A method for delivering resources to clients in a distributed computing environment. At least a first resource associated with a first content provider and maintained on an origin server references a second resource. The second resource is associated with a network formed by a plurality of repeater servers operable to serve the second resource to clients on behalf of the first content provider, the origin server being distinct from the plurality of repeater servers. Responsive to a request that causes the first resource to be served to a client from the origin server, at least one of the plurality of repeater servers is selected to serve the second resource to the client. If a copy of the second resource is available on the selected repeater server, the copy of the second resource is served to the client from the selected repeater server; otherwise, if a copy of the second resource is not available on the selected repeater server, the second resource is replicated on the selected repeater server.
|
43. An internet content delivery method for delivering resources from multiple content providers to multiple clients via a network of repeater servers, said resources including web pages associated with said content providers, at least some of said web pages including references to other resources, and wherein a web page associated with a first content provider includes a reference to a first resource also associated with said first content provider, wherein the web page is maintained on a storage device associated with an origin server, the method comprising:
after said web page has been served to a requesting client from the storage device, serving the first resource to the requesting client from a particular repeater server in the network of repeater servers by:
(A) if a copy of the first resource is not cached in a cache storage associated with the particular repeater server, wherein the cached storage is distinct from the storage device associated with the origin server, said particular repeater server uses at least a table on the particular repeater server and information included in a request for the first resource to determine an origin server associated with the first resource and attempts to obtain a copy of said first resource from a peer repeater server or from the origin server determined to be associated with the first resource; and then serving the copy of the first resource by the particular repeater server to the requesting client; otherwise,
(B) if a copy of the first resource is cached in the cache storage, serving the copy of the first resource by the particular server to the requesting client.
23. A method for delivering resources to clients from a network of repeater servers on behalf of content providers, wherein at least a first resource maintained on a storage device associated with an origin server references a second resource, the method comprising
configuring at least a plurality of the repeater servers in the network of repeater servers to serve at least the second resource to clients, wherein each of the repeater servers comprises a cache storage distinct from the storage device associated with the origin server;
responsive to a request that causes the origin server to serve the first resource to a client from the storage device associated with the origin server, selecting at least one of the plurality of repeater servers in the network from which to serve the second resource to the client;
responsive to the selected repeater server being requested to serve the second resource:
checking to determine whether a copy of the second resource is available in the cache storage of the selected repeater server;
if a copy of the second resource is available in the cache storage of the selected repeater server, serving the copy of the second resource to the client from the cache storage of the selected repeater server; otherwise,
if a copy of the second resource is not available in the cache storage of the selected repeater server, using at least a table on the selected repeater server and information included in a request for the second resource to determine an origin server associated with the second resource, and replicating the second resource in the cache storage of the selected repeater server, wherein the replicating act comprises:
requesting a copy of the second resource from the determined origin server.
1. A method for delivering resources to clients in a distributed computing environment, wherein at least a first resource associated with a first content provider and maintained on a storage device associated with an origin server references a second resource, the method comprising:
providing a network formed by a plurality of repeater servers configured to serve at least the second resource to clients on behalf of the first content provider, wherein each of the plurality of repeater servers comprises a cache storage distinct from the storage device associated with the origin server; and
responsive to a request that causes the origin server to serve the first resource to a client from the storage device associated with the origin server, selecting at least one of the plurality of repeater servers to serve the second resource to the client; and
responsive to the selected repeater server being requested to serve the second resource:
checking to determine whether a copy of the second resource is available in the cache storage of the selected repeater server;
if a copy of the second resource is available in the cache storage of the selected repeater server, serving the copy of the second resource to the client from the cache storage of the selected repeater server; otherwise,
if a copy of the second resource is not available in the cache storage of the selected repeater server, using at least a table on the selected repeater server and information included in a request for the second resource to determine an origin server associated with the second resource, and replicating the second resource in the cache storage of the selected repeater server, wherein the replicating act comprises:
requesting a copy of the second resource from the determined origin server.
32. A method for delivering resources to clients in a distributed computing environment, wherein at least a first resource associated with a first content provider and maintained on a storage device associated with an origin server references a second resource, the method comprising:
providing a network formed by a plurality of repeater servers configured to serve at least the second resource to clients on behalf of the first content provider, wherein each of the plurality of repeater servers comprises a cache storage distinct from the storage device associated with the origin server; and
responsive to a request that causes the origin server to serve the first resource to a client from the storage device:
selecting a repeater server in the network from which to serve the second resource to the client; and
causing the client to be provided an address of the selected repeater server;
responsive to the selected repeater server being requested to serve the second resource:
checking to determine whether a copy of the second resource is available in the cache storage of the selected repeater server;
if a copy of the second resource is available on the cache storage of the selected repeater server, serving the copy of the second resource to the client from the cache storage of the selected repeater server; otherwise,
if a copy of the second resource is not available in the cache storage of the selected repeater server, using at least a table on the selected repeater server and information included in a request for the second resource to determine an origin server associated with the second resource, and replicating the second resource in the cache storage of the selected repeater server, wherein the replicating act comprises:
requesting a copy of the second resource from the determined origin server.
2. A method as recited in
requesting a copy of the second resource from a peer repeater server in the network.
3. A method as recited in
associating the second resource with at least one of the plurality of repeater servers via a second URL, wherein the second URL is at least partially distinct from the first URL and wherein at least one of the plurality of repeater servers associated with the second resource via the second URL comprises the selected repeater server.
4. A method as recited in
forming the second URL by prepending data to the first domain name.
5. A method as recited in
modifying the textual reference to comprise the second URL.
6. A method as recited in
7. A method as recited in
transmitting to the client an HTTP REDIRECT command comprising the modified textual reference.
8. A method as recited in
9. A method as recited in
10. A method as recited in
a step for determining a Group Reduction table and a Link Cost table, wherein said Group Reduction table and said Link Cost table are operable for use in identifying the selected repeater server.
11. A method as recited in
12. A method as recited in
by the reflector, sending the request to the origin server on which the first resource is maintained.
14. A method as recited in
15. A method as recited in
16. A method as recited in
assigning the second resource a second URL identifying at least the selected repeater server, wherein the second URL is at least partially distinct from the first URL.
17. A method as recited in
forming the second URL by prepending data identifying the selected repeater server to the first domain name, and wherein the method further comprises:
modifying the textual reference to comprise the second URL.
18. A method as recited in
configuring each of the plurality of repeater servers with an activity logging module for use in recording an identity of each content provider associated with resources served therefrom, wherein responsive to the act of serving,
recording, by the activity logging module, information concerning delivery of at least the second resource by the selected repeater server in conjunction with at least identification of the first content provider.
19. A method as recited in
20. A method as recited in
configuring the table on each of the repeater servers to comprise a listing of subscribers authorized to have resources served to clients therefrom, wherein the subscribers comprise the plurality of content providers.
21. A method as recited in
by a reflector, intercepting the request and determining that the first resource is to be served to the client by the origin server on which the first resource is maintained; and
by the reflector, sending the request to the origin server on which the first resource is maintained.
22. A method as recited in
24. A method as recited in
associating the second resource with at least one of the plurality of repeater servers via a second URL, wherein the second URL is at least partially distinct from the first URL and wherein the plurality of repeater servers associated with the second resource via the second URL comprises the selected repeater server.
25. A method as recited in
forming the second URL by prepending data to the first domain name.
26. A method as recited in
configuring each of the plurality of repeater servers with an activity logging module for use in recording an identity of each content provider associated with resources served therefrom, wherein responsive to the act of serving,
recording, by the activity logging module, information concerning delivery of at least the second resource by the selected repeater server in conjunction with at least identification of the first content provider.
27. A method as recited in
28. A method as recited in
configuring the table on each of the repeater servers to comprise a listing of subscribers authorized to have resources served to clients therefrom, wherein the subscribers comprise each of the plurality of content providers.
29. A method as recited in
by a reflector, intercepting the request and determining that the first resource is to be served to the client by the origin server on which the first resource is maintained; and
by the reflector, sending the request to the origin server on which the first resource is maintained.
30. A method as recited in
31. A method as recited in
33. A method as defined in
34. A method as recited in
35. A method as recited in
by the reflector, intercepting the request and determining that the first resource is to be served to the client by the origin server on which the first resource is maintained; and
by the reflector, sending the request to the origin server on which the first resource is maintained.
36. A method as recited in
assigning the second resource a second URL identifying at least the selected repeater server, wherein the second URL is at least partially distinct from the first URL.
37. A method as recited in
forming the second URL by prepending data identifying the selected repeater server to the first domain name.
38. A method as recited in
modifying the textual reference to comprise the second URL.
39. A method as recited in
40. A method as recited in
configuring each of the plurality of repeater servers with an activity logging module for use in recording an identity of each content provider associated with resources served therefrom, wherein responsive to the act of serving,
recording, by the activity logging module, information concerning delivery of at least the first resource by the particular repeater server in conjunction with at least identification of the first content provider.
41. A method as recite in
42. A method as recited in
configuring the table on each of the repeater servers to comprise a listing of subscribers authorized to have resources served to clients therefrom, wherein the subscribers comprise the plurality of content providers.
44. An internet method as in
45. An internet method as in
46. An internet method as in
47. An internet method as in
48. A method as recited in
configuring the table on each of the repeater servers to comprise a listing of subscribers authorized to have resources served to clients therefrom, wherein the subscribers comprise the plurality of content providers.
49. A method as recited in
configuring each of the repeater servers with an activity logging module for use in recording an identity of each content provider associated with resources served therefrom, wherein responsive to the act of serving,
recording, by the activity logging module of the particular repeater server, information related to the serving of the copy of the first resource by the particular repeater server in conjunction with at least identification of the first content provider.
50. A method as recite in
|
This application is a continuation of and claims priority to pending U.S. patent application Ser. No. 11/441,253, filed May 26, 2006, which is a continuation of U.S. patent application Ser. No. 11/065,412, filed Feb. 23, 2005, pending, which is a continuation of U.S. patent application Ser. No. 09/612,598, filed Jul. 7, 2000, abandoned, which is a division of U.S. patent application Ser. No. 09/021,506, filed Feb. 10, 1998, patented as U.S. Pat. No. 6,185,598, the contents of each of which are hereby fully incorporated herein by reference. This application is a continuation of and claims priority to U.S. patent application Ser. No. 11/065,412, filed Feb. 23, 2005, pending, which is a continuation of U.S. patent application Ser. No. 09/612,598, filed Jul. 7, 2000, abandoned, which is a division of U.S. patent application Ser. No. 09/021,506, filed Feb. 10, 1998, patented as U.S. Pat. No. 6,185,598.
This invention relates to replication of resources in computer networks.
The advent of global computer networks, such as the Internet, have led to entirely new and different ways to obtain information. A user of the Internet can now access information from anywhere in the world, with no regard for the actual location of either the user or the information. A user can obtain information simply by knowing a network address for the information and providing that address to an appropriate application program such as a network browser.
The rapid growth in popularity of the Internet has imposed a heavy traffic burden on the entire network. Solutions to problems of demand (e.g., better accessibility and faster communication links) only increase the strain on the supply. Internet Web sites (referred to here as “publishers”) must handle ever-increasing bandwidth needs, accommodate dynamic changes in load, and improve performance for distant browsing clients, especially those overseas. The adoption of content-rich applications, such as live audio and video, has further exacerbated the problem.
To address basic bandwidth growth needs, a Web publisher typically subscribes to additional bandwidth from an Internet service provider (ISP), whether in the form of larger or additional “pipes” or channels from the ISP to the publisher's premises, or in the form of large bandwidth commitments in an ISP's remote hosting server collection. These increments are not always as fine-grained as the publisher needs, and quite often lead times can cause the publisher's Web site capacity to lag behind demand.
To address more serious bandwidth growth problems, publishers may develop more complex and costly custom solutions. The solution to the most common need, increasing capacity, is generally based on replication of hardware resources and site content (known as mirroring), and duplication of bandwidth resources. These solutions, however, are difficult and expensive to deploy and operate. As a result, only the largest publishers can afford them, since only those publishers can amortize the costs over many customers (and Web site hits).
A number of solutions have been developed to advance replication and mirroring. In general, these technologies are designed for use by a single Web site and do not include features that allow their components to be shared by many Web sites simultaneously.
Some solution mechanisms offer replication software that helps keep mirrored servers up-to-date. These mechanisms generally operate by making a complete copy of a file system. One such system operates by transparently keeping multiple copies of a file system in synch. Another system provides mechanisms for explicitly and regularly copying files that have changed. Database systems are particularly difficult to replicate, as they are continually changing. Several mechanisms allow for replication of databases, although there are no standard approaches for accomplishing it. Several companies offering proxy caches describe them as replication tools. However, proxy caches differ because they are operated on behalf of clients rather than publishers.
Once a Web site is served by multiple servers, a challenge is to ensure that the load is appropriately distributed or balanced among those servers. Domain name-server-based round-robin address resolution causes different clients to be directed to different mirrors.
Another solution, load balancing, takes into account the load at each server (measured in a variety of ways) to select which server should handle a particular request.
Load balancers use a variety of techniques to route the request to the appropriate server. Most of those load-balancing techniques require that each server be an exact replica of the primary Web site. Load balancers do not take into account the “network distance” between the client and candidate mirror servers.
Assuming that client protocols cannot easily change, there are two major problems in the deployment of replicated resources. The first is how to select which copy of the resource to use. That is, when a request for a resource is made to a single server, how should the choice of a replica of the server (or of that data) be made. We call this problem the “rendezvous problem”. There are a number of ways to get clients to rendezvous at distant mirror servers. These technologies, like load balancers, must route a request to an appropriate server, but unlike load balancers, they take network performance and topology into account in making the determination.
A number of companies offer products which improve network performance by prioritizing and filtering network traffic.
Proxy caches provide a way for client aggregators to reduce network resource consumption by storing copies of popular resources close to the end users. A client aggregator is an Internet service provider or other organization that brings a large number of clients operating browsers to the Internet. Client aggregators may use proxy caches to reduce the bandwidth required to serve web content to these browsers. However, traditional proxy caches are operated on behalf of Web clients rather than Web publishers.
Proxy caches store the most popular resources from all publishers, which means they must be very large to achieve reasonable cache efficiency. (The efficiency of a cache is defined as the number of requests for resources which are already cached divided by the total number of requests.) Proxy caches depend on cache control hints delivered with resources to determine when the resources should be replaced. These hints are predictive, and are necessarily often incorrect, so proxy caches frequently serve stale data. In many cases, proxy cache operators instruct their proxy to ignore hints in order to make the cache more efficient, even though this causes it to more frequently serve stale data.
Proxy caches hide the activity of clients from publishers. Once a resource is cached, the publisher has no way of knowing how often it was accessed from the cache.
This invention provides a way for servers in a computer network to off-load their processing of requests for selected resources by determining a different server (a “repeater”) to process those requests. The selection of the repeater can be made dynamically, based on information about possible repeaters.
If a requested resource contains references to other resources, some or all of these references can be replaced by references to repeaters.
Accordingly, in one aspect, this invention is a method of processing resource requests in a computer network. First a client makes a request for a particular resource from an origin server, the request including a resource identifier for the particular resource, the resource identifier sometimes including an indication of the origin server. Requests arriving at the origin server do not always include an indication of the origin server; since they are sent to the origin server, they do not need to name it. A mechanism referred to as a reflector, co-located with the origin server, intercepts the request from the client to the origin server and decides whether to reflect the request or to handle it locally. If the reflector decides to handle the request locally, it forwards it to the origin server, otherwise it selects a “best” repeater to process the request. If the request is reflected, the client is provided with a modified resource identifier designating the repeater.
The client gets the modified resource identifier from the reflector and makes a request for the particular resource from the repeater designated in the modified resource identifier.
When the repeater gets the client's request, it responds by returning the requested resource to the client. If the repeater has a local copy of the resource then it returns that copy, otherwise it forwards the request to the origin server to get the resource, and saves a local copy of the resource in order to serve subsequent requests.
The selection by the reflector of an appropriate repeater to handle the request can be done in a number of ways. In the preferred embodiment, it is done by first pre-partitioning the network into “cost groups” and then determining which cost group the client is in. Next, from a plurality of repeaters in the network, a set of repeaters is selected, the members of the set having a low cost relative to the cost group which the client is in. In order to determine the lowest cost, a table is maintained and regularly updated to define the cost between each group and each repeater. Then one member of the set is selected, preferably randomly, as the best repeater.
If the particular requested resource itself can contain identifiers of other resources, then the resource may be rewritten (before being provided to the client). In particular, the resource is rewritten to replace at least some of the resource identifiers contained therein with modified resource identifiers designating a repeater instead of the origin server. As a consequence of this rewriting process, when the client requests other resources based on identifiers in the particular requested resource, the client will make those requests directly to the selected repeater, bypassing the reflector and origin server entirely.
Resource rewriting must be performed by reflectors. It may also be performed by repeaters, in the situation where repeaters “peer” with one another and make copies of resources which include rewritten resource identifiers that designate a repeater.
In a preferred embodiment, the network is the Internet and the resource identifier is a uniform resource locator (URL) for designating resources on the Internet, and the modified resource identifier is a URL designating the repeater and indicating the origin server (as described in step B3 below), and the modified resource identifier is provided to the client using a REDIRECT message. Note, only when the reflector is “reflecting” a request is the modified resource identifier provided using a REDIRECT message.
In another aspect, this invention is a computer network comprising a plurality of origin servers, at least some of the origin servers having reflectors associated therewith, and a plurality of repeaters.
The above and other objects and advantages of the invention will be apparent upon consideration of the following detailed description, taken in conjunction with the accompanying drawings, in which the reference characters refer to like parts throughout and in which:
Thus, a repeater can be considered as a dedicated proxy server that maintains a partial or sparse mirror of the origin server 102, by implementing a distributed coherent cache of the origin server. A repeater may maintain a (partial) mirror of more than one origin server. In some embodiments, the network 100 is the Internet and repeaters mirror selected resources provided by origin servers in response to clients' HTTP (hypertext transfer protocol) and FTP (file transfer protocol) requests.
A client 106 connects, via the network 100, to origin server 102 and possibly to one or more repeaters 104a etc.
Origin server 102 is a server at which resources originate. More generally, the origin server 102 is any process or collection of processes that provide resources in response to requests from a client 106. Origin server 102 can be any off-the-shelf Web server. In a preferred embodiment, origin server 102 is typically a Web server such as the Apache server or Netscape Communications Corporation's Enterprise™ server.
Client 106 is a processor requesting resources from origin server 102 on behalf of an end user. The client 106 is typically a user agent (e.g., a Web browser such as Netscape Communications Corporation's Navigator™) or a proxy for a user agent. Components other than the reflector 108 and the repeaters 104a, 104b, etc., may be implemented using commonly available software programs. In particular, this invention works with any HTTP client (e.g., a Web browser), proxy cache, and Web server. In addition, the reflector 108 might be fully integrated into the data server 112 (for instance, in a Web Server). These components might be loosely integrated based on the use of extension mechanisms (such as so-called add-in modules) or tightly integrated by modifying the service component specifically to support the repeaters.
Resources originating at the origin server 102 may be static or dynamic. That is, the resources may be fixed or they may be created by the origin server 102 specifically in response to a request. Note that the terms “static” and “dynamic” are relative, since a static resource may change at some regular, albeit long, interval.
Resource requests from the client 106 to the origin server 102 are intercepted by reflector 108 which for a given request either forwards the request on to the origin server 102 or conditionally reflects it to some repeater 104a, 104b, etc. in the network 100. That is, depending on the nature of the request by the client 106 to the origin server 102, the reflector 108 either serves the request locally (at the origin server 102), or selects one of the repeaters (preferably the best repeater for the job) and reflects the request to the selected repeater. In other words, the reflector 108 causes requests for resources from origin server 102, made by client 106, to be either served locally by the origin server 102 or transparently reflected to the best repeater 104a, 104b, etc. The notion of a best repeater and the manner in which the best repeater is selected are described in detail below.
Repeaters 104a, 104b, etc. are intermediate processors used to service client requests thereby improving performance and reducing costs in the manner described herein. Within repeaters 104a, 104b, etc., are any processes or collections of processes that deliver resources to the client 106 on behalf of the origin server 102. A repeater may include a repeater cache 110, used to avoid unnecessary transactions with the origin server 102.
The reflector 108 is a mechanism, preferably a software program, that intercepts requests that would normally be sent directly to the origin server 102. While shown in the drawings as separate components, the reflector 108 and the origin server 102 are typically co-located, e.g., on a particular system such as data server 112. (As discussed below, the reflector 108 may even be a “plug in” module that becomes part of the origin server 102.
Uniform Resource Locators
Each location in a computer network has an address which can generally be specified as a series of names or numbers. In order to access information, an address for that information must be known. For example, on the World Wide Web (“the Web”) which is a subset of the Internet, the manner in which information address locations are provided has been standardized into Uniform Resource Locators (URLs). URLs specify the location of resources (information, data files, etc.) on the network.
The notion of URLs becomes even more useful when hypertext documents are used. A hypertext document is one which includes, within the document itself, links (pointers or references) to the document itself or to other documents. For example, in an on-line legal research system, each case may be presented as a hypertext document. When other cases are cited, links to those cases can be provided. In this way, when a person is reading a case, they can follow cite links to read the appropriate parts of cited cases.
In the case of the Internet in general and the World Wide Web specifically, documents can be created using a standardized form known as the Hypertext Markup Language (HTML). In HTML, a document consists of data (text, images, sounds, and the like), including links to other sections of the same document or to other documents. The links are generally provided as URLs, and can be in relative or absolute form. Relative URLs simply omit the parts of the URL which are the same as for the document including the link, such as the address of the document (when linking to the same document), etc. In general, a browser program will fill in missing parts of a URL using the corresponding parts from the current document, thereby forming a fully formed URL including a fully qualified domain name, etc.
A hypertext document may contain any number of links to other documents, and each of those other documents may be on a different server in a different part of the world. For example, a document may contain links to documents in Russia, Africa, China and Australia. A user viewing that document at a particular client can follow any of the links transparently (i.e., without knowing where the document being linked to actually resides). Accordingly, the cost (in terms of time or money or resource allocation) of following one link versus another may be quite significant.
URLs generally have the following form (defined in detail in T. Berners-Lee et al, Uniform Resource Locators (URL), Network Working Group, Request for Comments: 1738, Category: Standards Track, December 1994, located at “http://ds.internic.net/rfc/rfc1738.txt”, which is hereby incorporated herein by reference):
scheme://host[:port]/url-path
where “scheme” can be a symbol such as “file” (for a file on the local system), “ftp” (for a file on an anonymous FTP file server), “http” (for a file on a file on a Web server), and “telnet” (for a connection to a Telnet-based service). Other schemes, can also be used and new schemes are added every now and then. The port number is optional, the system substituting a default port number (depending on the scheme) if none is provided. The “host” field maps to a particular network address for a particular computer. The “url-path” is relative to the computer specified in the “host” field. A url-path is typically, but not necessarily, the pathname of a file in a web server directory.
For example, the following is a URL identifying a file “F” in the path “A/B/C” on a computer at “www.uspto.gov”:
http://www.uspto.gov/A/B/C/F
In order to access the file “F” (the resource) specified by the above URL, a program (e.g., a browser) running on a user's computer (i.e., a client computer) would have to first locate the computer (i.e., a server computer) specified by the host name. I.e., the program would have to locate the server “www.uspto.gov”. To do this, it would access a Domain Name Server (DNS), providing the DNS with the host name (“www.uspto.gov”). The DNS acts as a kind of centralized directory for resolving addresses from names. If the DNS determines that there is a (remote server) computer corresponding to the name “www.uspto.gov”, it will provide the program with an actual computer network address for that server computer. On the Internet this is called an Internet Protocol (or IP) address and it has the form “123.345.456.678”. The program on the user's (client) computer would then use the actual address to access the remote (server) computer.
The program opens a connection to the HTTP server (Web server) on the remote computer “www.uspto.gov” and uses the connection to send a request message to the remote computer (using the HTTP scheme). The message is typically an HTTP GET request which includes the url-path of the requested resource, “A/B/C/F”. The HTTP server receives the request and uses it to access the resource specified by the url-path “A/B/C/F”. The server returns the resource over the same connection.
Thus, conventionally HTTP client requests for Web resources at an origin server 102 are processed as follows (see
There are many variations of this basic model. For example, in one variation, instead of providing the client with the resource, the origin server can tell the client to re-request the resource by another name. To do so, in A7 the server 102 sends back to the client 106 a reply called a “REDIRECT” which contains a new URL indicating the other name. The client 106 then repeats the entire sequence, normally without any user intervention, this time requesting the resource identified by the new URL.
System Operation
In this invention reflector 108 effectively takes the place of an ordinary Web server or origin server 102. The reflector 108 does this by taking over the origin server's IP address and port number. In this way, when a client tries to connect to the origin server 102, it will actually connect to the reflector 108. The original Web server (or origin server 102) must then accept requests at a different network (IP) address, or at the same IP address but on a different port number. Thus, using this invention, the server referred to in A3-A7 above is actually a reflector 108.
Note that it is also possible to leave the origin server's network address as it is and to let the reflector run at a different address or on a different port. In this way the reflector does not intercept requests sent to the origin server, but can still be sent requests addressed specifically to the reflector. Thus the system can be tested and configured without interrupting its normal operation.
The reflector 108 supports the processing as follows (see
By using a rule base (B2), it is possible to selectively reflect resources. There are a number of reasons that certain particular resources cannot be effectively repeated (and therefore should not be reflected), for instance:
In addition, the reflector 108 can be configured so that requests from certain network addresses (e.g., requests from clients on the same local area network as the reflector itself) are never reflected. Also, the reflector may choose not to reflect requests because the reflector is exceeding its committed aggregate information rate, as described below.
A request which is reflected is automatically mirrored at the repeater when the repeater receives and processes the request.
The combination of the reflection process described here and the caching process described below effectively creates a system in which repeatable resources are migrated to and mirrored at the selected reflector, while non-repeatable resources are not mirrored.
Alternate Approach
Placing the origin server name in the reflected URL is generally a good strategy, but it may be considered undesirable for aesthetic or (in the case, e.g., of cookies) certain technical reasons.
It is possible to avoid the need for placing both the repeater name and the server name in the URL. Instead, a “family” of names may be created for a given origin server, each name identifying one of the repeaters used by that server.
For instance, if www.example.com is the origin server, names for three repeaters might be created:
wr1.example.com
wr2.example.com
wr3.example.com
The name “wr1.example.com” would be an alias for repeater 1, which might also be known by other names such as “wr1.anotherExample.com” and “wr1.example.edu”.
If the repeater can determine by which name it was addressed, it can use this information (along with a table that associates repeater alias names with origin server names) to determine which origin server is being addressed. For instance, if repeater 1 is addressed as wr1.example.com, then the origin server is “www.example.com”; if it is addressed as “wr1.anotherExample.com”, then the origin server is “www.anotherExample.com”.
The repeater can use two mechanisms to determine by which alias it is addressed:
How a Repeater Handles a Request
When a browser receives a REDIRECT response (as produced in B3), it reissues a request for the resource using the new resource identifier (URL) (A1-A5). Because the new identifier refers to a repeater instead of the origin server, the browser now sends a request for the resource to the repeater which processes a request as follows, with reference to
Note that the bottom row of
Selecting the Best Repeater
If the reflector 108 determines that it will reflect the request, it must then select the best repeater to handle that request (as referred to in step B3-1). This selection is performed by the Best Repeater Selector (BRS) mechanism described here.
The goal of the BRS is to select, quickly and heuristically, an appropriate repeater for a given client given only the network address of the client. An appropriate repeater is one which is not too heavily loaded and which is not too far from the client in terms of some measure of network distance. The mechanism used here relies on specific, compact, pre-computed data to make a fast decision. Other, dynamic solutions can also be used to select an appropriate repeater.
The BRS relies on three pre-computed tables, namely the Group Reduction Table, the Link Cost Table, and the Load Table. These three tables (described below) are computed off-line and downloaded to each reflector by its contact in the repeater network.
The Group Reduction Table places every network address into a group, with the goal that addresses in a group share relative costs, so that they would have the same best repeater under varying conditions (i.e., the BRS is invariant over the members of the group).
The Link Cost Table is a two dimensional matrix which specifies the current cost between each repeater and each group. Initially, the link cost between a repeater and a group is defined as the “normalized link cost” between the repeater and the group, as defined below. Over time, the table will be updated with measurements which more accurately reflect the relative cost of transmitting a file between the repeater and a member of the group. The format of the Link Cost Table is <Group ID> <Group ID> <link cost>, where the Group ID's are given as AS numbers.
The Load Table is a one dimensional table which identifies the current load at each repeater. Because repeaters may have different capacities, the load is a value that represents the ability of a given repeater to accept additional work. Each repeater sends its current load to a central master repeater at regular intervals, preferably at least approximately once a minute. The master repeater broadcasts the Load Table to each reflector in the network, via the contact repeater.
A reflector is provided entries in the Load Table only for repeaters which it is assigned to use. The assignment of repeaters to reflectors is performed centrally by a repeater network operator at the master repeater. This assignment makes it possible to modify the service level of a given reflector. For instance, a very active reflector may use many repeaters, whereas a relatively inactive reflector may use few repeaters.
Tables may also be configured to provide selective repeater service to subscribers in other ways, e.g., for their clients in specific geographic regions, such as Europe or Asia.
Measuring Load
In the presently preferred embodiments, repeater load is measured in two dimensions, namely
1. requests received by the repeater per time interval (RRPT), and
2. bytes sent by the repeater per time interval (BSPT).
For each of these dimensions, a maximum capacity setting is set. The maximum capacity indicates the point at which the repeater is considered to be fully loaded. A higher RRPT capacity generally indicates a faster processor, whereas a higher BSPT capacity generally indicates a wider network pipe. This form of load measurement assumes that a given server is dedicated to the task of repeating.
Each repeater regularly calculates its current RRPT and BSPT, by accumulating the number of requests received and bytes sent over a short time interval. These measurements are used to determine the repeater's load in each of these dimensions. If a repeater's load exceeds its configured capacity, an alarm message is sent to the repeater network administrator.
The two current load components are combined into a single value indicating overall current load. Similarly, the two maximum capacity components are combined into a single value indicating overall maximum capacity. The components are combined as follows:
current-load=B×current RRPT+(1−B)×current BSPT
max-load=B×max RRPT+(1−B)×max BSPT
The factor B, a value between 0 and 1, allows the relative weights of RRPT and BSPT to be adjusted, which favors consideration of either processing power or bandwidth.
The overall current load and overall maximum capacity values are periodically sent from each repeater to the master repeater, where they are aggregated in the Load Table, a table summarizing the overall load for all repeaters. Changes in the Load Table are distributed automatically to each reflector.
While the preferred embodiment uses a two-dimensional measure of repeater load, any other measure of load can be used.
Combining Link Costs and Load
The BRS computes the cost of servicing a given client from each eligible repeater. The cost is computed by combining the available capacity of the candidate repeater with the cost of the link between that repeater and the client. The link cost is computed by simply looking it up in the Link Cost table.
The cost is determined using the following formula:
threshold=K*max-load
capacity=max(max-load−current-load,e)
capacity=min(capacity,threshold)
cost=link-cost*threshold/capacity
In this formula, e is a very small number (epsilon) and K is a tuning factor initial set to 0.5. This formula causes the cost to a given repeater to be increased, at a rate defined by K, if its capacity falls below a configurable threshold.
Given the cost of each candidate repeater, the BRS selects all repeaters within a delta factor of the best score. From this set, the result is selected at random.
The delta factor prevents the BRS from repeatedly selecting a single repeater when scores are similar. It is generally required because available information about load and link costs loses accuracy over time. This factor is tunable.
Best Repeater Selector (BRS)
The BRS operates as follows, with reference to
Given a client network address and the three tables described above:
Preferably the results of the BRS processing are maintained in a local cache at the reflector 108. Thus, if the best repeater has recently been determined for a given client (i.e., for a given network address), that best repeater can be reused quickly without being re-determined. Since the calculation described above is based on statically, pre-computed tables, if the tables have not changed then there is no need to re-determine the best repeater.
Determining the Group Reduction and Link Cost Tables
The Group Reduction Table and Link Cost Table used in BRS processing are created and regularly updated by an independent procedure referred to herein as NetMap. The NetMap procedure is run by executing several phases (described below) as needed.
The term Group is used here to refers to an IP “address group”.
The term Repeater Group refers to a Group that contains the IP address of a repeater.
The term link cost refers to a statically determined cost for transmitting data between two Groups. In a presently preferred implementation, this is the minimum of the sums of the costs of the links along each path between them. The link costs of primary concern here are link costs between a Group and a Repeater Group.
The term relative link cost refers to the link cost relative to other link costs for the same Group which is calculated by subtracting the minimum link cost from a Group to any Repeater Group from each of its link costs to a Repeater Group. The term Cost Set refers to a set of Groups that are equivalent in regard to Best Repeater Selection. That is, given the information available, the same repeater would be selected for any of them.
The NetMap procedure first processes input files to create an internal database called the Group Registry. These input files describe groups, the IP addresses within groups, and links between groups, and come a variety of sources, including publicly available Internet Routing Registry (IRR) databases, BGP router tables, and probe services that are located at various points around the Internet and use publicly available tools (such as “traceroute”) to sample data paths. Once this processing is complete, the Group Registry contains essential information used for further processing, namely (1) the identity of each group, (2) the set of IP addresses in a given group, (3) the presence of links between groups indicating paths over which information may travel, and (4) the cost of sending data over a given link.
The following processes are then performed on the Group Registry file.
Calculate Repeater Group link costs
The NetMap procedure calculates a “link cost” for transmission of data between each Repeater Group and each Group in the Group Registry. This overall link cost is defined as the minimum cost of any path between the two groups, where the cost of a path is equal to the sum of the costs of the individual links in the path. The link cost algorithm presented below is essentially the same as algorithm #562 from ACM journal Transactions on Mathematical Software: “Shortest Path From a Specific Node to All Other Nodes in a Network” by U. Pape, ACM TOMS 6 (1980) pp. 450-455, http://www.netlib.org/toms/562.
In this processing, the term Repeater Group refers to a Group that contains the IP address of a repeater. A group is a neighbor of another group if the Group Registry indicates that there is a link between the two groups.
Calculate Cost Sets
A Cost Set is a set of Groups that are equivalent with respect to Best Repeater Selection. That is, given the information available, the same repeater would be selected for any of them.
The “cost profile” of a Group G is defined herein as the set of costs between G and each Repeater. Two cost profiles are said to be equivalent if the values in one profile differ from the corresponding values in the other profile by a constant amount.
Once a client Group is known, the Best Repeater Selection algorithm relies on the cost profile for information about the Group. If two cost profiles are equivalent, the BRS algorithm would select the same repeater given either profile.
A Cost Set is then a set of groups that have equivalent cost profiles.
The effectiveness of this method can be seen, for example, in the case where all paths to a Repeater from some Group A pass through some other Group B. The two Groups have equivalent cost profiles (and are therefore in the same Cost Set) since whatever Repeater is best for Group A is also going to be best for Group B, regardless of what path is taken between the two Groups.
By normalizing cost profiles, equivalent cost profiles can be made identical. A normalized cost profile is a cost profile in which the minimum cost has the value zero. A normalized cost profile is computed by finding the minimum cost in the profile, and subtracting that value from each cost in the profile.
Cost Sets are then computed using the following algorithm:
The algorithm for finding Cost Sets employs a hash table to reduce the time necessary to determine whether the desired Cost Set already exists. The hash table uses a hash value computed from cost profile of G.
Each Cost Set is then numbered with a unique Cost Sent Index number. Cost Sets are then used in a straightforward manner to generate the Link Cost Table, which gives the cost from each Cost Set to each Repeater.
As described below, the Group Reduction Table maps every IP address to one of these Cost Sets.
Build IP Map
The IP Map is a sorted list of records which map IP address ranges to Link Cost Table keys. The format of the IP map is:
<base IP address> <max IP address> <Link Cost Table key>
where IP addresses are presently represented by 32-bit integers. The entries are sorted by descending base address, and by ascending maximum address among equal base addresses, and by ascending Link Cost Table key among equal base addresses and maximum addresses. Note that ranges may overlap.
The NetMap procedure generates an intermediate IP map containing a map between IP address ranges and Cost Set numbers as follows:
The IP map file is then sorted by descending base address, and by ascending maximum address among equal base addresses, and by ascending Cost Set number among equal base addresses and maximum addresses. The sort order for the base address and maximum address minimizes the time to build the Group Reduction Table and produces the proper results for overlapping entries.
Finally, the NetMap procedure creates the Group Reduction Table by processing the sorted IP map. The Group Reduction Table maps IP addresses (specified by ranges) into Cost Set numbers. Special processing of the IP map file is required in order to detect overlapping address ranges, and to merge adjacent address ranges in order to minimize the size of the Group Reduction Table.
An ordered list of address range segments is maintained, each segment consisting of a base address B and a Cost Set number N, sorted by base address B. (The maximum address of a segment is the base address of the next segment minus one.) The following algorithm is used:
A reserved LAN address range is an address range reserved for use by LANs which should not appear as a global Internet address. LOCAL is a special Cost Set index different from all others, indicating that the range maps to a client which should never be reflected. REPEATER is a special Cost Set index different from all others, indicating that the address range maps to a repeater. NOGROUP is a special Cost Set index different from all others, indicating that this range of addresses has no known mapping.
Rewriting HTML Resources
As explained above with reference to
Rewriting requires that a repeater has been selected (as described above with reference to the Best Repeater Selector). Rewriting uses a so-called BASE directive. The BASE directive lets the HTML identify a different base server. (The base address is normally the address of the HTML resource.)
Rewriting is performed as follows:
An extension of HTML, known as XML, is currently being developed. The process of rewriting URLs will be similar for XML, with some differences in the mechanism that parses the resource and identifies embedded URLs.
Handling Non-HTTP Protocols
This invention makes it possible to reflect references to resources that are served by protocols other than HTTP, for instance, the File Transfer Protocol (FTP) and audio/video stream protocols. However, many protocols do not provide the ability to redirect requests. It is, however, possible to redirect references before requests are actually made by rewriting URLs embedded in HTML pages. The following modifications to the above algorithms are used to support this capability.
In F4, the rewriter rewrites URLs for servers if those servers appear in a configurable table of cooperating origin server or so-called co-servers. The reflector operator can define this table to include FTP servers and other servers. A rewritten URL that refers to a non-HTTP resource takes the form:
http:<repeater>/<origin server>@proxy=<scheme>[:<type>]@/resource
where <scheme> is a supported protocol name such as “ftp”. This URL format is an alternative to the form shown in B3.
In C3, the repeater looks for a protocol embedded in the arriving request. If a protocol is present and the requested resource is not already cached, the repeater uses the selected protocol instead of the default HTTP protocol to request the resource when serving it and storing it in the cache.
System Configuration and Management
In addition to the processing described above, the repeater network requires various mechanisms for system configuration and network management. Some of these mechanisms are described here.
Reflectors allow their operators to synchronize repeater caches by performing publishing operations. The process of keeping repeater caches synchronized is described below. Publishing indicates that a resource or collection of resources has changed.
Repeaters and reflectors participate in various types of log processing. The results of logs collected at repeaters are collected and merged with logs collected at reflectors, as described below.
Adding Subscribers to the Repeater Network
When a new subscriber is added to the network, information about the subscriber is entered in a Subscriber Table at the master repeater and propagated to all repeaters in the network. This information includes the Committed Aggregate Information Rate (CAIR) for servers belonging to the subscriber, and a list of the repeaters that may be used by servers belonging to the subscriber.
Adding Reflectors to the Repeater Network
When a new reflector is added to the network, it simply connects to and announces itself to a contact repeater, preferably using a securely encrypted certificate including the repeater's subscriber identifier.
The contact repeater determines whether the reflector network address is permitted for this subscriber. If it is, the contact repeater accepts the connection and updates the reflector with all necessary tables (using version numbers to determine which tables are out of date).
The reflector processes requests during this time, but is not “enabled” (allowed to reflect requests) until all of its tables are current.
Keeping Repeater Caches Synchronized
Repeater caches are coherent, in the sense that when a change to a resource is identified by a reflector, all repeater caches are notified, and accept the change in a single transaction.
Only the identifier of the changed resource (and not the entire resource) is transmitted to the repeaters; the identifier is used to effectively invalidate the corresponding cached resource at the repeater. This process is far more efficient than broadcasting the content of the changed resource to each repeater.
A repeater will load the newly modified resource the next time it is requested.
A resource change is identified at the reflector either manually by the operator, or through a script when files are installed on the server, or automatically through a change detection mechanism (e.g., a separate process that checks regularly for changes).
A resource change causes the reflector to send an “invalidate” message to its contact repeater, which forwards the message to the master repeater. The invalidate message contains a list of resource identifiers (or regular expressions identifying patterns of resource identifiers) that have changed. (Regular expressions are used to invalidate a directory or an entire server.) The repeater network uses a two-phase commit process to ensure that all repeaters correctly invalidate a given resource.
The invalidation process operates as follows:
The master broadcasts a “phase 1” invalidation request to all repeaters indicating the resources and regular expressions describing sets of resources to be invalidated.
When each repeater receives the phase 1 message, it first places the resource identifiers or regular expressions into a list of resource identifiers pending invalidation.
Any resource requested (in C3) that is in the pending invalidation list may not be served from the cache. This prevents the cache from requesting the resource from a peer cache which may not have received an invalidation notice. Were it to request a resource in this manner, it might replace the newly invalidated resource by the same, now stale, data.
The repeater then compares the resource identifier of each resource in its cache against the resource identifiers and regular expressions in the list.
Each match is invalidated by marking it stale and optionally removing it from the cache. This means that a future request for the resource will cause it to retrieve a new copy of the resource from the reflector.
When the repeater has completed the invalidation, it returns an acknowledgment to the master. The master waits until all repeaters have acknowledged the invalidation request.
If a repeater fails to acknowledge within a given period, it is disconnected from the master repeater. When it reconnects, it will be told to flush its entire cache, which will eliminate any consistency problem. (To avoid flushing the entire cache, the master could keep a log of all invalidations performed, sorted by date, and flush only files invalidated since the last time the reconnecting repeater successfully completed an invalidation. In the presently preferred embodiments this is not done since it is believed that repeaters will seldom disconnect.) When all repeaters have acknowledged invalidation (or timed out) the repeater broadcasts a “phase 2” invalidation request to all repeaters. This causes the repeaters to remove the corresponding resource identifiers and regular expressions from the list of resource identifiers pending invalidation.
In another embodiment, the invalidation request will be extended to allow a “server push”. In such requests, after phase 2 of the invalidation process has completed, the repeater receiving the invalidation request will immediately request a new copy of the invalidated resource to place in its cache.
Logs and Log Processing
Web server activity logs are fundamental to monitoring the activity in a Web site. This invention creates “merged logs” that combine the activity at reflectors with the activity at repeaters, so that a single activity log appears at the origin server showing all Web resource requests made on behalf of that site at any repeater.
This merged log can be processed by standard processing tools, as if it had been generated locally.
On a periodic basis, the master repeater (or its delegate) collects logs from each repeater. The logs collected are merged, sorted by reflector identifier and timestamp, and stored in a dated file on a per-reflector basis. The merged log for a given reflector represents the activity of all repeaters on behalf of that reflector. On a periodic basis, as configured by the reflector operator, a reflector contacts the master repeater to request its merged logs. It downloads these and merges them with its locally maintained logs, sorting by timestamp. The result is a merged log that represents all activity on behalf of repeaters and the given reflector.
Activity logs are optionally extended with information important to the repeater network, if the reflector is configured to do so by the reflector operator. In particular, an “extended status code” indicates information about each request, such as:
In addition, activity logs contain a duration, and extended precision timestamps. The duration makes it possible to analyze the time required to serve a resource, the bandwidth used, the number of requests handled in parallel at a given time, and other quite useful information. The extended precision timestamp makes it possible to accurately merge activity logs.
Repeaters use the Network Time Protocol (NTP) to maintain synchronized clocks. Reflectors may either use NTP or calculate a time bias to provide roughly accurate timestamps relative to their contact repeater.
Enforcing Committed Aggregate Information Rate
The repeater network monitors and limits the aggregate rate at which data is served on behalf of a given subscriber by all repeaters. This mechanism provides the following benefits:
For each subscriber, a “threshold aggregate information rate” (TAIR) is configured and maintained at the master repeater. This threshold is not necessarily the committed rate, it may be a multiple of committed rate, based on a pricing policy.
Each repeater measures the information rate component of each reflector for which it serves resources, periodically (typically about once a minute), by recording the number of bytes transmitted on behalf of that reflector each time a request is delivered. The table thus created is sent to the master repeater once per period. The master repeater combines the tables from each repeater, summing the measured information of each reflector over all repeaters that serve resources for that reflector, to determine the “measured aggregate information rate” (MAIR) for each reflector.
If the MAIR for a given reflector is greater than the TAIR for that reflector, the MAIR is transmitted by the master to all repeaters and to the respective reflector.
When a reflector receives a request, it determines whether its most recently calculated MAIR is greater than its TAIR. If this is the case, the reflector probabilistically decides whether to suppress reflection, by serving the request locally (in B2). The probability of suppressing the reflection increases as an exponential function of the difference between the MAIR and the CAIR.
Serving a request locally during a peak period may strain the local origin server, but it prevents this subscriber from taking more than allocated bandwidth from the shared repeater network.
When a repeater receives a request for a given subscriber (in C2), it determines whether the subscriber is running near its threshold aggregate information rate. If this is the case, it probabilistically decides whether to reduce its load by redirecting the request back to the reflector. The probability increases exponentially as the reflector's aggregate information rate approaches its limit.
If a request is reflected back to a reflector, a special character string is attached to the resource identifier so that the receiving reflector will not attempt to reflect it again. In the current system, this string has the form “src=overload”.
The reflector tests for this string in B2.
The mechanism for limiting Aggregate Information Rate described above is fairly coarse. It limits at the level of sessions with clients (since once a client has been reflected to a given repeater, the rewriting process tends to keep the client coming back to that repeater) and, at best, individual requests for resources. A more fine-grained mechanism for enforcing TAIR limits within repeaters operates by reducing the bandwidth consumption of a busy subscriber when other subscribers are competing for bandwidth.
The fine-grained mechanism is a form of data “rate shaping”. It extends the mechanism that copies resource data to a connection when a reply is being sent to a client. When an output channel is established at the time a request is received, the repeater identifies which subscriber the channel is operating for, in C2, and records the subscriber in a data field associated with the channel. Each time a “write” operation is about to be made to the channel, the Metered Output Stream first inspects the current values of the MAIR and TAIR, calculated above, for the given subscriber. If the MAIR is larger than the TAIR, then the mechanism pauses briefly before performing the write operation. The length of the pause is proportional to the amount the MAIR exceeds the TAIR. The pause ensures that tasks sending other resources to other clients, perhaps on behalf of other subscribers, will have an opportunity to send their data.
Repeater Network Resilience
The repeater network is capable of recovering when a repeater or network connection fails.
A repeater cannot operate unless it is connected to the master repeater. The master repeater exchanges critical information with other repeaters, including information about repeater load, aggregate information rate, subscribers, and link cost.
If a master fails, a “succession” process ensures that another repeater will take over the role of master, and the network as a whole will remain operational. If a master fails, or a connection to a master fails through a network problem, any repeater attempting to communicate with the master will detect the failure, either through an indication from TCP/IP, or by timing out from a regular “heartbeat” message it sends to the master.
When any repeater is disconnected from its master, it immediately tries to reconnect to a series of potential masters based on a configurable file called its “succession list”.
The repeater tries each system on the list in succession until it successfully connects to a master. If in this process, it comes to its own name, it takes on the role of master, and accepts connections from other repeaters. If a repeater which is not at the top of the list becomes the master, it is called the “temporary master”.
A network partition may cause two groups of repeaters each to elect a master. When the partition is corrected, it is necessary that the more senior master take over the network. Therefore, when a repeater is temporary master, it regularly tries to reconnect to any master above it in the succession list. If it succeeds, it immediately disconnects from all of the repeaters connected to it. When they retry their succession lists, they will connect to the more senior master repeater.
To prevent losses of data, a temporary master does not accept configuration changes and does not process log files. In order to take on these tasks, it must be informed that it is primary master by manual modification of its successor list. Each repeater regularly reloads its successor list to determine whether it should change its idea of who the master is.
If a repeater is disconnected from the master, it must resynchronize its cache when it reconnects to the master. The master can maintain a list of recent cache invalidations and send to the repeater any invalidations it was not able to process while disconnected. If this list is not available for some reason (for instance, because the reflector has been disconnected too long), the reflector must invalidate its entire cache.
A reflector is not permitted to reflect requests unless it is connected to a repeater. The reflector relies on its contact repeater for critical information, such as load and Link Cost Tables, and current aggregate information rate. A reflector that is not connected to a repeater can continue to receive requests and handle them locally.
If a reflector loses its connection with a repeater, due to a repeater failure or network outage, it continues to operate while it tries to connect to a repeater.
Each time a reflector attempts to connect to a repeater, it uses DNS to identify a set of candidate repeaters given a domain name that represents the repeater network. The reflector tries each repeater in this set until it makes a successful contact. Until a successful contact is made, the reflector serves all requests locally. When a reflector connects to a repeater, the repeater can tell it to attempt to contact a different repeater; this allows the repeater network to ensure that no individual repeater has too many contacts.
When contact is made, the reflector provides the version number of each of its tables to its contact repeater. The repeater then decides which tables should be updated and sends appropriate updates to the reflector. Once all tables have been updated, the repeater notifies the reflector that it may now start reflecting requests.
Using a Proxy Cache within a Repeater
Repeaters are intentionally designed so that any proxy cache can be used as a component within them. This is possible because the repeater receives HTTP requests and converts them to a form recognized by the proxy cache.
On the other hand, several modifications to a standard proxy cache have been or may be made as optimizations. This includes, in particular, the ability to conveniently invalidate a resource, the ability to support cache quotas, and the ability to avoid making an extra copy of each resource as it passes from the proxy cache through the repeater to the requester.
In a preferred embodiment, a proxy cache is used to implement C3. The proxy cache is dedicated for use only by one or more repeaters. Each repeater requiring a resource from the proxy cache constructs a proxy request from the inbound resource request. A normal HTTP GET request to a server contains only the pathname part of the URL—the scheme and server name are implicit. (n an HTTP GET request to a repeater, the pathname part of the URL includes the name of the origin server on behalf of which the request is being made, as described above.) However, a proxy agent GET request takes an entire URL. Therefore, the repeater must construct a proxy request containing the entire URL from the path portion of the URL it receives. Specifically, if the incoming request takes the form:
GET/<origin server>/<path>
then the repeater constructs a proxy request of the form:
GET http://<origin server>/<path>
and if the incoming request takes the form:
GET <origin server>@proxy=<scheme>:<type>@/<path>
then the repeater constructs a proxy request of the form:
GET <scheme>://<origin server>/<path>
Cache Control
HTTP replies contain directives called cache control directives, which are used to indicate to a cache whether the attached resource may be cached and if so, when it should expire. A Web site administrator configures the Web site to attach appropriate directives. Often, the administrator will not know how long a page will be fresh, and must define a short expiration time to try to prevent caches from serving stale data. In many cases, a Web site operator will indicate a short expiration time only in order to receive the requests (or hits) that would otherwise be masked by the presence of a cache. This is known in the industry as “cache-busting”. Although some cache operators may consider cache-busting to be impolite, advertisers who rely on this information may consider it imperative.
When a resource is stored in a repeater, its cache directives can be ignored by the repeater, because the repeater receives explicit invalidation events to determine when a resource is stale. When a proxy cache is used as the cache at the repeater, the associated cache directives may be temporarily disabled. However, they must be re-enabled when the resource is served from the cache to a client, in order to permit the cache-control policy (including any cache-busting) to take effect.
The present invention contains mechanisms to prevent the proxy cache within a repeater from honoring cache control directives, while permitting the directives to be served from the repeater.
When a reflector serves a resource to a repeater in B4, it replaces all cache directives by modified directives that are ignored by the repeater proxy cache. It does this by prefixing a distinctive string such as “wr-” to the beginning of the HTTP tag. Thus, “expires” becomes “wr-expires”, and “cache-control” becomes “wr-cache-control”. This prevents the proxy cache itself from honoring the directives. When a repeater serves a resource in C4, and the requesting client is not another repeater, it searches for HTTP tags beginning with “wr-” and removes the “wr-”. This allows the clients retrieving the resource to honor the directives.
Resource Revalidation
There are several cases where a resource may be cached so long as the origin server is consulted each time it is served. In one case, the request for the resource is attached to a so-called “cookie”. The origin server must be presented with the cookie to record the request and determine whether the cached resource may be served or not. In another case, the request for the resource is attached to an authentication header (which identifies the requester with a user id and password). Each new request for the resource must be tested at the origin server to assure that the requester is authorized to access the resource.
The HTTP 1.1 specification defines a reply header tided “Must-Revalidate” which allows an origin server to instruct a proxy cache to “revalidate” a resource each time a request is received. Normally, this mechanism is used to determine whether a resource is still fresh. In the present invention, Must-Revalidate makes it possible to ask an origin server to validate a request that is otherwise served from a repeater.
The reflector rule base contains information that determines which resources may be repeated but must be revalidated each time they are served. For each such resource, in B4, the reflector attaches a Must-Revalidate header. Each time a request comes to a repeater for a cached resource marked with a Must-Revalidate header, the request is forwarded to the reflector for validation prior to serving the requested resource.
Cache Quotas
The cache component of a repeater is shared among those subscribers that reflect clients to that repeater. In order to allow subscribers fair access to storage facilities, the cache may be extended to support quotas.
Normally, a proxy cache may be configured with a disk space threshold T. Whenever more than T bytes are stored in the cache, the cache attempts to find resources to eliminate.
Typically a cache uses the least-recently-used (LRU) algorithm to determine which resources to eliminate; more sophisticated caches use other algorithms. A cache may also support several threshold values—for instance, a lower threshold which, when reached, causes a low priority background process to remove items from the cache, and a higher threshold which, when reached, prevents resources from being cached until sufficient free disk space has been reclaimed.
If two subscribers A and B share a cache, and more resources of subscriber A are accessed during a period of time than resources of subscriber B, then fewer of B's resources will be in the cache when new requests arrive. It is possible that, due to the behavior of A, B's resources will never be cached when they are requested. In the present invention, this behavior is undesirable. To address this issue, the invention extends the cache at a repeater to support cache quotas.
The cache records the amount of space used by each subscriber in DS, and supports a configurable threshold TS for each subscriber.
Whenever a resource is added to the cache (at C3), the value DS is updated for the subscriber providing the resource. If DS is larger than TS, the cache attempts to find resources to eliminate, from among those resources associated with subscriber S. The cache is effectively partitioned into separate areas for each subscriber.
The original threshold T is still supported. If the sum of reserved segments for each subscriber is smaller than the total space reserved in the cache, the remaining area is “common” and subject to competition among subscribers.
Note, this mechanism might be implemented by modifying the existing proxy cache discussed above, or it might also be implemented without modifying the proxy cache—if the proxy cache at least makes it possible for an external program to obtain a list of resources in the cache, and to remove a given resource from the cache.
Rewriting from Repeaters
When a repeater receives a request for a resource, its proxy cache may be configured to determine whether a peer cache contains the requested resource. If so, the proxy cache obtains the resource from the peer cache, which can be faster than obtaining it from the origin server (the reflector). However, a consequence of this is that rewritten HTML resources retrieved from the peer cache would identify the wrong repeater. Thus, to allow for cooperating proxy caches, resources are preferably rewritten at the repeater.
When a resource is rewritten for a repeater, a special tag is placed at the beginning of the resource. When constructing a reply, the repeater inspects the tag to determine whether the resource indicates that additional rewriting is necessary. If so, the repeater modifies the resource by replacing references to the old repeater with references to the new repeater.
It is only necessary to perform this rewriting when a resource is served to the proxy cache at another repeater.
Repeater-Side Include
Sometimes, an origin server constructs a custom resource for each request (for instance, when inserting an advertisement based on the history of the requesting client). In such a case, that resource must be served locally rather than repeated. Generally, a custom resource contains, along with the custom information, text and references to other, repeatable, resources.
The process that assembles a “page” from a text resource and possibly one or more image resources is performed by the Web browser, directed by HTML. However, it is not possible using HTML to cause a browser to assemble a page using text or directives from a separate resource. Therefore, custom resources often necessarily contain large amounts of static text that would otherwise be repeatable.
To resolve this potential inefficiency, repeaters recognize a special directive called a “repeater side include”. This directive makes it possible for the repeater to assemble a custom resource, using a combination of repeatable and local resources. In this way, the static text can be made repeatable, and only the special directive need be served locally by the reflector.
For example, a resource X might consist of custom directives selecting an advertising banner, followed by a large text article. To make this resource repeatable, the Web site administrator must break out a second resource, Y, to select the banner. Resource X is modified to contain a repeater-side include directive identifying resource Y, along with the article. Resource Y is created and contains only the custom directives selecting an ad banner. Now resource X is repeatable, and only resource Y, which is relatively small, is not repeatable.
When a repeater constructs a reply, it determines whether the resource being served is an HTML resource, and if so, scans it for repeater-side include directives. Each such directive includes a URL, which the repeater resolves and substitutes in place of the directive. The entire resource must be assembled before it is served, in order to determine its final size, as the size is included in a reply header ahead of the resource.
Thus, a method and apparatus for dynamically replicating selected resources in computer networks is provided. One skilled in the art will appreciate that the present invention can be practiced by other than the described embodiments, which are presented for purposes of illustration and not limitation, and the present invention is limited only by the claims that follow.
Farber, David A., Greer, Richard E., Swart, Andrew D., Balter, James A.
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
4495570, | Jan 14 1981 | Hitachi, Ltd. | Processing request allocator for assignment of loads in a distributed processing system |
4591983, | Jul 09 1984 | Teknowledge, Inc. | Hierarchical knowledge system |
4594704, | Nov 23 1983 | Compagnie Industrielle des Telecommunications Cit-Alcatel | Spare subscriber terminal device in a digital concentrator |
4726017, | May 21 1985 | Fla. | Multidrop data concentrator communication network |
4803641, | Jun 06 1984 | Tecknowledge, Inc. | Basic expert system tool |
4839798, | Nov 07 1984 | Hitachi, Ltd. | Method and apparatus for controlling job transfer between computer systems |
4847784, | Jul 13 1987 | Teknowledge, Inc.; TEKNOWLEDGE, INC , A CORP OF DE | Knowledge based tutor |
4920432, | Jan 12 1988 | Lodgenet Interactive Corporation | System for random access to an audio video data library with independent selection and display at each of a plurality of remote locations |
4922417, | Oct 24 1986 | BELL TELEPHONE LABORATORIES, INCORPORATED, A CORP OF NEW YORK; AT&T TECHNOLOGIES, INCORPORATED, A CORP OF NEW YORK | Method and apparatus for data hashing using selection from a table of random numbers in combination with folding and bit manipulation of the selected random numbers |
4943932, | Nov 25 1986 | Cimflex Teknowledge Corporation; TEKNOWLEDGE, INC | Architecture for composing computational modules uniformly across diverse developmental frameworks |
4949187, | Dec 16 1988 | VIDEO-ON-DEMAND VENTURES LLC | Video communications system having a remotely controlled central source of video and audio data |
4949248, | Jul 15 1988 | CANTOR FIRZGERALD SECURITIES | System for shared remote access of multiple application programs executing in one or more computers |
5029232, | Jan 12 1989 | Level 3 Communications, LLC | Satellite communications network |
5130792, | Feb 01 1990 | USA VIDEO CORPORATION; USA VIDEO TECHNOLOGY CORPORATION | Store and forward video system |
5132992, | Jan 07 1991 | Greenwich Information Technologies, LLC | Audio and video transmission and receiving system |
5136716, | Sep 04 1987 | ENTERASYS NETWORKS, INC | Session control in network for digital data processing system which supports multiple transfer protocols |
5172413, | Dec 20 1990 | SASKTEL, 2121 SASKATCHEWAN DRIVE, REGINA, SASKATCHEWAN, CANADA, S4P 3Y2 | Secure hierarchial video delivery system and method |
5191573, | Jun 13 1988 | SIGHTSOUND TECHNOLOGIES, LLC | Method for transmitting a desired digital video or audio signal |
5253275, | Jan 07 1991 | Greenwich Information Technologies, LLC | Audio and video transmission and receiving system |
5253341, | Mar 04 1991 | GLOBAL PATENT HOLDINGS, LLC | Remote query communication system |
5287499, | Mar 22 1989 | TTI Inventions C LLC | Methods and apparatus for information storage and retrieval utilizing a method of hashing and different collision avoidance schemes depending upon clustering in the hash table |
5287537, | Nov 15 1985 | Data General Corporation | Distributed processing system having plural computers each using identical retaining information to identify another computer for executing a received command |
5291554, | May 28 1992 | TV Answer, Inc. | Shared-price custom video rentals via interactive TV |
5341477, | Feb 24 1989 | HEWLETT-PACKARD DEVELOPMENT COMPANY, L P | Broker for computer network server selection |
5371532, | May 15 1992 | TTI Inventions A LLC | Communications architecture and method for distributing information services |
5410343, | Sep 27 1991 | Verizon Patent and Licensing Inc | Video-on-demand services using public switched telephone network |
5414455, | Jul 07 1993 | HEWLETT-PACKARD DEVELOPMENT COMPANY, L P | Segmented video on demand system |
5442389, | Dec 28 1992 | AT&T IPM Corp | Program server for interactive television system |
5442390, | Jul 07 1993 | TIVO INC | Video on demand with memory accessing and or like functions |
5442749, | Aug 22 1991 | Sun Microsystems, Inc. | Network video server system receiving requests from clients for specific formatted data through a default channel and establishing communication through separate control and data channels |
5471622, | Oct 04 1989 | Paralogic, Inc. | Run-time system having nodes for identifying parallel tasks in a logic program and searching for available nodes to execute the parallel tasks |
5475615, | Dec 23 1993 | COMCAST MO GROUP, INC | Method and system for sizing interactive video delivery systems |
5508732, | Mar 22 1993 | IBM Corporation | Data server, control server and gateway architecture system and method for broadcasting digital video on demand |
5515511, | Jun 06 1994 | International Business Machines Corporation | Hybrid digital/analog multimedia hub with dynamically allocated/released channels for video processing and distribution |
5519435, | Sep 01 1994 | INTEL NETWORK SYSTEMS, INC | Multi-user, on-demand video storage and retrieval system including video signature computation for preventing excessive instantaneous server data rate |
5522070, | Mar 19 1992 | Fujitsu Limited | Computer resource distributing method and system for distributing a multiplicity of processes to a plurality of computers connected in a network |
5528281, | Sep 27 1991 | Verizon Patent and Licensing Inc | Method and system for accessing multimedia data over public switched telephone network |
5539621, | Jan 14 1994 | PDACO LTD | Network communication unit with client and resource node array double layer ICs on printed board with connectors on housing |
5542087, | Oct 15 1993 | HEWLETT-PACKARD DEVELOPMENT COMPANY, L P | Linear hashing for distributed records |
5544313, | May 11 1994 | Cisco Technology, Inc | Baton passing optimization scheme for load balancing/configuration planning in a video-on-demand computer system |
5544327, | Mar 01 1994 | Cisco Technology, Inc | Load balancing in video-on-demand servers by allocating buffer to streams with successively larger buffer requirements until the buffer requirements of a stream can not be satisfied |
5550577, | May 19 1993 | ALCATEL N V | Video on demand network, including a central video server and distributed video servers with random access read/write memories |
5550863, | Jan 07 1991 | Greenwich Information Technologies, LLC | Audio and video transmission and receiving system |
5550982, | Jun 24 1993 | CONGRESS FINANCIAL CORPORATION NEW ENGLAND , A MASSACHUSETTS CORPORATION | Video application server |
5557317, | May 20 1994 | NEC Corporation | Video-on-demand system with program relocation center |
5572643, | Oct 19 1995 | INTERNETAD SYSTEMS LLC | Web browser with dynamic display of information objects during linking |
5590288, | Jul 30 1991 | Restaurant Technology, Inc. | Distributed data processing system and method utilizing peripheral device polling and layered communication software |
5592611, | Mar 14 1995 | AUTONOMY, INC | Stand-in computer server |
5594910, | Mar 23 1989 | International Business Machines Corporation | Interactive computer network and method of operation |
5603026, | Dec 07 1994 | YORANSO CONSULTING LIMITED LIABILITY COMPANY | Application-specific conflict resolution for weakly consistent replicated databases |
5619648, | Nov 30 1994 | Alcatel Lucent | Message filtering techniques |
5623656, | Dec 15 1994 | THE CHASE MANHATTAN BANK, AS COLLATERAL AGENT | Script-based data communication system and method utilizing state memory |
5625781, | Oct 31 1995 | International Business Machines Corporation | Itinerary list for interfaces |
5627829, | Oct 07 1993 | SAMSUNG ELECTRONICS CO , LTD | Method for reducing unnecessary traffic over a computer network |
5630067, | Jul 29 1994 | International Business Machines Corporation | System for the management of multiple time-critical data streams |
5633999, | Apr 07 1993 | Nonstop Networks Limited | Workstation-implemented data storage re-routing for server fault-tolerance on computer networks |
5634006, | Aug 17 1992 | International Business Machines Corporation | System and method for ensuring QOS in a token ring network utilizing an access regulator at each node for allocating frame size for plural transmitting applications based upon negotiated information and priority in the network |
5638443, | Nov 23 1994 | CONTENTGUARD HOLDINGS, INC | System for controlling the distribution and use of composite digital works |
5644714, | Jan 14 1994 | PDACO LTD | Video collection and distribution system with interested item notification and download on demand |
5646676, | May 30 1995 | International Business Machines Corporation | Scalable interactive multimedia server system for providing on demand data |
5649186, | Aug 07 1995 | Open Invention Network, LLC | System and method for a computer-based dynamic information clipping service |
5659729, | Feb 01 1996 | Oracle America, Inc | Method and system for implementing hypertext scroll attributes |
5666362, | Jul 25 1995 | HEWLETT-PACKARD DEVELOPMENT COMPANY, L P | Method and apparatus for asynchronous PPP and synchronous PPP conversion |
5671279, | Nov 13 1995 | Meta Platforms, Inc | Electronic commerce using a secure courier system |
5675734, | Jun 13 1988 | SIGHTSOUND TECHNOLOGIES, LLC | System for transmitting desired digital video or audio signals |
5682512, | Jun 30 1995 | Intel Corporation | Use of deferred bus access for address translation in a shared memory clustered computer system |
5699513, | Mar 31 1995 | GENERAL DYNAMICS C4 SYSTEMS, INC | Method for secure network access via message intercept |
5708780, | Jun 07 1995 | Soverain IP, LLC | Internet server access control and monitoring systems |
5712979, | Sep 20 1995 | ULOGIN LLC | Method and apparatus for attaching navigational history information to universal resource locator links on a world wide web page |
5715453, | May 31 1996 | PayPal, Inc | Web server mechanism for processing function calls for dynamic data queries in a web page |
5721914, | Sep 14 1995 | Verizon Patent and Licensing Inc | System and method for hierarchical data distribution |
5734831, | Apr 26 1996 | Sun Microsystems, Inc. | System for configuring and remotely administering a unix computer over a network |
5740423, | Dec 28 1995 | CSG Systems, Inc | System and method for accessing distributed data on a plurality of databases |
5742762, | May 19 1995 | Telogy Networks, Inc.; TELOGY NETWORKS, INC | Network management gateway |
5751961, | Jan 31 1996 | HANGER SOLUTIONS, LLC | Integrated internet system for translating logical addresses of internet documents to physical addresses using integrated service control point |
5761507, | Mar 05 1996 | GLOBALFOUNDRIES Inc | Client/server architecture supporting concurrent servers within a server with a transaction manager providing server/connection decoupling |
5761663, | Jun 07 1995 | GOOGLE LLC | Method for distributed task fulfillment of web browser requests |
5764906, | Nov 07 1995 | Francap Corporation | Universal electronic resource denotation, request and delivery system |
5774660, | Aug 05 1996 | RESONATE INC | World-wide-web server with delayed resource-binding for resource-based load balancing on a distributed resource multi-node network |
5774668, | Jun 07 1995 | Microsoft Technology Licensing, LLC | System for on-line service in which gateway computer uses service map which includes loading condition of servers broadcasted by application servers for load balancing |
5777988, | Apr 17 1996 | HANGER SOLUTIONS, LLC | System and method for equalizing delay in a dynamic packet switching network |
5777989, | Dec 19 1995 | International Business Machines Corporation; IBM Corporation | TCP/IP host name resolution for machines on several domains |
5778187, | May 09 1996 | Two-Way Media Ltd | Multicasting method and apparatus |
5784058, | May 28 1996 | Oracle America, Inc | User-controllable persistent browser display pages |
5794253, | Jul 12 1996 | Microsoft Technology Licensing, LLC | Time based expiration of data objects in a store and forward replication enterprise |
5796952, | Mar 21 1997 | THE NIELSEN COMPANY US , LLC, A DELAWARE LIMITED LIABILITY COMPANY | Method and apparatus for tracking client interaction with a network resource and creating client profiles and resource database |
5799141, | Jun 09 1995 | EMC Corporation | Real-time data protection system and method |
5802106, | Dec 06 1996 | Symantec Corporation | Method for rapid data rate detection in a packet communication environment without data rate supervision |
5802291, | Mar 30 1995 | Sun Microsystems, Inc. | System and method to control and administer distributed object servers using first class distributed objects |
5812769, | Sep 20 1995 | TWINTECH E U II, LIMITED LIABILITY COMPANY | Method and apparatus for redirecting a user to a new location on the world wide web using relative universal resource locators |
5815664, | Mar 20 1995 | Fujitsu Limited | Address reporting device and method for detecting authorized and unauthorized addresses in a network environment |
5819092, | Nov 08 1994 | Microsoft Technology Licensing, LLC | Online service development tool with fee setting capabilities |
5826031, | Jun 10 1996 | Sun Microsystems, Inc. | Method and system for prioritized downloading of embedded web objects |
5828847, | Apr 19 1996 | Oracle America, Inc | Dynamic server switching for maximum server availability and load balancing |
5832506, | Mar 29 1996 | Intel Corporation | Directory for network servers |
5832514, | Jun 26 1996 | Microsoft Technology Licensing, LLC | System and method for discovery based data recovery in a store and forward replication process |
5835718, | Apr 10 1996 | HANGER SOLUTIONS, LLC | URL rewriting pseudo proxy server |
5838906, | Oct 17 1994 | EOLAS TECHNOLOGIES INC | Distributed hypermedia method for automatically invoking external application providing interaction and display of embedded objects within a hypermedia document |
5845303, | Dec 06 1994 | AKAMAI TECHNOLOGIES, INC | Document processing using frame-based templates with hierarchical tagging |
5856974, | Feb 13 1996 | EMC Corporaton | Internetwork address mapping gateway |
5862339, | Jul 09 1996 | Microsoft Technology Licensing, LLC | Client connects to an internet access provider using algorithm downloaded from a central server based upon client's desired criteria after disconnected from the server |
5867706, | Dec 19 1996 | International Business Machines Corp. | Method of load balancing across the processors of a server |
5867799, | Apr 04 1996 | HUDSON BAY MASTER FUND LTD | Information system and method for filtering a massive flow of information entities to meet user information classification needs |
5870546, | Feb 21 1996 | DISNEY ENTERPRISES, INC | Method and apparatus for redirection of server external hyper-link reference |
5870559, | Apr 11 1997 | HEWLETT-PACKARD DEVELOPMENT COMPANY, L P | Software system and associated methods for facilitating the analysis and management of web sites |
5878212, | Jul 31 1995 | AT&T Corp.; AT&T IPM Corp | System for updating mapping or virtual host names to layer-3 address when multimedia server changes its usage state to busy or not busy |
5884038, | May 02 1997 | RPX Corporation | Method for providing an Internet protocol address with a domain name server |
5890171, | Aug 06 1996 | Microsoft Technology Licensing, LLC | Computer system and computer-implemented method for interpreting hypertext links in a document when including the document within another document |
5893116, | Sep 30 1996 | Oracle International Corporation | Accessing network resources using network resource replicator and captured login script for use when the computer is disconnected from the network |
5894554, | Apr 23 1996 | Microsoft Corporation | System for managing dynamic web page generation requests by intercepting request at web server and routing to page server thereby releasing web server to process other requests |
5896533, | Jul 06 1995 | Intel Corporation | Accessing internets world-wide web through object linking and embedding technology |
5903723, | Dec 21 1995 | INCYTE PHARMACEUTICALS, INC | Method and apparatus for transmitting electronic mail attachments with attachment references |
5907704, | Apr 03 1995 | Q LIQUIDATING TRUST | Hierarchical encapsulation of instantiated objects in a multimedia authoring system including internet accessible objects |
5913028, | Oct 06 1995 | RPX Corporation | Client/server data traffic delivery system and method |
5913033, | Dec 20 1996 | International Business Machines Corporation | Apparatus and method for retrieving information using standard objects |
5918010, | Feb 07 1997 | ABOUT, INC | Collaborative internet data mining systems |
5919247, | Jul 24 1996 | GOLDMAN SACHS BANK USA, AS SUCCESSOR COLLATERAL AGENT | Method for the distribution of code and data updates |
5920701, | Jan 19 1995 | HANGER SOLUTIONS, LLC | Scheduling data transmission |
5925106, | Apr 05 1996 | Oracle America, Inc | Method and apparatus for obtaining and displaying network server information |
5931904, | Oct 11 1996 | RAKUTEN, INC | Method for reducing the delay between the time a data page is requested and the time the data page is displayed |
5933832, | Sep 10 1997 | Kabushiki Kaisha Toshiba | Retrieval system for frequently updated data distributed on network |
5935207, | Jun 03 1996 | Rovi Technologies Corporation | Method and apparatus for providing remote site administrators with user hits on mirrored web sites |
5940831, | Aug 28 1996 | NEC Corporation | Hypermedia system and method of managing directories and directory data originating from a node link structure in a directory server |
5944780, | May 05 1997 | AT&T Properties, LLC; AT&T INTELLECTUAL PROPERTY II, L P | Network with shared caching |
5945989, | Mar 25 1997 | VOICECOM TELECOMMUNICATIONS, LLC | Method and apparatus for adding and altering content on websites |
5956489, | Jun 07 1995 | Microsoft Technology Licensing, LLC | Transaction replication system and method for supporting replicated transaction-based services |
5956716, | Jun 07 1995 | Intervu, Inc | System and method for delivery of video data over a computer network |
5958008, | Oct 15 1996 | MICRO FOCUS LLC | Software system and associated methods for scanning and mapping dynamically-generated web documents |
5961596, | Feb 14 1996 | Hitachi, Ltd.; Hitachi ULSI Engineering Corp. | Method of monitoring a computer system, featuring performance data distribution to plural monitoring processes |
5966440, | Jun 13 1988 | SIGHTSOUND TECHNOLOGIES, LLC | System and method for transmitting desired digital video or digital audio signals |
5968121, | Aug 13 1997 | Microsoft Technology Licensing, LLC | Method and apparatus for representing and applying network topological data |
5973696, | Aug 08 1996 | Conexant Systems, Inc | Embedded web server |
5978791, | Apr 11 1995 | Personalweb Technologies, LLC | Data processing system using substantially unique identifiers to identify data items, whereby identical data items have the same identifiers |
5983005, | May 09 1996 | Two-Way Media Ltd | Multicasting method and apparatus |
5983214, | Apr 04 1996 | HUDSON BAY MASTER FUND LTD | System and method employing individual user content-based data and user collaborative feedback data to evaluate the content of an information entity in a large information communication network |
5983227, | Jun 12 1997 | DIGI PORTAL LLC | Dynamic page generator |
5987606, | Mar 19 1997 | Bascom Global Internet Services, Inc. | Method and system for content filtering information retrieved from an internet computer network |
5991809, | Jul 25 1996 | XCELERA INC | Web serving system that coordinates multiple servers to optimize file transfers |
5996025, | Oct 31 1997 | SANDPIPER CDN, LLC | Network transparent access framework for multimedia serving |
6002720, | Jan 07 1991 | Greenwich Information Technologies, LLC | Audio and video transmission and receiving system |
6003030, | Jun 07 1996 | AKAMAI TECHNOLOGIES, INC | System and method for optimized storage and retrieval of data on a distributed computer network |
6006264, | Aug 01 1997 | Cisco Technology, Inc | Method and system for directing a flow between a client and a server |
6012090, | Mar 14 1997 | Alcatel Lucent | Client-side parallel requests for network services using group name association |
6014686, | Jun 21 1996 | HANGER SOLUTIONS, LLC | Apparatus and methods for highly available directory services in the distributed computing environment |
6014698, | May 19 1997 | AT HOME BONDHOLDERS LIQUIDATING TRUST | System using first banner request that can not be blocked from reaching a server for accurately counting displays of banners on network terminals |
6018516, | Nov 14 1997 | GEN DIGITAL INC | Method for minimizing unneeded retransmission of packets in a packet communication environment supporting a plurality of data link rates |
6021426, | Jul 31 1997 | AKAMAI TECHNOLOGIES, INC | Method and apparatus for dynamic data transfer on a web page |
6026440, | Jan 27 1997 | International Business Machines Corporation | Web server account manager plug-in for monitoring resources |
6029175, | Oct 26 1995 | Teknowledge Corporation | Automatic retrieval of changed files by a network software agent |
6029176, | Nov 25 1997 | NIELSEN COMPANY US , LLC , THE | Manipulating and analyzing data using a computer system having a database mining engine resides in memory |
6035332, | Oct 06 1997 | NCR Voyix Corporation | Method for monitoring user interactions with web pages from web server using data and command lists for maintaining information visited and issued by participants |
6038216, | Nov 01 1996 | Symantec Corporation | Method for explicit data rate control in a packet communication environment without data rate supervision |
6038310, | Aug 01 1994 | British Telecommunications public limited company | Service node for a telephony network |
6038610, | Jul 17 1996 | Microsoft Technology Licensing, LLC | Storage of sitemaps at server sites for holding information regarding content |
6041307, | Jan 23 1998 | WSOU Investments, LLC | Technique for effectively managing resources in a network |
6041324, | Nov 17 1997 | International Business Machines Corporation | System and method for identifying valid portion of computer resource identifier |
6044405, | Apr 12 1996 | COLORADO WSC, LLC | Service network incorporating geographically-remote hubs linked by high speed transmission paths |
6046980, | Dec 09 1996 | GEN DIGITAL INC | System for managing flow bandwidth utilization at network, transport and application layers in store and forward network |
6049831, | Nov 02 1996 | Level 3 Communications, LLC | System for transmitting network-related information where requested network information is separately transmitted as definitions and display information |
6052718, | Jan 07 1997 | Cisco Technology, Inc | Replica routing |
6052730, | Jan 10 1997 | BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIVERSITY, THE | Method for monitoring and/or modifying web browsing sessions |
6061738, | Jun 27 1997 | D&I Systems, Inc. | Method and system for accessing information on a network using message aliasing functions having shadow callback functions |
6065051, | Apr 15 1998 | Meta Platforms, Inc | Apparatus and method for communication between multiple browsers |
6065062, | Dec 10 1997 | Cisco Systems, Inc. | Backup peer pool for a routed computer network |
6070191, | Oct 17 1997 | THE CHASE MANHATTAN BANK, AS COLLATERAL AGENT | Data distribution techniques for load-balanced fault-tolerant web access |
6078943, | Feb 07 1997 | International Business Machines Corporation; IBM Corporation | Method and apparatus for dynamic interval-based load balancing |
6081829, | Jan 31 1996 | Open Invention Network, LLC | General purpose web annotations without modifying browser |
6081835, | Apr 04 1996 | SUFFOLK TECHNOLOGIES, LLC | Internet server and method of controlling an internet server |
6085193, | Sep 29 1997 | GOOGLE LLC | Method and system for dynamically prefetching information via a server hierarchy |
6092112, | Jun 17 1996 | Matsushita Electric Industrial Co., Ltd. | Distributing information through an open network to many and unspecific clients being in different retaining situations with an information server |
6092204, | Oct 01 1996 | AT&T Corp | Filtering for public databases with naming ambiguities |
6094680, | Jun 27 1996 | Microsoft Technology Licensing, LLC | System and method for managing distributed resources on networks |
6098078, | Dec 29 1995 | THE CHASE MANHATTAN BANK, AS COLLATERAL AGENT | Maintaining consistency of database replicas |
6105028, | Jun 26 1997 | Hewlett-Packard Company | Method and apparatus for accessing copies of documents using a web browser request interceptor |
6108673, | Feb 25 1997 | International Business Machines Corporation | System for creating a form from a template that includes replication block |
6108703, | Jul 14 1998 | Massachusetts Institute of Technology | Global hosting system |
6112231, | Oct 18 1996 | AT&T Corp. | Server to cache protocol for improved web performance |
6112239, | Jun 18 1997 | Intervu, Inc | System and method for server-side optimization of data delivery on a distributed computer network |
6112240, | Sep 03 1997 | International Business Machines Corporation | Web site client information tracker |
6115357, | Jun 29 1998 | GEN DIGITAL INC | Method for pacing data flow in a packet-based network |
6115752, | May 21 1998 | Oracle America, Inc | System and method for server selection for mirrored sites |
6119143, | May 22 1997 | International Business Machines Corporation | Computer system and method for load balancing with selective control |
6119163, | Jul 06 1998 | Two-Way Media Ltd | Multicasting method and apparatus |
6125388, | May 31 1994 | TMI SOLUTIONS, LLC | System for transporting information objects between a user station and multiple remote sources based upon user modifiable object manifest stored in the user station |
6125394, | Jun 06 1997 | HANGER SOLUTIONS, LLC | Computer system having a plurality of resources and utilizing a selection mechanism to select the resources based upon historical loading |
6128601, | Aug 28 1997 | Cisco Technology, Inc | Active client to communications network connection apparatus and method |
6128660, | Mar 21 1996 | Intel Corporation | Network match maker |
6134583, | Jul 01 1996 | Oracle America, Inc | Method, system, apparatus and article of manufacture for providing identity-based caching services to a plurality of computer systems (#16) |
6144375, | Aug 14 1998 | CLOUD SOFTWARE GROUP, INC | Multi-perspective viewer for content-based interactivity |
6144702, | Jan 07 1991 | Greenwich Information Technologies, LLC | Audio and video transmission and receiving system |
6144996, | May 13 1998 | PALO ALTO NETWORKS, INC | Method and apparatus for providing a guaranteed minimum level of performance for content delivery over a network |
6151624, | Feb 03 1998 | R2 SOLUTIONS LLC | Navigating network resources based on metadata |
6154738, | Mar 27 1998 | PRODUCT ASSOCIATION TECHNOLOGIES, LLC | Methods and apparatus for disseminating product information via the internet using universal product codes |
6154744, | Jun 07 1995 | AKAMAI TECHNOLOGIES, INC | System and method for optimized storage and retrieval of data on a distributed computer network |
6154753, | Sep 15 1995 | COLORADO WSC, LLC | Document management system and method for business quality modeling |
6154777, | Jul 01 1996 | Oracle America, Inc | System for context-dependent name resolution |
6163779, | Sep 29 1997 | International Business Machines Corporation | Method of saving a web page to a local hard drive to enable client-side browsing |
6167427, | Nov 28 1997 | Alcatel Lucent | Replication service system and method for directing the replication of information servers based on selected plurality of servers load |
6173311, | Feb 13 1997 | GOOGLE LLC | Apparatus, method and article of manufacture for servicing client requests on a network |
6173322, | Jun 05 1997 | Hewlett Packard Enterprise Development LP | Network request distribution based on static rules and dynamic performance data |
6178160, | Dec 23 1997 | Cisco Technology, Inc | Load balancing of client connections across a network using server based algorithms |
6181690, | Jul 18 1997 | AT&T Corp | Toll-free internet service |
6181867, | Jun 07 1995 | Intervu, Inc | Video storage and retrieval system |
6185598, | Feb 10 1998 | SANDPIPER CDN, LLC | Optimized network resource location |
6185619, | Nov 07 1997 | Raytheon BBN Technologies Corp | Method and apparatus for balancing the process load on network servers according to network and serve based policies |
6189030, | Feb 21 1996 | DISNEY ENTERPRISES, INC | Method and apparatus for redirection of server external hyper-link references |
6189039, | Apr 10 1997 | Level 3 Communications, LLC | Selective tunneling of streaming data |
6195680, | Jul 23 1998 | SANDPIPER CDN, LLC | Client-based dynamic switching of streaming servers for fault-tolerance and load balancing |
6222856, | Jul 02 1996 | Microsoft Technology Licensing, LLC | Adaptive bandwidth throttling for individual virtual services supported on a network server |
6226618, | Aug 13 1998 | SANDPIPER CDN, LLC | Electronic content delivery system |
6226642, | Sep 11 1997 | Wistron Corporation | Content modification of internet web pages for a television class display |
6230196, | Nov 12 1997 | International Business Machines Corporation | Generation of smart HTML anchors in dynamic web page creation |
6243760, | Jun 24 1997 | Transcore Link Logistics Corporation | Information dissemination system with central and distributed caches |
6256675, | May 06 1997 | AT&T Corp | System and method for allocating requests for objects and managing replicas of objects on a network |
6263313, | Oct 22 1998 | SANDPIPER CDN, LLC | Method and apparatus to create encoded digital content |
6266699, | Apr 17 1996 | NOKIA SOLUTIONS AND NETWORKS GMBH & CO KG | Control in an intelligent network |
6269394, | Jun 07 1995 | System and method for delivery of video data over a computer network | |
6272566, | Nov 18 1998 | SANDPIPER CDN, LLC | System for maintaining proper buffering within video play list |
6282569, | Sep 11 1993 | International Business Machines Corp. | Name server computer having a load levelling facility to spread the load from client computers across a plurality of server computers |
6282574, | Mar 06 1997 | Verizon Patent and Licensing Inc | Method, server and telecommunications system for name translation on a conditional basis and/or to a telephone number |
6286045, | May 19 1997 | AT HOME BONDHOLDERS LIQUIDATING TRUST | Information storage and delivery over a computer network using centralized intelligence to monitor and control the information being delivered |
6292905, | Oct 02 1997 | Round Rock Research, LLC | Method for providing a fault tolerant network using distributed server processes to remap clustered network resources to other servers during server failure |
6298041, | Nov 01 1996 | Symantec Corporation | Method for explicit data rate control in a packet communication environment without data rate supervision |
6311214, | Aug 06 1998 | DIGIMARC CORPORATION AN OREGON CORPORATION | Linking of computers based on optical sensing of digital data |
6314565, | May 19 1997 | Intervu, Inc | System and method for automated identification, retrieval, and installation of multimedia software components |
6324580, | Sep 03 1998 | Oracle America, Inc | Load balancing for replicated services |
6327622, | Sep 03 1998 | Oracle America, Inc | Load balancing in a network environment |
6330602, | Apr 14 1997 | AVAYA Inc | Scaleable web server and method of efficiently managing multiple servers |
6332195, | Feb 09 1996 | McAfee, LLC | Secure server utilizing separate protocol stacks |
6347085, | Aug 16 1996 | NET2PHONE, INC | Method and apparatus for establishing communications between packet-switched and circuit-switched networks |
6351775, | May 30 1997 | SAP SE | Loading balancing across servers in a computer network |
6360256, | Jul 01 1996 | Oracle America, Inc | Name service for a redundant array of internet servers |
6360262, | Nov 24 1997 | International Business Machines Corporation | Mapping web server objects to TCP/IP ports |
6370571, | Mar 05 1997 | AT HOME BONDHOLDERS LIQUIDATING TRUST | System and method for delivering high-performance online multimedia services |
6370580, | Jul 25 1997 | XCELERA C O VIK BROTHERS INTERNATIONAL; XCELERA INC | Web serving system that coordinates multiple servers to optimize file transfers |
6398245, | Aug 13 1998 | SANDPIPER CDN, LLC | Key management system for digital content player |
6412000, | Nov 25 1997 | CA, INC | Method for automatically classifying traffic in a packet communications network |
6415280, | Apr 11 1995 | Personalweb Technologies, LLC | Identifying and requesting data in network using identifiers which are based on contents of data |
6418421, | Aug 13 1998 | SANDPIPER CDN, LLC | Multimedia player for an electronic content delivery system |
6418461, | Oct 06 1997 | Verizon Patent and Licensing Inc | Intelligent call switching node in an intelligent distributed network architecture |
6421726, | Mar 14 1997 | AKAMAI TCHNOLOGIES, INC | System and method for selection and retrieval of diverse types of video data on a computer network |
6424992, | Dec 23 1996 | International Business Machines Corporation | Affinity-based router and routing method |
6425005, | Oct 06 1997 | Verizon Patent and Licensing Inc | Method and apparatus for managing local resources at service nodes in an intelligent network |
6434622, | May 09 1996 | Two-Way Media Ltd | Multicasting method and apparatus |
6442549, | Jul 25 1997 | MEC MANAGEMENT, LLC | Method, product, and apparatus for processing reusable information |
6452925, | Apr 18 1996 | Intellectual Ventures II LLC | Universal access multimedia data network |
6456630, | Dec 08 1997 | CA, INC | Method for data rate control for heterogenous or peer internetworking |
6460082, | Jun 17 1999 | SANDPIPER CDN, LLC | Management of service-oriented resources across heterogeneous media servers using homogenous service units and service signatures to configure the media servers |
6463454, | Jun 17 1999 | SANDPIPER CDN, LLC | System and method for integrated load distribution and resource management on internet environment |
6463508, | Jul 19 1999 | SANDPIPER CDN, LLC | Method and apparatus for caching a media stream |
6470389, | Mar 14 1997 | AT&T Corp; Lucent Technologies, INC | Hosting a network service on a cluster of servers using a single-address image |
6480893, | Jul 25 1997 | CLEARWAY TECHNOLOGY, LLC | Web serving system |
6484204, | May 06 1997 | AT&T Corp. | System and method for allocating requests for objects and managing replicas of objects on a network |
6496856, | Jun 07 1995 | AKAMAI TECHNOLOGIES, INC | Video storage and retrieval system |
6502125, | Jun 07 1995 | AKAMAI TECHNOLOGIES, INC | System and method for optimized storage and retrieval of data on a distributed computer network |
6502215, | Oct 06 1995 | Micron Technology, Inc. | Self-test RAM using external synchronous clock |
6557054, | May 31 1994 | TMI SOLUTIONS, LLC | Method and system for distributing updates by presenting directory of software available for user installation that is not already installed on user station |
6581090, | Oct 14 1996 | XCELERA INC | Internet communication system |
6587837, | Aug 13 1998 | SANDPIPER CDN, LLC | Method for delivering electronic content from an online store |
6591299, | Nov 25 1997 | CA, INC | Method for automatically classifying traffic with enhanced hierarchy in a packet communications network |
6611862, | May 31 1994 | TMI SOLUTIONS, LLC | User station software that controls transport and presentation of content from a remote source |
6654807, | Feb 10 1998 | SANDPIPER CDN, LLC | Internet content delivery network |
6658464, | May 31 1994 | TMI SOLUTIONS, LLC | User station software that controls transport, storage, and presentation of content from a remote source |
6665706, | Jun 07 1995 | Akamai Technologies, Inc. | System and method for optimized storage and retrieval of data on a distributed computer network |
6741563, | Nov 01 1996 | GEN DIGITAL INC | Method for explicit data rate control in a packet communication environment without data rate supervision |
6763377, | Mar 03 2000 | SANDPIPER CDN, LLC | Asset management and scheduling graphical user interface for media streamer |
6779031, | Dec 12 1997 | Level 3 Communications, LLC | Network architecture with event logging |
6785704, | Dec 20 1999 | GOOGLE LLC | Content distribution system for operation over an internetwork including content peering arrangements |
6799221, | Jun 18 1997 | AKAMAI TECHNOLOGIES, INC | System and method for server-side optimization of data delivery on a distributed computer network |
6859791, | Aug 13 1998 | SANDPIPER CDN, LLC | Method for determining internet users geographic region |
6915329, | Jul 25 1996 | Xcelera | Web serving system |
6928442, | Apr 11 1995 | Personalweb Technologies, LLC | Enforcement and policing of licensed content using content-based identifiers |
6944676, | Jun 24 1997 | Transcore Link Logistics Corporation | Information dissemination system and method with central and distributed caches |
6963910, | Mar 03 2000 | SANDPIPER CDN, LLC | Graphical user interface for creating assets |
6973485, | Oct 07 1997 | Hitachi, Ltd. | Proxy server selecting server and proxy server |
7047300, | Feb 10 1998 | Sprint Communications Company L.P.; SPRINT COMMUNICATIONS COMPANY, L P | Survivable and scalable data system and method for computer networks |
7054935, | Feb 10 1998 | SANDPIPER CDN, LLC | Internet content delivery network |
7061923, | Oct 06 1997 | Verizon Patent and Licensing Inc | Method and apparatus for managing local resources at service nodes in an intelligent network |
7080153, | May 09 1996 | Two-Way Media Ltd | Multicasting method and apparatus |
7103564, | Aug 17 2000 | SANDPIPER CDN, LLC | Method and apparatus for performing personalization based on classification |
7110984, | Aug 13 1998 | SANDPIPER CDN, LLC | Updating usage conditions in lieu of download digital rights management protected content |
7117259, | Mar 03 2000 | SANDPIPER CDN, LLC | Server time window for multiple selectable servers in a graphical user interface |
7188085, | Jul 20 2001 | SANDPIPER CDN, LLC | Method and system for delivering encrypted content with associated geographical-based advertisements |
7206748, | Aug 13 1998 | SANDPIPER CDN, LLC | Multimedia player toolkit for electronic content delivery |
7254645, | Mar 30 2000 | NEC Corporation | Quality assured network service provision system compatible with a multi-domain network and service provision method and service broker device |
7266686, | May 09 1996 | Two-Way Media Ltd | Multicasting method and apparatus |
7600120, | May 09 1996 | Two-Way Media Ltd | System for delivering media |
7697415, | Oct 06 1997 | Verizon Patent and Licensing Inc | Method and apparatus for managing local resources at service nodes in an intelligent network |
7802310, | Apr 11 1995 | Personalweb Technologies, LLC | Controlling access to data in a data processing system |
7945539, | Apr 11 1995 | Personalweb Technologies, LLC | Distributing and accessing data in a data processing system |
7945544, | Apr 11 1995 | Personalweb Technologies, LLC | Similarity-based access control of data in a data processing system |
7945693, | Feb 10 1998 | SANDPIPER CDN, LLC | Controlling subscriber information rates in a content delivery network |
7949662, | Apr 11 1995 | Personalweb Technologies, LLC | De-duplication of data in a data processing system |
7949779, | Feb 10 1998 | SANDPIPER CDN, LLC | Controlling subscriber information rates in a content delivery network |
8001096, | Apr 11 1995 | Personalweb Technologies, LLC | Computer file system using content-dependent file identifiers |
8060613, | Feb 10 1998 | SANDPIPER CDN, LLC | Resource invalidation in a content delivery network |
8082262, | Apr 11 1995 | Personalweb Technologies, LLC | Methods, systems, and devices supporting data access in a data processing system |
8099420, | Apr 11 1995 | Personalweb Technologies, LLC | Accessing data in a data processing system |
20010052024, | |||
20020078233, | |||
20030149581, | |||
20040139097, | |||
20050038851, | |||
20050114296, | |||
20050198334, | |||
20060218265, | |||
20070233705, | |||
20070233706, | |||
20070233884, | |||
20080215755, | |||
20110219120, | |||
CA2202572, | |||
EP801487, | |||
EP817444, | |||
EP824236, | |||
EP865180, | |||
GB2281793, | |||
JP10027148, | |||
JP5162529, | |||
JP7066829, | |||
JP8044643, | |||
JP8328583, | |||
WO9642041, | |||
WO9711429, | |||
WO9729423, | |||
WO9804985, | |||
WO9806033, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Jan 22 2007 | SAAVIS COMMUNICATIONS CORPORATION | MOUNT SHASTA ACQUISITION LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 029876 | /0610 | |
Jan 22 2007 | MOUNT SHASTA ACQUISITION LLC | Level 3 Communications, LLC | MERGER SEE DOCUMENT FOR DETAILS | 029876 | /0659 | |
May 30 2007 | Level 3 Communications, LLC | (assignment on the face of the patent) | / |
Date | Maintenance Fee Events |
Date | Maintenance Schedule |
Oct 30 2015 | 4 years fee payment window open |
Apr 30 2016 | 6 months grace period start (w surcharge) |
Oct 30 2016 | patent expiry (for year 4) |
Oct 30 2018 | 2 years to revive unintentionally abandoned end. (for year 4) |
Oct 30 2019 | 8 years fee payment window open |
Apr 30 2020 | 6 months grace period start (w surcharge) |
Oct 30 2020 | patent expiry (for year 8) |
Oct 30 2022 | 2 years to revive unintentionally abandoned end. (for year 8) |
Oct 30 2023 | 12 years fee payment window open |
Apr 30 2024 | 6 months grace period start (w surcharge) |
Oct 30 2024 | patent expiry (for year 12) |
Oct 30 2026 | 2 years to revive unintentionally abandoned end. (for year 12) |