An exemplary method of resource allocation is provided. A relationship is estimated between at least one controlled resource and at least one dependent parameter. A resource allocation of the at least one controlled resource is adjusted to effect a desired system goal expressed in terms of the at least one dependent parameter.
|
13. A system of resource allocation, comprising:
a system controller comprising a processor and a memory for estimating a relationship between at least one controlled resource and at least one dependent parameter; and
adjusting a resource allocation of the at least one controlled resource to achieve a desired system goal expressed in terms of the at least one dependent parameter, wherein adjusting the resource allocation comprises,
(a) setting a current allocation of the at least one controlled resource,
(b) obtaining at least one sample measurement of the at least one dependent parameter at the current allocation,
(c) combining the at least one sample measurement with the current allocation to form a tuple, and
(d) determining a new allocation based on a previous tuple and the tuple, wherein an upper bound and a lower bound each correspond to one of the previous tuple and the tuple, respectively, based upon a value of each tuple, and a value of the system goal is no more than the upper bound and no less than the lower bound.
7. A computer readable medium embodying instructions executed by a processor to perform a method of resource allocation, the method comprising:
estimating a relationship between at least one controlled resource and at least one dependent parameter; and
adjusting a resource allocation of the at least one controlled resource to effect a desired system goal expressed in terms of the at least one dependent parameter, wherein adjusting the resource allocation comprises,
(a) setting a current allocation of the at least one controlled resource,
(b) obtaining at least one sample measurement of the at least one dependent parameter at the current allocation,
(c) combining the at least one sample measurement with the current allocation to form a tuple, and
(d) determining a new allocation based on a previous tuple and the tuple, wherein an upper bound and a lower bound each correspond to one of the previous tuple and the tuple, respectively, based upon a value of each tuple, and a value of the system goal is no more than the upper bound and no less than the lower bound.
1. A method of resource allocation of processing time in a processor, comprising:
estimating a relationship between the processing time and a network bandwidth; and
adjusting a resource allocation of the processing time to effect a desired system goal expressed in terms of the network bandwidth, wherein adjusting the resource allocation comprises,
(a) setting a current allocation of the processing time,
(b) obtaining at least one sample measurement of the network bandwidth at the current allocation,
(c) combining the at least one sample measurement with the current allocation to form a tuple, and
(d) determining a new allocation based on a previous tuple and the tuple, wherein an upper bound and a lower bound each correspond to one of the previous tuple and the tuple, respectively, based upon a value of each tuple, and a value of the system goal is no more than the upper bound and no less than the lower bound, wherein the upper bound represents a first measure of processing time corresponding to a smallest observed network bandwidth measurement greater than the system goal and the lower bound represents a second measure of processing time corresponding to a largest observed network bandwidth measurement less than the system goal,
wherein the method is performed by the processor.
2. The method of
5. The method of
6. The method of
a desired utilization level of the network bandwidth;
a minimization of a distance metric between the desired utilization level and an actual utilization level of the network bandwidth;
a maximization of the network bandwidth; or
a minimization of the network bandwidth.
8. The computer readable medium of
9. The computer readable medium of
10. The computer readable medium of
11. The computer readable medium of
12. The computer readable medium of
a desired utilization level of the at least one dependent parameter;
a minimization of a distance metric between the desired utilization level and an actual utilization level of the at least one dependent parameter;
a maximization of the at least one dependent parameter; or
a minimization of the at least one dependent parameter.
14. The system of
a computer system comprising a central processing unit (CPU), the computer system operatively connected to the system controller;
wherein the system controller adjusts the resource allocation of CPU in the computer system.
15. The system of
a complex system capable of controlling the at least one controlled resource.
16. The system of
17. The system of
18. The system of
19. The system of
a desired utilization level of the at least one dependent parameter;
a minimization of a distance metric between the desired utilization level and an actual utilization level of the at least one dependent parameter;
a maximization of the at least one dependent parameter; or
a minimization of the at least one dependent parameter.
20. The method of
21. The method of
22. The method of
a pre-determined distance from a target utilization level;
a maximum number of iterations; and
a maximum amount of elapsed real time.
23. The computer readable medium of
24. The computer readable medium of
25. The computer readable medium of
a pre-determined distance from a target utilization level;
a maximum number of iterations; and
a maximum amount of elapsed real time.
26. The system of
27. The system of
28. The system of
a pre-determined distance from a target utilization level;
a maximum number of iterations; and
a maximum amount of elapsed real time.
|
This invention was made with Government support under Contract No.: H98230-04-3-0001 awarded by the U.S. Department of Defense. The Government has certain rights in this invention.
1. Field of the Invention
The present invention relates generally to the field of system resource allocation, and, more particularly, to system control via resource allocation to satisfy measurable performance objectives.
2. Description of the Related Art
System management generally involves one or more steps of resource allocation. A good allocation of resources is essential for maintaining efficient operation of the system, and generally results in neither over-provisioning which is costly, nor under-provisioning, which can sacrifice the quality of the system output. Indeed, effective allocation of the system resources is an important part of maintaining or exceeding system performance requirements and targets. Resources that can be allocated may include, but are not limited to, central processing unit (“CPU”), bandwidth, memory, database handles, threads, and CPU scheduler priority, and input/output (“I/O”) scheduler priority.
One step in allocating resources involves defining one or more desired goals for the system functioning. As used herein, the term “goal” refers to the maximization/minimization of, the achievement of a desired upper/lower bound on, or attempting to achieve some target value of, some function of one or more measurable system characteristics. These characteristics may include measurements of the utilization of system resources, volume or quality measurements on system outputs, and the like. Inter-dependencies between resources may be very important when the goal is based on system resource utilization; that is, utilization of one resource may depend on the utilization of one or more other resources.
For example, in a computer system, the percentage of the CPU time used for an application running on the system (i.e., the application's CPU utilization) may depend on the amount of network bandwidth allocated to pass data to that application. In other words, in this example, CPU utilization and network bandwidth are two inter-dependent resources in a computer system. The system goals will often be expressed in terms of one resource. Consider, for example, a system with two resources, A and B. If resource A depends upon resource B, in the manner defined above, then, by adapting the allocation of resource B, one can indirectly control the utilization of resource A. In the above example, the resources can be equally swapped, and network bandwidth utilization can be controlled via the allocation of CPU resources.
Given different allocations of a specific resource, measurements of the system can be taken to determine the degree to which goals are satisfied. These measurements can be used to adaptively “learn” the relationships between resources of the system and system goals (recall that the system goal(s) may depend directly on utilization of some system resources). The relationships between resources and goals in systems are usually too complex to be known a priori, and difficult (or, as in stochastic systems, impossible) to obtain exactly. Thus, the learned estimates of these relationships can be very useful substitutes for exact relationships when deciding resource allocations in the system to achieve system goals.
In complex systems, which are precisely those systems that require control methods, the measurements are “noisy” (i.e., they include random variations over a short time scale), thereby making them inaccurate to some degree. Examples of these measurements in a computer system include the amount of CPU utilization by a particular process and the amount of memory utilization by a particular process. CPU utilization measurements may be “noisy” over short time intervals because they depend on (a) the complicated operation of operating system (“OS”) schedulers, (b) background system processes that use an often time-varying amount of CPU utilization, and (c) different states of program execution that may cause temporary fluctuations in the CPU utilization.
Moreover, obtaining the measurements can often perturb the system. For example, consider an example of obtaining measurements of the quality of a system given a particular resource allocation. Building a model of the relationship between the resource and the quality of the system may require setting the system to many different resource allocation levels. However, perturbing the system by changing resource allocation levels, especially when done repeatedly, may result in oscillations of one or more dependent quantities in the system, including the system measurements that figure into the goal function(s)).
Other methods for system management often suffer from one or more of the following drawbacks:
In one aspect of the present invention, a method of resource allocation is provided. The method includes the steps of estimating a relationship between at least one controlled resource and at least one dependent parameter; and adjusting a resource allocation of the at least one controlled resource to effect a desired system goal expressed in terms of the at least one dependent parameter.
In another aspect of the present invention, a machine-readable medium having instructions stored thereon for execution by a processor to perform a method of resource allocation is provided. The method includes the steps of estimating a relationship between at least one controlled resource and at least one dependent parameter; and adjusting a resource allocation of the at least one controlled resource to effect a desired system goal expressed in terms of the at least one dependent parameter.
In yet another aspect of the present invention, a system of resource allocation is provided. The system includes a system controller for (a) estimating a relationship between at least one controlled resource and at least one dependent parameter, and (b) adjusting a resource allocation of the at least one controlled resource to achieve a desired system goal expressed in terms of the at least one dependent parameter.
The invention may be understood by reference to the following description taken in conjunction with the accompanying drawings, in which like reference numerals identify like elements, and in which:
Illustrative embodiments of the invention are described below. In the interest of clarity, not all features of an actual implementation are described in this specification. It will of course be appreciated that in the development of any such actual embodiment, numerous implementation-specific decisions must be made to achieve the developers' specific goals, such as compliance with system-related and business-related constraints, which will vary from one implementation to another. Moreover, it will be appreciated that such a development effort might be complex and time-consuming, but would nevertheless be a routine undertaking for those of ordinary skill in the art having the benefit of this disclosure.
While the invention is susceptible to various modifications and alternative forms, specific embodiments thereof have been shown by way of example in the drawings and are herein described in detail. It should be understood, however, that the description herein of specific embodiments is not intended to limit the invention to the particular forms disclosed, but on the contrary, the intention is to cover all modifications, equivalents, and alternatives falling within the spirit and scope of the invention as defined by the appended claims.
It is to be understood that the systems and methods described herein may be implemented in various forms of hardware, software, firmware, special purpose processors, or a combination thereof. In particular, at least a portion of the present invention is preferably implemented as an application comprising program instructions that are tangibly embodied on one or more computer readable medium (e.g., hard disk, magnetic floppy disk, RAM, ROM, CD ROM, etc.) and executable by any device or machine comprising suitable architecture, such as a general purpose digital computer having a processor, memory, and input/output interfaces. It is to be further understood that, because some of the constituent system components and process steps depicted in the accompanying Figures are preferably implemented in software, the connections between system modules (or the logic flow of method steps) may differ depending upon the manner in which the present invention is programmed. Given the teachings herein, one of ordinary skill in the related art will be able to contemplate these and similar implementations of the present invention.
In an exemplary embodiment of the present invention, a novel system controller is presented, which simultaneously performs the following:
(a) learns the relationship between the controlled resource(s) and the dependent parameters. In some controlled systems system goal function(s) are expressed in terms of the dependent parameters; and
(b) adjusts resource allocation(s) of the controlled resource(s) to drive the system towards the desired goal points.
The simultaneous achievement of these two steps differentiates the present invention from prior art. Further, the described method is able to operate efficiently, requiring minimal state (i.e., minimal memory overhead in a software implementation, or minimal component complexity in an analog implementation) and with infrequent or no perturbations to the running system.
The resource allocation methods described herein may be applied to any of a variety of types of systems, as contemplated by those skilled in the art. Solely for illustrative purposes, the exemplary system illustrated herein concerns a computer system. However, it should be appreciated that the present invention is not so limited and may be applied to any system which can be controlled, including, but not limited to, manufacturing systems, logistic systems, and chemical systems.
The invention described herein may apply to any case where allocations of one or more controllable resources are adjusted to effect a desired system response (i.e., a goal). Further, the present invention does not rely on prior known models of resource and goal relationships.
We begin by summarizing some of the terminology used in the present invention. We assume that a complex system utilizes at least two resources for executing an application. One resource is referred herein as a controlled resource. The controlled resource is a resource in which a user of the complex system has control over its utilization. Dependent parameters are measurable characteristics of the system which are used in measuring how well the system is satisfying or approaching the desired goal(s). A controlled resource can also be a dependent parameter, if it figures into the goal function. The utilization level of another system resource (i.e., a system resource other than the controlled resource) may be a dependent parameter, in which case this other system resource may be equivalently referred to as a dependent resource. The value of the dependent parameter is dependent, at least in part, upon the allocation of the controlled resource. We use the term goal to refer generally to any measurable performance objective, which can be expressed in terms of minimizing, maximizing, or minimizing the error around some desired level of a goal function with parameters that may include the controlled resource and the dependent parameter(s). Exemplary embodiments of the present invention build an estimate of the relationship between the controlled resource(s) and the dependent parameter(s) that we refer to as the model.
We present exemplary methods for adaptively learning the relationship between one or more controlled resources and one or more dependent parameters in a complex system in which data concerning the level of satisfaction of a goal may be noisy (i.e., inaccurate). The relation can be used, for example, to determine a preferred allocation of a controlled resource to achieve performance targets measured in terms of the dependent parameter(s). The exemplary methods described herein are efficient and require minimal data storage, and, as such, may be particularly useful in real-time control of a complex system, in which resources must be allocated in a relatively short amount of time. The method is particularly preferable to existing art when applied to systems where altering resource allocations can cause significant perturbations to the system. Since the method is very economical in the number of times it requires the controlled resource allocation to be adjusted, the negative effects of such changes are smaller than those in existing methods.
The model (i.e., the estimated relationship between control resource(s) and dependent parameter(s)) may be initialized to a simple linear function or some other preset function. At each iteration (i.e., step) of the method, the controlled resource allocation is set, and one or more sample measurements of the dependent parameter are obtained at this controlled resource allocation so as to learn the relationship between the controlled resource allocation(s) and the dependent parameter(s). If more than one sample is taken, the samples may be combined in such a way as to reduce the impact of noise or inaccurate data (e.g., simple averaging, low-pass filtering). The controlled resource allocation and the dependent parameter measurement form a tuple.
After measuring the dependent parameter(s), the tuple (i.e., controlled resource allocation, dependent parameter measurement) is a data point in an n-dimensional space of real numbers (Rn). The model is updated by the addition of this tuple. In a preferred embodiment, the list of previously-recorded tuples can be connected by line segments to form a piece-wise linear function which is suitable for use as a model. However, only a few such tuples of the piecewise-linear model need to be stored at any time. For example, when the goal involves trying to achieve a particular utilization level (referred herein as a “target level”) of a single dependent parameter which varies monotonically with a controlled resource, only two tuples will need to be stored, a lower bound and an upper bound, on either side of the target level (i.e., the tuple with the largest observed dependent parameter measurement that is less than the target, and the tuple with the smallest observed dependent parameter measurement that is greater than the target). The method described will then choose a new allocation of the controlled resource which is predicted to result in a new tuple with the dependent parameter measurement between the current upper and lower bounds. Thus the new allocation should become either the upper or the lower bound after the next iteration. In the above example, at each step, the distance between the two bounds decreases, as the points which are computed by the method are successively closer to the target level.
When a stopping criterion is reached, the best value of the controlled resource is provided for achieving the goal. Stopping criteria may include, but are not limited to, pre-determined distance from the target value, a maximum number of iterations, a maximum amount of elapsed real time, and the like.
Although not so limited, we present, for the sake of simplicity, an illustrative method of the present invention on a computing system that handles stream processing (e.g., streaming multimedia data). Another application of the present invention may include capacity allocation in logistic or manufacturing systems. In such systems, the controlled resource is the level of capacity to add to the system. The dependent parameter is the level of quality of logistic or manufacturing system, which is observable, but real-time measurements may be inaccurate. An exemplary goal for such a system is achieving a set of desired processing capacity allocations for the system. Another goal may include satisfying response time objectives. Yet another application of the present invention may be in chemical production and processing systems.
We now consider a typical computing system composed of a set of processes (e.g., running instances of some data-processing program) hosted on a single node. In this illustrative application, the controlled resource is network bandwidth, the dependent resource is the CPU, and the goal is a target number of CPU share on the node.
The method builds an estimate of the relationship between each controlled resource and the values of the dependent parameter(s) only, as opposed to estimating explicitly the relationships between many controlled resources and each dependent parameter. This approximation allows for the method to be very rapid and to require minimal data storage. Interactions across controlled resources affect the system in two ways: (a) through a constraint on the total amount of controlled resource available, and (b) through the noise in measurements.
More specifically, the goal of the system depends upon the unknown relationship between each dependent parameter and the controlled resource. The mapping describing each of these relationships is constructed iteratively, and the next value of the resource level is determined by examining the direction that leads to a value of the goal metric closer to the desired value.
At each iteration, sample measurements of the dependent parameter(s) are taken at the current allocation of the controlled resource. As these measurements may be noisy, multiple measurements may be taken. Using a statistical technique such as one which determines the expected value from a set of random measurements, a single expected value can be obtained. One example of such a technique is to remove outliers (i.e., measurements more than a given number of standard deviations from the mean of the measurements) and then to take the mean of the remaining values.
The expected value is then used to update the model. The expected value adjusts the piecewise-linear function so that the new point (i.e., controlled resource-goal value pair) is included in the model function. In other words, the piecewise-linear function has as many segments with different slopes as the number of iterations plus one.
For many goal functions, only two points need to be stored for each process since one segment of the relationship function will contain the target value within its lower and upper limit. At each iteration, in this example, either the upper or lower limit will be updated by a new point which is closer to the target value. Hence, the distance between the two extremities of the active piece will shrink at each step.
Let Ci(Bi) be a smoothed estimate (e.g., an expected value) of the unknown relationship function for some value i. Then, Bi represents the value of the controlled resource. The goal function which depends upon the unknown relation Ci(Bi) is given by the notation F(B). The goal function may include n different components i=1 . . . n, each with its own relations to be learned by the method, Ci(Bi).
Following are the steps of the method in an exemplary embodiment of the method in which the goal function is to try to achieve target values for the dependent parameter, in this case referred to as Ci.
(1) Initialization: The target values be referred to by the n-vector, T. Set iteration counter, j=1. Set initial controlled resource value to a given starting point Bi or set to Bi=maximum_controlled_resource_capacity/n for every i=1, . . . n if no initial point is provided. Define the initial values Ci(LB(i))=0 and Ci(UB(i))=1, for all i=1, . . . n.
(2) Main loop: While the stopping criterion has not been reached, repeat:
(f) Update: Set B(j+1)=B(j)−G(j)(B)*S(j), and set j=j+1.
The particular embodiments disclosed above are illustrative only, as the invention may be modified and practiced in different but equivalent manners apparent to those skilled in the art having the benefit of the teachings herein. Furthermore, no limitations are intended to the details of construction or design herein shown, other than as described in the claims below. It is therefore evident that the particular embodiments disclosed above may be altered or modified and all such variations are considered within the scope and spirit of the invention. Accordingly, the protection sought herein is as set forth in the claims below.
Pendarakis, Dimitrios, Wynter, Laura, Silber, Jeremy I.
Patent | Priority | Assignee | Title |
8862765, | Oct 18 2007 | ARRIS ENTERPRISES LLC | Fair bandwidth redistribution algorithm |
Patent | Priority | Assignee | Title |
5077661, | May 03 1989 | Hewlett-Packard Company | Assignment-dependent resource allocation method |
6393455, | Mar 28 1997 | SHELL ELASTOMERS LLC | Workload management method to enhance shared resource access in a multisystem environment |
6584488, | Apr 12 1999 | TREND MICRO INCORPORATED | Controlling allocation of system resources with an enhanced priority calculation |
6785889, | Jun 15 2000 | Aurema, Inc.; AUREMA, PTY LTD | System and method for scheduling bandwidth resources using a Kalman estimator with active feedback |
6876668, | May 24 1999 | Cisco Technology, Inc. | Apparatus and methods for dynamic bandwidth allocation |
20020032850, | |||
20040107281, | |||
20040136379, | |||
20040143664, | |||
20040210871, | |||
20050038834, | |||
EP346039, | |||
EP1469387, | |||
WO2005017783, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Oct 18 2005 | PENDARAKIS, DIMITRIOS | International Business Machines Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 018091 | /0180 | |
Oct 18 2005 | SILBER, JEREMY I | International Business Machines Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 018091 | /0180 | |
Oct 19 2005 | WYNTER, LAURA | International Business Machines Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 018091 | /0180 | |
Jan 06 2006 | International Business Machines Corporation | (assignment on the face of the patent) | / |
Date | Maintenance Fee Events |
May 11 2010 | ASPN: Payor Number Assigned. |
Dec 27 2013 | REM: Maintenance Fee Reminder Mailed. |
May 18 2014 | EXP: Patent Expired for Failure to Pay Maintenance Fees. |
Date | Maintenance Schedule |
May 18 2013 | 4 years fee payment window open |
Nov 18 2013 | 6 months grace period start (w surcharge) |
May 18 2014 | patent expiry (for year 4) |
May 18 2016 | 2 years to revive unintentionally abandoned end. (for year 4) |
May 18 2017 | 8 years fee payment window open |
Nov 18 2017 | 6 months grace period start (w surcharge) |
May 18 2018 | patent expiry (for year 8) |
May 18 2020 | 2 years to revive unintentionally abandoned end. (for year 8) |
May 18 2021 | 12 years fee payment window open |
Nov 18 2021 | 6 months grace period start (w surcharge) |
May 18 2022 | patent expiry (for year 12) |
May 18 2024 | 2 years to revive unintentionally abandoned end. (for year 12) |