A system includes a server associated with a resource utilization, a database storing historical data including resource utilization values over a first time period, and a processor. The processor identifies, from the historical data, a maximum resource utilization value and determines a duration of time for which the resource utilization exceeds a percentage of the maximum. The processor predicts, based on the historical data, a maximum predicted resource utilization value over a second time period, later than the first. The processor also determines, based on the historical data, an upper bound of a resource utilization confidence interval. The processor generates, based on the maximum value over the first time period, the duration of time, the maximum predicted value over the second time period, and the upper bound, a recommendation to consolidate the server with a second server and/or to release computational resources. The processor transmits the recommendation to an administrator.
|
8. A method comprising:
accessing historical data for a first server, the first server associated with a resource utilization, the historical data comprising time-series values of the resource utilization of the first server collected from the first server over a first period of time;
identifying, from historical data, a maximum value of the resource utilization of the first server from the time-series values of the resource utilization collected from the first server over the first period of time;
determining, from the historical data, a first duration of time comprising a number of distinct times during which the time-series values of the resource utilization of the first server collected from the first server over the first period of time exceed a given percentage of the maximum value of the resource utilization;
predicting, based on the historical data, a maximum predicted value of the resource utilization of the first server over a second period of time later than the first period of time;
determining, based on the historical data, a first confidence interval for the resource utilization of the first server, the first confidence interval comprising an upper bound of the first confidence interval;
generating, based at least on the maximum value of the resource utilization of the first server collected from the first server over the first period of time, the first duration of time, the maximum predicted value of the resource utilization of the first server over the second period of time, and the upper bound of the first confidence interval, a recommendation comprising at least one of:
a recommendation to consolidate the first server with a second server; and
a recommendation to release computation resources of the first server to the second server;
transmitting the recommendation to a user device associated with a system administrator;
in response to generating the recommendation:
identifying one or more applications running on the first server; and
transferring the one or more applications running on the first server to the second server to release the computational resources of the first server.
1. A system comprising:
a first server associated with a resource utilization;
a database configured to store historical data for the first server comprising time-series values of the resource utilization of the first server collected from the first server over a first period of time; and
a hardware processor configured to:
identify, from the historical data, a maximum value of the resource utilization from the time-series values of the resource utilization of the first server collected from the first server over the first period of time;
determine, from the historical data, a first duration of time comprising a number of distinct times during which the time-series values of the resource utilization of the first server collected from the first server over the first period of time exceed a given percentage of the maximum value of the resource utilization;
predict, based on the historical data, a maximum predicted value of the resource utilization of the first server over a second period of time later than the first period of time;
determine, based on the historical data, a first confidence interval for the resource utilization of the first server, the first confidence interval comprising an upper bound of the first confidence interval;
generate, based at least on the maximum value of the resource utilization of the first server collected from the first server over the first period of time, the first duration of time, the maximum predicted value of the resource utilization of the first server over the second period of time, and the upper bound of the first confidence interval, a recommendation comprising at least one of:
a recommendation to consolidate the first server with a second server; and
a recommendation to release computational resources of the first server to the second server;
transmit the recommendation to a user device associated with a system administrator;
in response to generating the recommendation:
identify one or more applications running on the first server; and
transfer the one or more applications running on the first server to the second server to release the computational resources of the first server.
14. An apparatus comprising:
a hardware processor configured to:
access a database storing historical data for a first server, the first server associated with a resource utilization, the historical data comprising time-series values of the resource utilization of the first server collected from the first server over a first period of time;
identify, from the historical data, a maximum value of the resource utilization of the first server from the time-series values of the resource utilization collected from the first server over the first period of time;
determine, from the historical data, a first duration of time comprising a number of distinct times during which the time-series values of the resource utilization of the first server collected from the first server over the first period of time exceed a given percentage of the maximum value of the resource utilization;
predict, based on the historical data, a maximum predicted value of the resource utilization of the first server over a second period of time later than the first period of time;
determine, based on the historical data, a first confidence interval for the resource utilization of the first server, the first confidence interval comprising an upper bound of the first confidence interval;
generate, based at least on the maximum value of the resource utilization of the first server collected from the first server over the first period of time, the first duration of time, the maximum predicted value of the resource utilization of the first server over the second period of time, and the upper bound of the first confidence interval, a recommendation comprising at least one of:
a recommendation to consolidate the first server with a second server; and
a recommendation to release computation resources of the first server to the second server;
transmit the recommendation to a user device associated with a system administrator;
in response to generating the recommendation:
identify one or more applications running on the first server; and
transfer the one or more applications running on the first server to the second server to release the computational resources of the first server.
2. The system of
3. The system of
the processor is further configured to:
determine whether a percentage of the first duration of time over the first period of time is less than a threshold; and
in response to determining the percentage of the first duration of time over the first period of time is less than the threshold, remove the maximum value of the resource utilization from the historical data, wherein:
predicting the maximum predicted value of the resource utilization of the first server over the second period of time is further based on the determination that the percentage of the first duration of time over the first period of time is less than the threshold; and
determining the first confidence interval for the resource utilization of the first server is further based on the determination that the percentage of the first duration of time over the first period of time is less than the threshold.
4. The system of
computer processing units (CPUs);
random-access memory (RAM); and
disk storage.
5. The system of
implementing a gradient descent algorithm; and
implementing a simplex method.
6. The system of
7. The system of
the processor is further configured to:
determine, from the historical data, a plurality of duration of times during which the values of the resource utilization of the first server collected from the first server over the first period of time exceed a given percentage of the maximum value of the resource utilization;
determine whether a percentage of a total of the duration of times over the first period of time is more than a threshold; and
in response to determining the percentage of the plurality of duration of times over the first period of time is more than the threshold, determine the maximum value of the resource utilization from the historical data is normal.
9. The method of
10. The method of
determining whether a percentage of the first duration of time over the first period of time is less than a threshold; and
in response to determining the percentage of the first duration of time over the first period of time is less than the threshold, removing the maximum value of the resource utilization from the historical data, wherein:
predicting the maximum predicted value of the resource utilization of the first server over the second period of time is further based on the determination that the percentage of the first duration of time over the first period of time is less than the threshold; and
determining the first confidence interval for the resource utilization of the first server is further based on the determination that the percentage of the first duration of time over the first period of time is less than the threshold.
11. The method of
computer processing units (CPUs);
random-access memory (RAM); and
disk storage.
12. The method of
implementing a gradient descent algorithm; and
implementing a simplex method.
13. The method of
15. The apparatus of
16. The apparatus of
the processor is further configured to:
determine whether a percentage of the first duration of time over the first period of time is less than a threshold;
in response to determining the percentage of the first duration of time over the first period of time is less than the threshold, remove the maximum value of the resource utilization from the historical data, wherein:
predicting the maximum predicted value of the resource utilization of the first server over the second period of time is further based on the determination that the percentage of the first duration of time over the first period of time is less than the threshold; and
determining the first confidence interval for the resource utilization of the first server is further based on the determination that the percentage of the first duration of time over the first period of time is less than the threshold.
17. The apparatus of
computer processing units (CPUs);
random-access memory (RAM); and
disk storage.
18. The apparatus of
|
This invention relates generally to server architecture and usage, and specifically to a real-time server capacity optimization tool.
In many large data centers, server utilization rates may not be optimal. For example, some servers may be underutilized, resulting in an inefficient use of computer resources and unnecessary infrastructure complexity. On the other hand, other servers may be operating at or near capacity, risking performance issues and potential failures of the applications running on the servers.
This disclosure contemplates a real-time server capacity optimization tool that may be used to monitor a set of servers and to automatically reconfigure the servers to address server underutilization and/or overutilization. The tool identifies sub-optimal server capacity by monitoring and analyzing the historical resource utilization of each server in terms of its percentage utilization of disk storage, CPU, and/or RAM. Based on this historical resource utilization data, the tool predicts maximum resource utilizations of the server, from which the tool may identify an under/overutilization of the server. In response to identifying an underutilized server, the tool may release a portion of the computational resources of the server and/or consolidate the server with one or more other servers. In this manner, certain embodiments may free up valuable computational resources for use by overutilized servers and improve the overall efficiency of the server system. Certain embodiments of the tool are described below.
According to one embodiment, a system includes a first server, a database, and a hardware processor. The first server is associated with a resource utilization. The database stores historical data for the first server. The historical data includes values of the resource utilization of the first server collected from the first server over a first period of time. The hardware processor identifies, from the historical data, a maximum value of the resource utilization of the first server collected from the first server over the first period of time. The processor also determines, from the historical data, a first duration of time for which the values of the resource utilization of the first server collected from the first server over the first period of time exceed a given percentage of the maximum value. The processor additionally predicts, based on the historical data, a maximum predicted value of the resource utilization of the first server over a second period of time later than the first period of time. The processor further determines, based on the historical data, a first confidence interval for the resource utilization of the first server. The first confidence interval includes an upper bound. The processor also generates, based at least on the maximum value of the resource utilization of the first server collected from the first server over the first period of time, the first duration of time, the maximum predicted value of the resource utilization of the first server over the second period of time, and the upper bound, a recommendation. The recommendation includes at least one of a recommendation to consolidate the first server with a second server and a recommendation to release computational resources of the first server. The processor additionally transmits the recommendation to a system administrator.
According to another embodiment, a method includes accessing historical data for a first server. The first server is associated with a resource utilization. The historical data includes values of the resource utilization of the first server collected from the first server over a first period of time. The method also includes identifying, from historical data, a maximum value of the resource utilization of the first server collected from the first server over the first period of time. The method additionally includes determining, from the historical data, a first duration of time for which the values of the resource utilization of the first server collected from the first server over the first period of time exceed a given percentage of the maximum value. The method further includes predicting, based on the historical data, a maximum predicted value of the resource utilization of the first server over a second period of time later than the first period of time. The method also includes determining, based on the historical data, a first confidence interval for the resource utilization of the first server. The first confidence interval includes an upper bound. The method additionally includes generating, based at least on the maximum value of the resource utilization of the first server collected from the first server over the first period of time, the first duration of time, the maximum predicted value of the resource utilization of the first server over the second period of time, and the upper bound, a recommendation. The recommendation includes at least one of a recommendation to consolidate the first server with a second server and a recommendation to release computation resources of the first server. The method further includes transmitting the recommendation to a system administrator.
According to a further embodiment, an apparatus includes a hardware processor. The processor accesses a database storing historical data for a first server. The first server is associated with a resource utilization. The historical data includes values of the resource utilization of the first server collected from the first server over a first period of time. The processor also identifies, from the historical data, a maximum value of the resource utilization of the first server collected from the first server over the first period of time. The processor additionally determines, from the historical data, a first duration of time for which the values of the resource utilization of the first server collected from the first server over the first period of time exceed a given percentage of the maximum value. The processor further predicts, based on the historical data, a maximum predicted value of the resource utilization of the first server over a second period of time later than the first period of time. The processor also determines, based on the historical data, a first confidence interval for the resource utilization of the first server. The first confidence interval includes an upper bound. The processor additionally generates, based at least on the maximum value of the resource utilization of the first server collected from the first server over the first period of time, the first duration of time, the maximum predicted value of the resource utilization of the first server over the second period of time, and the upper bound, a recommendation. The recommendation includes at least one of a recommendation to consolidate the first server with a second server and a recommendation to release computation resources of the first server. The processor further transmits the recommendation to a system administrator.
Certain embodiments provide one or more technical advantages. As an example, an embodiment reduces the overall computational resources consumed by a data center, by consolidating underutilized servers. As another example, an embodiment improves the performance of the applications running on the servers, by reducing the occurrence of downtime associated with application failures arising from server overutilization. The system described in the present disclosure may particularly be integrated into a practical application of a server capacity optimization tool for use by an organization to reduce infrastructure complexity and improve the efficiency of the organization's server resources, as compared to a traditional server system operating without the tool. In particular, the tool may be used to continuously monitor any number of servers belonging to the organization, to automatically identify sub-optimal server capacity and reconfigure the servers accordingly.
Certain embodiments may include none, some, or all of the above technical advantages. One or more other technical advantages may be readily apparent to one skilled in the art form the figures, descriptions, and claims included herein.
For a more complete understanding of the present disclosure, reference is now made to the following description, taken in conjunction with the accompanying drawings, in which:
Embodiments of the present disclosure and its advantages may be understood by referring to
I. System Overview
Devices 106 may be used by users 104 to send inputs 138 to capacity optimizing tool 102 and/or to receive outputs 142 from capacity optimizing tool 102. In certain embodiments, input 138 may include a request for capacity optimizing tool 102 to transmit recommendations to user 104, as output 142. For example, in certain embodiments, capacity optimizing tool 102 may analyze historical data 124 at regular intervals and then transmit recommendations 142, based on such analysis, to user 104 in response to a request 138 for such recommendations. As another example, in some embodiments, capacity optimizing tool 102 may both analyze historical data 124 and transmit recommendations 142 in response to receiving a request 138 for such recommendations from user 104. In some embodiments, transmitting recommendations 142 to user 104 may include displaying a graphical user interface on device 106, where the graphical user interface depicts the recommendations.
In certain embodiments, inputs 138 may include inputs for use by capacity optimizing tool 102 in analyzing historical data 124. For example, as described in further detail below, in the discussion of
Devices 106 include any appropriate device for communicating with components of system 100 over network 108. For example, devices 106 may be a telephone, a mobile phone, a computer, a laptop, a wireless or cellular telephone, a tablet, a server, and IoT device, and/or an automated assistant, among others. This disclosure contemplates devices 106 being any appropriate device for sending and receiving communications over network 108. Device 106 may also include a user interface, such as a display, a microphone, keypad, or other appropriate terminal equipment usable by user 104. In some embodiments, an application executed by a processor of device 106 may perform the functions described herein.
Network 108 facilitates communication between and amongst the various components of system 100. This disclosure contemplates network 108 being any suitable network operable to facilitate communication between the components of system 100. Network 108 may include any interconnecting system capable of transmitting audio, video, signals, data, messages, or any combination of the preceding. Network 108 may include all or a portion of a public switched telephone network (PSTN), a public or private data network, a local area network (LAN), a metropolitan area network (MAN), a wide area network (WAN), a local, regional, or global communication or computer network, such as the Internet, a wireline or wireless network, an enterprise intranet, or any other suitable communication link, including combinations thereof, operable to facilitate communication between the components.
Servers 110 may be located on, or otherwise connected to, network 108. System 100 may include any number of servers 110a through 110n. Servers 110 may include application servers, database servers, file servers, mail servers, print servers, web servers, or any other type of server that provides computational functionality to users 104. For example, servers 110 may be used to test and/or run applications 112a through 112d. An application 112 submitted to servers 110 may use one or more servers 110 when executing. When an application uses more than one server 110, communication between those servers used by the application may occur over network 108.
Each server 110a through 110n is associated with a set of computational resources. These computational resources may include one or more central processing units (CPUs), an amount of random-access memory (RAM), an amount of disk storage, and/or any other computational resources associated with servers 110a through 110n. Applications 112a through 112d, running on servers 110a through 110n, consume at least a portion of the computational resources of the servers. For example, application 110a, running on server 110a, consumes a portion of the CPU, RAM, and disk storage of server 110a. Accordingly, each server 110a through 110n is associated with a resource utilization 140, indicating the percentage utilization of one or more of the server's computational resources. For example, the resource utilization for server 110a may indicate that 30% of the server's CPUs are currently in use, 45% of the server's RAM is currently in use, and 23% of the server's disk storage is currently in use.
In certain embodiments, servers 110 may include cloud-based servers. For example, servers 110 may include virtual servers running in a cloud computing environment offered by a hosting provider. In such embodiments, adjusting the computational resources associated with a given cloud-based server may include sending a reconfiguration command 144 to the hosting provider. For example, a reconfiguration command 144 may be used to release a portion of the computational resources associated with cloud-based server 110a. In some embodiments, servers 110 may include on-premises, physical servers. For example, servers 110 may include physical servers located in a datacenter operated by an organization running capacity optimizing tool 102. In such embodiments, adjusting the computational resources associated with a given physical server may include sending a recommendation 142 to a system administrator 104, to physically adjust the computation resources of the server. For example, in response to receiving a recommendation 142 to release 50% of the RAM associated with server 110a, a system administrator 104 may physically remove one or more RAM cards from server 110a.
Database 122 stores historical data 124 gathered from servers 110a through 110n. For example, for each server 110a through 110n, database 122 may store historical data 124a through 124n, corresponding to the historical resource utilization of the server, collected from the server over time. For each server 110, historical data 124 may include the percentage utilization of the server's CPU, RAM, and/or disk storage over time. In certain embodiments, historical data 124 is collected from each server 110 at regular time intervals. For example, historical data 124 may be collected from each server 110 every second, minute, or any other appropriate time interval. In some embodiments, historical data 124 may be collected from each server 110 at irregular intervals.
As seen in
Processor 126 is any electronic circuitry, including, but not limited to microprocessors, application specific integrated circuits (ASIC), application specific instruction set processor (ASIP), and/or state machines, that communicatively couples to memory 128 and controls the operation of capacity optimizing tool 102. Processor 126 may be 8-bit, 16-bit, 32-bit, 64-bit or of any other suitable architecture. Processor 126 may include an arithmetic logic unit (ALU) for performing arithmetic and logic operations, processor registers that supply operands to the ALU and store the results of ALU operations, and a control unit that fetches instructions from memory and executes them by directing the coordinated operations of the ALU, registers and other components. Processor 126 may include other hardware and software that operates to control and process information. Processor 126 executes software stored on memory to perform any of the functions described herein. Processor 126 controls the operation and administration of capacity optimizing tool 102 by processing information received from network 108, device(s) 106, memory 128, and interface 132. Processor 126 may be a programmable logic device, a microcontroller, a microprocessor, any suitable processing device, or any suitable combination of the preceding. Processor 126 is not limited to a single processing device and may encompass multiple processing devices.
Memory 128 may store, either permanently or temporarily, data, operational software, or other information for processor 126. Memory 128 may include any one or a combination of volatile or non-volatile local or remote devices suitable for storing information. For example, memory 128 may include random access memory (RAM), read only memory (ROM), magnetic storage devices, optical storage devices, or any other suitable information storage device or a combination of these devices. The software represents any suitable set of instructions, logic, or code embodied in a computer-readable storage medium. For example, the software may be embodied in memory 128, a disk, a CD, or a flash drive. In particular embodiments, the software may include an application executable by processor 126 to perform one or more of the functions described herein.
In certain embodiments, memory 128 may also store a library of reconfiguration commands 130. The library of reconfiguration commands 130 may include commands that may be submitted to the host provider of servers 110, to reconfigure the computational resources associated with servers 110. For example, the library of reconfiguration commands 130 may include commands to consolidate servers 110, release computational resources associated with a server 110, and/or add computational resources to a server 110.
Interface 132 is configured to enable wired and/or wireless communications. Interface 132 is configured to communicate data between capacity optimizing tool 102 and other components of system 100 (e.g., other network devices, systems, or domain(s)). For example, interface 132 may comprise a WIFI interface, a local area network (LAN) interface, a wide area network (WAN) interface, a modem, a switch, or a router. Processor 126 is configured to send and receive data using interface 132. Interface 132 may be configured to use any suitable type of communication protocol as would be appreciated by one of ordinary skill in the art.
Modifications, additions, or omissions may be made to the systems described herein without departing from the scope of the invention. For example, system 100 may include any number of users 104, devices 106, networks 108, servers 110, and databases 122. The components may be integrated or separated. Moreover, the operations may be performed by more, fewer, or other components. Additionally, the operations may be performed using any suitable logic comprising software, hardware, and/or other logic.
As described above, capacity optimizing tool 102 identifies servers 110 operating at sub-optimal capacity, based on historical data 124 collected from the servers and stored in database 122. Capacity optimizing tool 102 then generates reconfiguration recommendations 142 for such servers.
As illustrated in
Data analyzer 134 performs a number of different data analysis techniques on historical data 124, the results of which may be used by recommendation algorithm 216 to identify those servers 110 operating at sub-optimal capacities. Examples of potential data analysis techniques that may be used by data analyzer 134 are presented below.
a. Maximum Forecasted Capacity
As a first example, in certain embodiments, and as illustrated in
In certain embodiments, data analyzer 134 may determine maximum predicted resource utilizations 204 by applying forecasting algorithm 202 to historical data 124 to determine a set of predicted resource utilizations covering the given future time interval and then identifying the maximum predicted resource utilization from the set of predicted resource utilizations. For example,
Forecasting algorithm 202 may be any suitable algorithm configured to predict values for the future resource utilization 304 of a server 110, based on the historical resource utilization 302 of the server, stored in historical data 124. For example, in certain embodiments, forecasting algorithm 202 may perform a time-series analysis of historical resource utilization 302. In some embodiments, forecasting algorithm 202 may decompose the historical resource utilization 302 into a trend component, a seasonal component, and a random component, and predict values for the future resource utilization 304, based on these individual components. Examples of specific forecasting algorithms 202 that may be used by data analyzer 134 include the Holt-Winters forecasting algorithm, the Autoregressive Integrated Moving Average (ARIMA) forecasting algorithm, and the Error, Trend, Seasonality (ETS) forecasting algorithm, among others. Additionally, forecasting algorithm 202 may be a machine learning algorithm trained to predict values for future resource utilization 304, based on historical resource utilization 302. For example, forecasting algorithm 202 may be a long short-term memory (LTSM) machine learning algorithm, a recurrent neural network (RNN), or any other suitable machine learning algorithm.
b. Capacity Confidence Interval
As another example of a potential data analysis technique that may be used by data analyzer 134 to analyze historical data 124, in certain embodiments, and as illustrated in
Data analyzer 134 may use confidence interval algorithm 206 to estimate confidence intervals for the historical resource utilizations of each server 110a through 110n. For example, data analyzer 134 may use confidence interval algorithm 206 to estimate confidence intervals for the historical CPU utilization of server 110a, the historical RAM utilization of server 110a, the historical disk storage utilization of server 110a, and/or the historical utilization of any other appropriate computational resource of server 110a. As an example,
Confidence interval algorithm 206 may be any suitable algorithm configured to estimate a confidence interval for the resource utilization of a server 110, where the confidence interval is associated with a given confidence level that the actual resource utilization of the server lies within the estimated range. For example, in certain embodiments, confidence interval algorithm 206 corresponds to a gradient descent algorithm. As another example, in certain embodiments, confidence interval algorithm 206 corresponds to a simplex method.
c. Capacity Spike Analysis
As a further example of a potential data analysis technique that may be used by data analyzer 134 to analyze historical data 124, in certain embodiments, and as illustrated in
In certain embodiments, spike detecting algorithm 210 may identify spikes in historical resource utilization data 124, by identifying the maximum value of the historical resource utilization data, and then determining (1) the number of times 212 that the historical resource utilization data exceeds a given percentage of the identified maximum and/or (2) the percentage of time 214 for which the historical resource utilization data exceeds the given percentage of the identified maximum. For each server 110a through 110n, spike detecting algorithm 210 may perform such analysis on any component of historical resource utilization data 124. For example, spike detecting algorithm 210 may perform such analysis on the historical CPU utilization data, the historical RAM utilization data, the historical disk storage utilization data, and/or the historical utilization data for any other suitable computational resource of server 110.
As illustrated in
After identifying maximum 314, spike detecting algorithm 210 next calculates a given percentage of maximum 314. This percentage may be any suitable percentage. For example, as illustrated in
Spike detecting algorithm 210 next determines the number of distinct times 212a that the historical CPU utilization data 312 exceeds the given percentage 316 of the maximum 314, and the percentage 214a of total time for which the historical CPU utilization data 312 exceeds the given percentage 316 of the maximum 314. For example, as illustrated in
In contrast to
In certain embodiments, recommendation generator 136 may use the determination that spike 324 in historical CPU utilization data 322 is likely anomalous, to generate reconfiguration recommendation 142. In some embodiments, data analyzer 134 may use the determination that spike 324 in historical CPU utilization data 322 is likely anomalous to modify historical CPU utilization data 322. For example, in response to determining that spike 324 is likely anomalous, data analyzer 134 may remove spike 324 from historical CPU utilization data 322. Data analyzer 134 may then apply forecasting algorithm 202 and confidence interval algorithm 206 to the modified historical CPU utilization data 322 to obtain potentially more accurate values for the maximum predicted CPU utilization 204, generated by forecasting algorithm 202, and the upper bound 208 of the confidence interval, generated by confidence interval algorithm 206, as compared with the values obtained from the unmodified historical CPU utilization data.
Data analyzer 134 may be a sub-processing component of capacity optimizing tool 102. For example, data analyzer 134 may include non-transitory computer readable instructions stored in memory 128 and executed by processor 126. For example, memory 128 may store forecasting algorithm 202, confidence interval algorithm 206, spike detecting algorithm 210, and/or any other suitable algorithm used by data analyzer 134 to analyze historical data 124. An example algorithm for data analyzer 134 is as follows: (1) access historical data 124; (2) apply spike detecting algorithm 210 to historical data 124 to identify a number 212 of spikes in historical data 124 and/or a percentage duration 214 of the spikes; (3) if spike detecting algorithm determines that the spikes are likely anomalous, remove the spikes from historical data 124; (4) apply forecasting algorithm 202 to historical data 124 to determine maximum predicted resource utilization 204; (5) apply confidence interval algorithm 206 to historical data 124 to determine a confidence interval for historical data 124, with upper bound 208; and (6) provide recommendation generator 136 with maximum predicted utilization 204 and upper bound 208.
d. Recommendation Generation
As illustrated in
As an example, in certain embodiments, recommendation algorithm 216 may generate reconfiguration recommendations 142 based on the maximum predicted resource utilizations 204. As described above, maximum predicted resource utilizations 204 correspond to predictions of the maximum resource utilization of servers 110 over a given time interval in the future. For example, for each server 110a through 110n, maximum predicted resource utilizations 204 may include a maximum predicted CPU utilization over the future time interval, a maximum predicted RAM utilization over the future time interval, a maximum predicted disk storage utilization over the future time interval, and/or a maximum predicted utilization of any other suitable computation resource over the future time interval. Based on these maximum predicted future utilizations, recommendation algorithm may determine that one or more servers 110 are underutilized or overutilized. For example, recommendation algorithm 216 may determine that a server 110 is underutilized if a maximum predicted utilization of one or more of its computational resources is below a given threshold. Similarly, recommendation algorithm 216 may determine that a server 110 is overutilized if a maximum predicted utilization of one or more of its computational resources is above a given threshold.
As another example, in certain embodiments, recommendation algorithm 216 may generate reconfiguration recommendations 142 based on the upper bounds 206 of the confidence intervals generated by confidence interval algorithm 206. As described above, upper bound 206 corresponds to an estimate of the maximum possible value of the historical resource utilization of a given server 110 (e.g., if the confidence level associated with the confidence interval is set at 99.997%, there is a 99.997% likelihood that the maximum historical resource utilization lies below upper bound 208). For example, for each server 110a through 110n, upper bound 206 may correspond to an estimate of the maximum possible value of the historical CPU utilization, the historical RAM utilization, the historical disk storage utilization, and/or the historical utilization of any other appropriate computational resource. Based on these estimates of the maximum possible historical resource utilizations, recommendation algorithm may determine that one or more servers 110 are underutilized or overutilized. For example, recommendation algorithm 216 may determine that a server 110 is underutilized if an estimate of the maximum possible historical value of one or more of its computational resources is below a given threshold. Similarly, recommendation algorithm 216 may determine that a server 110 is overutilized if an estimate of the maximum possible historical value of one or more of its computation resources is above a given threshold.
As another example, in certain embodiments, recommendation algorithm 216 may generate reconfiguration recommendations 142 based on both the maximum predicted resource utilizations 204, generated by forecasting algorithm 202, and the upper bounds 208, generated by confidence interval algorithm 206. For example, recommendation algorithm 216 may determine whether a server 110 is underutilized/overutilized based on the larger of the maximum predicted resource utilizations 204 of the server and the upper bounds 208 estimated for the server, the smaller of the maximum predicted resource utilizations 204 of the server and the upper bounds 208 estimated for the server, the average of the maximum predicted resource utilizations 204 of the server and the upper bounds 208 estimated for the server, or any other suitable combination of the maximum predicted resource utilizations 204 of the server and the upper bounds 208 estimated for the server. As an example, recommendation algorithm 216 may determine that a given server 110 is underutilized if both a maximum predicted utilization of one or more of its computational resources is below a given threshold and the upper bound 208 of the confidence interval calculated for the one or more computational resources is also below the given threshold. As another example, recommendation algorithm 216 may determine that a given server 110 is overutilized if either (or both) of the maximum predicted resource utilization 204 of the server and the upper bound 208 estimated for the server exceed a given threshold.
In certain embodiments, recommendation algorithm 216 may take into account the spikes in historical resource utilization data 124, identified by spike detecting algorithm 210, in determining whether a server 110 is overutilized or underutilized. For example, recommendation algorithm 216 may determine that a given server 110 is likely not overutilized, despite the fact that one or both of the maximum predicted resource utilizations 204 of the server and the upper bounds 208 estimated for the server exceed a given threshold, where spike detecting algorithm 210 has detected one or more anomalous spikes in the historical data, which may be leading to overestimates of the maximum predicted resource utilizations 204 and/or upper bounds 208.
In some embodiments, recommendation algorithm 216 may use a machine learning algorithm to determine whether a given server 110 is underutilized or overutilized. Such a machine learning algorithm may be trained to identify underutilized and/or overutilized servers 110 based on the output from data analyzer 134.
In response to a determination that one or more servers 110 are underutilized, recommendation algorithm 216 may generate recommendations 142 to consolidate two or more of the identified servers and/or release computational resources associated with one or more of the identified servers. As an example, consider the maximum predicted/estimated resource utilizations for servers 110a through 110d, depicted in Table 1. These maximum predicted/estimated resource utilizations may correspond to maximum predicted utilizations 204, generated by forecasting algorithm 202, estimated upper bounds 208 of confidence intervals generated by confidence interval algorithm 206, or some combination of the maximum predicted utilizations 204 and upper bounds 208 (e.g., the larger of the two values, the smaller of the two values, an average of the two values, etc.).
TABLE 1
Example Maximum Predicted Resource Utilizations
Max
Max
Predicted/
Predicted/
Available
Available
Estimated
Estimated
CPU
RAM
CPU utilization
RAM utilization
Server
(cores)
(GB)
(%)
(cores)
(%)
(GB)
110a
1
2
40%
0.4
20%
0.4
110b
4
8
20%
0.8
10%
0.8
110c
2
4
105%
2.1
90%
3.6
110d
1
4
65%
0.65
70%
2.8
Based on the maximum predicted/estimated resource utilizations listed in Table 1, recommendation algorithm 216 may determine that first server 110a and second server 110b are underutilized and that third server 110c is overutilized. For example, recommendation algorithm 216 may determine that first server 110a and second server 110b are underutilized because they are associated with maximum predicted/estimated utilizations that are below 50%. On the other hand, recommendation algorithm 216 may determine that third server 110c is overutilized because it is associated with a maximum predicted/estimated CPU utilization that is over 100%. Accordingly, recommendation algorithm 216 may generate recommendations 142 to consolidate first server 110a with second server 110b and/or release computational resources associated with first server 110a and/or second server 110b. Recommendation algorithm 216 may also generate a recommendation to add computational resources to third server 110c. As an example, recommendation algorithm 216 may generate recommendations 142 suggesting that (1) first server 110a should be consolidated with second server 110b (e.g., the applications running on first server 110a should be transferred to second server 110b, and the computation resources associated with first server 110a should be released) and (2) the computational resources gained by consolidating first server 110a and second server 110b should be transferred to third server 110c (e.g., the CPU core and 2 GB of memory associated with first server 110a should be transferred to third server 110a). As another example, recommendation algorithm 216 may generate recommendations 142 suggesting that (1) 2 CPU cores and 6 GB of RAM should be released from second server 110b and (2) at least one of the released CPU cores should be transferred to third server 110c.
Recommendation generator 126 may use any suitable algorithm 216 to generate recommendations 142. For example, in certain embodiments, recommendation generator 126 may use a bin packing algorithm to identify two or more underutilized servers 110 to consolidate into one or more servers.
In certain embodiments, recommendation generator 136 may transmit recommendations 142 to system administrators 104. Such system administrators may then choose whether or not to implement the recommendations. In some embodiments, recommendation generator 136 may automatically implement recommendations 142. For example, recommendation generator 136 may generate reconfiguration commands 144 to implement recommendations 142. Reconfiguration commands 144 may be executed on servers 110 to reconfigure the servers. For example, in certain embodiments, reconfiguration commands 144 may be transmitted to the hosting provider of servers 110, for execution by the hosting provider. In certain embodiments, reconfiguration commands 144 may be generated based on a library of available reconfiguration commands 130 stored in memory 128. For example, library 130 may include commands to transfer applications from a first server 110a to a second server 110b, to release computational resources associated with a given server 110, and/or to add computational resources to a given server 110. In some embodiments, recommendation generator 136 may generate reconfiguration commands 144 based on an analysis of the current status of the servers. For example, in certain embodiments in which recommendations 142 include a recommendation to consolidate first server 110a with second server 110b, recommendation generator 136 may first identify those applications 112 currently installed on and/or running on first server 110a. Recommendation generator 136 may then generate reconfiguration commands 144 that include commands to transfer those applications currently installed and running on first server 110a to second server 110b and then to release the computational resources associated with first server 110a.
Recommendation generator 136 may be a sub-processing component of capacity optimizing tool 102. For example, recommendation generator 136 may include non-transitory computer readable instructions stored in memory 128 and executed by processor 126. An example algorithm for recommendation generator 136 is as follows: (1) receive output data from data analyzer 134, where the output data includes maximum predicted resource utilizations 204 generated by forecasting algorithm 202, upper bounds 208 generated by confidence interval algorithm 206, and/or the number of spikes 212 and the duration 214 of such spikes, determined by spike detecting algorithm 210; (2) identify, based on the output data received from data analyzer 134, whether any of servers 110 are underutilized or overutilized; (3) generate recommendation 142, where recommendation 142 includes recommendations to consolidate two or more underutilized servers, release computational resources associated with underutilized servers, and/or add computational resources to overutilized servers; and (4) transmit recommendation 142 to system administrator 104.
III. Method of Identifying Sub-Optimal Server Capacity
If, in step 408, capacity optimizing tool 102 determines that the duration of the spike is less than the threshold, in step 410 capacity optimizing tool 102 modifies the selected component of historical data 124a. For example, capacity optimizing tool 102 may remove the spike from historical data 124a.
In step 412, capacity optimizing tool 102 uses forecasting algorithm 202 to predict a maximum future resource utilization 204 for server 110a, based on the selected component of historical data 124a. In step 414, capacity optimizing tool 102 uses confidence interval algorithm 206 to generate a confidence interval for the selected component of historical data 124a, where the confidence interval includes upper bound 208.
In step 416, capacity optimizing tool 102 determines whether historical data 124a includes any additional components associated with computational resources belonging to server 110a. If, in step 416, capacity optimizing tool 102 determines that historical data 124a includes one or more additional components, method 400 returns to step 404. If, in step 416 capacity optimizing tool 102 determines that historical data 124a does not include any additional components, method 400 proceeds to step 418.
In step 418 capacity optimizing tool 102 generates reconfiguration recommendation 142, based on the maximum future resource utilizations 204 generated by forecasting algorithm 202, the upper bounds 208 determined by confidence interval algorithm 206, and/or the spikes identified by spike detecting algorithm 210. In step 420 capacity optimizing tool 102 determines whether recommendation 142 includes a recommendation to consolidate server 110a with another server 110 and/or to release a portion of the computational resources associated with server 110a.
If, in step 420 capacity optimizing tool 102 determines that recommendation 142 includes a recommendation to consolidate server 110a with another server 110 and/or to release a portion of the computational resources associated with server 110a, in step 422 capacity optimizing tool 102 determines whether server 110a is a physical, on-premises server or a cloud-based server. If, in step 422 capacity optimizing tool 102 determines that server 110a is a physical, on premises server, in step 424 capacity optimizing tool 102 sends recommendation 142 to a user device 106 associated with a system administrator 104. If, in step 422 capacity optimizing tool 102 determines that server 110a is a cloud-based server, in step 426 consolidates server 110a with another server 110 and/or releases a portion of the computational resources associated with server 110a, according to recommendation 142. For example, capacity optimizing tool 102 may generate reconfiguration commands 144 and execute reconfiguration commands 144 on server 110a.
Modifications, additions, or omissions may be made to method 400 depicted in
Although the present disclosure includes several embodiments, a myriad of changes, variations, alterations, transformations, and modifications may be suggested to one skilled in the art, and it is intended that the present disclosure encompass such changes, variations, alterations, transformations, and modifications as falling within the scope of the appended claims.
Ramasamy, Suki, Ganesan, Mahesh, Chaudhari, Vipul, Kasyap, Srinivasan Bhaskaran, Sharma, Debendra
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
10043255, | Feb 20 2018 | Capital One Services, LLC | Utilizing a machine learning model to automatically visually validate a user interface for multiple platforms |
10193811, | Jul 05 2017 | Juniper Networks, Inc. | Flow distribution using telemetry and machine learning techniques |
10210751, | Aug 04 2017 | Verizon Patent and Licensing Inc | Identification of traffic control mechanisms using machine learning |
10270644, | May 17 2018 | Accenture Global Solutions Limited | Framework for intelligent automated operations for network, service and customer experience management |
10277524, | Feb 23 2018 | Capital One Services, LLC | Monitoring data streams and scaling computing resources based on the data streams |
10332261, | Apr 26 2018 | Capital One Services, LLC | Generating synthetic images as training dataset for a machine learning network |
10341253, | Sep 19 2016 | Accenture Global Solutions Limited | Automatic consolidation of network resources |
10353863, | Apr 11 2018 | Capital One Services, LLC | Utilizing machine learning to determine data storage pruning parameters |
10410056, | Apr 16 2019 | Capital One Services, LLC | Computer vision based asset evaluation |
10419607, | Feb 06 2019 | Capital One Services, LLC; CAPITAL ONE SERVICES LLC | Identifying a media item to present to a user device via a communication session |
10423403, | Nov 29 2018 | Capital One Services, LLC | Utilizing a machine learning model to predict metrics for an application development process |
10439891, | Apr 08 2014 | KYNDRYL, INC | Hyperparameter and network topology selection in network demand forecasting |
10445738, | Nov 13 2018 | Capital One Services, LLC | Detecting a transaction volume anomaly |
10452441, | Oct 15 2018 | Accenture Global Solutions Limited | Determining an allocation of computing resources for a job |
10468014, | Feb 06 2019 | Capital One Services, LLC | Updating a speech generation setting based on user speech |
10474981, | Sep 05 2018 | Capital One Services, LLC | Utilizing machine learning to generate augmented reality delivery instructions for delivering an item to a location |
10484301, | Sep 30 2016 | NUTANIX INC | Dynamic resource distribution using periodicity-aware predictive modeling |
10489512, | Feb 14 2018 | Capital One Services, LLC | Utilizing machine learning models to identify insights in a document |
10490191, | Jan 31 2019 | Capital One Services, LLC | Interacting with a user device to provide automated testing of a customer service representative |
10521419, | Aug 31 2018 | Accenture Global Solutions Limited | Identifying an issue associated with data |
10525850, | Jul 31 2019 | Capital One Services, LLC | Automatic vehicle configuration based on sensor data |
10565475, | Apr 24 2018 | Accenture Global Solutions Limited | Generating a machine learning model for objects based on augmenting the objects with physical properties |
7171668, | Dec 17 2001 | ServiceNow, Inc | Automatic data interpretation and implementation using performance capacity management framework over many servers |
7305471, | Apr 14 2000 | Microsoft Technology Licensing, LLC | Capacity planning for server resources |
7403886, | May 23 2000 | Microsoft Technology Licensing, LLC | Load stimulation tool for server resource capacity planning |
8055951, | Apr 10 2007 | International Business Machines Corporation | System, method and computer program product for evaluating a virtual machine |
9213565, | Jun 28 2013 | VMWARE, INC | Methods and systems for mining datacenter telemetry data |
9363190, | Jul 31 2013 | The University of Melbourne; MANJRASOFT PTY LTD | System, method and computer program product for energy-efficient and service level agreement (SLA)-based management of data centers for cloud computing |
9372734, | Aug 27 2013 | Bank of America Corporation | Outage window scheduler tool |
9565129, | Sep 30 2014 | AIRBNB, INC | Resource provisioning planning for enterprise migration and automated application discovery |
9910696, | Apr 23 2015 | International Business Machines Corporation | Machine learning for virtual machine migration plan generation |
20070233866, | |||
20120053925, | |||
20150039764, | |||
20150052250, | |||
20180024860, | |||
20180114136, | |||
20180165109, | |||
20180349844, | |||
20180373579, | |||
20190036789, | |||
20190147371, | |||
20190163517, | |||
20190182120, | |||
20190250949, | |||
20190306736, | |||
20190317826, | |||
20190370083, | |||
20190386889, | |||
20200065213, | |||
20200104230, | |||
20200183758, | |||
20210042820, | |||
20210182387, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Mar 09 2020 | RAMASAMY, SUKI | Bank of America Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 052095 | /0966 | |
Mar 09 2020 | GANESAN, MAHESH | Bank of America Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 052095 | /0966 | |
Mar 09 2020 | KASYAP, SRINIVASAN BHASKARAN | Bank of America Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 052095 | /0966 | |
Mar 10 2020 | SHARMA, DEBENDRA | Bank of America Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 052095 | /0966 | |
Mar 11 2020 | CHAUDHARI, VIPUL | Bank of America Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 052095 | /0966 | |
Mar 12 2020 | Bank of America Corporation | (assignment on the face of the patent) | / |
Date | Maintenance Fee Events |
Mar 12 2020 | BIG: Entity status set to Undiscounted (note the period is included in the code). |
Date | Maintenance Schedule |
Dec 13 2025 | 4 years fee payment window open |
Jun 13 2026 | 6 months grace period start (w surcharge) |
Dec 13 2026 | patent expiry (for year 4) |
Dec 13 2028 | 2 years to revive unintentionally abandoned end. (for year 4) |
Dec 13 2029 | 8 years fee payment window open |
Jun 13 2030 | 6 months grace period start (w surcharge) |
Dec 13 2030 | patent expiry (for year 8) |
Dec 13 2032 | 2 years to revive unintentionally abandoned end. (for year 8) |
Dec 13 2033 | 12 years fee payment window open |
Jun 13 2034 | 6 months grace period start (w surcharge) |
Dec 13 2034 | patent expiry (for year 12) |
Dec 13 2036 | 2 years to revive unintentionally abandoned end. (for year 12) |