A method of controlling power at a central processing unit is disclosed. The method may include moving to a higher cpu frequency after a transient performance deadline has expired, entering an idle state, and resetting the transient performance deadline based on an effective transient budget.

Patent
   8909962
Priority
Dec 16 2009
Filed
Nov 11 2010
Issued
Dec 09 2014
Expiry
Jan 03 2032
Extension
418 days
Assg.orig
Entity
Large
0
121
currently ok
1. A method of controlling power at a central processing unit (cpu) of a portable computing device, the method comprising:
determining a first time value by computing an amount of time required to perform tasks scheduled for execution in the cpu;
determining a second time value by computing an amount of time that the cpu may remain in a busy state after the first time value without impacting a responsiveness of the portable computing device;
computing a transient performance deadline as a sum of the first and second time values;
commencing the performance of the scheduled tasks and a transient workload by the cpu;
determining whether the transient performance deadline for the cpu has expired;
moving to a higher cpu frequency level in response to determining that the transient performance deadline has expired and prior to the cpu entering an idle state;
entering the idle state; and
resetting the transient performance deadline based on an effective transient budget while in the idle state.
21. A computing device, comprising:
a memory; and
a processor coupled to the memory, wherein the processor is configured with processor executable instructions to perform operations comprising:
determining a first time value by computing an amount of time required to perform tasks scheduled for execution in the processor;
determining a second time value by computing an amount of time that the processor may remain in a busy state after the first time value without impacting a responsiveness of the computing device;
computing a transient performance deadline as a sum of the first and second time values;
commencing the performance of the scheduled tasks and a transient workload;
determining whether the transient performance deadline has expired;
moving to a higher processor frequency level in response to determining that the transient performance deadline has expired and prior to entering an idle state;
entering the idle state; and
resetting the transient performance deadline based on an effective transient budget while in the idle state.
31. A non-transitory computer-readable storage medium having stored thereon computer-executable instructions configured to cause a processor to perform operations comprising:
determining a first time value by computing an amount of time required to perform tasks scheduled for execution in a central processing unit (cpu) of a computing device;
determining a second time value by computing an amount of time that the cpu may remain in a busy state after the first time value without impacting a responsiveness of the computing device;
computing a transient performance deadline as a sum of the first and second time values;
commencing the performance of the scheduled tasks and a transient workload in the cpu;
determining whether the transient performance deadline for the cpu has expired;
moving to a higher cpu frequency level in response to determining that the transient performance deadline has expired and prior to the cpu entering an idle state;
entering the idle state; and
resetting the transient performance deadline based on an effective transient budget while in the idle state.
11. A portable computing device, comprising:
means for determining a first time value comprising means for computing an amount of time required to perform tasks scheduled for execution in a central processing unit (cpu) of the portable computing device;
means for determining a second time value comprising means for computing an amount of time that the cpu may remain in a busy state after the first time value without impacting a responsiveness of the portable computing device;
means for computing a transient performance deadline as a sum of the first and second time values;
means for commencing the performance of the scheduled tasks and a transient workload by the cpu;
means for determining whether the transient performance deadline has expired;
means for causing the cpu to move to a higher cpu frequency level in response to determining that the transient performance deadline has expired and prior to the cpu entering an idle state;
means for causing the cpu to enter the idle state; and
means for resetting the transient performance deadline based on an effective transient budget while the cpu is in the idle state.
2. The method of claim 1, further comprising:
exiting the idle state.
3. The method of claim 2, further comprising:
determining whether the cpu frequency has increased to a maximum cpu frequency.
4. The method of claim 3, further comprising:
rescheduling a timer in response to determining that the cpu frequency has not increased to a maximum cpu frequency.
5. The method of claim 1, further comprising:
determining an upcoming cpu frequency level for an upcoming busy cycle.
6. The method of claim 5, further comprising:
determining the effective transient budget by multiplying the transient performance deadline by a next higher cpu frequency level and dividing a result by a difference between the next higher cpu frequency level and the upcoming cpu frequency level, wherein the next higher cpu frequency level is one level higher than the upcoming cpu frequency level.
7. The method of claim 6, further comprising:
adding an overhead to the effective transient budget.
8. The method of claim 7, wherein adding an overhead to the effective transient budget comprises adding a clock switch overhead, a clock scheduling overhead, a voltage change overhead, or a combination thereof.
9. The method of claim 8, wherein resetting the transient performance deadline based on an effective transient budget while in the idle state comprises setting the transient performance deadline equal to an end idle time plus the effective transient budget and the overhead.
10. The method of claim 8, wherein resetting the transient performance deadline based on an effective transient budget while in the idle state comprises setting the transient performance deadline equal to a current time plus the effective transient budget and the overhead.
12. The portable computing device of claim 11, further comprising:
means for causing the cpu to exit the idle state.
13. The portable computing device of claim 12, further comprising:
means for determining whether a cpu frequency has increased to a maximum cpu frequency.
14. The portable computing device of claim 13, further comprising:
means for rescheduling a timer in response to determining that the cpu frequency has not increased to a maximum cpu frequency.
15. The portable computing device of claim 11, further comprising:
means for determining an upcoming cpu frequency level for an upcoming busy cycle.
16. The portable computing device of claim 15, further comprising means for determining the effective transient budget by multiplying the transient performance deadline by a next higher cpu frequency level and dividing a result by a difference between the next higher cpu frequency level and the upcoming cpu frequency level, wherein the next higher cpu frequency level is one level higher than the upcoming cpu frequency level.
17. The portable computing device of claim 16, further comprising:
means for adding an overhead to the effective transient budget.
18. The portable computing device of claim 17, wherein means for adding an overhead to the effective transient budget comprises means for adding a clock switch overhead, a clock scheduling overhead, a voltage change overhead, or a combination thereof.
19. The portable computing device of claim 18, wherein means for resetting the transient performance deadline based on an effective transient budget while in the idle state comprises means for setting the transient performance deadline equal to an end idle time plus the effective transient budget and the overhead.
20. The portable computing device of claim 18, wherein means for resetting the transient performance deadline based on an effective transient budget while in the idle state comprises means for setting the transient performance deadline equal to a current time plus the effective transient budget and the overhead.
22. The computing device of claim 21, wherein the processor is configured with processor-executable instructions to perform operations further comprising:
exiting the idle state.
23. The computing device of claim 22, wherein the processor is configured with processor-executable instructions to perform operations further comprising:
determining whether a processor frequency has increased to a maximum processor frequency.
24. The computing device of claim 23, wherein the processor is configured with processor-executable instructions to perform operations further comprising:
rescheduling a timer in response to determining that the processor frequency has not increased to a maximum processor frequency.
25. The computing device of claim 21, wherein the processor is configured with processor-executable instructions to perform operations further comprising:
determining an upcoming processor frequency level for an upcoming busy cycle.
26. The computing device of claim 25, wherein the processor is configured with processor-executable instructions to perform operations such that the effective transient budget is determined by multiplying the transient performance deadline by a next higher processor frequency level and dividing a result by a difference between the next higher processor frequency level and the upcoming frequency level, wherein the next higher processor frequency level is one level higher than the upcoming processor frequency level.
27. The computing device of claim 26, wherein the processor is configured with processor-executable instructions to perform operations further comprising:
adding an overhead to the effective transient budget.
28. The computing device of claim 27, wherein the processor is configured with processor-executable instructions to perform operations such that adding the overhead comprises adding a clock switch overhead, a clock scheduling overhead, a voltage change overhead, or a combination thereof.
29. The computing device of claim 28, wherein the processor is configured with processor-executable instructions to perform operations such that resetting the transient performance deadline based on the effective transient budget while in the idle state comprises setting the transient performance deadline equal to an end idle time plus the effective transient budget and the overhead.
30. The computing device of claim 28, wherein the processor is configured with processor-executable instructions to perform operations such that resetting the transient performance deadline based on the effective transient budget while in the idle state comprises setting the transient performance deadline equal to a current time plus the effective transient budget and the overhead.
32. The non-transitory computer-readable storage medium of claim 31, wherein the stored computer-executable instructions are configured to cause the processor to perform operations further comprising:
exiting the idle state.
33. The non-transitory computer-readable storage medium of claim 32, wherein the stored computer-executable instructions are configured to cause the processor to perform operations further comprising:
determining whether a cpu frequency has increased to a maximum cpu frequency.
34. The non-transitory computer-readable storage medium of claim 33, wherein the stored computer-executable instructions are configured to cause the processor to perform operations further comprising:
rescheduling a timer in response to determining that the cpu frequency has not increased to a maximum cpu frequency.
35. The non-transitory computer-readable storage medium of claim 31, wherein the stored computer-executable instructions are configured to cause the processor to perform operations further comprising:
determining an upcoming cpu frequency level for an upcoming busy cycle.
36. The non-transitory computer-readable storage medium of claim 35, wherein the stored computer-executable instructions are configured to cause the processor to perform operations such that the effective transient budget is determined by multiplying the transient performance deadline by a next higher cpu frequency level and dividing a result by a difference between the next higher cpu frequency level and the upcoming cpu frequency level, wherein the next higher cpu frequency level is one level higher than the upcoming cpu frequency level.
37. The non-transitory computer-readable storage medium of claim 36, wherein the stored computer-executable instructions are configured to cause the processor to perform operations further comprising:
adding an overhead to the effective transient budget.
38. The non-transitory computer-readable storage medium of claim 37, wherein the stored computer-executable instructions are configured to cause the processor to perform operations such that adding the overhead comprises adding a clock switch overhead, a clock scheduling overhead, a voltage change overhead, or a combination thereof.
39. The non-transitory computer-readable storage medium of claim 38, wherein the stored computer-executable instructions are configured to cause the processor to perform operations such that resetting the transient performance deadline based on the effective transient budget comprises setting the transient performance deadline equal to an end idle time plus the effective transient budget and the overhead.
40. The non-transitory computer-readable storage medium of claim 38, wherein the stored computer-executable instructions are configured to cause the processor to perform operations such that resetting the transient performance deadline based on the effective transient budget comprises setting the transient performance deadline equal to a current time plus the effective transient budget and the overhead.

The present application claims priority to U.S. Provisional Patent Application Ser. No. 61/286,991, entitled SYSTEM AND METHOD OF DYNAMICALLY CONTROLLING POWER IN A CENTRAL PROCESSING UNIT, filed on Dec. 16, 2009, the contents of which are fully incorporated by reference.

The present application is related to, and incorporates by reference, U.S. patent application Ser. No. 12/944,140, entitled SYSTEM AND METHOD FOR CONTROLLING CENTRAL PROCESSING UNIT POWER BASED ON INFERRED WORKLOAD PARALLELISM, by Rychlik et al., filed concurrently. The present application is related to, and incorporates by reference, U.S. patent application Ser. No. 12/944,202, entitled SYSTEM AND METHOD FOR CONTROLLING CENTRAL PROCESSING UNIT POWER IN A VIRTUALIZED SYSTEM, by Rychlik et al., filed concurrently. The present application is related to, and incorporates by reference, U.S. patent application Ser. No. 12/944,321, entitled SYSTEM AND METHOD FOR ASYNCHRONOUSLY AND INDEPENDENTLY CONTROLLING CORE CLOCKS IN A MULTICORE CENTRAL PROCESSING UNIT, by Rychlik et al., filed concurrently. The present application is related to, and incorporates by reference, U.S. patent application Ser. No. 12/944,378, entitled SYSTEM AND METHOD FOR CONTROLLING CENTRAL PROCESSING UNIT POWER WITH REDUCED FREQUENCY OSCILLATIONS, by Thomson et al., filed concurrently. The present application is related to, and incorporates by reference, U.S. patent application Ser. No. 12/944,561, entitled SYSTEM AND METHOD FOR CONTROLLING CENTRAL PROCESSING UNIT POWER WITH GUARANTEED STEADY STATE DEADLINES, by Thomson et al., filed concurrently. The present application is related to, and incorporates by reference, U.S. patent application Ser. No. 12/944,564, entitled SYSTEM AND METHOD FOR DYNAMICALLY CONTROLLING A PLURALITY OF CORES IN A MULTICORE CENTRAL PROCESSING UNIT BASED ON TEMPERATURE, by Sur et al., filed concurrently.

Portable computing devices (PCDs) are ubiquitous. These devices may include cellular telephones, portable digital assistants (PDAs), portable game consoles, palmtop computers, and other portable electronic devices. In addition to the primary function of these devices, many include peripheral functions. For example, a cellular telephone may include the primary function of making cellular telephone calls and the peripheral functions of a still camera, a video camera, global positioning system (GPS) navigation, web browsing, sending and receiving emails, sending and receiving text messages, push-to-talk capabilities, etc. As the functionality of such a device increases, the computing or processing power required to support such functionality also increases. Further, as the computing power increases, there exists a greater need to effectively manage the processor, or processors, that provide the computing power.

Accordingly, what is needed is an improved method of controlling power within a multicore CPU.

In the figures, like reference numerals refer to like parts throughout the various views unless otherwise indicated.

FIG. 1 is a front plan view of a first aspect of a portable computing device (PCD) in a closed position;

FIG. 2 is a front plan view of the first aspect of a PCD in an open position;

FIG. 3 is a block diagram of a second aspect of a PCD;

FIG. 4 is a block diagram of a processing system;

FIG. 5 is a flowchart illustrating a first aspect of a method of dynamically controlling power within a CPU;

FIG. 6 is a flowchart illustrating a first portion of a second aspect of a method of dynamically controlling power within a CPU;

FIG. 7 is a flowchart illustrating a second portion of the second aspect of a method of dynamically controlling power within a multicore CPU;

FIG. 8 is an exemplary graph showing the dynamic clock and voltage scaling (DCVS) controlled CPU frequency plotted over time; and

FIG. 9 is an exemplary graph showing effective transient response times for various performance levels.

The word “exemplary” is used herein to mean “serving as an example, instance, or illustration.” Any aspect described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other aspects.

In this description, the term “application” may also include files having executable content, such as: object code, scripts, byte code, markup language files, and patches. In addition, an “application” referred to herein, may also include files that are not executable in nature, such as documents that may need to be opened or other data files that need to be accessed.

The term “content” may also include files having executable content, such as: object code, scripts, byte code, markup language files, and patches. In addition, “content” referred to herein, may also include files that are not executable in nature, such as documents that may need to be opened or other data files that need to be accessed.

As used in this description, the terms “component,” “database,” “module,” “system,” and the like are intended to refer to a computer-related entity, either hardware, firmware, a combination of hardware and software, software, or software in execution. For example, a component may be, but is not limited to being, a process running on a processor, a processor, an object, an executable, a thread of execution, a program, and/or a computer. By way of illustration, both an application running on a computing device and the computing device may be a component. One or more components may reside within a process and/or thread of execution, and a component may be localized on one computer and/or distributed between two or more computers. In addition, these components may execute from various computer readable media having various data structures stored thereon. The components may communicate by way of local and/or remote processes such as in accordance with a signal having one or more data packets (e.g., data from one component interacting with another component in a local system, distributed system, and/or across a network such as the Internet with other systems by way of the signal).

Referring initially to FIG. 1 and FIG. 2, an exemplary portable computing device (PCD) is shown and is generally designated 100. As shown, the PCD 100 may include a housing 102. The housing 102 may include an upper housing portion 104 and a lower housing portion 106. FIG. 1 shows that the upper housing portion 104 may include a display 108. In a particular aspect, the display 108 may be a touch screen display. The upper housing portion 104 may also include a trackball input device 110. Further, as shown in FIG. 1, the upper housing portion 104 may include a power on button 112 and a power off button 114. As shown in FIG. 1, the upper housing portion 104 of the PCD 100 may include a plurality of indicator lights 116 and a speaker 118. Each indicator light 116 may be a light emitting diode (LED).

In a particular aspect, as depicted in FIG. 2, the upper housing portion 104 is movable relative to the lower housing portion 106. Specifically, the upper housing portion 104 may be slidable relative to the lower housing portion 106. As shown in FIG. 2, the lower housing portion 106 may include a multi-button keyboard 120. In a particular aspect, the multi-button keyboard 120 may be a standard QWERTY keyboard. The multi-button keyboard 120 may be revealed when the upper housing portion 104 is moved relative to the lower housing portion 106. FIG. 2 further illustrates that the PCD 100 may include a reset button 122 on the lower housing portion 106.

Referring to FIG. 3, an exemplary, non-limiting aspect of a portable computing device (PCD) is shown and is generally designated 320. As shown, the PCD 320 includes an on-chip system 322 that includes a multicore CPU 324. The multicore CPU 324 may include a zeroth core 325, a first core 326, and an Nth core 327.

As illustrated in FIG. 3, a display controller 328 and a touch screen controller 330 are coupled to the multicore CPU 324. In turn, a touch screen display display/touchscreen 332 external to the on-chip system 322 is coupled to the display controller 328 and the touch screen controller 330.

FIG. 3 further indicates that a video encoder 334, e.g., a phase alternating line (PAL) encoder, a sequential couleur a memoire (SECAM) encoder, or a national television system(s) committee (NTSC) encoder, is coupled to the multicore CPU 324. Further, a video amplifier 336 is coupled to the video encoder 334 and the display/touchscreen 332. Also, a video port 338 is coupled to the video amplifier 336. As depicted in FIG. 3, a universal serial bus (USB) controller 340 is coupled to the multicore CPU 324. Also, a USB port 342 is coupled to the USB controller 340. A memory 344 and a subscriber identity module (SIM) card 346 may also be coupled to the multicore CPU 324. Further, as shown in FIG. 3, a digital camera 348 may be coupled to the multicore CPU 324. In an exemplary aspect, the digital camera 348 is a charge-coupled device (CCD) camera or a complementary metal-oxide semiconductor (CMOS) camera.

As further illustrated in FIG. 3, a stereo audio CODEC 350 may be coupled to the multicore CPU 324. Moreover, an audio amplifier 352 may coupled to the stereo audio CODEC 350. In an exemplary aspect, a first stereo speaker 354 and a second stereo speaker 356 are coupled to the audio amplifier 352. FIG. 3 shows that a microphone amplifier 358 may be also coupled to the stereo audio CODEC 350. Additionally, a microphone 360 may be coupled to the microphone amplifier 358. In a particular aspect, a frequency modulation (FM) radio tuner 362 may be coupled to the stereo audio CODEC 350. Also, an FM antenna 364 is coupled to the FM radio tuner 362. Further, stereo headphones 366 may be coupled to the stereo audio CODEC 350.

FIG. 3 further indicates that a radio frequency (RF) transceiver 368 may be coupled to the multicore CPU 324. An RF switch 370 may be coupled to the RF transceiver 368 and an RF antenna 372. As shown in FIG. 3, a keypad 374 may be coupled to the multicore CPU 324. Also, a mono headset with a microphone 376 may be coupled to the multicore CPU 324. Further, a vibrator device 378 may be coupled to the multicore CPU 324. FIG. 3 also shows that a power supply 380 may be coupled to the on-chip system 322. In a particular aspect, the power supply 380 is a direct current (DC) power supply that provides power to the various components of the PCD 320 that require power. Further, in a particular aspect, the power supply is a rechargeable DC battery or a DC power supply that is derived from an alternating current (AC) to DC transformer that is connected to an AC power source.

FIG. 3 further indicates that the PCD 320 may also include a network card 388 that may be used to access a data network, e.g., a local area network, a personal area network, or any other network. The network card 388 may be a Bluetooth network card, a WiFi network card, a personal area network (PAN) card, a personal area network ultra-low-power technology (PeANUT) network card, or any other network card well known in the art. Further, the network card 388 may be incorporated into a chip, i.e., the network card 388 may be a full solution in a chip, and may not be a separate network card 388.

As depicted in FIG. 3, the display/touchscreen 332, the video port 338, the USB port 342, the camera 348, the first stereo speaker 354, the second stereo speaker 356, the microphone 360, the FM antenna 364, the stereo headphones 366, the RF switch 370, the RF antenna 372, the keypad 374, the mono headset 376, the vibrator 378, and the power supply 380 are external to the on-chip system 322.

In a particular aspect, one or more of the method steps described herein may be stored in the memory 344 as computer program instructions. These instructions may be executed by the multicore CPU 324 in order to perform the methods described herein. Further, the multicore CPU 324, the memory 344, or a combination thereof may serve as a means for executing one or more of the method steps described herein in order to a dynamically control the power of each CPU, or core, within the multicore CPU 324.

Referring to FIG. 4, a processing system is shown and is generally designated 400. In a particular aspect, the processing system 400 may be incorporated into the PCD 320 described above in conjunction with FIG. 3. As shown, the processing system 400 may include a multicore central processing unit (CPU) 402 and a memory 404 connected to the multicore CPU 402. The multicore CPU 402 may include a zeroth core 410, a first core 412, and an Nth core 414. The zeroth core 410 may include a zeroth dynamic clock and voltage scaling (DCVS) algorithm 416 executing thereon. The first core 412 may include a first DCVS algorithm 417 executing thereon. Further, the Nth core 414 may include an Nth DCVS algorithm 418 executing thereon. In a particular aspect, each DCVS algorithm 416, 417, 418 may be independently executed on a respective core 410, 412, 414.

Moreover, as illustrated, the memory 404 may include an operating system 420 stored thereon. The operating system 420 may include a scheduler 422 and the scheduler 422 may include a first run queue 424, a second run queue 426, and an Nth run queue 428. The memory 404 may also include a first application 430, a second application 432, and an Nth application 434 stored thereon.

In a particular aspect, the applications 430, 432, 434 may send one or more tasks 436 to the operating system 420 to be processed at the cores 410, 412, 414 within the multicore CPU 402. The tasks 436 may be processed, or executed, as single tasks, threads, or a combination thereof. Further, the scheduler 422 may schedule the tasks, threads, or a combination thereof for execution within the multicore CPU 402. Additionally, the scheduler 422 may place the tasks, threads, or a combination thereof in the run queues 424, 426, 428. The cores 410, 412, 414 may retrieve the tasks, threads, or a combination thereof from the run queues 424, 426, 428 as instructed, e.g., by the operating system 420 for processing, or execution, of those task and threads at the cores 410, 412, 414.

FIG. 4 also shows that the memory 404 may include a parallelism monitor 440 stored thereon. The parallelism monitor 440 may be connected to the operating system 420 and the multicore CPU 402. Specifically, the parallelism monitor 440 may be connected to the scheduler 422 within the operating system 420.

Referring to FIG. 5, a first aspect of a method of dynamically controlling the power of a central processing unit is shown and is generally designated 500. The method 500 may commence at block 502 with a do loop in which when device is powered on, the following steps may be performed.

At block 504, a power controller, e.g., a dynamic clock and voltage scaling (DCVS) algorithm, may monitor one or more CPUs. At decision 506, the power controller may determine whether a transient performance deadline for a CPU has expired. If not, the method 500 may end. Otherwise, if the transient performance deadline has expired, the method 500 may proceed to block 508 and the power controller may move the CPU to a higher performance level, i.e., a next higher operating frequency. In one aspect, the controller may move the CPU to a maximum performance level, i.e., a maximum CPU frequency. However, in another aspect, the CPU may not jump to a maximum performance level. The CPU may jump to an intermediate level and then, jump again, either to the maximum level or another higher performance level. The number of intermediate jumps may and the amount of time between jumps may be used to determine the frequency value of the jump.

At block 510, the CPU may enter an idle condition. Further, at block 512, the transient performance deadline may be reset. At block 514, the CPU may exit the idle condition. Moving to decision 516, the power controller may determine whether the upcoming CPU frequency is at a maximum CPU frequency. If so, the method 500 may end. Otherwise, if the CPU frequency is not at the maximum CPU frequency, the method may proceed to block 518 and the timer may be rescheduled. Then, the method 500 may end.

Referring to FIG. 6, a second aspect of a method of dynamically controlling the power of a central processing unit is shown and is generally designated 600. Beginning at block 602, a central processing unit (CPU) may enter an idle state. At block 604, a power controller, e.g., a dynamic clock and voltage scaling (DCVS) algorithm, may set a start idle time (StartIdleTime) equal to a current time (CurrentTime). Further, at block 606, the power controller may determine a busy time (BusyTime) by subtracting a start idle time (StartIdleTime) from an end idle time (EndIdleTime).

At block 608, the CPU may enter a software wait for interrupt (SWFI) condition. At block 610, the CPU may exit the SWFI condition. Moving to block 612, the power controller may set an end idle time (EndIdleTime) equal to a current time (CurrentTime). Further, at block 614, the power controller may determine an idle time (IdleTime) by subtracting the start idle time (StartIdleTime) from the end idle time (EndIdleTime). At block 616, the power controller may determine an upcoming CPU frequency (CPUFreq) from an updated steady state filter (UpdateSteadyStateFilter) a busy time (BusyTime) and an idle time (IdleTime). Thereafter, the method 600 may continue to block 702 of FIG. 7.

At block 702, the power controller may determine an effective transient budget (EffectiveTransientBudget) using the following formula:
EffectiveTransientBudget=(TransientResponseDeadline*NextCPUFreq)/(NextCPUFreq−CPUFreq)

where,

In a particular aspect, a clock scheduling overhead (ClockSchedulingOverhead) and a clock switch overhead (ClockSwitchOverhead) may also be added to the EffectiveTransientBudget. Further, a voltage change overhead (VoltageChangeOverhead) may be added to the EffectiveTransientBudget. Moving to block 704, the power controller may set a deadline to jump to a higher frequency (SetJumpToFrequency) equal to the end idle time (EndIdleTime) plus the effective transient budget (EffectiveTransientBudget). In another aspect, the deadline to jump may be the current time plus the transient budget. Thereafter, the method 600 may end.

In a particular aspect, the method 600 described in conjunction with FIG. 6 and FIG. 7 may be used to calculate the amount of time that the CPU may remain at the frequency determined by the DCVS before the transient deadline is exhausted and schedule a jump to the higher CPU frequency by that amount of time in the future. If idle is reentered prior to the jump to the higher frequency, the scheduled jump may be cancelled. The method 600 may delay the jump to the higher frequency by the amount of time determined as the EffectiveTransientBudget.

It is to be understood that the method steps described herein need not necessarily be performed in the order as described. Further, words such as “thereafter,” “then,” “next,” etc. are not intended to limit the order of the steps. These words are simply used to guide the reader through the description of the method steps. Moreover, the methods described herein are described as executable on a portable computing device (PCD). The PCD may be a mobile telephone device, a portable digital assistant device, a smartbook computing device, a netbook computing device, a laptop computing device, a desktop computing device, or a combination thereof.

In a particular aspect, a DCVS algorithm is a mechanism which measures CPU load/idle time and dynamically adjusts the CPU clock frequency to track the workload in an effort to reduce power consumption while still providing satisfactory system performance. As the workload changes, the change in CPU throughput may track, but also necessarily lag, the changes in the workload. Unfortunately, this may introduce a problem in cases where the workload has Quality of Service (QoS) requirements, as the DCVS algorithm may not track the workload quickly enough. Further, tasks may fail.

Many DCVS techniques involve measuring the steady state performance requirements of the CPU and setting the CPU frequency and voltage to the lowest level that may meet the steady state CPU usage. This is typically done by measuring the CPU utilization (percentage busy) over a period of time and setting the CPU performance level to one in which the average CPU utilization falls between a high and low threshold. The averaging period is optimized to minimize the frequency of changing clock frequencies, while maintaining reasonable responsiveness. In order to respond to transient workloads and/or the start of new workloads panic inputs may have been utilized to quickly bring up the CPU frequency.

In order to avoid the problem of the DCVS lagging the workload and causing tasks to fail, the system and methods disclosed herein provide a transient performance guarantee. The transient performance guarantee may be defined as the maximum amount of time that a continuously busy pulse may be delayed, as compared to running at the higher performance level. This may be accomplished by getting to the higher performance level prior to the transient performance deadline expiring and resetting the deadline whenever we go idle, since if the CPU is idle, it is by definition not in an oversubscribed state. As disclosed herein, the timer may be rescheduled to preserve the QoS guarantee whenever the system comes out of idle and the system CPU is not running at the maximum frequency.

In order to minimize the power impact of the transient performance guarantee, the present system and methods minimize the likely hood that an incoming pulse may require a frequency increase in order to meet the deadline. This may be accomplished by delaying the frequency, i.e., performance level, change until the effective transient budget has been exhausted and then, jumping straight to the higher performance level and staying there until the pulse is complete as shown in FIG. 8.

In a particular aspect, the effective transient budget is calculated as the transient response deadline scaled to the current performance level. For example, if the CPU is running a 75% of the maximum clock rate and the transient response deadline is 16 ms, the effective transient budget is 64 ms, i.e., 16 ms/(1-0.75). The effective transient budget represents how long the CPU may run at the current performance level prior to exhausting the budget. If the CPU is idle, the effective transient budget may be the same as the transient response deadline. If we are at the maximum performance level, the effective transient budget is infinite as shown in FIG. 9.

Using the methods described herein, the system may provide a strict bound on the maximum amount of time a task might run at some level other than the maximum level, and therefore implicitly provide a calculable bound on completion for tasks that require QoS guarantees, while still allowing dynamic CPU clock scaling. The bound may be set based on what tasks are currently running, a global system property, DCVS algorithm design or other properties, and may be entirely disabled if the system is not running any tasks that have QoS requirements or if the CPU is running at max clock.

In a particular aspect, the present methods may be extended by, instead of jumping to the maximum frequency when the deadline has expired, setting shorter internal effective deadlines and jumping to one, or more, intermediate frequencies, while still ensuring that the CPU is at the maximum frequency before the maximum QoS delay has been exhausted. Further, the present methods may substantially ensure that a well defined transient QoS is maintained, while simultaneously reducing overall CPU power.

The system and methods described herein may utilize opportunistic sampling. In other words, the system and methods may check for timer expiration on a periodic basis. In other aspects, the system and methods may not utilize opportunistic sampling.

In one or more exemplary aspects, the functions described may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the functions may be stored as one or more instructions or code on a computer program product such as a machine readable medium, i.e., a non-transitory computer-readable medium. Computer-readable media includes computer storage media that facilitates transfer of a computer program from one place to another. A storage media may be any available media that may be accessed by a computer. By way of example, and not limitation, such non-transitory computer-readable media may comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that may be used to store desired program code in the form of instructions or data structures and that may be accessed by a computer. Disk and disc, as used herein, includes compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of non-transitory computer-readable media.

Although selected aspects have been illustrated and described in detail, it will be understood that various substitutions and alterations may be made therein without departing from the spirit and scope of the present invention, as defined by the following claims.

Rychlik, Bohuslav, Iranli, Ali, Sur, Sumit, Gargash, Norman S., Thomson, Steven S.

Patent Priority Assignee Title
Patent Priority Assignee Title
4601008, Jun 30 1982 Fujitsu Limited Data processing system
5644769, Jun 14 1993 Matsushita Electric Industrial Co., Ltd. System for optimizing program by virtually executing the instruction prior to actual execution of the program to invalidate unnecessary instructions
6073244, Dec 24 1997 Mitsubishi Denki Kabushiki Kaisha Power-saving clock control apparatus and method
6076171, Mar 28 1997 Mitsubishi Denki Kabushiki Kaisha Information processing apparatus with CPU-load-based clock frequency
6804632, Dec 06 2001 Intel Corporation Distribution of processing activity across processing hardware based on power consumption considerations
6829713, Dec 30 2000 Intel Corporation CPU power management based on utilization with lowest performance mode at the mid-utilization range
6978389, Dec 20 2001 Texas Instruments Incorporated Variable clocking in an embedded symmetric multiprocessor system
7043405, Dec 06 2001 Intel Corporation Distribution of processing activity in a multiple core microprocessor
7058824, Jun 15 2001 Microsoft Technology Licensing, LLC Method and system for using idle threads to adaptively throttle a computer
7107187, Nov 12 2003 T-MOBILE INNOVATIONS LLC Method for modeling system performance
7133806, May 13 2004 AVAGO TECHNOLOGIES INTERNATIONAL SALES PTE LIMITED Method and apparatus for measurement of processor-utilization
7134031, Aug 04 2003 ARM Limited Performance control within a multi-processor system
7219245, Jun 03 2004 Advanced Micro Devices, Inc. Adaptive CPU clock management
7233188, Dec 22 2005 SONY INTERACTIVE ENTERTAINMENT INC Methods and apparatus for reducing power consumption in a processor using clock signal control
7263457, Jan 03 2006 MEDIATEK INC System and method for operating components of an integrated circuit at independent frequencies and/or voltages
7346787, Dec 07 2004 Intel Corporation System and method for adaptive power management
7369967, Dec 27 2004 Sprint Communications Company L.P. System and method for monitoring and modeling system performance
7370189, Sep 30 2004 Intel Corporation Method and apparatus for establishing safe processor operating points in connection with a secure boot
7398407, Dec 21 2004 Packet Digital Method and apparatus for on-demand power management
7401240, Jun 03 2004 Microsoft Technology Licensing, LLC Method for dynamically managing power in microprocessor chips according to present processing demands
7437581, Sep 28 2004 Meta Platforms, Inc Method and apparatus for varying energy per instruction according to the amount of available parallelism
7467291, Feb 28 2005 Oracle America, Inc System and method for calibrating headroom margin
7500124, Jun 11 2004 Samsung Electronics Co., Ltd. Electronic devices and operational methods that change clock frequencies that are applied to a central processing unit and a main system bus
7543161, Sep 30 2004 International Business Machines Corporation Method and apparatus for tracking variable speed microprocessor performance caused by power management in a logically partitioned data processing system
7650527, Feb 07 2006 AVAGO TECHNOLOGIES INTERNATIONAL SALES PTE LIMITED MIPS recovery technique
7669067, Dec 13 2004 Infineon Technologies AG Method and device for setting the clock frequency of a processor
7689838, Dec 22 2005 Intel Corporation Method and apparatus for providing for detecting processor state transitions
7711966, Aug 31 2004 Qualcomm Incorporated Dynamic clock frequency adjustment based on processor load
7761874, Aug 13 2004 Intel Corporation Managing processing system power and performance based on utilization trends
7783906, Feb 15 2007 LENOVO INTERNATIONAL LIMITED Maximum power usage setting for computing device
7849349, Mar 28 2007 Polaris Innovations Limited Reduced-delay clocked logic
7949887, Nov 01 2006 Intel Corporation Independent power control of processing cores
20020046354,
20020188877,
20030115495,
20030177163,
20040225902,
20040254765,
20050102560,
20060036878,
20060123253,
20060149975,
20070016815,
20070033425,
20070033526,
20070255929,
20080005591,
20080028244,
20080162965,
20080168287,
20080201591,
20080310099,
20090037922,
20090049314,
20090106576,
20090150695,
20090150696,
20090187775,
20090217276,
20090230930,
20090249347,
20090271646,
20090276642,
20100076733,
20100122101,
20110023047,
20110145559,
20110145605,
20110145615,
20110145616,
20110145624,
20110145824,
20130074085,
20130151879,
20140181542,
CN101076770,
CN101111814,
CN101135928,
CN101211215,
CN101241390,
CN101351759,
CN101403944,
CN101414268,
CN101436098,
CN1692326,
EP98169,
EP942363,
EP1496424,
GB2445167,
JP10268963,
JP11282695,
JP2002099433,
JP2004533674,
JP2005128937,
JP2006011548,
JP2008059054,
JP2008117397,
JP2008129846,
JP2008165798,
JP2008269249,
JP2008513912,
JP2009037335,
JP2009140157,
JP2009169858,
JP2009238024,
JP2009503728,
JP2010518525,
JP351902,
JP8006681,
JP8190535,
KR20070049226,
KR20090107490,
TW200907660,
WO2006037119,
WO2007007300,
WO2008047179,
WO2074046,
WO225414,
WO2004044720,
WO2005119412,
WO2007019003,
//////
Executed onAssignorAssigneeConveyanceFrameReelDoc
Aug 06 2010SUR, SUMITQualcomm IncorporatedASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0256870637 pdf
Aug 06 2010GARGASH, NORMAN S Qualcomm IncorporatedASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0256870637 pdf
Aug 09 2010RYCHLIK, BOHUSLAVQualcomm IncorporatedASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0256870637 pdf
Aug 10 2010IRANLI, ALIQualcomm IncorporatedASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0256870637 pdf
Aug 14 2010THOMSON, STEVEN SQualcomm IncorporatedASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0256870637 pdf
Nov 11 2010Qualcomm Incorporated(assignment on the face of the patent)
Date Maintenance Fee Events
May 09 2018M1551: Payment of Maintenance Fee, 4th Year, Large Entity.
May 11 2022M1552: Payment of Maintenance Fee, 8th Year, Large Entity.


Date Maintenance Schedule
Dec 09 20174 years fee payment window open
Jun 09 20186 months grace period start (w surcharge)
Dec 09 2018patent expiry (for year 4)
Dec 09 20202 years to revive unintentionally abandoned end. (for year 4)
Dec 09 20218 years fee payment window open
Jun 09 20226 months grace period start (w surcharge)
Dec 09 2022patent expiry (for year 8)
Dec 09 20242 years to revive unintentionally abandoned end. (for year 8)
Dec 09 202512 years fee payment window open
Jun 09 20266 months grace period start (w surcharge)
Dec 09 2026patent expiry (for year 12)
Dec 09 20282 years to revive unintentionally abandoned end. (for year 12)