A method for packet processing on a multi-core processor. According to one embodiment of the invention, a first set of one or more processing cores are configured to include the capability to process packets belonging to a first set of one or more packet types, and a second set of one or more processing cores are configured to include the capability to process packets belonging to a second set of one or more packet types, where the second set of packet types is a subset of the first set of packet types. Packets belonging to the first set of packet types are processed at a processing core of either the first or second set of processing cores. Packets belonging to the second set of packet types are processed at a processing core of the first set of processing cores.
|
1. A method for packet processing on a multi-core processor having a plurality of processing cores wherein each packet being processed includes a plurality of processing phases each identifiable by a phase identifier, the method comprising:
processing a first set of one or more packet types by a first set of one or more of the plurality of processing cores to include the capability to process packets belonging to a first set of one or more packet types;
processing a second set of one or more packet types by a second set of one or more of the plurality of processing, the second set of packet types being a subset of the first set of packet types;
receiving a first packet;
determining that the first packet belongs to the second set of packet types and processing the first packet at one of the first or second set of processing cores;
receiving a second packet;
determining that the second packet does not belong to the second set of packet types and processing the second packet at one of the first set of processing cores,
receiving a third, fourth, and fifth packet, the third and fourth packet belonging to the same flow, wherein at least one of the processing phases of the third packet is performed prior to at least one of the processing phases of the fourth packet;
assigning the third and fourth packets to different sets of the plurality of processing cores;
processing the at least one of the processing phases of the third packet and stalling the processing of the fourth packet;
upon determining that processing of the fourth packet is stalled, de-scheduling the processing of the fourth packet and allowing the processing core that was assigned to process the fourth packet to process the fifth packet;
upon the completion of the at least one processing phases of the third packet, releasing the stalling of the processing of the fourth packet allowing one of the sets of processing cores to resume processing of the fourth packet at the particular processing phase previously de-scheduled; and
upon each processing phase being completed for a particular packet, associating the corresponding phase identifier with that particular packet.
9. A network element, comprising:
an interface;
a memory;
a packet work assignment unit;
a first set of one or more processing cores; and
a second set of one or more processing cores,
wherein a first set of one or more packet types are processed by the first set of one or more of the plurality of processing cores,
wherein a second set of one or more packet types are processed by a second set of one or more of the plurality of processing cores, the second set of packet types being a subset of the first set of packet types,
wherein the packet work assignment unit receives a first packet, a second packet, a third packet, a fourth packet, and a fifth packet,
wherein the packet work assignment unit assigns packets to the first set of one or more packet types or to the second set of one or more packet types corresponding to the packet type supported by the first set of one or more processing cores and the second set of one or more processing cores,
wherein the third and fourth packet belonging to the same flow,
wherein at least one of the processing phases of the third packet is performed prior to at least one of the processing phases of the fourth packet;
wherein the third and fourth packets are assigned to different sets of the plurality of processing cores;
wherein the at least one of the processing phases of the third packet stalls the processing of the fourth packet by one of the sets of the plurality of processing cores;
wherein the stalling of the fourth packet is detected by one of the sets of the plurality of processing cores that processing core de-scheduling the processing of the fourth packet and allowing the processing core that was assigned to process the fourth packet to process the fifth packet;
wherein one of the sets of the plurality of processing cores completes the at least one processing phases of the third packet, releases the stalled of the processing of the fourth packet allowing one of the sets of processing cores to resume processing of the fourth packet at the particular processing phase previously de-scheduled; and
upon each processing phase being completed for a particular packet, associating the corresponding phase identifier with that particular packet.
15. A non-transitory computer readable storage medium having embodied thereon a program executable by a processor to perform operations for packet processing on a multi-core processor having a plurality of processing cores, wherein each packet being processed includes a plurality of processing phases each identifiable by a phase identifier, the operations comprising:
processing a first set of one or more packet types by a first set of one or more of the plurality of processing cores to include the capability to process packets belonging to a first set of one or more packet types;
processing a second set of one or more packet types by a second set of one or more of the plurality of processing, the second set of packet types being a subset of the first set of packet types;
receiving a first packet;
determining that the first packet belongs to the second set of packet types and processing the first packet at one of the first or second set of processing cores;
receiving a second packet;
determining that the second packet does not belong to the second set of packet types and processing the second packet at one of the first set of processing cores,
receiving a third, fourth, and fifth packet, the third and fourth packet belonging to the same flow, wherein at least one of the processing phases of the third packet is performed prior to at least one of the processing phases of the fourth packet;
assigning the third and fourth packets to different sets of the plurality of processing cores;
processing the at least one of the processing phases of the third packet and stalling the processing of the fourth packet;
upon determining that processing of the fourth packet is stalled, de-scheduling the processing of the fourth packet and allowing the processing core that was assigned to process the fourth packet to process the fifth packet;
upon the completion of the at least one processing phases of the third packet, releasing the stalling of the processing of the fourth packet allowing one of the sets of processing cores to resume processing of the fourth packet at the particular processing phase previously de-scheduled; and
upon each processing phase being completed for a particular packet, associating the corresponding phase identifier with that particular packet.
4. The method of
5. The method of
6. The method of
7. The method of
8. The method of
10. The network element of
11. The network element of
12. The network element of
13. The network element of
14. The network element of
a first set of one or more network packet processing modules processing a first set of one or more packet types on the first set of one or more processing cores;
a second set of one or more network packet processing modules processing a second set of one or more packet types on the second set of one or more processing cores, the second set of packet types being a subset of the first set of packet types;
a third set of one or more processing cores;
a third set of one or more network packet processing modules processing a third set of the one or more packet types on the third set of one or more processing cores, the third set of packet types being a subset of the first set of packet types.
16. The non-transitory computer readable storage medium of
17. The non-transitory computer readable storage medium of
18. The non-transitory computer readable storage medium of
19. The non-transitory computer readable storage medium of
20. The non-transitory computer readable storage medium of
21. The non-transitory computer readable storage medium of
22. The non-transitory computer readable storage medium of
23. The non-transitory computer readable storage medium of
|
The present application is a continuation and claims the benefit of U.S. patent application Ser. No. 12/240,892, now U.S. Pat. No. 7,990,974, entitled “PACKET PROCESSING ON A MULTI-CORE PROCESSOR”, filed on Aug. 2, 2011.
1. Field
Embodiments of the invention relate to the field of packet processing; and more specifically, to packet processing on a multi-core processor.
2. Background
A multi-core processor (e.g., a multi-core network processor, a multi-core general purpose processor, etc.) is a processor with two or more processing cores. Multi-core processors may increase processing performance. However, the packet processing architecture of a system is modified to realize the processing performance advantages of a multi-core processor. One packet processing architecture includes executing the same packet processing modules on each core. Thus, in this packet processing architecture, each core may process any packet.
Another packet processing architecture includes a single processing core or processor (e.g., a separate general purpose CPU) only processing “exception” packets, while the other cores only process “non-exception” packets. In this architecture, typically all packets are received by the processing cores that process the “non-exception” packets. Upon determining a packet is an “exception” packet, that packet is forwarded to the processing core or processor dedicated for processing “exception” packets.
The invention may best be understood by referring to the following description and accompanying drawings that are used to illustrate embodiments of the invention. In the drawings:
In the following description, numerous specific details are set forth. However, it is understood that embodiments of the invention may be practiced without these specific details. In other instances, well-known circuits, structures and techniques have not been shown in detail in order not to obscure the understanding of this description. Those of ordinary skill in the art, with the included descriptions, will be able to implement appropriate functionality without undue experimentation.
References in the specification to “one embodiment”, “an embodiment”, “an example embodiment”, etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the art to effect such feature, structure, or characteristic in connection with other embodiments whether or not explicitly described.
In the following description and claims, the terms “coupled” and “connected,” along with their derivatives, may be used. It should be understood that these terms are not intended as synonyms for each other. “Coupled” is used to indicate that two or more elements, which may or may not be in direct physical or electrical contact with each other, co-operate or interact with each other. “Connected” is used to indicate the establishment of communication between two or more elements that are coupled with each other.
The techniques shown in the figures can be implemented using code and data stored and executed on one or more electronic devices (e.g., a computer end station, a network element, etc.). Such electronic devices store and communicate (internally and with other electronic devices over a network) code and data using machine-readable media, such as machine storage media (e.g., magnetic disks; optical disks; random access memory; read only memory; flash memory devices; phase-change memory) and machine communication media (e.g., electrical, optical, acoustical or other form of propagated signals—such as carrier waves, infrared signals, digital signals, etc.). In addition, such electronic devices typically include a set of one or more processors coupled to one or more other components, such as a storage device, one or more user input/output devices (e.g., a keyboard, a touchscreen, and/or a display), and a network connection. The coupling of the set of processors and other components is typically through one or more busses and bridges (also termed as bus controllers). The storage device and signals carrying the network traffic respectively represent one or more machine storage media and machine communication media. Thus, the storage device of a given electronic device typically stores code and/or data for execution on the set of one or more processors of that electronic device. Of course, one or more parts of an embodiment of the invention may be implemented using different combinations of software, firmware, and/or hardware.
As used herein, a network element (e.g., a router, switch, bridge, secure router, unified threat management (UTM) appliance (e.g., including functionality including but not limited to firewall, gateway, anti-virus, anti-spyware, intrusion detection system, intrusion prevention system, web content filtering, and/or IPsec VPN) is a piece of networking equipment, including hardware and software that communicatively interconnects other equipment on the network (e.g., other network elements, computer end stations, etc.). A secure router includes, but is not limited to, routing functionality, switching functionality and security functionality (e.g., firewall, gateway, anti-virus, anti-spyware, intrusion detection system, intrusion prevention system, web content filtering, and/or IPsec VPN, user authentication, client security posture assessment, etc.).
A method and apparatus for packet processing on a multi-core processor is described. In one embodiment of the invention, a first set of one or more processing cores are loaded with packet processing modules that include the capability of processing packets belonging to a first set of one or more packet types, and a second set of one or more processing cores are loaded with packet processing modules that include the capability of processing packets belonging to a second set of one or more packet types, the second set of packet types being a subset of the first set of packet types.
According to one embodiment of the invention, the first set of packet types includes each packet to be processed by the multi-core processor. For example, each ingress packet (including control packets, data packets, packets relating to configuration, etc.) may be processed by the first set of one or more processing cores. The second set of packet types is a subset of the first set of packet types. For example, the second set of one or more processing cores is capable of processing a subset of the ingress packets (e.g., data packets). According to one embodiment of the invention, the second set of packet types includes substantially all ingress packets.
In one embodiment of the invention, each processing core may be assigned any packet regardless of the packet processing capabilities included on the processing cores. If a processing core does not include the capability of processing a particular packet, that packet is redirected to a processing core that includes the capability to process that packet. In an alternative embodiment of the invention, processing cores are assigned packets to process based on their processing capabilities.
In one embodiment of the invention, a packet goes through various processing stages during its processing lifecycle. Each processing stage is identifiable with a processing stage identification. Certain processing stages and/or portions of processing stages require locks to be placed on certain packet flows. For example, if the processing results of a certain stage or received packet needs to be consumed by a subsequent stage or a subsequently received packet, that subsequent stage or subsequently received packet is stalled. If a processing core encounters a lock, the processing core may de-schedule the packet and receive a different packet to process. After the lock is removed, the previously de-scheduled packet is assigned to a core for processing.
The complete firmware 130 is loaded onto the processing core 110A (e.g., the processing core 110A executes the complete firmware 130). According to one embodiment of the invention, the complete firmware 130 includes the same processing capabilities as if the processing core 110A was the only processing core available on the multi-core processor 100, including operating system support. For example, the code base for a single core network element may be substantially similar to the complete firmware 130, which minimizes modifications to an existing code base. The subset firmware 140A-140N is loaded onto the processing cores 110B-110N respectively. The subset firmware 140A-140N is a subset of the features and processing capability of the complete firmware 130. According to one embodiment of the invention, the subset firmware 140A-140N does not include operating system functionality. Thus, the amount of code loaded onto each of the processing cores 110B-110N is typically less than the amount of code loaded on to the processing core 110A. According to one embodiment of the invention, the complete firmware 130 may process each packet that the subset firmware 140A-140N may process, and additional packets and configuration management messages that the subset firmware 140A-140N cannot process.
The packet work assignment unit 105 assigns (schedules) work for the processing cores 110A-110N. The packet work assignment unit 105 may assign packets by a number of mechanisms. For example, the packet work assignment unit 105 may include one or more ingress packet buffers (e.g., ingress queue(s)) and assign packets to be processed to the processing cores 110A-110N based on the availability of the processing cores, and based on the order of receipt of the incoming packets. In one embodiment of the invention the processing cores 110A-110N request work from the packet work assignment unit 105 when they are available (e.g., not processing a packet), while in an alternative embodiment of the invention the packet work assignment unit 105 polls the processing cores 110A-110N to determine the availability of the processing cores 110A-110N.
In one embodiment of the invention, the packet work assignment unit 105 separates packets into one or more flows, depending on characteristics of the packet (e.g., source IP address, destination IP address, source port, destination port, protocol, etc.). According to one embodiment of the invention, a particular flow may be assigned to a particular processing core (e.g., all packets from that flow may be assigned to that processing core), while in an alternative embodiment of the invention packets from the same flow may be assigned to different processing cores.
The firmware 215 is loaded onto the processing core 210 and the firmware 225 is loaded onto the processing core 220. According to one embodiment of the invention, the firmware 215 includes a complete operating system that includes the functionality to process all ingress packets (including control packets, data packets, configuration packets, etc.) and the firmware 225 includes a subset of the features of the firmware 215. However, it should be understood that in alternative embodiments of the invention the firmware 215 does not include a complete operating system and may not include the functionality to process all ingress packets. The firmware 215 includes the processing type determination module 240, one or more non-network packet processing modules 230, and one or more network packet processing modules 250. The processing type determination module 240 parses work and determines the appropriate processing module to process that work. The one or more non-network packet processing modules 230 are used to process non-network packets (e.g., packets relating to configuration management, etc.). The one or more network packet processing module(s) 250 are used to process each ingress network packet (e.g., data packets, control packets, etc.). According to one embodiment of the invention, the non-network packet processing module(s) 230 and the network packet processing module(s) 250 together define the types of packets the firmware 215 supports processing. Packets belonging to the non-network packet processing module(s) 230 and/or the network packet processing module(s) 250 are packets that may be processed by the non-network packet processing module(s) and/or the network packet processing module(s) 250.
According to one embodiment of the invention, the firmware 225 includes a subset of the features of the firmware 215. The firmware 225 includes the processing type determination module 260 and one or more subset network packet processing modules 270. The network packet processing module(s) 270 is a subset of the network packet processing module(s) 250. For example, the network packet processing module(s) 270 cannot process each packet that the network packet processing module(s) 250 may process. According to one embodiment of the invention, control packets may be processed by the network packet processing module(s) 250 and cannot be processed by the subset network packet processing module(s) 270. While in one embodiment of the invention the firmware 225 includes support for processing packets requiring operating system support, in an alternative embodiment of the invention the firmware 225 does not include support for processing packets that require operating system support. According to one embodiment of the invention, the subset network packet processing module(s) 270 define the types of packets the firmware 225 supports processing. Packets belonging to the subset network packet processing module(s) 270 are packets that may be processed by the subset network packet processing module(s) 270.
According to one embodiment of the invention, the network packet processing module(s) 270 includes support for processing substantially most packets (i.e., most of the packets to be processed by the multi-core processor 200 are capable of being processed by the subset network packet processing module(s) 270). Packets that cannot be processed by the subset network packet processing module(s) 270 may be re-directed to the processing core 210 (these packets may be referred to as “exception” packets). According to one embodiment of the invention, the processing type determination module 260 determines whether the subset network packet processing module(s) 270 is capable of processing an incoming packet, and if so, which of the subset network packet processing module(s) can process that packet. For example, the processing type determination module 260 determines the type of the packet, and whether processing that type of packet is supported by one or more of the subset network packet processing module(s) 270.
In one embodiment of the invention, the majority of network packets (e.g., “non-exception” packets such as certain data packets, etc.) may be processed by either the processing core 210 or the processing core 220. In other words, the processing core 210 and the processing core 220 may each process the majority of packets. However, for the minority of network packets (e.g., “exception” packets such as certain control packets, configuration packets, etc.), only the processing core 210 is capable of processing those packets.
This multi-core packet processing architecture increases throughput, efficiency, is scalable with two or more processing cores, and reduces the amount of code base required for packet processing. For example, in a typical “slow path” architecture, which has a set of one or more processing cores being dedicated for processing only “exception” packets, those processing cores are idle when not processing “exception” packets. By way of example, in previous multi-core processing architectures, if the multi-core processor has two processing cores, one being dedicated for processing “exception” packets and the other being dedicated for processing “non-exception” packets, it should be understood that the processing core dedicated for processing “exception” packets remains idle most of the time (“exception” packets are encountered much less frequently than “non-exception” packets). In contrast, embodiments of the invention include a first set of one or more processing cores capable of processing “exception” packets and “non-exception” packets (e.g., all packets), and a second set of one or more processing cores capable of processing “non-exception” packets (e.g., the majority of packets). Thus, embodiments of the invention reduce the amount of time any of the processing cores of a multi-core processor are idle, which increases packet processing throughput and efficiency.
In addition, unlike other multi-core packet processing architectures which load the same packet processing capability on each processing core (which increases the code base and complexity of the architecture), embodiments of the invention allow for a reduced code size. For example, in some embodiments of the invention only a single processing core is loaded with a complete firmware (e.g., including an operating system that is capable of processing configuration management, data packets, control packets, etc.) while the other processing cores are loaded with a certain subset of that firmware to process the majority of the packets that the multi-core processor will encounter. Of course it should be understood that in alternative embodiments of the invention more than one processing core is loaded with a complete firmware.
Referring back to
At operation 2, the packet work assignment unit 205 assigns the packet 208 to be processed by the processing core 220. The packet work assignment unit 205 may assign the packet 208 in any number of ways, including causing a reference to be inserted into a work queue of the processing core 220, causing the packet 208 to be read into a memory of the processing core 220, etc. At operation 3, the processing type determination module 260 determines the type of the packet 208 and whether the firmware 225 (loaded on the processing core 220) is capable of processing the packet 208 (e.g., whether the packet 208 belongs to the subset network packet processing module(s) 270). There are many ways the processing type determination module 260 determines whether the firmware 225 has the functionality to process the packet 208. For example, in one embodiment of the invention, the processing type determination module 206 determines the type of the packet 208 (e.g., whether the packet is a data packet, a control packet, a packet related to configuration, etc.) and whether the subset network packet processing module(s) 270 includes one or more modules necessary to process the packet 208. According to one embodiment of the invention, the processing type determination module 206 determines the type of packet 208 from the header of the packet. In an alternative embodiment of the invention, the packet work assignment unit associates each work item with a processing phase identifier which the processing type determination module 260 may use when determining whether the firmware 225 has the functionality to process that packet.
If the processing type determination module 260 determines that the packet 208 belongs to the subset network packet processing module(s) 270 and thus may be processed by the firmware 225, at operation 4A the packet is processed with the appropriate module. Thus, in operation 4A, the packet 208 is determined to be a “non-exception” packet and is processed by one or more of the subset network packet processing module(s) 270.
If the processing type determination module 260 determines that the packet 208 does not belong to the subset network packet processing module(s) 270 and thus cannot be processed by the firmware 225, at operation 4B the packet is redirected to the core 210. Thus, in operation 4B, the packet 208 is determined to be an “exception” packet which cannot be processed by the firmware 225. The packet may be redirected in any number of ways. For example, the packet 208 may be returned to the packet work assignment unit 205, which may then place the packet into a specialized queue for the processing core 210. Thus, when the processing core 210 is available for work, the packet work assignment unit 205 assigns the packet 208 to the processing core 210. Of course, other redirection mechanisms may be used in embodiments of the invention described herein. Sometime later, after being assigned the packet 208 which has been redirected, at operation 5, the firmware 215 processes the packet 208 with the appropriate packet processing module (e.g., one or more of the non-network packet processing module(s) 230 and network packet processing module(s) 250).
At operation 6, the packet work assignment unit 205 assigns the packet 208 to be processed by the processing core 210. Since the firmware 215 includes a complete operating system capable of processing any ingress packet, the processing type determination module 240 determines which packet processing module(s) are used to process packets of that type (e.g., whether the packet is a non-network packet (e.g., related to configuration management) or a network packet), and at operation 7, that packet is processed with the appropriate packet processing module. It should be understood that with reference to operations 6-7, the packet 208 may either be an “exception” packet or a “non-exception” packet with respect to the processing core 220. In either case, the firmware 215 loaded on the processing core 210 is capable of processing the packet 208. Of course, it should be understood that the firmware 215 may not always be successful in processing each packet (e.g., if the packet is malformed and/or corrupted).
Although
The firmware 315 is loaded onto the processing core 310. According to one embodiment of the invention, the firmware 315 is similar to the firmware 215 described in reference with
The firmware 325A-325N is loaded onto the processing cores 320A-320N respectively. According to one embodiment of the invention, the firmware 325A-325N are each similar to the firmware 225 as described with reference to
At operation 1, the packet 308 enters the packet work assignment unit 305. For example, the packet 308 was received by an interface into the network element and stored in memory and a reference to that packet was placed into an ingress queue associated with the packet work assignment unit 305. Similarly as described with reference to the packet work assignment unit 205, the packet work assignment unit 305 may assign the packet 308 to any of the processing cores of the multi-core processor 300 (e.g., the processing core 310 or any of the processing cores 320A-320N). In addition, in some embodiments the packet work assignment unit 305 assigns the packet 308 to a processing core which supports processing the packet 308. The operations 2-5 describe the operations performed when the packet work assignment unit 305 assigns the packet 308 to be processed by the processing core 320N, the operations 6-9 describe the operations performed when the packet work assignment unit 305 assigns the packet 308 to be processed by the processing core 320A, and the operations 10-100 describe the operations performed when the packet work assignment unit 305 assigns the packet 308 to be processed by the processing core 310.
At operation 2, the packet work assignment unit 305 assigns the packet 308 to be processed by the processing core 320N. The packet work assignment unit 305 may assign the packet 308 with numerous mechanisms (e.g., as described with reference to
If the packet work assignment unit 305 determines to assign the packet 308 to the processing core 320A, the following operations are performed. At operation 6, the packet work assignment unit 305 assigns the packet 308 to be processed by the processing core 320A. Sometime later at operation 7, the packet 308 is processed by the subset network packet processing module(s) 370A upon the processing type determination module 360A determining that the firmware 325A includes the functionality to process the packet 308. However, if the processing type determination module 360A determines that the firmware 325A does not include the functionality to process the packet 308, at operation 8 the packet 308 is redirected to the processing core 310 for processing. As described with reference to
At operation 10, the packet work assignment unit 305 assigns the packet 308 to be processed by the processing core 310. Since the firmware 315 includes a complete operating system capable of processing any ingress packet, the processing type determination module 340 determines which packet processing module(s) are used to process packets of that type (e.g., whether the packet is a non-network packet (e.g., related to configuration management) or a network packet), and sometime later at operation 11, that packet is processed with the appropriate packet processing module. It should be understood that with reference to operations 10-11, the packet 308 may either be an “exception” packet or a “non-exception” packet in respect to the processing cores 320A-320N. In either case, the firmware 315 may process the packet 308. Of course, it should be understood that the firmware 315 may not always be successful in processing each packet (e.g., if the packet is malformed and/or corrupted). Thus, the multi-core packet processing architecture of the embodiments described herein may be implemented on a multi-core processor having two or more processing cores.
At block 440, the assigned processing core determines the processing required for the packet. Flow moves from block 440 to block 450. At block 450, a determination is made whether the assigned core supports processing the received packet (e.g., whether the packet belongs to one or more packet processing modules loaded on the assigned core). The assigned processing core may not support processing the received packet for a number of reasons (e.g., the assigned processing core does not include the appropriate packet processing module(s) to process the packet, the assigned processing core is configured not to process packets of that type, etc.). If the assigned processing core supports processing the packet, then flow moves to block 460 where the packet is processed. However, if the assigned processing core does not support processing the packet, then flow moves to block 470. At block 470, a determination is made whether another processing core supports processing the packet. If another processing core does not support processing the packet, then flow moves to block 480 where the packet is dropped. If another processing core supports processing the packet, then flow moves to block 490, where the packet is redirected to that processing core. Flow then moves to block 495 where the packet is then processed.
At block 510, the packet processing enters into an initial processing phase. As described previously, in some embodiments of the invention, the multi-core processor separates received packets into traffic flows. Typically, packets belonging to the same flow have the same source and destination addresses. Separating packets into traffic flows assists in the ordering and synchronizing of packets. In one embodiment of the invention, the initial processing phase 510 includes assigning the packet to a traffic flow based on properties of the packet (e.g., source IP address, destination IP address, source port, destination port, protocol, etc.). In addition to separating packets into traffic flows, in some embodiments of the invention the initial processing phase 510 includes assigning a tag type and value to the packet which determines the order in which the packet is processed (order as compared to other packets of the same traffic flow) and whether multiple packets with the same tag type and value may be processed concurrently. At the conclusion of the initial processing phase, a phase identifier indicating that the processing stage has been completed is stored. Flow moves from block 510 to block 520.
At block 520, the packet enters into a packet flow lookup phase, where the packet flow list is accessed. At the conclusion of the packet flow lookup phase, a phase identifier indicating that the processing stage has been completed is stored. Flow moves from block 520 to block 530. At block 530, the packet enters into the packet flow processing phase. According to one embodiment of the invention, only a single processing core may have access to the packet flow (for a given flow) at a given time. Thus, a lock on that flow must be obtained. In order to prevent head of the line blocking, according to some embodiments of the invention, a processing core that encounters a lock for a given packet de-schedules that packet (e.g., returns that packet back to the ingress queue) and receives another packet to process. Since a processing phase identifier is associated with that packet, the packet may be resumed at that specific processing phase as soon as the lock is removed. At the conclusion of the packet flow processing phrase, a phase identifier indicating that the processing phase has been completed is stored. Flow moves from block 530 to block 540.
At block 540, the packet enters into a security association processing phase. It should be understood that not every packet will enter into a security association processing phase. The security association processing phase will be skipped if the packet does not require security association processing. At the conclusion of the security association processing phase, a phase identifier indicating that the processing phase has been completed is stored. Flow moves from block 540 to block 550. At block 550, the packet enters into an output processing phase.
The packet work assignment unit 305 includes the input queue 620 and the de-schedule queue 630. The input queue 620 stores packets to be processed and to be assigned to a processing core, while the de-schedule queue 620 stores packets which have been assigned and de-scheduled, which will be described in greater detail later herein. While in one embodiment of the invention the input queue 620 and the de-schedule queue 630 are implemented as a FIFO (first-in-first-out) queue, in alternative embodiments of the invention the input queue 620 and the de-schedule queue 630 is implemented differently. The packets 610A-610B, 612, and 614 are stored in the input queue 620. According to one embodiment of the invention, the packets 610A-610B are associated with the same flow (e.g., they have the same source and destination). In addition, for the following discussion, at least certain processing phases of the packet 610B cannot be processed until certain processing phases of the packet 610A have completed processing (i.e., certain packet processing phases must be completed in order).
At operation 1, the packet work assignment unit 305 assigns the packet 610A to be processed by the processing core 310 (illustrated by the packet 610A being inside a dashed box within the processing core 310). With reference to
At operation 3, the packet work assignment unit 305 assigns the packet 610B to the processing core 320A (represented by the packet 610B inside a dashed box within the processing core 320A). According to one embodiment of the invention, the packet 610B is assigned to the processing core 320A prior to the processing core 310 releasing the lock. The processing core 320A processes the packet 610B until it reaches the processing phase when it encounters the lock. As described with reference to
According to one embodiment of the invention, the packet work assignment unit 305 gives priority to scheduling packets in the de-schedule queue 630. For example, packets in the de-schedule queue 630 are scheduled prior to scheduling packets in the ingress queue 620. Thus, at operation 8, the packet work assignment unit 305 assigns the processing core 320N to process the packet 610B from the de-schedule queue 630. At operation 9, the processing core 320N processes the packet 610B. According to one embodiment of the invention, the processing core 320N resumes processing of the packet 610B at the particular processing phase that encountered the lock (e.g., by determining the appropriate processing phase by reading the processing phase identifier associated with the packet 610B).
At operation 10, the packet work assignment unit 305 assigns the packet 614 to the processing core 310 (represented by the packet 614 illustrated in a dashed box within the processing core 310). At operation 11, the processing core 310 processes the packet 614.
At block 710, a processing core receives the scheduling of packet processing work from the packet work assignment unit. Flow moves from block 710 to block 720. At block 720, a determination is made whether the packet processing work is associated with a particular processing phase. For example, the processing core determines whether a processing phase identifier is associated with the packet. If there is no packet processing phase associated with the packet, flow moves to block 724 where processing begins. If there is a packet processing phase associated with the packet, flow moves to block 728 where processing begins at that processing phase. Flow moves from block 724 and 728 to block 730.
At block 730, a determination is made whether the processing is stalled by a lock. If the processing is not stalled by a lock, then flow moves to block 760 where the packet work is processed. However, if the processing is stalled by a lock, then flow moves to block 740 where the packet is de-scheduled (e.g., returned to the packet work assignment unit). Flow moves from block 740 to block 750. At block 750, the processing core receives scheduling of packet work of a different packet from the packet work assignment unit, and flow moves back to block 720.
While
While embodiments of the invention have described a multi-core processing architecture having one or more of the processing cores including a complete firmware, it should be understood that in some embodiments of the invention a set of two or more processing cores may collectively include the functionality of a complete firmware. For example, in one embodiment of the invention, each processing core of the collective complete firmware set may include the capability of processing at least certain data packets, and one or more processing cores of the set may include the capability of processing configuration packets while different one or more processing cores may include the capability of processing control packets.
It should be understood that the types of packets described herein (e.g., control packets, data packets, configuration packets) are exemplary, additional and/or alternative types of packets may be used in embodiments of the invention described herein.
While the flow and data flow diagrams in the figures show a particular order of operations performed by certain embodiments of the invention, it should be understood that such order is exemplary (e.g., alternative embodiments may perform the operations in a different order, combine certain operations, overlap certain operations, etc.)
While the invention has been described in terms of several embodiments, those skilled in the art will recognize that the invention is not limited to the embodiments described, can be practiced with modification and alteration within the spirit and scope of the appended claims. The description is thus to be regarded as illustrative instead of limiting.
Gmuender, John E., Harutyunov, Iosif, Minkin, Ilya, Nguyen, Huy Minh, Marthur, Manish
Patent | Priority | Assignee | Title |
10459777, | Sep 29 2008 | SonicWALL Inc. | Packet processing on a multi-core processor |
10594841, | Oct 10 2012 | Fortinet, Inc. | Configuring initial settings of a network security device via a hand-held computing device |
10970144, | Sep 29 2008 | SonicWALL Inc. | Packet processing on a multi-core processor |
9098330, | Sep 29 2008 | QUEST SOFTWARE INC F K A DELL SOFTWARE INC ; Aventail LLC | Packet processing on a multi-core processor |
9137158, | May 31 2012 | Hitachi, Ltd. | Communication apparatus and communication method |
9201683, | Dec 28 2012 | Futurewei Technologies, Inc. | Methods for task scheduling through locking and unlocking an ingress queue and a task queue |
9274826, | Dec 28 2012 | FUTUREWEI TECHNOLOGIES, INC | Methods for task scheduling through locking and unlocking an ingress queue and a task queue |
9535773, | Sep 29 2008 | QUEST SOFTWARE INC F K A DELL SOFTWARE INC ; Aventail LLC | Packet processing on a multi-core processor |
9898356, | Sep 29 2008 | SonicWALL Inc. | Packet processing on a multi-core processor |
Patent | Priority | Assignee | Title |
6675221, | Apr 06 2000 | GOOGLE LLC | Method and apparatus for customizing and fowarding parameters in a network processor |
7664823, | Sep 24 2003 | Cisco Technology, Inc. | Partitioned packet processing in a multiprocessor environment |
7702896, | Oct 03 2006 | American Megatrends, Inc. | Interactive firmware recovery |
7990974, | Sep 29 2008 | QUEST SOFTWARE INC F K A DELL SOFTWARE INC ; Aventail LLC | Packet processing on a multi-core processor |
20030235194, | |||
20060056406, | |||
20080127198, | |||
20100169636, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Oct 01 2008 | NGUYEN, HUY MINH | SONICWALL, INC | CORRECTIVE ASSIGNMENT TO CORRECT THE SPELLING OF 3RD INVENTOR S NAME PREVIOUSLY RECORDED AT REEL: 031142 FRAME: 0767 ASSIGNOR S HEREBY CONFIRMS THE ASSIGNMENT | 039375 | /0674 | |
Oct 01 2008 | MINKIN, ILYA | SONICWALL, INC | CORRECTIVE ASSIGNMENT TO CORRECT THE SPELLING OF 3RD INVENTOR S NAME PREVIOUSLY RECORDED AT REEL: 031142 FRAME: 0767 ASSIGNOR S HEREBY CONFIRMS THE ASSIGNMENT | 039375 | /0674 | |
Oct 01 2008 | NGUYEN, HUY MINH | SONICWALL, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 031142 | /0767 | |
Oct 01 2008 | MINKIN, ILYA | SONICWALL, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 031142 | /0767 | |
Oct 01 2008 | MARTHUR, MANISH | SONICWALL, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 031142 | /0767 | |
Oct 02 2008 | GMUENDER, JOHN E | SONICWALL, INC | CORRECTIVE ASSIGNMENT TO CORRECT THE SPELLING OF 3RD INVENTOR S NAME PREVIOUSLY RECORDED AT REEL: 031142 FRAME: 0767 ASSIGNOR S HEREBY CONFIRMS THE ASSIGNMENT | 039375 | /0674 | |
Oct 02 2008 | HARUTYUNOV, IOSIF | SONICWALL, INC | CORRECTIVE ASSIGNMENT TO CORRECT THE SPELLING OF 3RD INVENTOR S NAME PREVIOUSLY RECORDED AT REEL: 031142 FRAME: 0767 ASSIGNOR S HEREBY CONFIRMS THE ASSIGNMENT | 039375 | /0674 | |
Oct 02 2008 | GMUENDER, JOHN E | SONICWALL, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 031142 | /0767 | |
Oct 02 2008 | HARUTYUNOV, IOSIF | SONICWALL, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 031142 | /0767 | |
Jul 23 2010 | SONICWALL, INC | PSM MERGER SUB DELAWARE , INC C O THOMA BRAVO, LLC | MERGER SEE DOCUMENT FOR DETAILS | 031142 | /0840 | |
Jul 23 2010 | PSM MERGER SUB DELAWARE , INC | SONICWALL, INC | CHANGE OF NAME SEE DOCUMENT FOR DETAILS | 031142 | /0927 | |
Aug 02 2011 | SonicWall, Inc. | (assignment on the face of the patent) | / | |||
Jan 23 2013 | SONICWALL, INC | SONICWALL L L C | CONVERSION AND NAME CHANGE | 037289 | /0403 | |
Apr 08 2015 | SONICWALL L L C | DELL SOFTWARE INC | MERGER SEE DOCUMENT FOR DETAILS | 037286 | /0619 | |
May 24 2016 | MATHUR, MANISH | SONICWALL, INC | CORRECTIVE ASSIGNMENT TO CORRECT THE SPELLING OF 3RD INVENTOR S NAME PREVIOUSLY RECORDED AT REEL: 031142 FRAME: 0767 ASSIGNOR S HEREBY CONFIRMS THE ASSIGNMENT | 039375 | /0674 | |
Sep 07 2016 | DELL PRODUCTS, L P | CREDIT SUISSE AG, CAYMAN ISLANDS BRANCH, AS COLLATERAL AGENT | SECURITY AGREEMENT | 040030 | /0187 | |
Sep 07 2016 | DELL SOFTWARE INC | CREDIT SUISSE AG, CAYMAN ISLANDS BRANCH, AS COLLATERAL AGENT | SECURITY AGREEMENT | 040030 | /0187 | |
Sep 07 2016 | Aventail LLC | THE BANK OF NEW YORK MELLON TRUST COMPANY, N A , AS NOTES COLLATERAL AGENT | SECURITY AGREEMENT | 040039 | /0642 | |
Sep 07 2016 | Aventail LLC | CREDIT SUISSE AG, CAYMAN ISLANDS BRANCH, AS COLLATERAL AGENT | SECURITY AGREEMENT | 040030 | /0187 | |
Sep 07 2016 | Dell Products L P | THE BANK OF NEW YORK MELLON TRUST COMPANY, N A , AS NOTES COLLATERAL AGENT | SECURITY AGREEMENT | 040039 | /0642 | |
Sep 07 2016 | DELL SOFTWARE INC | THE BANK OF NEW YORK MELLON TRUST COMPANY, N A , AS NOTES COLLATERAL AGENT | SECURITY AGREEMENT | 040039 | /0642 | |
Oct 31 2016 | THE BANK OF NEW YORK MELLON TRUST COMPANY, N A | Aventail LLC | RELEASE OF SECURITY INTEREST IN CERTAIN PATENTS PREVIOUSLY RECORDED AT REEL FRAME 040039 0642 | 040521 | /0016 | |
Oct 31 2016 | Credit Suisse AG, Cayman Islands Branch | DELL SOFTWARE INC | RELEASE BY SECURED PARTY SEE DOCUMENT FOR DETAILS | 040521 | /0467 | |
Oct 31 2016 | THE BANK OF NEW YORK MELLON TRUST COMPANY, N A | Dell Products L P | RELEASE OF SECURITY INTEREST IN CERTAIN PATENTS PREVIOUSLY RECORDED AT REEL FRAME 040039 0642 | 040521 | /0016 | |
Oct 31 2016 | THE BANK OF NEW YORK MELLON TRUST COMPANY, N A | DELL SOFTWARE INC | RELEASE OF SECURITY INTEREST IN CERTAIN PATENTS PREVIOUSLY RECORDED AT REEL FRAME 040039 0642 | 040521 | /0016 | |
Oct 31 2016 | Credit Suisse AG, Cayman Islands Branch | Aventail LLC | RELEASE BY SECURED PARTY SEE DOCUMENT FOR DETAILS | 040521 | /0467 | |
Oct 31 2016 | Credit Suisse AG, Cayman Islands Branch | DELL PRODUCTS, L P | RELEASE BY SECURED PARTY SEE DOCUMENT FOR DETAILS | 040521 | /0467 | |
Oct 31 2016 | DELL SOFTWARE INC | CREDIT SUISSE AG, CAYMAN ISLANDS BRANCH, AS COLLATERAL AGENT | SECOND LIEN PATENT SECURITY AGREEMENT | 040587 | /0624 | |
Oct 31 2016 | DELL SOFTWARE INC | CREDIT SUISSE AG, CAYMAN ISLANDS BRANCH, AS COLLATERAL AGENT | FIRST LIEN PATENT SECURITY AGREEMENT | 040581 | /0850 | |
Nov 01 2016 | DELL SOFTWARE INC | QUEST SOFTWARE INC | CHANGE OF NAME SEE DOCUMENT FOR DETAILS | 046040 | /0277 | |
Dec 30 2016 | QUEST SOFTWARE INC | SONICWALL US HOLDINGS, INC | INTELLECTUAL PROPERTY SECURITY AGREEMENT | 041073 | /0001 | |
Dec 30 2016 | QUEST SOFTWARE INC | SONICWALL US HOLDINGS INC | CORRECTIVE ASSIGNMENT TO CORRECT THE THE NATURE OF CONVEYANCE PREVIOUSLY RECORDED AT REEL: 041073 FRAME: 0001 ASSIGNOR S HEREBY CONFIRMS THE INTELLECTUAL PROPERTY ASSIGNMENT | 042168 | /0114 | |
Nov 14 2017 | Credit Suisse AG, Cayman Islands Branch | Aventail LLC | CORRECTIVE ASSIGNMENT TO CORRECT THE ASSIGNEE PREVIOUSLY RECORDED AT REEL: 040587 FRAME: 0624 ASSIGNOR S HEREBY CONFIRMS THE ASSIGNMENT | 044811 | /0598 | |
Nov 14 2017 | Credit Suisse AG, Cayman Islands Branch | QUEST SOFTWARE INC F K A DELL SOFTWARE INC | CORRECTIVE ASSIGNMENT TO CORRECT THE ASSIGNEE PREVIOUSLY RECORDED AT REEL: 040587 FRAME: 0624 ASSIGNOR S HEREBY CONFIRMS THE ASSIGNMENT | 044811 | /0598 | |
May 18 2018 | SONICWALL US HOLDINGS INC | UBS AG, STAMFORD BRANCH, AS COLLATERAL AGENT | FIRST LIEN PATENT SECURITY AGREEMENT | 046321 | /0414 | |
May 18 2018 | CREDIT SUISSE AG, CAYMAN ISLANDS BRANCH, AS COLLATERAL AGENT | Aventail LLC | RELEASE OF FIRST LIEN SECURITY INTEREST IN PATENTS RECORDED AT R F 040581 0850 | 046211 | /0735 | |
May 18 2018 | CREDIT SUISSE AG, CAYMAN ISLANDS BRANCH, AS COLLATERAL AGENT | QUEST SOFTWARE INC F K A DELL SOFTWARE INC | RELEASE OF FIRST LIEN SECURITY INTEREST IN PATENTS RECORDED AT R F 040581 0850 | 046211 | /0735 | |
May 18 2018 | SONICWALL US HOLDINGS INC | UBS AG, STAMFORD BRANCH, AS COLLATERAL AGENT | SECOND LIEN PATENT SECURITY AGREEMENT | 046321 | /0393 |
Date | Maintenance Fee Events |
Apr 21 2017 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Jul 19 2021 | REM: Maintenance Fee Reminder Mailed. |
Nov 24 2021 | M1552: Payment of Maintenance Fee, 8th Year, Large Entity. |
Nov 24 2021 | M1555: 7.5 yr surcharge - late pmt w/in 6 mo, Large Entity. |
Date | Maintenance Schedule |
Nov 26 2016 | 4 years fee payment window open |
May 26 2017 | 6 months grace period start (w surcharge) |
Nov 26 2017 | patent expiry (for year 4) |
Nov 26 2019 | 2 years to revive unintentionally abandoned end. (for year 4) |
Nov 26 2020 | 8 years fee payment window open |
May 26 2021 | 6 months grace period start (w surcharge) |
Nov 26 2021 | patent expiry (for year 8) |
Nov 26 2023 | 2 years to revive unintentionally abandoned end. (for year 8) |
Nov 26 2024 | 12 years fee payment window open |
May 26 2025 | 6 months grace period start (w surcharge) |
Nov 26 2025 | patent expiry (for year 12) |
Nov 26 2027 | 2 years to revive unintentionally abandoned end. (for year 12) |