Embodiments describe an approach for improving path selection in multi-virtual input/output sever environment. Embodiments, send one or more inquiry commands to one or more virtual input/output server's (VIOSs). Further, embodiments, receive one or more server-side cache properties from the one or more VIOSs. Additionally, embodiments, determine one or more preferred paths based on the one or more server-side cache properties, and perform one or more input/output's (I/O's) on the one or more preferred paths.
|
1. A method for improving path selection in multi-virtual input/output server environment, the method comprising:
receiving, by the one or more processors, one or more server-side cache properties from one or more virtual input/output server's (VIOSs);
determining, by the one or more processors, one or more preferred paths based on the one or more server-side cache properties;
calculating, by the one or more processors, a value for the one or more server-side cache properties, wherein the one or more server-side cache properties have a predetermine value;
ranking, by the one or more processors, the one or more preferred paths from highest to lowest preference based on the calculated value of the one or more server-side cache properties; and
outputting, by the more or more processors, a list of the one of more server-side cache properties based on the ranking.
7. A method for improving path selection in multi-virtual input/output server environment, the method comprising:
marking, by the one or more processors, one or more detected failed paths;
marking, by the one or more processors, cache on the detected one or more failed paths;
sending, by the one or more processors, one or more cache invalidation requests to one or more virtual input/output servers (VIOSs);
determining, by the one or more processors, the one or more VIOS's have not responded with completion to the one or more invalidation requests;
detecting, by the one or more processors, when the failed paths are back online;
resending, by the one or more processors, the one or more cache invalidation requests when the failed paths are back online;
clearing, by the one or more processors, the marked cache and the cache invalidation requests on the one or more failed paths; and
failing over, by the one or more processors, the one or more I/O's onto the next preferred path.
13. A computer system for improving path selection in multi-virtual input/output server environment, the computer system comprising:
one or more computer processors;
one or more computer readable storage devices;
program instructions stored on the one or more computer readable storage devices for execution by at least one of the one or more computer processors, the stored program instructions comprising:
program instructions to receive one or more server-side cache properties from one or more virtual input/output server's (VIOSs);
program instructions to determine one or more preferred paths based on the one or more server-side cache properties;
program instructions to calculate a value for the one or more server-side cache properties, wherein the one or more server-side cache properties have a predetermine value;
program instructions to rank the one or more preferred paths from highest to lowest preference based on the calculated value of the one or more server-side cache properties; and
program instructions to output a list of the one of more server-side cache properties based on the ranking.
2. The method of
detecting, by the one or more processors, path failure in one or more paths; and
marking, by the one or more processors, the one or more detected failed paths.
3. The method of
marking, by the one or more processors, one or more cache invalidation in progress on the one or more failed paths; and
sending, by the one or more processors, one or more cache invalidation requests to the one or more VIOSs.
4. The method of
determining, by the one or more processors, the one or more VIOS's have responded with completion to the one or more invalidation requests.
5. The method of
clearing, by the one or more processors, one or more cache invalidation flag on the one or more failed paths; and
failing over, by the one or more processors, the one or more I/O's onto the next preferred path.
6. The method of
determining, by the one or more processors, the one or more VIOS's have not responded with completion to the one or more invalidation requests;
detecting, by the one or more processors, when the failed paths are back online;
resending, by the one or more processors, the one or more cache invalidation requests when the failed paths are back online;
clearing, by the one or more processors, one or more cache invalidation flag on the one or more failed paths; and
failing over, by the one or more processors, the one or more I/O's onto the next preferred path.
8. The method of
determining, by the one or more processors, the one or more VIOS's have responded with completion to the one or more invalidation requests.
9. The method of
clearing, by the one or more processors, one or more cache invalidation flag on the one or more failed paths; and
failing over, by the one or more processors, the one or more I/O's onto the next preferred path.
10. The method of
sending, by one or more processors, one or more inquiry commands to one or more virtual input/output server's (VIOSs); and
receiving, by the one or more processors, one or more server-side cache properties from the one or more VIOSs.
11. The method of
determining, by the one or more processors, one or more preferred paths based on the one or more server-side cache properties; and
performing, by the one or more processors, one or more input/outputs (I/Os) on the one or more preferred paths.
12. The method of
calculating, by the one or more processors, a value for the one or more server-side cache properties, wherein the one or more server-side cache properties have a predetermine value;
ranking, by the one or more processors, the one or more preferred paths from highest to lowest preference based on the calculated value of the one or more server-side cache properties; and
outputting, by the more or more processors, a list of the one of more server-side cache properties based on the ranking.
14. The computer system of
program instructions to detect path failure in one or more paths; and
program instructions to mark the one or more detected failed paths.
15. The computer system of
program instructions to mark one or more cache invalidation in progress on the one or more failed paths; and
program instructions to send one or more cache invalidation requests to the one or more VIOSs.
16. The computer system of
program instructions to determine the one or more VIOS's have responded with completion to the one or more invalidation requests
program instructions to clear one or more cache invalidation flag on the one or more failed paths; and
program instructions to fail over the one or more I/O's onto the next preferred path.
17. The computer system of
program instructions to determine the one or more VIOS's have not responded with completion to the one or more invalidation requests;
program instructions to detect when the failed paths are back online;
program instructions to resend the one or more cache invalidation requests when the failed paths are back online;
program instructions to clear one or more cache invalidation flag on the one or more failed paths; and
program instructions to fail over the one or more I/O's onto the next preferred path.
18. The method of
19. The method of
20. The computer system of
|
The present invention relates generally to the field of storage virtualization, and more particularly to managing the storage path between virtual disks and client logical partitions (LPARs) based on identified cache properties.
Server-side caching software provides the capability to cache storage data of target disks from a Storage Area Network (SAN) to faster flash disks (e.g., solid-state disk (SSD), non-volatile memory express (NVMe), etc.) on a server. Based on storage data access patterns of an application, a caching algorithm identifies the most frequently accessed data, i.e., “hot data,” and caches the hot data on a cache disk. All future read requests for the hot data are served from the cached data on the cache disk, improving application read performance. Further, the caching software monitors write operations and invalidates the cache before write data is sent to the SAN.
Embodiments of the present invention disclose a method, a system, and a computer program product for improving path selection in multi-virtual input/output server environment. Embodiments of the present invention discloses sending, by one or more processors, one or more inquiry commands to one or more virtual input/output server's (VIOSs). Receiving, by said one or more processors, one or more server-side cache properties from said one or more VIOSs. Determining, by said one or more processors, one or more preferred paths based on said one or more server-side cache properties, and performing, by said one or more processors, one or more input/outputs (I/Os) on said one or more preferred paths.
Embodiments of the present invention disclose a method, a system, and a computer program product for improving path selection in multi-virtual input/output server environment. Embodiments of the present invention discloses detecting, by said one or more processors, path failure in one or more paths. Marking, by said one or more processors, said one or more detected failed paths. Marking, by said one or more processors, said cache on said detected one or more failed paths. Sending, by said one or more processors, one or more cache invalidation requests to one or more virtual input/output servers (VIOSs). Clearing, by said one or more processors, said marked cache and said cache invalidation requests on said one or more failed paths, and failing over, by said one or more processors, said one or more I/O's onto said next preferred path.
In a virtualized environment (e.g., a power system virtualized environment (PSVE)), a Virtual Input/Output Server (VIOS) is responsible for storage virtualization to client logical partitions (LPARs), wherein the storage can be virtualized to a client through multiple VIOSs creating a multipath high-availability environment. For client virtual disks, a VIOS emulates small computer system interface (SCSI) targets and supports SCSI primary commands such as, but not limited to, inquiry, mode sense, mode select, etc. For example, a response for these commands is prepared by a VIOS. In a PSVE, the VIOS can implement server-side caching for virtualized disks that are mapped to client LPARs. If server-side caching is enabled for a virtual disk, then the VIOS will use a flash device to cache “hot” data of the virtual disk. However, not all VIOSs in a high availability multipath environment can have server-side caching enabled. It is important to note that server-side cache properties and cache properties have the same meaning and can be used interchangeably.
Currently, there is no method to identify cache properties for a virtualized disk such as, but not limited to: whether cache is enabled for one or more virtualized disks, the amount of cache available for a virtualized disk, latency of one or more caching disks, the bandwidth of the one or more cache disk, etc. In embodiments of the present invention, on the server-side cache environment, when a path fails over from one VIOS to another VIOS, the cache can be invalidated at the first VIOS because the data cached at the first VIOS can be updated through the fail over path. Embodiments of the present invention can identify one or more preferable paths between one or more VIOSs that have a server-side cache, do not have a server-side cache, and/or a less preferable cache.
Embodiments of the present invention can retrieve server-side cache properties of a virtualized disk from a VIOS, these embodiments can give one or more clients the flexibility to determine one or more preferred input/output (IO) paths and also provide a method to invalidate the cache at the failed path's VIOS. For example, a client retrieves server-side cache properties from a VIOS on each path by sending inquiry commands to vendor specific inquiry pages. It should be noted that a client can be computer hardware or software that accesses a service made available by a server.
In various embodiments of the present invention, a VIOS SCSI target emulator supports a vendor specific inquiry page for a virtualized disk. This vendor specific inquiry page comprises server-side cache properties such as, but not limited to: an identity of an enabled cache for one or more virtualized disks, an amount of cache available to a virtualized disk, latency of one or more caching disks, the bandwidth of the one or more cache disks, etc. In various embodiments, while configuring the path, one or more clients send one or more inquiry commands to one or more VIOSs to get one or more vendor specific inquiry pages from one or more VIOS SCSIs and save the associated server-side cache properties in the metadata of one or more identified paths. In various embodiments, after configuring the paths, the one or more clients determine one or more preferred paths based on the cache properties. For example, if caching is enabled for a cache disk at only one VIOS, then that path is preferred over the other paths that do not have enabled cache and/or paths that have less preferable cache. In another example, if more than one VIOS has caching enabled for a cache disk, the client can choose the path that gives better performance based on the cache properties. In some embodiments, the one or more clients determine one or more preferred paths based on the server-side cache properties saved in the metadata of the one or more identified paths. In various embodiments, the cache stored on the metadata of a path corresponds to one particular path (e.g., the path the cache corresponds to).
However, when the preferred path fails, the one or more clients fail-over to the next preferred VIOS, wherein the one or more clients will start using the preferred path associated with the next preferred VIOS as the path for input/output (I/O) (e.g., read, write). In a particular embodiment, one or more clients can write to the same data which is cached on the failed VIOS. In this particular embodiment, the cache at the VIOS of the failed path needs to be invalidated. If the cache in the failed path's VIOS is not invalidated, there can be a data integrity issue.
Embodiments of the present invention solve the problem of data integrity issues in multi-VIOS environments when a path has failed and improves the art of data storage by: (i) enabling VIOS SCSI target emulators to support one or more vendor specific mode pages for cache invalidation; (ii) sending a cache invalidation request to one or more VIOSs on the failed path, whenever there is a path failure; (iii) marking the path as failed and marking the path as “cache invalidation in progress”; (iv) failing-over the I/O to the next preferred path; (vi) clearing a “cache invalidation in progress” flag if VIOS on the failed path responds to cache invalidation request; (vii) if VIOS on the failed path does not respond to cache invalidation request, marking the failed path as “cache invalidation needed” and sending a cache invalidation request again whenever the failed path comes online; (viii) clearing one or more cache invalidation flags on one or more failed paths whenever it sends a cache invalidation request and marking the path as “cache invalidation in progress”; (ix) clearing one or more cache invalidation flags; and/or (x) selecting one or more paths not marked “cache invalidation needed” or “cache invalidation in progress” for I/O.
In various embodiments, computing device 110 can be, but is not limited to, a standalone device, a server, a laptop computer, a tablet computer, a netbook computer, a personal computer (PC), a smart phone, a desktop computer, a smart television, a smart watch, a radio, a stereo system, a cloud based service (e.g., a cognitive cloud based service), and/or any programmable electronic computing device capable of communicating with various components and devices within distributed data processing environment 100, via network 130 or any combination therein. In general, computing device 110 is representative of any programmable mobile device or a combination of programmable mobile devices capable of executing machine-readable program instructions and communicating with users of other mobile devices via network 130 and/or capable of executing machine-readable program instructions and communicating with server computer 120.
In other embodiments, computing device 110 can represent any programmable electronic computing device or combination of programmable electronic computing devices capable of executing machine readable program instructions, manipulating executable machine-readable instructions, and communicating with server computer 120 and other computing devices (not shown) within distributed data processing environment 100 via a network, such as network 130. Computing device 110 can include an instance of user interface 106. Computing device 110 and user interface 106 can allow a user to interact with path enhancing component (PEC) 122 on server computer 120 in various ways, such as sending program instructions, receiving messages, sending data, inputting data, editing data, collecting data, and/or receiving data. In various embodiments, not depicted in
User interface (UI) 106 provides an interface to PEC 122 on server computer 120 via network 130 for a user of computing device 110. In one embodiment, UI 106 can be a graphical user interface (GUI) or a web user interface (WUI) and can display text, documents, web browser windows, user options, application interfaces, and instructions for operation. UI 106 can include information (such as graphic, text, and sound) a program presents to a user and control sequences the user employs to control the program. In another embodiment, UI 106 can be mobile application software providing an interface between a user of computing device 110 and server computer 120. Mobile application software, or an “app,” can be a computer program designed to run on smart phones, tablet computers and other mobile devices. In an embodiment, UI 106 can enable the user of computing device 110 to send data, input data, edit data (annotations), collect data and/or receive data.
Server computer 120 can be a standalone computing device, a management server, a web server, a mobile computing device, one or more client servers, or any other electronic device or computing system capable of receiving, sending, and processing data. In other embodiments, server computer 120 can represent a server computing system utilizing multiple computers such as, but not limited to, a server system, such as in a cloud computing environment. In another embodiment, server computer 120 can be a laptop computer, a tablet computer, a netbook computer, a personal computer (PC), a desktop computer, a personal digital assistant (PDA), a smart phone, or any other programmable electronic device capable of communicating with computing device 110, shared storage 124, and other computing devices (not shown) within distributed data processing environment 100 via network 130. In another embodiment, server computer 120 represents a computing system utilizing clustered computers and components (e.g., database server computers, application server computers, etc.) that act as a single pool of seamless resources when accessed within distributed data processing environment 100. Server computer 120 can include internal and external hardware components, as depicted, and described in further detail with respect to
Each of shared storage 124 and local storage 108 can be a data/knowledge repository and/or a database that can be written and/or read by one or a combination of PEC 122, server computer 120 and computing device 110. In the depicted embodiment, shared storage 124 resides in distributed data processing environment 100 and local storage 108 resides on computing device 110. In another embodiment, shared storage 124 and/or local storage 108 can reside elsewhere within distributed data processing environment 100, provided that each can access and is accessible by computing device 110, server computer 120. Shared storage 124 and/or local storage 108 can be implemented with any type of storage device capable of storing data and configuration files that can be accessed and utilized by server computer 120, such as a database server, a hard disk drive, or a flash memory. In other embodiments, shared storage 124 and/or local storage 108 can be hard drives, memory cards, computer output to laser disc (cold storage), and/or any form of data storage known in the art.
In some embodiments, shared storage 124 and/or local storage 108 can be one or more cloud storage systems and/or databases linked to a cloud network. In various embodiments, PEC 122 can search, identify, match, and/or retrieve historic information of cache properties, preferred paths, and/or failed paths that have been previously identified and/or stored on a knowledge repository (e.g., shared storage 124 and/or local storage 108). For example, PEC 122 will search and/or store the features/symptoms related to a problem (e.g., failed paths, marked paths, cache properties, etc.) on shared storage 124, in which PEC 122 can access at a later time in order to determine and/or configure one or more preferred paths. In various embodiments, the knowledge repository enables PEC 122 to be self-learning and enables PEC 122 to utilize historic data to solve an identified issue (e.g., determining if the cache at the VIOS of failed path needs to be invalidated) and/or provide a solution to the identified issue (e.g., selecting a preferred path).
In various embodiments, PEC 122 can be on one or more server computers 120 (e.g., clients/client computers). In other embodiments, PEC 122 can be instructed by one or more client computers. In various embodiments, PEC 122 can instruct and/or enable a VIOS SCSI target emulator to support one or more vendor specific inquiry pages for a virtualized disk. In various embodiments, PEC 122 can send one or more inquiry commands for one or more vendor specific pages to one or more VIOSs to retrieve server-side cache properties. Vendor specific inquiry pages comprise server-side cache properties such as: the enabled cache for one or more virtualized disks, the amount of cache available for use by a virtualized disk, latency of one or more caching disks, the bandwidth of the one or more caching disks, etc. In various embodiments, while configuring the path, PEC 122 can send one or more inquiry commands, via one or more clients, to retrieve one or more vendor specific inquiry pages. In various embodiments, PEC 122 can store the server-side cache properties of the one or more retrieved vendor specific inquiry pages in the metadata of the one or more paths associated with the server-side cache properties (e.g., associated path(s)).
In some embodiments, the server-side cache properties and/or metadata can be stored on local storage 108 and/or shared storage 124. In various embodiments, PEC 122 configures one or more paths based on the retrieved server-side cache properties of the one or more vendor specific inquiry pages and/or metadata of one or more associated paths. In various embodiments, via one or more clients, PEC 122 can determine one or more preferred paths based on the retrieved server-side cache properties of the one or more vendor specific inquiry pages (e.g., server-side cache properties), metadata of one or more associated paths, and/or the one or more configured paths. For example, if caching is enabled for the cache disk at only one VIOS, then PEC 122 will prefer this particular path over the other paths whose cache is not enabled. In another example, if more than one VIOS has caching enabled for the cache disk, PEC 122 will choose the path that gives an enhanced performance based on the cache properties. In various embodiments, PEC 122 can rank the server-side cache properties from most effective to least effective based on the performance of the server-side cache properties and overall ability to improve the input/output (I/O) on a path. For example, the enabled cache for one or more virtualized disks can have the highest ranking/priority, the amount of cache available in a virtualized disk can have the second highest ranking/priority, latency of one or more caching disks can have the third highest ranking/priority, and the bandwidth of the one or more virtualized disks can have the fourth highest ranking/priority. In various embodiments, PEC 122 can utilize the ranking of server-side cache properties to determine the preferred path. For example, the path with the highest weighted ranking, based on the cache properties, is the most preferred path, the path with the second highest weighted ranking, based on the cache properties, is the second most preferred path, etc. In various embodiments, PEC 122 can output a descending list of server-side cache properties based on the weighted ranking of the server-side cache properties (e.g., highest weighted rank to lowest weighted rank).
In various embodiments, PEC 122 improves the art of data storage by solving the problem of data integrity issues when there is a failed path by invalidating the cached on the failed paths VIOS. In various embodiments, PEC 122 can solve the problem of data integrity issues of failed paths by instructing and/or enabling one or more VIOS SCSI target emulators to support one or more vendor specific mode page for a SCSI mode select command. In various embodiments, if PEC 122 detects, determines, and/or is alerted of a path fail, then PEC 122 can send one or more cache invalidation requests to one or more VIOSs on the failed path (e.g., send SCSI mode select command to a vendor specific mode page). In this particular embodiment, PEC 122 marks the path as failed and marks the cache of the failed path as invalidated. In various embodiments, PEC 122 fails-over the I/O to the next preferred path. In various embodiments, PEC 122 can instruct the VIOS on the failed path to invalidate the cache and submit a response to PEC 122 with completion. In various embodiments, PEC 122 clears at least one “cache invalidation in progress” flag. In some embodiments, if one or more VIOSs on the failed path do not respond with completion within a predetermined timeout period, PEC 122 can mark the path as “cache invalidation needed” and send a cache invalidation request again whenever the failed path comes online. For example, in a multipath input/output (MPIO) with health-check enabled, a device driver or PEC 122 sends a health-check command (inquiry/tur), periodically, and if the command succeeds the path is marked as online).
In various embodiments, PEC 122 can clear one or more cache invalidation flags on one or more paths whenever PEC 122 sends a cache invalidation request and marks the path as “cache invalidation in progress.” In various embodiments, if cache invalidation is successful, then PEC 122 can clear one or more cache invalidation flags. In various embodiments, PEC 122 can select unmarked paths for I/O. In various embodiments, PEC 122 can return server-side caching properties for an inquiry command by one or more VIOSs; determine preferred paths based on the server-side cache properties; enable a SCSI Mode Select command to invalidate cache of a virtualized disk at a failed preferred VIOS; and/or determine and govern the use of one or more failed paths by marking the failed paths and avoiding the failed paths, even when the failed paths come back online, until the cache is invalidated on the one or more VIOSs in order to avoid data integrity issues.
In one particular embodiment, as depicted in
MPIO subsystem 140 is a device driver designed to support a multipath I/O configuration. In various embodiments, PEC 122 can reside on MPIO subsystem 140; however, in other embodiments, not depicted in
Cache disk 125 and cache disk 133 are mechanisms for improving the time it takes to read from or write to a hard disk. Cache disk 125 and cache disk 133 can be a part of the hard disk. Further, cache disk 125 and cache disk 133 can also be a specified portion of random access memory (RAM). In general, cache disks (e.g., cache disk 125 and cache disk 133) can hold data that has recently been read and, in some cases, adjacent data areas that are likely to be accessed next. Cache engine 123 and cache engine 131 can be dedicated network servers or services acting as a server that save Web pages or other Internet content locally by placing previously requested information in temporary storage, i.e., cache. Cache engine 123 and cache engine 131 can both speed up access to data and reduce demand on an enterprise's bandwidth. Cache engine 123 and cache engine 131 can also enable users to access content offline, including media files or other documents. Virtual disk 137 can emulate an actual disk storage device, such as an optical disc drive, a floppy disk drive, or a hard disk drive. To other programs, a virtual drive looks and behaves like an actual physical device. Virtual disk 137 can be in any of the following forms: a disk image, a computer file that contains the exact data structure of an actual storage device; a logical disk (also known as vdisk), an array of two or more actual drives that cooperatively act like a single device; and/or a random-access memory (RAM) disk, which stores its data in RAM instead of on a storage device. Hypervisor 136 can be computer software, firmware, or hardware that creates and runs virtual machines. Hypervisor 136 can run one or more virtual machines and present guest operating systems with a virtual operating platform and manage the execution of the guest operating systems.
In one particular embodiment, as depicted in
In this particular embodiment, PEC 122 evaluates both paths by reading the server-side cache properties of each path and calculating the path rankings based on the respective cache properties. In this particular embodiment, PEC 122 determines and selects path 102 as the preferred I/O path because the cache is enabled for VIOS 128, and because of lower latency in cache disk 133; whereas, the cache was not enabled in VIOS 126 or the latency of cache disk 125 was far greater than that of cache disk 133.
In step 202, PEC 122 can send an inquiry command to a VIOS. In various embodiments, PEC 122 can send one or more inquiry commands to one or more VIOSs (e.g., VIOS 126 and/or VIOS 128) requesting one or more vendor specific inquiry pages to receive server-side cache properties. For example, PEC 122 can send an inquiry command to VIOS 126 and/or VIOS 128.
In step 204, PEC 122 receives server-side cache properties from the VIOS. In various embodiments, PEC 122 can receive one or more server-side cache properties from one or more VIOSs. For example, PEC 122 can receive one or more server-side cache properties from VIOS 126 and/or VIOS 128.
In step 206, PEC 122 determines a preferred path based on the server-side cache properties. In various embodiments, PEC 122 can determine one or more preferred paths based on the received server-side cache properties submitted by the one or more VIOSs. For example, VIOS 126 and VIOS 128 each send their associated server-side cache properties to PEC 122. More specifically, VIOS 126 sends the service-side cache properties associated with path 101 and VIOS 128 sends the server-side cache properties associated with path 102 to PEC 122. In various embodiments, each server-side cache property can have a predetermined ranking, priority number, and/or value, which PEC 122 can utilize to calculate and determine one or more preferred paths. For example, the enabled cache for one or more virtualized disks can have the highest value, the amount of cache available in a virtualized disk can have the second highest value, latency of one or more caching disks can have the third highest value, and the bandwidth of the one or more virtualized disks can have the fourth highest value. In this particular example, the path with the highest weighted calculated value, based on the cache properties, is the preferred path. In other embodiments, PEC 122 can output a list of preferred paths. For example, the path with the highest weighted calculated value is preferred path 1, the path with the second highest calculated value is preferred path 2, etc.
In step 208, PEC 122 performs I/O on the preferred path. In various embodiments, PEC 122 can perform I/O on the one or more preferred paths.
In step 302, PEC 122 detects a path failure. In various embodiments, PEC 122 can determine that one or more I/O paths have failed. In various embodiments, PEC 122 can determine if one or more paths have failed by monitoring the one or more paths, being alerted that the one or more paths have failed, and/or detecting that the one or more paths have failed. For example, PEC 122 receiving a failed data transfer notification and/or system error.
In step 304, PEC marks the path as failed. In various embodiments, PEC 122 can mark the one or more detected failed paths as a failed path.
In step 306, PEC 122 marks the failed path as “cache invalidation in progress.” In various embodiments, PEC 122 can mark one or more detected and/or marked failed paths as “cache invalidation in progress.”
In step 308, PEC 122 sends a cache invalidation request to the VIOS on the failed path. In various embodiments, PEC 122 can send one or more cache invalidation requests to one or more VIOSs on the one or more failed paths. For example, if PEC 122 detects path 101 has failed and PEC 122 marks the path as failed (e.g., marking the cache on path 101) then PEC 122 sends a cache invalidation request to VIOS 126 requesting VIOS 126 to invalidate the cache on the marked path to avoid data integrity issues. In another example, PEC 122 can clear the cache on the marked path to avoid data integrity issues. In some embodiments, one or more VIOSs can clear the cache associated with the cache invalidation request. In various embodiments, PEC 122 can use SCSI in-band communication to send one or more cache invalidation requests from a guest operating system (OS) to one or more VIOSs on the one or more failed paths.
In decision step 310, PEC 122 determines if a response has been received. In various embodiments, PEC 122 can determine if one or more responses (e. g., completion response(s)) to the one or more cache invalidation request have be received from the one or more VIOSs. In some embodiments, if one or more VIOSs on the one or more failed paths do not respond with completion within a predetermined timeout period (“No” branch, decision step 310), PEC 122 can mark the path as “cache invalidation needed” and proceed to step 312. In step 312, PEC 122 can resend a cache invalidation request to the failed path. In various embodiments, PEC 122 can resend one or more cache invalidation requests to one or more VIOSs on one or more failed path. In various embodiments, PEC 122 can detect when the one or more failed paths are online and can resend the cache invalidation request to the one or more failed paths (e.g., VIOSs) that have not responded once the one or more paths are back online. For example, in a multipath input/output (MPIO) subsystem 140 of
In step 314, PEC 122 clears the cache invalidation flag (e.g., “cache invalidation in progress”) on the failed path. In various embodiments, PEC 122 can clear the cache invalidation flag on the one or more failed paths. In various embodiments, not depicted in
In step 316, PEC performs an I/O fail-over to the next preferred path. In various embodiments, PEC 122 can fail-over one or more I/Os to the one or more next preferred paths. For example, if a path fails then one or more clients will search for available paths to the one or more virtual disks previously accesses by the failed path. In this particular example, PEC 122 looks for one or more preferred paths using an inquiry command, based on the server-side cache properties of VIOS 126 that were collected when the paths were configured. In another example, PEC 122 can fail-over I/O to the next preferred path based on a ranking list (e.g., the path with the next highest calculated value). In some embodiments, not depicted in
Memory 402 and persistent storage 405 are computer readable storage media. In this embodiment, memory 402 includes random access memory (RAM). In general, memory 402 can include any suitable volatile or non-volatile computer readable storage media. Cache 403 is a fast memory that enhances the performance of processors 401 by holding recently accessed data, and data near recently accessed data, from memory 402.
Program instructions and data used to practice embodiments of the present invention can be stored in persistent storage 405 and in memory 402 for execution by one or more of the respective processors 401 via cache 403. In an embodiment, persistent storage 405 includes a magnetic hard disk drive. Alternatively, or in addition to a magnetic hard disk drive, persistent storage 405 can include a solid-state hard drive, a semiconductor storage device, read-only memory (ROM), erasable programmable read-only memory (EPROM), flash memory, or any other computer readable storage media that is capable of storing program instructions or digital information.
The media used by persistent storage 405 can also be removable. For example, a removable hard drive can be used for persistent storage 405. Other examples include optical and magnetic disks, thumb drives, and smart cards that are inserted into a drive for transfer onto another computer readable storage medium that is also part of persistent storage 405.
Communications unit 407, in these examples, provides for communications with other data processing systems or devices. In these examples, communications unit 407 includes one or more network interface cards. Communications unit 407 can provide communications through the use of either or both physical and wireless communications links. Program instructions and data used to practice embodiments of the present invention can be downloaded to persistent storage 405 through communications unit 407.
I/O interface(s) 406 enables for input and output of data with other devices that can be connected to each computer system. For example, I/O interface 406 can provide a connection to external devices 408 such as a keyboard, keypad, a touch screen, and/or some other suitable input device. External devices 408 can also include portable computer readable storage media such as, for example, thumb drives, portable optical or magnetic disks, and memory cards. Software and data used to practice embodiments of the present invention can be stored on such portable computer readable storage media and can be loaded onto persistent storage 405 via I/O interface(s) 406. I/O interface(s) 406 also connect to display 409.
Display 409 provides a mechanism to display data to a user and can be, for example, a computer monitor.
The programs described herein are identified based upon the application for which they are implemented in a specific embodiment of the invention. However, it should be appreciated that any particular program nomenclature herein is used merely for convenience, and thus the invention should not be limited to use solely in any specific application identified and/or implied by such nomenclature.
The present invention can be a system, a method, and/or a computer program product. The computer program product can include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present invention.
The computer readable storage medium can be any tangible device that can retain and store instructions for use by an instruction execution device. The computer readable storage medium can be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing. A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network can comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
Computer readable program instructions for carrying out operations of the present invention can be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The computer readable program instructions can execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer can be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection can be made to an external computer (for example, through the Internet using an Internet Service Provider). In some embodiments, electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) can execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present invention.
Aspects of the present invention are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.
These computer readable program instructions can be provided to a processor of a general-purpose computer, a special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions can also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
The computer readable program instructions can also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams can represent a module, a segment, or a portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the blocks can occur out of the order noted in the Figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks can sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts or carry out combinations of special purpose hardware and computer instructions.
The descriptions of the various embodiments of the present invention have been presented for purposes of illustration but are not intended to be exhaustive or limited to the embodiments disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the invention. The terminology used herein was chosen to best explain the principles of the embodiment, the practical application or technical improvement over technologies found in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.
Rathi, Sanket, Thatikonda, Vamshikrishna, Anumula, Venkata Naga Seshaiah
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
6453392, | Nov 10 1998 | International Business Machines Corporation | Method of and apparatus for sharing dedicated devices between virtual machine guests |
6904477, | Apr 13 2001 | Oracle America, Inc | Virtual host controller interface with multipath input/output |
6973529, | Sep 28 2001 | International Business Machines Corporation | Multiple I/O path selection among disparate channel paths |
7793139, | Dec 12 2006 | International Business Machines Corporation | Partial link-down status for virtual Ethernet adapters |
7930416, | May 11 1999 | June Ray Limited | Load balancing technique implemented in a data network device utilizing a data cache |
8019966, | Jun 09 2008 | International Business Machines Corporation | Data sharing utilizing virtual memory having a shared paging space |
8041987, | Nov 10 2008 | International Business Machines Corporation | Dynamic physical and virtual multipath I/O |
8281306, | Jun 06 2008 | International Business Machines Corporation | Managing assignment of partition services to virtual input/output adapters |
8458413, | Oct 27 2010 | International Business Machines Corporation | Supporting virtual input/output (I/O) server (VIOS) active memory sharing in a cluster environment |
8495412, | Sep 10 2010 | International Business Machines Corporation | Autonomous propagation of virtual input/output (VIO) operation(s) to second VIO server (VIOS) due to a detected error condition at a first VIOS |
8521922, | Jan 10 2011 | ServiceNow, Inc; International Business Machines Corporation | Input/output (I/O) path selection based on workload types |
8902736, | Apr 07 2011 | Veritas Technologies LLC | Selecting an alternative path for an input/output request |
9262351, | Mar 27 2014 | International Business Machines Corporation | Inter-adapter cooperation for multipath input/output systems |
9800459, | Apr 01 2015 | EMC IP HOLDING COMPANY LLC | Dynamic creation, deletion, and management of SCSI target virtual endpoints |
20120066543, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Nov 21 2018 | THATIKONDA, VAMSHIKRISHNA | International Business Machines Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 047624 | /0552 | |
Nov 21 2018 | ANUMULA, VENKATA NAGA SESHAIAH | International Business Machines Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 047624 | /0552 | |
Nov 22 2018 | RATHI, SANKET | International Business Machines Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 047624 | /0552 | |
Nov 29 2018 | International Business Machines Corporation | (assignment on the face of the patent) | / |
Date | Maintenance Fee Events |
Nov 29 2018 | BIG: Entity status set to Undiscounted (note the period is included in the code). |
Jul 01 2024 | REM: Maintenance Fee Reminder Mailed. |
Date | Maintenance Schedule |
Nov 10 2023 | 4 years fee payment window open |
May 10 2024 | 6 months grace period start (w surcharge) |
Nov 10 2024 | patent expiry (for year 4) |
Nov 10 2026 | 2 years to revive unintentionally abandoned end. (for year 4) |
Nov 10 2027 | 8 years fee payment window open |
May 10 2028 | 6 months grace period start (w surcharge) |
Nov 10 2028 | patent expiry (for year 8) |
Nov 10 2030 | 2 years to revive unintentionally abandoned end. (for year 8) |
Nov 10 2031 | 12 years fee payment window open |
May 10 2032 | 6 months grace period start (w surcharge) |
Nov 10 2032 | patent expiry (for year 12) |
Nov 10 2034 | 2 years to revive unintentionally abandoned end. (for year 12) |