In one aspect, transitioning among replication modes in a storage system is provided. An aspect includes during an active replication session in which a first replication mode is performed, validating a precondition for transitioning to a second replication mode that is different from the first mode of replication. An aspect further includes suspending data flow for the active session, establishing a base snap set pair for a mode transition cycle, and preparing for the transitioning including updating, for the mode transition cycle, the base snap set pair on both a source system and a target system of the storage system, and creating a snap set against an active snap set on the target system as a mode transition cycle snap set. An aspect further includes implementing the mode transition cycle, and entering a replication session, upon completion of the mode transition cycle, for the second mode of replication.
|
1. A method for seamless transitioning replication modes in a storage system, the method comprising:
during an active replication session in which a first mode of replication is performed at the storage system:
validating a precondition for transitioning to a second mode of replication, wherein the second mode of replication is different than the first mode of replication;
suspending data flow for the active replication session;
establishing a base snap set pair for a mode transition cycle;
preparing for the transitioning from the first replication mode to the second replication mode including updating, for the mode transition cycle, the base snap set pair on both a source system and a target system of the storage system, and creating a snap set against an active snap set on the target system as a mode transition cycle snap set;
implementing the mode transition cycle; and
entering a replication session, upon completion of the mode transition cycle, for the second mode of replication.
15. A computer program product for seamless transitioning replication modes in a storage system, the computer program product embodied on a non-transitory computer readable medium and including instructions that, when executed by a computer causes the computer to perform operations comprising:
during an active replication session in which a first mode of replication is performed at the storage system:
validating a precondition for transitioning to a second mode of replication, wherein the second mode of replication is different than the first mode of replication;
suspending data flow for the active replication session;
establishing a base snap set pair for a mode transition cycle;
preparing for the transitioning from the first replication mode to the second replication mode including updating, for the mode transition cycle, the base snap set pair on both a source system and a target system of the storage system, and creating a snap set against an active snap set on the target system as a mode transition cycle snap set;
implementing the mode transition cycle; and
entering a replication session, upon completion of the mode transition cycle, for the second mode of replication.
8. A system for seamless transitioning replication modes in a storage system, the system comprising:
a memory comprising computer-executable instructions; and
a processor executing the computer-executable instructions, the computer-executable instructions when executed by the processor cause the processor to perform operations comprising:
during an active replication session in which a first mode of replication is performed at the storage system:
validating a precondition for transitioning to a second mode of replication, wherein the second mode of replication is different than the first mode of replication;
suspending data flow for the active replication session;
establishing a base snap set pair for a mode transition cycle;
preparing for the transitioning from the first replication mode to the second replication mode including updating, for the mode transition cycle, the base snap set pair on both a source system and a target system of the storage system, and creating a snap set against an active snap set on the target system as a mode transition cycle snap set;
implementing the mode transition cycle; and
entering a replication session, upon completion of the mode transition cycle, for the second mode of replication.
2. The method of
for transitioning to async replication mode as the second mode of replication, determining availability of a recovery snap set pair for use as the base snap set pair during the transitioning; and
for transitioning to sync or metro replication mode as the second mode of replication, determining availability of a synchronized snap set pair for use as the base snap set pair during the transitioning.
3. The method of
when the first mode of replication is async mode, stopping IO mirroring in a data path for the active replication session; and
when the first mode of replication is sync or metro mode, waiting for the active replication session to complete.
4. The method of
wherein establishing the base snap set pair for the mode transition cycle includes:
when the first mode of replication is async mode, the base snap set pair comprises a most recent synchronized snap set pair; and
when the first mode of replication is sync or metro mode, the base snap set pair comprises a most recent recovery snap set pair, and
wherein preparing for the transitioning from the first replication mode to the second replication mode includes creating a snap set against an active snap set on the target system as a mode transition cycle snap set.
5. The method of
when the second mode of replication is async replication mode, setting recovery data replication flow from the source system to the target system; and
when the second mode of replication is sync or metro replication, enabling IO mirroring on a consistency group at the source system.
6. The method of
when the second mode of replication is sync or metro replication mode, transferring a delta between the consistency group at the source system and a most recent base snap set from the mode transition cycle to the mode transition cycle snap set at the target system; and
when the second mode of replication is async replication, creating a new transition cycle snap set against the consistency group on the source system, and transferring a delta between the new transition cycle snap set and a most recent active snap set to the mode transition cycle snap set.
7. The method of
when the second mode of replication is sync replication mode, assigning the active snap set to the transition cycle snap set and continuing IO mirroring from the consistency group to a most recent active snap set at the target system;
when the second mode of replication is metro replication mode, refreshing the consistency group to the transition cycle snap set, and initiating bi-directional IO mirroring; and
when the second mode of replication is async replication mode, initiating async replication cycle scheduling.
9. The system of
for transitioning to async replication mode as the second mode of replication, determining availability of a recovery snap set pair for use as the base snap set pair during the transitioning; and
for transitioning to sync or metro replication mode as the second mode of replication, determining availability of a synchronized snap set pair for use as the base snap set pair during the transitioning.
10. The system of
when the first mode of replication is async mode, stopping IO mirroring in a data path for the active replication session; and
when the first mode of replication is sync or metro mode, waiting for the active replication session to complete.
11. The system of
wherein establishing the base snap set pair for the mode transition cycle includes:
when the first mode of replication is async mode, the base snap set pair comprises a most recent synchronized snap set pair, and
when the first mode of replication is sync or metro mode, the base snap set pair comprises a most recent recovery snap set pair, and
wherein preparing for the transitioning from the first replication mode to the second replication mode includes creating a snap set against an active snap set on the target system as a mode transition cycle snap set.
12. The system of
when the second mode of replication is async replication mode, setting recovery data replication flow from the source system to the target system; and
when the second mode of replication is sync or metro replication, enabling IO mirroring on a consistency group at the source system.
13. The system of
when the second mode of replication is sync or metro replication mode, transferring a delta between the consistency group at the source system and a most recent base snap set from the mode transition cycle to the mode transition cycle snap set at the target system; and
when the second mode of replication is async replication, creating a new transition cycle snap set against the consistency group on the source system, and transferring a delta between the new transition cycle snap set and a most recent active snap set to the mode transition cycle snap set.
14. The system of
when the second mode of replication is sync replication mode, assigning the active snap set to the transition cycle snap set and continuing IO mirroring from the consistency group to a most recent active snap set at the target system;
when the second mode of replication is metro replication mode, refreshing the consistency group to the transition cycle snap set, and initiating bi-directional IO mirroring; and
when the second mode of replication is async replication mode, initiating async replication cycle scheduling.
16. The computer program product of
for transitioning to async replication mode as the second mode of replication, determining availability of a recovery snap set pair for use as the base snap set pair during the transitioning; and
for transitioning to sync or metro replication mode as the second mode of replication, determining availability of a synchronized snap set pair for use as the base snap set pair during the transitioning.
17. The computer program product of
wherein establishing the base snap set pair for the mode transition cycle includes:
when the first mode of replication is async mode, the base snap set pair comprises a most recent synchronized snap set pair; and
when the first mode of replication is sync or metro mode, the base snap set pair comprises a most recent recovery snap set pair; and
wherein preparing for the transitioning from the first replication mode to the second replication mode includes creating a snap set against an active snap set on the target system as a mode transition cycle snap set.
18. The computer program product of
when the second mode of replication is async replication mode, setting recovery data replication flow from the source system to the target system; and
when the second mode of replication is sync or metro replication, enabling IO mirroring on a consistency group at the source system.
19. The computer program product of
when the second mode of replication is sync or metro replication mode, transferring a delta between the consistency group at the source system and a most recent base snap set from the mode transition cycle to the mode transition cycle snap set at the target system; and
when the second mode of replication is async replication, creating a new transition cycle snap set against the consistency group on the source system, and transferring a delta between the new transition cycle snap set and a most recent active snap set to the mode transition cycle snap set.
20. The computer program product of
when the second mode of replication is sync replication mode, assigning the active snap set to the transition cycle snap set and continuing IO mirroring from the consistency group to a most recent active snap set at the target system;
when the second mode of replication is metro replication mode, refreshing the consistency group to the transition cycle snap set, and initiating bi-directional IO mirroring; and
when the second mode of replication is async replication mode, initiating async replication cycle scheduling.
|
A replication session can be implemented using various replication modes, such as synchronous (sync) replication, asynchronous (async) replication, and active/active replication, which as also known as metro replication.
Sync replication is a business continuity process that mirrors data updates between two systems to prevent data loss and downtime. When sync replication is turned on for a production storage object, the system mirrors the data to a target system as part of handling write requests from an initiator, and only responds to the initiator after the writes have been persisted on both the source and target systems.
In asynchronous (async) replication, once data has been written to a primary storage or production location, additional write operations to the primary storage can be performed without having to wait for the secondary or target storage site to complete its write operations.
Active/Active replication (also known as “metro replication”) enables multiple database instances to support a common application. Synchronization among the databases is achieved through bidirectional replication capabilities.
This Summary is provided to introduce a selection of concepts in a simplified form that are further described herein in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter.
One aspect may provide a method for seamless transitioning among replication modes in a storage system. The method includes, during an active replication session in which a first mode of replication is performed at the storage system, validating a precondition for transitioning to a second mode of replication. The second mode of replication is different from the first mode of replication. The method also includes suspending data flow for the active replication session, establishing a base snap set pair for a mode transition cycle, and preparing for the transitioning from the first replication mode to the second replication mode including updating, for the mode transition cycle, the base snap set pair on both a source system and a target system of the storage system, and creating a snap set against an active snap set on the target system as a mode transition cycle snap set. The method further includes implementing the mode transition cycle, and entering a replication session, upon completion of the mode transition cycle, for the second mode of replication.
Another aspect may provide a system to provide seamless transitioning among replication modes in a storage system. The system includes a memory having computer-executable instructions. The system also includes a processor operated by a storage system. The processor executes the computer-executable instructions. When executed by the processor, the computer-executable instructions cause the processor to perform operations. The operations include, during an active replication session in which a first mode of replication is performed at the storage system, validating a precondition for transitioning to a second mode of replication. The second mode of replication is different from the first mode of replication. The operations also include suspending data flow for the active replication session, establishing a base snap set pair for a mode transition cycle, and preparing for the transitioning from the first replication mode to the second replication mode including updating, for the mode transition cycle, the base snap set pair on both a source system and a target system of the storage system, and creating a snap set against an active snap set on the target system as a mode transition cycle snap set. The operations further include implementing the mode transition cycle, and entering a replication session, upon completion of the mode transition cycle, for the second mode of replication.
Another aspect may provide a computer program product embodied on a non-transitory computer readable medium. The computer program product includes instructions that, when executed by a computer at a storage system, causes the computer to perform operations. The operations include, during an active replication session in which a first mode of replication is performed at the storage system, validating a precondition for transitioning to a second mode of replication. The second mode of replication is different from the first mode of replication. The operations also include suspending data flow for the active replication session, establishing a base snap set pair for a mode transition cycle, and preparing for the transitioning from the first replication mode to the second replication mode including updating, for the mode transition cycle, the base snap set pair on both a source system and a target system of the storage system, and creating a snap set against an active snap set on the target system as a mode transition cycle snap set. The operations further include implementing the mode transition cycle, and entering a replication session, upon completion of the mode transition cycle, for the second mode of replication.
Objects, aspects, features, and advantages of embodiments disclosed herein will become more fully apparent from the following detailed description, the appended claims, and the accompanying drawings in which like reference numerals identify similar or identical elements. Reference numerals that are introduced in the specification in association with a drawing figure may be repeated in one or more subsequent figures without additional description in the specification in order to provide context for other features. For clarity, not every element may be labeled in every figure. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating embodiments, principles, and concepts. The drawings are not meant to limit the scope of the claims included herewith.
Synchronous (sync), asynchronous (async), and metro replication modes each offers different levels of data protection and also replicates different objects to a remote or target cluster. For example, async replication replicates data changes in cycles with the cycle intervals determined by recovery point object (RPO) settings. Sync replication mirrors each data write operation of a production consistency group (CG) to an active snap set at a remote or target system. Metro replication intercepts application write operations to a production CG on both a source and target system of the storage system and mirrors the write operations to a peer production CG at a remote cluster.
Based on changing operational and performance requirements of a business or organization, there are instances in which switching among these modes can be advantageous to business operations. Each of these modes offers distinct advantages and disadvantages. For example, one advantage to sync replication is that there is typically no data loss during site disaster recovery, and no additional on disk data and metadata read operations, as incoming data gets mirrored to the remote site during the write operations. In addition, sync replication by design also offers reduced snap creation/deletion operations, which can reduce latencies in the replication process. On the other hand, when using sync replication a user may experience higher input/output (TO) latency or even IO timeouts if the system is overloaded, or when there is network congestion between a source and target site. In other words, while sync replication provides greater protection, it requires that the system have sufficient capacity and that the data update patterns are relatively consistent.
In async replication, there is little impact on the host IO latency as data is replicated by a background process after it has been written. Potentially, much less data could be replicated using techniques such as write aggregation, compression, and deduplication. Another advantage of async replication is that is allows more control over how the process uses system resources via RPOs and bandwidth limits' settings, etc. It also adds system load by periodically creating/deleting snap sets, and reading/scanning snap data content for data replication. Increases to RPO setting of async replication reduce burden and async replication snap creation/deletion/scan overhead. However, it also increases the replication lag time between the source and target systems. In addition, during disaster recovery, there may be increased lag times between the source and target systems, as the content on the target may not be up to date.
Advantages of metro replication include continuous operations; that is, applications can continue to run even when a server is lost or when a storage system is taken offline. In addition, metro replication enables easy and seamless storage migration, active/active stretch clusters, and active/passive stretch clusters in situations where application objects are not LU-aligned.
In view of the above-referenced advantages and disadvantages, switching among these replication modes can be desirable in order to provide optimal data protection as well as overall system performance. However, switching among them can be challenging as different modes require different preconditions to be enabled. In addition, without an effective framework, transitioning between modes could potentially cause up to six different switch flows, and each flow would require several different states to handle the transitions.
The embodiments described herein provide a way to support all combinations of mode switching, which can greatly simplify the development and testing efforts of replication mode transitions. A solution includes a framework that identifies generic functions common to all replication modes and adjusts for certain (custom) functions that are particular to each of the replication modes.
Before describing embodiments of the concepts, structures, and techniques sought to be protected herein, some terms are explained. The following description includes a number of terms for which the definitions are generally known in the art. However, the following glossary definitions are provided to clarify the subsequent description and may be helpful in understanding the specification and claims.
As used herein, the term “storage system” is intended to be broadly construed so as to encompass, for example, private or public cloud computing systems for storing data as well as systems for storing data comprising virtual infrastructure and those not comprising virtual infrastructure. As used herein, the terms “client,” “host,” and “user” refer, interchangeably, to any person, system, or other entity that uses a storage system to read/write data. In some embodiments, the term “storage device” may also refer to a storage array including multiple storage devices. In certain embodiments, a storage medium may refer to one or more storage mediums such as a hard drive, a combination of hard drives, flash storage, combinations of flash storage, combinations of hard drives, flash, and other storage devices, and other types and combinations of computer readable storage mediums including those yet to be conceived. A storage medium may also refer both physical and logical storage mediums and may include multiple level of virtual to physical mappings and may be or include an image or disk image. A storage medium may be computer-readable and may also be referred to herein as a computer-readable program medium.
In certain embodiments, the term “I/O request” or simply “I/O” or “TO” may be used to refer to an input or output request, such as a data read or data write request.
In certain embodiments, a storage device may refer to any non-volatile memory (NVM) device, including hard disk drives (HDDs), solid state drivers (SSDs), flash devices (e.g., NAND flash devices), and similar devices that may be accessed locally and/or remotely (e.g., via a storage attached network (SAN) (also referred to herein as storage array network (SAN)).
In certain embodiments, a storage array (sometimes referred to as a disk array) may refer to a data storage system that is used for block-based, file-based or object storage, where storage arrays can include, for example, dedicated storage hardware that contains spinning hard disk drives (HDDs), solid-state disk drives, and/or all-flash drives (e.g., the XtremIO all flash drive, available from DELL/EMC of Hopkinton Mass.). In certain embodiments, a data storage entity may be any one or more of a file system, object storage, a virtualized device, a logical unit, a logical unit number, a logical volume, a logical device, a physical device, and/or a storage medium.
In certain embodiments, a physical storage unit may be a physical entity, such as a disk or an array of disks, for storing data in storage locations that can be accessed by address, where physical storage unit is used interchangeably with physical volume. In certain embodiments, a data storage entity may be any one or more of a file system, object storage, a virtualized device, a logical unit, a logical unit number, a logical volume, a logical device, a physical device, and/or a storage medium.
In certain embodiments, a consistency group (CG) may refer to a technique for protecting data in a replication process. A consistency group ensures that all write operations to a production volume are also written to a corresponding copy or copies in write-order so that the copy can be relied on for use, if needed, instead of the production volume.
In certain embodiments, a snapshot may refer to differential representations of an image, i.e. the snapshot may have pointers to the original volume and may point to log volumes for changed locations. In certain embodiments, a snapshot may refer to differential representations of the state of a system. Snapshots may be combined into a snapshot array, which may represent different images over a time period or different states of a system over a time period.
In certain embodiments, a journal may be a record of write transactions (e.g., I/O data) issued to a storage system, which may be used to maintain a duplicate storage system, and to roll back the duplicate storage system to a previous point in time. In some embodiments, each entry in a journal contains, apart from the I/O data itself, I/O metadata that can include information such as a volume identifier (ID), the I/O block offset within the volume, the I/O length, and a time stamp of the I/O.
In certain embodiments, XtremIO, available from Dell EMC of Hopkinton, Mass.) is a type of content addressable storage array that uses all flash technology. Flash, as is understood, is a solid-state (SS) random access media type that can read any address range with no latency penalty, in comparison to a hard disk drive (HDD) which has physical moving components which require relocation when reading from different address ranges and thus significantly increasing the latency for random I/O data.
In certain embodiments, a data protection strategy that can be advantageous for use with computer systems, especially networked storage systems, is checkpointing. A checkpoint, as used herein, contains a consistent point in time image of an entire system, including configuration, logical volume mapping metadata, physical on disk layout metadata, and actual user data. In certain embodiments, a checkpoint preserves the state of a system at a given point in time by saving one or more snapshots of, for example, a file system, or an application at one or more points in time. A checkpoint can preserve a snapshot of an application's state, so that it can restart from that point in case of failure, which can be useful for long running applications that are executed in failure-prone computing systems. If a checkpoint is used, an application periodically writes large volumes of snapshot data to persistent storage in an attempt to capture its current state. Thus, if there is a failure, the application can recover by rolling-back its execution state to a previously saved checkpoint.
In certain embodiments, a “checkpoint” refers at least to an entity created by a checkpoint process, where the checkpoint process performs actions to preserve the state of an apparatus, system, or other entity (including software entities) at a particular time. Advantageously, a checkpoint includes information such as user data, the configuration of the apparatus, user metadata, and other information related to the internal state of the apparatus or system. For example, some storage systems (including XtremIO), in accordance with certain embodiments herein, also provide some kind of checkpoint feature, to provide an ability to preserve system state including user data and metadata at some defined point in time in order to restore this state after system malfunction or corruption. In certain embodiments, the checkpoint corresponds to a frozen, immutable re representation of the state of a system or apparatus at certain point in time, including user data, metadata, and the system configuration. In certain embodiments, the checkpoint is stored in a dedicated, reserved location within the system. In certain embodiments, the checkpoint is able to be created in an online, dynamic environment, where the checkpoint creation is transparent to entities having I/O interactions with the system.
For a file system, the accuracy and consistency of a file system is necessary to relate applications and data, so a checkpoint provides a way to provide periodic backup of file server state to allow system recovery in the event of faults or failures. When data corruption is detected, one of the checkpoints can be used for file system recovery. Similarly, a checkpoint, in a virtualization context, is a snapshot of the state of a virtual machine. Like a restore point in MICROSOFT WINDOWS operating systems, a checkpoint allows an administrator to restore an entity (e.g., a computer system, a file system, an application, a virtual machine, etc.) to a previous state. Checkpoints also can be used to create backups before conducting updates. Should an update fail or cause problems, an administrator can return the virtual machine to its state prior to the update. A recover action is used to return the system to the checkpoint state.
While vendor-specific terminology may be used herein to facilitate understanding, it is understood that the concepts, techniques, and structures sought to be protected herein are not limited to use with any specific commercial products. In addition, to ensure clarity in the disclosure, well-understood methods, procedures, circuits, components, and products are not described in detail herein.
The phrases, “such as,” “for example,” “e.g.,” “exemplary,” and variants thereof, are used herein to describe non-limiting embodiments and are used herein to mean “serving as an example, instance, or illustration.” Any embodiments herein described via these phrases and/or variants is not necessarily to be construed as preferred or advantageous over other embodiments and/or to exclude the incorporation of features from other embodiments. In addition, the word “optionally” is used herein to mean that a feature or process, etc., is provided in some embodiments and not provided in other embodiments.” Any particular embodiment of the invention may include a plurality of “optional” features unless such features conflict.
Turning now to
Source site 102 may include a host 104, a storage application 106, and data storage 108. In some embodiments, data storage 108 may include one or more storage volumes (e.g., V1), that operate as active, or production, volumes.
Host 104 may perform I/O operations on storage 108 (e.g., read data from and write data to storage 108). In some embodiments, the I/O operations may be intercepted by and controlled by the storage application 106. As changes are made to data stored on storage 108 via the I/O operations from host 104, or over time as storage system 100 operates, storage application 106 may perform data replication from the source site 102 to the target site 112 over a communication network 110.
In some embodiments, the communication network 110 may include internal (e.g., short distance) communication links (not shown) to transfer data between storage volumes for storing replicas 118 (also referred to herein as snap sets), such as an InfiniBand (IB) link or Fibre Channel (FC) link. In other embodiments, the communication link 110 may be a long-distance communication network of a storage area network (SAN), e.g., over an Ethernet or Internet (e.g., TCP/IP) link that may employ, for example, the iSCSI protocol.
In illustrative embodiments, storage system 100 may employ a snap set (or replication) mechanism to replicate data between source site 102 and target site 112. A snap set (or replica) may be created from data within storage 108 and transferred to the target site 112 during a data replication cycle using a data replication process, such as sync replication, async replication, and/or metro replication.
Data replication may be performed based on data replication policies that may define various settings for data recovery operations, shown as policy 114 in target site 112. For example, policy 114 may define a plurality of attributes, such as a frequency with which replicas are generated and how long each replica 118 is kept at target site 112. In some embodiments, policy 114 defines metrics for use in snap set creation and replication process determinations.
Policy 114 may define attributes for determining when to switch replication modes between sync, async, and metro replication modes. In embodiments, the policy 114 may define customer-selected requirements with respect to operational efficiencies. For example, a customer may indicate a desire or preference to use synchronous replication unless or until existing activities are impacted by performance and/or data safety issues, and then the customer desires to use async replication. In other embodiments, the switch among replication modes may be initiated upon request by an administrator or representative of the storage system.
As described herein, in example embodiments, the data replication mode performed by the system may be synchronous data replication with snap sets created in dynamic intervals during operation of storage system 100. The timing of synchronous replication cycles and the retention of the replicas 118 may be managed by replica manager 116 of target site 112. In some embodiments, the data replication mode may be asynchronous data replication. In other embodiments, the data replication mode may be metro replication.
In addition to managing replicas 118 according to a policy 114 (e.g., a replication and/or retention policy), the replica manager 116 may also include a cycle counter 117 to track generations of snap sets over time. The replica manager 116 may facilitate the transition functions described herein.
It will be understood that the roles of the source site 102 and the target site 112 may be reversed in instances, e.g., in which an event occurring on the source site 102 causes the target site 112 to intercept I/Os and take on the role of snap set creation and replication to the source site. This role reversal is referred to as a failover event. In this manner, the processes described herein apply equally to the target site. In this instance, the source site 102 becomes the new target site, and the target site 112 becomes the new source site, as shown in
Referring now to
Turning now to
In block 304, the process 300 validates a precondition for transitioning to a second mode of replication. The second mode of replication is different than the first mode of replication and can be any mode of replication other than the first mode of replication.
In block 306, the process 300 suspends the data flow for the active replication session. The process 300 suspends IO write operations from the source system to a target system (e.g., a device at the source system or a device at the target system).
In block 308, the process 300 establishes a base snap set pair for a mode transition cycle. The particular base snap set pair established will depend upon the mode of replication from which the replication session is transitioned. This step is further described in
In block 310, the process 300 prepares for the transitioning from the first replication mode to the second replication mode by updating the base snap set pair on both the source system and the target system of the storage system. This step is further described in
In block 312, the process 300 implements the mode transition cycle, and upon completion of the mode transition cycle, the process resumes the active replication session using the second replication mode in block 314. These steps are further described in the framework of
Turning now to
A generic function common to all replication modes is performed as part of a precondition validation feature. As shown in column 410, validating preconditions includes determining the stability of the session state and that sufficient system resources are available to transition between replication modes. Examples of system resources include additional volume slots and additional capacity for snap creation, to name a few. Generic functions also include identifying a mode request, e.g., from a customer.
Customized precondition validations for mode transitions are performed for each mode of replication as well. In column 404, precondition validations for transitioning from sync replication mode include determining the availability of recovery snap set pairs, which are used as a base snap set pair in a mode transition cycle process. Likewise, validations for transitioning from metro replication mode include determining the availability of recovery snap set pairs (as shown in column 408) for use as a base snap set pair during the mode transition cycle process. In column 406, precondition validations for transitioning from async replication mode include determining the availability of verified and synchronized snap set pairs for use as the base snap set pair in the mode transition cycle process.
Prior to transitioning between replication modes, data flow for the active replication cycle is suspended. In order to transition from async replication mode to sync or metro replication mode, data is suspended by waiting for the current cycle to complete (column 406). In transitioning from sync or metro replication mode to async replication mode, data flow is suspended by stopping IO mirroring in the data path of the session (columns 404, and 408, respectively).
Also prior to transitioning between replication modes, a base snap set pair is established for use in the mode transition cycle. In transitioning from sync or metro replication mode, the base snap set pair is the most recent recovery snap set pair from the sync/metro replication session (columns 404 and 408, respectively), while in transitioning from async replication mode to either sync or metro replication mode, the base snap set pair is the most recent synchronized (and verified) snap set pair from the async replication session (column 406).
In preparing for the transition between replication modes, a generic function that is common to all replication modes includes updating a mode transition cycle base snap set (SS) on the source and target (column 410). Another generic function includes creating a snap set against an active snap set on the target system as a mode transition cycle snap set (column 410).
Customized functions in preparing for the transition are performed. This preparation includes, in transitioning from sync or metro replication to async replication, a mode transition cycle base snap set is assigned to a most recent recovery snap set (columns 404 and 408, respectively). In transitioning from async replication to sync or metro replication, the custom preparation includes assigning a mode transition cycle snap set using the most recent synchronized snap set (column 406).
The mode transition cycle is performed via custom functions for each of the replication modes. In transitioning to async mode from sync or metro replication mode, data mirroring is enabled on the source system (consistency group) (columns 404 and 408, respectively). In transitioning to sync or metro mode from async mode, recovery data replication flow is set from the source system to the target system (column 406).
As part of the mode transitioning, a transition replication cycle is performed via customized functions for each of the replication modes. In transitioning to sync or metro replication from async replication, a delta between the consistency group and the last transition cycle base snap set is transferred to a target transition cycle snap set (columns 404 and 408, respectively). The mode transitioning cycle custom functions also include, in transitioning to sync or metro replication from async mode, transferring a delta between the consistency group at the source system and a most recent base snap set from the mode transition cycle to the mode transition cycle snap set at the target system (columns 404 and 408, respectively). In transitioning to async replication from sync or metro mode, the custom functions of the mode transitioning cycle include creating a new transition cycle snap set against the consistency group on the source system, and transferring a delta between the new transition cycle snap set and a most recent active snap set to the mode transition cycle snap set (column 406).
Upon completion of the mode transition cycle, a post process is performed with custom functions for each of the replication modes. To enter sync replication mode, as the target transition cycle snap set is now equivalent to the source product consistency group, the active snap set is assigned to the transition cycle snap set and IO mirroring is continued from the consistency group to a most recent active snap set at the target system (column 404). To enter async replication mode, as the target transition cycle snap set is now equivalent to the source transition cycle snap set, async replication cycle scheduling is initiated (column 406). To enter metro replication mode, as the target transition cycle snap set is now equivalent to the source production consistency group, the consistency group is refreshed to the transition cycle snap set and bi-direction IO mirroring is initiated (column 408).
In some embodiments, the source site 102 and/or target site 112 may be implemented as one or more computers, such as a computer 500 as shown in
Process 300 shown in
The processes described herein are not limited to the specific embodiments described. For example, process 300 is not limited to the specific processing order shown in
Processor 502 may be implemented by one or more programmable processors executing one or more computer programs to perform the functions of the system. As used herein, the term “processor” is used to describe an electronic circuit that performs a function, an operation, or a sequence of operations. The function, operation, or sequence of operations can be hard coded into the electronic circuit or soft coded by way of instructions held in a memory device. A “processor” can perform the function, operation, or sequence of operations using digital values or using analog signals. In some embodiments, the “processor” can be embodied in an application specific integrated circuit (ASIC). In some embodiments, the “processor” can be embodied in a microprocessor with associated program memory. In some embodiments, the “processor” can be embodied in a discrete electronic circuit. The “processor” can be analog, digital or mixed-signal.
While illustrative embodiments have been described with respect to processes of circuits, described embodiments may be implemented as a single integrated circuit, a multi-chip module, a single card, or a multi-card circuit pack. Further, as would be apparent to one skilled in the art, various functions of circuit elements may also be implemented as processing blocks in a software program. Such software may be employed in, for example, a digital signal processor, micro-controller, or general-purpose computer. Thus, described embodiments may be implemented in hardware, a combination of hardware and software, software, or software in execution by one or more processors.
Some embodiments may be implemented in the form of methods and apparatuses for practicing those methods. Described embodiments may also be implemented in the form of program code, for example, stored in a storage medium, loaded into and/or executed by a machine, or transmitted over some transmission medium or carrier, such as over electrical wiring or cabling, through fiber optics, or via electromagnetic radiation. A non-transitory machine-readable medium may include but is not limited to tangible media, such as magnetic recording media including hard drives, floppy diskettes, and magnetic tape media, optical recording media including compact discs (CDs) and digital versatile discs (DVDs), solid state memory such as flash memory, hybrid magnetic and solid state memory, non-volatile memory, volatile memory, and so forth, but does not include a transitory signal per se. When embodied in a non-transitory machine-readable medium, and the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the method.
When implemented on a processing device, the program code segments combine with the processor to provide a unique device that operates analogously to specific logic circuits. Such processing devices may include, for example, a general purpose microprocessor, a digital signal processor (DSP), a reduced instruction set computer (RISC), a complex instruction set computer (CISC), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA), a programmable logic array (PLA), a microcontroller, an embedded controller, a multi-core processor, and/or others, including combinations of the above. Described embodiments may also be implemented in the form of a bitstream or other sequence of signal values electrically or optically transmitted through a medium, stored magnetic-field variations in a magnetic recording medium, etc., generated using a method and/or an apparatus as recited in the claims.
Various elements, which are described in the context of a single embodiment, may also be provided separately or in any suitable subcombination. It will be further understood that various changes in the details, materials, and arrangements of the parts that have been described and illustrated herein may be made by those skilled in the art without departing from the scope of the following claims.
In the above-described flow chart of
Some embodiments may be implemented in the form of methods and apparatuses for practicing those methods. Described embodiments may also be implemented in the form of program code, for example, stored in a storage medium, loaded into and/or executed by a machine, or transmitted over some transmission medium or carrier, such as over electrical wiring or cabling, through fiber optics, or via electromagnetic radiation. A non-transitory machine-readable medium may include but is not limited to tangible media, such as magnetic recording media including hard drives, floppy diskettes, and magnetic tape media, optical recording media including compact discs (CDs) and digital versatile discs (DVDs), solid state memory such as flash memory, hybrid magnetic and solid state memory, non-volatile memory, volatile memory, and so forth, but does not include a transitory signal per se. When embodied in a non-transitory machine-readable medium and the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the method.
When implemented on one or more processing devices, the program code segments combine with the processor to provide a unique device that operates analogously to specific logic circuits. Such processing devices may include, for example, a general purpose microprocessor, a digital signal processor (DSP), a reduced instruction set computer (RISC), a complex instruction set computer (CISC), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA), a programmable logic array (PLA), a microcontroller, an embedded controller, a multi-core processor, and/or others, including combinations of one or more of the above. Described embodiments may also be implemented in the form of a bitstream or other sequence of signal values electrically or optically transmitted through a medium, stored magnetic-field variations in a magnetic recording medium, etc., generated using a method and/or an apparatus as recited in the claims.
For example, when the program code is loaded into and executed by a machine, such as the computer of
In some embodiments, a storage medium may be a physical or logical device. In some embodiments, a storage medium may consist of physical or logical devices. In some embodiments, a storage medium may be mapped across multiple physical and/or logical devices. In some embodiments, storage medium may exist in a virtualized environment. In some embodiments, a processor may be a virtual or physical embodiment. In some embodiments, a logic may be executed across one or more physical or virtual processors.
For purposes of illustrating the present embodiment, the disclosed embodiments are described as embodied in a specific configuration and using special logical arrangements, but one skilled in the art will appreciate that the device is not limited to the specific configuration but rather only by the claims included with this specification. In addition, it is expected that during the life of a patent maturing from this application, many relevant technologies will be developed, and the scopes of the corresponding terms are intended to include all such new technologies a priori.
The terms “comprises,” “comprising”, “includes”, “including”, “having” and their conjugates at least mean “including but not limited to”. As used herein, the singular form “a,” “an” and “the” includes plural references unless the context clearly dictates otherwise. Various elements, which are described in the context of a single embodiment, may also be provided separately or in any suitable subcombination. It will be further understood that various changes in the details, materials, and arrangements of the parts that have been described and illustrated herein may be made by those skilled in the art without departing from the scope of the following claims.
Chen, Xiangping, Kronrod, Svetlana
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
10037369, | Jun 26 2015 | EMC IP HOLDING COMPANY LLC | Storage tiering in replication target based on logical extents |
10082959, | Dec 27 2011 | EMC IP HOLDING COMPANY LLC | Managing data placement in storage systems |
10095428, | Mar 30 2016 | EMC IP HOLDING COMPANY LLC | Live migration of a tree of replicas in a storage system |
10152381, | Apr 27 2017 | EMC IP HOLDING COMPANY LLC | Using storage defragmentation function to facilitate system checkpoint |
10176046, | Jun 29 2017 | EMC IP HOLDING COMPANY LLC | Checkpointing of metadata into user data area of a content addressable storage system |
10235066, | Apr 27 2017 | EMC IP HOLDING COMPANY LLC | Journal destage relay for online system checkpoint creation |
7475124, | Sep 25 2002 | EMC IP HOLDING COMPANY LLC | Network block services for client access of network-attached data storage in an IP network |
8327103, | Jun 28 2010 | EMC IP HOLDING COMPANY LLC | Scheduling data relocation activities using configurable fairness criteria |
8380928, | Dec 17 2009 | EMC IP HOLDING COMPANY LLC | Applying data access activity measurements |
8429346, | Dec 28 2009 | EMC IP HOLDING COMPANY LLC | Automated data relocation among storage tiers based on storage load |
8515911, | Jan 06 2009 | EMC IP HOLDING COMPANY LLC | Methods and apparatus for managing multiple point in time copies in a file system |
8539148, | Dec 22 2010 | EMC IP HOLDING COMPANY LLC | Deduplication efficiency |
8566483, | Dec 17 2009 | EMC IP HOLDING COMPANY LLC | Measuring data access activity |
8583607, | Mar 28 2012 | EMC IP HOLDING COMPANY LLC | Managing deduplication density |
8683153, | Sep 29 2010 | EMC IP HOLDING COMPANY LLC | Iterating for deduplication |
8712976, | Mar 28 2012 | EMC IP HOLDING COMPANY LLC | Managing deduplication density |
8775388, | Sep 29 2010 | EMC IP HOLDING COMPANY LLC | Selecting iteration schemes for deduplication |
8782324, | Jun 28 2012 | EMC IP HOLDING COMPANY LLC | Techniques for managing placement of extents based on a history of active extents |
8799601, | Jun 28 2012 | EMC IP HOLDING COMPANY LLC | Techniques for managing deduplication based on recently written extents |
8909887, | Sep 25 2012 | EMC IP HOLDING COMPANY LLC | Selective defragmentation based on IO hot spots |
8930746, | Jun 30 2012 | EMC IP HOLDING COMPANY LLC | System and method for LUN adjustment |
8954699, | Jun 28 2012 | EMC IP HOLDING COMPANY LLC | Techniques for identifying IO hot spots using range-lock information |
8977812, | Mar 30 2011 | EMC IP HOLDING COMPANY LLC | Iterating in parallel for deduplication |
9152336, | Jun 30 2012 | EMC IP HOLDING COMPANY LLC | System and method for LUN adjustment |
9304889, | Sep 24 2014 | EMC IP HOLDING COMPANY LLC | Suspending data replication |
9355112, | Dec 31 2012 | EMC IP HOLDING COMPANY LLC | Optimizing compression based on data activity |
9384206, | Dec 26 2013 | EMC IP HOLDING COMPANY LLC | Managing data deduplication in storage systems |
9395937, | Dec 27 2013 | EMC Corporation | Managing storage space in storage systems |
9449011, | Dec 28 2012 | EMC IP HOLDING COMPANY LLC | Managing data deduplication in storage systems |
9459809, | Jun 30 2014 | EMC IP HOLDING COMPANY LLC | Optimizing data location in data storage arrays |
9460102, | Dec 26 2013 | EMC IP HOLDING COMPANY LLC | Managing data deduplication in storage systems based on I/O activities |
9477431, | Dec 28 2012 | EMC IP HOLDING COMPANY LLC | Managing storage space of storage tiers |
9513814, | Mar 29 2011 | EMC IP HOLDING COMPANY LLC | Balancing I/O load on data storage systems |
9529545, | Dec 26 2013 | EMC IP HOLDING COMPANY LLC | Managing data deduplication in storage systems based on storage space characteristics |
9542125, | Sep 25 2012 | EMC IP HOLDING COMPANY LLC | Managing data relocation in storage systems |
9594514, | Jun 27 2013 | EMC IP HOLDING COMPANY LLC | Managing host data placed in a container file system on a data storage array having multiple storage tiers |
9684593, | Nov 30 2012 | EMC IP HOLDING COMPANY LLC | Techniques using an encryption tier property with application hinting and I/O tagging |
9710187, | Jun 27 2013 | EMC IP HOLDING COMPANY LLC | Managing data relocation in storage systems |
9811288, | Dec 30 2011 | EMC IP HOLDING COMPANY LLC | Managing data placement based on flash drive wear level |
9817766, | Dec 28 2012 | EMC IP HOLDING COMPANY LLC | Managing relocation of slices in storage systems |
Date | Maintenance Fee Events |
Jan 23 2019 | BIG: Entity status set to Undiscounted (note the period is included in the code). |
Jan 23 2024 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Date | Maintenance Schedule |
Aug 04 2023 | 4 years fee payment window open |
Feb 04 2024 | 6 months grace period start (w surcharge) |
Aug 04 2024 | patent expiry (for year 4) |
Aug 04 2026 | 2 years to revive unintentionally abandoned end. (for year 4) |
Aug 04 2027 | 8 years fee payment window open |
Feb 04 2028 | 6 months grace period start (w surcharge) |
Aug 04 2028 | patent expiry (for year 8) |
Aug 04 2030 | 2 years to revive unintentionally abandoned end. (for year 8) |
Aug 04 2031 | 12 years fee payment window open |
Feb 04 2032 | 6 months grace period start (w surcharge) |
Aug 04 2032 | patent expiry (for year 12) |
Aug 04 2034 | 2 years to revive unintentionally abandoned end. (for year 12) |