A method for reducing data loss includes a first computing step for computing an intermediate result for each redundancy information entity of a redundancy set by processing respectively associated data information entities of a given data set on at least two main diagonals of a parity check matrix representing an error correction coding scheme. The method further includes a second computing step for computing the information content of the respective redundancy information entity dependent on the respective intermediate result.
|
3. A computer program product embodied on a computer readable storage medium having computer readable code thereon for reducing data loss, the data comprising a given redundancy set of at least two redundancy information entities (R) associated to a given data set of at least two data information entities (D), the information content of the redundancy set being computed dependent on the information content of the data set by applying an error correction coding scheme represented by a parity check matrix (H) wherein each redundant information entity (R) is represented by a row and each information entity of the data is represented by a column, and at least two square sub-matrices of the parity check matrix (H) having a main diagonal with elements being predominantly non-zero and having a number of rows and columns equal to a number (r) of redundancy information entities (R) in the redundancy set and representing consecutively placed data information entities (D) of the data set, the computer readable code comprising program instructions that when executed by a computer apparatus direct the computer apparatus to:
compute in a first computing operation an intermediate result (T) for the redundancy information entities (R) by process-ing the data information entities (D) on the at least two main diagonals, and
compute in a second computing operation the information content of the redundancy information entities (R) dependent on the intermediate result (T); wherein:
at least one square sub-matrix of the parity check matrix (H) with elements of the main diagonal being predominantly non-zero further has a neighboring diagonal with elements being predominantly non-zero;
the second computing step comprises processing the data information entities (D) on the respective neighboring diagonal utilizing the intermediate result (T); and
the respective information content of each redundancy information entity (R) in the redundancy set is computed as exclusive-OR of the respective information content of all data information entities (D) in the data set represented by a non-zero element in the respective row of the parity check matrix (H).
2. A device for reducing data loss at a plurality of data storage units, the device being in communication with the plurality of storage units, the data of the plurality of data storage units comprising a given redundancy set of at least two redundancy information entities (R) associated to a given data set of at least two data information entities (D), the information content of the redundancy set being computed dependent on the information content of the data set by applying an error correction coding scheme represented by a parity check matrix (H) wherein each redundant information entity (R) is represented by a row and each information entity of the data is represented by a column, and at least two square sub-matrices of the parity check matrix (H) having a main diagonal with elements being predominantly non-zero and having a number of rows and columns equal to a number (r) of redundancy information entities (R) in the redundancy set and representing consecutively placed data information entities (D) of the data set, being operable to:
compute in a first computing operation at a computing engine of the device an intermediate result (T) for the redundancy information entities (R) by processing the data information entities (D) on the at least two main diagonals, and
compute in a second computing operation at the computing engine of the device the information content of the redundancy information entities (R) dependent on the intermediate result (T); wherein:
at least one square sub-matrix of the parity check matrix (H) with elements of the main diagonal being predominantly non-zero further has a neighboring diagonal with elements being predominantly non-zero;
the second computing operation comprises processing the data information entities (D) on the respective neighboring diagonal utilizing the intermediate result (T); and
the respective information content of each redundancy information entity (R) in the redundancy set is computed as exclusive-OR of the respective information content of all data information entities (D) in the data set represented by a non-zero element in the respective row of the parity check matrix (H).
1. A method for creating an error correction coding scheme which reduces data loss at a plurality of data storage units, the data of the plurality of data storage units comprising a given redundancy set of at least two redundancy information entities (R) associated to a given data set of at least two data information entities (D), the information content of the redundancy set being computed dependent on the information content of the data set, the method comprising the steps of:
a base selection step (S2) for selecting a base coding scheme for a computing engine in communication with the plurality of storage units, the base coding scheme represented by a base matrix wherein each redundant information entity (R) is represented by a row and each information entity is represented by a column; and
a matrix setup step (S3) for setting up a target matrix (H′) with a subset of columns of the base matrix and for varying the order of columns with respect to the base matrix until the target matrix (H′) satisfies a given pattern of non-zero elements to at least a given extent;
wherein:
the given pattern of non-zero elements is selected to comprise a main diagonal with elements being predominantly non-zero of a square pattern sub-matrix of the target matrix (H′) having a number of rows and columns equal to a number (r) of redundancy information entities (R) in the redundancy set;
the given pattern of non-zero elements is selected to further comprise a neighboring diagonal disposed adjacent to the main diagonal of the square pattern sub-matrix of the target matrix (H′), the elements of the neighbouring diagonal being chosen to be predominantly non-zero;
the base matrix is selected to have the least possible number (nz) of non-zero elements for a given hamming distance (dmin) of the base coding scheme, number (n) of data information entities (D) in the data set and number (r) of redundancy information entities (R) in the redundancy set;
in the matrix setup step (S3), the order of the columns is varied until each square check sub-matrix of the target matrix (H′) with a number of columns equal to the number (r) of redundancy information entities (R) in the redundancy set has a rank equal to the number (r) of redundancy information entities (R) in the redundancy set;
the created error correction scheme is based on respectively computing the exclusive-or of the information content of all data information entities (D) represented by a non-zero element in each row of the target matrix (H′); and
the base coding scheme is based on one of the following: a hamming code or an extended hamming code.
|
1. Technical Field
The present invention relates to a method for creating an error correction coding scheme for reducing data loss. It further relates to a method, a device, a computer program product and a computer program for reducing data loss. It further relates to a system for protecting data stored on at least one storage unit against uncorrectable media errors.
2. Background Art
A storage unit is, for example, based on at least one magnetic disk or optical disk or on solid state memory as a storage medium. As the storage capacity of individual storage units grows, the probability of encountering at least one media-related error while reading data stored on at least one storage medium of a storage unit also increases. Data is lost when the error cannot be corrected by re-reading the specific part of the medium. Reliability of systems comprising two or more storage units can be increased by storing redundant data distributed to the two or more storage units. Such systems are known as redundant array of independent disks (RAID). A RAID configured system primarily reduces data loss due to a complete failure of a storage unit.
U.S. 2005/0108594 A1 discloses a method to protect data on a disk drive against uncorrectable media errors. The protection against uncorrectable media errors is provided for a RAID configured storage system by a technique in which redundancy information sectors are associated with data information sectors. The data information sectors and the redundancy information sectors are written as a single segment on a single storage unit. The redundancy information is either based on a Reed-Solomon code, an XOR-based code or one-dimensional parity.
Accordingly, it is desirable to provide a method for creating a coding scheme for reducing data loss that is simpler than previously-proposed techniques. It is also desirable to provide a method, a device, a computer program and a computer program product for reducing data loss that is simpler than previously-proposed techniques. It is further also desirable to provide a system for protecting data stored on at least one storage unit against uncorrectable media errors that is simpler and more reliable than previously-proposed techniques.
According to an embodiment of a first aspect of the present invention, there is provided a method for creating an error correction coding scheme for reducing data loss, the data comprising a given redundancy set of at least two redundancy information entities associated to a given data set of at least two data information entities, the information content of the redundancy set being computed dependent on the information content of the data set, the method comprising the steps of: a base selection step for selecting a base coding scheme represented by a base matrix wherein each redundant information entity is represented by a row and each information entity is represented by a column, and a matrix setup step for setting up a target matrix with a subset of columns of the base matrix and for varying the order of columns in respect to the base matrix until the target matrix satisfies a given pattern of non-zero elements to at least a given extent. This feature enables construction of a computing engine for computing the information content of the redundancy set, which is simpler than computing engines used in previously-proposed techniques. The given pattern of non-zero elements influences the complexity of the error correction coding scheme and, by this, also its implementation in the computing engine. The information entity may be one bit or byte or a sector on a storage unit or any other suitable entity for storing or transmitting or receiving information.
According to a preferred embodiment of the first aspect of the invention, the given pattern of non-zero elements is selected to comprise a main diagonal with elements being predominantly non-zero of a square pattern sub-matrix of the target matrix having a number of rows and columns equal to a number of redundancy information entities in the redundancy set. The given pattern of non-zero elements is thus simpler and regular and enables a simpler construction of the computing engine.
In this respect, it is advantageous that the given pattern of non-zero elements is selected to further comprise a neighboring diagonal disposed adjacent to the main diagonal of the square pattern sub-matrix of the target matrix, the elements of the neighbouring diagonal being chosen to be predominantly non-zero. This enables the construction of the computing engine such that elements of the neighboring diagonal can be processed by utilizing an intermediate result computed from elements of the main diagonal. The computing engine can thus be more efficient.
According to a further preferred embodiment of the first aspect of the invention, the base matrix is selected to have the least possible number of non-zero elements for a given Hamming distance of the base coding scheme, number of data information entities in the data set and number of redundancy information entities in the redundancy set. This enables a reduced number of operations for computing the information content of the redundancy set.
According to a further preferred embodiment of the first aspect of the invention, in the matrix setup step, the order of the columns is varied until each square check sub-matrix of the target matrix with a number of columns equal to the number of redundancy information entities in the redundancy set has a rank equal to the number of redundancy information entities in the redundancy set. This enables recovering of up to the number of redundancy information entities in the redundancy set consecutive unreadable data information entities, also called “erasures”. By this, the data can be protected more reliably against data loss and the possibility of data loss is reduced.
According to a further preferred embodiment of the first aspect of the invention, the created error correction scheme is based on respectively computing the exclusive-or of the information content of all data information entities represented by a non-zero element in each row of the target matrix. This enables a simpler and higher performance error correction coding scheme with a reduced overhead for computing the redundancy set and that is also more readily implemented compared to previously-proposed techniques.
In this respect, it is advantageous if the base coding scheme is based on one of the following: a Hamming code or an extended Hamming code. This enables more reliability of the error correction coding scheme.
According to an embodiment of a second aspect of the invention, there is provided a method for reducing data loss, the data comprising a given redundancy set of at least two redundancy information entities associated to a given data set of at least two data information entities, the information content of the redundancy set being computed dependent on the information content of the data set by applying an error correction coding scheme represented by a parity check matrix wherein each redundant information entity is represented by a row and each information entity of the data is represented by a column, and at least two square sub-matrices of the parity check matrix having a main diagonal with elements being predominantly non-zero and having a number of rows and columns equal to a number of redundancy information entities in the redundancy set and representing consecutively placed data information entities of the data set, comprising: a first computing step for computing an intermediate result for the redundancy information entities by processing the data information entities on the at least two main diagonals, and a second computing step for computing the information content of the redundancy information entities dependent on the intermediate result. Due to the at least two square sub-matrices of the parity check matrix having elements of the main diagonal being predominantly non-zero the computation of information content of the redundancy set is simpler.
According to a preferred embodiment of the second aspect of the invention, at least one square sub-matrix of the parity check matrix with elements of the main diagonal being predominantly non-zero further has a neighboring diagonal with elements being predominantly non-zero and the second computing step comprises processing the data information entities on the respective neighboring diagonal utilizing the intermediate result. This enables to compute the information content of the redundancy set more efficiently.
According to a further preferred embodiment of the second aspect of the invention, the respective information content of each redundancy information entity in the redundancy set is computed as exclusive-or of the respective information content of all data information entities in the data set represented by a non-zero element in the respective row of the parity check matrix. This enables computation of the information content of the redundancy set with reduced overhead resulting in higher performance. The method is further easier to implement.
According to an embodiment of a third aspect of the present invention, there is provided a device for reducing data loss. The device corresponds to an embodiment of the second aspect of the present invention and the advantages thereof.
According to an embodiment of a fourth aspect of the present invention, there is provided a system for protecting data stored on at least one storage unit against uncorrectable media errors. The system comprises a device embodying the third aspect of the present invention and at least one storage unit. Each information entity represents a sector on the at least one storage unit. The system corresponds to the device and the advantages thereof.
According to a preferred embodiment of the fourth aspect of the invention, the system is configured as a redundant array of independent storage units. The configuration is also known as a redundant array of independent disks (RAID). This enables more reliability, specifically in the case of a complete failure of one storage unit. Data loss is thus reduced by inter-disk redundancy provided by the redundant array of independent storage units and the intra-disk redundancy provided by the redundancy set. The advantageous embodiment of the third aspect of the invention is not limited to disks and also may comprise any other kind of storage unit.
According to an embodiment of a fifth aspect of the present invention, there is provided a computer program product for reducing data loss comprising a computer readable medium embodying program instructions executable by a computer. The program instructions correspond to an embodiment of the second aspect of the invention and the advantages thereof.
According to an embodiment of the sixth aspect of the present invention, there is provided a computer program for reducing data loss comprising program instructions. The program instructions correspond to an embodiment of the second aspect of the invention and the advantages thereof.
Reference will now be made, by way of example, to the accompanying drawings, in which:
Data is organized in stripes 202. The stripes 202 span all five storage units 100. Each individual storage unit 100 of the system comprises one strip 203 for each stripe 202. Each strip 203 comprises four segments 204 and each segment 204 comprises sixteen chunks of data, each comprising eight sectors. Each segment 204 therefore has 128 sectors.
In the RAID level 5 configured system, each strip 203 carries either user data E or RAID parity data P. The RAID parity data P is computed as modulo 2 sum, also known as exclusive-or or XOR, of all user data E in the same stripe 202. The location of the RAID parity data P is respectively rotated from one storage unit 100 to one of the other storage units 100 in the array 200 in successive stripes. If one of the storage units 100 fails, the respective user data E and RAID parity data P stored on the failed storage unit 100 can be recovered from the other user data E and RAID parity data P stored in the same stripe 202 on the other storage units 100 that are still working. A RAID level 5 configured system allows rebuilding of the information content of one failed storage unit 100 by recovering the lost data and writing it on a spare storage unit 100 included in the system.
Before the reconstruction is finished, data loss will happen in a RAID level 5 configured system if either a second storage unit 100 fails or a media error occurs on one of the other storage units 100. As the storage capacity of individual storage units 100 grows, the total number of bytes that are read during a rebuild operation becomes larger. This increases the probability of encountering an uncorrectable media error, typically resulting in one or more sectors becoming unreadable. The occurrence of uncorrectable media errors is particularly problematic when combined with a failure of one storage unit 100 in the system. For example, if one storage unit 100 fails in a RAID level 5 configured system, the rebuild process reads all the data on the remaining storage units 100 in order to rebuild the lost data on the spare storage unit 100. During this phase, an uncorrectable media error on any of the still working storage units 100 in the array 200 would lead to data loss because there is no way to reconstruct the information content of the uncorrectable sectors. The risk of data loss in this vulnerable phase becomes worse due to the continuous rapid increase of disk capacity and much slower advance in disk bandwidth and disk reliability. Theoretical and field results have shown that the dominant source of data loss in RAID level 5 configured systems is media-related failure during rebuilding.
The risk of data loss due to one or more media errors can be reduced by providing intra-disk redundancy, also called Sector Protection through Intra-Disk REdundancy (SPIDRE).
The concept of intra-disk redundancy can be applied to any of the existing RAID architectures or levels, e.g. RAID level 5, level 51, level 6 or level N+3. RAID redundancy provided by the RAID redundancy data P primarily reduces loss of data stored on the array 200 due to the failure of one complete storage unit 100. The intra-disk redundancy reduces loss of data stored on each individual storage unit 100 due to uncorrectable media errors. As can be appreciated, the concept of intra-disk redundancy can also be applied to a system comprising only one single storage unit 100.
Every modification of user data E in segment 204, e.g. the fourth chunk of data of segment 204 on the second storage unit HDD2, should be accompanied by the update of the intra-disk redundancy data S associated to the modified user data E in the same segment 204, e.g. the ninth chunk of data of segment 204 on the second storage unit HDD2. Further, the RAID parity data P corresponding to the modified user data E should also be updated, e.g. the fourth chunk of data of segment 204 on the eighth storage unit HDD8. Due to the update of the RAID parity data P the intra-disk redundancy data S associated to the updated RAID parity data P should also be updated, e.g. the ninth chunk of data of segment 204 on the eighth storage unit HDD8. Writing these four chunks of data individually would lead to four requests. By storing the intra-disk redundancy data S consecutively with the user data E in the same segment 204, only a first request 205 and a second request 206 are used to update the chunk of user data E, the chunk of RAID parity data P and the respective corresponding chunks of intra-disk redundancy data S.
In normal operation, i.e. without failure of any storage unit 100, user data E can be read from the at least one storage unit 100 without also reading the corresponding RAID parity data P. Accordingly, user data E can be read from the at least one storage unit 100 without also reading intra-disk redundancy data S as long as no media error occurs while reading the user data E. Reading of several consecutive chunks of user data E is advantageously done with a single request. This request may also comprise reading the intra-disk redundancy data S if it is located between chunks of user data E covered by the request. As can be appreciated, the information content of the intra-disk redundancy data S can be ignored in this case.
Each request for updating user data E uses the reading and writing of at least two chunks of data on at least two storage units 100: the modified user data E and RAID parity data P, respectively, and the respectively corresponding intra-disk redundancy data S. Reading and writing each of the four chunks of data with an individual request would use four requests for reading plus four requests for writing for each update of user data E. By applying the first request 205 and the second request 206, updating of user data E is achieved with only two requests for reading plus two requests for writing. One or more additional chunks of data, particularly user data E or RAID parity data P, that are logically placed in-between the modified chunk of user data E or RAID parity data P and the corresponding chunk of intra-disk redundancy data S may be read and written with the same request, respectively. The requested number of chunks of data for each request therefore depends on the distance between the modified chunk of user data E or RAID parity data P and the corresponding chunk of intra-disk redundancy data S. The average number of chunks of data of all requests for reading and/or writing are reduced by placing the intra-disk redundancy data S approximately in the middle of segment 204 compared to placing it at the beginning or the end of segment 204. In the present embodiment, the average number of chunks of data read and/or written per request is about 5.27. A typical ratio of seek time for each request to the time used to read one chunk of data of, for example, 4 KB is about 50 to 1. The effect of reading and/or writing more than one chunk of data with each request on the overall read/write performance, particularly when updating user data E, is therefore smaller compared to accessing each of the four chunks of data of the above example for updating user data E, RAID parity data P and intra-disk redundancy data S with individual requests.
As an example, each chunk of data has a size of 4 KB. Segment 204 has 128 sectors. Each chunk of data is divided into eight sectors. The chunk of data carrying the intra-disk redunancy data S is considered as a redundancy set of redundancy information sectors R. A number r of redundancy information sectors R is 8. Each chunk of user data E has eight data information sectors D. There are fifteen chunks of user data E in segment 204. A number n of data information sectors D carrying user data E therefore is 120. All 120 data information sectors D of all chunks of user data E of segment 204 are considered a data set. The redundancy set is associated with the data set in segment 204. The information content of the redundancy set is computed dependent on the information content of the data set. Particularly, the redundancy information sectors R in the redundancy set each can be considered as a parity that is computed dependent on a respectively associated subset of the data set, i.e. each redundancy information sector R is computed as parity of a respectively associated set of data information sectors D that is a subset of the data set.
In case a chunk of user data E cannot be read correctly from storage unit 100, this chunk of user data E is marked as a so called “erasure”. The information content of the erasure can in some cases be recovered using the information content of other chunks of user data E and the intra-disk redundancy data S of the same segment 204 on the same storage unit 100. The error correction capability, or more precisely, the erasure recovery capability achieved by providing the redundancy set that is associated with the data set in segment 204 depends on the respective selection of data information sectors D that are used for computing the respective redundancy information sector R. It is desirable to compute the information content of the redundancy set with as few computing operations as possible so as to reduce the need to use a complex computing engine 106.
Both the third and the fourth type of the parity check matrix H have the least number nz of non-zero elements for the given minimum Hamming distance dmin of 4 or 3, respectively, and for the number r of redundancy information sectors R of 8 and the number n of data information sectors D of 120.
It is possible to ensure, for a minimum Hamming distance dmin of 3, that parity check matrix H has the least number nz of non-zero elements, because the columns of the parity-check matrix H representing the Hamming code are formed from a set of binary tuples with the number r of redundancy information sectors R elements representing all non-zero numbers up to two raised to the power of the number r of redundancy information sectors R minus one. The columns of the parity check matrix H can be sorted to have columns with increasing number nz of non-zero elements from left to right. If less than two raised to the power of the number r of redundancy information sectors R minus one columns are used for representing all sectors of segment 204, the least number nz of non-zero elements can be guaranteed by selecting the left-most columns. For a minimum Hamming distance dmin greater than 3 the least number of non-zero elements cannot be guaranteed in the same way.
The third and the fourth type of the parity check matrix H are also modified such that they show the same properties regarding the error correction capability of correcting up to the number r of redundancy information sectors R, i.e. up to eight, consecutive sectors with a media error as the interleaved parity check code. In comparison to the interleaved parity check code, the extended Hamming code has the advantage of a better error correction capability due to the minimum Hamming distance dmin of 4 or 3, respectively. This additionally enables correcting any three and two sector media errors in segment 204, respectively. In general, the minimum Hamming distance dmin minus one single media errors can be corrected in segment 204. Using the extended Hamming code thus leads to improved protection on storage unit 100 and therefore to increased reliability of the storage unit 100 but also increases the computing power of the computing engine 106 for performing the higher number of exclusive-or operations due to the higher number nz of non-zero elements in the parity check matrix H compared to the interleaved parity check code. Error correction capability of the interleaved parity check code can be applied to most high-end storage units 100, particularly for most high-end hard disk drives (such as, for example, incorporating small computer system interfaces, SCSI). One of the extended Hamming codes with a higher error correction capability compared to the interleaved parity check code may be considered for low-end storage units 100, particularly for low-end hard disk drives (such as, for example, incorporating, advanced technology attachment systems, ATA or serial advanced technology attachments systems, SATA).
In order to reduce the need to use a complicated computing engine 106 and to guarantee certain properties of the data protection, e.g. the error correction capability of correcting up to the number r of redundancy information sectors R consecutive unreadable sectors due to a media error, the parity check matrix H can be improved. Two metrics are introduced for better comparison of different error correction coding schemes based on different parity check matrices H.
A first metric is XORO, the XOR Overhead. XORO is a measure of the computational cost of programming an XOR engine to complete all the exclusive-or operations for a given task, e.g. computing the information content of the redundancy set. The XOR engine is represented by the computing engine 106. For a single exclusive-or computation with a number k of operands, XORO is defined as XORO(k)=k+1. XORO does not account for the size of the operands.
The given task consumes memory bandwidth for sub-tasks such as moving data or parity between the storage unit 100 and the external memory 108, sending user data E from the external memory 108 to the host, e.g. the computer system, through the host interface 104 or moving data or parity into or out of the XOR engine, i.e. the computing engine 106. The consumption of memory bandwidth is quantified by a second metric called MBWC. If, for example, the given task is to compute the exclusive-or of a given number of chunks of data, e.g. of user data E or RAID parity data P, received from the host and writing these plus the computed result, e.g. the intra-disk redundancy data S, to the at least one storage unit 100, the memory bandwidth consumption MBWC in completing this given task is made up of several components. In a first component, the given number of chunks of data received from the host are written to the external memory 108. In a second component, the given number of chunks of data are read from the external memory 108 into the computing engine 106. In a third component, the computed result is written back to the external memory 108. In a fourth component, the given number of chunks and the computed result are written to the at least one storage unit 100. The total number of chunks of data transferred to and from the external memory is therefore three times the given number of chunks of data plus two for the computed result. In this example, all chunks of data are of the same size, e.g. 4 KB.
The computation of XORO is further illustrated with the IPC-8 (128, 120) code. Each of the redundancy information sectors R is the result of exclusive-or operations on fifteen distinct data information sectors D from among the total 120 data information sectors D of segment 204. The interleaved dependence of redundancy information sectors R on data information sectors D is captured in the 8 by 128 parity check matrix H with a regular pattern comprising sixteen distinct 8 by 8 pattern sub-matrices having elements on a respective main diagonal being non-zero. All other elements of each 8 by 8 pattern sub-matrix are zero. If all 120 data information sectors D of segment 204 are stored in the external memory 108, each of the eight redundancy information sectors R can be computed using an exclusive-or operation with fifteen source operands, i.e. data information sectors D, and one destination operand, i.e. redundancy information sector R. The XORO value for the computation of each redundancy information sector R is therefore equal to sixteen and the XORO value for the computation of all eight redundancy information sectors R is 128. In this case, computation of each redundancy information sector R is done sector by sector. The complexity of the computing engine can be reduced taking into account that the 120 data information sectors D are stored consecutively in contiguous locations in the external memory 108. All eight redundancy information sectors R can then be computed with a single exclusive-or operation with fifteen source operands and one destination operand. However, in this case, each operand spans eight consecutive sectors. The computed redundancy information sectors R are also stored consecutively in the external memory 108. The XORO value for computing all eight redundancy information sectors R is therefore 16. The MBWC value does not change due to the different computation of the redundancy information sectors R and equals 47 chunks of data. In contrast to the above-described case, computation of each redundancy information sector R is done chunk by chunk in the present scenario.
The method begins with a step S1 as an entry point. In a step S2, the base selection step is performed. Additionally, the number r of redundancy information sectors R and a number L of sectors in segment 204 are set. For example, the number r of redundancy information sectors R is 8 and the number L of sectors in segment 204 is 128, comprising both the eight redundancy information sectors R and the 120 data information sectors D. In a step S3 a target matrix H′ is set up. In this example the target matrix H′ is set up as a square identity matrix with a number of rows and columns equal to the number r of redundancy information sectors R. The identity matrix represents the redundancy information sectors R. Further, a randomized base matrix H1 is set up by randomly changing the order of columns of the selected base matrix, i.e. the selected parity check matrix H, excluding the first number r of redundancy information sectors R columns, if these already represent the identity matrix, as it is the case if the fourth type of the parity check matrix H is selected as the base matrix. Additionally, a vector I is set up comprising one element for each sector in segment 204, i.e. the number of elements of vector I being equal to the number L of sectors. The first number r of redundancy information sectors R elements of vector I are set to one, the other elements are set to zero. In vector I, all columns of the randomized base matrix H1 that are also present in the target matrix H′ are marked by a one.
In a step S4, a value of a first variable i is set to the number r of redundancy information sectors R plus one. Accordingly, in a step S5, a value of a second variable j is set to the number r of redundancy information sectors R plus one. The first variable i represents an index of the current column in target matrix H′. The second variable j represents an index of the current column in randomized base matrix H1. In a step S6, it is checked if the element of vector I pointed to by the second variable j is equal to zero. If this is the case, i.e. the corresponding column of the randomized base matrix H1 is not present in the target matrix H′ yet, the column of the randomized base matrix H1 pointed to by the second variable j is appended to the target matrix H′ at the position pointed to by the first variable i in a step S7.
In a step S8, it is checked if the target matrix H′ satisfies a given set of predetermined conditions. The predetermined conditions depend on the requirements for the resulting error correction coding scheme. This given set of predetermined conditions may, for example, comprise the target matrix H′ exhibiting the given pattern of non-zero elements, e.g. that elements on the main diagonal of the respective pattern sub-matrix are predominantly non-zero. This can, for example, easily be checked by masking the corresponding elements and counting the number of non-zero elements covered by the mask. If the counted number of non-zero elements per pattern sub-matrix exceeds a given threshold the pattern can be considered to be present in the respective pattern sub-matrix of the target matrix H′.
The set of predetermined conditions may also comprise the target matrix H′ exhibiting a given property regarding the error correction capability of the resulting error correction coding scheme. For example, to achieve the capability of correcting up to the number r of redundancy information sectors R of consecutive unreadable sectors in segment 204, each square check sub-matrix of the target matrix H′ with a number of columns equal to the number r of redundancy information sectors R should have a rank equal to the number r of redundancy information sectors R.
If the target matrix H′ satisfies all predetermined conditions in the given set of predetermined conditions to a given extent the column appended to the target matrix H′ is kept and the current element of vector I is set to one in a step S9. Satisfying the predetermined conditions to a given extent means, for example, that not all of the distinct square sub-matrices of the target matrix H′ should exhibit the given pattern but at least a given number or percentage of the sub-matrices exhibit the given pattern. In a step S10, the value of the first variable i is increased by one. In a step S11 it is checked if the value of the first variable i is equal to the number L of sectors in segment 204. If this is the case, the method ends in a step S12. Otherwise the method continues in step S5.
If the target matrix H′ in step S8 does not satisfy all predetermined conditions in the given set of predetermined conditions to the given extent the column appended to the target matrix H′ is deleted in a step S13 . The value of the second variable j is then increased in a step S14 to try the next column of the randomized base matrix H1. In a step S15, it is checked if the value of the second variable j is equal to the number L of sectors in segment 204. If this is not the case, the method continues in step S6. Otherwise there are no more columns available to try. In this case, the method continues in step S3, i.e. the target matrix H′ and the vector I are reset and a new randomized base matrix H1 is created from the base matrix by randomly reordering the columns of the selected base matrix. The method also continues in step S14 if the current element of vector I is not equal to zero in step S6.
The XORO value and the MBWC value can be further reduced by enabling the computing engine 106 to perform operations on intermediate results stored in the internal memory 107 and to overwrite source operands with the computed result or with an intermediate result T and by taking advantage of the given pattern of non-zero elements comprising the main diagonal and the neighboring diagonal of pattern sub-matrices being predominantly non-zero. Operations performed only on the data stored in the internal memory 107 do not contribute to the MBWC value because no data movement between the computing engine 106 and the external memory 108 is used. The contribution to the XORO value is 2 for these operations. It is therefore advantageous to compute the redundancy information sectors R by utilizing intermediate results T stored in the internal memory 107 of the computing engine 106.
For the example shown in
This results to a XORO value of 7×2, i.e. 14, and a MBWC value of zero for this step. The fourteenth, fifteenth and sixteenth main diagonal are processed as follows:
The remaining non-zero elements of the parity check matrix H can then be processed individually sector by sector. There are 164 non-zero elements in the parity check matrix H not located on one of the main diagonals or on one of the neighboring diagonals. Additionally, there are eight elements on main diagonals or neighboring diagonals that are zero. These together require additional 172 exclusive-or operations with the operand size of one sector. This results in an additional XORO value of 172 and MBWC value of 172 sectors. The total XORO value is therefore 202 and the MBWC value is 300 sectors or 37.5 chunks of data for computing the information content of the redundancy set. An additional fifteen plus sixteen chunks of data contribute to the MBWC value due to moving fifteen chunks of user data E from the host to the external memory 108 and to moving the fifteen chunks of user data E and the computed intra-disk redundancy data S as the result to the at least one storage unit 100. The total MBWC thus is about 69 chunks of data.
In order to place the redundancy information sectors R approximately in the middle of segment 204 as described above the columns of parity check matrix H can be shifted cyclically. For example, the original columns C1 to C64 of the the embodiment of the parity check matrix H become columns C65 to C128 and the original columns C65 to C 128 become columns C1 to C64. By this the redundancy information sectors R are moved from columns C1 to C8 to columns C65 to C72. It can be verified that this cyclic shift of columns does not change the capability of correcting up to eight consecutive unreadable sectors in segment 204 for the embodiment of the parity check matrix H.
The method shown in
The embodiments explained above are based on sectors as information entity. Alternatively, information entities can also represent a bit, a byte or any other suitable entity of information. The redundancy information sectors R and the data information sectors D represent specific embodiments and can more generally be considered as redundancy information entities R and data information entities D, respectively. Further, reduction of data loss due to media errors on a storage unit 100 is only one of many applications. For example, loss of data transmitted or received over a radio channel or cable can also be reduced by applying the error correction coding scheme presented above.
It will be understood that the present invention has been described purely by way of example, and modifications of detail can be made within the scope of the invention.
Each feature disclosed in the description, and (where appropriate) the claims and drawings may be provided independently or in any appropriate combination.
Iliadis, Ilias, Dholakia, Ajay, Hu, Xiaoyu, Elftheriou, Evangelos
Patent | Priority | Assignee | Title |
10216574, | Oct 24 2012 | SanDisk Technologies, Inc | Adaptive error correction codes for data storage systems |
10303547, | Jun 04 2014 | Pure Storage, Inc. | Rebuilding data across storage nodes |
11593203, | Jun 04 2014 | Pure Storage, Inc. | Coexisting differing erasure codes |
12066895, | Jun 04 2014 | Pure Storage, Inc. | Heterogenous memory accommodating multiple erasure codes |
8321762, | Nov 14 2005 | GLOBALFOUNDRIES Inc | Method for creating an error correction coding scheme |
8972826, | Oct 24 2012 | SanDisk Technologies, Inc | Adaptive error correction codes for data storage systems |
9021339, | Nov 29 2012 | Western Digital Technologies, INC | Data reliability schemes for data storage systems |
9059736, | Dec 03 2012 | SanDisk Technologies, Inc | Methods, solid state drive controllers and data storage devices having a runtime variable raid protection scheme |
9214963, | Dec 21 2012 | Western Digital Technologies, INC | Method and system for monitoring data channel to enable use of dynamically adjustable LDPC coding parameters in a data storage system |
Patent | Priority | Assignee | Title |
7058873, | Nov 07 2002 | Carnegie Mellon University | Encoding method using a low density parity check code with a column weight of two |
7260763, | Mar 11 2004 | Microsoft Technology Licensing, LLC | Algebraic low-density parity check code design for variable block sizes and code rates |
7278085, | Mar 06 2003 | Maxtor Corporation | Simple error-correction codes for data buffers |
7328397, | Mar 19 2003 | U S BANK NATIONAL ASSOCIATION, AS COLLATERAL AGENT | Method for performing error corrections of digital information codified as a symbol sequence |
7370264, | Dec 19 2003 | STMICROELECTRONICS FRANCE | H-matrix for error correcting circuitry |
7461329, | Mar 22 2004 | Canon Kabushiki Kaisha | Channel encoding adapted to error bursts |
CN610062, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Oct 25 2006 | ILIADIS, ILIAS | International Business Machines Corporation | CORRECTIVE ASSIGNMENT TO CORRECT THE SPELLING OF THE SECOND ASSIGNOR S LAS T NAME PREVIOUSLY RECORDED AT REEL: 019958 FRAME: 0412 ASSIGNOR S HEREBY CONFIRMS THE ASSIGNOR | 035561 | /0700 | |
Oct 25 2006 | HU, XIAO-YU | International Business Machines Corporation | CORRECTIVE ASSIGNMENT TO CORRECT THE SPELLING OF THE SECOND ASSIGNOR S LAS T NAME PREVIOUSLY RECORDED AT REEL: 019958 FRAME: 0412 ASSIGNOR S HEREBY CONFIRMS THE ASSIGNOR | 035561 | /0700 | |
Oct 25 2006 | ILIADIS, ILIAS | International Business Machines Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 019958 | /0412 | |
Oct 25 2006 | HU, XIAO-YU | International Business Machines Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 019958 | /0412 | |
Oct 31 2006 | DHOLAKIA, AJAY | International Business Machines Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 019958 | /0412 | |
Oct 31 2006 | DHOLAKIA, AJAY | International Business Machines Corporation | CORRECTIVE ASSIGNMENT TO CORRECT THE SPELLING OF THE SECOND ASSIGNOR S LAS T NAME PREVIOUSLY RECORDED AT REEL: 019958 FRAME: 0412 ASSIGNOR S HEREBY CONFIRMS THE ASSIGNOR | 035561 | /0700 | |
Nov 02 2006 | ELEFTHERIOUS, EVANGELOS | International Business Machines Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 019958 | /0412 | |
Nov 02 2006 | ELEFTHERIOU, EVANGELOS | International Business Machines Corporation | CORRECTIVE ASSIGNMENT TO CORRECT THE SPELLING OF THE SECOND ASSIGNOR S LAS T NAME PREVIOUSLY RECORDED AT REEL: 019958 FRAME: 0412 ASSIGNOR S HEREBY CONFIRMS THE ASSIGNOR | 035561 | /0700 | |
Nov 09 2006 | International Business Machines Corporation | (assignment on the face of the patent) | / | |||
Jun 29 2015 | International Business Machines Corporation | GLOBALFOUNDRIES U S 2 LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 036550 | /0001 | |
Sep 10 2015 | GLOBALFOUNDRIES U S INC | GLOBALFOUNDRIES Inc | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 036779 | /0001 | |
Sep 10 2015 | GLOBALFOUNDRIES U S 2 LLC | GLOBALFOUNDRIES Inc | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 036779 | /0001 |
Date | Maintenance Fee Events |
Sep 03 2010 | ASPN: Payor Number Assigned. |
Apr 25 2014 | REM: Maintenance Fee Reminder Mailed. |
Sep 14 2014 | EXP: Patent Expired for Failure to Pay Maintenance Fees. |
Date | Maintenance Schedule |
Sep 14 2013 | 4 years fee payment window open |
Mar 14 2014 | 6 months grace period start (w surcharge) |
Sep 14 2014 | patent expiry (for year 4) |
Sep 14 2016 | 2 years to revive unintentionally abandoned end. (for year 4) |
Sep 14 2017 | 8 years fee payment window open |
Mar 14 2018 | 6 months grace period start (w surcharge) |
Sep 14 2018 | patent expiry (for year 8) |
Sep 14 2020 | 2 years to revive unintentionally abandoned end. (for year 8) |
Sep 14 2021 | 12 years fee payment window open |
Mar 14 2022 | 6 months grace period start (w surcharge) |
Sep 14 2022 | patent expiry (for year 12) |
Sep 14 2024 | 2 years to revive unintentionally abandoned end. (for year 12) |