A method and apparatus for processing a received digital signal that has been corrupted by a channel is disclosed. The method includes storing the received digital signal and receiving a partially corrected sequence of symbols that includes an output of a preliminary denoising system operating on the received digital signal. Information specifying a signal degradation function that measures the signal degradation that occurs if a symbol having the value I is replaced by a symbol having the value J is utilized to generate a processed digital signal by replacing each symbol having a value I in a context of that symbol in the received digital signal with a symbol having a value J if replacement reduces a measure of overall signal degradation in the processed digital signal relative to the received digital signal as measured by the degradation function and the partially corrected sequence of symbols.
|
1. An apparatus for denoising an input noisy signal, the apparatus comprising:
one or more memories; and
a controller that
receives the noisy signal z that includes a number of sequentially ordered symbols, each symbol having a position,
stores the noisy signal z in the one or more memories,
receives a signal r, output from a preliminary denoising system that operates on the received noisy signal z, that includes a number of sequentially ordered symbols, each symbol having a position,
stores the signal r in the one or more memories, and
produces an output signal z′ by replacing a symbol within each of a number of different subsequences that occur in the noisy signal z with a corresponding replacement symbol that the controller computes to provide a minimal estimated signal degradation.
8. A method for denoising a noisy signal and partially corrected signal to generate an output signal, the method comprising:
receiving the noisy signal z that includes a number of sequentially ordered symbols, each symbol having a position,
storing the noisy signal z in one or more memories,
receiving the partially corrected signal r, output from a preliminary denoising system that operates on the received noisy signal z, that includes a number of sequentially ordered symbols, each symbol having a position,
storing the partially corrected signal r in the one or more memories, and
producing the output signal z′ by replacing a symbol within each of a number of different subsequences that occur in the noisy signal z with a corresponding replacement symbol that the controller computes to provide a minimal estimated signal degradation.
14. A computer readable medium encoded with a data processing program for denoising a noisy signal and a partially corrected signal to generate an output signal by:
receiving the noisy signal z that includes a number of sequentially ordered symbols, each symbol having a position,
storing the noisy signal z in one or more memories,
receiving the partially corrected signal r, output from a preliminary denoising system that operates on the received noisy signal z, that includes a number of sequentially ordered symbols, each symbol having a position,
storing the partially corrected signal r in the one or more memories, and
producing the output signal z′ by replacing a symbol within each of a number of different subsequences that occur in the noisy signal z with a corresponding replacement symbol that the controller computes to provide a minimal estimated signal degradation.
2. The apparatus of
for each of a number of different symbol subsequences, z(q), about symbol zq, that occur in the received noisy signal z,
counting a number of occurrences of each symbol at the corresponding positions p in signal r, rp, for positions p in the received noisy signal z at which z(p) is equal to z(q) and storing the counted number of occurrences in the one or more memories; and
for each of the number of symbol subsequences, z(q), in the received noisy signal z,
replacing symbol zq of subsequence z(q) in all occurrences of subsequence z(q), at positions zp, in the noisy signal z with a replacement symbol zq′ which produces a minimal computed signal degradation.
3. The apparatus of
a degradation function C( ) that ;
the received noisy signal z;
the signal r; and
the counts of the number of occurrences of each symbol at the corresponding positions p in signal r, rp, for positions p in the received noisy signal z at which z(p) is equal to z(q).
4. The apparatus of
5. The apparatus of
where C(rp,zq′) is the degradation estimated for replacing the symbol rp at position p in the signal r with symbol zq′; and
p represents the positions in the signals r and z at which z(p) is equal to z(q).
6. The apparatus of
7. The apparatus of
9. The method of
for each of a number of different symbol subsequences, z(q), about symbol zq, that occur in the received noisy signal z,
counting a number of occurrences of each symbol at the corresponding positions p in signal r, rp, for positions p in the received noisy signal z at which z(p) is equal to z(q) and storing the counted number of occurrences in the one or more memories; and
for each of the number of symbol subsequences, z(q), in the received noisy signal z,
replacing symbol zq of subsequence z(q) in all occurrences of subsequence z(q), zp, in the noisy signal z with a replacement symbol zq which produces a minimal computed signal degradation.
10. The method of
11. The method of
where C(rp, zq′) is the degradation estimated for replacing the symbol rp at position p in the signal r with symbol zq′; and
p represents the positions in the signals r and z at which z(p) is equal to z(q).
12. The method of
13. The method of
|
The present invention relates to signal processing, and more particularly, to the correction of errors introduced into a signal by the transmission or processing of that signal.
The present invention can be more easily understood in terms of a simple exemplary system. Consider a telephone conversation in which a person talks into a microphone whose output is digitized and then transmitted to a second person via various telephone lines and switch systems. The speaker at the second person's location receives a sequence of digital values that are then played back to the second person. In general, the received sequence will differ from the transmitted sequence because of errors introduced by the transmission system, digital-to-analog converters, and analog to digital converters. For example, noise in the transmission system results in some of the digital values in the transmitted sequence being altered. One goal of a denoising system is to remove as many of these noise errors as possible.
The simple example discussed above is an example of a more general problem that is encountered in a wide range of applications. In general, an input digital signal that consists of a sequence of “symbols” is transmitted through a “communication link” and is received as an output digital signal at the output of the communication link. The output digital signal also consists of a sequence of “symbols”. Each of the symbols is chosen from a predetermined set of symbols, referred to as an alphabet. The output signal is assumed to be written in the same alphabet as the input signal.
In the simplest case, the signals are binary signals in which the alphabet consists of the symbols “0” and “1”. In this case the input and output signals consist of a sequence of 0s and 1s. However, other alphabets are commonly used. For example, a digitized signal in which each symbol is represented by an integer between 0 and M−1 is commonly used in broadband data transmission systems for connecting users to the Internet via a digital subscriber loop (DSL).
While the above examples refer to communication systems, it should be noted that this type of noise problem is present in a number of data processing systems. For example, the storage of data files on a magnetic disk drive can be viewed as the transmission of a digital signal through a communication link, the disk drive. The input signal is a sequence of symbols, e.g., bytes of data, which are chosen from a predetermined alphabet. In the case of byte data, each symbol has an integer value chosen from the set [0, 1, . . . , 255]. The retrieved file from the disk drive also consists of a sequence of symbols chosen from this set. The input signal symbols are processed by the electronics of the disk drive and stored in the form of localized magnetic fields that are read to generate the output signal. Noise in the digital to analog circuitry that converts the symbols to and from the magnetic fields introduces errors into the output signal. In addition, the magnetic fields can be altered during storage by random events that introduce additional errors.
Similarly, digital photography may be viewed as involving the transmission of a signal through a channel that corrupts the signal. In this case, the signal is the image, which is corrupted by noise in the photodetectors.
The present invention includes a method and apparatus for processing a received digital signal that includes a sequence of symbols that has been corrupted by a channel to generate a processed digital signal. The method includes storing the received digital signal and receiving a partially corrected sequence of symbols that includes an output of a preliminary denoising system operating on the received digital signal. Information specifying a signal degradation function that measures the signal degradation that occurs if a symbol having the value I is replaced by symbol having a value J is utilized to generate a processed digital signal by replacing each symbol having a value I in a context of that symbol in the received digital signal with a symbol having a value J if replacement reduces a measure of overall signal degradation in the processed digital signal relative to the input digital signal as determined using the degradation function and the partially corrected sequence of symbols. The method can be practiced on a dedicated apparatus or on a general purpose data processing system.
The present invention provides a method for reducing the signal degradation resulting from the noise that is introduced into a digital signal when the signal is processed by a system that introduces noise errors. The processing system that introduces the noise will be referred to as the “channel” in the following discussion because such a system is analogous to a transmission channel over which the signal is sent.
Refer now to
It is assumed that a preliminary denoising system 120 operates on z to generate a first approximation to a denoised signal 24, r=r1, r2, . . . , rn by changing various members of the z sequence in a manner that is not known to those receiving r. Consider a subsequence of 2k+1 symbols in z that is centered about zq. Here, k is an integer. The manner in which k is chosen will be discussed in more detail below. Denote this subsequence by z(q). That is z(q)=zq−k, zq−k+1, . . . zq, zq+1, . . . zq+k. The subsequence z(q) shall sometimes be referred to in what follows as the reference subsequence for index q. Assume that k is chosen such that this subsequence appears at a number of locations in z. That is, z(p)=z(q) for a number of different values of p. The present invention is based on the assumption that if the preliminary denoising system changes the value of zq, it should also change the value of zp in the same manner for each of the other occurrences of this subsequence.
The present invention examines the output of the preliminary denoising system and determines a value to be assigned to zq and each of the zp's based on a measure of the signal degradation that occurs when a symbol is mistakenly replaced by another symbol. This resulting new sequence 22, z′, is then output from the present invention. The present invention assumes that there is a quantified measure of the degradation introduced into the output signal by replacing a symbol having the value A in the input signal by a symbol having the value B in the output signal. The degradation may be different for different values of A and B. In the following discussion this degradation measure will be referred to simply as the “degradation” and denoted by C(A,B).
In systems that utilize an alphabet that contains more than two symbols, C(A,B) will often depend on the difference between A and B. For example, consider a digital signal that is generated by converting an analog time varying signal to a sequence of digital values utilizing an 8-bit analog-to-digital converter. The resulting digital signal is a sequence of symbols chosen from an alphabet having 256 symbols corresponding to the digital values 0 through 255. Assume that the output signal is to be converted back into an analog signal and played back to a human observer. The error in the output signal resulting from a symbol being altered by 1 is usually much less than the error resulting from a symbol being altered by a 2, and so on. Hence, the degradation function will depend on the amount by which the symbol is changed in this case.
The manner in which the present invention defines the correct symbol to use in place of zq can be more easily understood with reference to
In the second part of the algorithm, the counts from the first part are used to estimate the degradation that would result in the signal for the various possible choices of symbol values to which zq could be changed. Consider the case in which zq is changed to the value K. The algorithm computes the degradation estimate D(K) as follows:
as shown at 56. The algorithm then sets z′q equal to Kmin, defined as the value of K for which D(K) has the minimum value.
The manner in which the algorithm alters the output of the preliminary denoiser can be more easily understood with reference to a simple example. Consider the case in which the cost of making an error is the same for all errors, i.e., C(I,J)=C0 for all I that are different from J. It should be noted that C(I,I)=0 for all I. In this case, D(K) will be S(K)C0, where S(K) is the sum of N(J) for J different from K. Now assume that N(1)>>N(J) for J different from 1. That is, in the vast majority of the cases, the preliminary denoiser substituted the value 1 for the symbol at the middle of each subsequence equal to z(q) in the noisy signal. In this case, D(K) will have its minimum value for K=1, since all of the other values of D(K) will include N(1) in the S(K) term. Hence, for this degradation function, the algorithm of the present invention sets the output z′p for all p for which z(p)=z(q) to that value taken on by the majority of the rp, the output of the preliminary denoiser, for such indices p.
The above-described embodiments utilized a 2k long sequence surrounding the symbol being processed to define the 2k+1 symbol reference subsequence whose instances in z and the corresponding symbols in r are examined to determine the output symbol that is to be used in place of the symbol being processed. To simplify the following discussion of the more general cases, it is useful to define a “context” for the symbol being processed. Consider a symbol in the output signal. A subsequence of symbols having fixed values and in a predetermined location with respect to that symbol will be referred to as the “context” of that symbol. In the preceding example, the context of the symbol zq was the k symbols on each side of zq. Denote the k symbols on the left of zq by a=a1, a2, . . . , ak and the k symbols on the right of zq by b=b1, b2, . . . , bk. Then the reference subsequence used to determine the replacement symbol for zq can be written as z(q)=azqb. It should be noted, however, that other contexts can be utilized in the present invention. For example, the sequence ending with the symbol zq, i.e., azq, could have been utilized. Similarly, the sequence beginning with zq, i.e., zqb, could have been utilized. Furthermore, the lengths of the sequences a and b could be different.
In addition, contexts in which the sequences a and/or b have “wild cards” can also be utilized. That is, a may be written in the form a1, a2, . . . , a1, . . . , ak, where aw can be a string of symbols in which the symbols in the string can take on any value. Similarly, the symbols of the context do not need to be adjacent to the symbol being processed as long as they are in a predetermined location relative to that symbol. The above general definition of the context of a symbol and the induced reference subsequence applies also to multi-dimensional signals such as two-dimensional image data.
Refer again to
Controller 111 examines the sequences stored in memory 114 to determine if z(j−k′−1) has been received earlier. If not, controller 111 makes a new entry in memory 114 for the subsequence. The entry includes the L symbols that make up the subsequence and M counters for keeping track of the results from preliminary denoising system 120 for this sequence. Controller 111 then records the preliminary denoising system result in the appropriate counter. That is, controller 111 increments the counter corresponding to the symbol value rj−k′−1. When all of the symbols from both of the sequences z and r have been received and processed, the first pass is complete.
In the second pass, controller 111 sequentially goes through the stored z sequence and replaces each symbol with the symbol determined by the algorithm discussed above with reference to
It should be noted that the received signals z and r do not need to be stored in a high-speed memory. At any given time, controller 111 during the first pass needs L symbols from z, and only one symbol from r. Hence, the received signal can be stored on a disk drive with the exception of a small buffer for storing the L symbols currently being utilized. Only the context memory 114 needs to be a high-speed memory.
The above examples assume a value for L has been determined. The present invention provides the greatest benefits in those cases in which the received sequence z has reference subsequences that are repeated a statistically significant number of times so that the counter values corresponding to any such subsequence lead to an accurate characterization of the behavior of the preliminary denoiser. If the number of observed occurrences of the reference subsequences in the received sequence is small, the accuracy of the N(J) counts discussed above might be low, and hence, the accuracy of the estimates D(K) will likewise be low. If the accuracy of these counts is sufficiently low, the wrong decision with respect to correct output symbol will be made.
The number of occurrences of a reference subsequence depends to some degree on the length of the context. Consider the case in which a symbol z having a context of length L−1 is to be processed as described above. Further assume that the corresponding reference subsequence azb, appears Q times where Q>>1 and Q/M>>1, but the longer reference subsequence tazb does not appear frequently for any value of t. Then a reference subsequence that is larger than L will have much fewer occurrences, and the statistical accuracy of the counts will be degraded relative to the case in which the smaller context was used. Hence, choosing too large a value for L can result in decision errors.
For any fixed L, the system can only exploit correlations among L samples or fewer in the input signal. The greater the extent of the input correlation that can be effectively exploited the better the performance. In contrast to the above considerations, this argues against making L too small.
From the above discussion, it is clear that there is an optimum value of L. This optimum can be determined empirically. If the length of the correlated sequences in the input signal does not change markedly over time, an optimum value for L can be determined experimentally by utilizing exemplary input signals and comparing the results of denoising for various values of L.
In principle, L can be determined for any particular output signal by denoising the signal using a number of different L values. In such a system, the value of L can be decreased from some upper bound until a value that provides satisfactory statistical accuracy is found. A reasonable starting value for L is given by [log(n)/log(M)], where n is the number of symbols in the z sequence and M is the number of symbols in the alphabet.
Refer now to
If the statistical accuracy of the counts for this reference subsequence is too low, controller 111 looks for a smaller context as shown at 158. If such a context is present, the associated reference subsequence is chosen and the process repeated as shown at 160 and 152. If no smaller context is available, z′j is set to rj, i.e., the value provided by the preliminary denoising system as shown at 159. The process continues by incrementing j as shown at 157 and repeating the process until all of the symbols that are to be processed have been processed. As noted above, the symbols on the ends of the sequence z′ that are too close to an end to have a context are set to the values in the corresponding positions in the sequence r.
The above-described embodiments of the present invention have utilized a denoising apparatus that directly processes the received signal and has specific memories for use in storing the various parameters, contexts, and degradation functions. However, the present invention can be practiced on a general-purpose data processing system to which a copy of the received signal from the channel and a copy of the output of the preliminary denoising system have been transferred by loading an appropriate data processing program into that data processing system. Embodiments in which the preliminary denoising system operates on the same data processing system can also be practiced.
The above-described embodiments utilize separate memories for storing the degradation function, list of contexts, and the received signals. However, embodiments in which a single memory is used to store two or more of these quantities can also be constructed without departing from the teachings of the present invention. Accordingly, it is to be understood that the separate memories discussed above can be part of a larger memory.
Various modifications to the present invention will become apparent to those skilled in the art from the foregoing description and accompanying drawings. Accordingly, the present invention is to be limited solely by the scope of the following claims.
Seroussi, Gadiel, Weinberger, Marcelo, Ordentlich, Erik, Weissman, Itschak, Verdu, Sergio
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
5533033, | Dec 19 1994 | The United States of America as represented by the Director, National | Device for and method of correcting errors in formatted modem transmissions |
5596565, | Mar 19 1994 | Sony Corporation | Method and apparatus for recording MPEG-compressed video data and compressed audio data on a disk |
5764658, | Oct 29 1993 | Mitsubishi Denki Kabushiki Kaisha | Data receiving apparatus and method |
6058216, | Sep 03 1996 | I-CHIPS TECHNOLOGY INCORPORATED | Apparatus for encoding image data |
6161209, | Mar 28 1997 | Her Majesty the Queen in right of Canada, as represented by the Minister | Joint detector for multiple coded digital signals |
6199186, | Apr 03 1998 | Infineon Technologies AG | Screening for undetected errors in data transmission systems |
6307487, | Sep 23 1998 | Qualcomm Incorporated | Information additive code generator and decoder for communication systems |
6848080, | Nov 05 1999 | Microsoft Technology Licensing, LLC | Language input architecture for converting one text form to another text form with tolerance to spelling, typographical, and conversion errors |
7047472, | Oct 17 2003 | Hewlett Packard Enterprise Development LP | Method for correcting noise errors in a digital signal |
7266466, | Mar 28 2002 | Koninklijke Philips Electronics N V | Watermark time scale searching |
20040010746, | |||
20040085917, | |||
JP10079938, | |||
JP2005124198, | |||
JP7177199, | |||
RE38871, | Oct 12 1994 | Mitsubishi Denki Kabushiki Kaisha | Data receiving apparatus and method |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Jan 20 2004 | ORDENTLICH, ERIK | HEWLETT-PACKARD DEVELOPMENT COMPANY, L P | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 015309 | /0358 | |
Jan 20 2004 | SEROUSSI, GADIEL | HEWLETT-PACKARD DEVELOPMENT COMPANY, L P | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 015309 | /0358 | |
Jan 20 2004 | WEINBERGER, MARCELO | HEWLETT-PACKARD DEVELOPMENT COMPANY, L P | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 015309 | /0358 | |
Jan 26 2004 | Hewlett-Packard Development Company, L.P. | (assignment on the face of the patent) | / | |||
Mar 05 2004 | WEISSMAN, ITSCHAK | HEWLETT-PACKARD DEVELOPMENT COMPANY, L P | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 015309 | /0358 | |
Jun 17 2004 | VERDU, SERGIO | HEWLETT-PACKARD DEVELOPMENT COMPANY, L P | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 015309 | /0358 |
Date | Maintenance Fee Events |
Oct 31 2014 | REM: Maintenance Fee Reminder Mailed. |
Mar 22 2015 | EXP: Patent Expired for Failure to Pay Maintenance Fees. |
Date | Maintenance Schedule |
Mar 22 2014 | 4 years fee payment window open |
Sep 22 2014 | 6 months grace period start (w surcharge) |
Mar 22 2015 | patent expiry (for year 4) |
Mar 22 2017 | 2 years to revive unintentionally abandoned end. (for year 4) |
Mar 22 2018 | 8 years fee payment window open |
Sep 22 2018 | 6 months grace period start (w surcharge) |
Mar 22 2019 | patent expiry (for year 8) |
Mar 22 2021 | 2 years to revive unintentionally abandoned end. (for year 8) |
Mar 22 2022 | 12 years fee payment window open |
Sep 22 2022 | 6 months grace period start (w surcharge) |
Mar 22 2023 | patent expiry (for year 12) |
Mar 22 2025 | 2 years to revive unintentionally abandoned end. (for year 12) |