A method and apparatus for the elimination of color from a multi-color image document is described. All color information for every picture element of the image of the document is provided concurrently and for every picture element PEL the image signal of all provided colors are analyzed and for every picture element that image signal of that color is selected which has the minimum contrast relative to the background of the image document.
|
1. Apparatus for elimination of color from a multi-color image of a document having an image and a background comprising:
a) means for providing concurrently signals for all color information of each picture element of the image of said document; b) means for determining a maximum signal value for all color information signals for analyzing said color information signal of all input colors of each picture element simultaneously and for outputting only said signal having the determined maximum value; c) means for converting said signal having a maximum value to a value representative of either a black or white picture element with the color dropped out of said image; d) means for assembling a data base of the values representing all picture elements of an image with the color dropped out of said image, for character recognition, and e) a weighting adder to which said color information signals are input, said adder combining and weighting said color information signals for providing a weighted color grey value for archiving said image.
7. A method of eliminating color from multi-color image documents having an image and a background, in processing of said document for optical character recognition, comprising the steps of:
a) providing concurrently all color information signals for each picture element of said image of said document; b) analyzing said color information signals of all colors for each picture element; c) selecting a color information signal for each picture element which has the minimum contrast relative to said background of said document; d) outputting only said selected color information signal for each picture element having said minimum contrast; e) converting said selected color information signal to a value representative of the picture element with the color dropped out of the image; f) assembling a data base of the values representing the picture elements with the color dropped out forming a data base representative of a black and white image, and g) weighting and combining said color image signals into one output, providing a weighted color grey signal for archiving purposes.
0. 9. Apparatus for elimination of representations of color from a scanned data collection electronically representing a multi-color document, said document having an image and a background comprising:
a) means for providing concurrently signals for all color information of each picture element of the image of said document; b) means for determining a maximum signal value for all color information signals for analyzing said color information signals of all input colors of each picture element simultaneously and for outputting only said color information signal having said determined maximum value; c) means for converting said signal having said maximum value to a digital value representative of either a black or white picture element with the color dropped out of the image, and d) means for assembling a data base of said digital values representing all picture elements of said document image with the color dropped out of the image, for character recognition, thereby converting said scanned data collection to a data base of digital values representing said document image in a black and white form with all color representation eliminated.
6. A method of eliminating color from multi-color image documents having an image and a background, in processing of said document for optical character recognition, comprising the steps of:
a) providing concurrently all color information signals for each picture element of said image of said document, said providing performed simultaneously; b) analyzing said color information signals of all colors for each picture element; c) selecting a color information signal for each picture element which has the minimum contrast relative to said background of said document; d) outputting only said selected color information signal for each picture element having said minimum contrast; e) converting said selected color information signal to a value representative of the picture element with the color dropped out of the image; f) assembling a data base of the values representing the picture elements with the color dropped out forming a data base representative of a black and white image, wherein said simultaneously providing is performed by projecting an image of a picture element onto an inclined one line ccd color image sensor providing an approximately simultaneous output of the primary color component signals, wherein said inclined sensor is inclined by approximately 45 degrees.
8. A method of eliminating color from multi-color image documents having an image and a background, in processing of said document for optical character recognition, comprising the steps of:
a) providing concurrently all color information signals for each picture element of said image of said document, said providing concurrently all color information performed in a time multiplexed manner; b) analyzing said color information signals of all colors for each picture element; c) selecting a color information signal for each picture element which has the minimum contrast relative to said background of said document; d) outputting only said selected color information signal for each picture element having said minimum contrast; e) converting said selected color information signal to a value representative of the picture element with the color dropped out of the image; f) assembling a data base of the values representing the picture elements with the color dropped out forming a data base representative of a black and white image; said time-multiplex manner comprising scanning each picture element of an image for each primary color component and delaying results of each primary color component scan until all color scan results of each pel are available; and g) weighting and combining said color image signals into one output, providing a weighted color grey signal for archiving purposes.
2. The apparatus of
3. The apparatus of
4. The apparatus of
5. The apparatus of
0. 10. The apparatus of
0. 11. The apparatus of
0. 12. The apparatus of
0. 13. The apparatus of
|
This application is a continuation of U.S. patent application Ser. No. 08/135,820, filed Oct. 13, 1993, now abandoned.
This invention pertains to a method and an apparatus for the elimination of color from multi-color image documents.
For character recognition purposes it is very advisable if not vital, to eliminate or to drop-out all image information which is redundant, such as for example the pre-printed form. Only the filled-in information is to be used for the optical recognition process. To facilitate the forms drop-out, the banking industry uses colored forms printed in colors such as red, green, yellow, and blue.
Usually optical readers of multi-font recognition capability in the banking and retail industries are equipped with optical drop-out color filters. These filters are placed in the optical path on a mechanical filter bank. It is known in the art to shift this filter bank by mechanical means such that the right filter is placed in the optical path, i.e. red, green, or blue color filter, for example, is placed in the optical path. The operator of such optical reading equipment has first to analyze the actual document in order to assess which drop-out color is to be used. Then the applicable color filter has to be manually adjusted using a slider in the optical path. So, for a red pre-printed form, a red color filter will be used. Due to the rules of physics the contrast of the pre-printed areas will strongly be reduced, or will be dropped-out but the filled-in information which is not printed and written in the same color, in this example red, remains untouched on the document. In most cases the contrast of the filled-in information is increased related to the background color. This afore-mentioned described procedure has to be performed for each color depending on the color of the pre-printed form.
It is known that the same effect is achieved with color lamps and color cameras if only one color is used at a time.
The above described simple methods fail totally if multi-color background is used, as for example especially in euro-cheques. If there for example a red drop-out color is used, then the red lines are dropped-out but not the green and the blue lines. In case of a blue filter, the blue lines are dropped-out but not the red and the green ones. In case of a green filter, the green lines are dropped-out but not the red and the blue ones. Thus, the background is not totally dropped-out.
There are other types of forms drop-out methods such as forms subtraction and spatial filtering methods. In forms subtraction the image of the empty pre-printed form is in a way subtracted from the filled-in form of the same type. In this method a mask matching method is used. This method has certain sensitivity regarding form shrinkage or expansion, form rotation, tolerances in the printing, and it shows a strong dependency on resolution. As a result usually there are still some parts of the background remaining on the document.
In forms drop-outs by spatial filtering methods there are used spatial frequency filters, logical filters and density filters, as described in many books and articles about digital image processing. These filters can reduce but not really drop-out the background without destroying the filled-in information to a certain extent, so-called erosion of the information, or with parts of the background left, so-called artifacts.
As a conclusion it can be said that traditional color drop-out methods as described before for mono color pre-printed forms are most efficient but fail for multi-color documents.
Therefore the objects of the present invention are to provide a method and an apparatus which efficiently remove or eliminate multi-color images contained on documents without using the handling of mechanical filter adjustment or the like.
This object as well as other objects are solved advantageously basically by applying the features described herein. Further advantageous embodiments are also described.
In accordance with the present invention color picture information is used, then for every single picture element all the image signals of all colors, normally the three colors red, green, and blue are analyzed, and finally by a special electronic or logical set-up automatically only the image signal of that color which has the minimum contrast relative to the document background are selected. For the analyzing purpose the color information is concurrently provided for every picture element of the image.
By this inventive basic solution an automatic picture element related color drop-out or elimination from the background is provided. This is performed without using any mechanical adjustments of filters as in the prior art.
In accordance with an advantageous further development of the present invention the color image signals are weighted and combined to a secondary output to provide a grey scale signal for archiving purposes of the image document.
Further advantages and details will be apparent from the following more detailed description of the invention given in conjunction with the embodiments shown in the drawing.
In accordance to a preferred further embodiment of the present invention the color data on lines R, G, and B from the color data sources 2, 3, and 4 are input to an adder 6. On output line 7 of adder 6 a signal WCG indicates the weighted color grey data output from adder 6 after having added the weighted colors. This is performed to have a grey value of each picture element for archiving purposes to thus be able to archive the image document.
The method underlining the schematic block diagram of
In the schematic diagram of
The picture related output value DOC can be described in a more general way by the following equation:
The drop-out color value DOCij for the i-th row and the j-th column is the maximum value for all available color values C. In the most general case this could be n colors and C would be Ck=1 to n, i, j for every picture element PEL. In this general case the general expression can be expressed by the following equation:
The drop-out color data DOC are used preferably for optical character recognition purposes. The output data WCG on line 7 provide the weighted color grey data. These data are a result of a weighted averaging process. It can be expressed generally by the following equation:
The coefficients aB, aG and aR indicate the different weights for the different color data R, G, and B. If they are equal to 1 then WCG is the normal average of the normalized color values. In the most general case for n colors the following equation can be used:
A practical application is this, if e.g. all coefficients are set to 0 except one, than the signal WCG on line 7 of the arrangement as shown in
In
Similar to
In
In
It is the task of transputer 410 within maximum finder 41 to find in accordance with equation (1) and (2) above, the maximum of the three applied color values for each single picture element and it also runs a dynamic thresholding algorithm to provide the signal as a 1-bit black or white data on output line 45. Transputer 460 may run in parallel to speed-up the overall complete process. This transputer 460 weights, as already described, the color data according to the equation (3) and (4) above, and thus provides the WCG data. The programs to run the different tasks are loaded into the transputers. It might be possible in a stripped-down version to run both tasks on only one transputer. It then has to run the task of maximum finding, dynamic thresholding, and weighting and combining the colors to one grey value.
In
After the frame buffers for all colors are stored the described three functions of maximum finding, dynamic thresholding, and weighting of colors can be performed synchronized to each other.
In
The result is indicated by data set DROP and reference numeral 610. This data set is used in further steps.
Furthermore, in step 62 the weighted color grey data for each picture element and every color is calculated in accordance with the equation:
This result is indicated by data set WCG and reference numeral 620.
This means that by this the weighted color grey value WCG for archiving purposes of every picture element and thus the overall total image is determined. Next in step 63 using as input data set DROP 610, there is calculated the dynamic threshold THRij for every picture element PEL. In step 64 it is decided if the value DROPij<THRij resulting in the drop-out color data for every picture element if i runs from 1 to n and j runs from 1 to m. The resulting data set DOC is indicated by reference numeral 640.
It is necessary for the working of the present invention to have concurrently all color information for every picture element of the image. This can be achieved by several methods, be it simultaneously or by a time multiples method.
An example for the simultaneous method is given by a color camera that provides the primary color components, normally red R, green G, and blue B for each picture element PEL simultaneously. An example of the principle working arrangement of such a camera is shown in FIG. 7. Basically camera 70 contains a focusing lens system 72 which focuses the incoming image, indicated by arrow 71, onto a first semi-transparent mirror 73. The reflected part is then again reflected at mirror 75 to form, for example, the blue string. That part of image 71 which is not reflected from semi-transparent mirror 73 is partly reflected at a second semi-transparent mirror 74 onto mirror 76 to form for example the red string. The part which passes semi-transparent mirror 74 forms for example the green string. All three images raised from the blue string, the green string, and the red string for example are focused by objective lens systems 751, 741, and 761, onto a blue filter 752 or a green filter 742 or a red filter 762 respectively. Those filtered images are sensed by matrix sensors which might be in charged coupled device technology or television technology. Sensor 753 is to issue, for example on output line 754, the blue signal B, or sensor 743 which images the green signal G on output line 744 or sensor 763 which outputs the red signal R on output line 764. Thus, at the output the three basic colors B, G, and R are present and can be used for further application. The signals are then for example input to maximum finder 1 in
Another example for the simultaneously gaining color information for every single picture element PEL is shown in FIG. 8.
In the time multiplex method for providing the color information concurrently to the maximum finder 1, as for example in
This operation, however, is only applicable when the forms feed pitch is synchronized with the scan period. For that purpose a schematic representation of the use of delay lines for synchronization is shown in
The method in accordance with the present invention and the different embodiments for realizing this method have been shown in the figures. Furthermore, there has been shown alternate embodiments for providing concurrently color information for every single picture element.
In advantageous manner the invention as described is able to remove color from multi-color image documents, especially multi-color cheques, to thus prepare the optical character recognition of those documents. Thus, it is easier to concentrate on all the filled-in information. Important aspects of the present invention are that concurrently all color information for every picture element is provided, that for every picture element the image signals of all provided colors are analyzed and that finally for every picture element that image signal is selected which has the minimum contrast to its background or in other words which has the maximum value in the sense of being the brightest signal.
Patent | Priority | Assignee | Title |
7153378, | Nov 21 2003 | 2502851 ONTARIO LIMITED | Product labelling |
8385811, | Feb 11 2003 | Data Recognition Corporation | System and method for processing forms using color |
8892895, | May 07 2002 | Data Recognition Corporation | Integrated system for electronic tracking and control of documents |
Patent | Priority | Assignee | Title |
3938088, | Dec 30 1971 | Xerox Corporation | Character coding and recognition system |
4547896, | Jun 29 1981 | Tokyo Shibaura Denki Kabushiki Kaisha | Printed matter identifying apparatus |
4547897, | Feb 01 1983 | Honeywell Inc. | Image processing for part inspection |
5041993, | Jun 12 1987 | Smiths Industries Public Limited Company | Method of processing sub-images of an image field |
5070531, | Nov 06 1989 | OCE-NEDERLAND B V | Method of and means for processing image data |
5220620, | Oct 09 1989 | Fujitsu Limited | Color image data processing apparatus |
5241609, | Feb 05 1990 | Konica Corporation | Marker color conversion apparatus for color image processing device |
5283698, | Apr 08 1991 | Canon Kabushiki Kaisha | Image reading apparatus |
5285271, | May 14 1991 | Hewlett-Packard Company | Digital color matrixing circuit |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Apr 02 1999 | International Business Machines Corp. | (assignment on the face of the patent) | / |
Date | Maintenance Fee Events |
Sep 22 2004 | M1552: Payment of Maintenance Fee, 8th Year, Large Entity. |
Oct 20 2008 | REM: Maintenance Fee Reminder Mailed. |
Apr 10 2009 | EXP: Patent Expired for Failure to Pay Maintenance Fees. |
Date | Maintenance Schedule |
Oct 14 2006 | 4 years fee payment window open |
Apr 14 2007 | 6 months grace period start (w surcharge) |
Oct 14 2007 | patent expiry (for year 4) |
Oct 14 2009 | 2 years to revive unintentionally abandoned end. (for year 4) |
Oct 14 2010 | 8 years fee payment window open |
Apr 14 2011 | 6 months grace period start (w surcharge) |
Oct 14 2011 | patent expiry (for year 8) |
Oct 14 2013 | 2 years to revive unintentionally abandoned end. (for year 8) |
Oct 14 2014 | 12 years fee payment window open |
Apr 14 2015 | 6 months grace period start (w surcharge) |
Oct 14 2015 | patent expiry (for year 12) |
Oct 14 2017 | 2 years to revive unintentionally abandoned end. (for year 12) |