The present invention relates to devices and methods for the measurement and/or for the specification of the perceptual intensity of a visual image, or the perceptual distance between a pair of images. Grayscale test and reference images are processed to produce test and reference luminance images. A luminance filter function is convolved with the reference luminance image to produce a local mean luminance reference image. test and reference contrast images are produced from the local mean luminance reference image and the test and reference luminance images respectively, followed by application of a contrast sensitivity filter. The resulting images are combined according to mathematical prescriptions to produce a Just noticeable difference, JND value, indicative of a spatial Standard Observer, SSO. Some embodiments include masking functions, window functions, special treatment for images lying on or near borders and pre-processing of test images.
|
24. A method of performing confined convolution, the method comprising:
receiving an image;
padding the image with zeros to provide a first intermediate image;
convolving the first intermediate image with a selected non-negative kernel function to obtain a second intermediate image;
cropping the second intermediate image to obtain a third intermediate image;
receiving said third intermediate image, I3(x,y)=PCC{K(x,y),I(x,y)}; and
forming a fourth intermediate image,
defined as I4(x,y)=K(x,y c I(x,y)=PCC{K(x,y), I(x,y)}/PCC {K(x,y)/ΣxΣyK(x,y),I(x,y)}.
1. A method of processing an image, the method comprising:
producing a test image;
producing a test luminance image from the test image;
producing a reference image;
producing a reference luminance image from the reference image;
producing a local mean luminance reference image as a convolution of the reference luminance image and a luminance filter function;
producing a test contrast image in the absence of temporal filtering;
producing a reference contrast image;
producing a difference image; and
producing a just noticeable difference image as a mathematical combination of the difference image,
wherein the convolution is defined as confined convolution, which comprises:
receiving an image;
padding the image with zeros to provide a first intermediate image;
convolving the first intermediate image with a selected non-negative kernel function to obtain a second intermediate image;
cropping the second intermediate image to obtain a third intermediate image;
receiving said third intermediate image, I3(x,y)=PCC{K(x,y),I(x,y)}; and
forming a fourth intermediate image,
defined as I4(x,y)=K(x,y c I(x,y)=PCC{K(x,y), I(x,y)}/PCC {K(x,y)/ΣxΣyK(x,y),I(x,y)}.
2. A method of spatially processing an image, the method comprising:
spatially producing a test image with a test image dimension of nx pixels in the x direction (width) and ny pixels in the y direction (height) having Gtest (x,y) which is defined to be the grayscale of the pixel at column x and row y;
spatially producing a reference image with a reference image dimension of nx pixels in the x direction (width) and ny pixels in the y direction (height) having Greference (x,y) which is defined to be the grayscale of the pixel at column x and row y;
wherein spatially producing the test and reference images includes:
providing viewing angles subtended in each image in the x and y directions defined by sx and sy respectively, the viewing angles sx, sy can be derived from a viewing distance and an image size in a display by the equation as follows, once to compute sx and once to compute sy:
tan {(π*size(degrees)/360}=(0.5*size)/viewing distance and
providing a width and height for each pixel, px and py as follows:
producing a test contrast image;
producing a reference contrast image;
producing a difference image; and
producing a just noticeable difference image as a mathematical combination of the difference image.
26. A method of spatially processing an image, the method comprising:
producing a spatial test image with a test image dimension of nx pixels in the x direction (width) and ny pixels in the y direction (height) having Gtest (x,y) which is defined to be the grayscale of the pixel at column x and row y;
producing a spatial reference image with a reference image dimension of nx pixels in the x direction (width) and ny pixels in the y direction (height) having Greference (x,y) which is defined to be the grayscale of the pixel at column x and row y;
wherein spatially producing the test and reference images includes:
providing viewing angles subtended in each image in the x and y directions defined by sx and sy respectively, the viewing angles sx, sy can be derived from a viewing distance and an image size in a display by the equation as follows, once to compute sx and once to compute sy:
tan {(πsize(degrees)/360}=(0.5*size)/viewing distance and
providing a width and height for each pixel, px and py as follows:
producing a test contrast image;
producing a reference contrast image;
producing a difference image; and
producing a just noticeable difference image as a mathematical combination of the difference image with a window function.
25. A method of processing a spatial image, the method comprising:
producing a spatial test image with a test image dimension of nx pixels in the x direction (width) and ny pixels in the y direction (height) having Gtest (x,y) which is defined to be the grayscale of the pixel at column x and row y;
producing a spatial reference image from the spatial test image with a reference image dimension of nx pixels in the x direction (width) and ny pixels in the y direction (height) having Greference (x,y) which is defined to be the grayscale of the pixel at column x and row y;
wherein spatially producing the test and reference images includes:
providing viewing angles subtended in each image in the x and y directions defined by sx and sy respectively, the viewing angles sx, sy can be derived from a viewing distance and an image size in a display by the equation as follows, once to compute sx and once to compute sy:
tan {(π*size(degrees)/360}=(0.5*size)/viewing distance and
providing a width and height for each pixel, px and py as follows:
producing a test contrast image;
producing a reference contrast image;
producing a difference image; and
producing a just noticeable difference image as a mathematical combination of the difference image.
3. The method of
4. The method of
5. The method of
6. The method of
7. The method of
8. The method of
9. The method of
10. The method of
12. The method of
13. The method of
14. The method of
15. The method of
16. The method of
17. The method of
18. The method of
19. The method of
20. The method of
21. The method of
22. The method of
23. The method of
the test image having first and second opposing sides; and
performing a convolution of the test image with a selected filter that isolates the first and second opposing sides of the test image from each other, to thereby form the reference image.
29. The method of
|
This application is a continuation of prior application Ser. No. 11/045,041 filed Jan. 24, 2005 now U.S. Pat. No. 7,783,130.
The invention described herein was made by employees of the United States Government and may be manufactured and used by or for the Government for governmental purposes without payment of any royalties thereon or therefor.
3.a Technical Field of the Invention
This invention relates generally to the field of devices and methods for the specification and measurement of the perceptual intensity of one or more visual images and, more particularly, to the rapid and efficient determination of a visibility metric for such images.
3.b. Description of the Prior Art
Vision is the means by which most people acquire and process information about the world around them. Numerous objects intended for human use include a component of information to be identified visually by a human observer. Some everyday examples include information displayed on a screen or page, keys or buttons to be pressed on a keyboard, telephone, calculator, remote control unit, among many other examples. Therefore, it is reasonable that the design of such objects include specifications to insure that the visual information is accessible to typical human observers, that is, that the information is visible. Providing a means for measuring and specifying visibility, a “visibility metric,” is an objective of the present invention.
A significant challenge in designing standards for visibility is that such standards are based upon models of the human visual sense. However, vision is a complex and only partially understood process. Previous standards for visibility have thus tended to be complex, difficult to use and not sufficiently general to serve as a standard method or methods for the specification and measurement of visibility. The performance of various visibility metrics has been reviewed by Ahumada and coworkers in two publications: Society for Information Display, International Symposium, Digest of Technical Papers Vol. 24, pp. 305-308 (1993) and Vol. 26. pp. 45-48 (1995), the contents of both publications are incorporated herein by reference.
Other examples of visibility metrics include the work of Lubin and co-workers U.S. Pat. No. 6,654,504, US Patent Application Publication 2002/0031277 and “A Human System Model for Objective Picture Quality Measurements,” Proceedings, International Broadcasters' Convention, Amsterdam, The Netherlands, pp. 498-503 (1997). These methods developed by Lubin and co-workers require extensive calibration for each application in addition to suffering from the disadvantage of complexity. These methods are chiefly intended for image quality evaluation.
Other methods for estimating visibility include those of Barten, “The SQRI Method: A New Method for the Evaluation of Visible Resolution on a Display,” Proceedings of the Society for Information Display, Vol. 28, pp. 253-262 (1987). In addition to complexity, the Barten method suffers from the further disadvantage of being appropriate primarily for the specification of displays such as television monitors.
Standards for the measurement and specification of color are known in the art and widely used. However, such color standards typically do not address the spatial pattern employed in a visual signal (for example, the shape of a letter). Consequently, such methods are not appropriate for specifying or measuring visibility.
Thus, a need exists in the art for a standard specification and measurement of visibility, sufficiently general to be applicable to large classes of visual information but sufficiently simple for widespread implementation and use, including embedding into inexpensive systems.
Accordingly and advantageously, the present invention relates to systems and techniques for processing visual information to produce a single numerical value for the visibility metric indicative of a Spatial Standard Observer (SSO). Advantages of the SSO include a simple and efficient design that produces an accurate visibility metric with a relatively few calculations.
Some embodiments of the present invention use a Minkowski sum directly over filtered image pixels. This technique avoids the need for complicated spatial frequency filter banks, with a corresponding gain in simplicity and computational efficiency.
A particular form of Contrast Sensitivity Filter (CSF) is used in some embodiments of the present invention which combines radial- and oblique-effect filters. This permits accurate visibility predictions of the visibility of oblique patterns such as half-toning and rasterizing artifacts.
Viewing distance and image resolution are jointly treated in an advantageous manner in some embodiments of the present invention. The use of this feature causes the computed value of image visibility to be substantially independent of image resolution (except to the extent that the resolution actually alters the visibility of the information in the image).
A window function is advantageously employed in some embodiments of the present invention in such a manner as to represent the reduction in visibility with distance from the observer's region of fixation.
It is advantageous in some embodiments of the present invention to use convolution operations along with the window function. In this manner it is feasible to simulate the scanning of an image by the eye of the observer.
Pooling the data accumulates the visibility over the scan and is advantageously employed in some embodiments of the present invention.
When images are located near a border region, it may occur that the border has a markedly different intensity (typically darker) than that of the image and the general image background. In such cases, it is advantageous in some embodiments of the present invention to introduce special procedures for handling border effects. Two examples are presented. One includes at least a portion of the border into the definition of “image” leading to a enhanced image that is then processed by the SSO. Another approach is to attenuate the image contrast near the border.
The SSO provides a standardized measure of visibility, allowing comparisons to be made of visibility measurements taken in a wide variety of applications, locations and times. Manufacturing and engineering specifications of visibility in standardized units can then be made.
Furthermore, SSO visibility measurements are not limited by target size. Thus, very large or very small displays can use SSO.
The SSO further provides the feasibility of making simple, automated measurements of the visibility of visual information, not requiring the use of human observers to estimate visibility. Simplicity of measurement is an important feature of SSO in order to allow SSO to be adopted in a wide variety of applications and at low cost.
SSO has numerous potential areas of application. We note a few applications as illustrative of the utility of SSO, not thereby limiting the scope of SSO to only those enumerated. Many other applications are apparent to those with ordinary skill in the art, within the scope of the present invention. Possible applications include:
These are among the advantages achieved in accordance with various embodiments of the present invention as described in detail below.
To facilitate understanding, identical reference numerals have been used, where possible, to designate identical elements that are common to the figures.
The techniques of the present invention can readily be understood by considering the following detailed description in conjunction with the following drawings, in which:
After considering the following description, those skilled in the art will clearly realize that the teachings of the invention can be readily utilized for determining the probable visibility of various graphical or visual depictions and displays as viewed by a typical human observer. In particular, the present invention relates generally to systems and techniques for processing one or more images to produce a single numerical value, or “visibility metric,” indicative of a “Spatial Standard Observer” (SSO). Advantages of the present invention include techniques for the rapid evaluation of the SSO.
The present invention relates generally to devices and methods for the measurement and/or for the specification of the perceptual intensity of a visual image. Other embodiments relate generally to devices and methods for the measurement and/or for the specification of differences in perception or “perceptual distance” between two or more visual images. Such devices and methods can be advantageously used in situations in which it is desired to measure or to specify visibility or visual intensity. Examples include the determination of visibility and/or discriminability of text, graphic elements, labels, icons, among other visual images. Examples also include the determination of visibility and/or discriminability between images, such as an original image and a compressed digital form of that image. Some embodiments of the present invention can also be advantageously used to quantify the visibility of blemishes on a display as might be useful, for example, in providing objective determinations of pass/fail criteria in the manufacture of displays.
In essence, various embodiments of the present invention operate on a digital image (or an analog image following digitization) or on a pair of digital images. An arbitrary number of images can be compared by repeated pairwise comparisons. Thus, for economy of language we will describe applications of the present invention to a single digital image or to the comparison of two digital images, understanding that this is by way of illustration and not limitation since multiple images can be handled by multiple applications of such pairwise comparisons. Analogue images can be handled within the scope of the present invention following digitization by any of numerous digitization techniques well-known in the art, such as use of a digital camera, a scanner, among other devices and digitization techniques known in the field.
In the comparison of two digital images, it is advantageous in some embodiments of the present invention to pre-process the images to erase any inessential difference before presenting them as input to the SSO. Such pre-processing removal of inessential differences can improve the speed to SSO processing, further enhancing the range of potential applications amenable to SSO processing.
Also, by way of illustration and not limitation, it will be presumed in our descriptions that the images are viewed on a particular display called the reference display, and viewed at a particular viewing distance. Techniques are well-known in the art for translating an image on a non-reference display into a digital representation as it would appear on the reference display, and for translating from an arbitrary viewing distance and angle to a standard viewing distance and angle.
Typical inputs in the construction of a Spatial Standard Observer (SSO) are two digital images having (or scaled so as to have) the same size, called herein a test image and a reference image. G(x,y) is defined to be the grayscale of the pixel at column x and row y; Gtest(x,y), Greference(x,y) for the test and reference images respectively. We take the dimension of the image to be nx pixels in the x direction (width) and ny pixels in the y direction (height). Typical values are nx=640 and ny=480.
Letting sx and sy be the viewing angles subtended by the image in the x and y directions respectively, the viewing angles sx, sy can be derived from the viewing distance and the image size in the plane of the display by the use of Eq. 1 twice, once to compute sx and once to compute sy.
Eq. 1b follows from Eq. 1a only when the ratio (size/(viewing distance)) is much less than one. But this is true in virtually all cases of practical interest so we use Eq. 1b hereinafter. Also, the designation of cm in Eq. 1a and 1b is for convenience, since it is only necessary that “size” and “viewing distance” be expressed in the same units of length.
The width and height of each pixel, px and Py respectively, are given by Eq. 2 with px, py in degrees if sx and sy are in degrees. Typical values are sx=8 deg. and sy=6 deg. yielding typical values for px=py=( 1/80) deg.
The test and reference images, Gtest(x,y) and Greference(x,y) respectively, may contain noise, or may differ in those image components having high spatial frequencies whose visibilities are not of interest for the particular image analysis under consideration. In addition, the images may be captured at a higher resolution or larger area than is necessary for the particular image analysis. For these and other reasons, it may be useful to pre-process the test and reference images to remove noise, remove high frequency components and other components not significantly affecting the visibility analysis, to reduce image resolution, and/or to crop the image to a rectangle of interest (or other convenient shape). Such operations can be performed by filtering, downsampling and cropping, pursuant to some embodiments of the present invention. Such operations are optional and, when employed, can be employed in any combination, sequence or number. That is, multiple steps of each operation can be performed whenever advantageous to do so, and the sequence of various operations or combinations can also be adjusted for the particular image processing task at hand. To be concrete in our description, we describe typical pre-processing operations, individually and in a particular sequence, understanding thereby that the present invention is not limited to the particular steps, sequence, number or type of operations described.
It is convenient in some, embodiments to pre-filter the test and reference images by convolution with a pre-filter function PF(x,y) pursuant to Eq. 2.1
G′(x,y)=PF(x,y)G(x,y) Eq. 2.1
for Gtest(x,y) and Greference(x,y) respectively. The G′ function of Eq. 2.1, the pre-processed image, is then used in place of G in subsequent image processing, including in Eqs. 3, 4 and following.
In some embodiments of the present invention, it is convenient to use a pre-filter function PF(x,y) given by Eq. 2.2.
in which pscale is a parameter, conveniently taken to be 0.125 degree in some embodiments.
The test and reference images can be downsampled by integer factors in the x and y directions {dx, dy} respectively, by selecting every dx-th column and dy-th row from the original image to create a new, downsampled image G″(x,y). This operation is conveniently expressed in terms of a “downsampling operator” DS as
G″(x,y)=DS(G′(x,y),dxdy) Eq. 2.3
The new dimensions of the test and reference images in the x and y directions are thus given as nx′ and ny′ as in Eq. 2.4.
in which the function “Floor[ ]” returns the nearest integer less than or equal to its argument. Typical values for dx and dy are dx=dy=4.
Eq. 2.3 uses the pre-processed image G′ from Eq. 2.1 as the image from which the downsampled image G″ is derived. This is a particular example presented to be concrete in our description and not intending to limit the scope of the present invention. Although downsampling is almost always preceded by filtering to avoid aliasing, downsampling can be performed on an image with or without pre-filtering.
The image G, G′ or G″ can be cropped to a rectangle of interest ROI. For definiteness, we describe cropping the G″ image having dimensions nx′ and ny′. It is convenient to describe the ROI by the pixel coordinates of its lower left corner {xLL, yLL} and upper right corner {xUR, yUR} respectively. Cropping is conveniently performed by deleting from the image rows 1 through(yLL−1) inclusive, and rows (yUR+1) through ny′ inclusive, as well as columns 1 through (xLL−1) inclusive, and columns (xUR+1) through nx′ inclusive. The dimensions of the new, cropped image are thus
nx″=xUR−xLL+1
ny″=yUR−yLL+1 Eq. 2.5
If the pre-processing techniques are used, singly or in combination, the resulting output images (test and reference) are considered the input images to the other image processing procedures described herein. New image dimensions (if present) should also be used.
If a reference image is not readily available, it is convenient in some embodiments of the present invention to create one by deleting the target or structural component from a copy of the test image. If the target is confined to a local region on an otherwise uniform image with graylevel G0, then it is convenient in some embodiments of the present invention to create a reference image as a uniform image having the same size as the test image with a graylevel also equal to G0. Typical images are depicted as 100 in
If a reference image is not available, some embodiments of the present invention obtain a reference image by processing the test image, for example, convolving the test image with a reference filter, RF(x,y). It is advantageous in some embodiments to pre-process the test image pursuant to one or more of the pre-processing techniques described herein (or others known in the field) before application of the reference filter, that is, convolve RF with G, G′, G″ or equivalents, among others.
In some embodiments, it is convenient to create a reference image by smoothing the test image and thereby suppress from the test image the signals whose visibility is of interest. For example, smoothing can conveniently be carried out with a Gaussian reference filter having the form given by Eq. 2.6.
“rscale” is a parameter conveniently chosen to be 2 degree.
The reference image is then created by convolving the test image with the reference filter, Eq. 2.6, either by employing conventional convolution (e.g., Eq. 5a, 5b, 5c) or, advantageously according to some embodiments of the present invention, using “confined convolution,” denoted by a “confined convolution operator” as applied in Eq. 2.7.
G″′(x,y)=RF(x,y)CG″(x,y) Eq. 2.7
Eq. 2.7 depicts the example in which the pre-processed image G″″ is convolved by confined convolution to produce a reference image G″′, understanding that pre-processing the test image is optional and conventional or other forms of convolution can be employed.
Confined convolution offers some advantages in image processing. In standard cyclic convolution, the edges of the image are considered to be connected. Thus, image content close to one edge of the image may be spread over to the opposite edge, which is sometimes called the “wrap-around problem.” Confined convolution is a form of convolution which avoids the wrap-around problem by, in effect, disconnecting the opposing edges of the image.
Confined convolution makes use of a “Pad-Convolve-Crop” (PCC) operator. The operands of the PCC operator are a general image function, I(x,y), and a kernel K(x,y) in which the kernel has kx columns and ky rows. The image I(x,y) is augmented or “padded” with rows and columns containing entries having a value of 0, such that the padded image has kx additional columns (of all 0's) and ky additional rows (of all 0's) in comparison with I(x,y). This padded I(x,y) is convolved with the kernel K(x,y). The image resulting from this convolution is then restored to the original image size by removing the added ky rows and kx columns. This sequence of operations defines the PCC operator operating on K and I, denoted as PCC(K(x,y),I(x,y)).
The confined convolution of K(x,y) with I(x,y) is then given by Eq. 2.8.
in which 1(x,y) is an image (array) all of whose entries=1 and which has the same number of rows and columns as the (unpadded) image I(x,y).
The reference and test images (optionally, following pre-processing) are converted from a grayscale format to local luminance contrast image. This conversion is depicted schematically as “Contrast” 101 in
This transformation from grayscale G(x,y) to a luminance image or luminance L(x,y) is advantageously performed by a gamma function “Gamma” as in Eq. 3.
L(x,y)=Gamma[G(x,y)] Eq. 3
The particular form and parameters used for the Gamma function will depend on the particular characteristics of the device displaying the test and reference images. A typical version is Eq. 4 in which the luminance L(x,y) is given by:
L(x,y)=Lmax(G(x,y)/Gmax)γ Eq. 4
in which Lmax is the maximum possible luminance in the image, Gmax is the corresponding maximum grayscale value. γ is the gamma exponent of the display, approximately correcting for nonlinearities in the luminance characteristics of the display. A typical value for γ is γ=2.2. Eq.s 3 and 4 are applied to both test and reference images.
A local luminance filter is employed having a luminance filter function LF(x,y). It is then convenient to introduce a local mean luminance reference image LL(x,y) obtained by the convolution of the reference luminance image Lreference(x,y) with the luminance filter function by Eq. 5a
LL(x,y)=LF(x,y)Lreference(x,y) Eq. 5a
in which denotes convolution of the two functions defined in known texts in the field, for example “Fourier Analysis and Imaging” by Roger N. Bracewell, (Kluwer Academic/Plenum Publishers, 2003), pp. 174-179, incorporated herein by reference. The convolution can be expressed in discrete and continuous forms as in Eq. 5b and 5c respectively.
LF(x,y)Lreference(x,y)=∫∫LF(x−τ,y−ω)Lreference(x,y)dτdω Eq. 5b
where the integrals extend over the domain in which LF(τ,ω) is not zero. In discrete form the convolution is given by Eq. 5c.
In some embodiments of the present invention, it is convenient to use the luminance filter function LF(x,y) given by Eq, 6.
in which lscale is a parameter to be chosen. If lscale→+∞, this corresponds to an LL that is constant and equal to the average luminance over the image.
The average (MEAN) luminance, Lmean is given by a numerical average of the luminance over all pixels in the x and y directions, Eq. 7.
A typical value for Lmean is 40 candelas per sq. meter (40 cd/m2).
The contrast or contrast image of each pixel, C(x,y) is then given by Eq. 8 applied to both test and reference luminance images Ltest(x, y) and Lreference(x, y) respectively.
For the particular embodiments described thus far, Lmax plays no apparent role since it appears as a multiplicative factor in both L (Eq. 4), and LL (through Lreference (Eq.s 4 and 5)) hence canceling from Eq. 8. (Under the typically reasonable presumption that both test and reference images have the same maximum possible luminances, Lmax). However, it is convenient to retain Lmax in the equations since it simplifies the application of the equations in other embodiments of the present invention in which Lmax and/or Lmean may play a role in determining parameters of the process. A typical value for Lmax is 100 cd/m2.
Following the construction of test and reference contrast functions via Eq. 8, both test and reference images are typically passed through a Contrast Sensitivity Filter (CSF), 102 in
F(x,y)=DFT−1[CSF(u,v)*DFT[C(x,y)]] Eq. 9
in which C(x,y) is the contrast function of the image from Eq. 8 and F(x,y) is the filtered image.
The Discrete Fourier Transform and the Inverse Discrete Fourier Transform, DFT[ ] and DFT−1[ ], are conventional operations in the field of digital signal processing and described in many texts, for example, the text by Bracewell, supra at pp. 167-168, incorporated herein by reference.
CSF(u,v) is the discrete version of a Contrast Sensitivity Filter in the frequency domain, and u and v are horizontal and vertical frequency indices respectively in units of cycles/width and cycles/height.
The discrete, frequency domain version of the Contrast Sensitivity filter, CSF(u,v) is conveniently given by the product of a radial contrast sensitivity function, RCSF(u,v), and an oblique effect contrast sensitivity filter, OEF(u,v), as expressed in Eq. 10.
CSF(u,v)=RCSF(u,v)OEF(u,v) Eq. 10
In some embodiments of the present invention it is convenient to choose a radial function RCSF having the form given in Eq. 11.
in which “sech” is the hyperbolic secant function, “gain”, “loss”, f0, f1 and p are parameters. Typical values for these parameters are as follows:
In some embodiments of the present invention, it is convenient to choose an oblique filter, OEF having the form given in Eq. 12.
in which “corner” and “slope” are parameters. Typical values for “corner” and “slope” are corner=3.481 and slope=13.57149.
Following processing of both the test image and the reference image by CSF, 102, the resulting filtered images are subtracted pixel-by-pixel, 103. The result is the difference image D(x,y) of Eq. 13.
D(x,y)=Ftest(x,y)−Freference(x,y) Eq. 13
In some embodiments of the present invention, it is advantageous to create a mask image, M(x,y), from the filtered reference image Freference(x,y). In such embodiments, the absolute value of the filtered reference image is raised to a power “a”, convolved with a masking filter MF(x,y), added to the constant l and the b'th root of the resulting expression is computed as in Eq. 14.
in which the convolution operator indicates discrete convolution.
In some embodiments, it is advantageous to choose a=b=2 in Eq. 14, resulting in a mask image M(x,y) given by Eq. 15.
M(x,y)=√{square root over (1+MF(x,y)Frefrence2(x,y))}{square root over (1+MF(x,y)Frefrence2(x,y))} Eq. 15
Furthermore, it is advantageous in some embodiments of the present invention to choose the masking filter MF(x,y) to have the form of Eq. 16
in which “mgain” and “mscale” are parameters. Typical choices for mgain and mscale are mgain=0.2 and mscale=0.1.
In some embodiments of the present invention, the difference image D(x,y) is divided by the masking image to yield a masked difference image MD(x,y) according to Eq. 17.
For those embodiments in which a mask image is not employed, the masked difference image is simply the difference image. Also, when a mask image is not employed, the subtract operation 103 can optionally precede the CSF 102.
At this “boost” stage, 106 in
In some embodiments, it is advantageous to use a window function W(x,y) as given by Eq. 19
in which “wscale” is a parameter, advantageously chosen to be approximately 1.013 in some embodiments.
It is advantageous in some embodiments of the present invention to display the complete JND(x,y) image, 107 in
The next stage in the process combines or “pools” the values of JND(x,y) of the pixels in the x and y directions to produce a single value of JND. It is convenient to use a Minkowski summation to effect this pooling with a parameter ψ as exponent, as given in Eq. 20.
The number, JND of Eq. 20 is the desired numerical value characterizing the Spatial Standard Observer.
In some embodiments, it is advantageous to let ω→∞, in which case Eq. 20 reduces to Eq. 21.
JND=Max[JND(x,y)] Eq. 21
In some embodiments of the present invention, it is advantageous to apply a non-linear transformation (for example, a power function) to the JND computed from either Eq. 20 or Eq. 21. Thus, whether or not a non-linear transformation is applied to JND, and whether or not border effects are relevant for the particular image(s) under consideration, the Spatial Standard Observer, as characterized by the value of JND, provides an effective visibility metric, able to be computed relatively rapidly.
In some applications, the target or test image (201 in
In some embodiments of the present invention it is advantageous to multiply the contrast images by a spatial border aperture function BA(x,y) between the Contrast and CSF steps, that is, at 120 in the process flow diagram of
CBA(x,y)=C(x,y)BA(x,y) Eq. 22
Then CBA(x,y) is used in place of C(x,y) at the CSF step, Eq. 9.
In some embodiments of the present invention, the border aperture function is advantageously chosen to be:
in which “bgain” and “bscale” are parameters. An example of this function is given in
The use of a border aperture function, BA(x,y) as in Eq. 23, has the advantage of simplicity, but as an approximation, it may not be as accurate as alternative methods. In other embodiments, it is advantageous for the parameters bscale and bgain to depend upon the luminance contrast between the image and the border. Typically, a margin is added to the image such that the enlarged image, image+margin, contains a portion of the border. This enlarged image is then processed as the “image” pursuant to the image processing techniques described herein, typically including the masking component of the processing, 105. The presence of a portion of the border in the enlarged image will tend to produce the appropriate masking effect, tending to reduce visibility of targets or portions of targets near the border.
There are various ways the use of an enlarged image can be implemented to treat border effects. For example, it is convenient to take the width of the border region to be Round[2*mscale/px] and the height to be Round[2*mscale/py], in which mscale is the masking parameter (Eq. 16). “Round[ ]” is a function that generates as the value of the function that integer nearest to the value of the function's argument. The dimensions of the enlarged image are then given by Eq. 24 as:
width=nx+Round[2*mscale/px]
height=ny+Round[2*mscale/py] Eq. 24
An advantage of treating border effects with an enlarged image is that it more correctly deals with the dependence of the border masking effect upon the luminance contrast between the border and the (original, unenlarged) image. A possible disadvantage is that this approach requires somewhat more processing to include the masking step.
JND from Eq. 20 (or Eq. 21 for Ψ→∞) relates to the percentage of human observers who will notice a difference. For example, images leading to JND having a value around 1 will typically present noticeable differences to about 75% of typical human observers. Images resulting in larger JND values will present noticeable difference to a correspondingly larger percentage of typical human observers, although the precise functional relationship between JND and the percentage of viewers observing differences may not be readily known.
It is advantageous in some embodiments of the present invention to use JND as a measure of different levels of perceptual intensity. That is, larger JND values indicate that a larger percentage of observers will notice a difference. But also larger values of JND typically indicate that a given observer will be more likely to observe more detailed differences. By way of illustration and not limitation, we consider the example of observing a scene through some form of optical instrument, such as a remote viewing device, night vision goggles, among others. A given observer may require an image value of JND1 in order to conclude that some object is present other than natural background. However a value of JND2>JND1 would be required for the observer to conclude that the object is a military vehicle. And a value of JND3>JND2 would be required to conclude that it is a hostile military vehicle. Thus JND values as determined by the SSO can be a useful measure of not only minimal levels of visibility but, when more stringently applied, also estimate the probable level of perceptual information obtainable from a given image.
The memory 260 typically comprises different modalities, illustratively semiconductor memory, such as random access memory (RAM), and disk drives. Depending on the embodiment, the memory 260 typically includes an operating system, 280. The operating system 280 may be implemented by any conventional operating system such as UNIX®, WINDOWS®, and LINUX®, among others.
Although various embodiments which incorporate the teachings of the present invention have been shown and described in detail herein, those skilled in the art can readily devise many other varied embodiments that still incorporate these teachings.
Patent | Priority | Assignee | Title |
11454748, | Nov 13 2018 | EUROPTEC USA, INC | Low sparkle articles and display system stacks |
11694480, | Jul 27 2020 | Samsung Electronics Co., Ltd. | Method and apparatus with liveness detection |
Patent | Priority | Assignee | Title |
5694491, | Jun 17 1996 | ASEV DISPLAY LABS | Methods and apparatus for assessing the visibility of differences between two image sequences |
5719966, | Oct 15 1996 | Sarnoff Corporation | Apparatus for assessing the visiblity of differences between two image sequences |
5781665, | Aug 28 1995 | Pitney Bowes Inc. | Apparatus and method for cropping an image |
5974159, | Mar 29 1996 | ASEV DISPLAY LABS | Method and apparatus for assessing the visibility of differences between two image sequences |
6148117, | Dec 27 1996 | HEWLETT-PACKARD DEVELOPMENT COMPANY, L P | Image processing system with alterable local convolution kernel |
6799515, | Mar 11 2002 | Luminous double faced picture display | |
6977664, | Sep 24 1999 | Nippon Telegraph and Telephone Corporation | Method for separating background sprite and foreground object and method for extracting segmentation mask and the apparatus |
20020031277, | |||
20020145757, | |||
20020150304, | |||
20030197867, | |||
WO206851, | |||
WO2004086751, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Aug 09 2010 | The United States of America as represented by the Administrator of the National Aeronautics and Space Administration (NASA) | (assignment on the face of the patent) | / |
Date | Maintenance Fee Events |
Aug 13 2015 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Nov 04 2019 | M1552: Payment of Maintenance Fee, 8th Year, Large Entity. |
Nov 04 2019 | M1555: 7.5 yr surcharge - late pmt w/in 6 mo, Large Entity. |
Sep 19 2023 | M1553: Payment of Maintenance Fee, 12th Year, Large Entity. |
Date | Maintenance Schedule |
Mar 20 2015 | 4 years fee payment window open |
Sep 20 2015 | 6 months grace period start (w surcharge) |
Mar 20 2016 | patent expiry (for year 4) |
Mar 20 2018 | 2 years to revive unintentionally abandoned end. (for year 4) |
Mar 20 2019 | 8 years fee payment window open |
Sep 20 2019 | 6 months grace period start (w surcharge) |
Mar 20 2020 | patent expiry (for year 8) |
Mar 20 2022 | 2 years to revive unintentionally abandoned end. (for year 8) |
Mar 20 2023 | 12 years fee payment window open |
Sep 20 2023 | 6 months grace period start (w surcharge) |
Mar 20 2024 | patent expiry (for year 12) |
Mar 20 2026 | 2 years to revive unintentionally abandoned end. (for year 12) |