A method and apparatus for coding an image includes calculation of motion vectors of vertices of a patch in an image being encoded and transmitting information of horizontal and vertical components of the motion vectors of the vertices and information specifying that values of the horizontal and vertical components of a motion vector for each pixel in the patch are integral multiples of 1/d of a distance between adjacent pixels, where d is an integer not less than 2.
|
2. A coder which codes an image by carrying out motion compensation in which all pixels associated with a same patch are not restricted to have a common vector and horizontal and vertical components of a motion vector for each pixel can assume an arbitrary value other than an integral multiple of a distance between adjacent pixels, said coder comprising:
means for estimating motion information of a patch from an original image of a current image and a reference image; and
means for calculating horizontal components of a motion vector for each pixel of a plurality of pixels of a predicted image from the estimated motion information of a patch with limiting of said horizontal and vertical components of a motion vector for each pixel to integer multiples of 1/d, d being an integer not less than 2 of the distance between adjacent pixels.
1. A method of coding an image by carrying out motion compensation in which all pixels associated with a same patch are not restricted to have a common vector and horizontal and vertical components of a motion vector for each pixel can assume an arbitrary value other than an integral multiple of a distance between adjacent pixels, said method comprising the steps of:
estimating motion information of a patch from an original image of a current image and a reference image; and
calculating horizontal and vertical components of a motion vector for each pixel of a plurality of pixels of a predicted image from the estimated motion information of a patch with limiting of said horizontal and vertical components of a motion vector for each pixel to integer multiples of 1/d, d being an integer not less than 2 of the distance between adjacent pixels.
4. The method according to
6. The coder according to
|
This is a continuation of application Ser. No. 09/994,728, filed Nov. 28, 2001 now U.S. Pat. No. 6,542,548; which is a divisional application of application Ser. No. 09/863,428, filed May 24, 2001 now U.S. Pat. No. 6,516,033; which is a divisional of application Ser. No. 09/626,788, filed Jul. 26, 2000, now U.S. Pat. No. 6,285,713; which is a continuation of application Ser. No. 09/364,255, filed Jul. 30, 1999, now U.S. Pat. No. 6,134,271; which is a continuation of application Ser. No. 08/903,199, filed Jul. 15, 1997, now U.S. Pat. No. 5,963,259; which is a continuation of application Ser. No. 08/516,218, filed Aug. 17, 1995, now U.S. Pat. No. 5,684,538.
1. Field of the Invention
The present invention relates to a video coding/decoding system and a video coder and a video decoder used with the same system for implementing a motion compensation method in which all the pixels associated with the same patch are not restricted to have a common motion vector and in which the horizontal and vertical components of a motion vector of a pixel can assume an arbitrary value other than an integral multiple of the distance between adjacent pixels.
2. Description of the Related Art
In the high-efficiency coding and decoding of image sequences, a motion compensation method utilizing the analogy between temporally-proximate frames is well known to have a great advantage in compressing the amount of information.
In
In
A motion compensation method constituting the main stream of the current video coding and decoding techniques depends on the “block matching of half-pixel accuracy” employed by MPEG1 and MPEG2 providing the international standard of video coding/decoding method.
In the “block matching of half-pixel accuracy”, the original image of the current frame to be coded is segmented into a number n of blocks at the motion estimation section 3-1 in
In Equation 1, I(x,y) denotes the original image of the current frame to be coded, and R(x,y) a reference image stored in memory. In this equation, it is assumed that pixels exist at points of which the x and y coordinates are an integer on the original image I and the reference image R. Bi designates the pixels contained in the ith block of the original image I, and Ni the number of pixels contained in the ith block of the original image I. The process of evaluating the prediction error for motion vectors varying from one block to another and searching for a motion vector associated with the smallest prediction error is called the matching. Also, the process of calculating Ei(ui,vi) for all vectors (ui,vi) conceivable within a predetermined search range and searching for the minimum value of the vector is called the full search.
In the motion estimation for the “block matching of half-pixel accuracy”, ui and vi are determined with one half of the distance between adjacent pixels, i.e., ½ as a minimum unit. As a result, (x−ui,y−vi) is not necessarily an integer, and a luminance value of a point lacking a pixel must actually be determined on the reference image R when calculating the prediction error using Equation 1. The process for determining the luminance value of a point lacking a pixel is called the interpolation, and the point where interpolation is effected is referred to as an interpolated point or an intermediate point. A bilinear interpolation is often used as an interpolation process using four pixels around the interpolated point.
When the process of bilinear interpolation is described in a formula, the luminance value R(x+p,y+q) at the interpolated point (x+p,y+q) of the reference image R can be expressed by Equation 2 with the fractional components of the coordinate value of the interpolated point given as p and q (0≦p<1, 0≦q<1).
In the motion estimation by “block matching of half-pixel accuracy”, a two-step search is widely used in which, first, the full-search of single-pixel accuracy is effected for a wide search range to estimate a motion vector approximately, followed by the full search of half-pixel accuracy for a very small range defined by, say, plus/minus a half pixel in horizontal and vertical directions around the motion vector. In the second-step search, a method is frequently used in which the luminance value of an interpolated point on the reference image R is determined in advance. An example of the process according to this method is shown in
On the other hand, assuming that the interpolation calculation is made on a reference image each time of matching, a total of 128 (=16×8, in which 16 is the number of white squares in
As described above, the number of interpolation operations can be reduced by determining the luminance value of the interpolated points on the reference image R in advance by reason of the fact that the same interpolated point on the reference image R is used a plurality of times.
Also, in the “block matching of half-pixel accuracy”, a predicted image is synthesized using the relation of Equation 3 in the synthesis circuits 4-1, 4-2 shown in FIGS. 1A and 1B.
P(x,y)=R(x−ui,y−vi),(x,y)∈Bi(1≦i≦n) (3)
In Equation 3, P(x,y) shows an original image I(x,y) of the current frame to be coded which is predicted by use of the reference image R(x,y) and the motion vector (ui,vi). Also, assuming that the predicted image P is segmented into a number n of blocks corresponding to the original image I, Bi represents a pixel contained in the ith block of the predicted image P.
In the “block matching of half-pixel accuracy”, as described above, the value of (x−ui,y−vi) is not necessarily an integer, and therefore the interpolation process such as the bilinear interpolation using Equation 2 is carried out in synthesizing a predicted image.
The “block matching of half-pixel accuracy” is currently widely used as a motion compensation method. Applications requiring an information compression ratio higher than MPEG1 and MPEG2, however, demand an even more sophisticated motion compensation method. The disadvantage of the “block matching” method is that all the pixels in the same block are required to have the same motion vector.
In order to solve this problem, a motion compensation method allowing adjacent pixels to have different motion vectors has recently been proposed. The “motion compensation based on spatial transformation” which is an example of such a method is briefly explained below.
In the “motion compensation based on spatial transformation”, the relation between the predicted image P and the reference image R in synthesizing a predicted image at the synthesis circuit 4-1, 4-2 in
P(x,y)=R(fi(x,y),gi(x,y)),(x,y)∈Pi(1≦i≦n) (4)
In Equation 4, on the assumption that the predicted image P is segmented into a number n of patches corresponding to the original image I, Pi represents a pixel contained in the ith patch of the predicted image P. Also, the transformation functions fi(x,y) and gi(x,y) represent a spatial correspondence between the predicted image P and the reference image R. The motion vector for a pixel (x,y) in Pi can be represented by (x−fi(x,y),y−gi(x,y)). The predicted image P is synthesized by calculating the transformation functions fi(x,y), gi(x,y) with respect to each pixel in each patch and determining the luminance value of corresponding points in the reference image R in accordance with Equation 4. In the process, (fi(x,y), gi(x,y)) is not necessarily an integer, and therefore the interpolation process such as the bilinear interpolation is performed using Equation 3 as in the case of the “block matching of half-pixel accuracy”.
The “block matching” can be interpreted as a special case of the “motion compensation based on spatial transformation” in which the transformation function is a constant.
Nevertheless, the words “motion compensation based on spatial transformation” as used in the present specification are not assumed to include the “block matching”.
Examples of the transformation functions fi(x,y), gi(x,y) in the “motion compensation based on spatial transformation” include the case using the affine transformation shown in Equation 5 (refer to “Basic Study of Motion Compensation Based on Triangular Patches” by Nakaya, et al., Technical Report of IEICE, IE90-106, H2-03) shown below
fi(x,y)=ai1x+ai2y+ai3
gi(x,y)=ai4x+ai5y+ai6 (5)
the case using the bilinear transformation given in Equation 6 (G. J. Sullivan and R. L. Baker, “Motion compensation for video compression using control grid interpolation”, Proc. ICASSP '91, M9.1, pp.2713-2716, 1991-05) shown below
fi(x,y)=bi1xy+bi2x+bi3y+bi4
gi(x,y)=bi5xy+bi6x+bi7y+bi8 (6)
and the case using the perspective transformation given in Equation 7 (V. Seferdis and M. Ghanbari, “General approach to block-matching motion estimation”, Optical Engineering, vol. 32, no. 7, pp. 1464-1474, 1993-07) shown below
In Equations 5, 6 and 7, aij, bij, cij (j: 1 to 9) designate motion parameters estimated for each patch as motion information at the motion estimation section 3-1 in FIG. 1A. An image identical to the predicted image P produced at the synthesis circuit 4-1 of the video coder 1 can be obtained at the synthesis circuit 4-2 of the video decoder 2 at the receiving end in such a manner that information capable of specifying the motion parameter of the transformation function for each patch in some form or other is transmitted by the video coder 1 as motion information to the video decoder 2 at the receiving end. Assume, for example, that the affine transformation (Equation 5) is used as the transformation function and the patch is triangular in shape. In such a case, six motion parameters can be transmitted directly as motion information. Alternatively, the motion vectors of three vertices of a patch may be transmitted so that six motion parameters indicated by Equation 5 are calculated from the motion vectors of the three vertices at the receiving end. Also, in the case where the bilinear transformation (Equation 6) is used as the transformation function, the employment of a quadrilateral patch makes it possible to transmit the desired one of eight motion parameters and the motion vectors of four vertices of the patch.
The following explanation refers to the case using the affine transformation (Equation 5) as the transformation function. This explanation applies substantially directly with equal effect to the case where other transformations (Equation 6, 7, etc.) are employed.
Even after a transformation function is established, many variations are conceivable for the “motion compensation based on spatial transformation”. An example is shown in FIG. 3. In this case, the motion vector is restricted to continuously change at the patch boundary. First, an original image I202 of the current frame is segmented into a plurality of polygonal patches, thereby constituting a patch-segmented original image I208. The vertices of these patches are called the grid points, each of which is shared by a plurality of patches. A patch 209 in
In the “motion compensation based on spatial transformation”, as in the “block matching”, it is pointed out that the motion estimation based on matching is effective. An example algorithm for motion estimation based on matching is described below. This scheme is called the “hexagonal matching” and is effectively applied to the case where the motion vector continuously changes at the patch boundary. This scheme is configured of two processes:
In process (1), the block matching is applied to a block of a given size containing a grid point, and the motion vector of this block is determined as a coarse motion vector for the grid points existing in the particular block. The object of process (1) is nothing but to determine a coarse motion vector of a grid point and is not always achieved using the block matching. The manner in which process (2) is carried out is shown in FIG. 4.
A typical search range for the refinement algorithm is ±3 pixels in each of horizontal and vertical directions. In such a case, a total of 49 (=7×7) matching operations are performed for each grid point in the polygon 302. Since each patch is involved in the refinement algorithm for three grid points, on the other hand, it follows that a total of 147 (=49×3) evaluations of prediction error is performed for each pixel in a patch. Further, each repetition of this refinement process increases the number of prediction error evaluations correspondingly. Consequently, each time of prediction error evaluation, interpolation computations are carried out for the interpolated points involved on the reference image, thereby enormously increasing the amount of computations.
The problem of interpolation computation in the motion estimation for the “motion compensation based on spatial transformation” is complicated due to the essential difference thereof from the similar problem in the motion estimation for the “block matching at half-pixel accuracy”. In the “motion compensation based on spatial transformation”, even when the horizontal and vertical components of the motion vector of each grid point are restricted to an integral multiple of ½, the horizontal and vertical components of the motion vector of each pixel in each patch are not necessarily an integral multiple of ½. Also, in view of the fact that the components below the decimal point of the motion vector for each pixel in each patch generally can assume an arbitrary value, the luminance value of the same interpolated point on the reference image R is rarely used a plurality of times in the matching operation.
The feature of the “motion compensation based on spatial transformation” is that a numerical operation is required for determining a motion vector for each pixel. In the case where the computation accuracy varies between the transmitting and receiving ends in computing a motion vector (transformation function), a mismatch may occur in which the predicted image P obtained at the synthesis circuit 4-1 of the video coder 1 is different from the predicted image P produced from the synthesis circuit 4-2 of the video decoder 2. This mismatch of the predicted image P has the property of accumulating at the receiving end. Even when there is only a small error for each frame, therefore, the quality of the decoded image output from the video decoding circuit 2 may be seriously affected in the end. This problem is not posed by the “block matching” in which all the pixels in a block follow the same motion vector and this particular motion vector is coded and transmitted directly as motion information.
An example of employing the affine transformation (Equation 5) as a transformation function to cope with this problem is explained. A method of solving such a problem is by enhancing the computation accuracy of Equation 5 sufficiently to reduce the computation error of Equation 5 sufficiently below the quantization step size of the luminance value. A case using this solution is studied below.
Assume, for example, that the luminance value is quantized in 8 bits with the quantization step size of 1 and that the maximum value of the luminance value is 255 (11111111) and the minimum value thereof is 0 (00000000). Also, assume that the luminance values of four adjacent pixels on the reference image P are R(0,0)=0, R(0,1)=0, R(1,0)=255, and R(1,1)=255, respectively. Further, it is assumed that the computation of Equation 5 is carried out to determine fi(x,y) when the horizontal and vertical coordinates of a point on the reference image R corresponding to a pixel P(x,y) on the predicted image P are given by 0<gi(x,y)<1 and 0<fi(x,y)<1, respectively. This condition is hereinafter referred to as the worst condition.
Under this worst condition, a computation error more than 1/255 in magnitude of fi(x,y) always leads to an error of the quantized value of the luminance. For a mismatch to be prevented, therefore, both the video coder 1 and the video decoder 2 must be fabricated in such a manner as to secure the computation error of Equation 5 sufficiently smaller than 1/255. Improving the computation accuracy, however, generally leads to an increased number of digits for internal expression of a numerical value, thereby further complicating the computation process. In the motion compensation process, Equation 5 is computed so frequently that an increased complication of this computation process has a serious adverse effect on the total amount of information processed.
With the “motion compensation based on spatial transformation”, motion estimation based on matching poses the problem of a greatly increased amount of computations required for interpolation of luminance values at points lacking a pixel on the reference image R. A more complicated computation operation is another problem which will be posed if the computation accuracy for synthesizing each predicted image P in the video coder and the video decoder is to be improved to accommodate a mismatch between a predicted image P obtained at the sending end and a predicted image P obtained at the receiving end.
An object of the present invention is to realize a motion estimation process with a small amount of computations by reducing the number of calculations for interpolation of luminance values.
Another object of the invention is to provide a method of reducing the computation accuracy required for computing the transformation function at the time of synthesizing a predicted image P and also preventing the mismatch between the predicted images P attributable to the computation accuracy of the transformation function.
Prior to motion estimation, a high-resolution reference image R′ is prepared for which the luminance value of a point having x and y coordinates equal to an integral multiple of 1/m1 and 1/m2 (m1 and m2 are positive integers) respectively is determined by interpolation on the reference image R. It follows therefore that in the high-resolution reference image R′, pixels exist at points whose x and y coordinate values are an integral multiple of 1/m1 and 1/m2 respectively. In the case where the luminance value of the reference image R at a position having a coordinate value other than an integer becomes required in the process of motion estimation, such a value is approximated by the luminance value of a pixel existing at a position nearest to the particular coordinate in the high-resolution reference image R′. The object of reducing the number of interpolation computations thus is achieved.
In the above-mentioned process for preparing the high-resolution reference image R′, interpolation computations in the number of m1×m2−1 per pixel of the original image I are required. Once the interpolation process for achieving a high resolution is completed, however, the motion estimation process does not require any further computations for interpolation. In the case of the “motion compensation based on spatial transformation” described with reference to the related art above, more than 147 interpolation computations is required for each pixel in the motion estimation. When it is assumed that m1=m2=2, the number of required interpolation computations is not more than three per pixel or about one fiftieth of the conventional requirement. Even when m1=m2=4, the number of requited interpolation computations is only 15, which is as small as about one tenth. The computation amount thus can be reduced remarkably.
Also, assume that the horizontal and vertical components of the motion vector of each pixel used for synthesizing the predicted image P in the video coder and the video decoder are defined to take a value equivalent only to an integral multiple of 1/d1 or 1/d2 (d1 and d2 being integers) respectively of the distance between adjacent pixels. The object of reducing the required computation accuracy of the transformation function and preventing a mismatch is thus achieved.
In the case where the above-mentioned rule on motion vectors is employed, the magnitude of the computation error of the transformation function fi(x,y) always leading to an error of the quantization value of luminance under the “worst condition” described with reference to the related art above is 1/d1. Suppose d1=4, for example, the risk of causing a mismatch of the predicted images under the “worst condition” is maintained substantially at the same level even when the computation accuracy of fi(x,y) is reduced by 6 bits as compared with the proposed solution described above with reference to the related art.
The foregoing and other objects, advantages, manner of operation and novel features of the present invention will be understood from the following detailed description when read in conjunction with the accompanying drawings.
A method of performing the motion estimation operation by improving the resolution of the whole reference image R in a video coder 1 will be explained as a first embodiment. First, the luminance value of a point lacking a pixel on the reference image R is interpolated to form a high-resolution reference numeral R′. Assuming that the bilinear interpolation (Equation 3) is used as an interpolation scheme for the luminance value, the high-resolution reference numeral R′ is given by Equation
where it is assumed that s and t are an integral number and that 0≦s<m1 and 0≦t<m2. On the high-resolution reference image R′, pixels are assumed to exist at points where all of x, y, s and t are an integral number. The points where s=t=0 corresponds originally to pixels existing on the reference image R, and the luminance value of other points can be determined by interpolation.
In the description that follows, an embodiment will be explained with reference to the case in which m1=m2=m (m: positive integral number) for the sake of simplicity.
An example of an video coder 1 utilizing the high-resolution reference image R′ is shown in FIG. 5. The arrows in
An example of the interpolation circuit 405 using the bilinear interpolation (Equation 8) for the interpolation of a luminance value is shown in
In the first embodiment, although the number of interpolation computations is reduced, a memory capable of storing an image four times larger than the reference image R is required as the frame memory 407 for storing the high-resolution reference image R′. In view of this, a second-embodiment is described below, in which although the number of interpolation computations required is increased as compared with the first embodiment, the required memory capacity is reduced.
In the second method, while the required portion of the original image I and the reference image R of the current frame are fetched little by little, the reference image R is interpolated and used for motion estimation. The distance between adjacent pixels is assumed to be unity for both horizontal and vertical directions on the original image I of the current frame and the reference image R. The description below is based on the assumption that the “hexagonal matching” is used for motion estimation, and is centered on a circuit for executing the refinement operation in the “hexagonal matching”. The coarse motion estimation of grid points which constitutes another process for the “hexagonal matching”, as already explained, is carried out by executing the “block matching” for a block containing the grid points.
In the embodiments shown in
Taking into consideration the facility of multiply and divide operations in a circuit, m1 and m2 are preferably a power of 2. With the reduction in the magnitude of m1 and m2, the circuit scale can be reduced. On the other hand, the approximation accuracy of the coordinate (motion vector) for motion estimation is adversely affected, and the prediction error is likely to be inverted in magnitude in the computation of Equation 1. The result of motion estimation thus is distorted, thereby deteriorating the performance of prediction. With the increase of m1 and m2, by contrast, the inverse phenomenon results. Taking the circuit scale into consideration, the m1 or m2 value of 4 or less is desirable. When the performance of prediction is taken into account, however, 2 or more is a desirable value of m1 and m2. Balancing between these two extremes, the appropriate value of m1 and m2 is 2 and 4 respectively.
When motion estimation is carried out using a high-resolution reference image R′ with an image density of m times larger in horizontal and vertical directions, the value of the transformation functions fi(x,y) and gi(x,y) in Equations 5 to 7 is limited to an integer multiple of 1/m. In other words, this indicates that the minimum unit of the transformation function becomes 1/m of the interval between adjacent pixels. This restriction, however, is applied only to the motion estimation, and need not be observed in synthesizing the predicted image P. In the motion compensation based on spatial transformation, on the other hand, in order to prevent a mismatch of predicted images P in the video coder 1 at the sending end and in the video decoder 2 at the receiving end, some standard is required to be established with respect to the computation accuracy of the transformation function for synthesizing the predicted image P. One method of establishing such a standard is by setting a minimum unit of the transformation function for synthesizing the predicted image P as in motion estimation.
In this method, the horizontal and vertical components of the motion vector of each pixel used in synthesizing the predicted image P at the synthesis circuit 4-1 of the video coder 1 and the synthesis circuit 4-2 of the video decoder 2 are specified to assume only a value equal to an integer multiple of 1/d1 and 1/d2 (d1 and d2 are positive integers) respectively of the distance between adjacent pixels. In other words, the synthesis circuits 4-1 and 4-2 are constructed to include means for rounding the computation result of the transformation functions fi(x,y) and gi(x,y) into a value equal to an integer multiple of 1/d1 and a value equal to an integer multiple of 1/d2, respectively.
With reference to the case using the affine transformation (Equation 5) as a transformation function, explanation will be made below about an embodiment of a method in which the computation result of the transformation function is rounded into a value equal to an integer multiple of 1/d1 and 1/d2. For simplicity's sake, it is assumed that d1=d2=d (d: positive integer). It is also assumed that the patch is triangular in shape and that the motion vectors of three vertices of the patch are transmitted as motion information.
The following description deals with the example shown in
In this equation, any dividing operation is not performed and aij (j: 1 to 6) is retained in the form of aij=aji′/Di where both the numerator and denominator are an integer. Then, the computation result of Equation 5 can always be given in the form of a fraction having a numerator and a denominator of an integral number such as fi(x,y)=fi′(x,y)/Di and gi(x,y)=gi′(x,y)/Di. Defining the symbol “//” as representing a dividing operation between integral values (a dividing operation in which he decimal component of the computation result is discarded), it is assumed that
where ki=Di//2. Fi(x,y) and Gi(x y) are the functions for rounding fi(x,y) and gi(x,y) respectively into a value equal to the nearest integer multiple of 1/d.
In the synthesis circuit 4-1 of the video coder 1 and the synthesis circuit 4-2 of the video decoder 2, if Fi(x,y) and Gi(x,y) of Equation 7 are used in place of fi(x,y) and gi(x,y) of Equation 4, the horizontal and vertical components of the motion vector of each pixel can be restricted to assume only a value equal to an integer multiple of 1/d of the distance between adjacent pixels. Also, by using Fi(x,y) and Gi(x,y) for both the sending and the receiving ends, a mismatch of the predicted image P attributable to the error of the transformation function can be prevented in a computation comparatively low in accuracy.
The value d can be either defined as a fixed parameter for the coding/decoding system, or can be determined as a variable by arrangement between the sending and the receiving ends before transmitting the video data. An example procedure for determining the value d by communication between the video coder 1 at the sending end and the video decoder 2 at the receiving end is shown in FIG. 14. First, step 1403 causes the sending end to notify the receiving end by communication that the allowable upper limit of d is 4 due to the hardware restriction of the video coder 1. Then, the receiving end at step 1404 notifies the sending end by communication that the upper limit of d is 2 due to the restriction of the video coder 2. As a result, the sending end decides that the optimum value of d is 2 and gives advice at step 1405 that the video data subsequently transmitted is coded with d as 2. Immediately after this advice, the sending end transmits video data at step 1406. Generally, the larger the value d, the more complicated the system hardware. Consequently, it is considered appropriate that the sending end employs the upper limit value for the sending or receiving end, whichever is lower. For this method to be realized, the video coder 1 and the video decoder 2 are required to have a function capable of accommodating the value of d equal to or lower than their own upper limit respectively.
Considering the facility of multiply and divide operation, a power of 2 is recommendable as the value of d. The larger the value of d, the smaller the prediction error. In spite of this, the synthesizing process for the predicted image P becomes more complicated. Taking the performance of prediction into consideration, the desirable value of d is 2 or more. As a trade-off between the performance of prediction and the complication of the process, an appropriate value of d is specifically 2, 4, 8.
The following-described modifications also obviously are included in the present invention.
According to the present invention, it is possible to reduce the number of computations for interpolation of luminance values in the motion estimation process for a motion compensation scheme in which all the pixels associated with the same patch are not restricted to have a common motion vector but the horizontal and vertical components of the motion vector of pixels can assume an arbitrary value other than an integer multiple of the distance between adjacent pixels.
Further, according to the present invention, the computation accuracy of the transformation function can be reduced while preventing a mismatch of the predicted image in synthesizing a predicted image by a motion compensation scheme in which all the pixels associated with the same patch are not restricted to have a common motion vector and the horizontal and vertical components of the motion vector of pixels can assume an arbitrary value other than an integer multiple of the distance between adjacent pixels. Furthermore, in a method of determining the values of d1 and d2 by arrangement between the sending and receiving ends before transmission of video data, an optimum image quality of a decoded image can be determined in accordance with the performance of the systems at the sending end and the receiving end.
Kimura, Junichi, Nakaya, Yuichiro
Patent | Priority | Assignee | Title |
7680188, | Aug 18 1994 | Hitachi, Ltd. | Video coding method and apparatus for calculating motion vectors of the vertices of a patch of an image and transmitting information of horizontal and vertical components of the motion vectors |
7684487, | Aug 18 1994 | Hitachi, Ltd. | Video coding method and apparatus for calculating motion vectors of the vertices of a patch of an image and transmitting information of horizontal and vertical components of the motion vectors |
8135070, | Aug 18 1994 | Hitachi, Ltd. | Video coding method and apparatus for calculating motion vectors of the vertices of a patch of an image and transmitting information of horizontal and vertical components of the motion vectors |
8155199, | Aug 18 1994 | Hitachi, Ltd. | Video coding method and apparatus for calculating motion vectors of the vertices of a patch of an image and transmitting information of horizontal and vertical components of the motion vectors |
8160146, | Aug 18 1994 | Hitachi, Ltd. | Video coding method and apparatus for calculating motion vectors of the vertices of a patch of an image and transmitting information of horizontal and vertical components of the motion vectors |
8565312, | Apr 24 2009 | Sony Semiconductor Solutions Corporation | Image processing method and image information coding apparatus using the same |
Patent | Priority | Assignee | Title |
4937666, | Dec 04 1989 | SHINGO LIMITED LIABILITY COMPANY | Circuit implementation of block matching algorithm with fractional precision |
5016102, | Apr 27 1989 | Sony Corporation | Motion dependent video signal processing |
5025482, | May 24 1989 | MITSUBISHI DENKI KABUSHIKI KAISHA, | Image transformation coding device with adaptive quantization characteristic selection |
5295201, | Jan 21 1993 | NEC Corporation | Arrangement of encoding motion image signals using motion compensation and orthogonal transformation |
5684538, | Aug 18 1994 | Hitachi, Ltd. | System and method for performing video coding/decoding using motion compensation |
5963259, | Aug 18 1994 | Hitachi, Ltd. | Video coding/decoding system and video coder and video decoder used for the same system |
6134271, | Aug 18 1994 | Hitachi, Ltd. | Video coding/decoding system and video coder and video decoder used for the same system |
6285713, | Aug 18 1994 | Hitachi, LTD | Video coding/decoding system and video coder and video decoder used for the same system |
JP5130581, | |||
JP549023, | |||
JP6500212, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Jan 15 2003 | Hitachi, Ltd. | (assignment on the face of the patent) | / |
Date | Maintenance Fee Events |
Jan 26 2009 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Feb 02 2009 | ASPN: Payor Number Assigned. |
Feb 02 2009 | RMPN: Payer Number De-assigned. |
Oct 07 2010 | RMPN: Payer Number De-assigned. |
Oct 08 2010 | ASPN: Payor Number Assigned. |
Jan 09 2013 | M1552: Payment of Maintenance Fee, 8th Year, Large Entity. |
Mar 17 2017 | REM: Maintenance Fee Reminder Mailed. |
Sep 04 2017 | EXP: Patent Expired for Failure to Pay Maintenance Fees. |
Date | Maintenance Schedule |
Aug 09 2008 | 4 years fee payment window open |
Feb 09 2009 | 6 months grace period start (w surcharge) |
Aug 09 2009 | patent expiry (for year 4) |
Aug 09 2011 | 2 years to revive unintentionally abandoned end. (for year 4) |
Aug 09 2012 | 8 years fee payment window open |
Feb 09 2013 | 6 months grace period start (w surcharge) |
Aug 09 2013 | patent expiry (for year 8) |
Aug 09 2015 | 2 years to revive unintentionally abandoned end. (for year 8) |
Aug 09 2016 | 12 years fee payment window open |
Feb 09 2017 | 6 months grace period start (w surcharge) |
Aug 09 2017 | patent expiry (for year 12) |
Aug 09 2019 | 2 years to revive unintentionally abandoned end. (for year 12) |