An image encoding device includes a first prediction parameter determination section (53) for selecting, for each of prediction units belonging to a first group, a prediction parameter from a basic set; a second prediction parameter determination section (55) for selecting, for each of prediction units belonging to a second group, a prediction parameter from a reduced set (i) including at least a part of the prediction parameter(s) selected by the first prediction parameter determination section (53) and (ii) is constituted by a prediction parameter(s), the number of which is not more than the number of prediction parameters included in the basic set; and a prediction parameter encoding section (243) for encoding (i) information indicating which one of prediction parameters is selected by the first prediction parameter determination section (53) and (ii) information indicating which one of prediction parameters is selected by the second prediction parameter determination section (55).

Patent
   RE48074
Priority
Feb 24 2010
Filed
Jun 08 2018
Issued
Jun 30 2020
Expiry
Dec 27 2030
Assg.orig
Entity
Large
0
28
currently ok
4. An image encoding device for encoding a difference between an input image and a prediction image, comprising:
prediction parameter encoding means for encoding a prediction parameter which is applied to each of a plurality of prediction units;
reduced set creating means for creating a reduced set which is a group of prediction parameters; and
second prediction parameter determination means for (i) determining a first prediction parameter which specifies a prediction parameter included in a basic set or (ii) determining a second prediction parameter which specifies a prediction parameter included in a reduced set,
the reduced set including at least one prediction parameter included in a prediction unit region in the vicinity of a target prediction unit selected from the plurality of prediction units,
the number of prediction parameters included in the reduced set being less than the number of prediction parameters included in a basic set,
the prediction parameter encoding means encoding the prediction parameter, per prediction unit, and a flag for designating that (i) the prediction parameter is encoded by use of the first prediction parameter determined by the second prediction parameter determination means or (ii) the prediction parameter is encoded by use of the second prediction parameter determined by the second prediction parameter determination means, and
in a case where the prediction parameters which are identical with each other exist in at least two prediction parameters included in the prediction unit region in the vicinity of the target prediction unit, the prediction parameter encoding means causing the reduced set to include one of the prediction parameters which are identical with each other, and carrying out encoding, of the at least one prediction parameter per prediction unit, the at least one prediction parameter included in the reduced set.
1. An image decoding device for decoding encoded data which is obtained in such a manner that (i) a difference between an input image and a prediction image is encoded for each of a plurality of prediction units, and (ii) a prediction parameter used to generate the prediction image is encoded, the image decoding device comprising:
prediction parameter decoding means for decoding a prediction parameter which is to be applied to each of the plurality of prediction units;
reduced set creating means for creating a reduced set which is a group of prediction parameters; and
second prediction parameter decoding means for (i) decoding a first prediction parameter which specifies a prediction parameter included in a basic set or (ii) decoding a second prediction parameter from a bit sequence having a length determined in accordance with the number of prediction parameters included in the reduced set, the second prediction parameter specifying a prediction parameter included in the reduced set,
the reduced set including at least one prediction parameter included in a prediction unit region in the vicinity of a target prediction unit selected from the plurality of prediction units,
the number of prediction parameters included in the reduced set being less than the number of prediction parameters included in the basic set which is constituted by predetermined prediction parameters,
the prediction parameter decoding means carrying out decoding of the prediction parameter, in accordance with a value of a flag which is included, per prediction unit, in the encoded data, so that (i) the prediction parameter is decoded by use of the first prediction parameter decoded by the second prediction parameter decoding means or (ii) the prediction parameter is decoded by use of the second prediction parameter decoded by the second prediction parameter decoding means, and
in a case where the prediction parameters which are identical with each other exist in at least two prediction parameters included in the prediction unit region in the vicinity of the target prediction unit, the prediction parameter decoding means causing the reduced set to include one of the prediction parameters which are identical with each other, and carrying out decoding, of the at least one prediction parameter per prediction unit, the at least one prediction parameter included in the reduced set.
2. The image decoding device as set forth in claim 1, wherein:
each of the prediction parameters is a motion vector used in motion compensation prediction.
3. The image decoding device as set forth in claim 1,
said second prediction parameter decoding means, in accordance with the value of the flag, (i) decoding a first prediction parameter which specifies a prediction parameter included in a basic set or (ii) decoding a second prediction parameter from a bit sequence having a length determined in accordance with the number of prediction parameters included in the reduced set, the second prediction parameter specifying a prediction parameter included in the reduced set.
5. The image encoding device as set forth in claim 4,
said second prediction parameter determination means, in accordance with the value of the flag, (i) determining a first prediction parameter which specifies a prediction parameter included in a basic set or (ii) determining a second prediction parameter which specifies a prediction parameter included in a reduced set.

b,f,j,n=B,
c,g,k,o=C,
d,h,l,p=D
(Prediction Mode 2)

In a case where the prediction mode thus assigned is the prediction mode 2 (DC prediction), the prediction image generation section 145 generates the pixel values “a” through “p” by use of the following formula.
a˜p=ave(A,B,C,D,I,J,K,L)
Here, the “ave ( . . . )” means to obtain an average of elements included in the brackets.
(Prediction Mode 4)

In a case where the prediction mode thus assigned is the prediction mode 4, the prediction image generation section 145 generates the pixel values “a” through “p” by use of the following formula.
d=(B+(C×2)+D+2)>>2,
c,h=(A+(B×2)+C+2)>>2,
b,g,l=(M+(A×2)+B+2)>>2,
a,f,k,p=(I+(M×2)+A+2)>>2,
e,j,o=(J+(I×2)+M+2)>>2,
i,n=(K+(J×2)+I+2)>>2,
m=(L+(K×2)+J+2)>>2
Here, “>>” represents a right shift operation. A value of “x>>s” (x and s are arbitral integers) is equal to a value obtained in such a manner that a fractional part of a value of “x/(2{circumflex over ( )}s) is rounded down.

Further, for a prediction mode other than the prediction modes described above, the prediction image generation section 145 can calculate the pixel values “a” through “p” in the same manner as described above.

<Additional Matters Related to Video Encoding Device>

The video encoding device of the present invention is thus described above. Note, however, that the present invention is not limited to the arrangements described above.

(Additional Matter 1)

For example, the second prediction parameter decoding section 45 can determine, in accordance with a flag included in the encoded data #1, whether or not the reduced set RS is used in decoding the prediction parameter #45.

More specifically, for example, in a case where a value of the flag included in the encoded data #1 is 1, the second prediction parameter decoding section 45 decodes the prediction parameter #45 by use of the reduced set RS. On the other hand, in a case where a value of the flag included in the encoded data #1 is 0, the second prediction parameter decoding section 45 decodes the prediction parameter #45 by use of the basic parameter set in place of the reduced set RS, for example.

With the arrangement, it is possible to reduce an amount of a process for decoding the prediction parameter #45.

(Additional Matter 2)

Further, the second prediction parameter decoding section 45 can determine, in accordance with the number of sub blocks included in the macro block, whether or not the reduced set RS is used in decoding the prediction parameter #45.

More specifically, in a case where the number of sub blocks included in the macro block is not less than 16, the second prediction parameter decoding section 45 decodes the prediction parameter #45 by use of the reduced set RS. On the other hand, in a case where the number of sub blocks included in the macro block is less than 16, the second prediction parameter decoding section 45 decodes the prediction parameter #45 by use of the basic parameter set in place of the reduced set RS, for example.

With the arrangement, it is possible to reduce an amount of the process for decoding the prediction parameter #45.

(Additional Matter 3)

Further, in the above explanations, the basic parameter set is a set of prediction parameters, as shown in FIG. 5. However, the present invention is not limited to this.

For example, it is possible to have an arrangement in which the prediction parameter decoding section 144 employs a parameter set whose main direction is a horizontal direction, as shown in (a) of FIG. 16, or a parameter set whose main direction is a vertical direction, as shown in (b) of FIG. 16.

More specifically, for example, in a case where there is an edge in the horizontal direction in the macro block, the prediction parameter decoding section 144 uses, as the basic parameter set, the parameter set whose main direction is the horizontal direction, as shown in (a) of FIG. 16. Meanwhile, in a case where there is an edge in the vertical direction in the macro block, the prediction parameter decoding section 144 uses, as the basic parameter set, the parameter set whose main direction is the vertical direction, as shown in (b) of FIG. 16.

Further, in a case where such a plurality of basic parameter sets are selectively used, and the parameter set shown in (a) of FIG. 16 or the parameter set shown in (b) of FIG. 16 is selected as the basic parameter set, the second prediction parameter decoding section 45 decodes the prediction parameter #45 by use of the reduced set RS. In a case where the parameter set shown in FIG. 5 is selected as the basic parameter set, the second prediction parameter decoding section 45 decodes the prediction parameter #45 by use of the parameter set shown in FIG. 5 as the basic parameter set.

With the arrangement, it is possible to use the reduced set RS in accordance with a characteristic of an image in the macro block.

(Video Encoding Device)

The following description deals with a video encoding device (image encoding device) 2 of the present embodiment with reference to FIGS. 10 through 14. FIG. 10 is a block diagram illustrating an arrangement of the video encoding device 2. The video encoding device 2 includes a header information determination section 21, a header information encoding section 22, an MB setting section 23, an MB encoding section 24, a variable-length code multiplexing section 25, an MB decoding section 26, and a frame memory 27 (see FIG. 10).

To put it shortly, the video encoding device 2 generates encoded data #1 by encoding an input image #100, and outputs the encoded data #1 thus generated.

The header information determination section 21 determines header information on the basis of the input image #100. The header information thus determined is outputted as header information #21. The header information #21 includes an image size of the input image #100. The header information #21 is supplied to the MB setting section 23, and is also supplied to the header information encoding section 22.

The header information encoding section 22 encodes the header information #21 and outputs the header information #21 thus encoded as encoded header information #22. The encoded header information #22 is supplied to the variable-length code multiplexing section 25.

On the basis of the header information #21, the MB setting section 23 divides the input image #100 into a plurality of macro blocks, and outputs a plurality of macro block images #23 related to the respective plurality of macro blocks. The plurality of macro block images #23 are supplied to the MB encoding section 24 sequentially.

The MB encoding section 24 encodes the plurality of macro block images #23 which are sequentially supplied, so as to generate MB encoded data #24. The MB encoded data #24 thus generated is supplied to the variable-length code multiplexing section 25. An arrangement of the MB encoding section 24 will be described later, and therefore an explanation of the arrangement of the MB encoding section 24 is omitted here.

The variable-length code multiplexing section 25 multiplexes the encoded header information #22 and the MB encoded data #24, so as to generate the encoded data #1. Then, the variable-length code multiplexing section 25 outputs the encoded data #1.

The MB decoding section 26 sequentially decodes the MB encoded data #24 corresponding to each of the plurality of macro blocks thus supplied, so as to generate a decoded image #26 corresponding to each of the plurality of macro blocks. Then, the MB decoding section 26 outputs the decoded image #26. The decoded image #26 is supplied to the frame memory 27.

The decoded image #26 thus supplied is stored in the frame memory 27. When a specific macro block is encoded, decoded images corresponding to, respectively, all macro blocks provided before the macro block in a raster scan order have been stored in the frame memory.

(MB Encoding Section 24)

The following description deals with the MB encoding section 24 with reference to another drawing more specifically.

FIG. 11 is a block diagram illustrating an arrangement of the MB encoding section 24. The MB encoding section 24 includes a sub block dividing section 241, a prediction parameter determination section 242, a prediction parameter encoding section 243, a prediction residual generation section 244, a transform coefficient encoding section 245, a prediction residual decoding section 246, a sub block decoded image generation section 247, a prediction image generation section 248, and an MB encoded data generation section 249 (see FIG. 11).

The sub block dividing section 241 divides the macro block image #23 into a plurality of sub blocks. The sub block dividing section 241 sequentially outputs, in a predetermined order, (i) sub block position information #241a indicating a position of each of the plurality of sub blocks in the macro block, which plurality of sub blocks constitute the macro block, and (ii) a sub block image #241b which is image data related to the sub block indicated by the sub block position information #241a.

Note that it is preferable that the sub block dividing section 241 (i) outputs sub block position information #241a related to each of a plurality of sub blocks belonging to a first group (described later) and a sub block image #241b related to each of the plurality of the sub blocks belonging to the first group, and then (ii) outputs sub block position information #241a related to each of a plurality of sub blocks belonging to a second group (described later) and a sub block image #241b related to each of the plurality of sub blocks belonging to the second group. For example, it is preferable to have an arrangement in which the plurality of sub blocks belonging to the first group are first scanned in the raster scan order, and then, the plurality of sub blocks belonging to the second group are scanned in the raster scan order.

The prediction parameter determination section 242 determines a prediction parameter #242 used to generate a prediction image related to the sub block indicated by the sub block position information #241a, and outputs the prediction parameter #242 thus determined. Further, the prediction parameter encoding section 243 encodes the prediction parameter #242, so as to obtain an encoded prediction parameter #243. Then, the prediction parameter encoding section 243 outputs the encoded prediction parameter #243. An arrangement of the prediction parameter determination section 242 and an arrangement of the prediction parameter encoding section 243 will be described later, and therefore explanations of these arrangements are omitted here.

On the basis of the sub block position information #241a, the prediction residual generation section 244 (i) specifies the sub block which is to be a prediction target, and (ii) generates a prediction residual #244 which is a difference image between the sub block image #241b corresponding to the sub block and a prediction image #248 generated by the prediction image generation section 248.

The transform coefficient encoding section 245 applies, with respect to the prediction residual #244, frequency transform whose size is identical with that of the sub block, so as to generate a transform coefficient of the prediction residual #244.

Further, the transform coefficient encoding section 245 generates a quantization transform coefficient #245a by quantizing the transform coefficient, and then generates a variable-length code by applying a variable-length code method (such as CABAC and CAVLC) with respect to the quantization transform coefficient #245a. After that, the transform coefficient encoding section 245 outputs the variable-length code as encoded data #245b.

The prediction residual decoding section 246 carries out inverse quantization with respect to the quantization transform coefficient #245a, and then, generates a decoded residual #246 by applying inverse transform of the frequency transform (inverse frequency transform) with respect to the quantization transform coefficient #245a. After that, the prediction residual decoding section 246 outputs the decoded residual #246.

Note that the present invention is not limited to the processes carried out by the prediction residual generation section 244, the transform coefficient encoding section 245, and the prediction residual decoding section 246. For example, the transform coefficient encoding section 245 can omit the frequency transform, and can directly quantize the prediction residual.

The sub block decoded image generation section 247 generates a sub block decoded image #247 by adding the prediction image #248 and the decoded residual #246 to each other, and outputs the sub block decoded image #247.

The prediction image generation section 248 generates a prediction image #248 corresponding to the prediction target sub block, on the basis of the prediction parameter #242, a decoded image #27, and the sub block decoded image #247, and outputs the prediction image #248 thus generated. A method identical with the method of generating the prediction image #145 by the prediction image generation section 145 described above can be applied to a specific method of generating the prediction image #248 by the prediction image generation section 248.

The MB encoded data generation section 249 accumulates (i) the encoded data #245b related to each of the plurality of sub blocks and (ii) the encoded prediction parameter #243 related to each of the plurality of sub blocks, so as to cause these to be integrated with each other per macro block. The MB encoded data generation section 249 thus generates MB encoded data #24 which is encoded data whose unit is a macro block, and then, outputs the MB encoded data #24.

The following description deals with the prediction parameter determination section 242 and the prediction parameter encoding section 243 with reference to another drawing.

(Prediction Parameter Determination Section 242)

FIG. 12 is a block diagram illustrating an arrangement of the prediction parameter determination section 242. The prediction parameter determination section 242 includes a group determination section 51, a switch section 52, a first prediction parameter determination section 53, a reduced set derivation section 54, and a second prediction parameter determination section 55 (See FIG. 12).

The group determination section 51 classifies the sub block indicated by the sub block position information #241a into one of a plurality of groups, and outputs, to the switch section 52, group information #51 indicating a result of the classification.

The group determination section 51 can classify the plurality of sub blocks into the first group and the second group, as explained above with reference to (a) through (f) of FIG. 4, for example.

Further, the group determination section 51 can classify the plurality of sub blocks into a plurality of groups by use of different classification method for the plurality of macro blocks. For example, the group determination section 51 can classify a plurality of sub blocks constituting a macro block MB1 into two groups as shown in (a) and (b) of FIG. 4, while classifying a plurality of sub blocks constituting a macro block MB2, which is different from the macro block MB1, into two groups as shown in (c) and (d) of FIG. 4. With the arrangement in which different classification methods are used for the plurality of macro blocks as described above, it is preferable that the group determination section 51 outputs a flag indicating which classification method has been used. By transmitting the flag to the video decoding device for decoding the encoded data #1, the video decoding device can identify which one of classification methods has been used by the group determination section 51.

On the basis of the group information #51, the switch section 52 transmits, to one of the first prediction parameter determination section 53 and the second prediction parameter determination section 55, the sub block encoded data #241b which is encoded data related to the sub block indicated by the sub block position information #241a.

Specifically, in a case where the group determination section 51 classifies the sub block indicated by the sub block position information #241a into the first group, the switch section 52 transmits the sub block encoded data #241b to the first prediction parameter determination section 53. On the other hand, in a case where the group determination section 51 classifies the sub block indicated by the sub block position information #241a into the second group, the switch section 52 transmits the sub block encoded data #241b to the second prediction parameter determination section 45.

On the basis of the decoded image #27, the sub block decoded image #247, and the sub block encoded data #241b, the first prediction parameter determination section 53 determines (selects) a prediction parameter #53 used to generate a prediction image related to each of the plurality of sub blocks belonging to the first group. Then, the first prediction parameter determination section 53 outputs the prediction parameter #53. Further, the prediction parameter #53 is also supplied to the reduced set derivation section 54.

In a case where, for example, the prediction parameter is an intra prediction mode defined in the H.264/MPEG-4 AVC standard, the first prediction parameter determination section 53 (i) selects, with respect to each of the plurality of sub blocks belonging to the first group, one of prediction modes in the basic parameter set which has been described above with reference to FIG. 5, and (ii) outputs the prediction mode thus selected.

Note that the present invention is not limited to the specific method of determining the specific prediction parameter #53, carried out by the first prediction parameter determination section 53. For example, the first prediction parameter determination section 53 can determine the prediction parameter #53 with respect to each of the plurality of sub blocks belonging to the first group so that a difference between the prediction image of the sub block and the input image #100 becomes smallest. For example, in a case where, with respect to a sub block SB1 belonging to the first group, a difference between a prediction image generated by use of the prediction mode 1 in the basic parameter set and the input image #100 becomes smallest, as compared with a case where another prediction mode in the basic parameter set is used, the first prediction parameter determination section 53 selects the prediction mode 1 with respect to the sub block SB1. In a case where, with respect to a sub block SB2 belonging to the first group, a difference between a prediction image generated by use of the prediction mode 2 in the basic parameter set and the input image #100 becomes smallest, as compared with a case where another prediction mode in the basic parameter set is used, the first prediction parameter determination section 53 selects the prediction mode 2 with respect to the sub block SB2.

(a) of FIG. 13 is a view showing an example of a prediction mode which is selected by the first prediction parameter determination section 53 with respect to each of the plurality of sub blocks belonging to the first group, among the plurality of sub blocks constituting the macro block MB. According to the example shown in (a) of FIG. 13, the first prediction parameter determination section 53 selects, with respect to each of the plurality of sub blocks belonging to the first group, one of the prediction mode 1, the prediction mode 6, and the prediction mode 8. In the present example, each of the prediction modes 1, 6, and 8 is supplied to the reduced set derivation section 54 as the prediction parameter #53.

With the operations described above, the reduced set derivation section 54 is supplied with the prediction parameter #53 related to each of the plurality of sub blocks belonging to the first group.

An arrangement of the reduced set derivation section 54 is the same as that of the reduced set derivation section 44 described above. That is, the reduced set derivation section 54 generates the reduced set RS by use of the prediction parameter #53. A method of generating the reduced set RS by the reduced set derivation section 54 is the same as the method of generating the reduced set RS by the reduced set derivation section 44 described above.

Note that, in a case where a plurality of generation methods are employed by the reduced set derivation section 54 selectively, it is preferable that the reduced set derivation section 54 outputs a flag indicating which one of the plurality of generation methods has been selected. By transmitting the flag to the video decoding device for decoding the encoded data #1, it becomes possible for the video decoding device to identify which one of the plurality of generation methods has been employed by the reduced set derivation section 54.

(b) of FIG. 13 is a view showing an example of the reduced set RS, which is generated by the reduced set derivation section 54 in a case where each of the prediction modes shown in (a) of FIG. 13 is supplied as the prediction parameter #53. According to the present example, the reduced set RS is constituted by a prediction mode 1, a prediction mode 8, and a prediction mode 6 (see (b) of FIG. 13).

The second prediction parameter determination section 55 selects, from among prediction parameters included in the reduced set RS, a prediction parameter #55 used to generate a prediction image related to each of the plurality of sub blocks belonging to the second group. Then, the second prediction parameter determination section 55 outputs the prediction parameter #55.

The present invention is not limited to a specific method of determining the prediction parameter #55 by the second prediction parameter determination section 55. For example, the second prediction parameter determination section 55 selects, from among the prediction parameters included in the reduced set RS with respect to each of the plurality of sub blocks belonging to the second group, such a prediction parameter #55 that a prediction image of the sub block can be generated in the most appropriate manner.

(c) of FIG. 13 is a view showing an example of a prediction mode, which is selected from among the prediction modes included in the reduced set RS shown in (b) of FIG. 13 by the second prediction parameter determination section 55 with respect to each of the plurality of sub blocks belonging to the second group, among the plurality of sub blocks constituting the macro block MB. As shown in (c) of FIG. 13, one of the prediction modes 1, 6, and 8, included in the reduced set RS shown in (b) of FIG. 13, is selected with respect to each of the plurality of sub blocks belonging to the second block.

Generally, optimum prediction parameters with respect to a plurality of sub blocks constituting a macro block have a correlation with each other. Accordingly, a prediction parameter selected with respect to each of the plurality of sub blocks belonging to the first group is highly likely to be an optimum prediction parameter with respect to each of the plurality of sub blocks belonging to the second group.

Further, as described above, the second prediction parameter determination section 55 selects, from among the prediction parameters included in the reduced set RS, the prediction parameter #55 with respect to each of the plurality of sub blocks belonging to the second group. Accordingly, it is possible to reduce an encoding amount of the prediction parameters #55, as compared with a case where the reduced set RS is not used.

For example, as described later, in a case where a prediction mode and a flag indicating whether or not the prediction mode is identical with an estimate value are simultaneously encoded, the three prediction modes included in the reduced set RS shown in (b) of FIG. 13 can be encoded with an encoding amount of ceil (log2 (3−1))=1 bit. Meanwhile, in a case where the second prediction parameter determination section 55 selects a prediction parameter from the basic parameter set constituted by 9 prediction modes, without using the reduced set RS, a necessary encoding amount becomes ceil (log2 (9−1))=3 bits. Furthermore, in the above example, the second group is constituted by 8 sub blocks. Accordingly, as compared with the case where the reduced set RS is not used, it is possible to reduce, per macro block, the encoding amount by 3×8−1×8=16 bits, by using the reduced set RS.

Generally, in a case where a prediction parameter and the flag indicating whether or not the prediction parameter is identical with the estimate value are simultaneously encoded, it is possible to reduce, per macro block, the encoding amount by Ngs×(ceil (log2 (Nfs−1))−ceil (log2 (Nrs−1))) bits (where: Nfs is the number of prediction parameters which can be selected with respect to each of the plurality of sub blocks belonging to the first group; Nrs is the number of prediction parameters included in the reduced set RS; and Ngs is the number of sub blocks included in the second group) by use of the reduced set RS.

As described above, by using the reduced set RS, it is possible to reduce the encoding amount necessary to encode a prediction parameter, without sacrificing encoding efficiency.

(Another Example of Arrangement of Prediction Parameter Determination Section 242)

In the above explanation as to the prediction parameter determination section 242, the reduced set derivation section 54 generates the reduced set RS per macro block. Note, however, that the present invention is not limited to the arrangement.

That is, the prediction parameter determination section 242 can have such an arrangement that (i) the reduced set derivation section 54 generates the reduced set RS per sub block, and (ii) the second prediction parameter determination section 55 determines a prediction parameter with respect to a prediction target sub block on the basis of the reduced set RS generated per sub block.

With the arrangement, the reduced set derivation section 54 carries out the same operations as those of the reduced set derivation section 44, explained in the above (Step S701) through (Step S703) of (Another example of arrangement of reduction parameter decoding section 144) described above. Note, however, that the decoded prediction parameter in the above (Step S701) through (Step S703) of (Another example of arrangement of prediction parameter decoding section 144) corresponds to the encoded prediction parameter in the present example.

With the arrangement, the reduced set derivation section 54 can generate the reduced set RS per sub block. Further, the second prediction parameter determination section 55 can determine the prediction parameter with respect to the prediction target sub block, on the basis of the reduced set RS generated per sub block.

Generally, a prediction parameter with respect to the prediction target parameter has a correlation with a prediction parameter with respect to another sub block located in the vicinity of the prediction target sub block. Accordingly, a prediction parameter included in the reduced set RS generated by the operations described above is highly likely to be an optimum prediction parameter in prediction of a sub block belonging to the second group. Further, the number of prediction parameters included in the reduced set RS generated by the operations described above is generally smaller than the number of prediction parameters which can be selected with respect to sub blocks included in the first group.

Accordingly, the video decoding device 1 having the arrangement described above can generate encoded data #1 whose encoding amount is small, without sacrificing encoding efficiency.

Note that, in a case where there is no encoded prediction parameter with respect to a plurality of sub blocks included in the proximity sub block region NSR, the second prediction parameter determination section 55 selects a prediction parameter from the basic parameter set, for example.

Furthermore, the reduction set derivation section 54 of the present example can derive the reduced set RS by carrying out operations substantially similar to the operations described in the above (Example 1 of generation of reduced set RS) through (Example 4 of generation of reduced set RS). In this case, however, the “first group” described in the above (Example 1 of generation of reduced set RS) through (Example 4 of generation of reduced set RS) corresponds to the “proximity sub block region NSR” in the present example.

Moreover, in the above explanation, the reduced set RS is used with respect to the second group. Note, however, that the present invention is not limited to this. It is possible to apply the aforementioned operations to all the sub blocks in the macro block. That is, it is possible to have an arrangement in which, on the basis of a reduced set RS generated per sub block, a prediction parameter is determined with respect to each of the sub blocks in the macro block.

With the arrangement, it is possible to reduce an encoding amount of prediction parameters with respect to the sub blocks in the macro block. Accordingly, the video encoding device 1 having the arrangement described above can generate encoded data #1 whose encoding amount is further reduced.

(Prediction Parameter Encoding Section 243)

Next, the following description deals with the prediction parameter encoding section 243 with reference to FIG. 14. FIG. 14 is a block diagram illustrating an arrangement of the prediction parameter encoding section 243. The prediction parameter encoding section 243 includes a group determination section 61, a switch section 62, a first prediction parameter encoding section 63, a reduced set derivation section 64, a second prediction parameter encoding section 65 (see FIG. 14).

The group determination section 61 has an arrangement which is substantially similar to that of the group determination section 51 described above. That is, the group determination section 61 (i) classifies a sub block indicated by sub block position information #241a into one of a plurality of predetermined groups, and (ii) outputs, to the switch section 62, group information #61 indicating a result of the classification. Note that, the group determination section 61 employs the same classification method as the classification method used by the group determination section 51, so as to classify each sub block into a corresponding one of the plurality of predetermined groups.

On the basis of the group information #61, the switch section 62 transmits a prediction parameter #242 related to a sub block indicated by the sub block position information #241a to one of the first prediction parameter encoding section 63 and the second prediction parameter encoding section 65.

Specifically, in a case where the group determination section 61 classifies the sub block indicated by the sub block position information #241a into the first group, the switching section 62 transmits the prediction parameter #242 to both the first prediction parameter encoding section 63 and the reduced set derivation section 64. On the other hand, in a case where the group determination section 61 classifies the sub block indicated by the sub block position information #241a into the second group, the switching section 62 transmits the prediction parameter #242 to the second prediction parameter encoding section 65.

The first prediction parameter encoding section 63 encodes the prediction parameter #242 related to each of the plurality of sub blocks belonging to the first group, so as to generate an encoded prediction parameter #63. Then, the first prediction parameter encoding section 63 outputs the encoded prediction parameter #63.

Specifically, the first prediction parameter encoding section 63 first sets, as an estimate value with respect to each of the plurality of sub blocks belonging to the first group, a prediction parameter selected with respect to another sub block located in the vicinity of the sub block.

Next, the first prediction parameter encoding section 63 encodes a flag indicating whether or not the prediction parameter selected with respect to the sub block is different from the estimate value. Further, in a case where the prediction parameter selected with respect to the sub block is different from the estimate value, the first prediction parameter encoding section 63 encodes the prediction parameter.

Here, in a case where the prediction parameter with respect to each of the plurality of sub blocks belonging to the first group is selected from the basic parameter set, the prediction parameter can be expressed by a code of 1 bit or a code of 4 bits (including the flag).

As described above, by carrying out encoding by use of the estimate value, it is possible to enhance compressibility in encoding the prediction parameter #242 related to each of the plurality of sub blocks belonging to the first group.

Note that the first prediction parameter encoding section 63 can encode the prediction parameter #242 itself, related to each of the plurality of sub blocks belonging to the first group.

The reduced set derivation section 64 generates the reduced set RS by use of the prediction parameter #242 related to each of the plurality of sub blocks belonging to the first group. A method of generating the reduced set RS by the reduced set derivation section 64 is the same as the method of generating the reduced set RS by the reduced set derivation section 44 described above, and therefore explanation of the method of generating the reduced set RS is omitted here.

The second prediction parameter encoding section 65 encodes a prediction parameter which is included in the reduced set RS, that is, the prediction parameter selected with respect to each of the plurality of sub blocks belonging to the second group, so as to generate an encoded prediction parameter #65. Then, the second prediction parameter encoding section 65 outputs the encoded prediction parameter #65.

Specifically, the second prediction parameter encoding section 65 first sets, as an estimate value with respect to each of the plurality of sub blocks belonging to the second block, a prediction parameter selected with respect to another sub block located in the vicinity of the sub block.

In a case where the number Nrs of prediction parameters included in the reduced set RS is 1, the second prediction parameter encoding section 65 does not encode any prediction parameter, and finishes an encoding process of a second prediction parameter.

Next, the second prediction parameter encoding section 65 encodes a flag indicating whether or not the prediction parameter selected with respect to the sub block is different from the estimate value. Further, in a case where the prediction parameter selected with respect to the sub block is different from the estimate value, the second prediction parameter encoding section 65 encodes the prediction parameter.

Here, in a case where Nrs is 2, the second prediction parameter encoding section 65 finishes the encoding process of the second prediction parameter.

In a case where Nrs is not 2, the prediction parameter included in the reductions set can be expressed by a code of cell (log2 (Nrs−1)) bits.

Further, the number Nrs of prediction parameters included in the reduced set RS is generally smaller than the number of prediction parameters which can be selected with respect to the plurality of sub blocks belonging to the first group.

Accordingly, by employing the reduced set RS, it is possible to encode a prediction parameter selected with respect to each of the plurality of sub blocks belonging to the second group, while having a reduction in encoding amount.

Further, as described above, by carrying out encoding by use of an estimate value, it is possible to enhance compressibility in encoding the prediction parameter #242 related to each of the plurality of sub blocks belonging to the second group.

Note that it is possible to have an arrangement in which the second prediction parameter encoding section 65 encodes the prediction parameter itself, selected with respect to each of the plurality of sub blocks belonging to the second group.

<Additional Matters Related to Video Encoding Device>

The video encoding device 2 of the present invention is thus explained above. Note, however, that the present invention is not limited to the arrangements described above.

(Additional Matter 1′)

For example, it is possible to have an arrangement in which the second prediction parameter determination section (i) finds a spatial correlation between prediction parameters in the macro block, and (ii), in a case where the spatial correlation thus found is small, determines the prediction parameter #55 without using the reduced set RS. Further, it is preferable to have an arrangement in which, in a case where the second prediction parameter determination section 55 determines the prediction parameter #55 without using the reduced set RS, the video encoding device 2 (i) encodes a flag indicating that the reduced set RS is not used, and (ii) transmits the flag thus encoded to the video decoding device.

With the arrangement, it is possible to determine the prediction parameter #55 without using the reduced set RS, in a case where the spatial correlation between the prediction parameters is small. It is therefore possible to suppress an amount of a process for determining a prediction parameter.

(Additional Matter 2′)

Further, it is possible to have an arrangement in which the second prediction parameter determination section 55 determines, in accordance with the number of sub blocks included in the macro block, whether or not the reduced set RS is used to determine the prediction parameter #55.

More specifically, in a case where the number of sub blocks included in the macro block is not less than 16, the second prediction parameter determination section 55 determines the prediction parameter #55 by use of the reduced set RS. On the other hand, in a case where the number of sub blocks included in the macro block is less than 16, the second prediction parameter determination section 55 determines the prediction parameter #45 by use of the basic parameter set in place of the reduced set RS.

With the arrangement, it is possible to determine the prediction parameter #55 without using the reduced set RS, in a case where the spatial correlation between the prediction parameters is small. It is therefore possible to suppress an amount of a process for determining a prediction parameter.

(Additional Matter 3′)

Moreover, it is possible to have an arrangement in which, in accordance with a characteristic of an image in the macro block, the prediction parameter determination section 242 employs, as the basic parameter set, one of (i) a parameter set whose main direction is a horizontal direction (as shown in (a) of FIG. 16) and (ii) a parameter set whose main direction is a vertical direction (as shown in (b) of FIG. 16).

For example, the prediction parameter determination section 242 can employ, as the basic parameter set, the parameter set whose main direction is the horizontal direction (as shown in (a) of FIG. 16), in a case where there is a horizontal edge in the macro block. On the other hand, the prediction parameter determination section 242 can employ, as the basic parameter set, the parameter set whose main direction is the vertical direction (as shown in (b) of FIG. 16), in a case where there is a vertical edge in the macro block, for example.

Further, it is possible to have an arrangement in which (I), in a case where (i) a plurality of basic parameter sets are used selectively and (ii) the parameter set shown in (a) of FIG. 16 or the parameter set shown in (b) of FIG. 16 is selected as the basic parameter set, the second prediction parameter determination section 55 determines the prediction parameter #55 by use of the reduced set RS, and (II), in a case where the parameter set shown in FIG. 5 is selected as the basic parameter set, the second prediction parameter determination section 55 decodes the prediction parameter set #55 by use of the basic parameter set in place of the reduced set RS.

With the arrangement, it is possible to determine whether to use the reduced set RS in accordance with a characteristic of an image in the macro block. It is therefore possible to reduce effectively an encoding amount of prediction parameters.

(Data Structure of Encoded Data #1)

The following description deals with a data structure of encoded data #1 generated by the video encoding device 2, with reference to FIG. 15.

FIG. 15 is a view showing a bit stream structure of a bit stream #MBS for each of a plurality of macro blocks of the encoded data #1. As shown in FIG. 15, the bit stream #MBS includes sub block information #SB1 through #SBN (here, N is the number of sub blocks in the macro block), which is information related to sub blocks SB1 through SBN (here, N is the number of sub blocks in the macro block) included in the macro block.

Further, as shown in FIG. 15, each sub block information #SBn (1≤n≤N) includes (i) sub block position information #Ln which is information indicating a position of a sub block SBn in the macro block, and (ii) prediction parameter information #Pn indicating a prediction parameter associated with the sub block SBn.

The sub block position information #Ln is referred to in the video decoding device for decoding the encoded data #1, so that the position of the sub block SBn in the macro block is specified. Particularly, in the video decoding device 1 described above, the sub block position information #Ln is information which is referred to, so that the sub block SBn is classified into a group.

The prediction parameter information #Pn is information for specifying, in the video decoding device for decoding the encoded data #1, a prediction parameter associated with the sub block SBn. Particularly, in the video decoding device 1 described above, the prediction parameter information #Pn is information indicating a prediction parameter among prediction parameters which can be selected with respect to a group to which the sub block SBn belongs.

For example, in a case where the sub block SBn belongs to the first group, and the basic parameter set is a prediction parameter set constituted by prediction parameters which can be selected with respect to the first group, the prediction parameter information #Pn is information indicating one of the prediction modes 0 through 8 included in the basic parameter set.

Further, in a case where the sub block SBn belongs to the second group, the prediction parameter information #Pn is information indicating a prediction parameter among prediction parameters included in the reduced set RS.

Furthermore, in a case where the encoded data #1 is data generated by encoding, simultaneously, a prediction mode and a flag indicating whether or not the prediction parameter is identical with an estimate value, the prediction parameter #Pn can be represented by a code of ceil (log2 (N−1)) bits. Here, N is the number of prediction parameters which can be selected with respect to the group to which the sub block SBn belongs.

Generally, the number of prediction parameters included in the reduced set RS is smaller than the number of prediction parameters which can be selected with respect to the first group. Accordingly, an encoding amount of the prediction parameters included in the encoded data #1 is small in a case where the reduced set RS is used, as compared with a case where the reduced set RS is not used.

Specifically, in a case where the encoded data #1 is data generated by encoding, simultaneously, a prediction mode and a flag indicating whether or not the prediction mode is identical with an estimate value, an encoding amount of the prediction parameter information #Pn is smaller by ceil (log2 (Nfs−1))−ceil (log2 (Nrs−1)) bits, as compared with a case where the reduced set RS is not used. Here, Nfs is the number of prediction parameters which can be selected with respect to each of the plurality of sub blocks belonging to the first group, and Nrs is the number of prediction parameters included in the reduced set RS.

As described above, by employing the reduced set RS, it is possible to reduce an encoding amount of the encoded data #1.

Note that the encoded data #1 may not include the sub block position information #Ln. For example, by setting, in advance, a sub block scanning order so that the sub block scanning order is used in both the encoding and the decoding in common, it becomes possible to determine a position of a sub block on the basis of information indicating a number of the sub block in the encoded data #1 in the sub block scanning order.

<Example of Application to Another Prediction Parameter>

In the above explanation, the prediction mode in the intra prediction is mainly used as the prediction parameter, as an example. Note, however, that the present invention is not limited to this. The present invention is also applicable to another parameter generally used in generation of a prediction image in an encoding process/decoding process of a video.

The following description deals with (i) an example in which the present invention is applied to a case where a motion vector in motion compensation prediction is used as the prediction parameter and (ii) an example in which the present invention is applied to a case where a weighting factor in luminance compensation prediction is used as the prediction parameter.

(Example of Application to Motion Vector)

In the motion compensation prediction, a position in a region on a decoded image used in prediction of a prediction target sub block is represented by a prediction parameter called “motion vector”.

The motion vector is selected from a prediction parameter set in which the number of elements depends on a size of an image and an accuracy (compensation accuracy). For example, in a case where (i) W is a width (the number of pixels in a horizontal direction) of an image, H is a height (the number of pixels in a vertical direction) of the image, and a compensation accuracy of the image is 0.25 pixel, a motion vector V is selected from a prediction parameter set S defined by the following formula.
S≡{V|V=((N/4),(M/4))

(where: N is an integer satisfying an inequality of 0≤N<4W; and M is an integer satisfying an inequality of 0≤M<4H).

In a case where the prediction parameter is such a motion vector, the first prediction parameter determination section 53 determines a motion vector for each of the plurality of sub blocks belonging to the first group, and assigns the motion vector to the sub block. Further, the first parameter decoding section 43 can decode the motion vector for each of the plurality of sub blocks belonging to the first group.

Each of the reduced set derivation sections 44 and 54 can generate the reduced set RS by use of with the motion vector assigned to each of the plurality of sub blocks belonging to the first group. Furthermore, each of the reduced set derivation sections 44 and 54 can generate the reduced set RS by use of motion vectors assigned to sub blocks in the vicinity of the prediction target sub block, as described above.

As to each of the other sections of the video decoding device 1 and the video encoding device 2, it is possible to reduce an encoding amount of the prediction parameters included in the encoded data #1 by carrying out the same operations as those of the aforementioned arrangement employing a prediction mode as the prediction parameter.

Note that, in a case where the prediction parameter is a motion vector, it is possible to have, for example, an arrangement in which each of the reduced set derivation sections 44 and 54 adds, to the reduced set RS, such a motion vector that a norm of a difference vector between the motion vector and a motion vector assigned to a sub block in the vicinity of the prediction target sub block is not more than a certain value.

Generally, in a case where the prediction parameter is a motion vector, the prediction parameter set includes a large number of prediction vectors. For example, the number of motion vectors included in the prediction parameter set S described above is 8000×4000 (W=2000, H=1000).

Meanwhile, the number of motion vectors accumulated in the reduced set RS has an upper limit which is equal to (i) the number of sub blocks belonging to the first group, or (ii) the number of sub blocks in the vicinity of the prediction target sub block. For example, even in a case where the sub blocks included in the proximity sub block region NSR shown in (b) of FIG. 8 are used, the number of motion vectors accumulated in the reduced set RS is only 24.

Accordingly, depending on a characteristic of a target image of the encoding/decoding, the number of motion vectors accumulated in the reduced set RS might be not sufficient even in a case where the reduced set RS is generated only with the motion vectors assigned to the sub blocks in the vicinity of the prediction target sub block. This might cause a reduction in prediction accuracy, and an increase in encoding amount of residual data. By adding, to the reduction ser RS, such a motion vector that a norm of a difference vector between the motion vector and another motion vector assigned to a sub block located in the vicinity of the prediction target sub block is not more than a certain value, it is possible to reduce an encoding amount of the prediction parameters without having an increase in encoding amount of residual data.

(Example of Application to Weighting Factor in Luminance Compensation Prediction)

In luminance compensation prediction, a luminance of the prediction target sub block is predicted by use of a value obtained by multiplying, by a weighting factor, each of a plurality of luminances of reference pictures, which are referred to in motion compensation prediction of a prediction target sub block.

The present invention can be applied to a case where the weighting factor is used as a prediction parameter. For example, each of the reduced set derivation sections 44 and 54 can generate the reduced set RS on the basis of the weighting factor assigned to a sub block located in the vicinity of the prediction target sub block.

Note that, in a case where the present invention is applied to the weighting factor, (i) a plurality of values, each of which is used as the weighting factor, may be associated with, respectively, a plurality of representative values, a total number of which is set in advance, and (ii) the plurality of representative values may be used as prediction parameters.

For example, in a case where a value W of a weighting factor can be one of real values (or values having a predetermined digit number) satisfying an inequality of 0≤W≤1, all values Wn of the weighting factor, satisfying an inequality of (n/X)≤Wn≤((n+1)/X), are associated with representative values w, respectively, and the representative values w are used as prediction parameters. Here, X is a natural number, and n is an integer satisfying an inequality of 0≤n≤X−1. With such association, all the real values satisfying an inequality of 0≤W≤1 can be mapped in a set of representative values Wn, a total number of which is X. Further, a specific value of X is set so that the encoding amount of the encoded data becomes small, for example.

As described above, by limiting the number of elements used as prediction parameters within a predetermined range, it becomes possible to reduce the encoding amount of the encoded data, as compared with a case where the weighting factor itself is used as a prediction parameter.

(Additional Matters)

An image encoding device of the present invention, for encoding a difference between an input image and a prediction image, includes: classification means for (i) dividing a prediction image into a plurality of unit regions and (ii) classifying a plurality of prediction units included in each of the plurality of unit regions into a first group and a second group; first selection means for selecting, for each of a plurality of prediction units belonging to the first group, a prediction parameter designating how to generate a prediction image, the first selection means selecting the prediction parameter from a basic set constituted by predetermined prediction parameters; second selection means for selecting, for each of a plurality of prediction units belonging to the second group, a prediction parameter designating how to generate a prediction image, the second selection means selecting the prediction parameter from a reduced set which (i) includes at least a part of the prediction parameter(s) selected by the first selection means and (ii) is constituted by a prediction parameter(s), the number of which is not more than the number of prediction parameters included in the basic set; and prediction parameter encoding means for encoding (i) information indicating which one of prediction parameters is selected, for each of the plurality of prediction units belonging to the first group, by the first selection means and (ii) information indicating which one of prediction parameters is selected, for each of the plurality of prediction units belonging to the second group, by the second selection means.

According to the image encoding device having the arrangement described above, for each of the plurality of prediction units belonging to the second group, a prediction parameter designating how to generate a prediction image is selected from the reduced set which (i) includes at least a part of the prediction parameter(s) selected by the first selection means for each of the plurality of prediction units belonging to the first group which is included in the unit region in which the second group is included, and (ii) is constituted by a prediction parameter(s), the number of which is not more than the prediction parameters included in the basic set. The information indicating which one of prediction parameters is selected by the second selection means is encoded.

Here, generally, a prediction parameter with respect to each of the plurality of prediction units has a correlation with a prediction parameter with respect to another prediction unit located in the vicinity of the prediction unit. For this reason, the prediction parameter selected with respect to each of the plurality of prediction units belonging to the first group is highly likely to be an optimum prediction parameter with respect to each of the plurality of prediction units belonging to the second group. That is, the parameter selected from the reduced set is highly likely to be an optimum prediction parameter with respect to each of the plurality of prediction units belonging to the second group. Accordingly, with the arrangement described above, it is possible to encode a prediction parameter without having a reduction in encoding efficiency.

Further, with the arrangement described above, the reduced set includes at least a part of the prediction parameter(s) selected by the first selection means, and is constituted by prediction parameters, the number of which is not more than the number of prediction parameters included in the basic set. Accordingly, for each of the plurality of prediction units belonging to the second group, it is possible to have a reduction in encoding amount of information indicating which one of prediction parameters has been selected.

According to the arrangement described above, it is therefore possible to have a reduction in encoding amount of information designating a prediction parameter, without having a reduction in encoding efficiency.

Furthermore, it is preferable that the reduced set is constituted by only prediction parameters that (i) are selected by the first selection means and (ii) are different from each other.

According to the arrangement, it is possible to further reduce the number of prediction parameters included in the reduced set. Accordingly, it is possible to further reduce an encoding amount. More specifically, prediction parameters generally have a spatial correlation with each other, so that distribution of optimum prediction parameters with respect to a plurality of prediction units in a certain region becomes inhomogeneous. For this reason, it is highly likely that, in the certain region, only a part of various prediction parameters included in the basic set is used. Accordingly, the number of prediction parameters in a certain region, selected by the first selection means, is highly likely to be smaller than the number of prediction parameters included in the basic set. For this reason, by employing, as the reduced set, a group of all the prediction parameters which (i) are selected by the first selection means and (ii) are different from each other, it is possible to reduce the number of elements of the reduced set.

Moreover, it is preferable that each of the plurality of unit regions is a unit used in encoding carried out by the image encoding device.

According to the arrangement, the generation process of the reduced set is carried out only once per encoding unit (e.g., each macro block defined in the H.264/MPEG-4 AVC standard). Accordingly, it is possible to reduce an amount of the process for generating the reduced set.

Further, the image encoding device of the present invention is preferably arranged such that the plurality of prediction units belonging to the first group and the plurality of prediction units belonging to the second group are arranged to form a checkered pattern (checker board pattern).

Generally, a prediction parameter with respect to each of a plurality of prediction units has a correlation with a prediction parameter with respect to another prediction unit located in the vicinity of the prediction unit.

According to the arrangement, the plurality of prediction units belonging to the first group and the plurality of prediction units belonging to the second group are arranged to form the checkered pattern (checker board pattern). Accordingly, a prediction parameter selected with respect to each of the plurality of prediction units belonging to the first group is highly likely to be an optimum prediction parameter with respect to each of the plurality of prediction units belonging to the second group.

According to the arrangement, it is thus possible to reduce an encoding amount necessary for the encoding of a prediction parameter, without sacrificing encoding efficiency.

Furthermore, each of the prediction parameters can be a prediction parameter designating a prediction mode in intra prediction.

According to the arrangement, it is possible to reduce an encoding amount necessary for encoding of a prediction mode in intra prediction, without sacrificing encoding efficiency.

Moreover, it is preferable that the reduced set (i) includes the prediction parameter(s) selected by the first selection means and (ii) includes at least one of a vertical direction prediction mode in the intra prediction, a horizontal direction prediction mode in the intra prediction, and a DC prediction mode in the intra prediction, and, for each of the plurality of prediction units belonging to the second group, the second selection means selects, from the reduced set, the prediction parameter designating how to generate a prediction image.

Generally, the vertical direction prediction mode, the horizontal direction prediction mode, and the DC prediction mode are highly frequently selected in the intra prediction.

According to the arrangement, it is possible that the second selection means selects, for each of the plurality of prediction units belonging to the second group, a prediction parameter designating how to generate a prediction image, and the second selection means selects the prediction parameter from the reduced set which (i) includes the prediction parameter(s) selected by the first selection means and (ii) includes one of the vertical direction prediction mode in the intra prediction, the horizontal direction prediction mode in the intra prediction, and the DC prediction mode in the intra prediction. Accordingly, it is possible to reduce an encoding amount without having a reduction in prediction accuracy in generation of a prediction image for each of the plurality of prediction units belonging to the second group.

Further, it is possible that the second selection means selects the prediction parameter in such a manner that (i), in a case where the number of a plurality of prediction units in each of the plurality of unit regions, constituted by the plurality of prediction units belonging to the first group and the plurality of prediction units belonging to the second group, is not less than a predetermined threshold value, the second selection means selects a prediction parameter from the reduced set, and (ii), in a case where said number is less than the predetermined threshold value, the second selection means selects a prediction parameter from the basic set.

According to the arrangement, it is possible to select a prediction parameter from the basic set in a case where the number of prediction units in the unit region is less than the predetermined value. It is therefore possible to reduce an amount of a process for selecting a prediction parameter.

The image encoding device of the present invention can be arranged such that the basic set is capable of being set per unit region, and the second selection means selects the prediction parameter in such a manner that (i), in a case where the basic set, which is set with respect to each of the plurality of unit regions, constituted by the plurality of prediction units belonging to the first group and the plurality of prediction units belonging to the second group, satisfies a specific condition, the second selection means selects a prediction parameter from the reduced set, and (ii), in a case where the basic set does not satisfy the specific condition, the second selection means selects a prediction parameter from the basic set.

According to the arrangement, it is possible that the basic set is capable of being set per unit region, and the second selection means selects the prediction parameter in such a manner that (i), in a case where the basic set, which is set with respect to each of the plurality of unit regions, constituted by the plurality of prediction units belonging to the first group and the plurality of prediction units belonging to the second group, satisfies a specific condition, the second selection means selects a prediction parameter from the reduced set, and (ii), in a case where the basic set does not satisfy the specific condition, the second selection means selects a prediction parameter from the basic set. Accordingly, it is possible to reduce an encoding amount while reducing an amount of a process for selecting a prediction parameter.

Moreover, an image encoding device of the present invention can be expressed as an image encoding device for encoding a difference between an input image and a prediction image, including: selection means for selecting, for each of a plurality of prediction units, a prediction parameter designating how to generate a prediction image, the selection means selecting the prediction parameter from a reduced set including at least a part of a prediction parameter(s) designating how to generate a prediction image(s) for a prediction unit(s) which (i) is located in the vicinity of a corresponding one of the plurality of prediction units and (ii) has been encoded; and prediction parameter encoding means for encoding, for each of the plurality of prediction units, information indicating which one of prediction parameters has been selected by the selection means.

Generally, a prediction parameter with respect to each of a plurality of prediction units has a correlation with a prediction parameter with respect to another prediction unit located in the vicinity of the prediction unit. Accordingly, the reduced set is highly likely to include an optimum prediction parameter in generation of a prediction image of each of the plurality of prediction units. Further, the reduced set is constituted by at least a part of the prediction parameter(s) with respect to the prediction unit(s) which is located in the vicinity of a corresponding one of the plurality of prediction units. Accordingly, the number of prediction parameters included in the reduced set is smaller than the number of prediction parameters included in a parameter set constituted by prediction parameters with respect to prediction units other than the each of the plurality of prediction units.

According to the image encoding device of the present invention, having the arrangement described above, it is therefore possible to generate encoded data whose encoding amount is small, without sacrificing encoding efficiency.

Further, it is preferable that the prediction parameter encoding means employs, as a code indicating which one of prediction parameters is selected by the second selection means, a code having a length shorter than that of a code indicating which one of prediction parameters is used by the first selection means.

According to the arrangement, prediction parameter encoding means can employ, as a code indicating which one of prediction parameters is selected by the second selection means, a code having a length shorter than that of a code indicating which one of prediction parameters is used by the first selection means. Accordingly, it is possible to encode, for each of the plurality of prediction units belonging to the second group, by use of the code having a short length, information indicating which one of prediction parameters is selected.

Furthermore, it is preferable that the reduced set (i) includes the prediction parameter(s) selected by the first selection means, and (ii) is constituted by a prediction parameter(s), the number of which is 2n+1 (n is a natural number), and, for each of the plurality of prediction units belonging to the second group, the second selection means selects, from the reduced set, the prediction parameter designating how to generate a prediction image.

Generally, in a case where a group of elements, the number of which is 2n+1 (n is a natural number), is encoded, compression efficiency is improved as compared with a case where a group of elements, the number of which is not 2n+1, is encoded.

According to the arrangement, it is possible to encode, for each of the plurality of prediction units belonging to the second group, information indicating which one of prediction parameters is selected from the reduced set that (i) includes the prediction parameter(s) selected by the first selection means, and (ii) is constituted by prediction parameters, the number of which is 2n+1 (n is a natural number). Accordingly, it is possible to have an additional effect of enhancing compression efficiency in encoding a prediction parameter.

Further, an image decoding device of the present invention, for decoding encoded data which is obtained in such a manner that (i) a difference between an original image and a prediction image is encoded for each of a plurality of prediction units, and, simultaneously, (ii) selection information is encoded, the selection information indicating which one of a plurality of prediction parameters, each designating how to generate a prediction image, is selected for each of the plurality of prediction units, includes: classification means for classifying a plurality of prediction units included in each of a plurality of unit regions constituting the prediction image into a first group and a second group; first selection means for selecting, for each of the plurality of prediction units belonging to the first group, a prediction parameter designating how to generate a prediction image, the first selection means selecting, by referring to selection information for each of a plurality of prediction units belonging to the first group, the prediction parameter from a basic set constituted by predetermined prediction parameters; and second selection means for selecting, for each of the plurality of prediction parameters belonging to the second group, a prediction parameter designating how to generate a prediction image, the second selection means selecting, by referring to selection information for each of a plurality of prediction units belonging to the second group, the prediction parameter from a reduced set which (i) includes at least a part of the prediction parameter(s) selected by the first selection means and (ii) is constituted by a prediction parameter(s), the number of which is not more than the number of the predetermined prediction parameters included in the basic set.

According to the image decoding device having the arrangement described above, it is possible to select, for each of the plurality of prediction units belonging to the second group, a prediction parameter designating how to generate a prediction image, from the reduced set which (i) includes at least a part of the prediction parameter(s) selected by the first selection means for each of the plurality of prediction units belonging to the first group which is included in the unit region to which the second group belongs, and (ii) is constituted by a prediction parameter(s), the number of which is not more than the number of prediction parameters included in the basic set.

Here, generally, a prediction parameter with respect to each of the prediction units has a correlation with a prediction parameter with respect to another prediction unit located in the vicinity of the prediction unit. Accordingly, the prediction parameter selected for each of the plurality of prediction units belonging to the first group is highly likely to be an optimum prediction parameter with respect to each of the plurality of prediction units belonging to the second group. With the arrangement described above, it is therefore possible to decode, without having a reduction in encoding efficiency, the prediction parameter from selection information having a reduction in encoding amount.

Furthermore, it is preferable that the reduced set includes only prediction parameters that (i) are selected by the first selection means and (ii) are different from each other.

According to the arrangement, it is possible to further reduce the number of prediction parameters included in the reduced set. Accordingly, it is possible to further reduce an encoding amount.

Moreover, it is preferable that each of the plurality of unit regions is a unit used in decoding carried out by the image decoding device.

According to the arrangement, it is possible to carry out a process for generating the reduced set only once per decoding unit (e.g., each macro block defined in the H.264/MPEG-4 AVC standard). Accordingly, it is possible to reduce an amount of the process for generating the reduced set.

Further, the image decoding device of the present invention is preferably arranged such that the plurality of prediction units belonging to the first group and the plurality of prediction units belonging to the second group are arranged to form a checkered pattern (checker board pattern).

Generally, a prediction parameter with respect to each of a plurality of prediction units has a correlation with a prediction parameter with respect to another prediction unit located in the vicinity of the prediction unit.

According to an image encoding device having an arrangement corresponding to the arrangement described above, the plurality of prediction units belonging to the first group and the plurality of prediction units belonging to the second group are arranged to form the checkered pattern (checker board pattern). Accordingly, it is possible to select an optimum prediction parameter with respect to each of the plurality of prediction units belonging to the second group. According to the image encoding device having the arrangement corresponding to the arrangement described above, it is therefore possible to reduce an encoding amount necessary for the encoding of a prediction parameter, without sacrificing encoding efficiency.

According to the image decoding device having the arrangement described above, it is possible to decode the encoded data whose encoding amount is reduced in the manner described above.

Furthermore, the image decoding device of the present invention can be arranged such that each of the prediction parameters is a prediction parameter designating a prediction mode in intra prediction.

According to the arrangement described above, it is possible to decode, without sacrificing encoding efficiency, such encoded data that, in the intra prediction, an encoding amount of prediction modes is reduced.

Moreover, it is preferable that the reduced set (i) includes the prediction parameter(s) selected by the first selection means and (ii) includes at least one of a vertical direction prediction mode in the intra prediction, a horizontal direction prediction mode in the intra prediction, and a DC prediction mode in the intra prediction, and for each of the plurality of prediction units belonging to the second group, the second selection means selects, from the reduced set, the prediction parameter designating how to generate a prediction image.

Generally, the vertical direction prediction mode, the horizontal direction prediction mode, and the DC prediction mode are highly frequently selected in the intra prediction.

According to an image encoding device having an arrangement corresponding to the arrangement described above, it is possible that the second selection means selects, for each of the plurality of prediction units belonging to the second group, a prediction parameter designating how to generate a prediction image, and the second selection means selects the prediction parameter from the reduced set which (i) includes the prediction parameter(s) selected by the first selection means and (ii) includes one of the vertical direction prediction mode in the intra prediction, the horizontal direction prediction mode in the intra prediction, and the DC prediction mode in the intra prediction. Accordingly, it is possible to reduce an encoding amount without having a reduction in prediction accuracy in generation of a prediction image for each of the plurality of prediction units belonging to the second group.

According to the image decoding device having the arrangement described above, it is possible to decode the encoded data whose encoding amount is reduced in the manner described above.

Further, the image decoding device of the present invention can be arranged such that the second selection means selects the prediction parameter in such a manner that (i), in a case where the number of a plurality of prediction units in each of the plurality of unit regions, constituted by the plurality of prediction units belonging to the first group and the plurality of prediction units belonging to the second group, is not less than a predetermined threshold value, the second selection means selects a prediction parameter from the reduced set, and (ii), in a case where said number is less than the predetermined threshold value, the second selection means selects a prediction parameter from the basic set.

According to the arrangement, it is possible to select a prediction parameter from the basic set in a case where the number of prediction units in the unit region is less than the predetermined value. It is therefore possible to reduce an amount of a process for selecting a prediction parameter.

Furthermore, the image decoding device of the present invention can be arranged such that the basic set is capable of being set per unit region, and the second selection means selects the prediction parameter in such a manner that (i), in a case where the basic set, which is set with respect to each of the plurality of unit regions, constituted by the plurality of prediction units belonging to the first group and the plurality of prediction units belonging to the second group, satisfies a specific condition, the second selection means selects a prediction parameter from the reduced set, and (ii), in a case where the basic set does not satisfy the specific condition, the second selection means selects a prediction parameter from the basic set.

According to the arrangement, it is possible that the basic set is capable of being set per unit region, and the second selection means selects the prediction parameter in such a manner that (i), in a case where the basic set, which is set with respect to each of the plurality of unit regions, constituted by the plurality of prediction units belonging to the first group and the plurality of prediction units belonging to the second group, satisfies a specific condition, the second selection means selects a prediction parameter from the reduced set, and (ii), in a case where the basic set does not satisfy the specific condition, the second selection means selects a prediction parameter from the basic set. Accordingly, it is possible to decode the encoded data whose encoding amount is reduced, while reducing an amount of a process for selecting a prediction parameter.

Moreover, an image decoding device of the present invention can be expressed as an image decoding device for decoding encoded data which is obtained in such a manner that (i) a difference between an input image and a prediction image is encoded for each of a plurality of prediction units, and, simultaneously, (ii) selection information is encoded, the selection information indicating which one of a plurality of prediction parameters, each designating how to generate a prediction image, is selected for each of the plurality of prediction units, including: selection means for selecting, for each of a plurality of prediction units, a prediction parameter designating how to generate a prediction image, the selection means selecting the prediction parameter from a reduced set including at least a part of a prediction parameter(s) designating how to generate a prediction image(s) for a prediction unit(s) which (i) is located in the vicinity of a corresponding one of the plurality of prediction units and (ii) has been decoded.

Generally, a prediction parameter with respect to each of a plurality of prediction units has a correlation with a prediction parameter with respect to another prediction unit located in the vicinity of the prediction unit. Accordingly, the reduced set is highly likely to include an optimum prediction parameter in generation of a prediction image of each of the plurality of prediction units. Further, the reduced set is constituted by at least a part of the prediction parameter(s) with respect to the prediction unit(s) which is located in the vicinity of a corresponding one of the plurality of prediction units. Accordingly, the number of prediction parameters included in the reduced set is smaller than the number of prediction parameters included in a parameter set constituted by prediction parameters with respect to prediction units other than the each of the plurality of prediction units.

According to an image encoding device having an arrangement corresponding to the arrangement described above, it is therefore possible to generate encoded data whose encoding amount is small, without sacrificing encoding efficiency.

The image decoding device having the arrangement described above can decode the encoded data whose encoding amount is small as described above.

Further, it is preferable that the reduced set (i) includes the prediction parameter(s) selected by the first selection means, and (ii) is constituted by prediction parameters, the number of which is 2n+1 (n is a natural number), and, for each of the plurality of prediction units belonging to the second group, the second selection means selects, from the reduced set, the prediction parameter designating how to generate a prediction image.

Generally, in a case where a group of elements, the number of which is 2n+1 (n is a natural number), is encoded, compression efficiency is improved as compared with a case where a group of elements, the number of which is not 2n+1, is encoded.

According to the arrangement described above, it is possible to decode the encoded data having such high compression efficiency.

Furthermore, a data structure of the present invention, for encoded data which is obtained in such a manner that (i) a difference between an input image and a prediction image is encoded for each of a plurality of prediction units, and, simultaneously, (ii) selection information is encoded, the selection information indicating which one of a plurality of prediction parameters, each designating how to generate a prediction image, is selected for each of the plurality of prediction units, includes: selection information which is referred to in an image decoding device for decoding the encoded data, so as to select, for each of the plurality of prediction units, a prediction parameter designating how to generate a prediction image, the prediction parameter being selected from a reduced set which includes at least a part of a prediction parameter(s) designating how to generate a prediction image(s) for a prediction unit(s) which (i) is located in the vicinity of a corresponding one of the plurality of prediction units and (ii) has been decoded.

Generally, a prediction parameter with respect to each of a plurality of prediction units has a correlation with a prediction parameter with respect to another prediction unit located in the vicinity of the prediction unit. Accordingly, the reduced set is highly likely to include an optimum prediction parameter in generation of a prediction image of each of the plurality of prediction units. Further, the reduced set is constituted by at least a part of the prediction parameter(s) with respect to the prediction unit(s) which is located in the vicinity of a corresponding one of the plurality of prediction units. Accordingly, the number of prediction parameters included in the reduced set is smaller than the number of prediction parameters included in a parameter set constituted by prediction parameters with respect to prediction units other than the each of the plurality of prediction units.

Accordingly, the encoded data having the structure described above is the encoded data whose encoding amount is reduced without sacrificing encoding efficiency.

The present invention is not limited to the description of the embodiments above, but may be altered by a skilled person within the scope of the claims. An embodiment based on a proper combination of technical means disclosed in different embodiments is encompassed in the technical scope of the present invention.

The present invention is suitably applicable to an image encoding device for generating encoded data by encoding an image, and an image decoding device for decoding the encoded data generated by such an image encoding device.

  • 1: Video decoding device
  • 14: MB decoding section
  • 144: Prediction parameter decoding section
  • 41: Group determination section (classification means)
  • 42: Switch section
  • 43: First prediction parameter decoding section (first selection means)
  • 44: Reduced set derivation section
  • 45: Second prediction parameter decoding section (second selection means)
  • 2: Video encoding device
  • 24: MB encoding section
  • 242: Prediction parameter determination section
  • 51: Group determination section (classification means)
  • 52: Switch section
  • 53: First prediction parameter determination section (first selection means)
  • 54: Reduced set derivation section
  • 55: Second prediction parameter determination section (second selection means)
  • 243: Prediction parameter encoding section (prediction parameter encoding means)
  • 248: Prediction image generation section

Yamamoto, Tomoyuki

Patent Priority Assignee Title
Patent Priority Assignee Title
5859672, Mar 18 1996 Sharp Kabushiki Kaisha Image motion detection device
6765964, Dec 06 2000 Intel Corporation System and method for intracoding video data
7031387, Mar 19 2002 LG Electronics Inc. B picture mode determining method and apparatus in video coding system
7386048, May 28 2002 Dolby Laboratories Licensing Corporation Methods and systems for image intra-prediction mode organization
20030058238,
20030107815,
20030223496,
20040008771,
20040091049,
20040213348,
20050276448,
20060088103,
20060262983,
20070025444,
20070063997,
20070160133,
20080130988,
20090003441,
20100195723,
20100290532,
20110135000,
EP1653745,
JP2006128770,
JP2006254347,
JP2007189276,
JP7177519,
JP9261646,
WO2009084340,
//
Executed onAssignorAssigneeConveyanceFrameReelDoc
May 23 2018Sharp Kabushiki KaishaVELOS MEDIA, LLCASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0463100357 pdf
Jun 08 2018VELOS MEDIA, LLC(assignment on the face of the patent)
Date Maintenance Fee Events
Jun 08 2018BIG: Entity status set to Undiscounted (note the period is included in the code).
Jun 08 2021M1551: Payment of Maintenance Fee, 4th Year, Large Entity.
Jun 08 2021M1554: Surcharge for Late Payment, Large Entity.


Date Maintenance Schedule
Jun 30 20234 years fee payment window open
Dec 30 20236 months grace period start (w surcharge)
Jun 30 2024patent expiry (for year 4)
Jun 30 20262 years to revive unintentionally abandoned end. (for year 4)
Jun 30 20278 years fee payment window open
Dec 30 20276 months grace period start (w surcharge)
Jun 30 2028patent expiry (for year 8)
Jun 30 20302 years to revive unintentionally abandoned end. (for year 8)
Jun 30 203112 years fee payment window open
Dec 30 20316 months grace period start (w surcharge)
Jun 30 2032patent expiry (for year 12)
Jun 30 20342 years to revive unintentionally abandoned end. (for year 12)