Offset temporal motion vector predictor (TMVP)

Offset temporal motion vector predictor (TMVP)
RE50091

An improved method for temporal motion vector prediction for inter block HEVC is provided that relies on a block translational model. The method adds an offset to a temporal motion vector predictor (TMVP) to improve prediction accuracy. The method first designates a current prediction block as an area for motion compensation where all the pixels inside the prediction block perform identical translation temporally using motion vectors MVs. A coordinate offset is then derived for a current prediction block from the MVs of its spatially neighboring blocks. The offset TMVP is then defined for the current prediction block as the MV of an offset block which is in the geometrical location of the current prediction block coordinate plus the coordinate offset in a specified temporal reference picture. The offset TMVP is then used to code MVs.

PTO Wrapper PDF
Dossier Espace Google

Patent RE50091
Priority Oct 05 2016
Filed Sep 28 2021
Issued Aug 20 2024
Expiry Oct 05 2037
Inventors Wang, Limin
Assg.orig ARRIS ENTE…
Assg.curr ARRIS ENTE…
Entity Large
Referenced by 0
References 6
Maint.: currently ok

CLAIM FOR PRIORITY
BRIEF DESCRIPTION OF…
DETAILED DESCRIPTION

0. 7. A method of temporal motion vector prediction for inter block coding of video within a bitstream that relies on a block based translational model, the method comprising:

(a) designating a current prediction block as an area of said video for motion compensation where all the samples defined by the current prediction block perform identical translation temporally using at least one motion vector;

(b) deriving a coordinate offset for the current prediction block from at least one motion vector of a set of spatially neighboring blocks relative to the current prediction block;

(c) wherein a selection of said set of spatially neighboring blocks is based upon a merge candidate list for said current prediction block;

(d) wherein said merge candidate list includes a left neighboring block (L), and an above neighboring block (A), where said merge candidate list includes an ordered list of candidate neighboring blocks, where said left neighboring block (L) is directly left of and aligned with the current prediction block, where said above neighboring block (A) is directly above of and aligned with the current prediction block;

(e) wherein said left neighboring block (L), and said above neighboring block (A), are included within two of a first position, a second position, and a third position of the ordered list of candidate neighboring bocks of said merge candidate list, where said left neighboring block (L) is included within one of said first position, the second position, and third position, where said above neighboring block (A) is included within a different one of said first position, the second position, and third position than said left neighboring block (L);

(f) defining an offset temporal motion vector predictor (TMVP) for the current prediction block as the coordinate offset based upon the at least one motion vector of an offset block selected from the merge candidate list further based upon another coordinate offset in a specified temporal reference picture that is in a temporally different picture than said current prediction block; and

(g) decoding said current prediction block based upon said offset TMVP.

0. 1. A method of temporal motion vector prediction for inter block coding in High Efficiency video Coding (HEVC) that relies on a block based translational model, the method comprising:

designating a current prediction block as an area for motion compensation using HEVC where all the pixels inside the current prediction block perform identical translation temporally using either one or more motion vectors MVs;

deriving a coordinate offset for the current prediction block from the MVs of its spatially neighboring blocks;

defining an offset of a temporal motion vector predictor (TMVP) for the current prediction block as the MV of an offset block which is in the geometrical location of the current prediction block coordinate plus the coordinate offset in a specified temporal reference picture; and

using the offset TMVP to code MVs,

wherein the motion vectors of neighboring prediction blocks to the current prediction block are used to calculate the offset for the TMVP,

wherein the neighboring prediction blocks located in a first three positions in a merge candidate list for the current prediction block are used in calculating the offset for the TMVP, wherein the three neighboring prediction blocks comprise a left (L), an above (A), and an above-left (AL),

wherein with the three neighboring prediction blocks, the offset for the TMVP for the current prediction block is derived as median of motion vectors of these neighbors, as follows:

dx=median (Lx, ALx, Ax)

dy=median (Ly, ALy, Ay)

wherein Lx, ALx, Ax are the x component of motion vectors of left neighbor, Above-left neighbor, and Above neighbor, respectively, and

wherein Ly, ALy, Ay are the y component of left neighbor, Above-left neighbor, and Above neighbor, respectively.

0. 2. The method of claim 1, wherein in the step of defining the offset TMVP, the offset TMVP is further defined by the steps comprising:

assuming that the current prediction block is at the position of coordinate (x, y) in the current picture; and

adding a coordinate offset of (dx, dy) to the coordinate (x,y) to provide the offset TMVP (x′,y′) as

(x′, y′) =(x, y) +(dx, dy) =(x+dx, y+dy).

0. 3. The method of claim 1, wherein syntax elements are required in coding a bitstream to indicate values for the offset TMVP.

0. 4. The method of claim 1, wherein one offset is shared for multiple prediction blocks or coded with coarser granularity than the final fractional motion vector accuracy.

0. 5. The method of claim 1, wherein motion vectors of neighboring prediction blocks are normalized to compensate for the difference in temporal distances between references used among the neighboring prediction blocks.

0. 6. The method of claim 1, wherein the step of using MV prediction with the TMVP to code MVs, comprises:

providing a TMVP offset mode that can be turned on or off, wherein in the on mode the offset TMVP with the added offset is used to code MVs, and in the off mode a TMVP without the added offset is used to code MVs, wherein an explicit signal is provided for turn on and turn off of the offset TMVP mode using a flag provided at a CU, slice or sequence level.

CLAIM FOR PRIORITY

In an explicit approach for encoding using the TMVP offset according to embodiments of the present invention, syntax elements expressing the offset are used. One offset can be shared for multiple prediction blocks.

In an implicit approach for encoding using the TMVP offset according to embodiments of the present invention, motion vectors of neighboring prediction blocks to the current prediction block are used to calculate the offset for the TMVP. In one example, the neighboring prediction blocks located in a first three positions in a merge candidate list for the current prediction block are used in calculating the TMVP offset. In another example, the three neighboring prediction blocks, the left (L), the above (A), and the above-left (AL), are used for computing the TMVP offset. One possible example for calculating the TMVP offset for the current prediction block is to use a median of motion vectors of these neighbors, as follows:
dx=median(Lx,ALx,Ax)
dy=median(Ly,ALy,Ay)

wherein Lx, ALx, Ax are the x component of motion vectors of Left neighbor, Above-left neighbor, and Above neighbor, respectively, and

wherein Ly, ALy, Ay are the y component of Left neighbor, Above-left neighbor, and Above neighbor, respectively.

An added TMVP offset mode can be used in one embodiment of the present invention that can be turned on and off with either the explicit or implicit means. Implicit signaling for turn on and turn off of the offset TMVP mode can be based on coding information of neighboring blocks. Explicit signaling for turn on and turn off in the offset TMVP mode can be thru a flag at a CU, slice or sequence level.

BRIEF DESCRIPTION OF THE DRAWINGS

Further details of the present invention are explained with the help of the attached drawings in which:

FIG. 1 shows a simplified system for encoding and decoding video according to embodiments of the present invention;

FIG. 2 illustrates video pictures with reference unit blocks for motion estimation and compensation;

FIG. 3 is a flowchart showing steps of a method of adding an offset to a TMVP according to embodiments of the present invention;

FIG. 4 is a flowchart showing specifics of the steps to further define the offset TMVP;

FIG. 5 is a flowchart showing calculation of the TMVP offset using the neighboring prediction blocks located in a first three positions in a merge candidate list for the current prediction block; and

FIG. 6 illustrates the three neighboring prediction blocks used to calculate the TMVP offset in FIG. 5.

DETAILED DESCRIPTION

FIG. 1 shows a simplified system for encoding and decoding video according to embodiments of the present invention. System includes an encoder 102 and a decoder 104. Encoder 102 and decoder 104 may use a video coding standard to encode and decode video, such as HEVC. Specifically, encoder 102 and decoder 104 may use syntax elements from the HEVC range extension. Also, other elements of encoder 102 and decoder 104 may be appreciated.

Encoder 102 and decoder 104 perform temporal prediction through motion estimation and motion compensation. Motion estimation is a process of determining a motion vector (MV) for a current unit of video. For example, the motion estimation process searches for a best match prediction for a current unit block of video (e.g., a prediction block) over reference pictures. The best match prediction is described by the motion vector and associated reference picture ID. Also, a reference unit in a B picture may have up to two motion vectors that point to a previous reference unit in a previous picture and a subsequent reference unit in a subsequent reference picture in the picture order. Motion compensation is then performed by subtracting a reference unit pointed to by the motion vector from the current unit of video. In the case of bi-prediction, the two motion vectors point to two reference units, which can be combined to form a combined bi-directional reference unit.

To perform motion estimation and compensation, encoder 102 and decoder 104 include motion estimation and compensation blocks 104-1 and 104-2, respectively. For bi-directional prediction, the motion estimation and compensation blocks 104-1 and 104-2 can use a combined bi-directional reference unit in the motion compensation process for the current unit. Syntax elements are further used in the motion prediction process.

For the encoder 102 and decoder 104 of FIG. 1, embodiments of the present invention contemplate that software to enable them to perform functions described to follow for the present invention is provided in a memory. The encoder 102 and decoder 104 are further contemplated to include one or more processors that function in response to executable code stored in the memory to cause the processor to perform the functions described.

FIG. 2 illustrates video pictures with reference unit blocks for motion estimation and compensation. The video includes a number of pictures 200-1-200-5. A current picture is shown at 200-3 and includes a current unit of video 202-1. Current unit 202-1 may be bi-predicted using reference unit blocks from reference pictures in other pictures, such as a previous picture 200-1 in the picture order and a subsequent picture 200-5 in the picture order. Picture 200-1 includes a reference unit 202-2 and picture 200-5 includes a reference unit block 202-3, both of which can be used to predict current unit block 202-1.

Once the current picture is established and the reference unit blocks are determined, motion estimation and compensation block 104-1 can determine motion vectors that represent the location of reference unit blocks 202-2 and 202-3 with respect to current unit block 202-1. Then, motion estimation and compensation block 104-1 calculates a difference between the combined reference unit block and the current unit block 202-1. Encoder 102 outputs the motion vectors in an encoded bitstream that is sent to decoder 104.

Decoder 104 receives the encoded bitstream and can reconstruct the pictures of the video. Decoder 104 may reconstruct reference unit blocks 202-2 and 202-3 from the encoded bitstream prior to decoding current unit block 202-1. Also, decoder 104 decodes the motion vectors for current unit block 202-1. Then, in decoder 104, motion estimation and compensation block 104-2 are used to reconstruct the current unit block 202-1. The motion estimation and compensation block 104-2 uses the motion vectors to locate reconstructed reference unit blocks 202-2 and 202-3 and reconstruct the current unit block 202-1.

Motion vector prediction is used in motion vector coding process to exploit correlation between the coding motion vector and its selected predictor. Due to the characteristics of natural video, object generally moves in a smooth, linear trajectory from frame to frame. This behavior makes the motion vector of the temporally collocated block as a powerful motion vector predictor for a current block, and it is hence used in HEVC motion vector coding.

In HEVC, for a current prediction block in a current picture, the motion vector of its temporal collocated prediction block, which is in the same geometrical location in a specified temporal reference picture as the current prediction block in the current picture, is defined as the temporal motion vector predictor (TMVP) for the current prediction block. Specifically, the collocated block has the same spatial coordinate (x,y) in the reference picture as the current prediction block (x,y) in the current picture. The collocated position can however be suboptimal when there is significant object motion between the frames. In such case, the collocated position may represent a different object and its motion vector is not a useful TMVP.

Accordingly, embodiments of the present invention introduce a way to improve TMVP effectiveness, especially when there are a lot of movements between the frames. Instead of using the same coordinate in the reference picture as the coding prediction block in the current picture, embodiments of the present invention add a coordinate offset to the coordinate for the TMVP location.

FIG. 3 is a flowchart showing steps of a method of adding an offset to a TMVP according to embodiments of the present invention. The method uses a block translational model and begins in step 300 with designating a current prediction block as an area for motion compensation where all the pixels inside the prediction block perform identical translation temporally using either one or more motion vectors (MVs). In the next step 302, a coordinate offset is derived for the current prediction block from the MVs of its spatially neighboring blocks. In step 304, the offset TMVP is defined for the current prediction block as the MV of an offset block which is in the geometrical location of the current prediction block coordinate plus a coordinate offset computed in step 302 in a specified temporal reference picture. Finally, in step 306 the MV prediction with the offset TMVP is used to code MVs.

FIG. 4 is a flowchart showing specifics of the steps to further define the offset TMVP. First in step 400, an assumption is made that the current prediction block is at the position of coordinate (x, y) in the current picture. Next, in step 402 a coordinate offset of (dx, dy) is added to the coordinate (x,y) to define the offset TMVP (x′, y′) as follows:
(x′,y′)=(x,y)+(dx,dy)=(x+dx,y+dy).

The TMVP offset (dx, dy) can be determined explicitly or implicitly. Details of the two approaches are described to follow.

For explicit approach, syntax elements in coding bitstream can be used to indicate TMVP offset values. To reduce the overhead bits, one offset may be shared for multiple prediction blocks and coded with coarser granularity than the final fractional motion vector accuracy.

For implicit approach, a motion vector derivation method is specified so that decoder can repeat the same process and be able to regenerate the same TMVP offset. In this simplified approach, the motion vectors of neighboring prediction blocks are used to calculate the offset for TMVP. Motion vectors of neighboring prediction blocks in this approach are normalized to compensate for the difference in temporal distances between references used among these prediction blocks.

FIG. 5 is a flowchart showing calculation of the coordinate offset using the neighboring prediction blocks. FIG. 6 illustrates the three neighboring prediction blocks used to calculate the coordinate offset for the current block (C) in FIG. 5. The prediction blocks in FIG. 6 include the above left (AL), above (A), above right (AR), left (L), right (R), below left (BL), below (B), and below right (AR).

In a first step 500 of FIG. 5, the three neighboring prediction blocks are used for the current prediction block to calculate the offset TMVP. The three neighboring prediction blocks include the left (L) 601, the above (A) 602, and the above-left (AL) 603. Next, in step 502, the current prediction block is derived as a median of motion vectors of these neighbors, as follows:
dx=median(Lx,ALx,Ax)
dy=median(Ly,ALy,Ay)

wherein Lx, ALx, Ax are the x component of motion vectors of Left neighbor, Above-left neighbor, and Above neighbor, respectively, and

wherein Ly, ALy, Ay are the y component of Left neighbor, Above-left neighbor, and Above neighbor, respectively.

Although the present invention has been described above with particularity, this was merely to teach one of ordinary skill in the art how to make and use the invention. Many additional modifications will fall within the scope of the invention as that scope is defined by the following claims.

INVENTORS:

Wang, Limin, Panusopone, Krit, Hong, Seungwook

THIS PATENT IS REFERENCED BY THESE PATENTS:

Patent

Priority

Assignee

Title

THIS PATENT REFERENCES THESE PATENTS:

Patent	Priority	Assignee	Title
10200709,	Mar 16 2012	Qualcomm Incorporated	High-level syntax extensions for high efficiency video coding
20130003851,
20130195182,
20170347096,
20180084260,
20180184110,

ASSIGNMENT RECORDS Assignment records on the USPTO

/////////////////////

Executed on	Assignor	Assignee	Conveyance	Frame	Reel	Doc
Nov 13 2017	HONG, SEONGWOOK	ARRIS ENTERPRISES LLC	ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS	058709	0760	pdf
Nov 13 2017	WANG, LIMIN	ARRIS ENTERPRISES LLC	ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS	058709	0760	pdf
Nov 14 2017	PANUSOPONE, KRIT	ARRIS ENTERPRISES LLC	ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS	058709	0760	pdf
Sep 28 2021		ARRIS ENTERPRISES LLC	(assignment on the face of the patent)
Mar 07 2022	COMMSCOPE, INC OF NORTH CAROLINA	JPMORGAN CHASE BANK, N A	ABL SECURITY AGREEMENT	059350	0743	pdf
Mar 07 2022	CommScope Technologies LLC	JPMORGAN CHASE BANK, N A	ABL SECURITY AGREEMENT	059350	0743	pdf
Mar 07 2022	ARRIS ENTERPRISES LLC	JPMORGAN CHASE BANK, N A	ABL SECURITY AGREEMENT	059350	0743	pdf
Mar 07 2022	COMMSCOPE, INC OF NORTH CAROLINA	WILMINGTON TRUST	SECURITY INTEREST SEE DOCUMENT FOR DETAILS	059710	0506	pdf
Mar 07 2022	CommScope Technologies LLC	WILMINGTON TRUST	SECURITY INTEREST SEE DOCUMENT FOR DETAILS	059710	0506	pdf
Mar 07 2022	ARRIS ENTERPRISES LLC	WILMINGTON TRUST	SECURITY INTEREST SEE DOCUMENT FOR DETAILS	059710	0506	pdf
Mar 07 2022	COMMSCOPE, INC OF NORTH CAROLINA	JPMORGAN CHASE BANK, N A	TERM LOAN SECURITY AGREEMENT	059350	0921	pdf
Mar 07 2022	CommScope Technologies LLC	JPMORGAN CHASE BANK, N A	TERM LOAN SECURITY AGREEMENT	059350	0921	pdf
Mar 07 2022	ARRIS ENTERPRISES LLC	JPMORGAN CHASE BANK, N A	TERM LOAN SECURITY AGREEMENT	059350	0921	pdf
Dec 17 2024	OUTDOOR WIRELESS NETWORKS LLC	APOLLO ADMINISTRATIVE AGENCY LLC	SECURITY INTEREST SEE DOCUMENT FOR DETAILS	069889	0114	pdf
Dec 17 2024	COMMSCOPE INC , OF NORTH CAROLINA	APOLLO ADMINISTRATIVE AGENCY LLC	SECURITY INTEREST SEE DOCUMENT FOR DETAILS	069889	0114	pdf
Dec 17 2024	CommScope Technologies LLC	APOLLO ADMINISTRATIVE AGENCY LLC	SECURITY INTEREST SEE DOCUMENT FOR DETAILS	069889	0114	pdf
Dec 17 2024	ARRIS ENTERPRISES LLC	APOLLO ADMINISTRATIVE AGENCY LLC	SECURITY INTEREST SEE DOCUMENT FOR DETAILS	069889	0114	pdf
Dec 17 2024	JPMORGAN CHASE BANK, N A , AS COLLATERAL AGENT	CommScope Technologies LLC	RELEASE OF SECURITY INTEREST AT REEL FRAME 059350 0921	069743	0704	pdf
Dec 17 2024	JPMORGAN CHASE BANK, N A , AS COLLATERAL AGENT	COMMSCOPE, INC OF NORTH CAROLINA	RELEASE OF SECURITY INTEREST AT REEL FRAME 059350 0921	069743	0704	pdf
Dec 17 2024	JPMORGAN CHASE BANK, N A , AS COLLATERAL AGENT	ARRIS ENTERPRISES LLC F K A ARRIS ENTERPRISES, INC	RELEASE OF SECURITY INTEREST AT REEL FRAME 059350 0921	069743	0704	pdf
Dec 17 2024	RUCKUS IP HOLDINGS LLC	APOLLO ADMINISTRATIVE AGENCY LLC	SECURITY INTEREST SEE DOCUMENT FOR DETAILS	069889	0114	pdf

MAINTENANCE FEES AND DATES: Maintenance records on the USPTO

Date	Maintenance Fee Events
Sep 28 2021	BIG: Entity status set to Undiscounted (note the period is included in the code).

Date	Maintenance Schedule
Aug 20 2027	4 years fee payment window open
Feb 20 2028	6 months grace period start (w surcharge)
Aug 20 2028	patent expiry (for year 4)
Aug 20 2030	2 years to revive unintentionally abandoned end. (for year 4)
Aug 20 2031	8 years fee payment window open
Feb 20 2032	6 months grace period start (w surcharge)
Aug 20 2032	patent expiry (for year 8)
Aug 20 2034	2 years to revive unintentionally abandoned end. (for year 8)
Aug 20 2035	12 years fee payment window open
Feb 20 2036	6 months grace period start (w surcharge)
Aug 20 2036	patent expiry (for year 12)
Aug 20 2038	2 years to revive unintentionally abandoned end. (for year 12)