System and method for three-dimensional video coding

System and method for three-dimensional video coding
US7650036

systems and methods are provided for receiving and encoding 3d video. The receiving method comprises: accepting a bitstream with a current video frame encoded with two interlaced fields, in a MPEG2, MPEG4, or H.264 standard; decoding a current frame top field; decoding a current frame bottom field; and, presenting the decoded top and bottom fields as a 3d frame image. In some aspects, the method presents the decoded top and bottom fields as a stereo-view image. In other aspects, the method accepts 2D selection commands in response to a trigger such as receiving a supplemental enhancement information (SEI) message, an analysis of display capabilities, manual selection, or receiver system configuration. Then, only one of the current frame interlaced fields is decoded, and a 2D frame image is presented.

PTO Wrapper PDF
Dossier Espace Google

Patent 7650036
Priority Oct 16 2003
Filed Apr 02 2004
Issued Jan 19 2010
Expiry Dec 07 2026 Extension 979 days
Inventors Sun, Shijun
Assg.orig Sharp Labo…
Assg.curr Sharp Kabu…
Entity Large
Referenced by 66
References 23
Maint.: EXPIRED

RELATED APPLICATIONS
BACKGROUND OF THE IN…
SUMMARY OF THE INVEN…
BRIEF DESCRIPTION OF…
DETAILED DESCRIPTION…
Functional Descripti…

26. A three-dimensional (3d) video encoding system, the system comprising:

an encoder having an input to accept a current 3d video image, including a first view of the image and a second, 3d, view of the image, the encoder encoding the first view as a frame top field and the second view as the frame bottom field, interlaced in a single first video frame, and the encoder having a channel-connected output to supply a first video frame bitstream; and,

wherein the encoder transmits a supplemental enhancement information (SEI) 3d option message with the first video frame, to trigger decoding selected from a group consisting of single field two-dimensional (2D) decoding and top and bottom field 3d decoding in response to receiver capabilities.

18. A three-dimensional (3d) video receiver system, the system comprising:

a decoder having an input connected to a channel to accept a bitstream with a single first video frame encoded with two interlaced fields and an output to supply a top field and a bottom field, both decoded from the first video frame; and,

a display having an input to accept the decoded fields, the display visually presenting the decoded top and bottom fields as a 3d frame image;

wherein the decoder receives a supplemental enhancement information (SEI) 3d content message with the first video frame, analyzes display capabilities, and, if non-3d display capabilities are detected, decodes only one of the first frame interlaced fields in response to the 3d option SEI message; and,

wherein the display visually presents a two-dimensional (2D) image.

35. A three-dimensional (3d) video decoder, the decoder comprising:

an input connected to a channel to accept a single first video frame bitstream encoded with two interlaced fields and a supplemental enhancement information (SEI) 3d content message with the first video frame;

a two-dimensional (2D) decision unit to analyze the SEI 3d content message, and if non-3d display capabilities are detected in an associated display, decoding only one of the first frame interlaced fields in response to the 3d option SEI message;

a 3d decision unit to analyze the SEI 3d content message, and if 3d display capabilities are detected in the associated display, decoding first frame interlaced top and bottom fields in response to the 3d option SEI message; and,

an output to supply at least one of the first video frame top field and the first video frame bottom field.

9. A method for encoding three-dimensional (3d) video in a transmitter system, the method comprising:

in a transmitter encoder:

accepting an electromagnetic waveform representing a 3d video image, including a first view of the image and a second view of the image;

encoding the first view as a top field in a single first video frame;

encoding the second view as a bottom field in the first video frame; and,

transmitting an electromagnetic waveform into a channel representing a first video frame bitstream having the top field interlaced with the bottom field in the single first video frame, and a supplemental enhancement information (SEI) 3d option message with the first video frame to trigger decoding selected from a group consisting of single field two-dimensional (2D) decoding, and top and bottom field 3d decoding in a receiver decoder, depending on receiver capabilities.

1. A method for receiving three-dimensional (3d) video in a receiver system, the method comprising:

in a receiver decoder:

accepting an electromagnetic waveform representing a bitstream with two interlaced fields, both encoded in a single first video frame, with a supplemental enhancement information (SEI) 3d content message;

analyzing display capabilities in response to being triggered by the SEI 3d content message;

if 3d display abilities are detected, decoding a first frame top field from the first video frame and decoding a first frame bottom field from the first video frame;

if non-3d display capabilities are detected, decoding only one of the first video frame interlaced fields; and,

a display presenting an electromagnetic waveform representing the decoded top and bottom fields as a 3d frame image if 3d capable, and presenting a two-dimensional (2D) frame image if non-3d capable.

2. The method of claim 1 wherein accepting a bitstream with first video frame encoded with two interlaced fields includes accepting the bitstream in a standard selected from the group including Motion Pictures Expert group-2 (MPEG2), MPEG4, and ITU-T H.264 standards.

3. The method of claim 1 wherein presenting the decoded top and bottom fields as a 3d frame image includes presenting the decoded top and bottom fields as a stereo-view image.

4. The method of claim 1 further comprising:

prior to accepting the first video frame, accepting a first encoded video frame;

deriving a predictive first frame top field;

deriving a predictive first frame bottom field;

wherein decoding the first video frame top field includes decoding the first video frame top field in response to the predictive first frame top field; and,

wherein decoding the first video frame bottom field includes decoding the first video frame bottom field in response to the predictive first frame bottom field.

5. The method of claim 1 further comprising:

prior to accepting the first video frame, accepting a first encoded video frame;

deriving a predictive first frame first field;

wherein decoding the first video frame top field includes decoding the first video frame top field in response to the predictive first frame first field; and,

wherein decoding the first video frame bottom field includes decoding the first video frame bottom field in response to the predictive first frame first field.

6. The method of claim 5 wherein deriving a predictive first frame first field includes deriving a predictive first frame top field.

7. The method of claim 5 wherein deriving a predictive first frame first field includes deriving a predictive first frame bottom field.

8. The method of claim 1 further comprising:

simultaneous with the presentation of the 3d image, presenting a 2D image in response to using one of the decoded first frame interlaced fields.

10. The method of claim 9 wherein transmitting the first video frame bitstream having the top field interlaced with the bottom field includes transmitting the bitstream in a standard selected from the group including Motion Pictures Expert group-2 (MPEG2), MPEG4, and ITU-T H.264 standards.

11. The method of claim 9 wherein accepting a current 3d video image, including a first view of the image and a second, 3d, view of the image includes accepting a first and second view of a stereo image.

12. The method of claim 9 further comprising:

accepting a 2D command responsive to a trigger selected from the group including an analysis of receiver capabilities and the channel bandwidth; and,

transmitting the 2D command to a receiver.

13. The method of claim 12 further comprising:

transmitting only one of the fields from the first video view frame.

14. The method of claim 9 further comprising: tuviste

prior to accepting the current video image, accepting a first video image;

encoding a first image top field;

encoding a first image bottom field;

wherein encoding the current frame top field includes encoding the current frame top field in response to the first image top field; and,

wherein encoding the current frame bottom field includes encoding the first video frame bottom field in response to the first frame bottom field.

15. The method of claim 9 further comprising:

prior to accepting the current image, accepting a first video image;

encoding a first image first field;

wherein encoding the current frame top field includes encoding the current frame top field in response to the first image first field; and,

wherein encoding the current frame bottom field includes encoding the current frame bottom field in response to the first image first field.

16. The method of claim 15 wherein encoding a first image first field includes encoding a first image top field.

17. The method of claim 15 wherein encoding a first image first field includes encoding a first image bottom field.

19. The system of claim 18 wherein the decoder accepts the bitstream in a standard selected from the group including Motion Pictures Expert group-2 (MPEG2), MPEG4, and ITU-T H.264 standards.

20. The system of claim 18 wherein the display visually presents the decoded top and bottom fields as a stereo-view image.

21. The system of claim 18 wherein the decoder, prior to accepting the current frame, accepts a first encoded video frame, derives a predictive first frame top field, derives a predictive first frame bottom field, decodes the current frame top field in response to the predictive first frame top field, and decodes the current frame bottom field in response to the predictive first frame bottom field.

22. The system of claim 18 wherein the decoder, prior to accepting the first video frame, accepts a first encoded video frame, derives a predictive first frame first field, decodes the first video frame top field in response to the predictive first frame first field, and decodes the first video frame bottom field in response to the predictive first frame first field.

23. The system of claim 22 wherein the decoder derives a predictive first frame top field.

24. The system of claim 22 wherein the decoder derives a predictive first frame bottom field.

25. The system of claim 18 wherein the display, as a selected alternative to the presentation of the 3d image, presents a 2D image in response to using only one of the decoded first video frame interlaced fields.

27. The system of claim 26 wherein the encoder transmits the bitstream in a standard selected from the group including Motion Pictures Expert group-2 (MPEG2), MPEG4, and ITU-T H.264 standards.

28. The system of claim 26 wherein the encoder accepts a first and second view of a stereo image.

29. The system of claim 26 wherein the encoder transmits a 2D command responsive to a trigger selected from the group including an analysis of connected receiver capabilities and the channel bandwidth.

30. The system of claim 29 wherein the encoder encodes and transmits only one of the fields from the first video frame.

31. The system of claim 26 wherein the encoder, prior to accepting the current video image, accepts a first video image, encodes a first image top field, encodes a first image bottom field, encodes the current frame top field in response to the first image top field, and encodes the first video frame bottom field in response to the first image bottom field.

32. The system of claim 26 wherein the encoder, prior to accepting the current image, accepts a first video image, encodes a first image first field, encodes the first video frame top field in response to the first image first field, and encodes the first video frame bottom field in response to the first image first field.

33. The system of claim 32 wherein the first image first field is a first image top field.

34. The system of claim 32 wherein the first image first field is a first image bottom field.

RELATED APPLICATIONS

This application claims the benefit of a provisional application entitled, SYSTEMS AND METHODS FOR STEREO VIDEO CODING, invented by Lei et al., Ser. No. 60,512,155, filed Oct. 16, 2003.

This application claims the benefit of a provisional application entitled, SYSTEMS AND METHODS FOR STEREO VIDEO CODING, invented by Lei et al., Ser. No. 60,519,482, filed Nov. 13, 2003.

BACKGROUND OF THE INVENTION

1. Field of the Invention

This invention generally relates to video compression and, more particularly, to a system and method of encoding/decoding compressed video for three-dimensional and stereo viewing.

2. Description of the Related Art

Conventional video compression techniques typically handle three-dimensional (3D), or stereo-view video, in units of a frame. The most straightforward method is to code two views separately, as independent video sequences. This straightforward method, however, suffers from poor coding efficiency. It also has higher complexity because it needs to encode/decoder, multiplex/demultiplex, and synchronize two bitstreams. To reduce the complexity of bitstream handling, synchronized frames from each view can also be grouped together to form a composite frame, and then coded into one single bitstream. This composite-frame method still suffers from poor coding efficiency. It also loses a view-scalable functionality, i.e., decoder can choose to decode and display only one view.

Alternately, as noted in U.S. patent application 20020009137, one view can be coded into a base layer bitstream, and the other view into an enhancement layer. This layer approach not only has a better coding efficiency, but it also preserves the view-scalable functionality. However, this method still has higher complexity due to its needs to handle multiple bitstreams (base-layer and enhanced-layer bitstreams).

It would be advantageous if compressed 3D video could be communicated with greater efficiency.

It would be advantageous if only one view of a compressed 3D or stereo-view could be decoded to permit viewing on legacy 2D displays.

SUMMARY OF THE INVENTION

The present invention treats 3D video frames as interlaced materials. Therefore, a 3D view can be coded using existing interlace-coding methods, such as those in H.264, which enable better compression. Further, the invention supports a scalable coding (two-dimensional view) feature with minimal restrictions on the encoder side. The scalable decoding option can be signaled in a simple SEI message, for example.

Accordingly, a method is provided for receiving 3D video. The method comprises: accepting a bitstream with a current video frame encoded with two interlaced fields, in a Motion Pictures Expert Group-2 (MPEG2), MPEG4, or ITU-T H.264 standard; decoding a current frame top field; decoding a current frame bottom field; and, presenting the decoded top and bottom fields as a 3D frame image. In some aspects, the method presents the decoded top and bottom fields as a stereo-view image.

In some aspects, the method accepts 2D selection commands in response to a trigger such as receiving a supplemental enhancement information (SEI) message. Other triggers include an analysis of display capabilities, manual selection, or receiver system configuration. Then, only one of the current frame interlaced fields is decoded, and a 2D frame image is presented.

In one aspect of the method, a first encoded video frame is accepted prior to accepting the current frame. Then, the method: derives a predictive first frame top field; and, derives a predictive first frame bottom field. Then, the current frame top and bottom fields are decoded in response to the predictive first frame top field and predictive first frame bottom field, respectively.

Likewise, a method is providing for encoding 3D video, comprising: accepting a current 3D video image, including a first view of the image and a second, 3D, view of the image; encoding the first view as a frame top field; encoding the second view as the frame bottom field; and, transmitting a bitstream with a current video frame, having the top field interlaced with the bottom field, into a channel.

Additional details of the above-described methods, and 3D video encoder and receiver systems are provided below.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic block diagram of the present invention 3D video receiver system.

FIG. 2 is a schematic block diagram of the present invention 3D video encoding system.

FIG. 3 illustrates the present invention 3D view field interlacing process.

FIG. 4 is a graph illustrating a comparison of coding performance.

FIG. 5 is a flowchart illustrating the present invention method for receiving 3D video.

FIG. 6 is a flowchart illustrating the present invention method for encoding 3D video.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

FIG. 1 is a schematic block diagram of the present invention 3D video receiver system. The system 100 comprises a decoder 102 having an input connected to a channel on line 104 to accept a bitstream with a current video frame encoded with two interlaced fields. For example, line 104 may be connected to the Internet, a satellite receiver, or a digital cable network. The decoder 102 accepts the bitstream in a standard such as MPEG2, MPEG4, or ITU-T H.264. The decoder 102 has an output on line 106 to supply a decoded current frame top field and current frame bottom field. A display 108 has an input to accept the decoded fields on line 106. The display 108 visually presents the decoded top and bottom fields as a 3D frame image. For example, the display 108 can be a high-definition TV. In other aspects of the system, the display 108 visually presents the decoded top and bottom fields as a stereo-view image.

Generally, the display 108 may visually presents a two-dimensional (2D) image in response to using only one of the decoded current frame interlaced fields. Also, as a selected alternative to the presentation of the 3D image, the display 108 may present a 2D image in response to using only one of the decoded current frame interlaced fields. For example, a user may manually select to view a 2D image, even if a 3D image is available.

In other aspects, the decoder 102 may analyze the display capabilities and decode only one of the current frame interlaced fields, if non-3D display capabilities are detected. For example, the decoder may detect that display 108 is a legacy television. In this circumstance, the display 108 visually presents a two-dimensional (2D) image.

In some aspects, the decoder 102 receives a supplemental enhancement information (SEI) 3D content message with the current video frame. There are many types of conventional SEI messages. The 3D content SEI message is a message acts as a signal that the referenced frame(s) include 3D content organized as top and bottom fields in a frame. The 3D content SEI messages may trigger the decoder 102 to analyze display capabilities. This analysis may be a result of a query directed to display 108, or a result of accessing pre-configured information in memory concerning display capabilities. If non-3D display capabilities are detected, the decoder may elect to decode only one of the current frame interlaced fields in response to the 3D option SEI message. Since only one field is supplied by the decoder 102, the display 108 visually presents a two-dimensional (2D) image. Note, the decoder 102 may still provide both fields of a 3D view to a display 108, even if the display is not enabled to present a 3D image.

In some aspects, the decoder 102 includes a 2D decision unit 110 to supply 2D selection commands on line 112. The decoder 102 decodes only one of the current frame interlaced fields in response to the 2D selection commands. In response, the display 108 visually presents a 2D image. The decoder 2D decision unit 110 supplies 2D selection commands in response to a trigger such as receiving an SEI message on line 104. The trigger may be an analysis of display capabilities. For example, capabilities may be explored in communications with the display on line 106. In other aspects, the trigger may be responsive to a manual selection made by the user and received on line 114. In another aspect, the trigger can be responsive to the receiver system configuration stored in memory 116.

The organization of top and bottom fields as complementary 3D views is compatible with predictive encoding and decoding processes. With respect to MPEG standards, intra-coded frames (I-frames) are used to carry information that can be used as a foundation in a series of subsequent frames. With respect to H.264, the predictive frame is called an independent decoder refresh (IDR) picture. In some aspects, the decoder accepts a first encoded video frame prior to accepting the current frame. The decoder 102 derives a predictive first frame top field and a predictive first frame bottom field from the first frame. Then, the decoder 102 decodes the current frame top field in response to the predictive first frame top field. Likewise, the current frame bottom field is decoded in response to the predictive first frame bottom field.

Alternately, the decoder 102 derives a predictive first frame first field from the first frame. The first field may be either a top field or a bottom field. The decoder 102 decodes the current frame top field in response to the predictive first frame first field, and decodes the current frame bottom field in response to the predictive first frame first field.

FIG. 2 is a schematic block diagram of the present invention 3D video encoding system. The system 200 comprises an encoder 202 having an input on line 204 to accept a current 3D video image, including a first view of the image and a second, 3D, view of the image. In some aspects, the encoder 202 accepts a stereo image. The encoder 202 encodes the first view as a frame top field and encodes the second view as the frame bottom field. The encoder 202 has a channel-connected output on line 104 to supply a bitstream with a current video frame, having the top field interlaced with the bottom field. The encoder 202 transmits the bitstream in a standard such as MPEG2, MPEG4, or H.264.

In one aspect of the system, the encoder 202 transmits a 2D command responsive to a trigger such as an analysis of connected receiver capabilities or the channel bandwidth. The analysis of receiver capabilities may occur as a result of accessing a memory 212 holding a record of receiver capabilities. For example, record may show that a connected receiver, or group of receivers, lacks 3D display capability. The analysis of channel bandwidth may be made as a result of accessing the memory 212, or a result of receiving a real-time measurement of bandwidth. In some circumstances the bandwidth may be small enough that the transmission of both fields is impractical. For these, and potentially other reasons, the encoder 202 may elect to encode and transmit only one of the fields from the current view frame.

In another aspect, the encoder 202 may transmit an SEI 3D optional message, to signal 3D views available, to describe how 3D views are mapped into interlaced fields, and to describe dependency of each field.

In another aspect, the encoder 202 may transmit an SEI 3D option message with the current video frame, to trigger optional single field two-dimensional (2D) decoding. For example, if 2D receiver capabilities are discovered, the encoder 202 may transmit the SEI 3D option message, along with only one of fields.

Prior to accepting the current video image, the encoder 202 may accept a first video image, and encode a first image top field, as well as a first image bottom field. For example, the first image top and bottom fields may be used to form either an I-frame (MPEG) or an IDR picture (H.264). Then, the encoder 204 encodes the current frame top field in response to the first image top field, and encodes the current frame bottom field in response to the first frame bottom field.

Alternately, a single field can be used to generate subsequent top and bottom fields. That is, the encoder 204, prior to accepting the current image, may accept a first video image and encode a first image first field. The first image first field may be either a top or bottom field. Then, the encoder 204 encodes the current frame top field in response to the first image first field, and encodes the current frame bottom field in response to the first image first field.

Functional Description

FIG. 3 illustrates the present invention 3D view field interlacing process. Option 1 shows the left and right (stereo or 3D) views. Option 2 shows the views as a composite video frame.

Instead of treating stereo-view video frames as separate frames or a composite video frame, the present invention considers the sequence as interlaced materials. For example, as illustrated in option 3, the left view picture is the top field and the right view is the bottom field. It is straightforward to code the interlaced video frames using existing interlaced coding methods in different video coding standards, for example, but not limited to, MPEG2, MPEG4, and H.264. The use of these standards enables better compression and bitstream handling.

Using the interlaced coding methods of the above-mentioned video coding standards, a scalable decoding option can be supported with minimal restrictions on the encoder side. The scalable option means that at least one view (or field) can be decoded independently, without referring to bitstream of the other view (or field). This option permits decoder and encoder devices to be used with legacy devices that do not support 3D display functionality. To enable this scalable coding option, all pictures are coded in field-picture mode. At least one field (either top or bottom) is self-contained; and for a self-contained field, the corresponding field pictures can only use previously coded field pictures with the same parity as reference for motion compensation.

Here is a very brief summary of relevant H.264 coding tools. H.264 is the latest international video coding standard. Relative to prior video coding methods, some new inter-frame prediction options have been designed to enhance the prediction flexibility and accuracy. H.264 permits multiple reference pictures to be used for inter prediction. That is, more than one prior coded picture can be used as references for inter prediction. To allow for better handling of interlaced materials, H.264 permits a video frame to be coded as either a frame picture, or two field pictures. The choice between these two options is referred to as picture-level adaptive-frame-field (PAFF) coding. This idea can be extended to the macroblock level, to enable the option of Macroblock-level adaptive frame-field (MBAFF) coding.

FIG. 4 is a graph illustrating a comparison of coding performance. Experiments were conducted to evaluate the coding performance using the H.264 verification model JM7.3 software. The encoding parameters are: 5 reference frames, +/− 32 pixel motion search range, 15-frame group of pictures (GOP) with IPPP (an Intra-frame followed by 14 P pictures), RD-optimization (a JM software encoding option), CAVLC (context adaptive variable length coding—an entropy option in the H.264 standard), and a fixed quantization parameter (QP) of 28/32/36. A 300-frame stereo sequence was coded in the following three settings:

- 1. Full frame coding (as option 2 in FIG. 3);
- 2. PAFF coding (as option 3 in FIG. 3, with no restriction on reference pictures); and
- 3. Scalable field coding (as option 3 in FIG. 3, with restriction on reference pictures).

The coding performances are shown in FIG. 4. The scalable field coding (setting #3) and PAFF coding (setting #3) essentially overlap. Both setting #2 and setting #3 have much better coding performance than setting #1. The difference is roughly 0.8 dB at relatively high quality side. The higher the peak signal-to-noise ratio (PSNR) values, the better the coding performance. So, for example, setting #2 is better than setting #1 (by 0.8 dB at the high bitrate side). Compared to setting #2, setting #3 has very little overhead, less than 0.1 dB.

FIG. 5 is a flowchart illustrating the present invention method for receiving 3D video. Although the method is depicted as a sequence of numbered steps for clarity, no order should be inferred from the numbering unless explicitly stated. It should be understood that some of these steps may be skipped, performed in parallel, or performed without the requirement of maintaining a strict order of sequence. The method starts at Step 500.

Step 502 accepts a bitstream with a current video frame encoded with two interlaced fields. For example, the bitstream is a standard such as MPEG2, MPEG4, or H.264. Step 504 decodes a current frame top field. Step 506 decodes a current frame bottom field. Step 508 presents the decoded top and bottom fields as a 3D frame image. In some aspects, Step 508 presents the decoded top and bottom fields as a stereo-view image. In one aspect of the method, Step 503 accepts a 2D selection command. For example, the 2D selection command may be accepted in response to a trigger such as a supplemental enhancement information (SEI) message, an analysis of display capabilities, manual selection, or receiver system configuration. Then, only one of the current frame interlaced fields is used in response to the 2D selection commands. That is, either Step 504 or Step 506 is preformed. As shown, Step 504 is performed (Step 506 is bypassed). Step 510 presents a 2D frame image. Alternately, both fields may be decoded but a 2D frame image is presented in Step 510 in response to using only one of the decoded current frame interlaced fields. In one aspect of the method, simultaneous with the presentation of the 3D image (Step 508), Step 510 presents a 2D image in response to using one of the decoded current frame interlaced fields. The simultaneous presentation of 2D and 3D images may represent that either the 2D or 3D view may be selected.

In another aspect of the method, Step 503 is organized into substeps, not shown. Step 503a receives a (SEI) 3D content message with the current video frame. Step 503b analyzes display capabilities. If non-3D display capabilities are detected, only one of the current frame interlaced fields is decoded. That is, either Step 504 or Step 506 is performed. Then, Step 510 presents a 2D frame image.

In another aspect, Step 503 accepts an SEI 3D optional message, to signal 3D views available, to describe how 3D views are mapped into interlaced fields, and to describe the dependency of each field. In another aspect, Step 501a accepts a first encoded video frame prior to accepting the current frame. Step 501b derives a predictive first frame top field. Step 501c derives a predictive first frame bottom field. Then, decoding the current frame top field (Step 504) includes decoding the current frame top field in response to the predictive first frame top field. Likewise, decoding a current frame bottom field (Step 506) includes decoding the current frame bottom field in response to the predictive first frame bottom field.

Alternately, but not shown, Step 501b derives a predictive first frame first field, either a top field or a bottom field. In this aspect Step 501c is bypassed. Then, Step 504 decodes the current frame top field in response to the predictive first frame first field. Step 506 decodes the current frame bottom field in response to the predictive first frame first field.

FIG. 6 is a flowchart illustrating the present invention method for encoding 3D video. The method starts at Step 600. Step 602 accepts a current 3D video image, including a first view of the image and a second, 3D, view of the image. In one aspect, Step 602 accepts a first and second view of a stereo image. Step 604 encodes the first view as a frame top field. Step 606 encodes the second view as the frame bottom field. Step 608 transmits a bitstream with a current video frame, having the top field interlaced with the bottom field, into a channel. For example, Step 608 transmits the bitstream in a standard such as MPEG2, MPEG4, or ITU-T H.264.

In one aspect, Step 607 accepts a 2D command responsive to a trigger such as an analysis of receiver capabilities or the channel bandwidth. Then, Step 610 transmits the 2D command to a receiver. In one aspect, Step 610 transmits a supplemental enhancement information (SEI) 3D option message with the current video frame to trigger optional single field two-dimensional (2D) decoding. In another aspect, Step 612 transmits only one of the fields from the current view frame, if the 2D command is transmitted in Step 610.

In one aspect, Step 601a accepts a first video image prior to accepting the current video image. Step 601b encodes a first image top field. Step 601c encodes a first image bottom field. For example, an I-frame may be encoded for MPEG standard transmissions. Then, Step 604 encodes the current frame top field in response to the first image top field, and Step 606 encodes the current frame bottom field in response to the first frame bottom field.

Alternately, Step 601b encodes a first image first field, either a top field or a bottom field, and Step 601c is bypassed. Then, Step 604 encodes the current frame top field in response to the first image first field, and Step 606 encodes the current frame bottom field in response to the first image first field.

Systems and methods for 3D encoding and decoding have been provided. Examples have been given as to how the processes may be scaled for 2D applications. Examples have also been given for how the processes may be enabled with predictive coding. However, the present invention is not limited to merely these examples. Other variations and embodiments of the invention will occur to those skilled in the art.

INVENTORS:

Sun, Shijun, Lei, Shawmin

THIS PATENT IS REFERENCED BY THESE PATENTS:

Patent	Priority	Assignee	Title
10015467,	Dec 18 2008	LG Electronics Inc.	Digital broadcasting reception method capable of displaying stereoscopic image, and digital broadcasting reception apparatus using same
10051226,	Nov 05 2009	Sony Corporation	Transmitter for enabling switching involving a 3D video signal
10368052,	May 24 2011	ADEIA MEDIA HOLDINGS LLC	Dynamic distribution of three-dimensional content
10419753,	Sep 12 2016	Renesas Electronics Corporation	Semiconductor device, moving image processing system, method of controlling semiconductor device
10855736,	Sep 22 2009	Qualcomm Incorporated	Enhanced block-request streaming using block partitioning or request controls for improved client-side handling
11122253,	May 24 2011	ADEIA MEDIA HOLDINGS LLC	Dynamic distribution of multi-dimensional multimedia content
11477253,	Sep 22 2009	Qualcomm Incorporated	Enhanced block-request streaming system using signaling or block creation
11743317,	Sep 22 2009	Qualcomm Incorporated	Enhanced block-request streaming using block partitioning or request controls for improved client-side handling
11770432,	Sep 22 2009	Qualcomm Incorporated	Enhanced block-request streaming system for handling low-latency streaming
11991340,	May 24 2011	ADEIA MEDIA HOLDINGS LLC	Dynamic distribution of content
12155715,	Sep 22 2009	Qualcomm Incorporated	Enhanced block-request streaming using block partitioning or request controls for improved client-side handling
8363724,	Jul 11 2006	InterDigital VC Holdings, Inc	Methods and apparatus using virtual reference pictures
8520057,	Nov 04 2008	Electronics and Telecommunications Research Institute	Method and system for transmitting/receiving 3-dimensional broadcasting service
8743178,	Jan 05 2010	Dolby Laboratories Licensing Corporation	Multi-view video format control
8806050,	Aug 10 2010	Qualcomm Incorporated	Manifest file updates for network streaming of coded multimedia data
8887020,	Oct 06 2003	Qualcomm Incorporated	Error-correcting multi-stage code generator and decoder for communication systems having single transmitters or multiple transmitters
8917314,	Nov 04 2008	Electronics and Telecommunications Research Institute	Apparatus and method for synchronizing stereoscopic image, and apparatus and method for providing stereoscopic image based on the same
8918533,	Jul 13 2010	Qualcomm Incorporated	Video switching for streaming video data
8948247,	Apr 14 2009	Futurewei Technologies, Inc.	System and method for processing video files
8958375,	Feb 11 2011	Qualcomm Incorporated	Framing for an improved radio link protocol including FEC
9097903,	Jun 16 2009	LG Electronics Inc	3D display device and selective image display method thereof
9131247,	Oct 19 2005	INTERDIGITAL MADISON PATENT HOLDINGS	Multi-view video coding using scalable video coding
9136878,	May 09 2005	Qualcomm Incorporated	File download and streaming system
9136983,	Feb 13 2006	Qualcomm Incorporated	Streaming and buffering using variable FEC overhead and protection periods
9171354,	Dec 13 2012	Samsung Electronics Co., Ltd.	Image processing apparatus and method for enhancing the quality of an image
9178535,	Apr 16 2007	Qualcomm Incorporated	Dynamic stream interleaving and sub-stream based delivery
9185439,	Jul 15 2010	Qualcomm Incorporated	Signaling data for multiplexing video components
9191151,	Sep 22 2009	Qualcomm Incorporated	Enhanced block-request streaming using cooperative parallel HTTP and forward error correction
9209934,	Sep 22 2009	Qualcomm Incorporated	Enhanced block-request streaming using cooperative parallel HTTP and forward error correction
9225961,	May 13 2010	Qualcomm Incorporated	Frame packing for asymmetric stereo video
9236885,	Oct 05 2002	Qualcomm Incorporated	Systematic encoding and decoding of chain reaction codes
9236887,	May 07 2004	Qualcomm Incorporated	File download and streaming system
9236976,	Dec 21 2001	Qualcomm Incorporated	Multi stage code generator and decoder for communication systems
9237101,	Sep 12 2007	Qualcomm Incorporated	Generating and communicating source identification information to enable reliable communications
9240810,	Jun 11 2002	Qualcomm Incorporated	Systems and processes for decoding chain reaction codes through inactivation
9246633,	Sep 23 1998	Qualcomm Incorporated	Information additive code generator and decoder for communication systems
9253233,	Aug 31 2011	Qualcomm Incorporated	Switch signaling methods providing improved switching between representations for adaptive HTTP streaming
9264069,	May 10 2006	Qualcomm Incorporated	Code generator and decoder for communications systems operating using hybrid codes to allow for multiple efficient uses of the communications systems
9270299,	Feb 11 2011	Qualcomm Incorporated	Encoding and decoding using elastic codes with flexible source block mapping
9270414,	Feb 21 2006	Qualcomm Incorporated	Multiple-field based code generator and decoder for communications systems
9281847,	Feb 27 2009	Qualcomm Incorporated	Mobile reception of digital video broadcasting—terrestrial services
9288010,	Oct 22 2010	Qualcomm Incorporated	Universal file delivery methods for providing unequal error protection and bundled file delivery services
9288473,	Mar 09 2011	Fujitsu Limited	Creating apparatus and creating method
9294226,	Mar 26 2012	Qualcomm Incorporated	Universal object delivery and template-based file delivery
9319448,	Aug 10 2010	Qualcomm Incorporated	Trick modes for network streaming of coded multimedia data
9380096,	Sep 22 2009	Qualcomm Incorporated	Enhanced block-request streaming system for handling low-latency streaming
9386064,	Sep 22 2009	Qualcomm Incorporated	Enhanced block-request streaming using URL templates and construction rules
9386293,	Apr 12 2010	KONINKLIJKE PHILIPS N V	Method for generating and rebuilding a stereoscopic-compatible video stream and related coding and decoding devices
9419749,	Aug 19 2009	Qualcomm Incorporated	Methods and apparatus employing FEC codes with permanent inactivation of symbols for encoding and decoding processes
9420259,	May 24 2011	ADEIA MEDIA HOLDINGS LLC	Dynamic distribution of three-dimensional content
9432433,	Sep 22 2009	Qualcomm Incorporated	Enhanced block-request streaming system using signaling or block creation
9456015,	Aug 10 2010	Qualcomm Incorporated	Representation groups for network streaming of coded multimedia data
9516294,	Dec 18 2008	LG Electronics Inc.	Digital broadcasting reception method capable of displaying stereoscopic image, and digital broadcasting reception apparatus using same
9596447,	Jul 21 2010	Qualcomm Incorporated	Providing frame packing type information for video coding
9602802,	Jul 21 2010	Qualcomm Incorporated	Providing frame packing type information for video coding
9628536,	Sep 22 2009	Qualcomm Incorporated	Enhanced block-request streaming using cooperative parallel HTTP and forward error correction
9628823,	Jan 09 2006	INTERDIGITAL MADISON PATENT HOLDINGS	Method and apparatus for providing reduced resolution update mode for multi-view video coding
9681117,	Oct 28 2010	LG Electronics Inc	Receiver apparatus and method for receiving a three-dimensional broadcast signal in a mobile environment
9787966,	Sep 04 2012	Industrial Technology Research Institute	Methods and devices for coding interlaced depth data for three-dimensional video content
9838666,	Aug 12 2011	Renesas Electronics Corporation	Video decoding device and image display device
9843844,	Oct 05 2011	Qualcomm Incorporated	Network streaming of media data
9869875,	Jun 16 2009	LG Electronics Inc.	3D display device and selective image display method thereof
9876607,	Aug 19 2009	Qualcomm Incorporated	Methods and apparatus employing FEC codes with permanent inactivation of symbols for encoding and decoding processes
9917874,	Sep 22 2009	Qualcomm Incorporated	Enhanced block-request streaming using block partitioning or request controls for improved client-side handling
9924152,	Dec 18 2008	LG Electronics Inc.	Digital broadcasting reception method capable of displaying stereoscopic image, and digital broadcasting reception apparatus using same
RE43741,	Oct 05 2002	Qualcomm Incorporated	Systematic encoding and decoding of chain reaction codes

THIS PATENT REFERENCES THESE PATENTS:

Patent	Priority	Assignee	Title
5012326,	Aug 23 1988	Kabushiki Kaisha Toshiba	Television signal transmitting and receiving system
5633682,	Oct 22 1993	Sony Corporation	Stereoscopic coding system
5767898,	Jun 23 1994	Sanyo Electric Co., Ltd.	Three-dimensional image coding by merger of left and right images
6081270,	Jan 27 1997	ACTIVISION PUBLISHING, INC	Method and system for providing an improved view of an object in a three-dimensional environment on a computer display
6449003,	Dec 18 1996	Fujitsu Siemens Computer GmbH	Method and circuit for converting the image format of three-dimensional electronic images produced with line polarization
6515663,	Mar 19 1999	AsusTek Computer Inc.	Apparatus for and method of processing three-dimensional images
6574423,	Feb 28 1996	Panasonic Corporation	High-resolution optical disk for recording stereoscopic video, optical disk reproducing device, and optical disk recording device
6765568,	Jun 12 2000	VREX, INC	Electronic stereoscopic media delivery system
6784891,	Feb 28 2001	MAXELL HOLDINGS, LTD ; MAXELL, LTD	Image display system
7254265,	Apr 01 2000	PHOENIX 3D, INC	Methods and systems for 2D/3D image conversion and optimization
20020009137,
20020047835,
20020122585,
20030095177,
20040022418,
20040032980,
20040096109,
20040126020,
20040218816,
20060013490,
20060177123,
EP639031,
WO3056843,

ASSIGNMENT RECORDS Assignment records on the USPTO

////

Executed on	Assignor	Assignee	Conveyance	Frame	Reel	Doc
Mar 30 2004	LEI, SHAWMIN	Sharp Laboratories of America, Inc	ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS	015184	0197	pdf
Mar 30 2004	SUN, SHIJUN	Sharp Laboratories of America, Inc	ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS	015184	0197	pdf
Apr 02 2004		Sharp Laboratories of America, Inc.	(assignment on the face of the patent)
Feb 16 2010	SHARP LABORATORIES OF AMERICA INC	Sharp Kabushiki Kaisha	ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS	023937	0526	pdf

MAINTENANCE FEES AND DATES: Maintenance records on the USPTO

Date	Maintenance Fee Events
Aug 30 2013	REM: Maintenance Fee Reminder Mailed.
Nov 25 2013	M1551: Payment of Maintenance Fee, 4th Year, Large Entity.
Nov 25 2013	M1554: Surcharge for Late Payment, Large Entity.
Oct 24 2014	ASPN: Payor Number Assigned.
Sep 04 2017	REM: Maintenance Fee Reminder Mailed.
Feb 19 2018	EXP: Patent Expired for Failure to Pay Maintenance Fees.

Date	Maintenance Schedule
Jan 19 2013	4 years fee payment window open
Jul 19 2013	6 months grace period start (w surcharge)
Jan 19 2014	patent expiry (for year 4)
Jan 19 2016	2 years to revive unintentionally abandoned end. (for year 4)
Jan 19 2017	8 years fee payment window open
Jul 19 2017	6 months grace period start (w surcharge)
Jan 19 2018	patent expiry (for year 8)
Jan 19 2020	2 years to revive unintentionally abandoned end. (for year 8)
Jan 19 2021	12 years fee payment window open
Jul 19 2021	6 months grace period start (w surcharge)
Jan 19 2022	patent expiry (for year 12)
Jan 19 2024	2 years to revive unintentionally abandoned end. (for year 12)