A method and device of encoding both a hdr picture (Ihdr) and a first sdr picture (ISDR1) obtained from said hdr picture, in at least one bitstream (F1, F2, F3, F4). The method comprises: —obtaining (210) a second sdr picture (ISDR2) by tone-mapping the hdr picture (Ihdr); —obtaining (230) a color mapping function (CMF) that allows the mapping of the colors of the second sdr picture (ISDR2) onto the colors of a third sdr picture (ISDR3) obtained (220) from the first sdr picture (ISDR1); —encoding (240), in a bitstream, an information (INF) representative of the color mapping function; and —encoding (260), in a bitstream, a fourth sdr picture (ISDR4) obtained (250) from the first sdr picture (ISDR1). The present disclosure further relates to a method and device of decoding.

Patent
   11006151
Priority
Jun 30 2015
Filed
Jun 27 2016
Issued
May 11 2021
Expiry
Jun 27 2036
Assg.orig
Entity
Large
0
46
window open
1. A method of encoding both a high dynamic range (hdr) picture and a first standard dynamic range (sdr) picture obtained from said hdr picture, in at least one bitstream, the method comprising:
obtaining a second sdr picture by tone mapping the hdr picture;
obtaining a color mapping function that allows a mapping of the colors of the second sdr picture onto colors of a third sdr picture obtained from the first sdr picture;
obtaining a fourth sdr picture by applying said color mapping function onto the colors of the second sdr picture;
encoding, in a bitstream, an information representative of the color mapping function; and
encoding, in a bitstream, said fourth sdr picture.
9. A device for encoding both a high dynamic range (hdr) picture and a first standard dynamic range (sdr) picture obtained from said hdr picture, in at least one bitstream, wherein the device comprises a processor configured to:
obtain a second sdr picture by tone-mapping the hdr picture;
obtain a color mapping function that allows a mapping of the colors of a second sdr picture onto the colors of a third sdr picture obtained from the first sdr picture;
obtain a fourth sdr picture by applying said color mapping function onto the colors of the second sdr picture;
encode, in a bitstream, an information representative of the color mapping function; and
encode, in a bitstream, said fourth sdr picture.
17. A non-transitory processor readable medium having stored therein instructions for causing a processor to perform a method of encoding both a high dynamic range (hdr) picture and a first standard dynamic range (sdr) picture obtained from said hdr picture, in at least one bitstream, the method comprising:
obtaining a second sdr picture by tone mapping the hdr picture;
obtaining a color mapping function that allows a mapping of the colors of the second sdr picture onto colors of a third sdr picture obtained from the first sdr picture;
obtaining a fourth sdr picture by applying said color mapping function onto the colors of the second sdr picture;
encoding, in a bitstream, an information representative of the color mapping function; and
encoding, in a bitstream, said fourth sdr picture.
6. A method of decoding a high dynamic range (hdr) picture from at least one bitstream comprising:
obtaining a decoded first standard dynamic range (sdr) picture by decoding a bitstream;
obtaining an information representative of a color mapping function by decoding a bitstream;
obtaining a decoded second sdr picture by applying a function based on the information representative of the color mapping function to the colors of the decoded first sdr picture; and
obtaining a decoded hdr picture by applying an inverse-tone-mapping to the decoded second sdr picture wherein the obtaining of a decoded hdr picture by applying an inverse-tone-mapping to the decoded second sdr picture comprises:
decoding inverse tone mapping information;
obtaining the decoded hdr picture by applying the inverse tone mapping information the second sdr picture.
13. A device for decoding a high dynamic range (hdr) picture from at least one bitstream, wherein the device comprises a processor configured to:
obtain a decoded first standard dynamic range (sdr) picture by decoding a bitstream;
obtain an information representative of a color mapping function by decoding a bitstream;
obtain a decoded second sdr picture by applying a function based on the information representative of the color mapping function to the colors of the decoded first sdr picture; and
obtain a decoded hdr picture by applying an inverse-tone-mapping to the decoded second sdr picture wherein the obtaining of a decoded hdr picture by applying an inverse-tone-mapping to the decoded second sdr picture comprises:
decode inverse tone mapping information;
obtain the decoded hdr picture by applying the inverse tone mapping information to the second sdr picture.
19. A non-transitory processor readable medium having stored therein instructions for causing a processor to perform a method of decoding a high dynamic range (hdr) picture, comprising:
obtaining a decoded first standard dynamic range (sdr) picture by decoding a bitstream;
obtaining an information representative of a color mapping function by decoding a bitstream;
obtaining a decoded second sdr picture by applying a function based on the information representative of the color mapping function to the colors of the decoded first sdr picture; and
obtaining a decoded hdr picture by applying an inverse-tone-mapping to the decoded second sdr picture wherein the obtaining of a decoded hdr picture by applying an inverse-tone-mapping to the decoded second sdr picture comprises:
decoding inverse tone mapping information;
obtaining the decoded hdr picture by applying the inverse tone mapping information to the second sdr picture.
2. The method of claim 1, wherein the third picture is the first sdr picture.
3. The method of claim 1, wherein the third sdr picture is a decoded version of the encoded first sdr picture.
4. The method according to claim 1, wherein the obtaining of a second sdr picture by tone mapping the hdr picture comprises obtaining inverse tone mapping information from the hdr picture, applying this inverse tone mapping information to the hdr picture and encoding this information in a bitstream.
5. The method according to claim 4, wherein the inverse tone mapping information is a backlight picture that is applied to the hdr picture by dividing, pixel by pixel, the hdr picture by the backlight picture to obtain a second sdr picture.
7. The method of claim 6, wherein the obtaining of a decoded hdr picture by applying an inverse-tone-mapping to the decoded second sdr picture comprises:
obtaining a first component by applying a non-linear function on a luminance component, obtained from the bitstream, in order that a dynamic of said first component is increased compared to the dynamic of the luminance component;
obtaining at least one color component from said first component, two chrominance components obtained from the bitstream and from a factor that depends on the luminance component; and
the decoded picture is obtained by combining together said at least one color component.
8. The method of claim 6, wherein the inverse tone mapping information is a backlight picture and applying the inverse tone mapping information to the sdr comprises multiplying the second sdr picture by the backlight picture.
10. The device of claim 9, wherein the third picture is the first sdr picture.
11. The device according to claim 9, wherein, to obtain the second sdr picture by tone mapping the hdr picture, the processor is configured to obtain inverse tone mapping information from the hdr picture; apply the inverse tone mapping information to the hdr picture; and encode the inverse tone mapping information in a bitstream.
12. The device of claim 11, wherein the inverse tone mapping information is a backlight picture that is applied to the hdr picture by dividing, pixel by pixel, the hdr picture by the backlight picture to obtain a second sdr picture.
14. The device of claim 9, wherein the third sdr picture is a decoded version of the encoded first sdr picture.
15. The device of claim 13, wherein the obtaining of a decoded hdr picture by applying an inverse-tone-mapping to the decoded second sdr picture comprises:
obtaining a first component by applying a non-linear function on a luminance component, obtained from the bitstream, in order that a dynamic of said first component is increased compared to the dynamic of the luminance component;
obtaining at least one color component from said first component, two chrominance components obtained from the bitstream and from a factor that depends on the luminance component; and
the decoded picture is obtained by combining together said at least one color component.
16. The device of claim 8, wherein the inverse tone mapping information is a backlight picture and applying the inverse tone mapping information to the second sdr picture comprises multiplying the second sdr picture by the backlight picture.
18. The non-transitory processor readable medium of claim 11 wherein the obtaining of a second sdr picture by tone mapping the hdr picture comprises obtaining inverse tone mapping information from the hdr picture, applying this inverse tone mapping information to the hdr picture and encoding this information in a bitstream.
20. The non-transitory processor readable medium of claim 19, wherein the inverse tone mapping information is a backlight picture and applying the inverse tone mapping information to the second sdr picture comprises multiplying the second sdr picture by the backlight picture.

This application claims the benefit, under 35 U.S.C. § 365 of International Application PCT/EP2016/064837, filed Jun. 27, 2016, which was published in accordance with PCT Article 21(2) on Jan. 5, 2017, in English, and which claims the benefit of European Patent Application No. 15306048.8, filed on Jun. 30, 2015.

The present disclosure generally relates to picture/video encoding and decoding.

The present section is intended to introduce the reader to various aspects of art, which may be related to various aspects of the present principles that are described and/or claimed below. This discussion is believed to be helpful in providing the reader with background information to facilitate a better understanding of the various aspects of the present principles. Accordingly, it should be understood that these statements are to be read in this light, and not as admissions of prior art.

In the following, a picture contains one or several arrays of samples (pixel values) in a specific picture/video format which specifies all information relative to the pixel values of a picture (or a video) and all information which may be used by a display and/or any other device to visualize and/or decode a picture (or video) for example. A picture comprises at least one component, in the shape of a first array of samples, usually a luma (or luminance) component, and, possibly, at least one other component, in the shape of at least one other array of samples, usually a color component. Or, equivalently, the same information may also be represented by a set of arrays of color samples, such as the traditional tri-chromatic RGB representation.

A pixel value is represented by a vector of C values, where C is the number of components. Each value of a vector is represented with a number of bits which defines a maximal dynamic range of the pixel values.

Standard-Dynamic-Range pictures (SDR pictures) are color pictures whose luminance values are represented with a limited dynamic usually measured in power of two or f-stops. SDR pictures have a dynamic range, also called a dynamic in the following, around 10 f-stops, i.e. a ratio 1000 between the brightest pixels and the darkest pixels in the linear domain, and are coded with a limited number of bits (most often 8 or 10 in HDTV (High Definition Television systems) and UHDTV (Ultra-High Definition Television systems) in a non-linear domain, for instance by using the ITU-R BT.709 OETF (Optico-Electrical-Transfer-Function) (Rec. ITU-R BT.709-5, April 2002) or ITU-R BT.2020 OETF (Rec. ITU-R BT.2020-1, June 2014) to reduce the dynamic. This limited non-linear representation does not allow correct rendering of small signal variations, in particular in dark and bright luminance ranges. In High-Dynamic-Range pictures (HDR pictures), the signal dynamic is much higher (up to 20 f-stops, a ratio one million between the brightest pixels and the darkest pixels) and a new non-linear representation is needed in order to maintain a high accuracy of the signal over its entire range. In HDR pictures, raw data are usually represented in floating-point format (either 32-bit or 16-bit for each component, namely float or half-float), the most popular format being openEXR half-float format (16-bit per RGB component, i.e. 48 bits per pixel) or in integers with a long representation, typically at least 16 bits.

A color gamut is a certain complete set of colors. The most common usage refers to a set of colors which can be accurately represented in a given circumstance, such as within a given color space or by a certain output device.

A color gamut is sometimes defined by RGB primaries provided in the CIE1931 color space chromaticity diagram and a white point as illustrated in FIG. 1.

It is common to define primaries in the so-called CIE1931 color space chromaticity diagram. This is a two dimensional diagram (x,y) defining the colors independently on the luminance component. Any color XYZ is then projected in this diagram using the transform:

{ x = X X + Y + Z y = Y X + Y + Z
The z=1−x−y component is also defined but carry no extra information.

A gamut is defined in this diagram by the triangle whose vertices are the set of (x,y) coordinates of the three primaries RGB. The white point W is another given (x,y) point belonging to the triangle, usually close to the triangle center.

A color volume is defined by a color space and a dynamic range of the values represented in said color space.

For example, a color gamut is defined by a RGB ITU-R Recommendation BT.2020 color space for UHDTV. An older standard, ITU-R Recommendation BT.709, defines a smaller color gamut for HDTV. In SDR, the dynamic range is defined officially up to 100 nits (candela per square meter) for the color volume in which data are coded, although some display technologies may show brighter pixels.

As explained extensively in “A Review of RGB Color Spaces” by Danny Pascale, a change of gamut, i.e. a transform that maps the three primaries and the white point from a gamut to another, can be performed by using a 3×3 matrix in the linear RGB color space. Also, a change of space from XYZ to RGB is performed by a 3×3 matrix. As a consequence, regardless of whether RGB or XYZ is the color space, a change of gamut can be performed by a 3×3 matrix. For example, a gamut change from BT.2020 linear RGB to BT.709 XYZ can be performed by a 3×3 matrix.

High Dynamic Range pictures (HDR pictures) are color pictures whose luminance values are represented with a HDR dynamic that is higher than the dynamic of a SDR picture.

The HDR dynamic is not yet defined by a standard but one may expect a dynamic range up to a few thousands nits. For instance, a HDR color volume is defined by a RGB BT.2020 color space and the values represented in said RGB color space belong to a dynamic range from 0 to 4000 nits. Another example of HDR color volume is defined by a RGB BT.2020 color space and the values represented in said RGB color space belong to a dynamic range from 0 to 1000 nits.

Color-grading a picture (or a video) is a process of altering/enhancing the colors of the picture (or the video). Usually, color-grading a picture involves a change of the color volume (color space and/or dynamic range) or a change of the color gamut relative to this picture. Thus, two different color-graded versions of a same picture are versions of this picture whose values are represented in different color volumes (or color gamut) or versions of the picture whose at least one of their colors has been altered/enhanced according to different color grades. This may involve user interactions.

For example, in cinematographic production, a picture (of a video) is captured using tri-chromatic cameras into RGB color values composed of 3 components (Red, Green and Blue). The RGB color values depend on the tri-chromatic characteristics (color primaries) of the sensor.

A HDR color-graded version of the captured picture (or video) is then obtained in order to get theatrical renders (using a specific theatrical grade). Typically, the values of the first color-graded version of the captured picture (or video) are represented according to a standardized YUV format such as BT.2020 which defines parameter values for UHDTV.

The YUV format is typically performed by applying a non-linear function, so called Optical Electronic Transfer Function (OETF) on the linear RGB components to obtain non-linear components R′G′B′, and then applying a color transform (usually a 3×3 matrix) on the obtained non-linear R′G′B′ components to obtain the three components YUV. The first component Y is a luminance component and the two components U,V are chrominance components.

Then, a Colorist, usually in conjunction with a Director of Photography, performs a control on the color values of the first color-graded version of the captured picture (or video) by fine-tuning/tweaking some color values in order to instill an artistic intent.

A SDR color-graded version of the captured picture is also obtained to get home release renders (using specific home, Blu-Ray Disk/DVD grade). Typically, the values of the second color-graded version of the captured picture are represented according to a standardized YUV format such as ITU-R Recommendation BT.601 (Rec. 601) which defines studio encoding parameters of Standard Digital Television for standard 4:3 and wide-screen 16:9 aspect ratios, or ITU-R Recommendation BT.709 which defines parameter values for High Definition Television systems (HDTV).

Obtaining such a SDR color-graded version of the captured picture usually comprises shrinking the color volume of the first color-graded version of the captured picture (for example RGB BT.2020 1000 nits modified by the Colorist) in order that the second color-graded version of the captured picture belong to a second color volume (RGB BT.709 1000 nits for example). This is an automatic step which uses a color mapping function (CMF) (for example for mapping of RGB BT.2020 format to RGB BT.709) usually approximated by a three dimensional look-up-table (also called 3D LUT). Note that all the considered YUV formats are characterized with the Color primaries parameters that allow defining any RGB-to-YUV and YUV-to-RGB color mappings.

Then, a Colorist, usually in conjunction with a Director of Photography, performs a control on the color values of the second color-graded version of the captured picture by fine-tuning/tweaking some color values in order to instill the artistic intent in the home release.

The problem to be solved is the distribution of both the HDR color-graded version and the SDR color-graded version of the captured picture (or video), i.e. the distribution of a compressed HDR picture (or video) representative of a color-graded version of a captured picture (or video) while, at the same time, distributing an associated SDR picture (or video) representative of a color-graded SDR version of said captured picture (or video) for backward compatibility with legacy SDR displays for example. Said associated SDR picture (or video) is sometimes called an imposed SDR picture (video).

A straightforward solution is simulcasting both these HDR and SDR color graded pictures (or videos) on a distribution infrastructure. The drawback of this solution is to virtually double the needed bandwidth compared to a legacy infrastructure adapted to broadcast a SDR picture (or video) such as HEVC main 10 profile (“High Efficiency Video Coding”, SERIES H: AUDIOVISUAL AND MULTIMEDIA SYSTEMS, Recommendation ITU-T H.265, Telecommunication Standardization Sector of ITU, October 2014).

Using a legacy distribution infrastructure is a requirement to accelerate the emergence of the distribution of HDR pictures (or video). Also, the bitrate shall be minimized while ensuring good quality of both the HDR and SDR pictures (or videos).

The following presents a simplified summary of the disclosure in order to provide a basic understanding of some aspects of the disclosure. This summary is not an extensive overview of the disclosure. It is not intended to identify key or critical elements of the disclosure. The following summary merely presents some aspects of the disclosure in a simplified form as a prelude to the more detailed description provided below.

The present principles set out to remedy at least one of the drawbacks of the prior art with a method of encoding both a HDR picture and a first SDR picture obtained from said HDR picture, in at least one bitstream, the method comprising:

the second HDR picture is obtained by combining together the luminance component and the two chrominance components.

In accordance with an example of the principles, the third and fourth SDR pictures are the first SDR picture.

In accordance with an example of the principles, the fourth SDR picture is the first SDR picture and the third SDR picture is a decoded version of the encoded first SDR picture.

In accordance with an example of the principles, the third SDR picture is the first SDR picture and the fourth SDR picture is obtained by applying the color mapping function onto the colors of the second SDR picture.

In accordance with an example of the principles, the third SDR picture is a decoded version of the encoded first SDR picture and the fourth SDR picture is obtained by applying the color mapping function onto the colors of the second SDR picture.

According to another of their aspects, the present principles relate to a method of decoding a HDR picture from at least one bitstream comprising:

the decoded picture is obtained by combining together said at least one color component.

According to another of their aspects, the present principles relate to a device of encoding both a HDR picture and a first SDR picture obtained from said HDR picture, in at least one bitstream, characterized in that the device comprises a processor configured to:

According to another of their aspects, the present principles relate to a device of decoding a HDR picture from at least one bitstream, characterized in that the device comprises a processor configured to:

the decoded picture is obtained by combining together said at least one color component.

According to other of their aspects, the present principles relate to a device comprising a processor configured to implement the above method, a computer program product comprising program code instructions to execute the steps of the above method when this program is executed on a computer, a processor readable medium having stored therein instructions for causing a processor to perform at least the steps of the above method, and a non-transitory storage medium carrying instructions of program code for executing steps of the above method when said program is executed on a computing device.

The specific nature of the disclosure as well as other objects, advantages, features and uses of the disclosure will become evident from the following description of embodiments taken in conjunction with the accompanying drawings.

In the drawings, an embodiment of the present disclosure is illustrated. It shows:

FIG. 1 shows examples of CIE1931 color space chromaticity diagram;

FIG. 2 shows a block diagram of the steps of a method for encoding both a HDR picture and a SDR picture in accordance with the present principles;

FIG. 3 shows a diagram of the steps of a method for decoding a HDR picture IHDR and a SDR picture ISDR1 in accordance with an example of the present principles;

FIG. 4 shows a diagram of the steps of an example of the method for encoding both the HDR picture IHDR and the first SDR picture ISDR1 as described in relation with FIG. 2;

FIG. 5 shows a diagram of the steps of a method for encoding both a HDR picture and a SDR picture in accordance with a variant of FIG. 4.

FIG. 6 shows a diagram of the steps of an example of the method for encoding both the HDR picture IHDR and the first SDR picture ISDR1 as described in relation with FIG. 2;

FIG. 7 shows a diagram of the steps of a method for encoding both a HDR picture and a SDR picture in accordance with a variant of FIG. 6;

FIG. 8a-d show diagrams of the sub-steps of the step 210 in accordance with examples of the present principles;

FIG. 9 shows a diagram of the steps of a method for decoding both a HDR picture and a SDR picture in accordance with an example of the present principles.

FIG. 10a-c show diagrams of the sub-steps of the step 210 in accordance with examples of the present principles;

FIG. 11a-d show diagrams of the steps of a method of decoding a HDR picture and a SDR picture from at least one bitstream in accordance with an example of the present principles;

FIG. 12 shows an example of an architecture of a device in accordance with an example of present principles; and

FIG. 13 shows two remote devices communicating over a communication network in accordance with an example of present principles;

Similar or same elements are referenced with the same reference numbers.

The present disclosure will be described more fully hereinafter with reference to the accompanying figures, in which embodiments of the disclosure are shown. This disclosure may, however, be embodied in many alternate forms and should not be construed as limited to the embodiments set forth herein. Accordingly, while the disclosure is susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and will herein be described in detail. It should be understood, however, that there is no intent to limit the disclosure to the particular forms disclosed, but on the contrary, the disclosure is to cover all modifications, equivalents, and alternatives falling within the spirit and scope of the disclosure as defined by the claims.

The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the disclosure. As used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises”, “comprising,” “includes” and/or “including” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. Moreover, when an element is referred to as being “responsive” or “connected” to another element, it can be directly responsive or connected to the other element, or intervening elements may be present. In contrast, when an element is referred to as being “directly responsive” or “directly connected” to other element, there are no intervening elements present. As used herein the term “and/or” includes any and all combinations of one or more of the associated listed items and may be abbreviated as“/”.

It will be understood that, although the terms first, second, etc. may be used herein to describe various elements, these elements should not be limited by these terms. These terms are only used to distinguish one element from another. For example, a first element could be termed a second element, and, similarly, a second element could be termed a first element without departing from the teachings of the disclosure.

Although some of the diagrams include arrows on communication paths to show a primary direction of communication, it is to be understood that communication may occur in the opposite direction to the depicted arrows.

Some embodiments are described with regard to block diagrams and operational flowcharts in which each block represents a circuit element, module, or portion of code which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that in other implementations, the function(s) noted in the blocks may occur out of the order noted. For example, two blocks shown in succession may, in fact, be executed substantially concurrently or the blocks may sometimes be executed in the reverse order, depending on the functionality involved.

Reference herein to “an example”, “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment or an example can be included in at least one implementation of the disclosure. The appearances of the phrase “in one embodiment”, “according to an embodiment” “in one example” or “in accordance with an example” in various places in the specification are not necessarily all referring to the same embodiment or example, nor are separate or alternative embodiments or examples necessarily mutually exclusive of other embodiments or examples.

Reference numerals appearing in the claims are by way of illustration only and shall have no limiting effect on the scope of the claims.

While not explicitly described, the present embodiments and variants may be employed in any combination or sub-combination.

The present principles is described for encoding/decoding a picture but extends to the encoding/decoding of a sequence of pictures (video) because each picture of the sequence is sequentially encoded/decoded as described below.

FIG. 2 shows a diagram of the steps of a method for encoding both a HDR picture IHDR and a SDR picture ISDR1 in accordance with the present principles.

The HDR picture IHDR is a color-graded version of a captured picture (or video) according to a first grade, and the first SDR picture ISDR1 is a color-graded version of said captured picture (or video) according to a second grade as explained above. The constraint on this encoding method is that the color-grade of the SDR picture ISDR1 shall be rendered at the decoder or at least a SDR picture having a visual content very close to the visual content of SDR picture ISDR1 in order to preserve the artist intent.

In step 210, a module TM obtains a second SDR picture ISDR2 by tone-mapping the HDR picture IHDR.

The term ‘tone-mapping’ means any approach that reduces the dynamic range of the HDR picture IHDR to a targeted dynamic range. Examples of tone-mapping approaches are given in FIG. 8a-d, 9, 10a-d but the present disclosure is not limited to a specific tone-mapping approach.

In step 220, a module SDR1-to-SDR3 obtains a third SDR picture ISDR3 from the first SDR picture ISDR1.

In step 230, a module CM obtains a color mapping function CMF that allows the mapping of the colors of the second SDR picture ISDR2 onto the colors of the third SDR picture ISDR3 in order to minimize the differences between the second SDR picture ISDR2 and the third SDR picture ISDR3.

For example, the color mapping function is obtained by minimizing a mean square error calculated by subtracting the pixel values of the third SDR picture ISDR3 from the pixels of the SDR picture ISDR2. An example of color mapping function is given by the standard HEVC with color remapping information SEI message (Annex. D.2.32). The present disclosure is not limited to a specific color mapping function but extend to any kind of mapping function.

In step 240, an encoder ENC1 encodes, in a bitstream F1, an information INF representative of the color mapping function CMF.

According to an embodiment of the method, the information INF is an index allowing to retrieve the color mapping function CMF from a list of color mapping functions.

According to an embodiment of the method, the information INF represent parameters of the color mapping function CMF.

In step 250, a module SDR1-to-SDR4 obtains a fourth SDR picture ISDR4 from the first SDR picture ISDR1.

In step 260, an encoder ENC2 encodes the fourth SDR picture ISDR4 in a bitstream F2.

FIG. 3 shows a diagram of the steps of a method for decoding a HDR picture IHDR and a SDR picture ISDR1 in accordance with an example of the present principles.

In step 310, a decoder DEC2 obtains a decoded SDR picture, called a decoded fourth SDR picture ISDR4, by decoding a bitstream F2.

In step 320, a module SDR4-to-SDR1 obtains a decoded first SDR picture ISDR1 from the decoded fourth SDR picture ISDR4.

In step 220, the module SDR1-to-SDR3 obtains a decoded third SDR picture ISDR3 from the decoded first SDR picture ISDR1.

In step 330, a decoder DEC1 obtains an information INF representative of a color mapping function CMF by decoding at least partially a bitstream F1.

According to a variant, the information INF is representative of the inverse of the color mapping function CMF.

In step 340, a module AP−1 obtains a decoded second SDR picture ISDR2 by applying the inverse CMF−1 of the color mapping function CMF to the colors of the decoded third SDR picture ISDR3.

In step 350, a module ITM obtains a decoded HDR picture IHDR by applying an inverse-tone-mapping to the decoded second SDR picture ISDR2.

The inverse-tone-mapping is the inverse of the tone-mapping used in step 210 in FIG. 2.

FIG. 4 shows a diagram of the steps of an example of the method for encoding both the HDR picture IHDR and the first SDR picture ISDR1 as described in relation with FIG. 2.

The modules SDR1-to-SDR3 and SDR1-to-SDR4 are configured in order that the SDR pictures ISDR3 and ISDR4 equal the SDR picture ISDR1.

In other words, those modules do not implement any method.

In step 230, the color mapping function CMF is then obtained to allow the mapping of the colors of the second SDR picture ISDR2 onto the colors of the first SDR picture ISDR1, and in step 260, the first SDR picture ISDR1 is directly encoded by the encoder ENC2.

According to this example of the present principles, the first SDR picture ISDR1 as color-graded by the colorist, is thus directly available by decoding the bitstream F2. The artist intent is thus preserved when the decoded first SDR picture ISDR1 is displayed.

FIG. 5 shows a diagram of the steps of a method for encoding both a HDR picture IHDR and a SDR picture ISDR1 in accordance with a variant of FIG. 4.

The module SDR1-to-SDR4 is configured in order that the fourth SDR picture ISDR4 is the first SDR picture ISDR1. The first SDR picture ISDR1 as color-graded by the colorist, is thus directly available by decoding the bitstream F2. The artist intent is thus preserved when the decoded first picture ISDR1 is displayed.

The module SDR1-to-SDR3 is configured to encode the first SDR picture ISDR1 by using the encoder ENC2, and to obtain the third SDR picture ISDR3 by decoding the encoded first SDR picture ISDR1 according to a decoder DEC2 (step 310).

In step 230, the color mapping function CMF is then obtained to allow the mapping of the colors of the SDR picture ISDR2 onto the colors of the decoded version of the encoded first SDR picture ISDR1.

Determining the color mapping function CMF from the decoded version of the encoded first SDR picture ISDR1 rather than from the first SDR picture ISDR1, leads to a decoded second SDR picture ISDR2 (obtained at the decoding side) whose the content is closer to the content of the second SDR picture ISDR2 used at the encoding side. Then, the decoded HDR picture, obtained from the decoded second SDR picture ISDR2 and the color mapping function determined from said decoded second SDR picture ISDR2, at the encoding side, has a visual content closer to the visual content of the original HDR picture, improving the performance of the HDR encoding/decoding scheme of FIG. 4.

FIG. 6 shows a diagram of the steps of an example of the method for encoding both the HDR picture IHDR and the first SDR picture ISDR1 as described in relation with FIG. 2;

The module SDR1-to-SDR3 is configured in order that the SDR pictures ISDR3 is the SDR picture ISDR1.

In step 230, the color mapping function CMF is then obtained to allow the mapping of the colors of the second SDR picture ISDR2 onto the colors of a first SDR picture ISDR1.

The module SDR1-to-SDR4 comprises a module AP (step 610) to obtain the fourth SDR picture ISDR4 by applying the color mapping function CMF (obtained from the SDR picture ISDR1) onto the colors of the second SDR picture ISDR2.

The content of the fourth SDR picture ISDR4 is thus close to the content of the first SDR picture ISDR1 because the color mapping function CMF is determined in order to minimize the differences between these two pictures.

FIG. 7 shows a diagram of the steps of a method for encoding both a HDR picture IHDR and a SDR picture ISDR1 in accordance with a variant of FIG. 6.

The module SDR1-to-SDR3 is configured to encode (step 260) the first SDR picture ISDR1 by using the encoder ENC2, and to obtain the third SDR picture ISDR3 by decoding the encoded first SDR picture ISDR1 according to the decoder DEC2 (step 310).

Determining the color mapping function CMF from the decoded version of the encoded first SDR picture ISDR1 rather than from the first SDR picture ISDR1, leads to a decoded second SDR picture ISDR2 (obtained at the decoding side) whose the content is closer to the content of the second SDR picture ISDR2 used at the encoding side. Then, the decoded HDR picture, obtained from the decoded second SDR picture ISDR2 and the color mapping function determined from said decoded second SDR picture ISDR2, at the encoding side, has a visual content closer to the visual content of the original HDR picture, improving the performance of the HDR encoding/decoding scheme of FIG. 6.

According to an example of the present principles, in step 210, the module TM applies a tone-mapping operator onto the HDR picture IHDR in order to reduce the dynamic range of the luminance of the HDR picture IHDR to a target dynamic range.

The invention is not limited to any specific tone-mapping operator. This single condition is that the tone-mapping operator shall be reversible. For example, the tone-mapping operator defined by Reinhard may be used (Reinhard, E., Stark, M., Shirley, P., and Ferwerda, J., \Photographic tone reproduction for digital images,” ACM Transactions on Graphics 21 (July 2002)), or defined by Boitard, R., Bouatouch, K., Cozot, R., Thoreau, D., & Gruson, A. (2012). Temporal coherency for video tone mapping. In A. M. J. van Eijk, C. C. Davis, S. M. Hammel, & A. K. Majumdar (Eds.), Proc. SPIE 8499, Applications of Digital Image Processing (p. 84990D-84990D-10)).

FIG. 8a-d show diagrams of the sub-steps of the step 210 in accordance with examples of the present principles.

As illustrated in FIG. 8a, the module TM comprises a module BAM configured to obtain a backlight picture Ba from the HDR picture IHDR (step 2101).

According to an embodiment of step 2101, illustrated in FIG. 8b, the module BAM comprises a module BI which obtains the backlight picture Ba from the luminance component L of the HDR picture IHDR.

When the HDR picture IHDR belongs to a RGB color space, the luminance component L is obtained, for instance in the 709 color gamut, by a linear combination which is given by:
L=0.2127·R+0.7152·G+0.0722·B

According to an embodiment, the backlight picture Ba is determined as being a weighted linear combination of shape functions ψi given by:
Ba=Σiaiψi  (1)
with ai being weighting coefficients.

Thus, determining a backlight picture Ba from a luminance component L consists in finding optimal weighting coefficients (and potentially also optimal shape functions if not known beforehand) in order that the backlight picture Ba fits the luminance component L.

There are many well-known methods to find the weighting coefficients ai. For example, one may use a least mean square error method to minimize the mean square error between the backlight picture Ba and the luminance component L.

It may be noted that the shape functions may be the true physical response of a display backlight (made of LED's for instance, each shape function then corresponding to the response of one LED) or may be a pure mathematical construction in order to fit the luminance component at best.

According to a variant of this embodiment, illustrated in FIG. 8c, the module BAM further comprises a module BM which modulate the backlight picture Ba (given by equation (1)) with a mean luminance value Lmean of the HDR picture IHDR obtained by the means of a module HL.

According to an example, the module HL is configured to calculate the mean luminance value Lmean over the whole luminance component L.

According to an example, the module HL is configured to calculate the mean luminance value Lmean by

L mean = E ( L β ) 1 β

with β being a coefficient less than 1 and E(X) the mathematical expectation value (mean) of the luminance component L.

This last example is advantageous because it avoids that the mean luminance value Lmean be influenced by a few pixels with extreme high values which usually leads to very annoying temporal mean brightness instability when the HDR picture IHDR belongs to a sequence of images.

The invention is not limited to a specific embodiment for calculating the mean luminance value Lmean.

According to a variant, illustrated in FIG. 8d, a module N normalizes the backlight image Ba (given by equation (1)) by its mean value E(Ba) such that one gets a backlight picture Bagray (having a mid-grey equals to 1) for the HDR picture (or for all HDR pictures if the HDR picture belongs to a sequence or group of pictures):

Ba gray = Ba E ( Ba )

Then, the module BM is configured to modulate the backlight picture Bagray with the mean luminance value Lmean of the HDR picture IHDR, by using the following relation
Bamod≈cstmod·Lmeanα·Bagray  (2)
with cstmod being a modulation coefficient and a being another modulation coefficient less than 1, typically ⅓. For example, cstmod≈1.7 for a backlight picture is obtained by least means squares.

Practically, by linearity, all operations to modulate the backlight picture apply to the backlight coefficients ai as a correcting factor which transforms the coefficients ai into new coefficients ãl such that one gets

Ba mod = i a i ~ ψ i

The present disclosure is not limited to any way to obtain a backlight picture Ba from the HDR picture IHDR.

In step 2102, in FIG. 8a, the second SDR picture ISDR2 is obtained by dividing, pixel by pixel, the HDR picture IHDR by the backlight picture Ba.

In step 2103, an encoder ENC3 encodes the backlight picture Ba in a bitstream F3.

Dividing the HDR picture IHDR by the backlight picture Ba reduces the dynamic range of the HDR picture. A method as described in relation with FIG. 8a-d may thus be considered as being a tone-mapping of the HDR picture IHDR.

FIG. 9 shows a diagram of the steps of a method for decoding both a HDR picture and a SDR picture in accordance with an example of the present principles.

This example allows getting both a HDR picture and a SDR picture when those pictures have been previously encoded by a method as described in relation with FIGS. 8a-d.

The module ITM, in step 350, comprises a decoder DEC3 which obtains a decoded backlight picture Ba by decoding a bitstream F3 (step 3501). In step 3502, the decoded HDR picture IHDR is obtained by multiplying the second SDR picture ISDR2 by the decoded backlight picture Ba.

Multiplying the second SDR picture ISDR2 by the decoded backlight picture Ba increases the dynamic range of the resulting HDR picture compared to the second SDR picture ISDR2, i.e such multiplying may be considered as being inverse-tone-mapping.

FIG. 10a-c show diagrams of the sub-steps of the step 210 in accordance with examples of the present principles.

In this example, the HDR picture IHDR is considered as having three color components Ec (c=1, 2 or 3) in which the pixel values of the HDR picture IHDR are represented.

The present disclosure is not limited to any color space in which the three components Ec are represented but extends to any color space such as RGB, CIELUV, XYZ, CIELab, etc.

Basically, a luminance component L and two chrominance components C1 and C2 are determined from the three color components Ec of the HDR picture IHDR. The luminance and chrominance components form a SDR color picture whose pixel values are represented in the color space (L, C1, C2). Said SDR color picture is viewable by a legacy SDR display, i.e. has a sufficient visual quality in order to be viewed by a legacy SDR display.

In step 100a, a module IC obtains a component Y that represents the luminance of the HDR picture IHDR by linearly combining together the three components Ec:

Y = A 1 [ E 1 E 2 E 3 ]

where A1 is the first row of a 3×3 matrix A that defines a color space transforms from the (E1, E2, E3) color space to a color space (Y, C1, C2).

In step 130a, a module BMM obtains a module value Bm from the component Y.

According to an example of the step 130a, the modulation value Bm is an average, median, min or max value of the pixel values of the component Y. These operations may be performed in the linear HDR luminance domain Ylin or in a non-linear domain like ln(Y) or Yγ with γ<1.

In step 110a, a module FM obtains the luminance component L by applying a non-linear function f on the component Y:
L=f(Bm,Y)  (3)

Applying the non-linear function f on the component Y reduces its dynamic range. In other terms, the dynamic of the luminance component L is reduced compared to the dynamic of the component Y.

Basically the dynamic range of the component Y is reduced in order that the luminance values of the component L are represented by using 10 bits.

According to an embodiment, the component Y is divided by the modulation value Bm before applying the non-linear function f:
L=f(Y/Bm)  (4)

According to an embodiment, the non-linear function f is a gamma function:
L=B·Y1γ

where Y1 equals either Y or Y/Ba according to the embodiments of eq. (3) or (4), B is a constant value, γ is a parameter (real value strictly below 1).

According to an example, the non-linear function f is a S-Log function:
L=a·ln(Y1+b)+c
where a, b and c are parameters (real values) of a S Log curve determined such that f(0) and f(1) are invariant, and the derivative of the S Log curve is continuous in 1 when prolonged by a gamma curve below 1. Thus, a, b and c are functions of the parameter γ. Typical values are shown in Table 1.

TABLE 1
Γ a B c
1/2.0 0.6275 0.2550 0.8575
1/2.4 0.4742 0.1382 0.9386
1/2.8 0.3861 0.0811 0.9699

In an advantageous embodiment, a value of γ close to 1/2.5 is efficient in terms of HDR compression performance as well as good viewability of the obtained SDR luma. Thus, the 3 parameters may advantageously take the following values: a=0.44955114, b=0.12123691, c=0.94855684.

According to an example, the non-linear function f is either a gamma correction or a S Log correction according to the pixel values of the component Y.

Applying a gamma correction on the component Y, pulls up the dark regions but does not lower enough high lights to avoid burning of bright pixels.

Then, according to an embodiment, the module FM applies either the gamma correction or the S Log correction according to the pixel values of the component Y. An information data Inf may indicate whether either the gamma correction or Slog correction applies.

For example, when the pixel value of the component Y is below a threshold (equal to 1), then the gamma correction is applied and otherwise the S Log correction is applied.

According to an example, when the method is used to encode several HDR pictures belonging to a sequence of pictures, a modulation value Bm is determined for every HDR picture, a Group of Pictures (GOP) or for a part of a HDR picture such as, but not limited to, a slice or a Transfer Unit as defined in HEVC.

According to an embodiment, the value Bm and/or the parameters of the non-linear function f (such as a, b, c or γ) and/or the information data Inf is (are) stored in a local or remote memory and/or added into a bitstream F3.

In step 120a, at least one color component EC (c=1, 2, 3) is obtained from the HDR picture IHDR. A color component Ec may be obtained directly from a local or a remote memory or by applying a color transform on the HDR picture IHDR.

In step 140a, an intermediate color component E′c (c=1, 2 or 3) is obtained by scaling each color component Ec by a factor r(L) that depends on the luminance component L:

{ E 1 ( i ) = E 1 ( i ) * r ( L ( i ) ) E 2 ( i ) = E 2 ( i ) * r ( L ( i ) ) E 3 ( i ) = E 3 ( i ) * r ( L ( i ) )

where r(L(i)) is a factor (real value), determined by the module RM (step 150a), that depends on the value of a pixel i of the component L, E′c(i) is the value of the pixel i of the intermediate color component E′c, and Ec (i) is the value of the pixel i of the color component Ec.

Scaling by a factor means multiplying by said factor or dividing by the inverse of said factor.

Scaling each color component Ec by the factor r(L) that depends on the luminance component L preserves the hue of the colors of the HDR picture IHDR.

According to an example of the step 150a, the factor r(L) is the ratio of the luminance component L over the component Y:

r ( L ( i ) ) = L ( i ) Y ( i )

with Y(i) being the value of a pixel i of the component Y. Actually, the value Y(i) of a pixel of the component Y depends non-ambiguously on the value L(i) of a pixel of the luminance component L, such that the ratio can be written as a function of L(i) only.

This example is advantageous because scaling each color component Ec by the factor r(L) that further depends on the component Y preserves the hue of the colors of the HDR picture IHDR and thus improves the visual quality of the decoded color picture.

More precisely, in colorimetry and color theory, colorfulness, chroma, and saturation refer to the perceived intensity of a specific color. Colorfulness is the degree of difference between a color and gray. Chroma is the colorfulness relative to the brightness of another color that appears white under similar viewing conditions. Saturation is the colorfulness of a color relative to its own brightness.

A highly colorful stimulus is vivid and intense, while a less colorful stimulus appears more muted, closer to gray. With no colorfulness at all, a color is a “neutral” gray (a picture with no colorfulness in any of its colors is called grayscale). Any color can be described from its colorfulness (or chroma or saturation), lightness (or brightness), and hue.

The definition of the hue and saturation of the color depends on the color space used to represent said color.

For example, when a CIELUV color space is used, the saturation suv is defined as the ratio between the chroma Cu over the luminance L*.

s uv = C uv * L * = u * 2 + v * 2 L *

The hue is then given by

h uv = arctan v * u *

According to another example, when a CIELAB color space is used, the saturation is defined as the ratio of the chroma over the luminance:

s ab = C ab * L * = a * 2 + b * 2 L *

The hue is then given by

h ab = arctan b * a *

These equations are a reasonable predictor of saturation and hue that are in agreement with the human perception of saturation, and demonstrate that adjusting the brightness in CIELAB (or CIELUV) color space while holding the angle a*/b* (or u*/V) fixed does affect the hue and thus the perception of a same color. In step 140a, scaling the color components Ec by a same factor preserves this angle, thus the hue.

Now let us consider that the HDR picture IHDR is represented in the CIELUV color space and a second SDR picture ISDR2 that is formed by combining together the luminance component L, whose dynamic range is reduced compared to the dynamic range of the luminance of the HDR picture IHDR (step 110a), and two chrominance components U (=C1) and V (=C2) of the CIELUV color space. The colors of the second SDR picture ISDR2 are thus differently perceived by a human being because the saturation and the hue of the colors changed. The method described in relation with FIG. 10a determines the chrominance components C1 and C2 of the second SDR picture ISDR2 in order that the hue of the colors of the second SDR picture ISDR2 best match the hue of the colors of the HDR picture IHDR.

According to an example of the step 150a, the factor r(L) is given by:

r ( L ( i ) ) = max { 5 , L ( i ) } 2048 max { 0.01 , Y ( i ) }

This last embodiment is advantageous because it prevents the factor from going to zero for very dark pixels, i.e. allows the ratio to be invertible regardless of the pixel value.

In step 160a, the two chrominance components C1, C2 are obtained from said at least one intermediate color components E′c.

According to an embodiment of the step 160a, illustrated in FIG. 10b, at least one intermediate component Dc (c=1, 2 or 3) is obtained by applying (step 161b) an OETF on every intermediate color component (E′c):

{ D 1 = OETF ( E 1 ) D 2 = OETF ( E 2 ) D 3 = OETF ( E 3 )
For example, the OETF is defined by the ITU-R recommendation BT.709 or BT.2020 and stated as follows

D c = OETF ( E c ) = { 4.5 E c E c < 0.018 1.099 E c ′0 .45 - 0.099 E c 0.018 .

This embodiment allows a reduction of the dynamic range according to a specific OETF but leads to a complex decoding process as detailed later.

According to a variant of this example, illustrated in FIG. 10c, the OETF is approximated by a square root, i.e. at least one intermediate component Dc (c=1, 2 or 3) is obtained by taking the square-root (step 161c) of every intermediate color component (E′c):

{ D 1 = E 1 D 2 = E 2 D 3 = E 3

This variant is advantageous because it provides a good approximation of the OETF defined by the ITU-R recommendation BT.709 or BT.2020 and leads to a low complexity decoder.

According to another variant, the OETF is approximated by a cubic-root, i.e. at least one intermediate component Dc (c=1, 2 or 3) is obtained by taking the cubic-root of every intermediate color component (E′c):

{ D 1 = E 1 3 D 2 = E 2 3 D 3 = E 3 3 ,

This variant is advantageous because it provides a good approximation of the OETF defined by the ITU-R recommendation BT.709 or BT.2020 but it leads to a somewhat more complex decoder than the decoder obtains when the OETF is approximated by a square-root.

In step 162b, a module LC1 obtains the two chrominance components C1 and C2 by linearly combining the three intermediate components Dc:

[ C 1 C 2 ] = [ A 2 A 3 ] [ D 1 D 2 D 3 ]

where A2 and A3 are the second and third rows of the 3×3 matrix A.

In step 170a, as illustrated in FIG. 10a, a module COM obtains the second SDR picture ISDR2 by combining together the luminance component L and the chrominance components C1 and C2.

FIG. 11a-d show diagrams of the steps of a method of decoding a HDR picture and a SDR picture from at least one bitstream in accordance with an example of the present principles.

In step 111a, a module DECOMB obtains a luminance component L and two chrominance components C1, C2 from the second SDR picture ISDR2.

In step 113a, a module IFM obtains a first component Y by applying a non-linear function f−1 on the luminance component L in order that the dynamic of the first component Y is increased compared to the dynamic of the luminance component L:
Y=f−1(Ba,L)  (5)

The non-linear function f−1 is the inverse of the non-linear function f (step 110a).

Thus, the examples of the function f−1 are defined according to the examples of the function f.

According to an example, the value Bm and/or the parameters of the non-linear function f−1 (such as a, b, c or γ) and/or the information data Inf is (are) obtained from a local or remote memory (for example a Look-Up-Table) and/or from a bitstream F3 as illustrated in FIG. 11a.

According to an embodiment, the luminance component L is multiplied by the modulation value Bm after having applied the non-linear function f−1:
Y=Bm*ƒ−1(L)  (6)

According to an example, the non-linear function f−1 is the inverse of a gamma function.

The component Y is then given by:

Y 1 = L 1 / γ B

where Y1 equals Y or Y/Bm according to the embodiments of eq. (5) or (6), B is a constant value, γ is a parameter (real value strictly below 1).

According to an embodiment, the non-linear function f−1 is the inverse of a S-Log function. The component Y1 is then given by:

Y 1 = exp ( L - c a ) - b

According to an embodiment, the non-linear function f is the inverse of either a gamma correction or a S Log correction according to the pixel values of the component Y. This is indicated by the information data Inf.

In step 112a, a module ILC obtains at least one color component Ec from the first component Y, the two chrominance component C1, C2, and from a factor r(L) that depends on the luminance component L. The decoded HDR picture IHDR is then obtained by combining together said at least one color component Ec.

The factor r(L) may be obtained either from a local or remote memory (such a Look-Up-Table) or from a bitstream.

When a general OETF is applied on every intermediate color component E′c (step 161b in FIG. 10b), the intermediate components Dc are related to the component Y, the two chrominance components C1, C2 and the factor r(L):

Y = A 1 [ E 1 E 2 E 3 ] = A 1 [ E 1 E 2 E 3 ] / r ( L ) = A 1 [ EOTF ( D 1 ) EOTF ( D 2 ) EOTF ( D 3 ) ] / r ( L ) ( 7 a )
and

[ C 1 C 2 ] = [ A 2 A 3 ] [ D 1 D 2 D 3 ] ( 7 b )

where EOTF (Electro-Optical Trans Function) is the inverse of OETF applied in step 161b.

Equation (7b) provides

{ D 2 = ϑ 2 D 1 + L 2 ( C 1 , C 2 ) D 3 = ϑ 3 D 1 + L 3 ( C 1 , C 2 ) ( 8 )
where OETF(Ec)=Dc, ∂i are constants depending on the matrix A and Li are linear functions also depending on the matrix A. Then, equation (7a) becomes:
r(L)*Y=A11EOTF(D1)+A12EOTF(D2)+A13EOTF(D3)  (9)
and then
r(L)*Y=A11EOTF(D1)+A12EOTF(∂2D1+L2(C1,C2))+A13EOTF(∂3D1+L3(C1,C2)  (10)

Equation (10) is an implicit equation on D1 only. Depending on the expression of the EOTF, equation (10) can be more or less solved simply. Once solved, D1 is obtained, D2, D3 are deduced from D1 by equation (8). Then the intermediate color component E′c are obtained by applying the EOTF on the three obtained intermediate components Dc, i.e. E′c=EOTF(Dc).

In this general case, i.e. when a general OETF (does not have any specific property) is applied on each intermediate color component E′c, there exist no analytic solution to equation (10). For instance when the OETF is the ITU-R BT.709/2020 OETF, the equation (10) may be solved numerically by using the so-called Newton's method or any other numerical method to find the root of a regular function. However, this leads to highly complex decoders.

In this general case, according to a first example of the step 112a, illustrated in FIG. 11b, in step 1121a, a module ILEC obtains three intermediate color component E′c from the first component Y, the two chrominance component C1, C2 and the factor r(L) as above explained. In step 1122a, the three color components Ec are obtained by scaling each intermediate color component E′c by the factor r(L):
Ec(i)=E′c(i)/r(L(i))

where r(L(i)) is the factor given by step 150a that depends on the value of a pixel i of the component L (output of step 111a), E′c(i) is the value of the pixel i of an intermediate color component E′c, and Ec (i) is the value of the pixel i of the color component Ec.

Actually this order step 1121a before step 1122a is the inverse of the order step 161b followed by step 162b of the encoding method (FIG. 10b).

According to a variant of this first example, the OEFT is a square root function and the EOTF is then a square function.

According to another variant of this first example, the OEFT is a cubic root function and the EOTF is then a cubic function.

When the OETF used in step 161b, fulfills the commutation condition, namely
OETF(x*y)=OETF(x)*OETF(y),

the component Y and the color components Ec are related by:

Y = A 1 [ E 1 E 2 E 3 ] = A 1 [ EOTF ( F 1 ) EOTF ( F 2 ) EOTF ( F 3 ) ] ( 11 )
where Fc are components equal to OETF(Ec) and

[ C 1 C 2 ] = [ C 1 C 2 ] / OETF ( r ( L ) ) = [ A 2 A 3 ] [ D 1 D 2 D 3 ] / OETF ( r ( L ) ) = [ A 2 A 3 ] [ OETF ( E 1 ) OETF ( E 2 ) OETF ( E 3 ) ] / OETF ( r ( L ) ) ,

such that the commutation condition provides

[ C 1 C 2 ] = [ A 2 A 3 ] [ OETF ( E 1 / r ( L ) ) OETF ( E 2 / r ( L ) ) OETF ( E 3 / r ( L ) ) ] = [ A 2 A 3 ] [ OETF ( E 1 ) OETF ( E 2 ) OETF ( E 3 ) ] = [ A 2 A 3 ] [ F 1 F 2 F 3 ] ( 12 )

Equation (11) provides

{ F 2 = ϑ 2 F 1 + L 2 ( C 1 , C 2 ) F 3 = ϑ 3 F 1 + L 3 ( C 1 , C 2 )
where ∂i are constants depending on the matrix A and Li are linear functions also depending on the matrix A.

Then, equation (11) becomes:
Y=A11EOTF(F1)+A12EOTF(F2)+A13EOTF(F3)  (13)
and then
Y=A11EOTF(F1)+A12EOTF(∂2F1+L2(C′1C′2))+A13EOTF(∂3F1+L3(C′1,C′2)  (14)

When the OETF fulfills the commutation conditions, according to a second example of the step 112a, illustrated in FIG. 11c, in step 1121c, two intermediate components C′1 and C′2 are obtained by scaling the two chrominance components C1 and C2 by the factor OEFT(r(L(i))) where OETF is the function used in step 161b in FIG. 10b:

C 1 ( i ) = C 1 ( i ) OETF ( r ( L ( i ) ) ) C 2 ( i ) = C 2 ( i ) OETF ( r ( L ( i ) ) )

where r(L(i)) is the factor given by step 150a that depends on the value of a pixel i of the component L (output of step 111a), C′1(i), C′2(i) is respectively the value of the pixel i of the component C′1 and C′2, C1 (i), C2 (i) is respectively the value of the pixel i of the component C1 and C2.

In step 1122c, a module ILEC obtains the three color components Ec from the first component Y and the two intermediate chrominance components C′1, C′2 as above explained.

According to a variant of this second example, the OEFT is a square root function and the EOTF is then a square function. Then, in step 1122c, the two intermediate components C′1 and C′2 are obtained by scaling the two chrominance components C1 and C2 by the factor √{square root over (r(L(i)))}

C 1 ( i ) = C 1 ( i ) OETF ( r ( L ( i ) ) ) = C 1 ( i ) r ( L ( i ) ) C 2 ( i ) = C 2 ( i ) OETF ( r ( L ( i ) ) ) = C 2 ( i ) r ( L ( i ) )

Equation(11) becomes:

Y = A 1 [ E 1 E 2 E 3 ] = A 1 [ F 1 2 F 2 2 F 3 2 ] and [ C 1 C 2 ] = [ C 1 C 2 ] / r ( L ) = [ A 2 A 3 ] [ D 1 D 2 D 3 ] / r ( L ) = [ A 2 A 3 ] [ E 1 E 2 E 3 ] / r ( L ) ( 14 )
such that the commutation provides

[ C 1 C 2 ] = [ A 2 A 3 ] [ E 1 / r ( L ) E 2 / r ( L ) E 2 / r ( L ) ] = [ A 2 A 3 ] [ E 1 E 2 E 3 ] = [ A 2 A 3 ] [ F 1 F 2 F 3 ] ( 15 )

Equation (14) becomes:
Y=A11F12+A12F22+A13F32  (16) and
Y=A11F12+A12(∂2F1+L2(C′1,C′2))2+A13(∂3F1+L3(C′1,C′2))2   (17)

Equation (17) is a second order equation that may be solved analytically. This analytic solution leads to a specific embodiment of the step 1122c as illustrated in FIG. 11d. This embodiment is advantageous because it allows an analytic expression of the EOTF (inverse of the OETF), and thus of the decoded components of the HDR picture. Moreover, the EOTF is then the square function that is a low complexity process at the decoding side.

In step 11221c, a module SM obtains a second component S by combining together the two intermediate chrominance components C′1, C′2 and the first component Y:
S=√{square root over (Y+k0C′12+k1C′22+k2C′1C′2)}

where k0, k1 and k2 parameters values and C′c2 means the square of a component C′c (c=1 or 2).

In step 11222c, a module LC2 obtains the three solver components Fc by linearly combining together the intermediate chrominance component C′1, C′2 and a second component S:

[ F 1 F 2 F 3 ] = C [ S C 1 C 2 ]

where C is a 3×3 matrix defined as the inverse of the matrix A.

In step 11223c, the three color components Ec are obtained by taking the square of each intermediate color components (Dc):

[ E 1 E 2 E 3 ] = [ EOTF ( F 1 ) EOTF ( F 2 ) EOTF ( F 3 ) ] = [ ( F 1 ) 2 ( F 2 ) 2 ( F 3 ) 2 ]

The matrix A determines the transform of the HDR picture IHDR to be encoded from the color space (E1, E2, E3), in which the pixel values of the HDR picture to be encoded are represented, to the color space (Y, C1, C2).

Such a matrix depends on the gamut of the HDR picture IHDR to be encoded.

For example, when the HDR picture to be encoded is represented in the BT709 gamut as defined by ITU-R Rec. 709, the matrix A is given by:

A = [ 0.2126 0.7152 0.0722 - 0.1146 - 0.3854 0.5 0.5 - 0.4541 0.0459 ]

and the matrix C is given by:

C = [ 1 0 1.5748 1 - 0.1874 - 0.4681 1 1.8556 0 ]

According to a variant of this second embodiment, the OETF is a cubic root function and the EOTF is then a cubic function. Then, in step 1121c in

FIG. 11c, the two intermediate components C′1 and C′2 may then be obtained by scaling the two chrominance components C1 and C2 by the factor 3√{square root over (r(L(i))}:

C 1 ( i ) = C 1 ( i ) r ( L ( i ) 3 C 2 ( i ) = C 2 ( i ) r ( L ( i ) 3 :

The EOTF is then a cubic function thus leading to an equation (17) on F1 being a more complex third order equation which can be solved analytically by the so-called Cardano's method.

Very complex analytic solutions also exist for the fourth order equation (Ferrari's method), but not anymore for any order higher or equal to five as stated by the AbelRuffini theorem.

The decoder DEC1 (respectively DEC2, DEC3) is configured to decode data which have been encoded by the encoder ENC1 (respectively ENC2, ENC3). The encoder ENC1 and/or ENC2 and/or ENC3 (and decoder DEC1 and/or DEC2 and/or DEC3) may be block-based processing.

The encoders ENC1 and/or ENC2 and/or ENC3 (and decoder DEC1 and/or DEC2 and/or DEC3) is not limited to a specific encoder (decoder).

According to an embodiment, the encoder ENC1 is configured to encode the information INF in a SEI message such as the color remapping information SEI message as defined in the HEVC standard (Annex D.2.32).

According to an embodiment, the encoder ENC3 is configured to encode the backlight picture Ba as an auxiliary picture or by using frame packing (Annex D.2.16) as described in the HEVC standard, or to encode the weighting coefficients and possibly the shape functions in a SEI message (HEVC standard, Annex D1).

According to an embodiment, the decoder DEC3 is configured to . . . the decoded backlight picture Ba obtained from an auxiliary picture or a packed frame encoded in the bitstream F1 as described in the HEVC standard, or is obtained from weighting coefficients and possibly shape functions obtained from a SEI message in the bitstream F1.

The encoder ENC1 and/or ENC2 (and decoder DEC1 and/or DEC2) is not limited to a specific encoder which may be, for example, an image/video coder with loss like JPEG, JPEG2000, MPEG2, HEVC recommendation or H264/AVC recommendation (“Advanced video coding for generic audiovisual Services”, SERIES H: AUDIOVISUAL AND MULTIMEDIA SYSTEMS, Recommendation ITU-T H.264, Telecommunication Standardization Sector of ITU, February 2014)).

The bitstreams F1, F2, F3 may be multiplexed together to form a single bitstream.

On FIG. 1-11d, the modules are functional units, which may or not be in relation with distinguishable physical units. For example, these modules or some of them may be brought together in a unique component or circuit, or contribute to functionalities of a software. A contrario, some modules may potentially be composed of separate physical entities. The apparatus which are compatible with the disclosure are implemented using either pure hardware, for example using dedicated hardware such ASIC or FPGA or VLSI, respectively «Application Specific Integrated Circuit», «Field-Programmable Gate Array», «Very Large Scale Integration», or from several integrated electronic components embedded in a device or from a blend of hardware and software components.

FIG. 12 represents an exemplary architecture of a device 1200 which may be configured to implement a method described in relation with FIG. 1-11d.

Device 1200 comprises following elements that are linked together by a data and address bus 1201:

In accordance with an example, the battery 1206 is external to the device. In each of mentioned memory, the word «register» used in the specification can correspond to area of small capacity (some bits) or to very large area (e.g. a whole program or large amount of received or decoded data). The ROM 1203 comprises at least a program and parameters. The ROM 1203 may store algorithms and instructions to perform techniques in accordance with present principles. When switched on, the CPU 1202 uploads the program in the RAM and executes the corresponding instructions.

RAM 1204 comprises, in a register, the program executed by the CPU 1202 and uploaded after switch on of the device 1200, input data in a register, intermediate data in different states of the method in a register, and other variables used for the execution of the method in a register.

The implementations described herein may be implemented in, for example, a method or a process, an apparatus, a software program, a data stream, or a signal. Even if only discussed in the context of a single form of implementation (for example, discussed only as a method or a device), the implementation of features discussed may also be implemented in other forms (for example a program). An apparatus may be implemented in, for example, appropriate hardware, software, and firmware. The methods may be implemented in, for example, an apparatus such as, for example, a processor, which refers to processing devices in general, including, for example, a computer, a microprocessor, an integrated circuit, or a programmable logic device. Processors also include communication devices, such as, for example, computers, cell phones, portable/personal digital assistants (“PDAs”), and other devices that facilitate communication of information between end-users.

In accordance with an example of encoding or an encoder, the HDR or SDR picture is obtained from a source. For example, the source belongs to a set comprising:

In accordance with an example of the decoding or a decoder, the decoded SDR or HDR picture is sent to a destination; specifically, the destination belongs to a set comprising:

In accordance with examples of encoding or encoder, the bitstream F1, F2 and/or F3 are sent to a destination. As an example, one of bitstreams F1, F2 and F3 or both bitstreams are stored in a local or remote memory, e.g. a video memory (1204) or a RAM (1204), a hard disk (1203). In a variant, one or both bitstreams are sent to a storage interface (1205), e.g. an interface with a mass storage, a flash memory, ROM, an optical disc or a magnetic support and/or transmitted over a communication interface (1205), e.g. an interface to a point to point link, a communication bus, a point to multipoint link or a broadcast network.

In accordance with examples of decoding or decoder, the bitstream F1, F2 and/or F3 is obtained from a source. Exemplarily, the bitstream is read from a local memory, e.g. a video memory (1204), a RAM (1204), a ROM (1203), a flash memory (1203) or a hard disk (1203). In a variant, the bitstream is received from a storage interface (1205), e.g. an interface with a mass storage, a RAM, a ROM, a flash memory, an optical disc or a magnetic support and/or received from a communication interface (1205), e.g. an interface to a point to point link, a bus, a point to multipoint link or a broadcast network.

In accordance with examples, device 1200 being configured to implement an encoding method described in relation with one of the FIGS. 2, 4-8d, 10a-c, belongs to a set comprising:

In accordance with examples, device 1200 being configured to implement a decoding method described in relation with one of the FIGS. 3, 9 11a-d, belongs to a set comprising:

According to an embodiment illustrated in FIG. 13, in a transmission context between two remote devices A and B over a communication network NET, the device A comprises a processor in relation with memory RAM and ROM which are configured to implement a method for encoding a picture as described in relation with one of the FIGS. 2, 4-8d, 10a-c, and the device B comprises a processor in relation with memory RAM and ROM which are configured to implement which are configured to implement a method for decoding as described in relation with one of the FIGS. 3, 9 11a-d.

In accordance with an example, the network is a broadcast network, adapted to broadcast still pictures or video pictures from device A to decoding devices including the device B.

Implementations of the various processes and features described herein may be embodied in a variety of different equipment or applications. Examples of such equipment include an encoder, a decoder, a post-processor processing output from a decoder, a pre-processor providing input to an encoder, a video coder, a video decoder, a video codec, a web server, a set-top box, a laptop, a personal computer, a cell phone, a PDA, and any other device for processing a picture or a video or other communication devices. As should be clear, the equipment may be mobile and even installed in a mobile vehicle.

Additionally, the methods may be implemented by instructions being performed by a processor, and such instructions (and/or data values produced by an implementation) may be stored on a computer readable storage medium. A computer readable storage medium can take the form of a computer readable program product embodied in one or more computer readable medium(s) and having computer readable program code embodied thereon that is executable by a computer. A computer readable storage medium as used herein is considered a non-transitory storage medium given the inherent capability to store the information therein as well as the inherent capability to provide retrieval of the information therefrom. A computer readable storage medium can be, for example, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. It is to be appreciated that the following, while providing more specific examples of computer readable storage mediums to which the present principles can be applied, is merely an illustrative and not exhaustive listing as is readily appreciated by one of ordinary skill in the art: a portable computer diskette; a hard disk; a read-only memory (ROM); an erasable programmable read-only memory (EPROM or Flash memory); a portable compact disc read-only memory (CD-ROM); an optical storage device; a magnetic storage device; or any suitable combination of the foregoing.

The instructions may form an application program tangibly embodied on a processor-readable medium.

Instructions may be, for example, in hardware, firmware, software, or a combination. Instructions may be found in, for example, an operating system, a separate application, or a combination of the two. A processor may be characterized, therefore, as, for example, both a device configured to carry out a process and a device that includes a processor-readable medium (such as a storage device) having instructions for carrying out a process. Further, a processor-readable medium may store, in addition to or in lieu of instructions, data values produced by an implementation.

As will be evident to one of skill in the art, implementations may produce a variety of signals formatted to carry information that may be, for example, stored or transmitted. The information may include, for example, instructions for performing a method, or data produced by one of the described implementations. For example, a signal may be formatted to carry as data the rules for writing or reading the syntax of a described embodiment, or to carry as data the actual syntax-values written by a described embodiment. Such a signal may be formatted, for example, as an electromagnetic wave (for example, using a radio frequency portion of spectrum) or as a baseband signal. The formatting may include, for example, encoding a data stream and modulating a carrier with the encoded data stream. The information that the signal carries may be, for example, analog or digital information. The signal may be transmitted over a variety of different wired or wireless links, as is known. The signal may be stored on a processor-readable medium.

A number of implementations have been described. Nevertheless, it will be understood that various modifications may be made. For example, elements of different implementations may be combined, supplemented, modified, or removed to produce other implementations. Additionally, one of ordinary skill will understand that other structures and processes may be substituted for those disclosed and the resulting implementations will perform at least substantially the same function(s), in at least substantially the same way(s), to achieve at least substantially the same result(s) as the implementations disclosed. Accordingly, these and other implementations are contemplated by this application.

Lasserre, Sebastien, Leleannec, Fabrice, Olivier, Yannick, Lopez, Patrick, Bordes, Philippe, Touze, David

Patent Priority Assignee Title
Patent Priority Assignee Title
10390027, Jan 30 2015 InterDigital VC Holdings, Inc Method and apparatus of encoding and decoding a color picture
6078357, Aug 05 1996 MATSUSHITA ELECTRIC INDUSTRIAL CO , LTD Image mixing circuit
7106352, Mar 03 2003 Oracle America, Inc Automatic gain control, brightness compression, and super-intensity samples
7558436, Jul 20 2006 PECO, INC Image dynamic range control for visual display
8014445, Feb 24 2006 Sharp Kabushiki Kaisha Methods and systems for high dynamic range video coding
8731287, Apr 14 2011 Dolby Laboratories Licensing Corporation Image prediction based on primary color grading model
9480434, Aug 26 2011 Koninklijke Philips Electronics N V Distortion reduced signal detection
9584811, Jun 17 2013 Dolby Laboratories Licensing Corporation Adaptive reshaping for layered coding of enhanced dynamic range signals
20020171663,
20070091213,
20080175495,
20090167955,
20100066762,
20100103200,
20100166301,
20110194618,
20130108183,
20130188696,
20140037206,
20140086321,
20140210847,
20140247870,
20140327822,
20150003749,
20150016735,
20150221280,
20150358646,
20160134872,
20160253792,
CN103503429,
CN103843058,
CN105324997,
EP2890129,
EP2958327,
JP11313338,
JP2002204373,
RU2504011,
WO2010105036,
WO2012122426,
WO2012142589,
WO2014009844,
WO2014077827,
WO2014204865,
WO2013103522,
WO2014128586,
WO2015097118,
/////////
Executed onAssignorAssigneeConveyanceFrameReelDoc
Jun 27 2016INTERDIGITAL MADISON PATENT HOLDINGS SAS(assignment on the face of the patent)
Jun 27 2016BORDES, PHILIPPEThomson LicensingASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0489940485 pdf
Jun 27 2016TOUZE, DAVIDThomson LicensingASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0489940485 pdf
Jun 30 2016OLIVIER, YANNICKThomson LicensingASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0489940485 pdf
Jul 04 2016LASSERRE, SEBASTIANThomson LicensingASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0489940485 pdf
Jul 07 2016LE LEANNEC, FABRICEThomson LicensingASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0489940485 pdf
Dec 12 2017LOPEZ, PATRICKThomson LicensingASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0489940485 pdf
Jul 30 2018Thomson LicensingInterDigital VC Holdings, IncASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0490070001 pdf
Feb 04 2020InterDigital VC Holdings, IncInterDigital Madison Patent Holdings, SASASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0530640254 pdf
Date Maintenance Fee Events
Dec 30 2017BIG: Entity status set to Undiscounted (note the period is included in the code).


Date Maintenance Schedule
May 11 20244 years fee payment window open
Nov 11 20246 months grace period start (w surcharge)
May 11 2025patent expiry (for year 4)
May 11 20272 years to revive unintentionally abandoned end. (for year 4)
May 11 20288 years fee payment window open
Nov 11 20286 months grace period start (w surcharge)
May 11 2029patent expiry (for year 8)
May 11 20312 years to revive unintentionally abandoned end. (for year 8)
May 11 203212 years fee payment window open
Nov 11 20326 months grace period start (w surcharge)
May 11 2033patent expiry (for year 12)
May 11 20352 years to revive unintentionally abandoned end. (for year 12)