There is provided an apparatus for detecting a quality defect in a video sequence, the apparatus comprising: a receiver arranged to receive a video bitstream representing a video sequence; and a defect module arranged to determine a measure of quantization parameter for each picture of the video sequence, the defect module further arranged to identify a quality defect when an abrupt change in the measure of quantization parameter occurs.
|
1. A method for detecting a quality defect in a video sequence, the method comprising:
receiving, by a receiver, an encoded video bitstream representing a video sequence;
determining, by the receiver, a measure of quantization parameter (qp) for each picture of the video sequence; and
identifying, by the receiver, without decoding the encoded video bitstream, a quality defect when an abrupt change in the measure of qp occurs, wherein the abrupt change in the measure of qp is detected when a first picture in the video sequence has a measure of qp that i) is greater than a measure of qp for a preceding picture by more than a first threshold t1 and ii) is greater than a measure of qp for a subsequent picture by more than a second threshold t2.
20. An apparatus for detecting a quality defect in a video sequence, the apparatus comprising:
a receiver arranged to receive an encoded video bitstream representing a video sequence; and
a defect module arranged to determine a measure of quantization parameter for each picture of the video sequence, the defect module further arranged to identify, without decoding the encoded video bitstream, a quality defect when an abrupt change in the measure of qp occurs, wherein the abrupt change in the measure of qp is detected when a first picture in the video sequence has a measure of qp that i) is greater than a measure of qp for a preceding picture by more than a first threshold t1 and ii) is greater than a measure of qp for a subsequent picture by more than a second threshold t2.
26. A method comprising:
receiving a video bitstream representing a video sequence;
determining a measure of quantization parameter (qp) for each picture of the video sequence;
identifying a quality defect when an abrupt change in the measure of qp occurs, wherein the abrupt change in the measure of qp is detected when a first picture has a measure of qp that i) is less than, by a threshold amount, a measure of qp of a first predetermined number of preceding pictures and ii) is less than, by the threshold amount, a measure of qp of a second predetermined number of subsequent pictures;
determining a qp range for the video sequence, wherein determining the qp range for the video sequence comprises calculating QPmax−QPmin, wherein QPmax is the maximum qp value for the video sequence and QPmin is the minimum qp value for the video sequence; and
calculating a quality score for the video sequence using a quality model, the determined qp range, and a defect indicator indicating that the quality defect has been identified.
27. A detector for detecting a quality defect in a video sequence, the detector comprising:
a receiver for receiving an encoded video bitstream representing a video sequence; and
a processor coupled to the receiver, the processor being configured to:
determine an average quantization parameter (qp) for a first picture of the video sequence;
determine an average quantization parameter (qp) for a second picture of the video sequence, wherein the first picture precedes the second picture in the video sequence;
determine an average quantization parameter (qp) for a third picture of the video sequence, wherein the third picture precedes the first picture in the video sequence;
compare the average qp for the first picture to the average qp for the second picture to determine whether the average qp for the first picture is greater than the average qp for the second picture by at least a threshold amount;
compare the average qp for the first picture to the average qp for the third picture to determine whether the average qp for the first picture is greater than the average qp for the third picture by at least a threshold amount;
produce a defect detection indication as a result of determining that: i) the average qp for the first picture is greater than the average qp for the second picture by at least the threshold amount and ii) the average qp for the first picture is greater than the average qp for the third picture by at least the threshold amount;
determine a qp range for the video sequence by calculating QPmax−QPmin, wherein QPmax is the maximum qp value for the video sequence and QPmin is the minimum qp value for the video sequence; and
calculate a quality score for the video sequence using: i) a quality model, ii) the determined qp range, and iii) the defect detection indication.
2. The method of
determining if each picture of the video sequence is an intra-coded picture; and
identifying a quality defect as intra-picture flicker when the abrupt change in the measure of the quantization parameter (qp) occurs such that the first picture is an intra-coded picture and the preceding picture and the subsequent picture are both inter-coded pictures.
3. The method of
4. The method of
5. The method of
7. The method of
determining an average motion vector; and
generating a quality score based on identifying the quality defect and the average motion vector.
8. The method of
9. The method of
10. The method of
11. The method of
12. The method of
13. The method of
14. The method of
15. The method of
16. The method of
17. The method of
determining a qp range for the video sequence, wherein determining the qp range for the video sequence comprises calculating QPmax−QPmin, wherein QPmax is the maximum qp value for the video sequence and QPmin is the minimum qp value for the video sequence; and
calculating a quality score for the video sequence using a quality model, the determined qp range, and a defect indicator indicating that the quality defect has been identified.
18. The method of
obtaining a motion indication indicative of the motion of the video sequence; and
calculating a quality score for the video sequence using a quality model, the obtained motion indication, and a defect indicator indicating that the quality defect has been identified.
19. The method of
the method further comprises determining a qp range for the video sequence, wherein determining the qp range for the video sequence comprises calculating QPmax−QPmin, wherein QPmax is the maximum qp value for the video sequence and QPmin is the minimum qp value for the video sequence, and
the step of calculating the quality score for the video sequence comprises calculating the quality score using the quality model, the obtained motion indication, the determined qp range, and the defect indicator indicating that the quality defect has been identified.
21. The apparatus of
a picture type determining module arranged to determine whether each picture of the video sequence is an intra-coded picture; and
wherein the defect module is further arranged to identify a quality defect as an intra-picture flicker when the abrupt change in the measure of quantization parameter occurs such that the first picture is an intra-coded picture and the preceding picture and the subsequent picture are both inter-coded pictures.
22. The apparatus of
23. The apparatus of
the level of spatial detail in a picture or scene in the video sequence; and
the amount of movement in the picture or scene.
24. The apparatus of
25. A non-transitory computer-readable medium, carrying instructions, which, when executed by computer logic, causes said computer logic to carry out the method of
28. The detector of
the method further comprises determining an average motion for the first picture, and
the processor is configured to calculate the quality score using: i) the quality model, ii) the determined qp range, iii) the defect detection indication, and iv) the determined average motion.
|
This application is a 35 U.S.C. §371 National Phase Entry Application from PCT/EP2011/072288, filed Dec. 9, 2011, designating the United States, the disclosure of which is incorporated herein in its entirety by reference.
The present application relates to a method for detecting a quality defect in a video sequence; an apparatus for detecting a quality defect in a video sequence; and a computer-readable medium.
Block-based coding is the dominant video encoding technology with codec standards such as H.263, MPEG-4 Visual, MPEG-4 AVC (a.k.a. H.264) and the emerging HEVC/H.265 standard being developed in JCT-VC. These codecs use different types of pictures (which employ different types of prediction) to compress the video as efficiently as possible. An intra-coded picture (I-picture) may only be predicted spatially from areas within the picture itself. A predictive picture (P-picture) is predicted from previously coded I or P pictures. A bidirectional predictive pictures (B-picture) is predicted from both previous and/or subsequent pictures.
The result of predicting a particular picture from other pictures is likely to produce a result which is different to the original version of the particular picture. The differences between the prediction of a picture and the original picture are called the picture's residual data. The residual data is transmitted or stored together with the prediction instructions in order to allow the original picture to be accurately reconstructed. In order to make the residual data more compact the data is transformed into the frequency domain. In H.264/AVC the Hadamard transform and a transform similar to a Discrete Cosine Transform are used on 4×4 or 2×2 (chroma) blocks for this purpose. Once the residuals have been transformed a quantization is performed on the transformed data to limit the data that needs to be transmitted or stored. It is in this step that the actual lossy compression takes place. The level of quantization is determined by the quantization parameter (QP) and a corresponding look-up-table. The QP is set at picture level but can also be altered at macro-block (MB) level.
A higher QP means that the residual data are more coarsely quantized, so less detail of the residual data is captured and so the reconstructed picture will match the original picture less well. A lower QP means that more detail of the residual data is captured and so the reconstructed picture will match the original picture well.
It is fairly expensive in terms of bits for an encoder to alter the QP at MB level and so the majority of changes in QP occur between pictures. When the quantization has been performed the quantized transformed residuals are encoded using entropy coding. In H.264/AVC one of the entropy coding algorithms, namely CAVLC (Context Adaptive Variable Length Coding) or CABAC (Context Adaptive Binary Arithmetic Coding), is used for this.
To increase error resilience in error prone networks I-pictures are usually inserted periodically to refresh the video. I-pictures are also inserted periodically to allow for random access and channel switching. Where the forced intra pictures should be inserted in time is not defined by the video coding standard, but is up to the encoder to decide. Typically, video coding standards define the video bitstream syntax and the decoding process, but do not define the encoding process. In other words, the method by which the video sequence is encoded is not standardized, whereas the output of the encoding process is.
To ensure the end-to-end quality of video over fixed and mobile networks network operators and broadcast vendors can utilize objective video quality models. Objective video quality models are mathematical models that approximate results of subjective quality assessment, but are based on criteria and metrics that can be measured objectively and automatically evaluated by a computer program.
The performance of an objective video quality model is evaluated by computing a metric between the objective score generated by the objective video quality model and subjective test results. This metric can be, for example, the correlation between subjective and objective data or the mean squared error. A subjective test result may comprise a mean opinion score (MOS) obtained from the opinions of a plurality of human test subjects.
Perceptual models may be considered to be a subset of objective video quality models. Whereas an objective video quality model can refer to any automated quality assessment method, a perceptual model attempts to determine to what extent any quality defects would be perceived by a viewer. Perceptual models can utilize the pixel information in the decoded video sequence, and in the case of full-reference models the reference signal may also be used to predict the degradation of the processed video. A big disadvantage of perceptual models is that they are usually computational demanding and not very suitable for deployment on a large scale for network monitoring purposes.
An alternative approach to perceptual quality models that is more light-weight than a full-reference model is to use network layer protocol headers as input for quality estimation of a transmitted video. This approach makes the model very efficient to implement and use, but the quality estimation of the transmitted video will be rather coarse. Therefore a video bitstream quality model may also be implemented. This model takes the encoded elementary stream as input in addition to network protocol headers and has the advantage that it will be fairly light-weight and yet has the potential of getting a better estimate of the quality of the video than one just using network layer protocol headers. Such a video bitstream quality model may operate in two modes, one mode where full decoding of the bitstream is allowed and another, lower complexity, mode where full decoding is not allowed (such that pixel information cannot be used).
“Modeling the impact of frame rate and quantization stepsizes and their temporal variations on perceptual video quality: a review of recent works” by Y-F. Qu, Z. Ma, and Y. Wang, and published in Information Sciences and Systems (CISS), 2010 44th Annual Conference, 17-19 Mar. 2010, describes the effect of frame rate and quantization stepsize as well as the temporal variation of the frame rate on the perceptual quality.
“Evaluation of temporal variation of video quality in packet loss networks” by C. Yim and A. Bovik, and published in Signal Processing: Image Communication 26 (2011) describes the effect that variations in the temporal quality of videos have on global video quality.
There is thus a need for quality defect detection that does not require decoding of the encoded video bitstream. Such a quality defect detection may be suitable for implementation with a quality model.
It has been recognized that abrupt changes in the Quantization Parameter (QP) of a block-coded video stream can give rise to notable quality defects in a decoded video sequence. A method and apparatus for detecting such defects is described which does not require the decoding of the encoded video sequence from the video bitstream. The estimation of the visual impact of the change in QP can be used in conjunction with other model parameters in a video bitstream quality model to estimate the overall quality of a video bitstream. The solution described herein thus improves the accuracy of the estimation. This allows an improved video bitstream quality model without the computational overhead of decoding the video sequence from the video bitstream.
Accordingly, there is provided a method for detecting a quality defect in a video sequence. The method comprises receiving a video bitstream representing a video sequence. The method further comprises determining a measure of quantization parameter for each picture of the video sequence. The method further comprises identifying a quality defect when an abrupt change in quantization parameter occurs.
Certain quality defects can be identified in a video bitstream representing a video sequence without decoding the video sequence from the video bitstream. This means quality defects can be identified in the video bitstream at network locations without a decoder, which allows for more accurate quality estimation at nodes throughout the distribution network.
The method may further comprise determining if each picture of the video sequence is an intra-coded picture, and identifying a quality defect as intra-picture flicker when an abrupt change in quantization parameter occurs for an intra-coded picture.
Intra-picture flicker is one type of quality defect that is caused by an abrupt change in quantization parameter. Intra-picture flicker may be identified when an intra coded picture coincides with a peak in quantization parameter. Intra-picture flicker may be identified when an abrupt change in quantization parameter occurs between an intra-coded picture and its neighbouring inter-coded pictures.
The method may be used in a quality estimation model to determine a perceptual quality of the video sequence represented by the video bitstream.
There is further provided an apparatus for detecting a quality defect in a video sequence, the apparatus comprising a receiver and a defect module. The receiver is arranged to receive a video bitstream representing a video sequence. The defect module is arranged to determine a measure of quantization parameter for each picture of the video sequence, and is further arranged to identify a quality defect when an abrupt change in the measure of quantization parameter occurs.
The apparatus may further comprise a picture type determining module arranged to determining whether each picture of the video sequence is an intra-coded picture. The defect module may be further arranged to identify a quality defect as an intra-picture flicker when an abrupt change in the measure of quantization parameter occurs for an intra-coded picture.
The apparatus may further comprise a quality module arranged to receive the video bitstream and a defect indication from the defect module, the quality module further arranged to apply a quality model to estimate a quality of video reconstructed from the video bitstream.
There is further provided a computer-readable medium, carrying instructions, which, when executed by computer logic, causes said computer logic to carry out any of the methods defined herein.
A method and apparatus for detecting a quality defect in a video bitstream will now be described, by way of example only, with reference to the accompanying drawings, in which:
One example of a quality defect caused by an abrupt change in quantization parameter (QP) creating a perceptible quality defect is intra-picture flicker. Intra-picture flicker is a coding artifact that is usually due to the bit rate control of the encoder not being able to keep its requirements of having an almost constant quality given a certain bitrate. Bit rate control in video coding is a complex task and is not a part of any standard. It is up to the developer of the encoder to create a rate control that allocates the available bit budget optimally over the pictures of the sequence to maximize the perceived quality. Rate control requirements can be very different. Rate controls for video-calling are often strict to keep the delay of the video low. This sometimes means keeping to a certain bit budget for each picture. One-way video, like broadcasted mobile TV, usually has more relaxed bit rate control requirements with a bit budget for a certain time period, e.g. 1-2 seconds. More flexibility in the rate of encoded video can be provided by implementing statistical multiplexing and distributing a fixed amount of bandwidth across a plurality of parallel video sequences in response to the encoding demands of each sequence. In some encoding arrangements the encoder does not have access to the subsequent pictures and it must select the encoding parameters on-the-fly (which is also known as one-pass encoding). For these and other reasons, during encoding the bitrate may not be optimally allocated between the pictures of the video sequence.
Intra-picture flicker arises when periodic intra-pictures are used when encoding a scene. The artifact is mainly visible at medium to low bitrates, where the requirements of the rate control can be more difficult to meet. However, the artifact may also be visible even if high bitrates are used. Intra-picture flicker is also more apparent for static scenes and where the inter (P- and B-) pictures are relatively inexpensive to encode in terms of bits (but the intra-pictures still are expensive) and the artifact is not masked by motion within the scene.
A typical intra-picture flicker artifact is illustrated by
The content of video sequence A is of a relatively static nature and depicts grazing cattle. The sequence has been encoded and decoded using JM, which is the reference encoder for H.264/AVC. The pictures 199 and 201 have been encoded as inter-pictures with a relatively low QP. The middle picture 200 has been encoded as an intra-picture with a relatively high QP due to the restrictions of the rate control and the fact that intra-pictures are much more expensive to encode than inter-pictures. Looking at the pictures played out in a sequence will show an obvious and annoying flicker, where the spatial detail within the image disappears at picture 200, but then re-appears at picture 201.
Such intra-picture flicker can be seen in video bitstreams from a variety of encoders. It is not a part of the encoder bit-rate control strategy, but rather a failure to follow the strategy. Since video codec standards do not cover the behaviour of the encoder, but rather the syntax of the bitstream to be decoded by the decoder, there are many different encoding solutions out there. Some encoding solutions manage the intra-picture flickering problem better than others.
Detecting such a defect is particularly important when statistical multiplexing is used and an easy-to-encode channel does not get enough bits to handle the periodic intra-frames correctly. One could also expect the problem to look worse at lower resolutions (e.g. the mobile TV case), when the available bitrate is less, and the encoding complexity per frame has less variation. For instance, having a periodic intra period of 0.5 seconds at a “medium” bitrate for HDTV is usually ok, while using a periodic intra period of 0.5 seconds at a “medium” bitrate for mobile TV (e.g. 320×240 pixels) would very likely result in regular intra-picture flickering.
Presented herein is a method and apparatus for detecting when this quality dip appears by monitoring the QP value. If the QP value is increased by a certain amount between an inter-picture and a following intra-picture and decreased a certain amount for the next-coming inter picture, then an intra-picture flicker is detected. In a preferred embodiment the QP is averaged over each picture before the comparison is made. In another embodiment, the QP difference is calculated per macro block and then the difference is averaged over all macro-blocks.
The plot of
if (picn == inter and picn+1 == intra and QPn+1 − QPn >
threshold t1) {
if (picn+1 == scene_cut)
drop in encoding quality in new scene detected
else if (QPn+1 − QPn+2 > threshold t2) {
intra picture flicker detected
} else {
drop in encoding quality detected
}
}
The high QP peaks a few pictures after the periodic intra pictures do not affect quality that much for the same reason described for sequence A: inter-pictures are predicted from previous (good quality) pictures and a decrease in fidelity of the residuals will not do much to the perceived quality over a few inter-pictures.
The range in QP over a sequence also affects the distribution of the quality of a sequence. The span in quality can be calculated as:
QPrange=QPmax−QPmin
Intra-picture flicker is just one example of an abrupt change in QP causing a quality defect in an encoded video sequence.
Intra-picture flicker as described above occurs when a peak as shown in
A trough as shown in
Even a step up as illustrated in
Similarly, a step down as illustrated in
It has been shown how an intra-picture flicker is detected within a video bitstream without fully decoding the video bitstream. It has also been shown how the general case of a sudden change in QP is detected. This information can be used to improve the accuracy of the quality estimation from a video bitstream quality model.
A video bitstream quality model may further take into account one or more of the following:
One or more of these details may be included in the calculation of the quality estimation of a video bitstream. How the total quality model is constructed is not in the scope of this document, but it could for instance be a linear model like an additive model or a non-linear model like a multiplicative model.
For example, the spatial detail in the picture may be determined using frequency analysis such as Fourier analysis or a discrete cosine transform. The magnitude and number of high frequency components gives a measure of the spatial detail. Since block-based video codecs transform image and residual data into the frequency domain, it is possible to obtain a measure of the spatial detail of a picture by parsing out from the video bitstream the coefficients relating to the frequency components. Such coefficients are often called AC coefficients. The measure of the spatial detail can be used by a quality model to determine the impact of the quality defect on the quality score. Where the measure of spatial detail is high then the quality defect will have a greater impact on the quality score.
As a further example, a measure of the amount of movement in the picture may be determined by the average magnitude of the motion vectors for macroblocks in the picture. The measure of the amount of movement may be used by a quality model to determine the impact of the quality defect on the quality score. Where the measure of the amount of movement is high then the quality defect will have a lower impact on the quality score.
An advantage of the method and apparatus described herein is that these allow detection of intra-picture flicker and similar quality defects without decoding the bitstream. This means that sudden drops in encoding quality are detected. The estimate of the visual impact of these quality defects can be used to improve the accuracy of the estimated quality from a video bitstream quality model. Further, because the video bitstream quality model does not need to decode the bitstream to determine the quality, video quality monitoring can be performed at parts of the distribution network where decoding the video bitstream would be too costly, in terms of processing resources, power consumption, physical space, etc.
It will be apparent to the skilled person that the exact order and content of the actions carried out in the method described herein may be altered according to the requirements of a particular set of execution parameters. Accordingly, the order in which actions are described and/or claimed is not to be construed as a strict limitation on order in which actions are to be performed.
Further, while examples have been given in the context of particular communications standards, these examples are not intended to be the limit of the communications standards to which the disclosed method and apparatus may be applied. For example, while specific examples have been given in the context of H.264, the principles disclosed herein can also be applied to MPEG-2, H.263, MPEG-4, HEVC/H.265 or other codec, and indeed any video codec using quantization of transform coefficients.
Pettersson, Martin, Argyropoulos, Savvas
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
7092448, | May 24 2002 | UNILOC 2017 LLC | Method and system for estimating no-reference objective quality of video data |
7869517, | Dec 06 2002 | British Telecommunications public limited company | Video quality measurement |
20020071492, | |||
20020126258, | |||
20030219070, | |||
20070201752, | |||
20080304562, | |||
20100027621, | |||
20100315557, | |||
20110013694, | |||
20110052084, | |||
20110228859, | |||
CN101978700, | |||
CN1656823, | |||
EP2252073, | |||
EP2373049, | |||
JP2002325254, | |||
JP2006509435, | |||
JP2009260940, | |||
JP2010011112, | |||
JP2011199380, | |||
KR1020100126397, | |||
WO2004054274, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Dec 09 2011 | Telefonaktiebolaget LM Ericsson (publ) | (assignment on the face of the patent) | / | |||
Dec 12 2011 | PETTERSSON, MARTIN | TELEFONAKTIEBOLAGET L M ERICSSON PUBL | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034178 | /0550 | |
Apr 17 2014 | ARGYROPOULOS, SAVVAS | Deutsche Telekom AG | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034178 | /0555 |
Date | Maintenance Fee Events |
Sep 30 2020 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Dec 20 2024 | M1552: Payment of Maintenance Fee, 8th Year, Large Entity. |
Date | Maintenance Schedule |
Jun 20 2020 | 4 years fee payment window open |
Dec 20 2020 | 6 months grace period start (w surcharge) |
Jun 20 2021 | patent expiry (for year 4) |
Jun 20 2023 | 2 years to revive unintentionally abandoned end. (for year 4) |
Jun 20 2024 | 8 years fee payment window open |
Dec 20 2024 | 6 months grace period start (w surcharge) |
Jun 20 2025 | patent expiry (for year 8) |
Jun 20 2027 | 2 years to revive unintentionally abandoned end. (for year 8) |
Jun 20 2028 | 12 years fee payment window open |
Dec 20 2028 | 6 months grace period start (w surcharge) |
Jun 20 2029 | patent expiry (for year 12) |
Jun 20 2031 | 2 years to revive unintentionally abandoned end. (for year 12) |