NZ745288B2 - Decoding video data using a two-level multi-type-tree framework - Google Patents
Decoding video data using a two-level multi-type-tree framework Download PDFInfo
- Publication number
- NZ745288B2 NZ745288B2 NZ745288A NZ74528817A NZ745288B2 NZ 745288 B2 NZ745288 B2 NZ 745288B2 NZ 745288 A NZ745288 A NZ 745288A NZ 74528817 A NZ74528817 A NZ 74528817A NZ 745288 B2 NZ745288 B2 NZ 745288B2
- Authority
- NZ
- New Zealand
- Prior art keywords
- tree
- region
- prediction
- decoding
- nodes
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/119—Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/44—Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
- H04N19/45—Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder performing compensation of the inverse transform mismatch, e.g. Inverse Discrete Cosine Transform [IDCT] mismatch
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/583—Motion compensation with overlapping blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/593—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/91—Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/96—Tree coding, e.g. quad-tree coding
Abstract
example device for decoding video data includes a video decoder configured to decode one or more syntax elements at a region-tree level of a region-tree of a tree data structure for a coding tree block (CTB) of video data, the region-tree having one or more region-tree nodes including region-tree leaf and non-leaf nodes, each of the region-tree non-leaf nodes having at least four child region-tree nodes, decode one or more syntax elements at a prediction-tree level for each of the region-tree leaf nodes of one or more prediction trees of the tree data structure for the CTB, the prediction trees each having one or more prediction-tree leaf and non-leaf nodes, each of the prediction-tree non-leaf nodes having at least two child prediction-tree nodes, each of the prediction leaf nodes defining respective coding units (CUs), and decode video data for each of the CUs.
Claims (27)
1. A method of decoding video data, the method comprising: decoding one or more syntax elements at a region-tree level of a -tree of a tree data structure for a coding tree block (CTB) of video data, the region-tree having one or more region-tree nodes including zero or more region-tree non-leaf nodes and one or more region-tree leaf nodes, each of the region-tree af nodes having four child region-tree nodes; determining, using the syntax elements at the -tree level, how the regiontree nodes are split into the child region-tree nodes; decoding one or more syntax ts at a prediction-tree level for each of the region-tree leaf nodes of one or more prediction trees of the tree data structure for the CTB, the tion trees each having root nodes corresponding to one or more of the region-tree leaf nodes and one or more prediction-tree nodes including zero or more prediction-tree af nodes and one or more prediction-tree leaf nodes, each of the prediction-tree non-leaf nodes having either two child prediction tree nodes or three child prediction tree nodes obtained using a center-side triple partitioning, with at least one prediction-tree non-leaf node having three child prediction tree nodes obtained using a center-side triple ioning, each of the prediction-tree leaf nodes defining respective coding units (CUs); determining, using the syntax elements at the prediction-tree level, how the prediction-tree nodes are split into the child prediction-tree nodes; and decoding video data, including prediction data and transform data, for each of the CUs based at least in part on the syntax elements at the region-tree level and the syntax ts at the prediction-tree level, wherein the prediction data indicates a prediction mode for forming a predicted block for a corresponding one of the CUs, and wherein the transform data includes transform coefficients representing transformed residual data for the corresponding one of the CU’s.
2. The method of claim 1, wherein decoding the syntax elements at the -tree level and decoding the syntax elements at the prediction-tree level comprises decoding one or more no-further-split tree types for at least one of the region-tree level or the prediction-tree level, wherein a no-further-split tree type means that no further splitting is permitted.
3. The method of claim 1 or 2, further comprising decoding data representing a maximum region-tree depth for the -tree level.
4. The method of claim 3, further sing decoding data representing a maximum prediction-tree depth for the prediction-tree level.
5. The method of claim 4, wherein a sum of the maximum region-tree depth and the maximum prediction-tree depth is less than a m total depth value.
6. The method of any one of claims 3 to 5, wherein decoding the data representing the maximum region-tree depth comprises decoding the data representing the maximum region-tree depth from one or more of a ce parameter set (SPS), a picture parameter set (PPS), or a slice header.
7. The method of any one of claims 1 to 6, further comprising decoding one syntax element that jointly represents both a maximum region-tree depth for the region-tree level and a maximum prediction-tree depth for the prediction-tree level.
8. The method of claim 7, wherein decoding the one syntax element that jointly represents the maximum region-tree depth and the maximum prediction-tree depth comprises decoding the one syntax element from one or more of a sequence parameter set (SPS), a picture parameter set (PPS), or a slice header.
9. The method of any one of claims 1 to 8, further comprising inferring that at least one node, comprising at least one of the region-tree nodes or at least one of the prediction-tree nodes, is split without decoding split data for the node.
10. The method of claim 9, wherein inferring comprises inferring that the at least one node is split based on a block to which the node ponds crossing at least one of picture boundary, a slice boundary, or a tile boundary.
11. The method of any one of claims 1 to 10, further comprising decoding data indicating whether the -tree depth and the prediction-tree depth overlap with each other.
12. The method of any one of claims 1 to 11, wherein the video data for each of the CUs comprises one or more of a skip flag, a merge index, a syntax element representing whether the CU is predicted using inter-mode or intra-mode, mode prediction information, motion information, transform information, residual information, or quantization information.
13. The method of any one of claims 1 to 12, wherein decoding the one or more syntax elements at the region-tree level comprises decoding one or more syntax elements representing splitting ation for the prediction trees before decoding the video data for each of the CUs.
14. The method of claim 13, r comprising determining the number of CUs using the ing information before decoding the video data for each of the CUs.
15. The method of any one of claims 1 to 14, wherein decoding the syntax elements at the region-tree level comprises decoding one or more syntax elements representing one or more d coding tools at the region-tree level.
16. The method of claim 15, further comprising applying the coding tools across boundaries of the CUs when the boundaries of the CUs are within a common region as indicated by one of the region-tree leaf nodes of the region-tree.
17. The method of claim 15 or 16, wherein ng the data representing one or more of the enabled coding tools comprises decoding overlapped block motion compensation (OBMC) mode information in each of the region-tree leaf nodes representing whether OBMC is enabled for blocks of the video data corresponding to the one of the region-tree leaf nodes.
18. The method of claim 15 or 16, wherein decoding the data representing one or more of the enabled coding tools comprises decoding overlapped transforms information in each of the region-tree leaf nodes representing whether overlapped transforms are d for blocks of the video data corresponding to the one of the -tree leaf nodes, wherein overlapped orms comprises a coding tool for which a orm block is permitted to overlap a boundary between two prediction blocks of a region corresponding to the one of the region-tree leaf nodes.
19. The method of claim 15 or 16, wherein decoding the data representing one or more of the enabled coding tools ses decoding data in one or more of the regiontree leaf nodes indicating whether all of the CUs within a region corresponding to the region-tree leaf node are coded using one of skip mode, merge mode, intra-mode, intermode , or frame-rate upconversion (FRUC) mode, and wherein when all of the CUs within one of the regions are coded using a common mode, the method further comprises preventing decoding of mode information for the CUs at the CU level.
20. The method of any one of claims 1 to 19, n decoding the syntax elements at the -tree level comprises decoding at least one of sample adaptive offset (SAO) parameters or adaptive loop filter (ALF) parameters.
21. The method of any one of claims 1 to 20, further sing decoding one or more center-side triple tree syntax elements in at least one of the region-tree level or the prediction-tree level.
22. The method of any one of claims 1 to 21, further comprising calculating respective quantization parameters (QPs) for each of the CUs, wherein calculating the respective QPs ses determining base QPs for each of the region-tree leaf nodes and calculating the respective QPs based on the base QPs of the corresponding regiontree leaf nodes of the CUs.
23. The method of any one of claims 1 to 22, further comprising encoding the video data prior to decoding the video data.
24. A device for decoding video data, the device comprising: a memory configured to store video data; and a processor implemented in circuitry and configured to carry out the method of any one of claims 1 to 23.
25. The device of claim 24, further comprising at least one of: a display configured to display the decoded video data; or a camera configured to capture the video data.
26. The device of claim 24 or 25, wherein the device ses one or more of a camera, a computer, a mobile device, a broadcast er device, or a set-top box.
27. A computer-readable storage medium having stored thereon instructions that, when executed, cause a processor to carry out the method of any one of claims 1 to 23. SOURCE DEVICE DESTINATION DEVICE 12 14 VIDEO SOURCE DISPLAY DEVICE 18 32 VIDEO VIDEO ENCODER DECODER 20 30 OUTPUT 16 INPUT ACE INTERFACE 22 28 WO 65375 s
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201662311248P | 2016-03-21 | 2016-03-21 | |
| US201662401016P | 2016-09-28 | 2016-09-28 | |
| US15/463,398 US11223852B2 (en) | 2016-03-21 | 2017-03-20 | Coding video data using a two-level multi-type-tree framework |
| PCT/US2017/023351 WO2017165375A1 (en) | 2016-03-21 | 2017-03-21 | Decoding video data using a two-level multi-type-tree framework |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| NZ745288A NZ745288A (en) | 2024-08-30 |
| NZ745288B2 true NZ745288B2 (en) | 2024-12-03 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| RU2018133028A (en) | VIDEO ENCODING USING THE INFRASTRUCTURE OF A TWO-LEVEL MULTI-TYPE TREE | |
| AU2023202344B2 (en) | Position dependent intra prediction combination extended with angular modes | |
| US12206864B2 (en) | Intra filtering flag in video coding | |
| US10805641B2 (en) | Intra filtering applied together with transform processing in video coding | |
| US10382795B2 (en) | Method of video coding using binary tree block partitioning | |
| US10375393B2 (en) | Method of video coding using binary tree block partitioning | |
| AU2014408228B2 (en) | Rules for intra-picture prediction modes when wavefront parallel processing is enabled | |
| KR102384090B1 (en) | Determining quantization parameter (qp) values and delta qp values for palette coded blocks in video coding | |
| US9924175B2 (en) | Determining application of deblocking filtering to palette coded blocks in video coding | |
| US20170048552A1 (en) | Method and apparatus for determining residue transform tree representation | |
| US12477107B2 (en) | Encoding and decoding a picture | |
| TWI859804B (en) | Video encoder, video decoder, methods for encoding and decoding and video data stream for realizing advanced video coding concepts | |
| BR112015025393B1 (en) | SYSTEM, METHOD AND COMPUTER READABLE MEDIUM | |
| RU2016125260A (en) | Prediction of the block vector in the encoding / decoding of video and images | |
| MX2025008499A (en) | Prediction mode signaling in video coding | |
| WO2020142175A8 (en) | Tile group signaling in video coding | |
| CN116567207A (en) | Method and apparatus for intra prediction | |
| US10432975B2 (en) | Method for encoding/decoding image and device for same | |
| NZ745288B2 (en) | Decoding video data using a two-level multi-type-tree framework | |
| US20250220184A1 (en) | Encoding method, decoding method, and bitstream | |
| BR112017004041B1 (en) | METHOD IMPLEMENTED BY COMPUTER, COMPUTER READABLE MEDIA AND COMPUTER SYSTEM |