JP4369318B2

JP4369318B2 - ENCODING MODE DETERMINING DEVICE, IMAGE ENCODING DEVICE, ENCODING MODE DETERMINING METHOD, AND ENCODING MODE DETERMINING PROGRAM

Info

Publication number: JP4369318B2
Application number: JP2004213786A
Authority: JP
Inventors: 陽司能登屋; 眞也角野
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 2003-07-24
Filing date: 2004-07-22
Publication date: 2009-11-18
Anticipated expiration: 2024-07-22
Also published as: JP2005057750A

Description

本発明は、符号化モード決定装置、画像符号化装置、符号化モード決定方法、および符号化モード決定プログラムに関する。 The present invention relates to an encoding mode determination device, an image encoding device, an encoding mode determination method, and an encoding mode determination program.

マルチメディア・インターネット時代のキーテクノロジーとして、ＭＰＥＧ−４が注目を集めている。ＭＰＥＧ−４では、移動体通信、インターネットなどの応用領域に対応するため、ＭＰＥＧ−１／２に比べ、符号化効率改善などに特徴を有している（例えば、非特許文献１参照。）。 MPEG-4 is attracting attention as a key technology in the multimedia Internet era. MPEG-4 is characterized by improved coding efficiency and the like compared to MPEG-1 / 2 in order to deal with application areas such as mobile communication and the Internet (see Non-Patent Document 1, for example).

ＭＰＥＧ−４では、新しい高能率の符号化方式として、ＡＶＣと呼ばれる方式が策定されている。ＡＶＣはＩＳＯＭＰＥＧ−４Ｐａｒｔ１０ＡｄｖａｎｃｅｄＶｉｄｅｏＣｏｄｉｎｇまたはＩＴＵ−ＴＨ．２６４と呼ばれている符号化方式である。
この方式は、動き推定やＤＣＴを４×４ピクセルの画像ブロックでも可能とし、動き推定のための画像を複数のピクチャから選択するなどして符号化効率の向上を図っている。ＡＶＣは、これまでの符号化方式に利用されていた技術を取り入れた高機能な符号化方式となっているため、応用領域に適応した最適な利用が課題となっている。 In MPEG-4, a method called AVC is formulated as a new high-efficiency encoding method. AVC is ISO MPEG-4 Part 10 Advanced Video Coding or ITU-T H.264. It is an encoding method called H.264.
This method enables motion estimation and DCT even with an image block of 4 × 4 pixels, and attempts to improve coding efficiency by selecting an image for motion estimation from a plurality of pictures. Since AVC is a high-performance encoding method that incorporates techniques used in conventional encoding methods, optimal use adapted to application areas has become a problem.

例えば、ＡＶＣ策定以前のＭＰＥＧ−４では、マクロブロック単位の符号化モードの候補（パーティションサイズ、予測方向、ダイレクトモードなど）の組み合わせ数が比較的少なく、符号化の際にこれらの候補を網羅して最適な符号化モードの探索を行ってもエンコーダでの処理量の負担は大きくなかった。 For example, in MPEG-4 before AVC development, the number of combinations of encoding mode candidates (partition size, prediction direction, direct mode, etc.) in units of macroblocks is relatively small, and these candidates are covered during encoding. Even when searching for the optimum coding mode, the processing load on the encoder was not large.

一方、ＡＶＣでは、図２５に示す様に、１６×１６ピクセル（以下、１６×１６という）のマクロブロックを１６×１６，１６×８，８×１６，８×８のマクロブロックパーティション（以下、小ブロックという）に分割可能である。また８×８ピクセルの小ブロックを８×８，８×４，４×８，４×４のサブマクロブロックパーティションに分割可能である。 On the other hand, in AVC, as shown in FIG. 25, a 16 × 16 pixel (hereinafter referred to as 16 × 16) macroblock is converted into a 16 × 16, 16 × 8, 8 × 16, and 8 × 8 macroblock partition (hereinafter referred to as “16 × 16”). Can be divided into small blocks). A small block of 8 × 8 pixels can be divided into 8 × 8, 8 × 4, 4 × 8, and 4 × 4 sub-macroblock partitions.

以下、１６×１６に分割された１つの小ブロックを小ブロックＳｂ１と、１６×８に分割された２つの小ブロックを小ブロックＳｂ２，Ｓｂ３と、８×１６に分割された２つの小ブロックを小ブロックＳｂ４，Ｓｂ５と、８×８に分割された４つの小ブロックを小ブロックＳｂ６〜Ｓｂ９とする。 Hereinafter, one small block divided into 16 × 16 is divided into a small block Sb1, two small blocks divided into 16 × 8 into small blocks Sb2 and Sb3, and two small blocks divided into 8 × 16 into two small blocks. The small blocks Sb4 and Sb5 and the four small blocks divided into 8 × 8 are referred to as small blocks Sb6 to Sb9.

また、ＡＶＣでは、図２６に示す様に、それぞれの小ブロックＳｂ１〜Ｓｂ９について、参照ピクチャを参照して動き推定を行うことが可能である。これは、それぞれのサブマクロブロックパーティションについても同様である。さらに、ＡＶＣでは、図２７に示す様に、符号化ピクチャに対して時間的に前の参照ピクチャを参照する前方向予測（図２７（ａ）参照）、符号化ピクチャに対して時間的に後の参照ピクチャを参照する後方向予測（図２７（ｂ）参照）、あるいは符号化ピクチャに対して双方向の参照ピクチャを参照する双方向予測（図２７（ｃ）参照）などのインター予測を行うことが可能である。 In AVC, as shown in FIG. 26, motion estimation can be performed for each of the small blocks Sb1 to Sb9 with reference to a reference picture. The same applies to each sub macroblock partition. Further, in AVC, as shown in FIG. 27, forward prediction (refer to FIG. 27A) referring to a reference picture temporally preceding the coded picture, and temporally subsequent to the coded picture. Inter prediction such as backward prediction referring to the reference picture (see FIG. 27B) or bidirectional prediction referring to the bidirectional reference picture with respect to the encoded picture (see FIG. 27C) is performed. It is possible.

〈従来のエンコーダによる処理〉
これらの符号化モードを網羅する従来のエンコーダの処理を図２８および図２９を用いて説明する。
従来のエンコーダでは、画像ブロックを複数の分割方法候補で分割した小ブロックの全てについて動き推定を行う。さらに、小ブロック毎の参照ピクチャの選択および画像ブロックの分割方法の選択を行い、選択された分割方法を用いた符号化を行う。 <Processing by conventional encoder>
The processing of the conventional encoder covering these coding modes will be described with reference to FIGS.
In a conventional encoder, motion estimation is performed for all small blocks obtained by dividing an image block by a plurality of division method candidates. Further, a reference picture for each small block and an image block dividing method are selected, and encoding using the selected dividing method is performed.

ここで、小ブロック毎の参照ピクチャの選択および画像ブロックの分割方法の選択に際して、符号化コストという量が用いられる。符号化コストとは、画質劣化度（小ブロックと予測画像との絶対差分和）と動き情報（動きベクトルあるいは差分動きベクトルなど）の符号量との和で表される量であり、画像ブロック単位の符号化コストが小さいほど、画像ブロックの符号化効率が良いことを示している。なお、絶対差分和以外にも、差分の二乗和や、差分のアダマール変換やＤＣＴ変換後の誤差の絶対値和などが用いられることがある。 Here, an amount of coding cost is used when selecting a reference picture for each small block and selecting an image block division method. The coding cost is an amount represented by the sum of the image quality degradation degree (absolute difference sum between a small block and a predicted image) and the code amount of motion information (such as a motion vector or a difference motion vector), and is an image block unit. The smaller the encoding cost is, the better the encoding efficiency of the image block is. In addition to the sum of absolute differences, a sum of squares of differences, an absolute value sum of errors after difference Hadamard transform, or DCT transform may be used.

図２８は、小ブロックのそれぞれに対する動き推定の処理フローを示すブロック図である。１６×１６の画像ブロックを分割したＭ×Ｎ（（Ｍ，Ｎ）＝（１６，１６），（１６，８），（８，１６），（８，８））の小ブロックのそれぞれに対して、図２８の処理が行われる。図２８に示す動き推定の処理フローは、小ブロックについてのフルペル予測ステップＳ３００と、サブペル予測ステップＳ３０１と、参照方向選択ステップＳ３０２とを備えている。 FIG. 28 is a block diagram showing a process flow of motion estimation for each of the small blocks. For each of the small blocks of M × N ((M, N) = (16,16), (16,8), (8,16), (8,8)) obtained by dividing the 16 × 16 image block) Thus, the process of FIG. 28 is performed. The motion estimation processing flow shown in FIG. 28 includes a full-pel prediction step S300, a sub-pel prediction step S301, and a reference direction selection step S302 for a small block.

フルペル予測ステップＳ３００は、Ｍ×Ｎの小ブロックに対して、前方向予測および後方向予測による整数画素精度の動き推定を行う（ステップＳ３０５，Ｓ３０６）。具体的には、整数画素精度で、決められた探索範囲内（例えば±３２）の動き推定を行う。すなわち、探索範囲内で、符号化コストを最小とする動きベクトル（以下、ＭＶという）０ｆおよびＭＶ０ｂを検出する。 The full-pel prediction step S300 performs motion estimation with integer pixel accuracy by forward prediction and backward prediction for M × N small blocks (steps S305 and S306). Specifically, motion estimation within a determined search range (for example, ± 32) is performed with integer pixel accuracy. That is, motion vectors (hereinafter referred to as MV) 0f and MV0b that minimize the coding cost are detected within the search range.

サブペル予測ステップＳ３０１は、Ｍ×Ｎの小ブロックに対して、前方向予測、後方向予測および双方向予測による非整数画素精度の動き推定を行う（ステップＳ３０７〜Ｓ３０９）。ＡＶＣのインター予測では、１／２画素精度や１／４画素精度といった非整数画素精度で動き推定を行うことができる。そこで、非整数画素精度の参照ピクチャをフィルタを用いて生成し、生成された参照ピクチャに対して動き推定が行われる。 The sub-pel prediction step S301 performs motion estimation with non-integer pixel accuracy by forward prediction, backward prediction, and bidirectional prediction for M × N small blocks (steps S307 to S309). In AVC inter prediction, motion estimation can be performed with non-integer pixel accuracy such as 1/2 pixel accuracy or 1/4 pixel accuracy. Therefore, a reference picture with non-integer pixel precision is generated using a filter, and motion estimation is performed on the generated reference picture.

前方向予測ステップＳ３０７では、２段階の動きベクトル探索により、ＭＶ２ｆが検出される。具体的には、フルペル予測ステップＳ３００で検出されたＭＶ０ｆを中心として、周囲８近傍の１／２画素（または１／４画素）と中心のＭＶ０ｆを含めた９点の中から、符号化コストを最小とするＭＶ１ｆ（図示せず）が求められる。さらに、ＭＶ１ｆを中心として、周囲８近傍の１／２画素（または１／４画素）と中心のＭＶ１ｆを含めた９点の中から、符号化コストを最小とするＭＶ２ｆが求められる。なお、フルペル予測では整数画素精度の動き推定を行うと書いたが、間引き画素、例えば、水平に1画素間引く、などした場合でも、本発明のモード選択の方法が適用可能である。 In the forward prediction step S307, MV2f is detected by a two-step motion vector search. Specifically, the encoding cost is selected from 9 points including ½ pixel (or ¼ pixel) in the vicinity of 8 surroundings and MV0f in the center with MV0f detected in the full-pel prediction step S300 as the center. The minimum MV1f (not shown) is determined. Further, MV2f that minimizes the coding cost is obtained from 9 points including ½ pixel (or ¼ pixel) in the vicinity of the surrounding 8 and MV1f at the center with MV1f as the center. In the full-pel prediction, it is written that motion estimation with integer pixel accuracy is performed, but the mode selection method of the present invention can be applied even when thinned pixels, for example, one pixel is thinned horizontally.

後方向予測ステップＳ３０８でも、前方向予測ステップＳ３０７と同様に、フルペル予測ステップＳ３００で検出されたＭＶ０ｂから、ＭＶ２ｂが検出される。
双方向予測ステップＳ３０９は、２枚の参照ピクチャを参照するため、処理量が多い。そこで、前方向予測ステップＳ３０７および後方向予測ステップＳ３０８で検出されたＭＶ２ｆおよびＭＶ２ｂを利用した予測が行われる。具体的には、ＭＶ２ｆおよびＭＶ２ｂが示す参照ピクチャ上の参照領域を平均したものを予測画像として用いる。 Also in the backward prediction step S308, MV2b is detected from MV0b detected in the full-pel prediction step S300, as in the forward prediction step S307.
The bidirectional prediction step S309 has a large amount of processing because it refers to two reference pictures. Therefore, prediction using MV2f and MV2b detected in forward prediction step S307 and backward prediction step S308 is performed. Specifically, an average of the reference areas on the reference picture indicated by MV2f and MV2b is used as the predicted image.

また、前方向予測ステップＳ３０７、後方向予測ステップＳ３０８および双方向予測ステップＳ３０９では、それぞれの符号化コストＣ０，Ｃ１およびＣ２が導出される。
参照方向選択ステップＳ３０２は、符号化コストＣ０〜Ｃ２のうち最小の符号化コストを有する方向を小ブロックの参照方向として選択するとともに、最小の符号化コストを出力する。 Also, in the forward prediction step S307, the backward prediction step S308, and the bidirectional prediction step S309, the respective encoding costs C0, C1, and C2 are derived.
The reference direction selection step S302 selects the direction having the minimum encoding cost among the encoding costs C0 to C2 as the reference direction of the small block and outputs the minimum encoding cost.

図２９は、画像ブロックについての動き推定の処理フローを示すブロック図である。図２９の画像ブロックについての動き推定の処理フローは、１６×１６の画像ブロックを４種類の分割方法候補により分割したＭ×Ｎ（（Ｍ，Ｎ）＝（１６，１６），（１６，８），（８，１６），（８，８））の小ブロックのそれぞれに対する動き推定を行う動き推定ステップＳ３１５と、小ブロックごとの動き推定の結果に基づいて、画像ブロックの符号化コストを分割方法候補毎に導出する符号化コスト換算ステップＳ３１６と、分割方法候補毎に導出された画像ブロックの符号化コストから、最良の分割方法を選択する分割方法選択ステップＳ３１７とを備えている。 FIG. 29 is a block diagram illustrating a process flow of motion estimation for an image block. The motion estimation processing flow for the image block of FIG. 29 is obtained by dividing M × N ((M, N) = (16, 16), (16, 8) by dividing a 16 × 16 image block by four types of division method candidates. ), (8, 16), (8, 8)) motion estimation step S315 for performing motion estimation for each of the small blocks, and the coding cost of the image block is divided based on the result of motion estimation for each small block. An encoding cost conversion step S316 derived for each method candidate, and a division method selection step S317 for selecting the best division method from the image block coding costs derived for each division method candidate are provided.

動き推定ステップＳ３１５は、図２８を用いて説明した小ブロックに対する動き推定の処理フローに対応する小ブロック動き推定ステップＳ３２０〜Ｓ３２３を有している。ここで、図２９では、小ブロック動き推定ステップＳ３２１〜Ｓ３２３の処理ブロックは複数の矢印により接続されている。例えば、１６×８の小ブロック動き推定ステップＳ３２１では、それぞれの処理ブロックは２本の矢印により接続されている。これは、それぞれの処理が、１６×１６の画像ブロックを１６×８に分割する２つの小ブロックＳｂ２，Ｓｂ３に対して行われることを示している。同様に、８×１６の小ブロック動き推定ステップＳ３２２においては、それぞれの処理ブロックは、２本の矢印により接続されており、８×８の小ブロック動き推定ステップＳ３２３においては、それぞれの処理ブロックは、４本の矢印により接続されている。それぞれの処理ブロックの処理の内容は、図２８で説明したのと同様であるため、ここでは説明を省略する。 The motion estimation step S315 includes small block motion estimation steps S320 to S323 corresponding to the process flow of motion estimation for the small block described with reference to FIG. Here, in FIG. 29, the processing blocks of the small block motion estimation steps S321 to S323 are connected by a plurality of arrows. For example, in the 16 × 8 small block motion estimation step S321, each processing block is connected by two arrows. This indicates that each process is performed on two small blocks Sb2 and Sb3 that divide a 16 × 16 image block into 16 × 8. Similarly, in the 8 × 16 small block motion estimation step S322, each processing block is connected by two arrows, and in the 8 × 8 small block motion estimation step S323, each processing block is Connected by four arrows. Since the contents of the processing of each processing block are the same as those described with reference to FIG. 28, description thereof is omitted here.

符号化コスト換算ステップＳ３１６は、ＭＢコスト換算ステップＳ３２５〜Ｓ３２８を有している。ＭＢコスト換算ステップＳ３２５〜Ｓ３２８は、小ブロック動き推定ステップＳ３２０〜３２３により出力された小ブロック毎の符号化コストを合計し、画像ブロックの符号化コストを分割方法候補毎に導出する。 The encoding cost conversion step S316 includes MB cost conversion steps S325 to S328. In MB cost conversion steps S325 to S328, the encoding costs for the small blocks output in the small block motion estimation steps S320 to 323 are totaled, and the encoding cost of the image block is derived for each division method candidate.

分割方法選択ステップＳ３１７は、ＭＢコスト換算ステップＳ３２５〜Ｓ３２８が導出した分割方法候補毎の符号化コストのうち、最小の符号化コストを示す分割方法候補を画像ブロックに適用する分割方法として選択する。
また、ＡＶＣでは、図３０に示す様に、２つの画像ブロック７１，７２からなる画像ブロックペア７３という概念を導入しており、画像ブロックペア７３を単位として、フィールド予測およびフレーム予測を適応的に切り換えることが可能である。例えば、フィールド予測の場合、フィールド構造ブロック７５，７６のそれぞれに対して動き推定が行われる。フレーム予測の場合、フレーム構造ブロック７７，７８のそれぞれについて動き推定が行われる。 In the division method selection step S317, among the encoding costs for each of the division method candidates derived by the MB cost conversion steps S325 to S328, the division method candidate indicating the minimum encoding cost is selected as the division method to be applied to the image block.
In addition, as shown in FIG. 30, AVC introduces the concept of an image block pair 73 composed of two image blocks 71 and 72, and adaptively performs field prediction and frame prediction using the image block pair 73 as a unit. It is possible to switch. For example, in the case of field prediction, motion estimation is performed for each of the field structure blocks 75 and 76. In the case of frame prediction, motion estimation is performed for each of the frame structure blocks 77 and 78.

また、画像ブロックペア７３の符号化モードは、符号化ピクチャ構造の２種類（フィールド・フレーム）および符号化予測方式（イントラ・インター予測）の２種類で、合計４種類ある。従来は、これらの全ての組み合わせを考慮していたため、処理量多いと言う問題があった。特にイントラ予測の処理負担が大きかった。 In addition, the coding mode of the image block pair 73 includes two types, that is, two types (field / frame) of the coded picture structure and a coding prediction scheme (intra / inter prediction), for a total of four types. Conventionally, since all these combinations are considered, there is a problem that the amount of processing is large. In particular, the processing load for intra prediction was large.

ここで、従来の符号化モード決定について説明する。ＡＶＣより前のＣｏｄｅｃではＭＢｐａｉｒ（大ブロック）という概念は無く、ＭＢ（中ブロック）の種類として、フィールド／フレームがあった。そして、イントラ／インターとフィールド／フレームの４通りを網羅するのが一般的だった。図３１に示すように、動き推定ステップＳ８１と、ピクチャ構造及び符号化予測方式決定ステップＳ８２とから構成されている。推定ステップＳ８１は、第１〜第６推定ステップＳ８１１〜Ｓ８１６とを有している。第１推定ステップＳ８１１は、フレーム構造ブロックに対してインター予測を行う。第２推定ステップＳ８１２は、フレーム構造ブロックに対してイントラ予測を行う。第３推定ステップＳ８１３は、フィールド構造トップＭＢに対してインター予測を行う。第４推定ステップＳ８１４は、フィールド構造のボトムフィールドに対してインター予測を行う。第３推定ステップＳ８１３によって導出された符号化コストと、第４推定ステップＳ８１４によって導出された符号化コストは合計されて、フィールド構造ブロックに対してインター予測して導出された符号化コストが得られる。第５推定ステップＳ８１５は、フィールド構造のトップフィールドに対してイントラ予測を行う。第６推定ステップＳ８１６は、フィールド構造のボトムフィールドに対してイントラ予測を行う。第５推定ステップＳ８１５によって導出された符号化コストと、第６推定ステップＳ８１６によって導出された符号化コストは合計されて、フィールド構造ブロックに対してイントラ予測して導出された符号化コストが得られる。 Here, conventional coding mode determination will be described. In Codec before AVC, there is no concept of MBpair (large block), and there is field / frame as a type of MB (medium block). And it was common to cover four types of intra / inter and field / frame. As shown in FIG. 31, it consists of a motion estimation step S81 and a picture structure and coding prediction method determination step S82. The estimation step S81 includes first to sixth estimation steps S811 to S816. The first estimation step S811 performs inter prediction on the frame structure block. In the second estimation step S812, intra prediction is performed on the frame structure block. In the third estimation step S813, inter prediction is performed on the field structure top MB. In the fourth estimation step S814, inter prediction is performed on the bottom field of the field structure. The coding cost derived by the third estimation step S813 and the coding cost derived by the fourth estimation step S814 are added together to obtain the coding cost derived by inter prediction on the field structure block. . In the fifth estimation step S815, intra prediction is performed on the top field of the field structure. In the sixth estimation step S816, intra prediction is performed on the bottom field of the field structure. The coding cost derived by the fifth estimation step S815 and the coding cost derived by the sixth estimation step S816 are added together to obtain the coding cost derived by intra prediction on the field structure block. .

ピクチャ構造及び符号化予測方式決定ステップＳ８２は、前記４種類の符号化コストのうち、最小となるものを選択する。
以上までが従来技術であるが、そのような考えを単純にＡＶＣに適用すると考えると、図３２のような処理が想定される。図３２では、処理全体は、動き推定ステップＳ８１’と、符号化予測方式決定ステップＳ８３と、ＭＢペアのピクチャ構造決定ステップＳ８２’とから構成されている。 The picture structure and coding prediction method determination step S82 selects the smallest of the four types of coding costs.
The above is the prior art, but when such an idea is simply applied to AVC, a process as shown in FIG. 32 is assumed. In FIG. 32, the entire process includes a motion estimation step S81 ′, a coding prediction method determination step S83, and an MB pair picture structure determination step S82 ′.

動き推定ステップＳ８１’は、第１〜第８推定ステップＳ８１１’〜Ｓ８１８’を備えている。第１推定ステップＳ８１１’はフレーム構造トップＭＢ７７に対してインター予測を行い、第２推定ステップＳ８１２’はフレーム構造トップＭＢ７７に対してイントラ予測を行う。第３推定ステップＳ８１３’はフレーム構造ボトムＭＢ７８に対してインター予測を行い、第４推定ステップＳ８１４’はフレーム構造ボトムＭＢ７８に対してイントラ予測を行う。第５推定ステップＳ８１５’はフィールド構造トップＭＢ７５に対してインター予測を行い、第６推定ステップＳ８１６’はフィールド構造トップＭＢ７５に対してイントラ予測を行う。第７推定ステップＳ８１７’はフィールド構造ボトムＭＢ７６に対してインター予測を行い、第８推定ステップＳ８１８’はフィールド構造ボトムＭＢ７６に対してイントラ予測を行う。 The motion estimation step S81 'includes first to eighth estimation steps S811' to S818 '. The first estimation step S811 'performs inter prediction on the frame structure top MB77, and the second estimation step S812' performs intra prediction on the frame structure top MB77. The third estimation step S813 'performs inter prediction on the frame structure bottom MB78, and the fourth estimation step S814' performs intra prediction on the frame structure bottom MB78. The fifth estimation step S815 'performs inter prediction on the field structure top MB75, and the sixth estimation step S816' performs intra prediction on the field structure top MB75. The seventh estimation step S817 'performs inter prediction on the field structure bottom MB76, and the eighth estimation step S818' performs intra prediction on the field structure bottom MB76.

符号化予測方式決定ステップＳ８３は、第１〜第４予測方式決定ステップＳ８３１〜Ｓ８３４を備えている。第１予測方式決定ステップＳ８３１は、第１推定ステップＳ８１１’及び第２推定ステップＳ８１２’の符号化コストを比較して、フレーム構造トップＭＢ７７に対するイントラ／インターを選択する。第２予測方式決定ステップＳ８３２は、第３予測ステップＳ８１３’及び第４予測ステップＳ８１４’の符号化コストを比較して、フレーム構造ボトムＭＢ７８に対するイントラ／インターを選択する。イントラ／インターが選択されたフレーム構造のトップＭＢ７７とボトムＭＢ７８の符号化コストは合計され、フレーム構造ブロックペア７７，７８の符号化コストが得られる。第３予測方式決定ステップＳ８３３は、第５推定ステップＳ８１５’及び第６推定ステップＳ８１６’の符号化コストを比較して、フィールド構造トップＭＢ７５に対するイントラ／インターを選択する。第４予測方式決定ステップＳ８３４は、第７推定ステップＳ８１７’及び第８推定ステップＳ８１８’の符号化コストを比較して、フィールド構造ボトムＭＢ７６に対するイントラ／インターを選択する。イントラ／インターが選択されたフィールド構造のトップＭＢ７５とボトムＭＢ７６の符号化コストは合計され、フィールド構造ブロックペア７５，７６の符号化コストが得られる。 The encoding prediction method determination step S83 includes first to fourth prediction method determination steps S831 to S834. The first prediction method determination step S831 compares the coding costs of the first estimation step S811 'and the second estimation step S812', and selects intra / inter for the frame structure top MB77. The second prediction method determination step S832 compares the coding costs of the third prediction step S813 'and the fourth prediction step S814', and selects intra / inter for the frame structure bottom MB78. The coding costs of the top MB 77 and the bottom MB 78 of the frame structure for which intra / inter is selected are added up, and the coding cost of the frame structure block pair 77 and 78 is obtained. The third prediction method determination step S833 selects the intra / inter for the field structure top MB75 by comparing the coding costs of the fifth estimation step S815 'and the sixth estimation step S816'. The fourth prediction method determination step S834 compares the encoding costs of the seventh estimation step S817 'and the eighth estimation step S818', and selects intra / inter for the field structure bottom MB76. The coding costs of the top MB 75 and the bottom MB 76 of the field structure for which intra / inter is selected are summed, and the coding cost of the field structure block pair 75 and 76 is obtained.

ピクチャ構造決定ステップＳ８２’は、フレーム構造ブロックペア７７，７８の符号化コストとフィールド構造ブロックペア７５，７６の符号化コストとを比較し、画像ブロックペア７３（７１，７２）のフィールド／フレームを決定する。
以上の処理では、イントラ予測とインター予測の両方でフィールド／フレームそれぞれのコスト計算を行うため、インター予測とイントラ予測のいずれか一方のみで圧縮率が向上する画像の場合であっても、圧縮率が最良となるように符号化ピクチャ構造と符号化予測方式を決定できる。しかし、その一方でイントラ予測の回数が多いため、処理量が膨大になる。
三木弼一編著，「ＭＰＥＧ−４のすべて」，初版，（株）工業調査会，１９９８年９月３０日，ｐ．３７−５８ The picture structure determination step S82 ′ compares the coding cost of the frame structure block pair 77 and 78 with the coding cost of the field structure block pair 75 and 76, and determines the field / frame of the image block pair 73 (71, 72). decide.
In the above processing, since the cost calculation for each field / frame is performed in both intra prediction and inter prediction, the compression rate is improved even in the case of an image in which the compression rate is improved by only one of inter prediction and intra prediction. Therefore, the coded picture structure and the coded prediction method can be determined so as to be the best. However, on the other hand, since the number of intra predictions is large, the processing amount becomes enormous.
Edited by Junichi Miki, “All about MPEG-4”, first edition, Industrial Research Co., Ltd., September 30, 1998, p. 37-58

以上のように、ＡＶＣでは、マクロブロック（ペア）毎に符号化モードの候補が膨大であり、全ての候補を網羅して符号化効率の高い符号化モードを探索すると、エンコーダの処理量の負担が大きくなる。
そこで、本発明では、より少ない処理量で適切な符号化モードの選択を可能とさせる符号化モード決定装置、画像符号化装置、符号化モード決定方法、および符号化モード決定プログラムを提供することを課題とする。 As described above, in AVC, there are an enormous number of encoding mode candidates for each macroblock (pair), and when searching for an encoding mode with high encoding efficiency covering all candidates, the burden of the processing amount of the encoder is reduced. Becomes larger.
Therefore, the present invention provides an encoding mode determination device, an image encoding device, an encoding mode determination method, and an encoding mode determination program that enable selection of an appropriate encoding mode with a smaller amount of processing. Let it be an issue.

請求項１に記載の符号化モード決定装置では、画像ブロックの符号化モードを複数候補の中から少なくとも１つに決定する装置であって、簡易動き推定部と、符号化モード選択部と、複雑動き推定部と、符号化モード決定部とを備えている。簡易動き推定部は、各符号化モードによってそれぞれ得られる画像ブロックのパーティションである小ブロックに対する簡易な動き推定に基づいて、各符号化モードの符号化コストを導出する。符号化モード選択部は、簡易動き推定部によって導出された符号化コストに基づいて、複数の符号化モードから一部の符号化モードを選択する。複雑動き推定部は、一部の符号化モードの少なくとも一部の符号化モードによって得られる小ブロックに対する複雑な動き推定に基づいて、各符号化モードの符号化コストを導出する。符号化モード決定部は、複雑動き推定部によって導出された符号化コストに基づいて、画像ブロックの符号化モードを決定する。 The encoding mode determining apparatus according to claim 1 is an apparatus that determines at least one encoding mode of an image block from a plurality of candidates, and includes a simple motion estimation unit, an encoding mode selection unit, A motion estimation unit and an encoding mode determination unit are provided. The simple motion estimation unit derives the encoding cost of each encoding mode based on simple motion estimation for a small block that is a partition of an image block obtained by each encoding mode. The coding mode selection unit selects a part of the coding modes from the plurality of coding modes based on the coding cost derived by the simple motion estimation unit. The complex motion estimation unit derives the coding cost of each coding mode based on the complex motion estimation for the small block obtained by at least some of the coding modes. The encoding mode determination unit determines the encoding mode of the image block based on the encoding cost derived by the complex motion estimation unit.

ここで、複雑な動き推定とは、簡易な動き推定よりも複雑な動き推定のことである（以下、同じ）。例えば、複雑な動き推定とは、整数画素精度の簡易な動き推定に対するより詳細な精度（例えば、１／２画素精度、１／４画素精度などといった非整数画素精度）での動き推定、非整数画素の簡易な動き推定に対するより詳細な精度での動き推定、縮小画像（画素情報の間引かれた画像）を参照する簡易な動き推定に対するより詳細な画像を参照する動き推定などである。 Here, complex motion estimation refers to motion estimation that is more complicated than simple motion estimation (the same applies hereinafter). For example, complex motion estimation is motion estimation with a more detailed accuracy (eg, non-integer pixel accuracy such as ½ pixel accuracy, ¼ pixel accuracy, etc.), non-integer than simple motion estimation with integer pixel accuracy For example, motion estimation with more detailed accuracy with respect to simple motion estimation of pixels, motion estimation with reference to a more detailed image with respect to simple motion estimation with reference to a reduced image (image with thinned pixel information), and the like.

符号化コストは、例えば、画質劣化度（小ブロックと動き推定における参照ピクチャとの絶対差分和）と動き情報（動きベクトルあるいは差分動きベクトルなど）の符号量との和で表される。符号化モードとは、例えば、小ブロックの分割方法や、小ブロックの動き推定の際のピクチャ参照方向や、小ブロックの符号化ピクチャ構造などである。 The encoding cost is represented, for example, by the sum of the image quality degradation level (absolute difference sum between the small block and the reference picture in motion estimation) and the code amount of motion information (such as a motion vector or a difference motion vector). The coding mode is, for example, a small block division method, a picture reference direction in motion estimation of a small block, a coded picture structure of a small block, or the like.

この装置では、簡易動き推定部により得られた符号化コストから符号化モード選択部が符号化モードの絞り込みを行う。さらに、絞り込んだ符号化モードの小ブロックに対して、複雑動き推定部が複雑な動き推定を行う。ここで、複雑な動き推定は、例えば、フィルタを適用する必要があるなどの理由により、簡易な動き推定に比して処理量が多いが、この装置では、符号化モードの決定に際して全ての小ブロックについて複雑な動き推定を行う必要が無い。このため、複雑な動き推定の回数を削減でき、符号化モード決定の処理量を低減することが可能となる。また、必要な小ブロックには複雑な動き推定を行うため、適切な符号化効率の符号化モードを決定することが可能となる。 In this apparatus, the encoding mode selection unit narrows down encoding modes from the encoding cost obtained by the simple motion estimation unit. Further, the complex motion estimator performs complex motion estimation on the narrow blocks of the encoded mode. Here, the complicated motion estimation has a larger processing amount than the simple motion estimation because, for example, it is necessary to apply a filter. However, in this apparatus, all small motion estimations are performed when determining the coding mode. There is no need to perform complex motion estimation for blocks. For this reason, the number of times of complicated motion estimation can be reduced, and the processing amount for determining the coding mode can be reduced. In addition, since a complicated motion estimation is performed for a necessary small block, it is possible to determine an encoding mode with appropriate encoding efficiency.

請求項２に記載の符号化モード決定装置では、請求項１において、簡易動き推定部は、各符号化モードの符号化コストを導出する際に、各符号化モードによって得られる小ブロックごとに複数のピクチャ参照方向の簡易な動き推定を行って符号化コストを算出し、次に各小ブロックごとに符号化コストが低いピクチャ参照方向を選択し、次に選択したピクチャ参照方向に関する全ての小ブロックの符号化コストを各分割方法候補ごとに合計して、各分割方法候補ごとの符号化モードの符号化コストを導出する。 In the encoding mode determination apparatus according to claim 2, in the first aspect, when the simple motion estimation unit derives the encoding cost of each encoding mode, a plurality of simple motion estimation units are provided for each small block obtained by each encoding mode. To calculate a coding cost by performing simple motion estimation in the picture reference direction, and then select a picture reference direction with a low coding cost for each small block, and then select all the small blocks related to the selected picture reference direction. Are summed for each division method candidate to derive the encoding mode coding cost for each division method candidate.

この装置では、簡易動き推定部が各小ブロックごとに符号化コストが低いピクチャ参照方向を選択しているため、各分割方法候補ごとの符号化モードにおいて最も符号化コストが小さい小ブロックの組み合わせが可能となる。
請求項３に記載の符号化モード決定装置では、請求項１において、簡易動き推定部は、各符号化モードの符号化コストを導出する際に、各符号化モードによって得られる小ブロックごとに複数のピクチャ参照方向の簡易な動き推定を行って符号化コストを算出し、次に小ブロックの各ピクチャ参照方向ごとの符号化コストを画像ブロック単位に換算して、各分割候補の各ピクチャ参照方向ごとの符号化モードの符号化コストを導出する。 In this apparatus, since the simple motion estimation unit selects a picture reference direction with a low encoding cost for each small block, the combination of small blocks with the lowest encoding cost in the encoding mode for each division method candidate is selected. It becomes possible.
According to a third aspect of the present invention, there is provided a coding mode determining apparatus according to the first aspect, wherein the simple motion estimation unit includes a plurality of simple motion estimation units for each small block obtained by each coding mode when deriving the coding cost of each coding mode. The encoding cost is calculated by performing simple motion estimation in the picture reference direction of each picture, and then the coding cost for each picture reference direction of the small block is converted into image block units, and each picture reference direction of each division candidate is calculated. The encoding cost of each encoding mode is derived.

この装置では、簡易動き推定部が小ブロックのピクチャ参照方向ごとの符号化コストを画像ブロック単位に換算して符号化モードを導出するため、一つの小ブロックにおいて異なるピクチャ参照方向の符号化モードも符号化モード選択部の対象となる。
請求項４に記載の符号化モード決定装置では、請求項２又は３において、簡易動き推定部の複数のピクチャ参照方向の簡易な動き推定は、時間的に前方向のピクチャを参照する前方向予測と、時間的に後方向のピクチャを参照する後方向予測のみを含む。すなわち、この装置は双方向予測を行わない。なお、前方向予測と後方向予測は、それぞれ、同一方向における複数枚のピクチャを参照する複数予測を含む（以下、同じ）。 In this apparatus, since the simple motion estimation unit derives the encoding mode by converting the encoding cost for each picture reference direction of the small block into the image block unit, the encoding mode of different picture reference directions in one small block is also included. This is a target of the encoding mode selection unit.
The encoding mode determination apparatus according to claim 4, wherein the simple motion estimation in the plurality of picture reference directions of the simple motion estimation unit according to claim 2 or 3 is performed by forward prediction with reference to a temporally forward picture. And backward prediction that refers to the backward picture in time. That is, this device does not perform bi-directional prediction. Note that forward prediction and backward prediction each include multiple predictions that refer to multiple pictures in the same direction (hereinafter the same).

この装置では、簡易動き推定部は、前方向予測と後方向予測のみを行う。双方向予測を行わないため、処理量を削減でき、簡易な動き推定の処理時間を短縮できる。
請求項５に記載の符号化モード決定装置では、請求項２又は３において、簡易動き推定部の複数のピクチャ参照方向の簡易な動き推定は、時間的に前方向のピクチャを参照する前方向予測と、時間的に後方向のピクチャを参照する後方向予測と、時間的に双方向のピクチャを参照する双方向予測とを含む。 In this apparatus, the simple motion estimation unit performs only forward prediction and backward prediction. Since bidirectional prediction is not performed, the amount of processing can be reduced, and the processing time for simple motion estimation can be shortened.
The encoding mode determination apparatus according to claim 5, wherein the simple motion estimation in the plurality of picture reference directions of the simple motion estimation unit according to claim 2 or 3 is performed by forward prediction with reference to a temporally forward picture. And backward prediction that refers to temporally backward pictures and bidirectional prediction that refers to temporally bidirectional pictures.

この装置では、双方向予測を行うため、簡易な動き推定の精度を向上させることが可能となる。このため、より適切な符号化モードを選択することが可能となる。
請求項６に記載の符号化モード決定装置では、請求項２又は３において、簡易動き推定部の複数のピクチャ参照方向の簡易な動き推定は、時間的に前方向のピクチャを参照する前方向予測と、時間的に後方向のピクチャを参照する後方向予測とを含む。簡易動き推定部は、前方向予測と後方向予測とに基づいて、時間的に双方向のピクチャを参照する双方向予測を行った場合の符号化コストを導出する。例えば、前方向予測の符号化コストと後方向予測の符号化コストとが近い値の場合には、双方向予測の符号化コストは、それらの符号化コストの小さい方の値よりも少しだけ小さい値である、などと推定される。 Since this apparatus performs bi-directional prediction, it is possible to improve the accuracy of simple motion estimation. For this reason, it becomes possible to select a more appropriate encoding mode.
The coding mode determination apparatus according to claim 6, wherein the simple motion estimation in the plurality of picture reference directions of the simple motion estimation unit according to claim 2 or 3 is performed by forward prediction with reference to a temporally forward picture. And backward prediction with reference to the backward picture in time. The simple motion estimator derives the coding cost when bi-directional prediction with reference to bi-directional pictures is performed based on forward prediction and backward prediction. For example, when the encoding cost of forward prediction and the encoding cost of backward prediction are close to each other, the encoding cost of bi-directional prediction is slightly smaller than the value of the smaller encoding cost. It is estimated that it is a value.

この装置では、双方向予測の予測結果を推定するため、簡易動き推定部では、双方向予測を行う必要はなく、処理量を低減することが可能となる。また、予測結果を簡易動き推定部による符号化コストに反映させることにより、双方向予測を行った場合と同様の効果を簡易に得ることが可能となる。このため、符号化効率を簡易に向上させることが可能となる。 In this apparatus, since the prediction result of bidirectional prediction is estimated, the simple motion estimation unit does not need to perform bidirectional prediction, and the processing amount can be reduced. In addition, by reflecting the prediction result on the coding cost by the simple motion estimation unit, it is possible to easily obtain the same effect as when bi-directional prediction is performed. For this reason, encoding efficiency can be easily improved.

請求項７に記載の符号化モード決定装置では、請求項１〜６のいずれかにおいて、複雑動き推定部は、簡易動き推定部における簡易な動き推定に基づいて、複雑な動き推定の際のピクチャ参照方向を決定する。複雑動き推定部では、決定された参照方向のピクチャを参照して動き推定を行う。すなわち、前方向予測又は後方向予測を実行可能な場合であっても、常に全ての方向の動き推定を行う必要が無くなる。 The coding mode determining apparatus according to claim 7, wherein the complex motion estimator is a picture for complex motion estimation based on simple motion estimation in the simple motion estimator. Determine the reference direction. The complex motion estimation unit performs motion estimation with reference to the determined picture in the reference direction. That is, even when forward prediction or backward prediction can be performed, it is not necessary to always perform motion estimation in all directions.

この装置では、必要な参照方向を参照して複雑な動き推定を実行することが可能となる。このため、複雑な動き推定の処理量を削減でき、複雑な動き推定の処理時間を短縮可能となる。
請求項８に記載の符号化モード決定装置では、請求項７において、複雑動き推定部は、簡易動き推定部における小ブロックに対する簡易な動き推定の結果、前方向予測と後向予測の符号化コストが概ね同じ場合は両方を選択し、異なる場合は符号化コストが小さい方のみを選択する。 In this apparatus, it is possible to execute complicated motion estimation with reference to a necessary reference direction. For this reason, the amount of processing of complicated motion estimation can be reduced, and the processing time of complicated motion estimation can be shortened.
The coding mode determining apparatus according to claim 8, wherein the complex motion estimation unit according to claim 7 is a coding result of forward prediction and backward prediction as a result of simple motion estimation for a small block in the simple motion estimation unit. If both are substantially the same, both are selected, and if they are different, only the one with the lower coding cost is selected.

この装置では、前方向予測と後向予測の符号化コストが概ね同じ場合は両方を選択し、さらに双方向予測を行うことができる。また、異なる場合は符号化コストが小さい方のみを選択する。これは、一方の符号化コストが大きい場合は、双方向予測で符号化コストが小さくなることが期待できないからである。 In this apparatus, when the encoding costs of forward prediction and backward prediction are substantially the same, both can be selected and further bi-directional prediction can be performed. If they are different, only the one with the lower coding cost is selected. This is because when one coding cost is large, it cannot be expected that the coding cost is reduced by bidirectional prediction.

請求項９に記載の符号化モード決定装置では、請求項１〜８のいずれかにおいて、複雑動き推定部は、簡易動き推定部における小ブロックに対する簡易な動き推定に基づいて、一部の符号化モードからさらに少なくとも一部の符号化モードを選択する。複雑動き推定部は、小ブロックに対する簡易な動き推定に基づいて、一部の符号化モードのうちの少なくとも一部の符号化モードを選択する。 The coding mode determining apparatus according to claim 9, wherein the complex motion estimator is configured to perform partial coding based on simple motion estimation for a small block in the simple motion estimator. At least some coding modes are further selected from the modes. The complex motion estimation unit selects at least some of the coding modes from among some of the coding modes based on simple motion estimation for the small block.

この装置では、異なる符号化モードから選択された一部の符号化モードの全部について、複雑な動き推定を行う必要がなく、処理量を削減できる。また、処理量を一定に保つように一部の符号化モードのうちの少なくとも一部の符号化モードを選択することも可能となる。 In this apparatus, it is not necessary to perform complicated motion estimation for all of some of the encoding modes selected from different encoding modes, and the processing amount can be reduced. It is also possible to select at least some of the coding modes among some of the coding modes so as to keep the processing amount constant.

請求項１０に記載の符号化モード決定装置では、請求項９において、複雑動き推定部は、各符号化モードを符号化コストが低い順に選択していき、選択した符号化モードの符号化コストの和が処理余裕量を超える直前に選択を打ち切る。
この装置では、複雑動き推定部は符号化モード選択部によって選択された符号化モードの全てを選択しないこともあり得るが、その場合でも符号化コストが低い符号化モードは選択されているため問題が少ない。 In the coding mode determination device according to claim 10, in claim 9, the complex motion estimation unit selects each coding mode in ascending order of coding cost, and the coding cost of the selected coding mode is determined. The selection is aborted immediately before the sum exceeds the processing allowance.
In this apparatus, the complex motion estimation unit may not select all of the encoding modes selected by the encoding mode selection unit, but even in that case, an encoding mode with a low encoding cost is selected. Less is.

請求項１１に記載の符号化モード決定装置では、請求項１〜１０において、簡易動き推定部あるいは複雑動き推定部は、動き推定処理の処理量がほぼ一定に保たれるように簡易動き推定あるいは複雑動き推定における動き推定方式を変化させる。
例えば、符号化モード決定装置は、画像ブロックにより構成される画像の画像属性に応じて、動き推定方式を変化させる。ここで、画像属性とは、例えば、画像のサイズや、画像の符号化方式（ピクチャタイプ〔Ｉピクチャ、Ｐピクチャ、Ｂピクチャ〕など）や、画像のフォーマット（走査方式〔プログレッシブ、インターレース〕、色差フォーマットなど）や、画像の動き量などである。 In the encoding mode determination apparatus according to claim 11, in any one of claims 1 to 10, the simple motion estimation unit or the complex motion estimation unit may perform simple motion estimation or a motion estimation process so that a processing amount of the motion estimation process is maintained substantially constant. Change the motion estimation method in complex motion estimation.
For example, the encoding mode determination apparatus changes the motion estimation method according to the image attribute of an image configured by image blocks. Here, the image attributes include, for example, the image size, the image encoding method (picture type [I picture, P picture, B picture], etc.), the image format (scanning method [progressive, interlaced], color difference). Format) and the amount of motion of the image.

動き推定方式とは、例えば、参照するピクチャの枚数・方向、動き推定を行うパーティションサイズのバリエーション、動きの探索範囲などである。
この装置では、適切な処理量の処理が実現され、装置の可動率が向上する。さらに付随的な効果として、より適切な動き推定を行うことが可能となる。 The motion estimation method includes, for example, the number and direction of referenced pictures, a variation in partition size for performing motion estimation, a motion search range, and the like.
In this apparatus, processing with an appropriate processing amount is realized, and the mobility of the apparatus is improved. As an additional effect, more appropriate motion estimation can be performed.

請求項１２に記載の符号化モード決定装置では、請求項１〜１１において、簡易な動き推定は、整数画素精度の動き推定であり、複雑な動き推定は、非整数画素精度の動き推定である。
この装置では、簡易動き推定部により得られた符号化コストから符号化モード選択部が符号化モードの絞り込みを行う。さらに、絞り込んだ符号化モードの小ブロックに対して、複雑動き推定部が非整数画素精度の動き推定を行う。ここで、非整数画素精度の動き推定はフィルタを適用する必要があって整数画素精度の動き推定に比して処理量が多いが、この装置では、符号化モードの決定に際して全ての小ブロックについて非整数画素精度の動き推定を行う必要が無い。このため、非整数画素精度の動き推定の回数を削減でき、符号化モード決定の処理量を低減することが可能となる。また、必要な小ブロックには非整数画素精度の動き推定を行うため、適切な符号化効率の符号化モードを決定することが可能となる。 The encoding mode determining apparatus according to claim 12, wherein the simple motion estimation is integer pixel precision motion estimation, and the complex motion estimation is non-integer pixel precision motion estimation. .
In this apparatus, the encoding mode selection unit narrows down encoding modes from the encoding cost obtained by the simple motion estimation unit. Further, the complex motion estimator performs motion estimation with non-integer pixel accuracy for the small blocks in the narrowed encoding mode. Here, the motion estimation with non-integer pixel accuracy needs to apply a filter, and the processing amount is larger than the motion estimation with integer pixel accuracy. However, in this apparatus, when determining the coding mode, There is no need to perform motion estimation with non-integer pixel accuracy. For this reason, the number of times of motion estimation with non-integer pixel precision can be reduced, and the processing amount for determining the coding mode can be reduced. In addition, since motion estimation with non-integer pixel accuracy is performed for the necessary small blocks, it is possible to determine an encoding mode with appropriate encoding efficiency.

請求項１３に記載の集積回路は、請求項１〜１２のいずれかに記載の符号化モード決定装置を含む。
この集積回路では、請求項１〜１２のいずれかに記載の符号化モード決定装置と同様の効果を得ることができる。 An integrated circuit according to a thirteenth aspect includes the coding mode determining apparatus according to any one of the first to twelfth aspects.
In this integrated circuit, it is possible to obtain the same effect as that of the coding mode determining apparatus according to any one of claims 1 to 12.

請求項１４に記載の画像符号化装置は、請求項１〜１２のいずれかに記載の符号化モード決定装置と、符号化装置とを備えている。符号化装置は、符号化モード決定装置が決定する画像ブロックの符号化モードに基づいて、画像ブロックの符号化を行う。
この画像符号化装置では、符号化モードの決定に際して、全てのパーティションについて複雑な動き推定を行う必要が無い。このため、複雑な動き推定の回数を削減でき、符号化モード決定の処理量を低減することが可能となる。また、必要なパーティションには複雑な動き推定を行うため、適切な符号化効率の符号化モードを決定し、符号化を行うことが可能となる。 An image coding apparatus according to a fourteenth aspect includes the coding mode determining apparatus according to any one of the first to twelfth aspects, and a coding apparatus. The encoding device encodes the image block based on the encoding mode of the image block determined by the encoding mode determination device.
In this image encoding apparatus, it is not necessary to perform complicated motion estimation for all partitions when determining the encoding mode. For this reason, the number of times of complicated motion estimation can be reduced, and the processing amount for determining the coding mode can be reduced. In addition, since complicated motion estimation is performed on the necessary partitions, it is possible to determine an encoding mode with appropriate encoding efficiency and perform encoding.

請求項１５に記載の集積回路は、請求項１４に記載の画像符号化装置を含む。
この集積回路では、請求項１４に記載の画像符号化装置と同様の効果を得ることができる。
請求項１６に記載の符号化モード決定装置は、画像ブロックの符号化モードを決定する装置であって、インター予測部と、符号化ピクチャ構造決定部と、イントラ予測部と、符号化予測方式決定部とを備えている。インター予測部は、画像ブロックのフィールド構造ブロックおよびフレーム構造ブロックの各ブロックについてインター予測を行って、符号化コストを導出する。符号化ピクチャ構造決定部は、インター予測部による符号化コストに基づいて、画像ブロックの符号化ピクチャ構造を決定する。イントラ予測部は、決定された符号化ピクチャ構造を有する各ブロックについてイントラ予測を行って、符号化コストを導出する。符号化予測方式決定部は、インター予測による符号化コストとイントラ予測による符号化コストを比較して、決定された符号化ピクチャ構造を有する画像ブロックの各ブロックに対する符号化予測方式を決定する。ここで、符号化ピクチャ構造とは、画像ブロックを符号化する際のピクチャ構造であり、フィールド構造又はフレーム構造を意味している。符号化予測方式とは、画像ブロックを符号化する際のインター予測あるいはイントラ予測を意味している。 An integrated circuit according to a fifteenth aspect includes the image encoding device according to the fourteenth aspect.
In this integrated circuit, the same effect as that of the image encoding device according to claim 14 can be obtained.
The coding mode determination device according to claim 16, wherein the coding mode determination device determines a coding mode of an image block, and includes an inter prediction unit, a coded picture structure determination unit, an intra prediction unit, and a coding prediction method determination. Department. The inter prediction unit performs inter prediction on each block of the field structure block and the frame structure block of the image block, and derives an encoding cost. The encoded picture structure determining unit determines the encoded picture structure of the image block based on the encoding cost by the inter prediction unit. The intra prediction unit performs intra prediction on each block having the determined encoded picture structure, and derives an encoding cost. The encoding prediction method determination unit determines the encoding prediction method for each block of the image block having the determined encoded picture structure by comparing the encoding cost based on inter prediction and the encoding cost based on intra prediction. Here, the encoded picture structure is a picture structure when an image block is encoded, and means a field structure or a frame structure. The encoding prediction method means inter prediction or intra prediction when an image block is encoded.

ここで、フィールド構造ブロックとは、例えば、画像ブロックの奇数ラインの集合により構成されるブロックと偶数ラインの集合により構成されるブロックとを含んでいる（以下、同じ）。フレーム構造ブロックとは、例えば、画像ブロックのラインを順次含むブロックにより構成されている（以下、同じ）。 Here, the field structure block includes, for example, a block constituted by a set of odd lines of an image block and a block constituted by a set of even lines (hereinafter the same). The frame structure block is composed of, for example, blocks that sequentially include image block lines (hereinafter the same).

この装置では、イントラ予測部は符号化ピクチャ構造決定部によって決定された符号化ピクチャ構造の各ブロックについてのみイントラ予測を行うため、イントラ予測部はフィールド構造ブロックおよびフレーム構造ブロックの全てについてイントラ予測を行う必要がない。このように処理負荷の高いイントラ予測の回数を減らすことができるため、画像ブロックの符号化予測方式を決定するための処理負荷を削減できる。 In this apparatus, since the intra prediction unit performs intra prediction only for each block of the encoded picture structure determined by the encoded picture structure determination unit, the intra prediction unit performs intra prediction for all of the field structure block and the frame structure block. There is no need to do it. Since the number of intra predictions with a high processing load can be reduced in this way, the processing load for determining the coding prediction method for an image block can be reduced.

請求項１７に記載の符号化モード決定装置では、請求項１６において、インター予測部は、フレーム構造ブロックの各ブロックの符号化コストを合計してフレーム構造ブロックの符号化コストを導出し、フィールド構造ブロックの各ブロックの符号化コストを合計してフィールド構造ブロックの符号化コストを導出する。 The coding mode determination apparatus according to claim 17, wherein the inter prediction unit derives the coding cost of the frame structure block by summing up the coding costs of the blocks of the frame structure block, and calculates the field structure. The coding cost of the field structure block is derived by summing up the coding cost of each block of the block.

この装置では、インター予測部は各ピクチャ構造ごとの各ブロックの符号化コストを導出して合計することで、各ピクチャ構造ごとの符号化コストを導出する。
請求項１８に記載の符号化モード決定装置では、請求項１７において、イントラ予測部は、決定された符号化ピクチャ構造を有する各ブロックについてイントラ予測を行って符号化コストを導出する。符号化予測方式決定部は、決定された符号化ピクチャ構造を有する各ブロックについて、インター予測部で導出された符号化コストとイントラ予測部で導出された符号化コストとを比較し、各ブロックごとに符号化予測方式を決定する。 In this apparatus, the inter prediction unit derives the coding cost for each picture structure by deriving and summing the coding costs of each block for each picture structure.
In an encoding mode determining apparatus according to an eighteenth aspect, in the seventeenth aspect, the intra prediction unit performs intra prediction on each block having the determined encoded picture structure to derive an encoding cost. The coding prediction method determination unit compares the coding cost derived by the inter prediction unit and the coding cost derived by the intra prediction unit for each block having the determined coded picture structure, and The encoding prediction method is determined.

この装置では、イントラ予測部は、決定された符号化ピクチャ構造を有する各ブロックについてイントラ予測を行って符号化コストを導出するため、フィールド構造ブロックおよびフレーム構造ブロックの全てについてイントラ予測を行う必要がない。このように処理負荷の高いイントラ予測の回数を減らすことができるため、画像ブロックの符号化予測方式を決定するための処理負荷を削減でき、さらに符号化装置全体の処理量を低減することが可能となる。 In this apparatus, since the intra prediction unit performs intra prediction for each block having the determined coded picture structure to derive the coding cost, it is necessary to perform intra prediction for all of the field structure block and the frame structure block. Absent. Since the number of intra predictions with a high processing load can be reduced in this way, the processing load for determining the coding prediction method of the image block can be reduced, and further the processing amount of the entire encoding device can be reduced. It becomes.

請求項１９に記載の符号化モード決定装置では、請求項１６〜１８のいずれかにおいて、画像ブロックは、２つの正方ブロックから構成されているブロックペアである。
この装置では、ブロックペアが２つの正方ブロックから構成されており、フィールド構造ブロックとフレーム構造ブロックとのそれぞれを正方のブロックとして処理することが可能となる。 In the encoding mode determining apparatus according to a nineteenth aspect, in any one of the sixteenth to eighteenth aspects, the image block is a block pair composed of two square blocks.
In this apparatus, the block pair is composed of two square blocks, and each of the field structure block and the frame structure block can be processed as a square block.

請求項２０に記載の集積回路は、請求項１６〜１９のいずれかに記載の符号化モード決定装置を含む。
この集積回路では、請求項１６〜１９のいずれかに記載の符号化モード決定装置と同様の効果を得ることができる。 An integrated circuit according to a twentieth aspect includes the coding mode determining apparatus according to any one of the twelfth to nineteenth aspects.
In this integrated circuit, it is possible to obtain the same effect as that of the coding mode determining apparatus according to any one of claims 16 to 19.

請求項２１に記載の画像符号化装置は、請求項１６〜１９のいずれかに記載の符号化モード決定装置と、符号化モード決定装置が決定する画像ブロックの符号化モードに基づいて、画像ブロックの符号化を行う符号化装置とを備える。
この装置では、イントラ予測部はピクチャ構造決定部によって決定されたピクチャ構造のブロックについてのみイントラ予測を行うため、イントラ予測部はフィールド構造ブロックおよびフレーム構造ブロックの全てについてイントラ予測を行う必要がない。このように処理負荷の高いイントラ予測の回数を減らすことができるため、画像ブロックの符号化予測方式を決定するための処理負荷を削減できる。 An image encoding device according to claim 21 is based on the encoding mode determining device according to any of claims 16 to 19 and an image block encoding mode determined by the encoding mode determining device. And a coding device that performs coding of the above.
In this apparatus, since the intra prediction unit performs intra prediction only for the block having the picture structure determined by the picture structure determining unit, the intra prediction unit does not need to perform intra prediction for all of the field structure block and the frame structure block. Since the number of intra predictions with a high processing load can be reduced in this way, the processing load for determining the coding prediction method for an image block can be reduced.

請求項２２に記載の集積回路は、請求項２１に記載の画像符号化装置を含む。
この集積回路では、請求項２１に記載の画像符号化装置と同様の効果を得ることができる。
請求項２３に記載の符号化モード決定装置は、画像ブロックの符号化モードを決定する装置であって、簡易動き推定部と、符号化ピクチャ構造決定部とを備えている。簡易動き推定部は、画像ブロックのフィールド構造ブロックおよびフレーム構造ブロックのそれぞれのブロックに対して簡易な動き推定によって符号化コストを導出する。符号化ピクチャ構造決定部は、簡易動き推定部による符号化コストに基づいて、画像ブロックのフィールド構造ブロックおよびフレーム構造ブロックの符号化コストを比較し、符号化ピクチャ構造を決定する。 An integrated circuit according to a twenty-second aspect includes the image encoding device according to the twenty-first aspect.
In this integrated circuit, the same effect as that of the image encoding device according to claim 21 can be obtained.
A coding mode determination apparatus according to a twenty-third aspect is an apparatus for determining a coding mode of an image block, and includes a simple motion estimation unit and a coded picture structure determination unit. The simple motion estimation unit derives an encoding cost by simple motion estimation for each of the field structure block and the frame structure block of the image block. The encoded picture structure determining unit determines the encoded picture structure by comparing the encoding costs of the field structure block and the frame structure block of the image block based on the encoding cost of the simple motion estimation unit.

この装置では、簡易な動き推定に基づいて、画像ブロックの符号化モード（具体的には、符号化ピクチャ構造）を決定する。このため、符号化モードを決定するための処理量を軽減することが可能となる。
請求項２４に記載の符号化モード決定装置では、請求項２３において、簡易動き推定部は、各ブロックに対して簡易なインター予測と簡易なイントラ予測を行い、次に簡易なインター予測の符号化コストと簡易なイントラ予測の符号化コストを比較し各ブロックごとに簡易なインター予測と簡易なイントラ予測のいずれかを選択し、さらに各ピクチャ構造ごとのブロックの符号化コストを合計してフレーム構造ブロック及びフィールド構造ブロックの符号化コストを導出する。 In this apparatus, an encoding mode (specifically, an encoded picture structure) of an image block is determined based on simple motion estimation. For this reason, it is possible to reduce the processing amount for determining the encoding mode.
In the coding mode determination apparatus according to Claim 24, in Claim 23, the simple motion estimation unit performs simple inter prediction and simple intra prediction on each block, and then performs simple inter prediction encoding. Compare the cost and the coding cost of simple intra prediction, select either simple inter prediction or simple intra prediction for each block, and then add the block coding cost for each picture structure to the frame structure Deriving coding costs for blocks and field structure blocks.

この装置では、簡易動き推定部がインター予測とイントラ予測を用いてフレーム構造ブロック及びフィールド構造ブロックの符号化コストを導出するため、インター予測又はイントラ予測のいずれかで圧縮率が向上する画像ブロックの場合でも圧縮率が最良となるような符号化ピクチャ構造を決定できる。 In this apparatus, since the simple motion estimation unit derives the coding cost of the frame structure block and the field structure block using inter prediction and intra prediction, an image block whose compression rate is improved by either inter prediction or intra prediction is determined. Even in this case, it is possible to determine a coded picture structure that provides the best compression rate.

請求項２５に記載の符号化モード決定装置では、請求項２４において、簡易なインター予測は、整数画素精度のインター予測である。
この装置では、簡易動き推定部では、整数画素精度のインター予測と簡易なイントラ予測とを行うことできる。 In the coding mode determining apparatus according to a twenty-fifth aspect, in the twenty-fourth aspect, the simple inter prediction is inter prediction with integer pixel accuracy.
In this apparatus, the simple motion estimation unit can perform inter prediction with integer pixel accuracy and simple intra prediction.

請求項２６に記載の符号化モード決定装置は、請求項２３〜２５のいずれかにおいて、画像ブロックは、２つの正方ブロックから構成されているブロックペアである。
この装置では、ブロックペアが２つの正方ブロックから構成されており、フィールド構造ブロックとフレーム構造ブロックとのそれぞれを正方のブロックとして処理することが可能となる。 According to a twenty-sixth aspect of the present invention, in any one of the twenty-third to twenty-fifth aspects, the image block is a block pair composed of two square blocks.
In this apparatus, the block pair is composed of two square blocks, and each of the field structure block and the frame structure block can be processed as a square block.

請求項２７に記載の集積回路は、請求項２３〜２６のいずれかに記載の符号化モード決定装置を含む。
この集積回路により、請求項２３〜２６のいずれかに記載の符号化モード決定装置と同様の効果を得ることができる。 An integrated circuit according to a twenty-seventh aspect includes the coding mode determining device according to any one of the twenty-third to twenty-sixth aspects.
With this integrated circuit, an effect similar to that of the coding mode determining apparatus according to any one of claims 23 to 26 can be obtained.

請求項２８に記載の画像符号化装置は、請求項２３〜２６のいずれかに記載の符号化モード決定装置と、符号化モード決定装置によって決定される符号化ピクチャ構造の画像ブロックに対して複雑な動き推定を行う複雑動き推定部と、複雑動き推定部による予測結果に基づいて、画像ブロックの符号化を行う符号化部とを備える。 An image encoding device according to a twenty-eighth aspect is complex with respect to the coding mode determining device according to any one of the twenty-third to twenty-sixth aspects and an image block having a coded picture structure determined by the coding mode determining device. A complex motion estimation unit that performs accurate motion estimation and an encoding unit that encodes an image block based on a prediction result by the complex motion estimation unit.

この装置では、複雑な動き推定によって画像ブロックの符号化を行うため、圧縮効率が向上する。しかも、ここでは、符号化モード決定装置によって決定された符号化ピクチャ構造の画像ブロックに対してのみ複雑な動き推定を行うため、従来より複雑な動き推定の回数を減らすことができる。 In this apparatus, since the image block is encoded by complicated motion estimation, the compression efficiency is improved. In addition, here, since the complicated motion estimation is performed only on the image block having the coded picture structure determined by the coding mode determining apparatus, the number of times of motion estimation more complicated than before can be reduced.

請求項２９に記載の画像符号化装置では、請求項２８において、複雑予測部は、決定された符号化ピクチャ構造を有する各ブロックに対して、複雑なインター予測又は複雑なイントラ予測を行う。
この装置では、インター予測又はイントラ予測のいずれかで圧縮率が向上する画像ブロックに対しても、圧縮効率を向上させることができる。 In an image coding apparatus according to a twenty-ninth aspect, in the twenty-eighth aspect, the complex prediction unit performs complex inter prediction or complex intra prediction on each block having the determined coded picture structure.
In this apparatus, the compression efficiency can be improved even for an image block whose compression rate is improved by either inter prediction or intra prediction.

請求項３０に記載の画像符号化装置では、請求項２９において、複雑なインター予測は、非整数画素精度のインター予測である。
この装置では、非整数画素精度のインター予測を用いて複雑なインター予測を行うことが可能となる。 In an image encoding device according to a thirty-third aspect, in the twenty-ninth aspect, the complicated inter prediction is inter prediction with non-integer pixel accuracy.
In this apparatus, it is possible to perform complex inter prediction using inter prediction with non-integer pixel accuracy.

請求項３１に記載の集積回路は、請求項２８〜３０のいずれかに記載の画像符号化装置を含む。
この集積回路により、請求項２８〜３０のいずれかに記載の画像符号化装置と同様の効果を得ることができる。 An integrated circuit according to a thirty-first aspect includes the image encoding device according to any one of the twenty-eighth to thirty-third aspects.
With this integrated circuit, it is possible to obtain the same effect as that of the image encoding device according to any one of claims 28 to 30.

請求項３２に記載の符号化モード決定方法は、画像ブロックの符号化モードを複数候補の中から少なくとも１つに決定する符号化モード決定方法であって、簡易動き推定ステップと、符号化モード選択ステップと、複雑動き推定ステップと、符号化モード決定ステップとを備えている。簡易動き推定ステップは、各符号化モードによってそれぞれ得られる画像ブロックのパーティションである小ブロックに対する簡易な動き推定に基づいて、各符号化モードの符号化コストを導出する。符号化モード選択ステップは、簡易動き推定ステップによって導出された符号化コストに基づいて、複数の符号化モードから一部の符号化モードを選択する。複雑動き推定ステップは、一部の符号化モードの少なくとも一部の符号化モードによって得られる小ブロックに対する複雑な動き推定に基づいて、各符号化モードの符号化コストを導出する。符号化モード決定ステップは、複雑動き推定ステップによって導出された符号化コストに基づいて、画像ブロックの符号化モードを決定する。 The encoding mode determination method according to claim 32, wherein the encoding mode determination method determines an encoding mode of an image block from at least one of a plurality of candidates, and includes a simple motion estimation step and an encoding mode selection. A step, a complex motion estimation step, and a coding mode determination step. The simple motion estimation step derives the coding cost of each coding mode based on simple motion estimation for a small block which is a partition of an image block obtained by each coding mode. The encoding mode selection step selects some encoding modes from the plurality of encoding modes based on the encoding cost derived by the simple motion estimation step. The complex motion estimation step derives the coding cost of each coding mode based on the complex motion estimation for the small block obtained by at least some of the coding modes. The encoding mode determination step determines the encoding mode of the image block based on the encoding cost derived by the complex motion estimation step.

この方法では、簡易動き推定ステップにより得られた符号化コストから符号化モード選択ステップが符号化モードの絞り込みを行う。さらに、絞り込んだ符号化モードの小ブロックに対して、複雑動き推定ステップが複雑な動き推定を行う。ここで、複雑な動き推定は、例えば、フィルタを適用する必要があるなどの理由により、簡易な動き推定に比して処理量が多いが、この方法では、符号化モードの決定に際して全ての小ブロックについて複雑な動き推定を行う必要が無い。このため、複雑な動き推定の回数を削減でき、符号化モード決定の処理量を低減することが可能となる。また、必要な小ブロックには複雑な動き推定を行うため、適切な符号化効率の符号化モードを決定することが可能となる。 In this method, the encoding mode selection step narrows down the encoding mode from the encoding cost obtained by the simple motion estimation step. Further, the complicated motion estimation step performs a complicated motion estimation on the narrow blocks of the narrowed coding mode. Here, the complicated motion estimation has a larger processing amount than the simple motion estimation because, for example, it is necessary to apply a filter. However, in this method, all of the small motion estimations are performed when determining the coding mode. There is no need to perform complex motion estimation for blocks. For this reason, the number of times of complicated motion estimation can be reduced, and the processing amount for determining the coding mode can be reduced. In addition, since a complicated motion estimation is performed for a necessary small block, it is possible to determine an encoding mode with appropriate encoding efficiency.

請求項３３に記載の符号化モード決定方法は、画像ブロックの符号化モードを決定する符号化モード決定方法であって、インター予測ステップと、符号化ピクチャ構造決定ステップと、イントラ予測ステップと、符号化予測方式決定ステップとを備えている。インター予測ステップは、画像ブロックのフィールド構造ブロックおよびフレーム構造ブロックの各ブロックについてインター予測を行って、符号化コストを導出する。符号化ピクチャ構造決定ステップは、インター予測ステップによる符号化コストに基づいて、画像ブロックの符号化ピクチャ構造を決定する。イントラ予測ステップは、決定された符号化ピクチャ構造を有する各ブロックについてイントラ予測を行って、符号化コストを導出する。符号化予測方式決定ステップは、インター予測による符号化コストとイントラ予測による符号化コストを比較して、決定された符号化ピクチャ構造を有する画像ブロックの各ブロックに対する符号化予測方式を決定する。 The encoding mode determination method according to claim 33, wherein the encoding mode determination method determines an encoding mode of an image block, and includes an inter prediction step, an encoded picture structure determination step, an intra prediction step, A prediction method determining step. In the inter prediction step, inter prediction is performed on each of the field structure block and the frame structure block of the image block to derive the coding cost. The encoded picture structure determination step determines the encoded picture structure of the image block based on the encoding cost of the inter prediction step. In the intra prediction step, intra prediction is performed on each block having the determined encoded picture structure to derive an encoding cost. The encoding prediction scheme determination step determines the encoding prediction scheme for each block of the image block having the determined encoded picture structure by comparing the encoding cost by inter prediction and the encoding cost by intra prediction.

この方法では、イントラ予測ステップは符号化ピクチャ構造決定ステップによって決定された符号化ピクチャ構造の各ブロックについてのみイントラ予測を行うため、イントラ予測ステップはフィールド構造ブロックおよびフレーム構造ブロックの全てについてイントラ予測を行う必要がない。このように処理負荷の高いイントラ予測の回数を減らすことができるため、画像ブロックの符号化予測方式を決定するための処理負荷を削減できる。 In this method, since the intra prediction step performs intra prediction only for each block of the coded picture structure determined by the coded picture structure determining step, the intra prediction step performs intra prediction for all of the field structure block and the frame structure block. There is no need to do it. Since the number of intra predictions with a high processing load can be reduced in this way, the processing load for determining the coding prediction method for an image block can be reduced.

請求項３４に記載の符号化モード決定方法は、画像ブロックの符号化モードを決定する符号化モード決定方法であって、簡易動き推定ステップと、符号化ピクチャ構造決定ステップとを備えている。簡易動き推定ステップは、画像ブロックのフィールド構造ブロックおよびフレーム構造ブロックのそれぞれのブロックに対して簡易な動き推定によって符号化コストを導出する。符号化ピクチャ構造決定ステップは、簡易動き推定ステップによる符号化コストに基づいて、画像ブロックのフィールド構造ブロックおよびフレーム構造ブロックの符号化コストを比較し、符号化ピクチャ構造を決定する。 An encoding mode determination method according to a thirty-fourth aspect is an encoding mode determination method for determining an encoding mode of an image block, and includes a simple motion estimation step and an encoded picture structure determination step. The simple motion estimation step derives an encoding cost by simple motion estimation for each of the field structure block and the frame structure block of the image block. The coded picture structure determination step compares the coding costs of the field structure block and the frame structure block of the image block based on the coding cost of the simple motion estimation step, and determines the coded picture structure.

この方法では、簡易な動き推定に基づいて、画像ブロックの符号化モード（具体的には、符号化ピクチャ構造）を決定する。このため、符号化モードを決定するための処理量を軽減することが可能となる。
請求項３５に記載の符号化モード決定プログラムは、コンピュータに以下の方法を行わさせる。符号化モード決定方法は、画像ブロックの符号化モードを複数候補の中から少なくとも１つに決定する符号化モード決定方法であって、簡易動き推定ステップと、符号化モード選択ステップと、複雑動き推定ステップと、符号化モード決定ステップとを備えている。簡易動き推定ステップは、各符号化モードによってそれぞれ得られる画像ブロックのパーティションである小ブロックに対する簡易な動き推定に基づいて、各符号化モードの符号化コストを導出する。符号化モード選択ステップは、簡易動き推定ステップによって導出された符号化コストに基づいて、複数の符号化モードから一部の符号化モードを選択する。複雑動き推定ステップは、一部の符号化モードの少なくとも一部の符号化モードによって得られる小ブロックに対する複雑な動き推定に基づいて、各符号化モードの符号化コストを導出する。符号化モード決定ステップは、複雑動き推定ステップによって導出された符号化コストに基づいて、画像ブロックの符号化モードを決定する。 In this method, a coding mode (specifically, a coded picture structure) of an image block is determined based on simple motion estimation. For this reason, it is possible to reduce the processing amount for determining the encoding mode.
An encoding mode determination program according to a thirty-fifth aspect causes a computer to perform the following method. The encoding mode determination method is an encoding mode determination method that determines at least one encoding mode of an image block from a plurality of candidates, and includes a simple motion estimation step, an encoding mode selection step, and a complex motion estimation And a coding mode determination step. The simple motion estimation step derives the coding cost of each coding mode based on simple motion estimation for a small block which is a partition of an image block obtained by each coding mode. The encoding mode selection step selects some encoding modes from the plurality of encoding modes based on the encoding cost derived by the simple motion estimation step. The complex motion estimation step derives the coding cost of each coding mode based on the complex motion estimation for the small block obtained by at least some of the coding modes. The encoding mode determination step determines the encoding mode of the image block based on the encoding cost derived by the complex motion estimation step.

このプログラムでは、簡易動き推定ステップにより得られた符号化コストから符号化モード選択ステップが符号化モードの絞り込みを行う。さらに、絞り込んだ符号化モードの小ブロックに対して、複雑動き推定ステップが複雑な動き推定を行う。ここで、複雑な動き推定は、例えば、フィルタを適用する必要があるなどの理由により、簡易な動き推定に比して処理量が多いが、このプログラムでは、符号化モードの決定に際して全ての小ブロックについて複雑な動き推定を行う必要が無い。このため、複雑な動き推定の回数を削減でき、符号化モード決定の処理量を低減することが可能となる。また、必要な小ブロックには複雑な動き推定を行うため、適切な符号化効率の符号化モードを決定することが可能となる。 In this program, the encoding mode selection step narrows down the encoding mode from the encoding cost obtained by the simple motion estimation step. Further, the complicated motion estimation step performs a complicated motion estimation on the narrow blocks of the narrowed coding mode. Here, the complicated motion estimation has a larger processing amount than the simple motion estimation because, for example, it is necessary to apply a filter. However, in this program, all small motion estimations are performed when determining the encoding mode. There is no need to perform complex motion estimation for blocks. For this reason, the number of times of complicated motion estimation can be reduced, and the processing amount for determining the coding mode can be reduced. In addition, since a complicated motion estimation is performed for a necessary small block, it is possible to determine an encoding mode with appropriate encoding efficiency.

請求項３６に記載の符号化モード決定プログラムは、コンピュータに以下の方法を行わさせる。符号化モード決定方法は、画像ブロックの符号化モードを決定する符号化モード決定方法であって、インター予測ステップと、符号化ピクチャ構造決定ステップと、イントラ予測ステップと、符号化予測方式決定ステップとを備えている。インター予測ステップは、画像ブロックのフィールド構造ブロックおよびフレーム構造ブロックの各ブロックについてインター予測を行って、符号化コストを導出する。符号化ピクチャ構造決定ステップは、インター予測ステップによる符号化コストに基づいて、画像ブロックの符号化ピクチャ構造を決定する。イントラ予測ステップは、決定された符号化ピクチャ構造を有する各ブロックについてイントラ予測を行って、符号化コストを導出する。符号化予測方式決定ステップは、インター予測による符号化コストとイントラ予測による符号化コストを比較して、決定された符号化ピクチャ構造を有する画像ブロックの各ブロックに対する符号化予測方式を決定する。 A coding mode determination program according to a thirty-sixth aspect causes a computer to perform the following method. The encoding mode determination method is an encoding mode determination method for determining an encoding mode of an image block, and includes an inter prediction step, an encoded picture structure determination step, an intra prediction step, an encoding prediction method determination step, It has. In the inter prediction step, inter prediction is performed on each of the field structure block and the frame structure block of the image block to derive the coding cost. The encoded picture structure determination step determines the encoded picture structure of the image block based on the encoding cost of the inter prediction step. In the intra prediction step, intra prediction is performed on each block having the determined encoded picture structure to derive an encoding cost. The encoding prediction scheme determination step determines the encoding prediction scheme for each block of the image block having the determined encoded picture structure by comparing the encoding cost by inter prediction and the encoding cost by intra prediction.

このプログラムでは、イントラ予測ステップは符号化ピクチャ構造決定ステップによって決定された符号化ピクチャ構造の各ブロックについてのみイントラ予測を行うため、イントラ予測ステップはフィールド構造ブロックおよびフレーム構造ブロックの全てについてイントラ予測を行う必要がない。このように処理負荷の高いイントラ予測の回数を減らすことができるため、画像ブロックの符号化予測方式を決定するための処理負荷を削減できる。 In this program, since the intra prediction step performs intra prediction only for each block of the coded picture structure determined by the coded picture structure determining step, the intra prediction step performs intra prediction for all of the field structure block and the frame structure block. There is no need to do it. Since the number of intra predictions with a high processing load can be reduced in this way, the processing load for determining the coding prediction method for an image block can be reduced.

請求項３７に記載の符号化モード決定プログラムは、コンピュータに以下の方法を行わさせる。符号化モード決定方法は、画像ブロックの符号化モードを決定する符号化モード決定方法であって、簡易動き推定ステップと、符号化ピクチャ構造決定ステップとを備えている。簡易動き推定ステップは、画像ブロックのフィールド構造ブロックおよびフレーム構造ブロックのそれぞれのブロックに対して簡易な動き推定によって符号化コストを導出する。符号化ピクチャ構造決定ステップは、簡易動き推定ステップによる符号化コストに基づいて、画像ブロックのフィールド構造ブロックおよびフレーム構造ブロックの符号化コストを比較し、符号化ピクチャ構造を決定する。 An encoding mode determination program according to a thirty-seventh aspect causes a computer to perform the following method. The encoding mode determination method is an encoding mode determination method for determining the encoding mode of an image block, and includes a simple motion estimation step and an encoded picture structure determination step. The simple motion estimation step derives an encoding cost by simple motion estimation for each of the field structure block and the frame structure block of the image block. The coded picture structure determination step compares the coding costs of the field structure block and the frame structure block of the image block based on the coding cost of the simple motion estimation step, and determines the coded picture structure.

このプログラムでは、簡易な動き推定に基づいて、画像ブロックの符号化モード（具体的には、符号化ピクチャ構造）を決定する。このため、符号化モードを決定するための処理量を軽減することが可能となる。 In this program, an encoding mode (specifically, an encoded picture structure) of an image block is determined based on simple motion estimation. For this reason, it is possible to reduce the processing amount for determining the encoding mode.

本発明では、より少ない処理量で適切な符号化モードの選択を可能とさせる符号化モード決定装置、画像符号化装置、符号化モード決定方法、および符号化モード決定プログラムを提供することができる。 According to the present invention, it is possible to provide an encoding mode determination device, an image encoding device, an encoding mode determination method, and an encoding mode determination program that enable selection of an appropriate encoding mode with a smaller processing amount.

［第１実施形態］
図１〜図１５を用いて、本発明の第１実施形態としてのエンコーダについて説明する。
図１は、本発明の第１実施形態としてのエンコーダ１の構造を説明するブロック図である。エンコーダ１は、例えば、入力画像信号３０をＭＰＥＧ−４符号化し、符号化画像信号３１として出力する画像符号化装置であり、パーソナルコンピュータ（ＰＣ）、携帯電話などにおいて備えられる。 [First Embodiment]
An encoder as a first embodiment of the present invention will be described with reference to FIGS.
FIG. 1 is a block diagram illustrating the structure of an encoder 1 as a first embodiment of the present invention. The encoder 1 is, for example, an image encoding device that MPEG-4 encodes an input image signal 30 and outputs the encoded image signal 31 as an encoded image signal 31, and is provided in a personal computer (PC), a mobile phone, or the like.

〈エンコーダ１の構成〉
図１に示すエンコーダ１は、入力画像信号３０のイントラ予測を行うイントラ予測部２と、入力画像信号３０のインター予測を行うインター予測部３と、イントラ予測およびインター予測の予測結果を切り換える切換部４と、切換部４の出力を符号化して符号化画像信号３１を出力する符号化部５と、入力画像信号３０のローカルデコード信号３２を作成する参照画像作成部６とを備えている。 <Configuration of encoder 1>
An encoder 1 shown in FIG. 1 includes an intra prediction unit 2 that performs intra prediction of an input image signal 30, an inter prediction unit 3 that performs inter prediction of the input image signal 30, and a switching unit that switches between prediction results of intra prediction and inter prediction. 4, an encoding unit 5 that encodes the output of the switching unit 4 and outputs an encoded image signal 31, and a reference image generation unit 6 that generates a local decode signal 32 of the input image signal 30.

イントラ予測部２は、入力画像信号３０を画像ブロック毎にイントラ予測し、イントラ予測画像との差分信号を切換部４に出力する。
インター予測部３は、入力画像信号３０を第１の入力とし、ローカルデコード信号３２を第２の入力として、インター予測結果を切換部４に出力する。さらに、インター予測部３は、インター予測結果のうち、動きベクトルなど符号化にかかる情報を第２の出力として符号化部５に出力する。 The intra prediction unit 2 performs intra prediction on the input image signal 30 for each image block, and outputs a difference signal from the intra predicted image to the switching unit 4.
The inter prediction unit 3 outputs the inter prediction result to the switching unit 4 with the input image signal 30 as the first input and the local decode signal 32 as the second input. Furthermore, the inter prediction unit 3 outputs information related to encoding, such as a motion vector, among the inter prediction results to the encoding unit 5 as a second output.

インター予測部３は、入力画像信号３０を第１の入力、ローカルデコード信号３２を第２の入力とし、動き推定を行う動き推定部１０と、動き推定部１０の出力を第１の入力、ローカルデコード信号３２を第２の入力とし、予測画像を出力する予測画像作成部１１と、入力画像信号３０を第１の入力、予測画像作成部１１の出力を第２の入力とする減算部１２とから構成されている。また、動き推定部１０の出力のうち、動きベクトルや符号化モードなどの符号化情報は、後述する可変長符号化部２２の入力にも与えられる。 The inter prediction unit 3 uses the input image signal 30 as a first input and the local decode signal 32 as a second input, performs motion estimation, and outputs the motion estimation unit 10 as a first input, A predicted image generating unit 11 that outputs the predicted image by using the decoded signal 32 as a second input; and a subtracting unit 12 that uses the input image signal 30 as a first input and the output of the predicted image generating unit 11 as a second input; It is composed of Of the output of the motion estimation unit 10, encoding information such as a motion vector and an encoding mode is also given to an input of a variable length encoding unit 22 described later.

動き推定部１０は、主に、フルペル予測部１３と、分割方法候補選択部１４と、サブペル予測部１５と、分割方法決定部１６とを備えている（動作については、後述）。
切換部４は、イントラ予測結果を第１の入力、インター予測結果を第２の入力とし、いずれかの入力を符号化部５に出力する。 The motion estimation unit 10 mainly includes a full-pel prediction unit 13, a division method candidate selection unit 14, a sub-pel prediction unit 15, and a division method determination unit 16 (the operation will be described later).
The switching unit 4 uses the intra prediction result as the first input and the inter prediction result as the second input, and outputs one of the inputs to the encoding unit 5.

符号化部５は、切換部４の出力を第１の入力とし、ＤＣＴ（ＤｉｓｃｒｅｔｅＣｏｓｉｎｅＴｒａｎｓｆｏｒｍ）部２０、量子化部２１、可変長符号化部２２を通して符号化画像信号３１を出力する。
参照画像作成部６では、逆量子化部２３には量子化部２１の出力が入力され、逆量子化部２３の出力は、逆ＤＣＴ部２４を通して、加算部２５の第１の入力に与えられる。加算部２５は、予測画像作成部１１の出力を第２の入力とし、加算結果をメモリ２６に出力する。メモリ２６は、予測画像作成部１１の第２の入力と動き推定部１０の第２の入力にローカルデコード信号３２を出力する。 The encoding unit 5 uses the output of the switching unit 4 as a first input, and outputs an encoded image signal 31 through a DCT (Discrete Cosine Transform) unit 20, a quantization unit 21, and a variable length encoding unit 22.
In the reference image creation unit 6, the output of the quantization unit 21 is input to the inverse quantization unit 23, and the output of the inverse quantization unit 23 is given to the first input of the addition unit 25 through the inverse DCT unit 24. . The adding unit 25 uses the output of the predicted image creating unit 11 as a second input, and outputs the addition result to the memory 26. The memory 26 outputs the local decode signal 32 to the second input of the predicted image creation unit 11 and the second input of the motion estimation unit 10.

〈エンコーダ１の動作〉
次に、エンコーダ１の動作について説明する。まず、入力画像信号３０は、符号化処理の基本単位である画像ブロックを単位として入力されている。
イントラ符号化される画像ブロックは、イントラ予測部２において、同一ピクチャ内の他の画像ブロックの画素係数を用いてイントラ予測される。イントラ予測された画像ブロックは、ＤＣＴ部２０において離散コサイン変換（ＤＣＴ）を施され、量子化部２１において量子化され、可変長符号化部２２において可変長符号化される。 <Operation of encoder 1>
Next, the operation of the encoder 1 will be described. First, the input image signal 30 is input in units of image blocks that are basic units of encoding processing.
The image block to be intra-coded is intra-predicted by the intra prediction unit 2 using the pixel coefficients of other image blocks in the same picture. The intra-predicted image block is subjected to discrete cosine transform (DCT) in the DCT unit 20, quantized in the quantization unit 21, and variable-length coded in the variable-length coding unit 22.

一方、量子化部２１において量子化されたＤＣＴ係数は、逆量子化部２３において逆量子化され、逆ＤＣＴ部２４において逆ＤＣＴされ、ローカルデコードされ、ローカルデコード信号３２としてメモリ２６に記憶される。このメモリ２６に記憶されたローカルデコード信号３２は、インター予測部３において画像ブロックがインター符号化される際に使用される。 On the other hand, the DCT coefficient quantized by the quantizing unit 21 is inversely quantized by the inverse quantizing unit 23, is inversely DCTed by the inverse DCT unit 24, is locally decoded, and is stored in the memory 26 as the local decoded signal 32. . The local decoded signal 32 stored in the memory 26 is used when the image block is inter-coded in the inter prediction unit 3.

インター符号化される画像ブロックは、動き推定部１０において、動き推定される。ここで、動き推定部１０の詳しい動作については後述する。
予測画像作成部１１は、動き推定部１０の動き推定の結果と、メモリ２６に記憶されたローカルデコード信号３２とに基づいて、予測画像を作成する。減算部１２は、画像ブロックと作成された予測画像との差分から差分画像ブロックを求める。差分画像ブロックは、ＤＣＴ部２０において離散コサイン変換を施され、量子化部２１において量子化される。離散コサイン変換および量子化された差分画像ブロックは、動き推定の結果などとともに可変長符号化部２２において可変長符号化される。 The motion estimation unit 10 performs motion estimation on the image block to be inter-coded. Here, the detailed operation of the motion estimation unit 10 will be described later.
The predicted image creation unit 11 creates a predicted image based on the motion estimation result of the motion estimation unit 10 and the local decode signal 32 stored in the memory 26. The subtraction unit 12 obtains a difference image block from the difference between the image block and the created predicted image. The difference image block is subjected to discrete cosine transform in the DCT unit 20 and quantized in the quantization unit 21. The differential image block subjected to discrete cosine transform and quantization is subjected to variable length coding in the variable length coding unit 22 together with the result of motion estimation.

〈動き推定部１０の動作〉
動き推定部１０は、画像ブロックの符号化コストを最小とする画像ブロックの符号化モード（画像ブロックの分割方法、予測方向など）を決定するとともに、動きベクトルの導出を行う。 <Operation of Motion Estimation Unit 10>
The motion estimation unit 10 determines an image block coding mode (image block division method, prediction direction, etc.) that minimizes the coding cost of the image block, and derives a motion vector.

図２を用いて、動き推定部１０の特徴について説明する。動き推定部１０は、画像ブロックの全てのパーティションサイズおよび動き推定の際の全ての参照ピクチャに対して、整数画素精度の動き推定を行う（ステップＳ７０１〜Ｓ７０３）。さらに、整数精度の動き推定の結果に基づいて、符号化コストを小さくするパーティションサイズと参照ピクチャとの候補を選択し、選択された候補に対して非整数画素精度の動き推定を行う（ステップＳ７０４）。 The features of the motion estimation unit 10 will be described with reference to FIG. The motion estimator 10 performs motion estimation with integer pixel accuracy for all the partition sizes of the image block and all the reference pictures at the time of motion estimation (steps S701 to S703). Further, based on the result of motion estimation with integer precision, a candidate for a partition size and a reference picture for reducing the coding cost is selected, and motion estimation with non-integer pixel precision is performed on the selected candidate (step S704). ).

これにより、全てのパーティションサイズおよび全ての参照ピクチャに対して非整数画素精度の動き推定を行う必要なく、符号化の際のパーティションサイズおよび参照ピクチャを決定するための処理量を低減することが可能となる。また、選択された候補に対しては非整数画素精度の動き推定を行うため、適切な符号化効率を実現することが可能となる。 As a result, it is not necessary to perform motion estimation with non-integer pixel accuracy for all partition sizes and all reference pictures, and it is possible to reduce the processing amount for determining the partition size and reference picture at the time of encoding. It becomes. In addition, since the motion estimation with non-integer pixel accuracy is performed on the selected candidate, it is possible to realize appropriate encoding efficiency.

図３を用いて、動き推定部１０の動作についてさらに説明を加える。
図３は、画像ブロックについての符号化モード決定の処理フローを示すブロック図である。図３の画像ブロックについての符号化モード決定の処理フローは、フルペル予測部１３により実行されるフルペル予測ステップＳ４１と、分割方法候補選択部１４により実行される分割方法候補選択ステップＳ４２と、サブペル予測部１５により実行されるサブペル予測ステップＳ４３と、分割方法決定部１６により実行される分割方法決定ステップＳ４４とから構成されている。 The operation of the motion estimation unit 10 will be further described with reference to FIG.
FIG. 3 is a block diagram illustrating a processing flow for determining an encoding mode for an image block. The processing flow for determining the coding mode for the image block in FIG. 3 includes a full-pel prediction step S41 executed by the full-pel prediction unit 13, a division method candidate selection step S42 executed by the division method candidate selection unit 14, and a sub-pel prediction. It comprises a sub-pel prediction step S43 executed by the unit 15 and a division method determination step S44 executed by the division method determination unit 16.

フルペル予測ステップＳ４１は、小ブロックフルペル予測ステップＳ４５と、予測方向選択ステップＳ４６と、符号化コスト導出ステップＳ４７とを備えている。
小ブロックフルペル予測ステップＳ４５は、１６×１６の画像ブロックを４種類の分割方法候補により分割したＭ×Ｎ（（Ｍ，Ｎ）＝（１６，１６），（１６，８），（８，１６），（８，８））の小ブロックＳｂ１〜Ｓｂ９（図２５参照）のそれぞれに対して、整数画素精度の動き推定を行い、小ブロック毎の符号化コストおよび動きベクトルを導出する。具体的には、それぞれの小ブロックＳｂ１〜Ｓｂ９に対して、前方向予測ステップＳ４５１〜Ｓ４５４および後方向予測ステップＳ４５５〜Ｓ４５８が行われている。すなわち、前方向予測ステップＳ４５１〜Ｓ４５４および後方向予測ステップＳ４５５〜Ｓ４５８では、それぞれの分割方法候補により分割された小ブロックの個数に応じた回数の処理が行われている。図３では、この回数を処理ブロックからの矢印の本数で表している。 The full pel prediction step S41 includes a small block full pel prediction step S45, a prediction direction selection step S46, and an encoding cost derivation step S47.
In the small block full-pel prediction step S45, M × N ((M, N) = (16,16), (16,8), (8, 16), (8, 8)) for each of the small blocks Sb1 to Sb9 (see FIG. 25), motion estimation with integer pixel accuracy is performed to derive the coding cost and motion vector for each small block. Specifically, forward prediction steps S451 to S454 and backward prediction steps S455 to S458 are performed for each of the small blocks Sb1 to Sb9. That is, in the forward prediction steps S451 to S454 and the backward prediction steps S455 to S458, the number of processes corresponding to the number of small blocks divided by the respective division method candidates is performed. In FIG. 3, this number is represented by the number of arrows from the processing block.

予測方向選択ステップＳ４６は、フルペル予測ステップＳ４５によって導出された符号化コストに基づいて、複数の符号化モードから一部の符号化モードを選択する。予測方向選択ステップＳ４６は、具体的には、前方向予測ステップＳ４５１〜Ｓ４５４の符号化コストと後方向予測ステップＳ４５５〜Ｓ４５８の符号化コストとを比較して、小ブロック毎の符号化コストを小さくする予測方向（ピクチャ参照方向）を選択する。 The prediction direction selection step S46 selects a part of the encoding modes from the plurality of encoding modes based on the encoding cost derived by the full-pel prediction step S45. Specifically, the prediction direction selection step S46 compares the encoding cost of the forward prediction steps S451 to S454 with the encoding cost of the backward prediction steps S455 to S458, and reduces the encoding cost for each small block. The prediction direction (picture reference direction) to be selected is selected.

符号化コスト導出ステップＳ４７は、予測方向選択ステップＳ４６が選択した予測方向の符号化コストを分割方法候補毎に合計し、画像ブロック単位での符号化コストを導出する。ここでは、フルペル予測ステップＳ４５が各小ブロックごとに符号化コストが低いピクチャ参照方向を選択しているため、各分割方法候補ごとの符号化モードにおいて最も符号化コストが低い小ブロックの組み合わせが可能となる。 In the encoding cost deriving step S47, the encoding costs in the prediction direction selected in the prediction direction selecting step S46 are summed for each division method candidate, and the encoding cost in units of image blocks is derived. Here, since the full-pel prediction step S45 selects the picture reference direction with the lowest coding cost for each small block, the combination of the small blocks with the lowest coding cost is possible in the coding mode for each division method candidate. It becomes.

分割方法候補選択ステップＳ４２は、符号化コスト導出ステップＳ４７が導出した画像ブロック単位での符号化コストを比較し、符号化コストが小さい２種類の分割方法候補を選択する。
サブペル予測ステップＳ４３は、分割方法候補選択ステップＳ４２において選択された２種類の分割方法候補により分割された小ブロックのそれぞれについて、非整数画素精度の動き推定を行う。ここで、非整数画素精度の動き推定は、図２８を用いて説明したサブペル予測ステップＳ３０１と同様に行われる。すなわち、選択された２種類の分割方法候補により分割された小ブロックのそれぞれについて、小ブロックフルペル予測ステップＳ４５で導出された動きベクトルに基づいて、非整数画素精度の動き推定を行う。また、サブペル予測ステップＳ４３では、小ブロックのそれぞれについて、前方向予測ステップＳ４３１，Ｓ４３４と、後方向予測ステップＳ４３２，Ｓ４３５と、双方向予測ステップＳ４３３，Ｓ４３６とが行われる。この結果、それぞれの小ブロックについて、３種類の予測方向についての符号化コストが導出される。また、前方向予測ステップＳ４３１，Ｓ４３４と、後方向予測ステップＳ４３２，Ｓ４３５と、双方向予測ステップＳ４３３，Ｓ４３６は、選択された２種類の分割方法候補により分割された小ブロックの個数に応じた回数の処理が行われている。 In the division method candidate selection step S42, the coding costs in units of image blocks derived by the coding cost deriving step S47 are compared, and two types of division method candidates with low coding costs are selected.
The sub-pel prediction step S43 performs motion estimation with non-integer pixel accuracy for each of the small blocks divided by the two types of division method candidates selected in the division method candidate selection step S42. Here, the motion estimation with non-integer pixel accuracy is performed in the same manner as the sub-pel prediction step S301 described with reference to FIG. That is, motion estimation with non-integer pixel accuracy is performed for each of the small blocks divided by the two selected division method candidates based on the motion vector derived in the small block full-pel prediction step S45. In sub-pel prediction step S43, forward prediction steps S431 and S434, backward prediction steps S432 and S435, and bidirectional prediction steps S433 and S436 are performed for each of the small blocks. As a result, for each small block, encoding costs for three types of prediction directions are derived. The forward prediction steps S431 and S434, the backward prediction steps S432 and S435, and the bidirectional prediction steps S433 and S436 are performed in accordance with the number of small blocks divided by the two selected division method candidates. Is being processed.

分割方法決定ステップＳ４４は、分割方法候補選択ステップＳ４２において選択された２種類の分割方法候補により分割されたそれぞれの小ブロックについて最小となる符号化コストから、小ブロックごとの予測方向を決定するとともに、画像ブロック単位での符号化コストを導出する。さらに、導出された画像ブロック単位での符号化コストを２種類の分割方法候補について比較し、最小の符号化コストを有する分割方法候補を画像ブロックの分割方法として決定する。また、同時に小ブロックについての動きベクトルが得られる。 In the division method determination step S44, the prediction direction for each small block is determined from the encoding cost that is the minimum for each small block divided by the two types of division method candidates selected in the division method candidate selection step S42. Deriving the coding cost for each image block. Further, the derived encoding cost for each image block is compared for two types of division method candidates, and the division method candidate having the minimum encoding cost is determined as the image block division method. At the same time, a motion vector for a small block is obtained.

図４を用いて、フルペル予測ステップＳ４１と分割方法候補選択ステップＳ４２との処理について詳しい説明を加える。なお、上述の様に、フルペル予測ステップＳ４１は、小ブロックフルペル予測ステップＳ４５と、予測方向選択ステップＳ４６と、符号化コスト導出ステップＳ４７とを備えている。 With reference to FIG. 4, a detailed description will be given of the processing in the full-pel prediction step S41 and the division method candidate selection step S42. As described above, the full-pel prediction step S41 includes a small block full-pel prediction step S45, a prediction direction selection step S46, and an encoding cost derivation step S47.

小ブロックフルペル予測ステップＳ４５は、小ブロックＳｂ１〜Ｓｂ９の全てに対して、整数画素精度の前方向予測（図４では、ｆｗと記載）と後方向予測（図４では、ｂｗと記載）を行い、それぞれの参照方向に対する符号化コストを導出する。図４では、それぞれの符号化コストを例示している。例えば、小ブロックＳｂ２では、前方向予測の符号化コストが（２１）、後方向予測の符号化コストが（２２）である。 The small block full-pel prediction step S45 performs forward prediction (denoted as fw in FIG. 4) and backward prediction (denoted as bw in FIG. 4) with integer pixel accuracy for all of the small blocks Sb1 to Sb9. Then, the encoding cost for each reference direction is derived. FIG. 4 illustrates the respective encoding costs. For example, in the small block Sb2, the encoding cost for forward prediction is (21), and the encoding cost for backward prediction is (22).

予測方向選択ステップＳ４６は、小ブロック毎に前方向予測と後方向予測との符号化コストを比較して、符号化コストがより小さい予測方向を選択する。例えば、小ブロックＳｂ２では、前方向予測が選択される。
符号化コスト導出ステップＳ４７は、予測方向選択ステップＳ４６が選択した小ブロック毎の符号化コストから画像ブロック単位での符号化コストを導出する。例えば、１６×８の分割方法について、小ブロックＳｂ２では、前方向予測が選択され、小ブロックＳｂ３では、後方向予測が選択されているため、１６×１６の画像ブロック単位での符号化コストは、（４１）となる。 The prediction direction selection step S46 compares the encoding costs of the forward prediction and the backward prediction for each small block, and selects a prediction direction with a smaller encoding cost. For example, in the small block Sb2, forward prediction is selected.
In the coding cost deriving step S47, the coding cost for each image block is derived from the coding cost for each small block selected in the prediction direction selecting step S46. For example, for the 16 × 8 division method, the forward prediction is selected in the small block Sb2, and the backward prediction is selected in the small block Sb3. Therefore, the coding cost for each 16 × 16 image block is (41).

分割方法候補選択ステップＳ４２では、フルペル予測ステップＳ４１により導出された画像ブロック単位での符号化コストを分割方法候補毎に比較し、符号化コストが小さい２種類の分割方法候補を選択する。図４では、１６×１６の分割方法（符号化コスト（４０））および１６×８の分割方法（符号化コスト（４１））が分割方法候補として選択される。 In the division method candidate selection step S42, the coding costs in units of image blocks derived in the full-pel prediction step S41 are compared for each division method candidate, and two types of division method candidates with low coding costs are selected. In FIG. 4, a 16 × 16 division method (encoding cost (40)) and a 16 × 8 division method (encoding cost (41)) are selected as division method candidates.

〈エンコーダ１の効果〉
エンコーダ１では、フルペル予測ステップＳ４１により得られた符号化コストから分割方法候補選択ステップＳ４２が分割方法候補の絞り込みを行う。さらに、絞り込んだ分割候補の小ブロックに対して、サブペル予測ステップＳ４３がサブペル予測を行う。ここで、サブペル予測はフィルタを適用する必要があってフルペル予測に比して処理量が多いが、この装置では、符号化モードの決定に際して全ての小ブロックＳｂ１〜Ｓｂ９についてサブペル予測を行う必要が無い。このため、サブペル予測の回数を削減でき、符号化モード決定の処理量を低減することが可能となる。また、必要な小ブロックにはサブペル予測を行うため、適切な符号化効率の符号化モードを決定することが可能となる。 <Effect of encoder 1>
In the encoder 1, the division method candidate selection step S42 narrows down the division method candidates from the encoding cost obtained in the full-pel prediction step S41. Further, the sub-pel prediction step S43 performs sub-pel prediction for the narrowed-down candidate blocks. Here, the sub-pel prediction needs to apply a filter and has a larger processing amount than the full-pel prediction, but in this apparatus, it is necessary to perform sub-pel prediction for all the small blocks Sb1 to Sb9 when determining the encoding mode. No. For this reason, the number of sub-pel predictions can be reduced, and the processing amount for determining the coding mode can be reduced. In addition, since sub-pel prediction is performed for the necessary small blocks, it is possible to determine an encoding mode with appropriate encoding efficiency.

〈エンコーダ１の変形例〉
本発明はかかる上記実施形態に限定されるものではなく、本発明の範囲を逸脱することなく種々の変形又は修正が可能である。
（１）フルペル予測部１３の変形例
（１−１）
上記実施形態では、フルペル予測ステップＳ４１を実行するフルペル予測部１３は、それぞれの小ブロックＳｂ１〜Ｓｂ９に対して、前方向予測ステップＳ４５１〜Ｓ４５４および後方向予測ステップＳ４５５〜Ｓ４５８を実行すると説明した（以下、第１フルペル予測方法という）。この場合は、双方向予測を行わないため、処理量を削減でき、フルペル予測の処理時間を短縮できる。 <Modification of Encoder 1>
The present invention is not limited to the above-described embodiment, and various changes or modifications can be made without departing from the scope of the present invention.
(1) Modification of full-pel prediction unit 13 (1-1)
In the above-described embodiment, the full-pel prediction unit 13 that executes the full-pel prediction step S41 is described as executing the forward prediction steps S451 to S454 and the backward prediction steps S455 to S458 for each of the small blocks Sb1 to Sb9 ( Hereinafter, it is referred to as a first full-pel prediction method). In this case, since bidirectional prediction is not performed, the amount of processing can be reduced, and the processing time of full-pel prediction can be shortened.

ここで、フルペル予測ステップＳ４１は、さらに双方向予測を実行し符号化コストを導出するものであっても良い（以下、第２フルペル予測方法という）。この場合は、双方向予測を行うため、フルペル予測の精度を向上させることが可能となる。このため、より適切な符号化モードを選択することが可能となる。また、前方向予測ステップＳ４５１〜Ｓ４５４および後方向予測ステップＳ４５５〜Ｓ４５８により導出された符号化コストから双方向予測を実行した場合の符号化コストを推定するものであっても良い（以下、第３フルペル予測方法という）。この装置では、双方向予測の予測結果を推定するため、フルペル予測部１３では、双方向予測を行う必要はなく、処理量を低減することが可能となる。また、予測結果をフルペル予測部１３による符号化コストに反映させることにより、双方向予測を行った場合と同様の効果を簡易に得ることが可能となる。このため、符号化効率を簡易に向上させることが可能となる。 Here, the full-pel prediction step S41 may further perform bi-directional prediction to derive an encoding cost (hereinafter referred to as a second full-pel prediction method). In this case, since bi-directional prediction is performed, it is possible to improve the accuracy of full-pel prediction. For this reason, it becomes possible to select a more appropriate encoding mode. Moreover, the encoding cost when bi-directional prediction is executed may be estimated from the encoding costs derived by the forward prediction steps S451 to S454 and the backward prediction steps S455 to S458 (hereinafter referred to as “third”). Full-pel prediction method). In this apparatus, since the prediction result of bidirectional prediction is estimated, the full-pel prediction unit 13 does not need to perform bidirectional prediction, and the processing amount can be reduced. In addition, by reflecting the prediction result on the encoding cost by the full-pel prediction unit 13, it is possible to easily obtain the same effect as when bi-directional prediction is performed. For this reason, encoding efficiency can be easily improved.

図５を用いて、１６×１６の画像ブロックを２つに分割した８×１６の小ブロックＳｂ４および小ブロックＳｂ５（図２５参照）に対して実行される第１〜第３フルペル予測方法について説明する。
図５（ａ）は、第１フルペル予測方法について説明する処理フローである。第１フルペル予測方法では、小ブロックＳｂ４，Ｓｂ５に対する前方向予測ステップＳ４５３および後方向予測ステップＳ４５７とが行われ、小ブロックＳｂ４，Ｓｂ５についての前方向予測ステップＳ４５３による符号化コストＣ４ｆ，Ｃ５ｆと、小ブロックＳｂ４，Ｓｂ５についての後方向予測ステップＳ４５７による符号化コストＣ４ｂ，Ｃ５ｂとが導出される。導出された符号化コストＣ４ｆ，Ｃ５ｆ，Ｃ４ｂ，Ｃ５ｂは、小ブロックごとの予測方法選択ステップＳ４６である小ブロック予測方法選択ステップＳ４６３（図３参照）において、小ブロックごとに比較され、小さい符号化コストを有する予測方向が選択される。より具体的には、小ブロック予測方法選択ステップＳ４６３内の比較ステップＳ４６３ａにおいて、小ブロックＳｂ４についての符号化コストＣ４ｆとＣ４ｂとが比較され、比較ステップＳ４６３ｂにおいて、小ブロックＳｂ５についての符号化コストＣ５ｆとＣ５ｂとが比較され、それぞれの小ブロックについて、より小さい符号化コストを有する予測方向が選択される。 The first to third full-pel prediction methods executed for the 8 × 16 small block Sb4 and the small block Sb5 (see FIG. 25) obtained by dividing the 16 × 16 image block into two will be described with reference to FIG. To do.
FIG. 5A is a processing flow for explaining the first full-pel prediction method. In the first full-pel prediction method, the forward prediction step S453 and the backward prediction step S457 for the small blocks Sb4 and Sb5 are performed, and the encoding costs C4f and C5f by the forward prediction step S453 for the small blocks Sb4 and Sb5 are: The encoding costs C4b and C5b in the backward prediction step S457 for the small blocks Sb4 and Sb5 are derived. The derived encoding costs C4f, C5f, C4b, and C5b are compared for each small block in a small block prediction method selection step S463 (see FIG. 3), which is a prediction method selection step S46 for each small block, and are encoded with a small encoding. A prediction direction having a cost is selected. More specifically, the coding cost C4f and C4b for the small block Sb4 are compared in the comparison step S463a in the small block prediction method selection step S463, and the coding cost C5f for the small block Sb5 is compared in the comparison step S463b. And C5b are compared, and for each small block, the prediction direction with the lower coding cost is selected.

図５（ｂ）は、第２フルペル予測方法について説明する処理フローである。第１フルペル予測方法との違いは、双方向予測ステップＳ４５９が行われる点である。例えば、小ブロックＳｂ４について、前方向予測ステップＳ４５３および後方向予測ステップＳ４５７で検出された動きベクトルであるＭＶ４ｆとＭＶ４ｂとを利用した予測が行われる。具体的には、ＭＶ４ｆおよびＭＶ４ｂが示す参照ピクチャ上の参照領域を平均して予測画像とし、双方向予測ステップＳ４５９の符号化コストＣ４ｇが導出される。小ブロックＳｂ５についても同様に、ＭＶ５ｆとＭＶ５ｂとを利用して、符号化コストＣ５ｇが導出される。 FIG. 5B is a processing flow for explaining the second full-pel prediction method. The difference from the first full-pel prediction method is that a bidirectional prediction step S459 is performed. For example, for the small block Sb4, prediction using the motion vectors MV4f and MV4b detected in the forward prediction step S453 and the backward prediction step S457 is performed. Specifically, the reference area on the reference picture indicated by MV4f and MV4b is averaged to be a predicted image, and the encoding cost C4g of the bidirectional prediction step S459 is derived. Similarly, for the small block Sb5, the encoding cost C5g is derived using the MV 5f and the MV 5b.

導出された双方向予測ステップＳ４５９の符号化コストＣ４ｇ，Ｃ５ｇは、小ブロック予測方法選択ステップＳ４６３の変形例としての小ブロック予測方法選択ステップＳ４６５において、前方向予測ステップＳ４５３および後方向予測ステップＳ４５７の符号化コストＣ４ｆ，Ｃ５ｆ，Ｃ４ｂ，Ｃ５ｂと比較される。具体的には、比較ステップＳ４６５ａにおいて、小ブロックＳｂ４についての符号化コストＣ４ｆ，Ｃ４ｂ，Ｃ４ｇが比較され、比較ステップＳ４６５ｂにおいて、小ブロックＳｂ５についての符号化コストＣ５ｆ，Ｃ５ｂ，Ｃ５ｇが比較される。この結果、それぞれの小ブロックについての最小の符号化コストを有する予測方向が選択される。 The derived encoding costs C4g and C5g of the bidirectional prediction step S459 are the same as those of the forward prediction step S453 and the backward prediction step S457 in the small block prediction method selection step S465 as a modification of the small block prediction method selection step S463. It is compared with the coding costs C4f, C5f, C4b, C5b. Specifically, in the comparison step S465a, the encoding costs C4f, C4b, and C4g for the small block Sb4 are compared, and in the comparison step S465b, the encoding costs C5f, C5b, and C5g for the small block Sb5 are compared. As a result, the prediction direction with the lowest coding cost for each small block is selected.

第２フルペル予測方法では、小ブロックについてより正確な動き検出が可能となり、符号化効率の向上が期待できる。
図５（ｃ）は、第３フルペル予測方法について説明する処理フローである。第１フルペル予測方法との違いは、双方向予測を行った場合の符号化コストの符号化コスト推定ステップＳ４６８が行われる点である。 In the second full-pel prediction method, more accurate motion detection is possible for small blocks, and an improvement in coding efficiency can be expected.
FIG. 5C is a processing flow for explaining the third full-pel prediction method. The difference from the first full-pel prediction method is that the encoding cost estimation step S468 of the encoding cost when bi-directional prediction is performed is performed.

符号化コスト推定ステップＳ４６８は、前方向予測ステップＳ４５３および後方向予測ステップＳ４５７の符号化コストＣ４ｆ，Ｃ５ｆ，Ｃ４ｂ，Ｃ５ｂから、双方向予測を行った場合の符号化コストの推定値である推定符号化コストＣ４ｈおよびＣ５ｈを導出する。具体的には、小ブロックＳｂ４についての符号化コストＣ４ｆとＣ４ｂとが「近い値」のときに、推定符号化コストＣ４ｈは、符号化コストＣ４ｆとＣ４ｂとの小さい方よりも少しだけ小さい、例えば、小さい方の値の９割の値など、と推定される。 The coding cost estimation step S468 is an estimated code that is an estimated value of the coding cost when bi-directional prediction is performed from the coding costs C4f, C5f, C4b, and C5b of the forward prediction step S453 and the backward prediction step S457. Derivation costs C4h and C5h are derived. Specifically, when the encoding costs C4f and C4b for the small block Sb4 are “near values”, the estimated encoding cost C4h is slightly smaller than the smaller one of the encoding costs C4f and C4b, for example, It is estimated that 90% of the smaller value.

ここで、「近い値」とは、例えば、式ａｂｓ（［Ｃ４ｆ］−［Ｃ４ｂ］）＊Ｋ＜ａｂｓ（ａｂｓ（［Ｃ４ｆ］）＋ａｂｓ（［Ｃ４ｂ］）が真の時に、符号化コストＣ４ｆとＣ４ｂとが「近い値」と判定される。ここで、［Ｃ４ｆ］、［Ｃ４ｂ］は、符号化コストＣ４ｆ，Ｃ４ｂの値を示し、Ｋは、所定の定数である。 Here, the “near value” is, for example, the encoding cost C4f when the formula abs ([C4f] − [C4b]) * K <abs (abs ([C4f]) + abs ([C4b])) is true. C4b is determined to be “near value.” Here, [C4f] and [C4b] indicate values of the encoding costs C4f and C4b, and K is a predetermined constant.

さらに、推定符号化コストＣ４ｈ，Ｃ５ｈは、比較ステップＳ４６３ａ，Ｓ４６３ｂの変形例としての比較ステップＳ４６６ａ，Ｓ４６６ｂにおいて、符号化コストＣ４ｆ，Ｃ５ｆ，Ｃ４ｂ，Ｃ５ｂと比較される。具体的には、比較ステップＳ４６６ａにおいて、推定符号化コストＣ４ｈおよび符号化コストＣ４ｆ，Ｃ４ｂの比較が行われ、比較ステップＳ４６６ｂにおいて、推定符号化コストＣ５ｈおよび符号化コストＣ５ｆ，Ｃ５ｂの比較が行われる。この結果、それぞれの小ブロックについての最小の符号化コストを有する予測方向が選択される。 Further, the estimated encoding costs C4h and C5h are compared with the encoding costs C4f, C5f, C4b, and C5b in comparison steps S466a and S466b as modifications of the comparison steps S463a and S463b. Specifically, in the comparison step S466a, the estimated coding cost C4h and the coding costs C4f and C4b are compared, and in the comparison step S466b, the estimated coding cost C5h and the coding costs C5f and C5b are compared. . As a result, the prediction direction with the lowest coding cost for each small block is selected.

第３フルペル予測方法では、双方向予測を行う必要はなく、処理量を低減することが可能となる。また、双方向予測を行った場合と同様の効果を簡易に得ることが可能となる。このため、符号化効率を簡易に向上させることが可能となる。
（１−２）
上記実施形態において、小ブロックフルペル予測ステップＳ４５と予測方向選択ステップＳ４６とは、逐次処理されるものであっても、並列処理されるものであってもよい。 In the third full-pel prediction method, it is not necessary to perform bi-directional prediction, and the processing amount can be reduced. In addition, it is possible to easily obtain the same effect as when bidirectional prediction is performed. For this reason, encoding efficiency can be easily improved.
(1-2)
In the above embodiment, the small block full-pel prediction step S45 and the prediction direction selection step S46 may be sequentially processed or may be processed in parallel.

図６を用いて、１６×１６の画像ブロックを１つに分割した１６×１６の小ブロックＳｂ１（図２５参照）に対する小ブロックフルペル予測ステップＳ４５と予測方向選択ステップＳ４６との処理順序について説明する。
図６（ａ）は、小ブロックフルペル予測ステップＳ４５と予測方向選択ステップＳ４６とを逐次処理する場合の処理フローを示している。詳しい説明は、図３を用いて上記実施形態において行ったので省略する。 The processing order of the small block full-pel prediction step S45 and the prediction direction selection step S46 for the 16 × 16 small block Sb1 (see FIG. 25) obtained by dividing the 16 × 16 image block into one will be described with reference to FIG. To do.
FIG. 6A shows a processing flow when the small block full-pel prediction step S45 and the prediction direction selection step S46 are sequentially processed. Detailed description has been made in the above embodiment with reference to FIG.

図６（ｂ）は、小ブロックフルペル予測ステップＳ４５と予測方向選択ステップＳ４６とを並列処理する場合の処理フローを示している。ここでは、前方向予測ステップＳ４５１と後方向予測ステップＳ４５５とそれぞれの符号化コストの比較が並列実行される。具体的には、エンコーダ１のメモリ２６に前方向予測ステップＳ４５１および後方向予測ステップＳ４５５のために２枚の参照ピクチャを格納し、並列に動き推定および符号化コスト計算を実行する。最初の数回のコスト計算の最良値を比較し、符号化コストが大きい参照方向の動き推定を打ち切る。 FIG. 6B shows a processing flow when the small block full pel prediction step S45 and the prediction direction selection step S46 are processed in parallel. Here, the forward prediction step S451, the backward prediction step S455, and the respective encoding costs are compared in parallel. Specifically, two reference pictures are stored in the memory 26 of the encoder 1 for the forward prediction step S451 and the backward prediction step S455, and motion estimation and encoding cost calculation are executed in parallel. Compare the best values of the first few cost calculations and abort the motion estimation in the reference direction with the highest coding cost.

通常、動き推定では、有望な探索開始位置と、その周辺について符号化コスト計算を行い、その中で一番良い候補を選択する。この際、符号化コストの計算は、１０回から１０００回は行われる。本発明の場合、予測方向の選択に不必要な動き推定処理を途中で打ち切ることができ、フルペル予測の処理量を削減することが可能となる。 Usually, in motion estimation, encoding cost calculation is performed for a probable search start position and its surroundings, and the best candidate is selected. At this time, the encoding cost is calculated 10 to 1000 times. In the case of the present invention, the motion estimation process unnecessary for selection of the prediction direction can be stopped halfway, and the processing amount of full-pel prediction can be reduced.

ここで、メモリ２６の割り当て量を、１枚の参照ピクチャを格納する場合と同じにするため、画素情報の間引かれた２枚の参照ピクチャを用いて、動き推定することとしても良い。
また、小ブロックフルペル予測ステップＳ４５と予測方向選択ステップＳ４６とは、それぞれの小ブロックごとに並列処理されるだけでなく、全ての小ブロックについて並列処理されるものであっても良い。 Here, in order to make the allocation amount of the memory 26 the same as when one reference picture is stored, motion estimation may be performed using two reference pictures obtained by thinning out pixel information.
Further, the small block full pel prediction step S45 and the prediction direction selection step S46 may be performed not only in parallel for each small block but also in parallel for all small blocks.

図７を用いて、１６×１６の画像ブロックを４種類の分割方法で分割した場合の全ての小ブロックＳｂ１〜Ｓｂ９に対する、小ブロックフルペル予測ステップＳ４５と予測方向選択ステップＳ４６との処理順序について説明する。
図７では、全ての小ブロックＳｂ１〜Ｓｂ９に対する、小ブロックフルペル予測ステップＳ４５と予測方向選択ステップＳ４６とが並列実行されている。また、図６（ｂ）を用いて説明したように、それぞれの小ブロックごとに、不必要な予測方向への動き推定の処理が打ち切られる。さらに、小ブロックごとの符号化コストの比較により、符号化コストが小さくならない小ブロックについての動き推定の処理が打ち切られる。 Referring to FIG. 7, the processing order of small block full-pel prediction step S45 and prediction direction selection step S46 for all small blocks Sb1 to Sb9 when a 16 × 16 image block is divided by four types of division methods is used. explain.
In FIG. 7, the small block full pel prediction step S45 and the prediction direction selection step S46 are executed in parallel for all the small blocks Sb1 to Sb9. Further, as described with reference to FIG. 6B, the motion estimation process in the unnecessary prediction direction is terminated for each small block. Furthermore, the motion estimation process for a small block whose encoding cost does not decrease is terminated by comparing the encoding costs for each small block.

すなわち、小ブロックごとに、不要な予測方向への動き推定処理を打ち切るだけでなく、分割方法の選択に不要な小ブロックへの動き推定処理も打ち切ることが可能となる。これにより、不必要な動き推定処理をさらに削減することが可能となり、フルペル予測の処理量をさらに削減することが可能となる。 That is, for each small block, not only the motion estimation process in the unnecessary prediction direction is aborted, but also the motion estimation process to the small block unnecessary for selection of the division method can be aborted. Thereby, unnecessary motion estimation processing can be further reduced, and the processing amount of full-pel prediction can be further reduced.

（１−３）
上記実施形態において、小ブロックフルペル予測ステップＳ４５と予測方向選択ステップＳ４６とは、小ブロックごとに逐次処理されるものであってもよい。
図８を用いて、１６×１６の画像ブロックを２つに分割した１６×８の小ブロックＳｂ２，Ｓｂ３（図２５参照）に対する小ブロックフルペル予測ステップＳ４５と予測方向選択ステップＳ４６との処理順序について説明する。 (1-3)
In the above embodiment, the small block full-pel prediction step S45 and the prediction direction selection step S46 may be sequentially processed for each small block.
The processing order of the small block full-pel prediction step S45 and the prediction direction selection step S46 for 16 × 8 small blocks Sb2 and Sb3 (see FIG. 25) obtained by dividing a 16 × 16 image block into two using FIG. Will be described.

小ブロックフルペル予測ステップＳ４５と予測方向選択ステップＳ４６とが逐次処理される場合（図８（ａ）参照）、前方向予測ステップＳ４５２、後方向予測ステップＳ４５６、および小ブロックごとの予測方法選択ステップＳ４６である小ブロック予測方法選択ステップＳ４６３は、以下の順番で行われる。小ブロックＳｂ２についての前方向予測ステップＳ４５２’、小ブロックＳｂ２についての後方向予測ステップＳ４５６’、小ブロックＳｂ３についての前方向予測ステップＳ４５２”、小ブロックＳｂ３についての後方向予測ステップＳ４５６”、小ブロックＳｂ２についての予測方法選択ステップＳ４６である小ブロック予測方法選択ステップＳ４６２’、小ブロックＳｂ３についての小ブロック予測方法選択ステップＳ４６２”の順である。 When the small block full-pel prediction step S45 and the prediction direction selection step S46 are sequentially processed (see FIG. 8A), the forward prediction step S452, the backward prediction step S456, and the prediction method selection step for each small block The small block prediction method selection step S463, which is S46, is performed in the following order. Forward prediction step S452 ′ for small block Sb2, backward prediction step S456 ′ for small block Sb2, forward prediction step S452 ″ for small block Sb3, backward prediction step S456 ″ for small block Sb3, small block The order is the small block prediction method selection step S462 ′, which is the prediction method selection step S46 for Sb2, and the small block prediction method selection step S462 ″ for the small block Sb3.

一方、小ブロックフルペル予測ステップＳ４５と予測方向選択ステップＳ４６とが、小ブロックごとに逐次処理される場合には（図８（ｂ）参照）、まず、小ブロックＳｂ２についての処理が行われ、その後、小ブロックＳｂ３についての処理が行われる。すなわち、まず、小ブロックＳｂ２についての前方向予測ステップＳ４５２’、後方向予測ステップＳ４５６’、小ブロック予測方法選択ステップＳ４６２’が行われる。その後、小ブロックＳｂ３についての前方向予測ステップＳ４５２”、後方向予測ステップＳ４５６”、小ブロック予測方法選択ステップＳ４６２”が行われる。また、この際に、小ブロックごとの処理は、（１−２）で説明したように、並列処理されてもよい。例えば、小ブロック１についての前方向予測ステップＳ４５２”、後方向予測ステップＳ４５６”、小ブロック予測方法選択ステップＳ４６２”が並列実行されても良い。 On the other hand, when the small block full-pel prediction step S45 and the prediction direction selection step S46 are sequentially processed for each small block (see FIG. 8B), first, processing for the small block Sb2 is performed. Thereafter, the process for the small block Sb3 is performed. That is, first, a forward prediction step S452 ', a backward prediction step S456', and a small block prediction method selection step S462 'for the small block Sb2 are performed. Thereafter, the forward prediction step S452 ", backward prediction step S456", and small block prediction method selection step S462 "for the small block Sb3 are performed. At this time, the processing for each small block is (1-2 For example, the forward prediction step S452 ", the backward prediction step S456", and the small block prediction method selection step S462 "for the small block 1 may be executed in parallel. .

（１−４）
図９及び図１０を用いてフルペル予測部１３の変形例について説明する。図９は、画像ブロックについての符号化モード決定の処理フローを示すブロック図である。図９の画像ブロックについての符号化モード決定の処理フローは、フルペル予測部１３により実行されるフルペル予測ステップＳ４１’と、分割方法候補選択部１４により実行される分割方法候補選択ステップＳ４２’とを備えている。 (1-4)
A modification of the full-pel prediction unit 13 will be described with reference to FIGS. 9 and 10. FIG. 9 is a block diagram illustrating a processing flow for determining an encoding mode for an image block. The processing flow for determining the coding mode for the image block in FIG. 9 includes a full-pel prediction step S41 ′ executed by the full-pel prediction unit 13 and a division method candidate selection step S42 ′ executed by the division method candidate selection unit 14. I have.

フルペル予測ステップＳ４１’は、小ブロックフルペル予測ステップＳ４５と、符号化コスト換算ステップＳ６６とを備えている。
小ブロックフルペル予測ステップＳ４５は、１６×１６の画像ブロックを４種類の分割方法候補により分割したＭ×Ｎ（（Ｍ，Ｎ）＝（１６，１６），（１６，８），（８，１６），（８，８））の小ブロックＳｂ１〜Ｓｂ９（図２５参照）のそれぞれに対して、整数画素精度の動き推定を行い、小ブロック毎の符号化コストおよび動きベクトルを導出する。具体的には、それぞれの小ブロックＳｂ１〜Ｓｂ９に対して、前方向予測ステップＳ４５１〜ステップＳ４５４および後方向予測ステップＳ４５５〜ステップＳ４５８が行われている。すなわち、前方向予測ステップＳ４５１〜ステップＳ４５４および後方向予測ステップＳ４５５〜ステップＳ４５８では、それぞれの分割方法候補により分割された小ブロックの個数に応じた回数の処理が行われている。図９では、この回数を処理ブロックからの矢印の本数で表している。 The full-pel prediction step S41 ′ includes a small block full-pel prediction step S45 and an encoding cost conversion step S66.
In the small block full-pel prediction step S45, M × N ((M, N) = (16,16), (16,8), (8, 16), (8, 8)) for each of the small blocks Sb1 to Sb9 (see FIG. 25), motion estimation with integer pixel accuracy is performed to derive the coding cost and motion vector for each small block. Specifically, forward prediction steps S451 to S454 and backward prediction steps S455 to S458 are performed on each of the small blocks Sb1 to Sb9. That is, in forward prediction steps S451 to S454 and backward prediction steps S455 to S458, processing is performed a number of times according to the number of small blocks divided by the respective division method candidates. In FIG. 9, this number is represented by the number of arrows from the processing block.

符号化コスト換算ステップＳ６６は、前方向予測ステップＳ４５１〜ステップＳ４５４の符号化コストと後方向予測ステップＳ４５５〜ステップＳ４５８の符号化コストを、それぞれ個別に、画像ブロック単位に換算する。具体的には、画像ブロック単位に換算した換算値とは、小ブロックフルペル予測ステップＳ４５によって得られた各小ブロックの各予測符号の符号化コストに、当該パーティションの分割数を乗じた値である。 In the encoding cost conversion step S66, the encoding costs in the forward prediction steps S451 to S454 and the encoding costs in the backward prediction steps S455 to S458 are individually converted into image block units. Specifically, the converted value converted into image block units is a value obtained by multiplying the encoding cost of each prediction code of each small block obtained by the small block full-pel prediction step S45 by the number of divisions of the partition. is there.

分割方法候補選択ステップＳ４２’は、符号化コスト導出ステップＳ４７が導出した画像ブロック単位での符号化コストを比較し、符号化コストが小さい２種類の分割方法候補を選択する。
図１０を用いて、フルペル予測ステップＳ４１’と分割方法候補選択ステップＳ４２’との処理について詳しい説明を加える。なお、上述の様に、フルペル予測ステップＳ４１’は、小ブロックフルペル予測ステップＳ４５と、符号化コスト換算ステップＳ６６とを備えている。 In the division method candidate selection step S42 ′, the coding costs in units of image blocks derived by the coding cost deriving step S47 are compared, and two types of division method candidates with low coding costs are selected.
With reference to FIG. 10, a detailed description will be given of the processing of the full-pel prediction step S41 ′ and the division method candidate selection step S42 ′. As described above, the full-pel prediction step S41 ′ includes a small block full-pel prediction step S45 and an encoding cost conversion step S66.

小ブロックフルペル予測ステップＳ４５は、小ブロックＳｂ１〜Ｓｂ９の全てに対して、整数画素精度の前方向予測（図１０では、ｆｗと記載）と後方向予測（図１０では、ｂｗと記載）と双方向予測（図１０では、ｂｉｄ）を行い、それぞれの参照方向に対する符号化コストを導出する。図１０では、それぞれの符号化コストを例示している。例えば、小ブロックＳｂ１では、前方向予測の符号化コストが（４０）、後方向予測の符号化コストが（７０）である。 The small block full-pel prediction step S45 includes forward prediction (denoted as fw in FIG. 10) and backward prediction (denoted as bw in FIG. 10) with integer pixel accuracy for all of the small blocks Sb1 to Sb9. Bidirectional prediction (bid in FIG. 10) is performed, and the coding cost for each reference direction is derived. FIG. 10 illustrates each encoding cost. For example, in the small block Sb1, the encoding cost for forward prediction is (40), and the encoding cost for backward prediction is (70).

符号化コスト換算ステップＳ６６は、前方向予測ステップＳ４５１〜ステップＳ４５４の符号化コストと後方向予測ステップＳ４５５〜ステップＳ４５８の符号化コストとを、それぞれ個別に、画像ブロック単位の符号化コストに換算する。具体的には、Ｓｂ１のｆｗ、ｂｗ、ｂｉｄの符号化コストは１倍し、Ｓｂ２〜Ｓｂ５のｆｗ，ｂｗ，ｂｉｄの符号化コストは２倍し、Ｓｂ６〜Ｓｂ９のＦＷ、ｂｗ、ｂｉｄの符号化コストは４倍する。 In the encoding cost conversion step S66, the encoding costs in the forward prediction steps S451 to S454 and the encoding costs in the backward prediction steps S455 to S458 are individually converted into the encoding costs for each image block. . Specifically, the coding cost of fw, bw, and bid of Sb1 is multiplied by 1, the coding cost of fw, bw, and bid of Sb2 to Sb5 is doubled, and the codes of FW, bw, and bid of Sb6 to Sb9 The cost will be quadrupled.

上記実施形態では、図１１（ａ）に示すように、フルペル予測ステップＳ４１’を実行するフルペル予測部１３は、それぞれの小ブロックＳｂ１〜Ｓｂ９に対して、前方向予測ステップＳ４５１〜Ｓ４５４および後方向予測ステップＳ４５５〜Ｓ４５８のみを実行すると説明した（以下、第１フルペル予測方法という）。ここで、フルペル予測ステップＳ４１’は、さらに双方向予測を実行し符号化コストを導出するものであっても良い（以下、第２フルペル予測方法という）。また、前方向予測ステップＳ４５１〜Ｓ４５４および後方向予測ステップＳ４５５〜Ｓ４５８により導出された符号化コストから双方向予測を実行した場合の符号化コストを推定するものであっても良い（以下、第３フルペル予測方法という）。 In the said embodiment, as shown to Fig.11 (a), the full pel prediction part 13 which performs full pel prediction step S41 'performs forward direction prediction step S451-S454 and back direction with respect to each small block Sb1-Sb9. It has been described that only the prediction steps S455 to S458 are executed (hereinafter referred to as a first full-pel prediction method). Here, the full-pel prediction step S41 'may further perform bi-directional prediction to derive an encoding cost (hereinafter referred to as a second full-pel prediction method). Moreover, the encoding cost when bi-directional prediction is executed may be estimated from the encoding costs derived by the forward prediction steps S451 to S454 and the backward prediction steps S455 to S458 (hereinafter referred to as “third”). Full-pel prediction method).

分割方法候補選択ステップＳ４２’では、フルペル予測ステップＳ４１’により導出された画像ブロック単位での符号化コストを比較し、符号化コストが小さい２種類の分割方法候補を選択する。図１０では、１６×１６の分割方法のｆｗ（符号化コスト（４０））および１６×１６の分割方法のｂｗ（符号化コスト（７０））が分割方法候補として選択される。 In the division method candidate selection step S42 ', the encoding costs for each image block derived in the full-pel prediction step S41' are compared, and two types of division method candidates with low encoding costs are selected. In FIG. 10, fw (encoding cost (40)) of the 16 × 16 division method and bw (encoding cost (70)) of the 16 × 16 division method are selected as division method candidates.

〈エンコーダ１の効果〉
エンコーダ１では、分割方法候補選択ステップＳ４２’において、フルペル予測ステップＳ４１’が導出した画像ブロック単位での符号化コストを比較し、符号化コストが小さい２種類の分割方法候補を選択しているため、全ての小ブロックＳｂ１〜Ｓｂ９についてサブペル予測を行う必要が無い。このため、サブペル予測の回数を削減でき、処理量を低減することが可能となる。また、必要な小ブロックにはサブペル予測を行うため、符号化効率を維持することが可能となる。 <Effect of encoder 1>
In the encoder 1, in the division method candidate selection step S42 ′, the encoding costs for each image block derived by the full-pel prediction step S41 ′ are compared, and two types of division method candidates with low encoding costs are selected. It is not necessary to perform sub-pel prediction for all the small blocks Sb1 to Sb9. For this reason, the frequency | count of subpel prediction can be reduced and it becomes possible to reduce a processing amount. Further, since sub-pel prediction is performed for the necessary small blocks, it is possible to maintain the coding efficiency.

特に、この実施形態では、前記実施形態とは異なり、分割方法候補選択ステップＳ４２’までに各分割方法の予測方向が絞り込まれておらず、すなわち各分割方法の各予測方向ごとにそれぞれ符号化コストが分割方法候補選択ステップＳ４２’での比較の対象となっている。言い換えると、フルペル予測部１３が小ブロックのピクチャ参照方向ごとの符号化コストを画像ブロック単位に換算して符号化モードを導出するため、一つの小ブロックにおいて異なるピクチャ参照方向の符号化モードも分割方法候補選択ステップＳ４２での比較対象となる。そのため、図１０に示す実施形態の画像ブロック場合は、最小の符号化コストである１６×１６の分割方法のｆｗ（符号化コスト（４０））および１６×１６の分割方法のｂｗ（符号化コスト（７０））が２種類の分割方法候補として選択される。この実施形態の画像ブロックに対して前記実施形態の装置を適用した場合は、フルペル予測ステップＳ４１において１６×１６の分割方法についてはｂｗが放棄されるため、例えば第２候補として１６×８分割（ｓｂ２がｂｉｄで、ｓｂ３がｂｉｄであり、符号化コストが７７）が選択されてしまう。 In particular, in this embodiment, unlike the previous embodiment, the prediction direction of each division method has not been narrowed down until the division method candidate selection step S42 ′, that is, the encoding cost for each prediction direction of each division method. Is a comparison target in the division method candidate selection step S42 ′. In other words, since the full-pel prediction unit 13 derives the encoding mode by converting the encoding cost for each picture reference direction of the small block into the image block unit, the encoding mode of the different picture reference direction is also divided in one small block. It becomes a comparison object in method candidate selection step S42. Therefore, in the case of the image block of the embodiment shown in FIG. 10, the fw (coding cost (40)) of the 16 × 16 division method, which is the minimum coding cost, and the bw (coding cost) of the 16 × 16 division method. (70)) is selected as two types of division method candidates. When the apparatus of the above embodiment is applied to the image block of this embodiment, bw is abandoned for the 16 × 16 division method in the full-pel prediction step S41, and thus, for example, 16 × 8 division ( sb2 is bid, sb3 is bid, and the coding cost is 77).

なお、図１１（ｂ）に示すように、符号化コスト換算ステップＳ６６を小ブロックフルペル予測ステップＳ４５内で行ってもよい。例えば、符号化換算処理は２倍や４倍といった簡単な計算であるため、小ブロックフルペル予測ステップＳ４５内にマージしてもい。また、換算値は、小ブロックフルペル予測ステップＳ４５中に１探索位置ごとに算出してもよいし、小ブロックフルペル予測ステップＳ４５後に求めてもよい。 In addition, as shown in FIG.11 (b), you may perform encoding cost conversion step S66 within small block full pel prediction step S45. For example, since the encoding conversion process is a simple calculation such as 2 or 4 times, it may be merged into the small block full-pel prediction step S45. The converted value may be calculated for each search position during the small block full pel prediction step S45, or may be obtained after the small block full pel prediction step S45.

（２）分割方法候補選択部１４の変形例
分割方法候補選択ステップＳ４２が選択する分割方法候補は、２種類に限られない。４種類の分割方法候補のうち、１〜３種類の分割方法候補を選択するものであれば良い。
（３）サブペル予測部１５の変形例
（３−１）
上記実施形態では、サブペル予測ステップＳ４３は、分割方法候補選択ステップＳ４２において選択された２種類の分割方法候補により分割された小ブロックのそれぞれについて、前方向予測、後方向予測、および双方向予測の３種類の予測方向へのサブペル予測を行うと説明した。 (2) Modification of Division Method Candidate Selection Unit 14 The division method candidates selected by the division method candidate selection step S42 are not limited to two types. Any one of 1 to 3 types of division method candidates may be selected from the four types of division method candidates.
(3) Modification of sub-pel prediction unit 15 (3-1)
In the above embodiment, the sub-pel prediction step S43 performs forward prediction, backward prediction, and bidirectional prediction for each of the small blocks divided by the two types of division method candidates selected in the division method candidate selection step S42. It has been described that sub-pel prediction in three types of prediction directions is performed.

ここで、サブペル予測ステップＳ４３は、フルペル予測ステップＳ４１の動き推定の結果に基づいて、各分割候補において３種類の予測方向のうち実際に行うものを決定し、決定したものについてのみサブペル予測を行っても良い。これについて、図１２を用いて説明する。 Here, the sub-pel prediction step S43 determines what is actually performed among the three types of prediction directions in each division candidate based on the motion estimation result of the full-pel prediction step S41, and performs sub-pel prediction only for the determined one. May be. This will be described with reference to FIG.

まず、フルペル予測ステップＳ４１は、それぞれのパーティションに対する前方向および後方向の整数画素精度の動き推定を行う。分割方法候補選択ステップＳ４２は、フルペル予測ステップＳ４１の整数画素精度の動き推定に基づいて、分割方法候補を選択する。さらに、サブペル予測ステップＳ４３は、分割方法候補選択ステップＳ４２で選択された分割方法候補により分割された小ブロックについてのサブペル予測の予測方向を判断する。 First, the full-pel prediction step S41 performs motion estimation with forward and backward integer pixel accuracy for each partition. In the division method candidate selection step S42, a division method candidate is selected based on the motion estimation with integer pixel accuracy in the full-pel prediction step S41. Further, the sub-pel prediction step S43 determines the prediction direction of the sub-pel prediction for the small block divided by the division method candidate selected in the division method candidate selection step S42.

より具体的には、以下の３つのケースにより予測方向が判断される。
第１のケースは、前方向予測の符号化コストと後方向予測の符号化コストとがほぼ一致する場合である。この場合、前方向予測、後方向予測、および双方向予測の３種類の予測方向について、非整数画素精度の動き推定が実行される。また、この場合に、前方向予測および後方向予測の２種類の予測方向についてのみ、非整数画素精度の動き推定が実行されてもよい。 More specifically, the prediction direction is determined by the following three cases.
The first case is a case where the encoding cost for forward prediction and the encoding cost for backward prediction substantially match. In this case, motion estimation with non-integer pixel accuracy is performed for three types of prediction directions: forward prediction, backward prediction, and bidirectional prediction. Further, in this case, motion estimation with non-integer pixel accuracy may be executed only for two types of prediction directions: forward prediction and backward prediction.

第２のケースは、第１のケース以外で、前方向予測の符号化コストが後方向予測の符号化コストよりも小さい場合である。この場合、前方向予測による非整数画素精度の動き推定が実行され、後方向予測および双方向予測による非整数画素精度の動き推定は実行されない。 The second case is a case where the encoding cost for forward prediction is smaller than the encoding cost for backward prediction except for the first case. In this case, motion estimation with non-integer pixel accuracy by forward prediction is executed, and motion estimation with non-integer pixel accuracy by backward prediction and bidirectional prediction is not executed.

第３のケースは、第１のケース以外で、前方向予測の符号化コストが後方向予測の符号化コストよりも大きい場合である。この場合、後方向予測による非整数画素精度の動き推定が実行され、前方向予測および双方向予測による非整数画素精度の動き推定は実行されない。 The third case is a case where the encoding cost of the forward prediction is larger than the encoding cost of the backward prediction except for the first case. In this case, motion estimation with non-integer pixel accuracy by backward prediction is executed, and motion estimation with non-integer pixel accuracy by forward prediction and bidirectional prediction is not executed.

第２及び第３のケースのように前方向予測と後向予測の符号化コストが異なる場合は符号化コストが小さい方のみを選択するのは、一方の符号化コストが大きい場合は双方向予測で符号化コストが小さくなることが期待できないからである。
以上に述べたように、上記３つのケースの判断により、必要な参照方向を参照して非整数画素精度の動き推定を実行することが可能であるため、サブペル予測の処理量を削減でき、サブペル予測の処理時間を短縮可能となる。 When the encoding costs of forward prediction and backward prediction are different as in the second and third cases, only the one with the smaller encoding cost is selected. When one of the encoding costs is large, bi-directional prediction is selected. This is because the encoding cost cannot be expected to be small.
As described above, since it is possible to perform motion estimation with non-integer pixel accuracy by referring to the required reference direction by the determination of the above three cases, it is possible to reduce the processing amount of sub-pel prediction, The processing time for prediction can be shortened.

（３−２）
上記（３−１）の判断に加えて、サブペル予測ステップＳ４３は、分割方法候補選択ステップＳ４２において選択された分割方法候補のうちのさらに一部の分割方法候補に対してサブペル予測を行うものであってもよい。すなわち、この場合は、分割方法候補選択ステップＳ４２において選択された分割方法候補でもサブペル予測が行われないものが発生する。つまり、複数の符号化モードから選択された一部の符号化モードの全部についてサブペル予測を行う必要がなく、処理量を削減できる。また、処理量を一定に保つように一部の符号化モードのうちの少なくとも一部の符号化モードを選択することも可能となる。 (3-2)
In addition to the determination in (3-1) above, the sub-pel prediction step S43 performs sub-pel prediction on a part of the division method candidates selected in the division method candidate selection step S42. There may be. That is, in this case, some of the division method candidates selected in the division method candidate selection step S42 are not subjected to sub-pel prediction. That is, it is not necessary to perform sub-pel prediction for all of some of the encoding modes selected from the plurality of encoding modes, and the processing amount can be reduced. It is also possible to select at least some of the coding modes among some of the coding modes so as to keep the processing amount constant.

例えば、上記（３−１）で判断された予測方向に基づいて、サブペル予測の対象となる小ブロックごとの必要処理量が推定される。さらに、画像ブロック全体についての必要処理量の合計が、画像ブロックのサブペル予測に割り当てられた処理余裕量を超えないように、サブペル予測を行う小ブロックの候補を絞り込む。このため、サブペル予測ステップＳ４３は分割方法候補選択ステップＳ４２によって選択された符号化モード（具体的には、分割方法候補）の全てを選択しないこともあり得るが、その場合でも符号化コストが低い分割方法候補は選択されているため問題が少ない。 For example, based on the prediction direction determined in (3-1) above, the required processing amount for each small block to be subjected to sub-pel prediction is estimated. Further, narrow block candidates to be subjected to sub-pel prediction are narrowed down so that the total required processing amount for the entire image block does not exceed the processing margin allocated to the sub-pel prediction of the image block. For this reason, the sub-pel prediction step S43 may not select all of the coding modes (specifically, the division method candidates) selected in the division method candidate selection step S42, but even in that case, the coding cost is low. Since the division method candidates are selected, there are few problems.

より具体的には、図１３のサブペル予測の動作処理フローを用いて説明する。なお、説明の都合上、１６×１６の小ブロックＳｂ１の１予測方向あたりの必要処理量を［４］、１６×８，８×１６の小ブロックＳｂ２〜Ｓｂ５の必要処理量をそれぞれ［２］、８×８の小ブロックＳｂ６〜Ｓｂ９の必要処理量をそれぞれ［１］として説明を行う。小ブロックの１予測方向あたりのサブペル予測の必要処理量は、小ブロックの画素数に比例するためである。 More specifically, the operation processing flow for sub-pel prediction in FIG. 13 will be described. For convenience of explanation, the required processing amount per prediction direction of the 16 × 16 small block Sb1 is [4], and the required processing amounts of the 16 × 8 and 8 × 16 small blocks Sb2 to Sb5 are [2]. , 8 × 8 small blocks Sb6 to Sb9 will be described as [1]. This is because the required processing amount of sub-pel prediction per prediction direction of a small block is proportional to the number of pixels of the small block.

処理は、画像ブロック単位で行われる（ステップＳ３０〜ステップＳ３７）。まず、１６×１６の画像ブロックのサブペル予測に割り当てられた処理量が処理余裕量として設定される（ステップＳ３０）。次に、分割方法候補毎の処理が行われる（ステップＳ３１〜ステップＳ３７）。 Processing is performed in units of image blocks (steps S30 to S37). First, the processing amount allocated to the sub-pel prediction of the 16 × 16 image block is set as the processing margin amount (step S30). Next, processing for each candidate division method is performed (steps S31 to S37).

分割方法候補毎の処理は、分割方法候補選択ステップＳ４２において選択された分割方法候補のうち、フルペル予測による符号化コストが小さいものから順番に行われる。まず、（３−１）で説明した方法により、小ブロックごとのサブペル予測の予測方向が選択され、小ブロックごとのサブペル予測の必要処理量が推定される。さらに、推定された小ブロックごとの必要処理量は、分割方法候補単位で合計され、分割方法候補全体の必要処理量が算出される（ステップＳ３１）。 The processing for each division method candidate is performed in order from the division method candidate selected in the division method candidate selection step S42 in ascending order of encoding cost by full-pel prediction. First, the prediction direction of sub-pel prediction for each small block is selected by the method described in (3-1), and the necessary processing amount of sub-pel prediction for each small block is estimated. Further, the estimated necessary processing amount for each small block is summed up in units of division method candidates, and the necessary processing amount of the entire division method candidates is calculated (step S31).

例えば、１６×８の小ブロックＳｂ２に対して、１方向の予測方向が選択された場合には（例えば、（３−１）の第２のケースまたは第３のケース）、小ブロックＳｂ２の１予測方向あたりの必要処理量［２］に、予測方向から定まる定数［１］を乗じた値［２］が小ブロックＳｂ２の必要処理量として算出される。また、３方向の予測方向が選択された場合には（例えば、（３−１）の第１のケース）、小ブロックＳｂ２の１予測方向あたりの必要処理量［２］に、予測方向から定まる定数［２］を乗じた値［４］が小ブロックＳｂ２の必要処理量として算出される。ここで、３方向の予測方向が選択されている場合に予測方向から定まる定数を［２］としている理由は、双方向予測については、動き推定処理は行わず、前方向予測および後方向予測の結果を利用した予測を行うことができるからである（図５（ｂ）又は（ｃ）で説明した方法をサブペル予測に利用可能である。）。このようにして推定した小ブロック毎の必要処理量は、分割方法単位で合計され、分割方法候補の必要処理量が算出される。 For example, when one prediction direction is selected for the 16 × 8 small block Sb2 (for example, the second case or the third case (3-1)), 1 of the small block Sb2 is selected. A value [2] obtained by multiplying the required processing amount [2] per prediction direction by a constant [1] determined from the prediction direction is calculated as the required processing amount of the small block Sb2. When three prediction directions are selected (for example, the first case of (3-1)), the required processing amount per prediction direction [2] of the small block Sb2 is determined from the prediction direction. A value [4] multiplied by the constant [2] is calculated as the required processing amount of the small block Sb2. Here, the reason that the constant determined from the prediction direction when the three prediction directions are selected is [2]. For bidirectional prediction, the motion estimation process is not performed, and the forward prediction and the backward prediction are performed. This is because prediction using the result can be performed (the method described in FIG. 5B or 5C can be used for sub-pel prediction). The necessary processing amount for each small block estimated in this way is summed up in units of division methods, and the necessary processing amount of the division method candidate is calculated.

算出された必要処理量は、ステップＳ３０で設定された処理余裕量と比較され、必要処理量が処理余裕量より大きくない場合には、処理余裕があると判定される（ステップＳ３２）。
処理余裕があると判定された場合には、（３−１）で選択された予測方向に対する小ブロックごとのサブペル予測が行われる（ステップＳ３３）。さらに、処理余裕量と分割方法候補の必要処理量との差が処理余裕量として設定され、次の分割方法候補に対する処理が開始される。 The calculated required processing amount is compared with the processing margin amount set in step S30, and if the required processing amount is not larger than the processing margin amount, it is determined that there is a processing margin (step S32).
If it is determined that there is a processing margin, sub-pel prediction is performed for each small block in the prediction direction selected in (3-1) (step S33). Further, the difference between the processing margin amount and the required processing amount of the division method candidate is set as the processing margin amount, and processing for the next division method candidate is started.

処理余裕が無いと判定された場合には、フルペル予測に基づいて最小の符号化コストを示すと判断される１つの予測方向が小ブロック毎に選択され（ステップＳ５５）、小ブロックごとの必要処理量が分割方法候補単位で合計され、分割方法候補の必要処理量が算出される（ステップＳ３５）。例えば、１６×８の小ブロックＳｂ２，Ｓｂ３に対して、小ブロックＳｂ２，Ｓｂ３の１予測方向あたりの必要処理量［２］が合計され、１６×８の分割方法候補の必要処理量が［４］と算出される。算出された必要処理量は、ステップＳ３０で設定された処理余裕量と比較され、必要処理量が処理余裕量より小さい場合には、処理余裕があると判定される（ステップＳ３６）。 If it is determined that there is no processing margin, one prediction direction determined to indicate the minimum coding cost based on full-pel prediction is selected for each small block (step S55), and necessary processing for each small block is performed. The amounts are totaled in units of division method candidates, and the necessary processing amount of the division method candidates is calculated (step S35). For example, for the 16 × 8 small blocks Sb2 and Sb3, the necessary processing amount [2] per prediction direction of the small blocks Sb2 and Sb3 is totaled, and the necessary processing amount of the 16 × 8 division method candidate is [4 ] Is calculated. The calculated required processing amount is compared with the processing margin amount set in step S30. If the required processing amount is smaller than the processing margin amount, it is determined that there is a processing margin (step S36).

処理余裕があると判定された場合には、フルペル予測に基づいて最小の符号化コストを示すと判断される１つの予測方向に対するサブペル予測が行われる（ステップＳ３７）。さらに、処理余裕量とステップＳ３５で算出された分割方法候補の必要処理量との差が処理余裕量として設定され（ステップ３４）、次の分割方法候補に対する処理が開始される。 If it is determined that there is a processing margin, sub-pel prediction is performed for one prediction direction that is determined to indicate the minimum coding cost based on full-pel prediction (step S37). Further, the difference between the processing margin amount and the necessary processing amount of the division method candidate calculated in step S35 is set as the processing margin amount (step 34), and processing for the next division method candidate is started.

ステップＳ３６で処理余裕がないと判定された場合には、サブペル予測は行わず、次の画像ブロックの処理を開始する。
（３−２−１）
次に、図１４を用いて、第１の具体例を説明する。この具体例では、分割方法候補選択ステップ４２において、第１候補として１６×１６分割方法（符号化コスト（４０））が選択され、第２候補として１６×８分割方法（符号化コスト（４３））が選択されている。 If it is determined in step S36 that there is no processing margin, sub-pel prediction is not performed and processing of the next image block is started.
(3-2-1)
Next, a first specific example will be described with reference to FIG. In this specific example, in the division method candidate selection step 42, the 16 × 16 division method (encoding cost (40)) is selected as the first candidate, and the 16 × 8 division method (encoding cost (43)) is selected as the second candidate. ) Is selected.

図１４に示すように、処理は、画像ブロック単位で行われる（ステップＳ３０〜ステップＳ３７）。まず、１６×１６の画像ブロックのサブペル予測に割り当てられた処理量が処理余裕量［８］として設定される（ステップＳ３０）。次に、分割方法候補毎の処理が行われる（ステップＳ３１〜Ｓ３７）。 As shown in FIG. 14, the process is performed in units of image blocks (step S30 to step S37). First, the processing amount allocated to the sub-pel prediction of the 16 × 16 image block is set as the processing margin amount [8] (step S30). Next, processing is performed for each candidate division method (steps S31 to S37).

分割方法候補毎の処理は、分割方法候補選択ステップＳ４２において選択された分割方法候補のうち、フルペル予測による符号化コストが小さいものから順番に行われる。
最初は、１６×１６の分割方法（符号化コスト（４０））が対象になる。具体的には、１６×１６の分割方法において、まず、（３−１）で説明した方法により、小ブロックＳｂ１のサブペル予測の予測方向が選択される。この場合は、第２のケースであり、前方向予測ｆｗの符号化コストが後方向予測ｂｗの符号化コストよりも小さい場合である。そのため、前方向予測による非整数画素精度の動き推定が実行され、後方向予測および双方向予測による非整数画素精度の動き推定は実行されない。この結果、小ブロックＳｂ１のサブペル予測の必要処理量［４］が推定される。さらに、１６×１６の分割方法の必要処理量［４］が算出される（ステップＳ３１）。 The processing for each division method candidate is performed in order from the division method candidate selected in the division method candidate selection step S42 in ascending order of encoding cost by full-pel prediction.
Initially, the 16 × 16 division method (encoding cost (40)) is targeted. Specifically, in the 16 × 16 division method, first, the prediction direction of the sub-pel prediction of the small block Sb1 is selected by the method described in (3-1). This case is a second case where the coding cost of the forward prediction fw is smaller than the coding cost of the backward prediction bw. Therefore, motion estimation with non-integer pixel accuracy by forward prediction is executed, and motion estimation with non-integer pixel accuracy by backward prediction and bidirectional prediction is not executed. As a result, the necessary processing amount [4] for sub-pel prediction of the small block Sb1 is estimated. Further, the required processing amount [4] of the 16 × 16 division method is calculated (step S31).

算出された必要処理量［４］は、ステップＳ５０で設定された処理余裕量［８］と比較され、必要処理量［４］が処理余裕量［８］より大きくないため、処理余裕があると判定される（ステップＳ３２）。
この場合は、（３−１）で選択された予測方向（ｆｗ）に対する小ブロックＳｂ１のサブペル予測が行われる（ステップＳ３３）。さらに、処理余裕量［８］と分割方法候補の必要処理量［４］との差が処理余裕量［４］として設定される（ステップＳ３４）。 The calculated required processing amount [4] is compared with the processing margin amount [8] set in step S50. Since the required processing amount [4] is not larger than the processing margin amount [8], there is a processing margin. Determination is made (step S32).
In this case, sub-pel prediction of the small block Sb1 with respect to the prediction direction (fw) selected in (3-1) is performed (step S33). Further, the difference between the processing margin [8] and the necessary processing amount [4] of the division method candidate is set as the processing margin [4] (step S34).

次に、１６×８の分割方法（符号化コスト（４２））が対象になる。具体的には、１６×８の分割方法において、まず、（３−１）で説明した方法により、第２のケースとして小ブロックＳｂ２のサブペル予測の予測方向が選択され（ｆｗ）、小ブロックＳｂ２のサブペル予測の必要処理量［２］が推定される。また、第３のケースとして小ブロックＳｂ３のサブペル予測の予測方向が選択され（ｂｗ）、小ブロックＳｂ３のサブペル予測の必要処理量［２］が推定される。さらに、推定された小ブロックＳｂ２の必要処理量［２］と小ブロックＳｂ３の必要処理量［２］は合計され、１６×８の分割方法候補の必要処理量［４］が算出される（ステップＳ３１）。 Next, the 16 × 8 division method (encoding cost (42)) is targeted. Specifically, in the 16 × 8 division method, first, the prediction direction of the sub-pel prediction of the small block Sb2 is selected as the second case by the method described in (3-1) (fw), and the small block Sb2 The required processing amount [2] for sub-pel prediction is estimated. As a third case, the prediction direction of the sub-pel prediction of the small block Sb3 is selected (bw), and the necessary processing amount [2] of the sub-pel prediction of the small block Sb3 is estimated. Further, the estimated necessary processing amount [2] of the small block Sb2 and the necessary processing amount [2] of the small block Sb3 are summed to calculate the necessary processing amount [4] of the 16 × 8 division method candidate (step). S31).

算出された必要処理量［４］は、ステップＳ３４で設定された処理余裕量［４］と比較され、必要処理量［４］が処理余裕量［４］より大きくないため、処理余裕があると判定される（ステップＳ３２）。
この場合は、（３−１）で選択された予測方向（ｆｗ）に対する小ブロックＳｂ２のサブペル予測が行われ、さらに（３−１）で選択された予測方向（ｂｗ）に対する小ブロックＳｂ３のサブペル予測が行われる（ステップＳ３３）。 The calculated required processing amount [4] is compared with the processing margin amount [4] set in step S34, and the required processing amount [4] is not larger than the processing margin amount [4]. Determination is made (step S32).
In this case, sub-pel prediction of the small block Sb2 for the prediction direction (fw) selected in (3-1) is performed, and further, the sub-pel of the small block Sb3 for the prediction direction (bw) selected in (3-1). Prediction is performed (step S33).

さらに、処理余裕量［４］とステップ３５で算出された分割方法候補の必要処理量［４］との差が処理余裕量として設定されるが（ステップＳ３４）、その値が［０］になったので、次の分割方法候補に対する処理は行わない。
（３−２−２）
次に、図１５を用いて、第２の具体例を説明する。この具体例では、分割方法候補選択ステップ４２において、第１候補として１６×１６分割方法（符号化コスト（４０））が選択され、第２候補として１６×８分割方法（符号化コスト（４３））が選択されている。 Furthermore, the difference between the processing margin [4] and the necessary processing amount [4] of the division method candidate calculated in step 35 is set as the processing margin (step S34), and the value becomes [0]. Therefore, the process for the next division method candidate is not performed.
(3-2-2)
Next, a second specific example will be described with reference to FIG. In this specific example, in the division method candidate selection step 42, the 16 × 16 division method (encoding cost (40)) is selected as the first candidate, and the 16 × 8 division method (encoding cost (43)) is selected as the second candidate. ) Is selected.

図１５に示すように、処理は、画像ブロック単位で行われる（ステップＳ３０〜Ｓ３７）。まず、１６×１６の画像ブロックのサブペル予測に割り当てられた処理量が処理余裕量［８］として設定される（ステップＳ３０）。次に、分割方法候補毎の処理が行われる（ステップＳ３１〜ステップＳ３７）。 As shown in FIG. 15, the processing is performed in units of image blocks (steps S30 to S37). First, the processing amount allocated to the sub-pel prediction of the 16 × 16 image block is set as the processing margin amount [8] (step S30). Next, processing for each candidate division method is performed (steps S31 to S37).

分割方法候補毎の処理は、分割方法候補選択ステップＳ４２において選択された分割方法候補のうち、フルペル予測による符号化コストが小さいものから順番に行われる。
最初は、１６×１６の分割方法（符号化コスト（４０））が対象になる。具体的には、１６×１６の分割方法において、まず、（３−１）で説明した方法により、小ブロックＳｂ１のサブペル予測の予測方向が選択される。第１のケースであり、前方向予測ｆｗの符号化コストと後方向予測の符号化ｂｗコストとがほぼ一致している。そのため、前方向予測ｆｗおよび後方向予測ｂｗの２種類の予測方向について、非整数画素精度の動き推定が実行される。その結果、小ブロックＳｂ１の前方向予測ｆｗのサブペル予測の必要処理量［４］と、小ブロックＳｂ１の前方向予測ｂｗのサブペル予測の必要処理量［４］とが推定される。推定された小ブロックＳｂ１ごとの必要処理量［４］は、分割方法候補単位で合計され、分割方法候補の必要処理量［８］が算出される（ステップＳ３１）。 The processing for each division method candidate is performed in order from the division method candidate selected in the division method candidate selection step S42 in ascending order of encoding cost by full-pel prediction.
Initially, the 16 × 16 division method (encoding cost (40)) is targeted. Specifically, in the 16 × 16 division method, first, the prediction direction of the sub-pel prediction of the small block Sb1 is selected by the method described in (3-1). In the first case, the encoding cost of the forward prediction fw and the encoding bw cost of the backward prediction are substantially the same. Therefore, motion estimation with non-integer pixel accuracy is executed for two types of prediction directions, ie, forward prediction fw and backward prediction bw. As a result, the necessary processing amount [4] for sub-pel prediction of the forward prediction fw of the small block Sb1 and the necessary processing amount [4] for sub-pel prediction of the forward prediction bw of the small block Sb1 are estimated. The estimated necessary processing amount [4] for each small block Sb1 is summed up in units of division method candidates, and the necessary processing amount [8] of the division method candidates is calculated (step S31).

算出された必要処理量［８］は、ステップＳ３０で設定された処理余裕量［８］と比較され、必要処理量［８］が処理余裕量［８］より大きくないため、処理余裕があると判定される（ステップＳ３２）。
この場合は、（３−１）で選択された予測方向（ｆｗ）に対する小ブロックＳｂ１のサブペル予測が行われ、さらに予測方向（ｂｗ）に対する小ブロックＳｂ１のサブペル予測が行われる（ステップＳ３３）。 The calculated required processing amount [8] is compared with the processing margin amount [8] set in step S30, and the required processing amount [8] is not larger than the processing margin amount [8]. Determination is made (step S32).
In this case, sub-pel prediction of the small block Sb1 with respect to the prediction direction (fw) selected in (3-1) is performed, and further, sub-pel prediction of the small block Sb1 with respect to the prediction direction (bw) is performed (step S33).

さらに、処理余裕量［８］とステップ５５で算出された分割方法候補の必要処理量［８］との差が処理余裕量として設定されるが（ステップＳ３４）、その値が［０］になったので、次の分割方法候補に対する処理は行わない。
この具体例では、１６×８分割方法（符号化コスト（４２））は、分割方法候補選択ステップＳ４２において選択された分割方法候補であるにもかかわらず、サブペル予測が行われない。 Furthermore, the difference between the processing margin [8] and the necessary processing amount [8] of the division method candidate calculated in step 55 is set as the processing margin (step S34), and the value becomes [0]. Therefore, the process for the next division method candidate is not performed.
In this specific example, although the 16 × 8 division method (encoding cost (42)) is the division method candidate selected in the division method candidate selection step S42, sub-pel prediction is not performed.

（３−２）の効果
このサブペル予測部１５では、サブペル予測の処理量を制御することが可能となる。特に、処理量を最小にする制御を行うと、ソフトウェアエンコーダの場合には、処理時間の短縮化の効果があり、ハードウェアエンコーダの場合には、消費電力削減の効果がある。また、リアルタイムエンコーダのように、処理時間を一定にしなければならない場合には、余裕分の処理量をその他の候補に配分することで、圧縮性能を上げることも可能となる。 Effect of (3-2) The sub-pel prediction unit 15 can control the processing amount of the sub-pel prediction. In particular, when the control for minimizing the processing amount is performed, in the case of a software encoder, there is an effect of shortening the processing time, and in the case of a hardware encoder, there is an effect of reducing power consumption. In addition, when the processing time must be constant as in the real-time encoder, it is possible to improve the compression performance by allocating the surplus processing amount to other candidates.

（３−３）
上記実施形態では、以下のように説明した。すなわち、分割方法決定ステップＳ４４は、分割方法候補選択ステップＳ４２において選択された２種類の分割方法候補により分割されたそれぞれの小ブロックについて最小となる符号化コストから、小ブロックごとの予測方向を決定するとともに、画像ブロック単位での符号化コストを導出する。さらに、導出された画像ブロック単位での符号化コストを２種類の分割方法候補について比較し、最小の符号化コストを有する分割方法候補を画像ブロックの分割方法として決定する。 (3-3)
In the said embodiment, it demonstrated as follows. That is, the division method determination step S44 determines the prediction direction for each small block from the minimum coding cost for each small block divided by the two types of division method candidates selected in the division method candidate selection step S42. In addition, the encoding cost for each image block is derived. Further, the derived encoding cost for each image block is compared for two types of division method candidates, and the division method candidate having the minimum encoding cost is determined as the image block division method.

ここで、分割方法候補選択ステップＳ４２が選択する分割方法候補は、２種類に限られず、さらに多くの候補を選択してもよい。
また、分割方法決定ステップＳ４４は、画像ブロックの分割方法を１つに決定せず、さらに多くの方法を選択するものであってもよい。例えば、分割方法決定ステップＳ４４では、２つの符号化モードを選択し、さらに別のステップによって最終的な符号化モードを決定してもよい。 Here, the division method candidates selected in the division method candidate selection step S42 are not limited to two types, and more candidates may be selected.
Further, the division method determination step S44 may select more methods without determining one image block division method. For example, in the division method determination step S44, two encoding modes may be selected, and the final encoding mode may be determined in yet another step.

（４）その他の変形例
（４−１）
フルペル予測部１３では整数画素精度の動き推定が、サブペル予測部１５では非整数画素の動き推定が行われると記載した。ここで、動き推定の精度は、これらに限定されるものではない。 (4) Other modifications (4-1)
It has been described that the full-pel prediction unit 13 performs motion estimation with integer pixel accuracy, and the sub-pel prediction unit 15 performs motion estimation of non-integer pixels. Here, the accuracy of motion estimation is not limited to these.

例えば、フルペル予測部１３では、簡易な動き推定が、サブペル予測部１５では、複雑な動き推定が行われるものであってもよい。
より具体的には、複雑な動き推定とは、簡易な動き推定よりも複雑な動き推定のことであり、例えば、複雑な動き推定とは、整数画素精度の簡易な動き推定に対するより詳細な精度（例えば、１／２画素精度、１／４画素精度などといった非整数画素精度）での動き推定、非整数画素の簡易な動き推定に対するより詳細な精度での動き推定、縮小画像（画素情報の間引かれた画像）を参照する簡易な動き推定に対するより詳細な画像を参照する動き推定などである。 For example, the full-pel prediction unit 13 may perform simple motion estimation, and the sub-pel prediction unit 15 may perform complex motion estimation.
More specifically, complex motion estimation is more complex motion estimation than simple motion estimation. For example, complex motion estimation is more detailed accuracy than simple motion estimation with integer pixel accuracy. (E.g., non-integer pixel accuracy such as ½ pixel accuracy, ¼ pixel accuracy, etc.), motion estimation with more detailed accuracy for simple motion estimation of non-integer pixels, reduced image (pixel information For example, a motion estimation that refers to a more detailed image with respect to a simple motion estimation that refers to a thinned image).

さらに、簡易な動き推定は、縮小画像に対して２画素精度、１／２画素精度などの動き推定であってもよい。
これにより、複雑な動き推定の処理量を低減しつつ、複雑な動き推定による適切な動き推定の効果を享受することが可能となる。 Further, the simple motion estimation may be motion estimation such as 2-pixel accuracy and 1 / 2-pixel accuracy for a reduced image.
Accordingly, it is possible to enjoy the effect of appropriate motion estimation by complex motion estimation while reducing the amount of processing of complex motion estimation.

また、ここでは２段階の精度の動き推定により符号化モードの選択を行っているが、より多くの段階により選択を行ってもよい。
例えば、整数画素精度、１／２画素精度、１／４画素精度というように、３段階の精度の動き推定を用いて、符号化モードの選択を行ってもよい。 Here, the encoding mode is selected by motion estimation with two stages of accuracy, but the selection may be performed by more stages.
For example, the coding mode may be selected using motion estimation with three stages of accuracy, such as integer pixel accuracy, 1/2 pixel accuracy, and 1/4 pixel accuracy.

（４−２）
フルペル予測部１３あるいはサブペル予測部１５は、それぞれの動き推定の処理量がほぼ一定に保たれるように動き推定の方式を変化させるものであってもよい。
従来では、それぞれの動き推定処理に際して、それぞれ所定の処理時間が割り当てられており、この所定の処理時間内で、固定的に選択されたパーティションサイズ、参照ピクチャに対して動き推定処理が行われている。この場合、処理量が最も多くなると考えられる場合（ワーストケース）を想定して処理時間が割り当てられるため、処理対象がワーストケースで無い場合、処理時間に余裕が生まれ、効率的な動き推定処理が妨げられている。 (4-2)
The full-pel prediction unit 13 or the sub-pel prediction unit 15 may change the motion estimation method so that the processing amount of each motion estimation is kept substantially constant.
Conventionally, a predetermined processing time is assigned to each motion estimation process, and the motion estimation process is performed on a fixedly selected partition size and reference picture within the predetermined processing time. Yes. In this case, processing time is allocated assuming that the amount of processing is considered to be the largest (worst case). Therefore, when the processing target is not the worst case, there is room in processing time, and efficient motion estimation processing is performed. It is hindered.

そこで本発明では、処理に余裕がある場合には、動き推定の方式を変化させ、効率的な動き推定処理を行う。ここで、処理に余裕があるか否かは、例えば、画像ブロックにより構成される入力画像の画像属性に応じて、判断される。画像属性とは、例えば、画像のサイズや、画像の符号化方式（ピクチャタイプ〔Ｉピクチャ、Ｐピクチャ、Ｂピクチャ〕など）や、画像のフォーマット（走査方式〔プログレッシブ、インターレース〕、色差フォーマットなど）や、画像の動き量などである。 Therefore, in the present invention, when there is a margin in processing, the motion estimation method is changed to perform efficient motion estimation processing. Here, whether or not there is a margin for the processing is determined, for example, according to the image attribute of the input image configured by the image block. Image attributes include, for example, image size, image encoding method (picture type [I picture, P picture, B picture], etc.), image format (scanning method (progressive, interlaced), color difference format, etc.) And the amount of motion of the image.

（４−２−１）
例えば、画像ブロックにより構成される入力画像サイズと参照ピクチャ数とパーティションサイズ数との積がほぼ一定になるように、動き推定の方式（参照するピクチャの枚数・方向、動き推定を行うパーティションサイズのバリエーション、動きの探索範囲など）を変化させる。より具体的には、入力画像サイズが小さい場合には、参照ピクチャ数やパーティションサイズ数を大きくし、フルペル予測あるいはサブペル予測の処理をより精度良く行うことが可能となる。 (4-2-1)
For example, the motion estimation method (number of reference pictures, direction, partition size for motion estimation is set so that the product of the input image size composed of image blocks, the number of reference pictures, and the number of partition sizes is substantially constant. Variation, motion search range, etc.). More specifically, when the input image size is small, it is possible to increase the number of reference pictures and the number of partition sizes, and to perform full-pel prediction or sub-pel prediction processing with higher accuracy.

（４−２−２）
また、例えば、Ｂピクチャの参照ピクチャ数をＰピクチャよりも少なくし、ピクチャ単位で動き推定の処理量がほぼ一定に保たれるようにする。より具体的には、次のようなバリエーションが考えられる。＜１＞Ｐピクチャでは前方４枚を参照し、Ｂピクチャでは前方２枚・後方２枚を参照する。＜２＞Ｐピクチャでは前方３枚を参照し、Ｂピクチャでは前方２枚・後方１枚を参照する。＜３＞Ｐピクチャでは前方２枚を参照し、Ｂピクチャでは前方１枚・後方１枚を参照する。 (4-2-2)
Further, for example, the number of reference pictures of B picture is made smaller than that of P picture so that the processing amount of motion estimation is kept almost constant for each picture. More specifically, the following variations can be considered. <1> The front four pictures are referred to in the P picture, and the front two pictures and the rear two pictures are referred to in the B picture. <2> The front 3 frames are referred to in the P picture, and the front 2 frames and the rear 1 frame are referred to in the B picture. <3> The front picture is referred to in the P picture, and the front picture and the rear picture are referred to in the B picture.

（４−２−３）
また、例えば、Ｂピクチャのパーティションサイズ数をＰピクチャよりも少なくし、ピクチャ単位での動き推定の処理量がほぼ一定に保たれる用にする。より具体的には、次のようなバリエーションが考えられる。＜１＞Ｐピクチャでは前方１枚を参照し、１６ｘ１６，１６ｘ８，８ｘ１６，８ｘ８の４パーティションサイズで予測を行うとする。一方、Ｂピクチャでは、上記４サイズのいずれか二つを選び、それぞれが前方予測と後方予測を行うとする。＜２＞Ｐピクチャでは後方１枚を参照し、１６ｘ１６，１６ｘ８，８ｘ１６，８ｘ８の４パーティションサイズで予測を行うとする。一方、Ｂピクチャでは、上記４サイズのいずれか二つを選び、それぞれが前方予測と後方予測を行うとする。 (4-2-3)
Further, for example, the number of partition sizes of a B picture is made smaller than that of a P picture, so that the processing amount of motion estimation for each picture is kept almost constant. More specifically, the following variations can be considered. <1> It is assumed that prediction is performed with four partition sizes of 16 × 16, 16 × 8, 8 × 16, and 8 × 8 with reference to the front one in the P picture. On the other hand, in the B picture, any two of the above four sizes are selected, and each performs forward prediction and backward prediction. <2> It is assumed that prediction is performed with four partition sizes of 16 × 16, 16 × 8, 8 × 16, and 8 × 8 with reference to the rear one in the P picture. On the other hand, in the B picture, any two of the above four sizes are selected, and each performs forward prediction and backward prediction.

（４−２−４）
また、例えば、入力画像がインターレースの場合には参照ピクチャ数あるいはパーティションサイズ数をプログレッシブの場合よりもより減らす。これは、インターレースの場合には、トップフィールドとボトムフィールドの二フィールドを参照する必要があるからである。より具体的には、次のようなバリエーションが考えられる。＜１＞Ｐピクチャの場合、プログレッシブのＰピクチャでは、前方２フレームを参照し、インターレースのＰピクチャでは、前方２フィールド（時間的には１フレーム分）を参照する。＜２＞Ｐピクチャの場合、プログレッシブのＰピクチャでは、前方１フレームを参照し、パーティションサイズは４種類（１６ｘ１６から８ｘ８）の予測をし、インターレースのＰピクチャでは、前方２フィールド（時間的には１フレーム分）を参照し、パーティションサイズはそれぞれ２種類の予測をする。 (4-2-4)
Also, for example, when the input image is interlaced, the number of reference pictures or the number of partition sizes is reduced more than in the case of progressive. This is because in the case of interlace, it is necessary to refer to two fields, a top field and a bottom field. More specifically, the following variations can be considered. In the case of a <1> P picture, the front two frames are referred to in the progressive P picture, and the two front fields (one frame in time) are referred to in the interlaced P picture. <2> In the case of a P picture, the progressive P picture refers to the front one frame, and the partition size is predicted in four types (16 × 16 to 8 × 8). In the interlaced P picture, the front two fields (in terms of time) One frame) is referenced, and two types of partition sizes are predicted.

（４−２−５）
また、例えば、画像の動きに応じて、参照ピクチャ数あるいはパーティションサイズ数を変化させる。動きベクトルの探索では、探索の処理時間が動きベクトルの大きさの影響を受ける方式がある。このような方式を用いた場合、動きが小さければ、各パーティションサイズ・各参照ピクチャの処理時間が短い。このため、動きが小さい場合には、より多くのパーティションサイズ数・参照ピクチャ数を用いて動き推定を行う。一方、動きが大きい場合には、それぞれの動き推定の処理時間が長くなる。このため、動きが大きい場合には、参照ピクチャの枚数を減らす、あるいは、パーティションサイズ数を減らす。 (4-2-5)
Also, for example, the number of reference pictures or the number of partition sizes is changed according to the motion of the image. In motion vector search, there is a method in which search processing time is affected by the size of a motion vector. When such a method is used, if the motion is small, the processing time of each partition size and each reference picture is short. For this reason, when the motion is small, motion estimation is performed using a larger number of partition sizes and reference pictures. On the other hand, when the motion is large, the processing time for each motion estimation becomes long. For this reason, when the motion is large, the number of reference pictures is reduced, or the number of partition sizes is reduced.

［第２実施形態］
図１６及び図１７を用いて、本発明の第２実施形態としてのエンコーダについて説明する。
図１６は、本発明の第２実施形態としてのエンコーダ６０の構造を説明するブロック図である。エンコーダ６０は、例えば、入力画像信号３０をＭＰＥＧ−４符号化し、符号化画像信号３１として出力する画像符号化装置であり、パーソナルコンピュータ（ＰＣ）、携帯電話などにおいて備えられる。また、ＡＶＣにおいて導入された画像ブロックペア７３という単位で入力画像信号３０を符号化する装置である（図３０参照）。 [Second Embodiment]
An encoder as a second embodiment of the present invention will be described with reference to FIGS. 16 and 17.
FIG. 16 is a block diagram illustrating the structure of an encoder 60 as the second embodiment of the present invention. The encoder 60 is, for example, an image encoding device that performs MPEG-4 encoding on the input image signal 30 and outputs the encoded image signal 31 as an encoded image signal 31, and is provided in a personal computer (PC), a mobile phone, or the like. In addition, it is an apparatus that encodes the input image signal 30 in units of image block pairs 73 introduced in AVC (see FIG. 30).

〈エンコーダ６０の構成〉
図１６に示すエンコーダ６０は、入力画像信号３０のイントラ予測を行うイントラ予測部６１と、入力画像信号３０のインター予測を行うインター予測部６２と、符号化モード決定部６３と、イントラ予測およびインター予測の予測結果を切り換える切換部６４と、切換部６４の出力を符号化して符号化画像信号３１を出力する符号化部５と、入力画像信号３０のローカルデコード信号３２を作成する参照画像作成部６とを備えている。 <Configuration of Encoder 60>
The encoder 60 illustrated in FIG. 16 includes an intra prediction unit 61 that performs intra prediction of the input image signal 30, an inter prediction unit 62 that performs inter prediction of the input image signal 30, a coding mode determination unit 63, and intra prediction and inter prediction. A switching unit 64 that switches prediction results of prediction, an encoding unit 5 that encodes the output of the switching unit 64 and outputs an encoded image signal 31, and a reference image generation unit that generates a local decode signal 32 of the input image signal 30 6 is provided.

イントラ予測部６１は、図示しない制御部により制御され、符号化ピクチャ構造決定部６７が決定したピクチャ構造のブロック（フィールド構造ブロックまたはフレーム構造ブロック）についてイントラ予測を行う。その結果、イントラ予測部６１は、入力画像信号３０を画像ブロック毎にイントラ予測し、イントラ予測結果を切換部４に出力する。 The intra prediction unit 61 is controlled by a control unit (not shown) and performs intra prediction on a block having a picture structure (field structure block or frame structure block) determined by the encoded picture structure determination unit 67. As a result, the intra prediction unit 61 performs intra prediction on the input image signal 30 for each image block, and outputs the intra prediction result to the switching unit 4.

インター予測部６２は、入力画像信号３０を第１の入力とし、ローカルデコード信号３２を第２の入力として、インター予測結果を切換部４に出力する。さらに、インター予測部６２は、インター予測結果のうち、動きベクトルなど符号化にかかる情報を第２の出力として符号化部５に出力する。 The inter prediction unit 62 outputs the inter prediction result to the switching unit 4 with the input image signal 30 as the first input and the local decode signal 32 as the second input. Further, the inter prediction unit 62 outputs information related to encoding, such as a motion vector, among the inter prediction results to the encoding unit 5 as a second output.

インター予測部３は、入力画像信号３０を第１の入力、ローカルデコード信号３２を第２の入力とし、動き推定を行う動き推定部６５と、動き推定部６５の出力を第１の入力、ローカルデコード信号３２を第２の入力とし、予測画像を出力する予測画像作成部１１と、入力画像信号３０を第１の入力、予測画像作成部１１の出力を第２の入力とする減算部１２とから構成されている。動き推定部６５は、動き推定を行い、符号化コスト導出する。また、動き推定部６５の出力のうち、動きベクトルや符号化モードなどの符号化情報は、可変長符号化部２２の入力にも与えられる。 The inter prediction unit 3 uses the input image signal 30 as a first input and the local decode signal 32 as a second input, and performs a motion estimation unit 65 that performs motion estimation, and outputs the motion estimation unit 65 as a first input, A predicted image generating unit 11 that outputs the predicted image by using the decoded signal 32 as a second input; and a subtracting unit 12 that uses the input image signal 30 as a first input and the output of the predicted image generating unit 11 as a second input; It is composed of The motion estimation unit 65 performs motion estimation and derives an encoding cost. Of the output of the motion estimation unit 65, the encoded information such as the motion vector and the encoding mode is also given to the input of the variable length encoding unit 22.

切換部４は、イントラ予測結果を第１の入力、インター予測結果を第２の入力とし、符号化モード決定部６３からの切換信号に従って、いずれかの入力を符号化部５に出力する。
符号化部５及び参照画像作成部６の構造及び機能は前記実施形態と同様であるため、ここでは説明を省略する。 The switching unit 4 uses the intra prediction result as the first input and the inter prediction result as the second input, and outputs one of the inputs to the encoding unit 5 in accordance with the switching signal from the encoding mode determination unit 63.
Since the structures and functions of the encoding unit 5 and the reference image creation unit 6 are the same as those in the above embodiment, description thereof is omitted here.

符号化モード決定部６３は、符号化ピクチャ構造決定部６７と、イントラ／インター決定部６８とを有している。符号化ピクチャ構造決定部６７は、動き推定部６５からの符号化コスト情報を入力としている。符号化ピクチャ構造決定部６７は、トップ・ボトムについての符号化コストを、符号化ピクチャ構造毎に合計し、符号化ピクチャ構造を決定する。符号化ピクチャ構造決定部６７は、決定した符号化ピクチャ構造をイントラ／インター選択部６８に出力する。 The encoding mode determination unit 63 includes an encoded picture structure determination unit 67 and an intra / inter determination unit 68. The encoded picture structure determination unit 67 receives the encoding cost information from the motion estimation unit 65 as an input. The encoded picture structure determining unit 67 determines the encoded picture structure by summing the encoding costs for the top and the bottom for each encoded picture structure. The encoded picture structure determination unit 67 outputs the determined encoded picture structure to the intra / inter selection unit 68.

イントラ／インター選択部６８は、イントラ予測部６１からのイントラ予測の符号化コストと、インター予測部６２からのインター予測の符号化コストを入力としている。イントラ／インター選択部６８は、イントラ予測とインター予測の符号化コストを比較し、符号化モードを決定する。イントラ／インター選択部６８は、この結果を切換部６４に通知する。この結果、切換部６４が動作する。 The intra / inter selection unit 68 receives the encoding cost of intra prediction from the intra prediction unit 61 and the encoding cost of inter prediction from the inter prediction unit 62 as inputs. The intra / inter selection unit 68 compares the encoding costs of intra prediction and inter prediction, and determines the encoding mode. The intra / inter selection unit 68 notifies the switching unit 64 of the result. As a result, the switching unit 64 operates.

なお、制御部は、符号化モード決定部６３が備えていても良い。
図１７は、符号化モード決定（画像ブロックペアについての符号化ピクチャ構造決定と符号化予測方式決定）の処理フローを示すブロック図である。図１７の処理フローは、動き推定部６５によるインター予測ステップＳ５１と、符号化ピクチャ構造決定部６７による符号化ピクチャ構造決定ステップＳ５２と、イントラ予測部６１によるイントラ予測ステップＳ５３と、イントラ／インター選択部６８による符号化予測方式決定ステップＳ５４とを備えている。 Note that the control unit may be included in the encoding mode determination unit 63.
FIG. 17 is a block diagram illustrating a processing flow of determining an encoding mode (determining an encoded picture structure and determining an encoding prediction scheme for an image block pair). The processing flow of FIG. 17 includes an inter prediction step S51 by the motion estimation unit 65, an encoded picture structure determination step S52 by the encoded picture structure determination unit 67, an intra prediction step S53 by the intra prediction unit 61, and intra / inter selection. And an encoding prediction method determination step S54 by the unit 68.

インター予測ステップＳ５１は、画像ブロックペア７３のフィールド構造ブロックペア７５，７６およびフレーム構造ブロックペア７７，７８についての動き推定結果を導出する（図３０参照）。具体的には、インター予測ステップＳ５１は、フレーム構造トップＭＢ７７についての第１インター予測ステップＳ５１１とボトムＭＢ７８についての第２インター予測ステップＳ５１２とを備えている。第１インター予測ステップＳ５１１は、フレーム構造トップＭＢ７７に対してインター予測を行い、符号化コスト（cost top0）を導出する。第２インター予測ステップＳ５１２は、フレーム構造ブロックペアのボトムＭＢ７８に対してインター予測を行い、符号化コスト（cost bot0）を導出する。各符号化コストcost top0，cost bot0は符号化ピクチャ構造決定ステップＳ５２に送られる。さらに各符号化コストcost top0，cost bot0が合計され、フレーム構造ブロックペア７７，７８の符号化コストcost0が得られ、それが符号化ピクチャ構造決定ステップＳ５２に送られる。なお、この実施形態では、cost top0は１５００であり、cost bot0は１３００であり、cost0は２８００である。インター予測ステップＳ５１は、フィールド構造ブロックペア７５，７６のトップＭＢ７５についての第３インター予測ステップＳ５１３と、ボトムＭＢ７６についての第４インター予測ステップＳ５１４とをさらに備えている。第３インター予測ステップＳ５１３は、フィールド構造ブロックペア７５，７６のトップＭＢ７５に対してインター予測を行い、符号化コスト（cost top1）を導出する。第４インター予測ステップＳ５１４は、フィールド構造ブロックペア７５，７６のボトムＭＢ７６に対してインター予測を行い、符号化コスト（cost bot1）を導出する。各符号化コストcost top1，cost bot1は符号化ピクチャ構造決定ステップＳ５２に送られる。さらに各符号化コストcost top1，cost bot1が合計され、フィールド構造ブロックペア７５，７６の符号化コストcost1が得られ、それが符号化ピクチャ構造決定ステップＳ５２に送られる。なお、この実施形態では、cost top1は１４００であり、cost bot1は１３００であり、cost1は２７００である。 The inter prediction step S51 derives motion estimation results for the field structure block pairs 75 and 76 and the frame structure block pairs 77 and 78 of the image block pair 73 (see FIG. 30). Specifically, the inter prediction step S51 includes a first inter prediction step S511 for the frame structure top MB77 and a second inter prediction step S512 for the bottom MB78. In the first inter prediction step S511, inter prediction is performed on the frame structure top MB77, and an encoding cost (cost top0) is derived. In the second inter prediction step S512, inter prediction is performed on the bottom MB 78 of the frame structure block pair, and a coding cost (cost bot0) is derived. The respective coding costs cost top0 and cost bot0 are sent to the coded picture structure determination step S52. Further, the respective coding costs cost top0 and cost bot0 are summed to obtain the coding cost cost0 of the frame structure block pair 77 and 78, which is sent to the coded picture structure determination step S52. In this embodiment, cost top0 is 1500, cost bot0 is 1300, and cost0 is 2800. The inter prediction step S51 further includes a third inter prediction step S513 for the top MB 75 of the field structure block pair 75 and 76 and a fourth inter prediction step S514 for the bottom MB 76. In the third inter prediction step S513, inter prediction is performed on the top MB 75 of the field structure block pair 75 and 76, and a coding cost (cost top1) is derived. In the fourth inter prediction step S514, inter prediction is performed on the bottom MB 76 of the field structure block pair 75 and 76, and a coding cost (cost bot1) is derived. The respective coding costs cost top1 and cost bot1 are sent to the coded picture structure determination step S52. Further, the respective coding costs cost top1 and cost bot1 are summed to obtain the coding cost cost1 of the field structure block pair 75 and 76, which is sent to the coded picture structure determination step S52. In this embodiment, cost top1 is 1400, cost bot1 is 1300, and cost1 is 2700.

なお、第１〜第４インター予測ステップＳ５１１〜ステップＳ５１４は、それぞれが、１６×１６の分割方法、１６×８の分割方法、８×１６の分割方法、８×８の分割方法を含んだ動き推定動作全体を表している。つまり、第１〜第４インター予測ステップＳ５１１〜Ｓ５１４には、本発明の第１実施形態を適用できる。また、第１〜第４インター予測ステップＳ５１１〜ステップＳ５１４は、フルペル予測とサブペル予測の両方を行っても良いが、処理量削減のためフルペル予測だけを行っても良い。 The first to fourth inter prediction steps S511 to S514 are motions including a 16 × 16 division method, a 16 × 8 division method, an 8 × 16 division method, and an 8 × 8 division method, respectively. It represents the entire estimation operation. That is, the first embodiment of the present invention can be applied to the first to fourth inter prediction steps S511 to S514. Moreover, although 1st-4th inter prediction step S511-step S514 may perform both full pel prediction and sub pel prediction, you may perform only full pel prediction for processing amount reduction.

以上のように符号化ピクチャ構造の符号化コスト導出にはインター予測のみを行っているが、インター予測の判定の精度はイントラ予測より良いため、十分な精度が得られる。
符号化ピクチャ構造決定ステップＳ５２は、動き推定結果に基づいて、画像ブロックペア７３の符号化ピクチャ構造を決定する。具体的には、符号化ピクチャ構造決定ステップＳ５２は、インター予測ステップＳ５１からのフレーム構造ブロックペア７７，７８の符号化コストcost0と、フィールド構造ブロックペア７５，７６の符号化コストcost1とを比較し、フレーム／フィールド選択を行う。この実施形態ではフィールド構造ブロックペア７５，７６の符号化コストcost1(2700)がフレーム構造ブロックペア７７，７８の符号化コストcost0(2800)より小さいため、フィールドを選択する。この結果、フィールド構造ブロックペア７５，７６のトップＭＢ７５のインター符号化コストcost top1とボトムＭＢ７６のインター符号化コストcost bot1が、符号化予測方式決定ステップＳ５４に提供される。 As described above, only the inter prediction is performed for deriving the coding cost of the coded picture structure. However, since the accuracy of determination of inter prediction is better than that of intra prediction, sufficient accuracy can be obtained.
The encoded picture structure determination step S52 determines the encoded picture structure of the image block pair 73 based on the motion estimation result. Specifically, the coded picture structure determination step S52 compares the coding cost cost0 of the frame structure block pair 77 and 78 from the inter prediction step S51 with the coding cost cost1 of the field structure block pair 75 and 76. Frame / field selection. In this embodiment, since the coding cost cost1 (2700) of the field structure block pair 75 and 76 is smaller than the coding cost cost0 (2800) of the frame structure block pair 77 and 78, the field is selected. As a result, the inter coding cost cost top1 of the top MB 75 and the inter coding cost cost bot1 of the bottom MB 76 of the field structure block pair 75 and 76 are provided to the coding prediction method determination step S54.

イントラ予測ステップＳ５３は、決定された符号化ピクチャ構造を有するブロックペアについてのイントラ予測結果を導出する。具体的には、イントラ予測ステップＳ５３は、トップＭＢについての第１イントラ予測ステップＳ５３１と、ボトムＭＢについての第２イントラ予測ステップＳ５３２とを備えている。第１イントラ予測ステップＳ５３１は、選択された符号化ピクチャ構造ブロックペア（この場合はフィールド構造ブロックペア７５，７６）のトップＭＢ７５についてイントラ符号化コストcost top2を導出し、符号化予測方式決定ステップＳ５４に提供する。第２イントラ予測ステップＳ５３２は、選択された符号化ピクチャ構造ブロックペア（この場合はフィールド構造ブロックペア７５，７６）のボトムＭＢ７６についてイントラ符号化コストcost bot2を導出し、符号化予測方式決定ステップＳ５４に提供する。なお、この実施形態では、cost top2は１５００であり、cost bot2は１４００である。また、イントラ予測は、処理量を減らすため、画素を間引いて精度を落とした処理であっても良いし、さらにはイントラ４×４を省略しても良い。 In the intra prediction step S53, an intra prediction result for a block pair having the determined coded picture structure is derived. Specifically, the intra prediction step S53 includes a first intra prediction step S531 for the top MB and a second intra prediction step S532 for the bottom MB. The first intra prediction step S531 derives the intra coding cost cost top2 for the top MB 75 of the selected coded picture structure block pair (in this case, the field structure block pair 75, 76), and the coding prediction method determination step S54. To provide. In the second intra prediction step S532, an intra coding cost cost bot2 is derived for the bottom MB 76 of the selected coded picture structure block pair (in this case, the field structure block pair 75, 76), and a coding prediction method determination step S54 is performed. To provide. In this embodiment, cost top2 is 1500 and cost bot2 is 1400. Further, the intra prediction may be a process in which the accuracy is reduced by thinning out pixels in order to reduce the processing amount, and the intra 4 × 4 may be omitted.

符号化予測方式決定ステップＳ５４は、インター予測結果とイントラ予測結果とに基づいて、決定された符号化ピクチャ構造を有するブロックペアの各ブロックに対する符号化予測方式を決定する。具体的には、符号化予測方式決定ステップＳ５４は、トップＭＢについての第１符号化予測方式決定ステップＳ５４１と、ボトムＭＢについての第２符号化予測方式決定ステップＳ５４２とを備えている。第１符号化予測方式決定ステップＳ５４１は、符号化ピクチャ構造決定ステップＳ５２からのトップＭＢのインター符号化コスト（具体的には、フィールド構造ブロックペア７５，７６のトップＭＢ７５のインター符号化コストcost top1）と、第１イントラ予測ステップＳ５３１からのトップＭＢ７５のイントラ符号化コストcost top2を比較し、トップＭＢについてのイントラ／インター選択を行う。この場合は、インター符号化コストcost top1(1400)がイントラ符号化コストcost top2(1500)より小さいため、インターが選択される。第２符号化予測方式決定ステップＳ５４２は、符号化ピクチャ構造決定ステップＳ５２からのボトムＭＢのインター符号化コスト（具体的には、フィールド構造ブロックペア７５，７６のボトムＭＢ７６のインター符号化コストcost bot1）と、第２イントラ予測ステップＳ５３２からのボトムＭＢ７６のイントラ符号化コストcost bot2を比較し、ボトムＭＢ７６についてのイントラ／インター選択を行う。この場合は、この場合は、インター符号化コストcost bot1(1300)がイントラ符号化コストcost bot2(1400)より小さいため、インターが選択される。 The coding prediction scheme determination step S54 determines a coding prediction scheme for each block of the block pair having the determined coded picture structure based on the inter prediction result and the intra prediction result. Specifically, the encoding prediction method determination step S54 includes a first encoding prediction method determination step S541 for the top MB and a second encoding prediction method determination step S542 for the bottom MB. The first encoding prediction scheme determination step S541 is the top MB inter encoding cost from the encoded picture structure determination step S52 (specifically, the inter MB encoding cost cost top1 of the top MB 75 of the field structure block pair 75, 76). ) And the intra coding cost cost top2 of the top MB 75 from the first intra prediction step S531, and intra / inter selection for the top MB is performed. In this case, since inter coding cost cost top1 (1400) is smaller than intra coding cost cost top2 (1500), inter is selected. The second encoding prediction method determination step S542 is a bottom MB inter encoding cost from the encoded picture structure determination step S52 (specifically, an inter encoding cost cost bot1 of the bottom MB 76 of the field structure block pair 75, 76). ) And the intra coding cost cost bot2 of the bottom MB76 from the second intra prediction step S532, and performs intra / inter selection for the bottom MB76. In this case, inter is selected because the inter coding cost cost bot1 (1300) is smaller than the intra coding cost cost bot2 (1400) in this case.

なお、本実施形態ではトップＭＢとボトムＭＢで符号化予測方式（イントラ／インター）は同一になったが、異なることもある。ただし、トップＭＢとボトムＭＢで異なる符号化ピクチャ構造で符号化されることは無い。符号化ピクチャ構造決定ステップＳ５２で符号化ピクチャ構造が決定されているからである。 In this embodiment, the encoding prediction scheme (intra / inter) is the same for the top MB and the bottom MB, but may be different. However, the top MB and the bottom MB are not encoded with different encoded picture structures. This is because the encoded picture structure is determined in the encoded picture structure determining step S52.

この実施形態では、イントラ予測ステップＳ５３は符号化ピクチャ構造決定ステップＳ５２によって決定された符号化ピクチャ構造の画像ブロックペアについてのみイントラ予測を行うため、イントラ予測ステップＳ５３はフィールド構造ブロックおよびフレーム構造ブロックの全てについてイントラ予測を行う必要がない。このように処理負荷の高いイントラ予測の回数を減らすことができるため、画像ブロックペアの符号化予測方式を決定するための処理負荷を削減できる。 In this embodiment, since the intra prediction step S53 performs intra prediction only for the image block pair having the coded picture structure determined by the coded picture structure determining step S52, the intra prediction step S53 includes the field structure block and the frame structure block. There is no need to make intra predictions for everything. As described above, since the number of intra predictions with a high processing load can be reduced, the processing load for determining the coding prediction method of the image block pair can be reduced.

〈変形例〉
第２実施形態では、第１実施形態で記載した内容を適宜変形して適用可能である。ここでは、第２実施形態に特徴的な変形例について記載する。
（１）
上記実施形態では、第１〜第４インター予測ステップＳ５１１〜ステップＳ５１４は、フルペル予測とサブペル予測の両方を行っても良いと記載した。 <Modification>
In the second embodiment, the contents described in the first embodiment can be appropriately modified and applied. Here, a modified example characteristic of the second embodiment will be described.
(1)
In the said embodiment, 1st-4th inter prediction step S511-step S514 described that you may perform both full pel prediction and sub pel prediction.

ここで、フルペル予測とサブペル予測との両方を行う場合には、フルペル予測により絞り込まれたパーティションサイズ、参照ピクチャ、ピクチャ構造の組み合わせに対して、サブペル予測を行うものであっても良い。より具体的には、図１８に示すように、整数画素精度の動き推定により、適切なパーティションサイズ、参照ピクチャ、ピクチャ構造の組み合わせを選択し（ステップＳ７１０）、選択された組み合わせに対して非整数画素精度の動き推定により、さらに絞り込みを行う（ステップＳ７１１）。さらに、絞り込みの結果として得られるパーティションサイズ、参照ピクチャ、ピクチャ構造の組み合わせに対して、イントラ予測を行い（ステップＳ７１２）、イントラ予測・インター予測の選択を行う。 Here, when performing both full-pel prediction and sub-pel prediction, sub-pel prediction may be performed on a combination of partition size, reference picture, and picture structure narrowed down by full-pel prediction. More specifically, as shown in FIG. 18, an appropriate combination of partition size, reference picture, and picture structure is selected by motion estimation with integer pixel accuracy (step S710), and a non-integer is selected for the selected combination. Further narrowing is performed by motion estimation with pixel accuracy (step S711). Further, intra prediction is performed on the combination of partition size, reference picture, and picture structure obtained as a result of narrowing down (step S712), and intra prediction / inter prediction is selected.

これにより全パーティションサイズ、全参照ピクチャ、全ピクチャ構造について非整数画素精度の動き推定とイントラ予測とを行う必要が無く、処理量を削減することが可能となる。
（２）
上記実施形態では、画像ブロックペア７３という単位で入力画像信号３０を符号化する装置について説明した。ここで、符号化は、画像ブロックペア単位で行われなくてもよい。例えば、正方の画像ブロックを単位として符号化が行われてもよい。この場合、上記実施形態で説明した方法は、矩形のフィールド構造ブロックペアと矩形のフレーム構造ブロックペアとに対して適用される。 As a result, it is not necessary to perform motion estimation and intra prediction with non-integer pixel accuracy for all partition sizes, all reference pictures, and all picture structures, and the processing amount can be reduced.
(2)
In the above embodiment, the apparatus that encodes the input image signal 30 in units of the image block pair 73 has been described. Here, encoding may not be performed in units of image block pairs. For example, encoding may be performed in units of square image blocks. In this case, the method described in the above embodiment is applied to a rectangular field structure block pair and a rectangular frame structure block pair.

［第３実施形態］
図１９〜図２０を用いて、本発明の第３実施形態としてのエンコーダについて説明する。
〈エンコーダ１の構成〉
図１９に示すエンコーダ６０は、入力画像信号３０のイントラ予測を行うイントラ予測部９１と、入力画像信号３０のインター予測を行うインター予測部９２と、符号化モード決定部９３と、イントラ予測およびインター予測の予測結果を切り換える切換部９４と、切換部９４の出力を符号化して符号化画像信号３１を出力する符号化部５と、入力画像信号３０のローカルデコード信号３２を作成する参照画像作成部６とを備えている。 [Third Embodiment]
An encoder as a third embodiment of the present invention will be described with reference to FIGS.
<Configuration of encoder 1>
19 includes an intra prediction unit 91 that performs intra prediction of the input image signal 30, an inter prediction unit 92 that performs inter prediction of the input image signal 30, a coding mode determination unit 93, and intra prediction and inter prediction. A switching unit 94 that switches prediction prediction results, an encoding unit 5 that encodes the output of the switching unit 94 and outputs an encoded image signal 31, and a reference image generation unit that generates a local decode signal 32 of the input image signal 30 6 is provided.

イントラ予測部９１は、簡易なイントラ予測と複雑なイントラ予測が可能である。簡易なイントラ予測とは、例えば、圧縮画像に対するイントラ予測であり、複雑なイントラ予測とは、例えば、非圧縮画像に対するイントラ予測である。イントラ予測部９１は、符号化モード決定部９３内の制御部９９（後述）により制御され、簡易なイントラ予測を行い、符号化コスト導出する。その結果、イントラ予測部９１は、入力画像信号３０を画像ブロック毎にイントラ予測し、イントラ予測結果を切換部９４に出力する。 The intra prediction unit 91 can perform simple intra prediction and complex intra prediction. Simple intra prediction is, for example, intra prediction for a compressed image, and complex intra prediction is, for example, intra prediction for an uncompressed image. The intra prediction unit 91 is controlled by a control unit 99 (described later) in the encoding mode determination unit 93, performs simple intra prediction, and derives an encoding cost. As a result, the intra prediction unit 91 performs intra prediction on the input image signal 30 for each image block, and outputs the intra prediction result to the switching unit 94.

インター予測部９２は、入力画像信号３０を第１の入力とし、ローカルデコード信号３２を第２の入力として、インター予測結果を切換部９４に出力する。さらに、インター予測部９２は、インター予測結果のうち、動きベクトルなど符号化にかかる情報を第２の出力として符号化部５に出力する。 The inter prediction unit 92 outputs the inter prediction result to the switching unit 94 with the input image signal 30 as the first input and the local decode signal 32 as the second input. Further, the inter prediction unit 92 outputs information related to encoding, such as a motion vector, among the inter prediction results to the encoding unit 5 as a second output.

インター予測部９２は、入力画像信号３０を第１の入力、ローカルデコード信号３２を第２の入力とし、動き推定を行う動き推定部９５と、動き推定部９５の出力を第１の入力、ローカルデコード信号３２を第２の入力とし、予測画像を出力する予測画像作成部１１と、入力画像信号３０を第１の入力、予測画像作成部１１の出力を第２の入力とする減算部１２とから構成されている。動き推定部９５は、フルペル・インター予測もしくはサブペル・インター予測を行い、符号化コスト導出する。また、動き推定部９５の出力のうち、動きベクトルや符号化モードなどの符号化情報は、可変長符号化部２２の入力にも与えられる。 The inter prediction unit 92 uses the input image signal 30 as a first input and the local decoded signal 32 as a second input, and performs a motion estimation unit 95 that performs motion estimation, and outputs the motion estimation unit 95 as a first input, A predicted image generating unit 11 that outputs the predicted image by using the decoded signal 32 as a second input; and a subtracting unit 12 that uses the input image signal 30 as a first input and the output of the predicted image generating unit 11 as a second input; It is composed of The motion estimation unit 95 performs full-pel inter prediction or sub-pel inter prediction and derives an encoding cost. Also, of the output of the motion estimation unit 95, the encoding information such as the motion vector and the encoding mode is also given to the input of the variable length encoding unit 22.

切換部９４は、イントラ予測結果を第１の入力、インター予測結果を第２の入力とし、符号化モード決定部９３からの切換信号に従って、いずれかの入力を符号化部５に出力する。
符号化部５及び参照画像作成部６の構造及び機能は前記実施形態と同様であるため、ここでは説明を省略する。 The switching unit 94 uses the intra prediction result as the first input and the inter prediction result as the second input, and outputs one of the inputs to the encoding unit 5 in accordance with the switching signal from the encoding mode determination unit 93.
Since the structures and functions of the encoding unit 5 and the reference image creation unit 6 are the same as those in the above embodiment, description thereof is omitted here.

符号化モード決定部９３は、決定部９６と、制御部９９とを有している。決定部９６は、イントラ／インター選択部９７と、符号化ピクチャ構造決定部９８とを有している。決定部９６は、動き推定部９５からの符号化コストと、イントラ予測部９１からの符号化コストを入力としている。イントラ／インター選択部９７がイントラ／インターを決定する。符号化ピクチャ構造決定部９８がフィールド／フレームを決定する。制御部９９が、イントラ予測部９１または動き推定部９５を制御して、決定された符号化ピクチャ構造の画像ブロックペア７３に動き推定をさせる。つまり、制御部９９は、イントラ予測部９１に複雑なイントラ予測を行わさせ、または動き推定部９５にサブペル・インター予測を行わさせる。制御部９９は、さらに切換部９４を動作させ、イントラ予測結果あるいはインター予測結果を符号化させる。 The encoding mode determination unit 93 includes a determination unit 96 and a control unit 99. The determination unit 96 includes an intra / inter selection unit 97 and an encoded picture structure determination unit 98. The determination unit 96 receives the encoding cost from the motion estimation unit 95 and the encoding cost from the intra prediction unit 91 as inputs. The intra / inter selector 97 determines intra / inter. An encoded picture structure determination unit 98 determines a field / frame. The control unit 99 controls the intra prediction unit 91 or the motion estimation unit 95 to cause the image block pair 73 having the determined coded picture structure to perform motion estimation. That is, the control unit 99 causes the intra prediction unit 91 to perform complex intra prediction or causes the motion estimation unit 95 to perform sub-pel / inter prediction. The control unit 99 further operates the switching unit 94 to encode the intra prediction result or the inter prediction result.

なお、制御部は、エンコーダ９０のどこかに有ればよい。符号化モード決定部９３が備えていなくても良い。
図２０は、画像ブロックペア７３の符号化モード決定の処理動作のフローである。この処理動作は、イントラ予測部９１又は動き推定部９５によって実行される簡易動き推定ステップＳ６１と、イントラ／インター選択部９７によって実行されるイントラ／インター選択ステップＳ６２と、符号化ピクチャ構造決定部９８によって実行される画像ブロックペア７３の符号化ピクチャ構造決定ステップＳ６３とを備えている。なお、符号化ピクチャ構造決定ステップＳ６３の次には、イントラ予測部９１又は動き推定部９５によって実行される複雑動き推定ステップＳ６４とを備えている。 The control unit only needs to be somewhere in the encoder 90. The encoding mode determination unit 93 may not be provided.
FIG. 20 is a flowchart of the processing operation for determining the coding mode of the image block pair 73. This processing operation includes a simple motion estimation step S61 executed by the intra prediction unit 91 or the motion estimation unit 95, an intra / inter selection step S62 executed by the intra / inter selection unit 97, and an encoded picture structure determination unit 98. And a coded picture structure determination step S63 of the image block pair 73 executed by step S63. Note that, after the encoded picture structure determination step S63, a complex motion estimation step S64 executed by the intra prediction unit 91 or the motion estimation unit 95 is provided.

簡易動き推定ステップＳ６１は、フレーム／フィールド構造のトップＭＢ及びボトムＭＢに対して、フルペル・インター予測と簡易イントラ予測を行って、それらの符号化コストを導出する。簡易動き推定ステップＳ６１は、第１〜第８推定ステップＳ６１１〜Ｓ６１８を備えている。第１推定ステップＳ６１１はフレーム構造ブロックペア７７，７８のトップＭＢ７７に対してフルペル・インター予測を行い、第２推定ステップＳ６１２はフレーム構造ブロックペア７７，７８のトップＭＢ７７に対して簡易・イントラ予測を行う。第３推定ステップＳ６１３はフレーム構造ブロックペア７７，７８のボトムＭＢ７８に対してフルペル・インター予測を行い、第４推定ステップＳ６１４はフレーム構造ブロックペア７７，７８のボトムＭＢ７８に対して簡易イントラ予測を行う。第５推定ステップＳ６１５はフィールド構造ブロックペア７５，７６のトップＭＢ７５に対してフルペル・インター予測を行い、第６推定ステップＳ６１６はフィールド構造ブロックペア７５，７６のトップＭＢ７５に対して簡易イントラ予測を行う。第７推定ステップＳ６１７はフィールド構造ブロックペア７５，７６のボトムＭＢ７６に対してフルペル・インター予測を行い、第８推定ステップＳ８１８はフィールド構造ブロックペア７５，７６のボトムＭＢ７６に対して簡易イントラ予測を行う。このように簡易動き推定ステップＳ６１がインター予測とイントラ予測を用いてフレーム構造ブロックペア７７，７８及びフィールド構造ブロックペア７５，７６の符号化コストを導出するため、インター予測又はイントラ予測のいずれかで圧縮率が向上する画像ブロックペア７３（７１，７２）の場合でも、圧縮率が最良となるような符号化ピクチャ構造を決定できる。 The simple motion estimation step S61 performs full-pel inter prediction and simple intra prediction on the top MB and the bottom MB of the frame / field structure, and derives their coding costs. The simple motion estimation step S61 includes first to eighth estimation steps S611 to S618. The first estimation step S611 performs full-pel inter prediction for the top MB 77 of the frame structure block pair 77, 78, and the second estimation step S612 performs simple / intra prediction for the top MB 77 of the frame structure block pair 77, 78. Do. The third estimation step S613 performs full-pel inter prediction on the bottom MB78 of the frame structure block pair 77 and 78, and the fourth estimation step S614 performs simple intra prediction on the bottom MB78 of the frame structure block pair 77 and 78. . The fifth estimation step S615 performs full-pel inter prediction on the top MB 75 of the field structure block pair 75 and 76, and the sixth estimation step S616 performs simple intra prediction on the top MB 75 of the field structure block pair 75 and 76. . The seventh estimation step S617 performs full-pel inter prediction on the bottom MB 76 of the field structure block pair 75, 76, and the eighth estimation step S818 performs simple intra prediction on the bottom MB 76 of the field structure block pair 75, 76. . As described above, since the simple motion estimation step S61 derives the coding costs of the frame structure block pairs 77 and 78 and the field structure block pairs 75 and 76 using inter prediction and intra prediction, either inter prediction or intra prediction is used. Even in the case of the image block pair 73 (71, 72) in which the compression rate is improved, it is possible to determine a coded picture structure that provides the best compression rate.

イントラ／インター選択ステップＳ６２は、（フレーム、フィールド）＊（トップ、ボトム）の４種類それぞれで、イントラ、インターの符号化コストを比較し、小さい方を選択する。
イントラ／インター選択ステップＳ６２は、第１〜第４選択ステップＳ６２１〜ステップＳ６２４を備えている。第１選択ステップＳ６２１は、第１推定ステップＳ６１１及び第２推定ステップＳ６１２の符号化コストを比較して、フレーム構造トップＭＢ７７に対するイントラ／インターを選択する。この場合は、第２推定ステップＳ６１２の符号化コスト(1300)を選択する。第２選択ステップＳ６２２は、第３推定ステップＳ６１３及び第４推定ステップＳ６１４の符号化コストを比較して、フレーム構造ボトムＭＢ７８に対するイントラ／インターを選択する。この場合は、第４推定ステップＳ６１４の符号化コスト(1200)を選択する。イントラ／インターが選択されたフレーム構造トップＭＢ７７の符号化コスト(1300)とボトムＭＢ７８の符号化コスト(1200)は合計され、フレーム構造ブロックペア７７，７８の符号化コスト(2500)が得られる。第３選択ステップＳ６２３は、第５推定ステップＳ６１５及び第６推定ステップＳ６１６の符号化コストを比較して、フィールド構造トップＭＢ７５，７６に対するイントラ／インターを選択する。この場合は、第５推定ステップＳ６１５の符号化コスト(1400)を選択する。第４選択ステップＳ６２４は、第７推定ステップＳ６１７及び第８推定ステップＳ６１８の符号化コストを比較して、フィールド構造ボトムＭＢ７６に対するイントラ／インターを選択する。この場合は、第７推定ステップＳ６１７の符号化コスト(1300)を選択する。イントラ／インターが選択されたフィールド構造トップＭＢ７５の符号化コスト(1400)とボトムＭＢ７６の符号化コスト(1300)は合計され、フィールド構造ブロックペア７５，７６の符号化コスト(2700)が得られる。 The intra / inter selection step S62 compares the intra and inter coding costs for each of the four types (frame, field) * (top, bottom), and selects the smaller one.
The intra / inter selection step S62 includes first to fourth selection steps S621 to S624. The first selection step S621 compares the coding costs of the first estimation step S611 and the second estimation step S612, and selects intra / inter for the frame structure top MB77. In this case, the encoding cost (1300) of the second estimation step S612 is selected. The second selection step S622 compares the coding costs of the third estimation step S613 and the fourth estimation step S614, and selects intra / inter for the frame structure bottom MB78. In this case, the encoding cost (1200) of the fourth estimation step S614 is selected. The coding cost (1300) of the frame structure top MB 77 for which intra / inter is selected and the coding cost (1200) of the bottom MB 78 are summed, and the coding cost (2500) of the frame structure block pair 77 and 78 is obtained. The third selection step S623 compares the coding costs of the fifth estimation step S615 and the sixth estimation step S616, and selects intra / inter for the field structure top MBs 75 and 76. In this case, the encoding cost (1400) of the fifth estimation step S615 is selected. The fourth selection step S624 compares the coding costs of the seventh estimation step S617 and the eighth estimation step S618 and selects intra / inter for the field structure bottom MB76. In this case, the encoding cost (1300) of the seventh estimation step S617 is selected. The coding cost (1400) of the field structure top MB 75 for which intra / inter is selected and the coding cost (1300) of the bottom MB 76 are summed, and the coding cost (2700) of the field structure block pair 75 and 76 is obtained.

符号化ピクチャ構造決定ステップＳ６３は、フレーム構造ブロックペア７７，７８の符号化コストとフィールド構造ブロックペア７５，７６の符号化コストとを比較し、画像ブロックペア７３のフィールド／フレームを決定する。この場合は、フレーム構造ブロックペア７７，７８の符号化コスト(2500)はフィールド構造ブロックペア７５，７６の符号化コスト(2700)より小さいため、フレーム構造ブロックペア７７，７８が選択される。 The coded picture structure determination step S63 compares the coding cost of the frame structure block pair 77 and 78 with the coding cost of the field structure block pair 75 and 76 to determine the field / frame of the image block pair 73. In this case, since the coding cost (2500) of the frame structure block pair 77, 78 is smaller than the coding cost (2700) of the field structure block pair 75, 76, the frame structure block pair 77, 78 is selected.

複雑動き推定ステップＳ６４は、決定された符号化ピクチャ構造の画像ブロックペア７３のトップＭＢ及びボトムＭＢ７７，７８それぞれに対して、複雑な動き推定（サブペル／インター又は複雑イントラの一方）を行う。複雑動き推定ステップＳ６４は、第１〜第４推定ステップＳ６４１〜Ｓ６４４を備えている。第１推定ステップＳ６４１は、トップＭＢ７７に対するサブペル・インター予測を行う。第２推定ステップＳ６４２は、トップＭＢ７７に対する複雑イントラ予測を行う。なお、第１推定ステップＳ６４１と第２推定ステップＳ６４２はいずれか一方のみが実行される。第３推定ステップＳ６４３は、ボトムＭＢ７８に対するサブペル・インター予測を行う。第４推定ステップＳ６４４は、ボトムＭＢ７８に対する複雑イントラ予測を行う。なお、第３推定ステップＳ６４３と第４推定ステップＳ６４４はいずれか一方のみが実行される。 In the complex motion estimation step S64, complex motion estimation (one of sub-pel / inter or complex intra) is performed for each of the top MB and the bottom MB 77 and 78 of the image block pair 73 having the determined coded picture structure. The complex motion estimation step S64 includes first to fourth estimation steps S641 to S644. The first estimation step S641 performs sub-pel inter prediction for the top MB77. In the second estimation step S642, complex intra prediction for the top MB 77 is performed. Note that only one of the first estimation step S641 and the second estimation step S642 is executed. In the third estimation step S643, sub-pel inter prediction is performed on the bottom MB78. The fourth estimation step S644 performs complex intra prediction for the bottom MB78. Note that only one of the third estimation step S643 and the fourth estimation step S644 is executed.

以上に述べたように、簡易動き推定ステップＳ６１の簡易なインター予測・イントラ予測に基づいて、符号化ピクチャ構造決定ステップＳ６３によって、符号化モード（具体的には、符号化ピクチャ構造）を決定している。このため、符号化モードを決定するための処理量を軽減することが可能となる。 As described above, the encoding mode (specifically, the encoded picture structure) is determined by the encoded picture structure determination step S63 based on the simple inter prediction / intra prediction in the simple motion estimation step S61. ing. For this reason, it is possible to reduce the processing amount for determining the encoding mode.

また、符号化モードが決定された後に複雑動き推定ステップＳ６４が複雑な動き推定を行っている。このように、複雑な予測によって画像ブロックペア７３の符号化を行うため、圧縮効率が向上する。しかも、ここでは、決定された符号化ピクチャ構造の画像ブロックペア７３に対してのみ複雑な予測を行うため、従来より複雑な予測の回数を減らすことができる。この結果、符号化効率を維持しながらも処理量を減らすることができる。 Further, after the coding mode is determined, the complex motion estimation step S64 performs complex motion estimation. Thus, since the image block pair 73 is encoded by complicated prediction, the compression efficiency is improved. In addition, since complicated prediction is performed only for the image block pair 73 having the determined coded picture structure, the number of times of complicated prediction can be reduced as compared with the conventional case. As a result, the processing amount can be reduced while maintaining the encoding efficiency.

なお、トップＭＢとボトムＭＢで異なる符号化ピクチャ構造で符号化されることは無いが、異なる符号化予測方式（イントラ／インター）で符号化されることはある。
なお、第３実施形態では、第１〜第２実施形態で記載した内容を適宜変形して適用可能である。
［第４実施形態］
さらにここで、上記実施の形態で示した動画像符号化装置の応用例とそれを用いたシステムを説明する。 The top MB and the bottom MB are not encoded with different encoded picture structures, but may be encoded with different encoding prediction schemes (intra / inter).
In the third embodiment, the contents described in the first and second embodiments can be appropriately modified and applied.
[Fourth Embodiment]
Furthermore, application examples of the moving picture coding apparatus shown in the above embodiment and a system using the same will be described.

図２１は、コンテンツ配信サービスを実現するコンテンツ供給システムex１００の全体構成を示すブロック図である。通信サービスの提供エリアを所望の大きさに分割し、各セル内にそれぞれ固定無線局である基地局ex１０７〜ex１１０が設置されている。
このコンテンツ供給システムex１００は、例えば、インターネットex１０１にインターネットサービスプロバイダex１０２および電話網ex１０４、および基地局ｅｘ１０７〜ｅｘ１１０を介して、コンピュータex１１１、ＰＤＡ（personal digital assistant）ex１１２、カメラex１１３、携帯電話ex１１４、カメラ付きの携帯電話ｅｘ１１５などの各機器が接続される。 FIG. 21 is a block diagram showing an overall configuration of a content supply system ex100 that implements a content distribution service. The communication service providing area is divided into desired sizes, and base stations ex107 to ex110, which are fixed radio stations, are installed in each cell.
The content supply system ex100 includes, for example, a computer ex111, a PDA (personal digital assistant) ex112, a camera ex113, a mobile phone ex114, a camera via the Internet ex101, the Internet service provider ex102, the telephone network ex104, and the base stations ex107 to ex110. Each device such as the attached mobile phone ex115 is connected.

しかし、コンテンツ供給システムex１００は図２１のような組合せに限定されず、いずれかを組み合わせて接続するようにしてもよい。また、固定無線局である基地局ex１０７〜ex１１０を介さずに、各機器が電話網ex１０４に直接接続されてもよい。
カメラex１１３はデジタルビデオカメラ等の動画撮影が可能な機器である。また、携帯電話は、ＰＤＣ（Personal Digital Communications）方式、ＣＤＭＡ（Code Division Multiple Access）方式、Ｗ−ＣＤＭＡ（Wideband-Code Division Multiple Access）方式、若しくはＧＳＭ（Global System for Mobile Communications）方式の携帯電話機、またはＰＨＳ（Personal Handyphone System）等であり、いずれでも構わない。 However, the content supply system ex100 is not limited to the combination as shown in FIG. 21, and may be connected in any combination. Further, each device may be directly connected to the telephone network ex104 without going through the base stations ex107 to ex110 which are fixed wireless stations.
The camera ex113 is a device capable of shooting a moving image such as a digital video camera. The mobile phone is a PDC (Personal Digital Communications) system, a CDMA (Code Division Multiple Access) system, a W-CDMA (Wideband-Code Division Multiple Access) system, or a GSM (Global System for Mobile Communications) system mobile phone, Alternatively, PHS (Personal Handyphone System) or the like may be used.

また、ストリーミングサーバex１０３は、カメラex１１３から基地局ex１０９、電話網ex１０４を通じて接続されており、カメラex１１３を用いてユーザが送信する符号化処理されたデータに基づいたライブ配信等が可能になる。撮影したデータの符号化処理はカメラex１１３で行っても、データの送信処理をするサーバ等で行ってもよい。また、カメラ１１６で撮影した動画データはコンピュータex１１１を介してストリーミングサーバex１０３に送信されてもよい。カメラex１１６はデジタルカメラ等の静止画、動画が撮影可能な機器である。この場合、動画データの符号化はカメラex１１６で行ってもコンピュータex１１１で行ってもどちらでもよい。また、符号化処理はコンピュータex１１１やカメラex１１６が有するＬＳＩex１１７において処理することになる。なお、画像符号化・復号化用のソフトウェアをコンピュータex１１１等で読み取り可能な記録媒体である何らかの蓄積メディア（ＣＤ−ＲＯＭ、フレキシブルディスク、ハードディスクなど）に組み込んでもよい。さらに、カメラ付きの携帯電話ex１１５で動画データを送信してもよい。このときの動画データは携帯電話ex１１５が有するＬＳＩで符号化処理されたデータである。 In addition, the streaming server ex103 is connected from the camera ex113 through the base station ex109 and the telephone network ex104, and live distribution or the like based on the encoded data transmitted by the user using the camera ex113 becomes possible. The encoded processing of the captured data may be performed by the camera ex113 or may be performed by a server or the like that performs data transmission processing. Further, the moving image data shot by the camera 116 may be transmitted to the streaming server ex103 via the computer ex111. The camera ex116 is a device that can shoot still images and moving images, such as a digital camera. In this case, the encoding of the moving image data may be performed by the camera ex116 or the computer ex111. The encoding process is performed in the LSI ex117 included in the computer ex111 and the camera ex116. Note that image encoding / decoding software may be incorporated into any storage medium (CD-ROM, flexible disk, hard disk, etc.) that is a recording medium readable by the computer ex111 or the like. Furthermore, you may transmit moving image data with the mobile telephone ex115 with a camera. The moving image data at this time is data encoded by the LSI included in the mobile phone ex115.

このコンテンツ供給システムex１００では、ユーザがカメラex１１３、カメラex１１６等で撮影しているコンテンツ（例えば、音楽ライブを撮影した映像等）を上記実施の形態同様に符号化処理してストリーミングサーバex１０３に送信する一方で、ストリーミングサーバex１０３は要求のあったクライアントに対して上記コンテンツデータをストリーム配信する。クライアントとしては、上記符号化処理されたデータを復号化することが可能な、コンピュータex１１１、ＰＤＡex１１２、カメラex１１３、携帯電話ex１１４等がある。このようにすることでコンテンツ供給システムex１００は、符号化されたデータをクライアントにおいて受信して再生することができ、さらにクライアントにおいてリアルタイムで受信して復号化し、再生することにより、個人放送をも実現可能になるシステムである。 In this content supply system ex100, the content (for example, video shot of music live) captured by the user with the camera ex113, camera ex116, etc. is encoded and transmitted to the streaming server ex103 as in the above embodiment. On the other hand, the streaming server ex103 distributes the content data to the requested client. Examples of the client include a computer ex111, a PDA ex112, a camera ex113, a mobile phone ex114, and the like that can decode the encoded data. In this way, the content supply system ex100 can receive and reproduce the encoded data at the client, and also realize personal broadcasting by receiving, decoding, and reproducing in real time at the client. It is a system that becomes possible.

このシステムを構成する各機器の符号化には上記各実施の形態で示した動画像符号化装置を用いるようにすればよい。
その一例として携帯電話について説明する。
図２２は、上記実施の形態で説明した動画像符号化装置を用いた携帯電話ex１１５を示す図である。携帯電話ex１１５は、基地局ex１１０との間で電波を送受信するためのアンテナex２０１、ＣＣＤカメラ等の映像、静止画を撮ることが可能なカメラ部ex２０３、カメラ部ex２０３で撮影した映像、アンテナex２０１で受信した映像等が復号化されたデータを表示する液晶ディスプレイ等の表示部ex２０２、操作キーｅｘ２０４群から構成される本体部、音声出力をするためのスピーカ等の音声出力部ex２０８、音声入力をするためのマイク等の音声入力部ex２０５、撮影した動画もしくは静止画のデータ、受信したメールのデータ、動画のデータもしくは静止画のデータ等、符号化されたデータまたは復号化されたデータを保存するための記録メディアex２０７、携帯電話ex１１５に記録メディアex２０７を装着可能とするためのスロット部ex２０６を有している。記録メディアex２０７はＳＤカード等のプラスチックケース内に電気的に書換えや消去が可能な不揮発性メモリであるＥＥＰＲＯＭ（Electrically Erasable and Programmable Read Only Memory）の一種であるフラッシュメモリ素子を格納したものである。 The moving picture coding apparatus described in each of the above embodiments may be used for coding of each device constituting this system.
A mobile phone will be described as an example.
FIG. 22 is a diagram illustrating the mobile phone ex115 using the video encoding device described in the above embodiment. The cellular phone ex115 includes an antenna ex201 for transmitting and receiving radio waves to and from the base station ex110, a camera such as a CCD camera, a camera unit ex203 capable of taking a still image, a video shot by the camera unit ex203, and an antenna ex201. A display unit ex202 such as a liquid crystal display that displays data obtained by decoding received video and the like, a main body unit composed of a group of operation keys ex204, an audio output unit ex208 such as a speaker for audio output, and audio input To store encoded data or decoded data such as a voice input unit ex205 such as a microphone, captured video or still image data, received mail data, video data or still image data, etc. Recording medium ex207, and slot portion ex20 for enabling recording medium ex207 to be attached to mobile phone ex115 The has. The recording medium ex207 stores a flash memory element which is a kind of EEPROM (Electrically Erasable and Programmable Read Only Memory) which is a nonvolatile memory that can be electrically rewritten and erased in a plastic case such as an SD card.

さらに、携帯電話ex１１５について図２３を用いて説明する。携帯電話ex１１５は表示部ex２０２及び操作キーｅｘ２０４を備えた本体部の各部を統括的に制御するようになされた主制御部ex３１１に対して、電源回路部ex３１０、操作入力制御部ex３０４、画像符号化部ex３１２、カメラインターフェース部ex３０３、ＬＣＤ（Liquid Crystal Display）制御部ex３０２、画像復号化部ex３０９、多重分離部ex３０８、記録再生部ex３０７、変復調回路部ex３０６及び音声処理部ex３０５が同期バスex３１３を介して互いに接続されている。 Further, the cellular phone ex115 will be described with reference to FIG. The cellular phone ex115 controls the power supply circuit ex310, the operation input control unit ex304, and the image coding for the main control unit ex311 which is configured to control the respective units of the main body unit including the display unit ex202 and the operation key ex204. Unit ex312, camera interface unit ex303, LCD (Liquid Crystal Display) control unit ex302, image decoding unit ex309, demultiplexing unit ex308, recording / reproducing unit ex307, modulation / demodulation circuit unit ex306, and audio processing unit ex305 via a synchronization bus ex313 Are connected to each other.

電源回路部ex３１０は、ユーザの操作により終話及び電源キーがオン状態にされると、バッテリパックから各部に対して電力を供給することによりカメラ付ディジタル携帯電話ex１１５を動作可能な状態に起動する。
携帯電話ex１１５は、ＣＰＵ、ＲＯＭ及びＲＡＭ等でなる主制御部ex３１１の制御に基づいて、音声通話モード時に音声入力部ex２０５で集音した音声信号を音声処理部ex３０５によってディジタル音声データに変換し、これを変復調回路部ex３０６でスペクトラム拡散処理し、送受信回路部ex３０１でディジタルアナログ変換処理及び周波数変換処理を施した後にアンテナex２０１を介して送信する。また携帯電話機ex１１５は、音声通話モード時にアンテナex２０１で受信した受信信号を増幅して周波数変換処理及びアナログディジタル変換処理を施し、変復調回路部ex３０６でスペクトラム逆拡散処理し、音声処理部ex３０５によってアナログ音声信号に変換した後、これを音声出力部ｅｘ２０８を介して出力する。 When the end call and power key are turned on by a user operation, the power supply circuit ex310 activates the camera-equipped digital mobile phone ex115 by supplying power from the battery pack to each unit. .
The mobile phone ex115 converts the voice signal collected by the voice input unit ex205 in the voice call mode into digital voice data by the voice processing unit ex305 based on the control of the main control unit ex311 including a CPU, a ROM, a RAM, and the like. The modulation / demodulation circuit unit ex306 performs spread spectrum processing, and the transmission / reception circuit unit ex301 performs digital analog conversion processing and frequency conversion processing, and then transmits the result via the antenna ex201. In addition, the cellular phone ex115 amplifies the received signal received by the antenna ex201 in the voice call mode, performs frequency conversion processing and analog-digital conversion processing, performs spectrum despreading processing by the modulation / demodulation circuit unit ex306, and analog audio by the voice processing unit ex305. After conversion into a signal, this is output via the audio output unit ex208.

さらに、データ通信モード時に電子メールを送信する場合、本体部の操作キーｅｘ２０４の操作によって入力された電子メールのテキストデータは操作入力制御部ex３０４を介して主制御部ex３１１に送出される。主制御部ex３１１は、テキストデータを変復調回路部ex３０６でスペクトラム拡散処理し、送受信回路部ex３０１でディジタルアナログ変換処理及び周波数変換処理を施した後にアンテナex２０１を介して基地局ex１１０へ送信する。 Further, when an e-mail is transmitted in the data communication mode, text data of the e-mail input by operating the operation key ex204 of the main body is sent to the main control unit ex311 via the operation input control unit ex304. The main control unit ex311 performs spread spectrum processing on the text data in the modulation / demodulation circuit unit ex306, performs digital analog conversion processing and frequency conversion processing in the transmission / reception circuit unit ex301, and then transmits the text data to the base station ex110 via the antenna ex201.

データ通信モード時に画像データを送信する場合、カメラ部ex２０３で撮像された画像データをカメラインターフェース部ex３０３を介して画像符号化部ex３１２に供給する。また、画像データを送信しない場合には、カメラ部ex２０３で撮像した画像データをカメラインターフェース部ex３０３及びＬＣＤ制御部ex３０２を介して表示部ex２０２に直接表示することも可能である。 When transmitting image data in the data communication mode, the image data captured by the camera unit ex203 is supplied to the image encoding unit ex312 via the camera interface unit ex303. When image data is not transmitted, the image data captured by the camera unit ex203 can be directly displayed on the display unit ex202 via the camera interface unit ex303 and the LCD control unit ex302.

画像符号化部ex３１２は、本願発明で説明した画像符号化装置を備えた構成であり、カメラ部ex２０３から供給された画像データを上記実施の形態で示した画像符号化装置に用いた符号化方法によって圧縮符号化することにより符号化画像データに変換し、これを多重分離部ex３０８に送出する。また、このとき同時に携帯電話機ex１１５は、カメラ部ex２０３で撮像中に音声入力部ex２０５で集音した音声を音声処理部ex３０５を介してディジタルの音声データとして多重分離部ex３０８に送出する。 The image encoding unit ex312 has a configuration including the image encoding device described in the present invention, and an encoding method using the image data supplied from the camera unit ex203 in the image encoding device described in the above embodiment. The encoded image data is converted into encoded image data by compression encoding, and sent to the demultiplexing unit ex308. At the same time, the cellular phone ex115 sends the sound collected by the audio input unit ex205 during imaging by the camera unit ex203 to the demultiplexing unit ex308 as digital audio data via the audio processing unit ex305.

多重分離部ex３０８は、画像符号化部ex３１２から供給された符号化画像データと音声処理部ex３０５から供給された音声データとを所定の方式で多重化し、その結果得られる多重化データを変復調回路部ex３０６でスペクトラム拡散処理し、送受信回路部ex３０１でディジタルアナログ変換処理及び周波数変換処理を施した後にアンテナex２０１を介して送信する。 The demultiplexing unit ex308 multiplexes the encoded image data supplied from the image encoding unit ex312 and the audio data supplied from the audio processing unit ex305 by a predetermined method, and the multiplexed data obtained as a result is a modulation / demodulation circuit unit A spectrum spread process is performed in ex306, a digital analog conversion process and a frequency conversion process are performed in the transmission / reception circuit unit ex301, and then the signal is transmitted through the antenna ex201.

データ通信モード時にホームページ等にリンクされた動画像ファイルのデータを受信する場合、アンテナex２０１を介して基地局ex１１０から受信した受信信号を変復調回路部ex３０６でスペクトラム逆拡散処理し、その結果得られる多重化データを多重分離部ex３０８に送出する。
また、アンテナex２０１を介して受信された多重化データを復号化するには、多重分離部ex３０８は、多重化データを分離することにより画像データの符号化ビットストリームと音声データの符号化ビットストリームとに分け、同期バスex３１３を介して当該符号化画像データを画像復号化部ex３０９に供給すると共に当該音声データを音声処理部ex３０５に供給する。 When receiving data of a moving image file linked to a home page or the like in the data communication mode, the received signal received from the base station ex110 via the antenna ex201 is subjected to spectrum despreading processing by the modulation / demodulation circuit unit ex306, and the resulting multiplexing is obtained. Is sent to the demultiplexing unit ex308.
In addition, in order to decode the multiplexed data received via the antenna ex201, the demultiplexing unit ex308 separates the multiplexed data to generate an encoded bitstream of image data and an encoded bitstream of audio data. The encoded image data is supplied to the image decoding unit ex309 via the synchronization bus ex313, and the audio data is supplied to the audio processing unit ex305.

次に、画像復号化部ex３０９は、画像データの符号化ビットストリームを上記実施の形態で示した符号化方法に対応した復号化方法で復号することにより再生動画像データを生成し、これをＬＣＤ制御部ex３０２を介して表示部ex２０２に供給し、これにより、例えばホームページにリンクされた動画像ファイルに含まれる動画データが表示される。このとき同時に音声処理部ex３０５は、音声データをアナログ音声信号に変換した後、これを音声出力部ex２０８に供給し、これにより、例えばホームページにリンクされた動画像ファイルに含まる音声データが再生される。 Next, the image decoding unit ex309 generates reproduction moving image data by decoding the encoded bit stream of the image data with a decoding method corresponding to the encoding method described in the above embodiment, and generates the reproduced moving image data. This is supplied to the display unit ex202 via the control unit ex302, and thereby, for example, moving image data included in a moving image file linked to a home page is displayed. At the same time, the audio processing unit ex305 converts the audio data into an analog audio signal, and then supplies the analog audio signal to the audio output unit ex208. Thus, for example, the audio data included in the moving image file linked to the home page is reproduced. The

なお、上記システムの例に限られず、最近は衛星、地上波によるディジタル放送が話題となっており、図２４に示すようにディジタル放送用システムにも上記実施の形態の画像符号化装置を組み込むことができる。具体的には、放送局ex４０９では映像情報の符号化ビットストリームが電波を介して通信または放送衛星ex４１０に伝送される。これを受けた放送衛星ex４１０は、放送用の電波を発信し、この電波を衛星放送受信設備をもつ家庭のアンテナex４０６で受信し、テレビ（受信機）ex４０１またはセットトップボックス（ＳＴＢ）ex４０７などの装置により符号化ビットストリームを復号化してこれを再生する。また、記録媒体であるCDやDVD等の蓄積メディアex４０２に記録した符号化ビットストリームを読み取り、復号化する再生装置ex４０３にも上記実施の形態で示した画像復号化装置を実装することが可能である。この場合、再生された映像信号はモニタex４０４に表示される。また、ケーブルテレビ用のケーブルex４０５または衛星／地上波放送のアンテナex４０６に接続されたセットトップボックスex４０７内に画像復号化装置を実装し、これをテレビのモニタex４０８で再生する構成も考えられる。このときセットトップボックスではなく、テレビ内に画像復号化装置を組み込んでも良い。また、アンテナex４１１を有する車ex４１２で衛星ex４１０からまたは基地局ex１０７等から信号を受信し、車ex４１２が有するカーナビゲーションex４１３等の表示装置に動画を再生することも可能である。 It should be noted that the present invention is not limited to the above-described system, and recently, digital broadcasting using satellites and terrestrial waves has become a hot topic, and as shown in FIG. Can do. Specifically, in the broadcasting station ex409, the encoded bit stream of the video information is transmitted to the communication or broadcasting satellite ex410 via radio waves. Receiving this, the broadcasting satellite ex410 transmits a radio wave for broadcasting, and receives the radio wave with a home antenna ex406 having a satellite broadcasting receiving facility, such as a television (receiver) ex401 or a set top box (STB) ex407. The device decodes the encoded bit stream and reproduces it. In addition, the image decoding apparatus described in the above embodiment can also be mounted on a playback apparatus ex403 that reads and decodes an encoded bitstream recorded on a storage medium ex402 such as a CD or DVD as a recording medium. is there. In this case, the reproduced video signal is displayed on the monitor ex404. Further, a configuration in which an image decoding device is mounted in a set-top box ex407 connected to a cable ex405 for cable television or an antenna ex406 for satellite / terrestrial broadcasting, and this is reproduced on the monitor ex408 of the television is also conceivable. At this time, the image decoding apparatus may be incorporated in the television instead of the set top box. It is also possible to receive a signal from the satellite ex410 or the base station ex107 by the car ex412 having the antenna ex411 and reproduce a moving image on a display device such as the car navigation ex413 that the car ex412 has.

更に、画像信号を上記実施の形態で示した画像符号化装置で符号化し、記録媒体に記録することもできる。具体例としては、DVDディスクｅｘ４２１に画像信号を記録するDVDレコーダや、ハードディスクに記録するディスクレコーダなどのレコーダｅx４２０がある。更にSDカードｅｘ４２２に記録することもできる。レコーダｅｘ４２０が上記実施の形態で示した画像復号化装置を備えていれば、DVDディスクｅｘ４２１やSDカードｅｘ４２２に記録した画像信号を再生し、モニタｅｘ４０８で表示することができる。 Further, the image signal can be encoded by the image encoding device shown in the above embodiment and recorded on a recording medium. As a specific example, there is a recorder ex420 such as a DVD recorder that records an image signal on a DVD disk ex421 or a disk recorder that records on a hard disk. Further, it can be recorded on the SD card ex422. If the recorder ex420 includes the image decoding device described in the above embodiment, the image signal recorded on the DVD disc ex421 or the SD card ex422 can be reproduced and displayed on the monitor ex408.

なお、カーナビゲーションex４１３の構成は例えば図２３に示す構成のうち、カメラ部ex２０３とカメラインターフェース部ex３０３、画像符号化部ｅｘ３１２を除いた構成が考えられ、同様なことがコンピュータex１１１やテレビ（受信機）ex４０１等でも考えられる。 For example, the configuration of the car navigation ex413 may be a configuration excluding the camera unit ex203, the camera interface unit ex303, and the image encoding unit ex312 in the configuration illustrated in FIG. 23. The same applies to the computer ex111 and the television (receiver). ) Ex401 can also be considered.

また、上記携帯電話ex１１４等の端末は、符号化器・復号化器を両方持つ送受信型の端末の他に、符号化器のみの送信端末、復号化器のみの受信端末の３通りの実装形式が考えられる。
このように、上記実施の形態で示した動画像符号化装置を上述したいずれの機器・システムに用いることは可能であり、そうすることで、上記実施の形態で説明した効果を得ることができる。 In addition to the transmission / reception type terminal having both the encoder and the decoder, the terminal such as the mobile phone ex114 has three mounting formats: a transmitting terminal having only an encoder and a receiving terminal having only a decoder. Can be considered.
As described above, the moving picture coding apparatus described in the above embodiment can be used in any of the above-described devices and systems, and by doing so, the effects described in the above embodiment can be obtained. .

［全実施形態共通の変形例］
（１）
前記実施形態では、１６×１６のマクロブロックを各分割候補によって分割したマクロブロックパーティションを小ブロックとして動き推定の単位として扱ってきた。この場合、図２５に示すように、８×８の分割方法で得られた小ブロックをさらに８×８、８×４、４×８，４×４のサブマクロブロックパーティションに分割することができ、このサブマクロブロックパーティションを本発明の小ブロックとして本発明を適用できる。 [Modifications common to all embodiments]
(1)
In the above embodiment, a macroblock partition obtained by dividing a 16 × 16 macroblock by each division candidate has been treated as a unit of motion estimation as a small block. In this case, as shown in FIG. 25, the small block obtained by the 8 × 8 division method can be further divided into 8 × 8, 8 × 4, 4 × 8, and 4 × 4 sub-macroblock partitions. Therefore, the present invention can be applied by using this sub-macroblock partition as a small block of the present invention.

（２）
ブロック図（例えば、図１、図１６、図１９、図２３など）の各機能ブロックは典型的には集積回路であるＬＳＩとして実現される。これらは個別に１チップ化されても良いし、一部又は全てを含むように１チップ化されても良い。 (2)
Each functional block in the block diagrams (for example, FIG. 1, FIG. 16, FIG. 19, FIG. 23, etc.) is typically implemented as an LSI that is an integrated circuit. These may be individually made into one chip, or may be made into one chip so as to include a part or all of them.

より具体的には、図１の動き推定部１０は、１チップ化されてもよい。さらに、図１のメモリ２６以外の機能ブロックが１チップ化されていてもよい。また、図１６のインター予測部３と符号化モード決定部６３とイントラ予測部６１とが１チップ化されてもよい。さらに図１６のメモリ２６以外の機能ブロックが１チップ化されていてもよい。また、図１９のインター予測部９２と符号化モード決定部９３とが１チップ化されていてもよい。さらに、図１９のメモリ以外の機能ブロックが１チップ化されていてもよい。 More specifically, the motion estimation unit 10 in FIG. 1 may be integrated into one chip. Furthermore, functional blocks other than the memory 26 in FIG. 1 may be integrated into one chip. Further, the inter prediction unit 3, the coding mode determination unit 63, and the intra prediction unit 61 in FIG. 16 may be integrated into one chip. Furthermore, functional blocks other than the memory 26 in FIG. 16 may be integrated into one chip. Further, the inter prediction unit 92 and the encoding mode determination unit 93 in FIG. 19 may be integrated into one chip. Furthermore, functional blocks other than the memory of FIG. 19 may be integrated into one chip.

なおここでは、ＬＳＩとしたが、集積度の違いにより、ＩＣ、システムＬＳＩ、スーパーＬＳＩ、ウルトラＬＳＩと呼称されることもある。
また、集積回路化の手法はＬＳＩに限るものではなく、専用回路又は汎用プロセサで実現してもよい。ＬＳＩ製造後に、プログラムすることが可能なＦＰＧＡ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）や、ＬＳＩ内部の回路セルの接続や設定を再構成可能なリコンフィギュラブル・プロセッサーを利用しても良い。 Note that the name used here is LSI, but it may also be called IC, system LSI, super LSI, or ultra LSI depending on the degree of integration.
Further, the method of circuit integration is not limited to LSI, and implementation with a dedicated circuit or a general-purpose processor is also possible. An FPGA (Field Programmable Gate Array) that can be programmed after manufacturing the LSI or a reconfigurable processor that can reconfigure the connection and setting of circuit cells inside the LSI may be used.

さらには、半導体技術の進歩又は派生する別技術によりＬＳＩに置き換わる集積回路化の技術が登場すれば、当然、その技術を用いて機能ブロックの集積化を行ってもよい。バイオ技術の適応等が可能性としてありえる。 Further, if integrated circuit technology comes out to replace LSI's as a result of the advancement of semiconductor technology or a derivative other technology, it is naturally also possible to carry out function block integration using this technology. Biotechnology can be applied.

本発明に係る符号化モード決定装置、画像符号化装置、符号化モード決定方法、および符号化モード決定プログラムにより、より少ない処理量で適切な符号化モードの選択が可能となり、上記分野において有用である。 The encoding mode determination device, the image encoding device, the encoding mode determination method, and the encoding mode determination program according to the present invention enable selection of an appropriate encoding mode with a smaller amount of processing, which is useful in the above field. is there.

本発明の一実施形態に係る画像符号化装置の構成図。The block diagram of the image coding apparatus which concerns on one Embodiment of this invention. 本発明に係る動き推定部の処理フローチャート。The processing flowchart of the motion estimation part which concerns on this invention. 本発明に係る動き推定部の処理フローを示す図。The figure which shows the processing flow of the motion estimation part which concerns on this invention. 本発明に係る分割候補選択部による分割候補の選択方法を示す図。The figure which shows the selection method of the division candidate by the division candidate selection part which concerns on this invention. フルペル予測部の処理フローを示す図。The figure which shows the processing flow of a full pel prediction part. フルペル予測部の処理フローを示す図。The figure which shows the processing flow of a full pel prediction part. フルペル予測部の処理フローの変形例を示す図。The figure which shows the modification of the processing flow of a full pel prediction part. フルペル予測部の処理フローの変形例を示す図。The figure which shows the modification of the processing flow of a full pel prediction part. フルペル予測部及び分割候補選択部の処理フローの変形例を示す図。The figure which shows the modification of the processing flow of a full pel prediction part and a division | segmentation candidate selection part. 符号化コスト換算部と分割候補選択部による符号化コスト換算及び分割候補の選択方法を示す図。The figure which shows the encoding cost conversion by the encoding cost conversion part and a division | segmentation candidate selection part, and the selection method of a division | segmentation candidate. フルペル予測部と符号化コスト換算部の処理フローの変形例を示す図。The figure which shows the modification of the processing flow of a full pel prediction part and an encoding cost conversion part. 本発明の第１実施形態にかかる処理フローチャート。The processing flowchart concerning a 1st embodiment of the present invention. 本発明の第１実施形態に係るサブペル予測部の処理フローチャート。The processing flowchart of the sub pel prediction part which concerns on 1st Embodiment of this invention. サブペル予測の一処理量配分例を示す図。The figure which shows the example of 1 processing amount distribution of subpel prediction. サブペル予測の一処理量配分例を示す図。The figure which shows the example of 1 processing amount distribution of subpel prediction. 本発明の第２実施形態に係る画像符号化装置の構成図。The block diagram of the image coding apparatus which concerns on 2nd Embodiment of this invention. イントラ予測部、動き推定部、符号化モード決定部の処理フローを示す図。The figure which shows the processing flow of an intra estimation part, a motion estimation part, and an encoding mode determination part. 本発明の第２実施形態にかかる処理フローチャート。The process flowchart concerning 2nd Embodiment of this invention. 本発明の第３実施形態に係る画像符号化装置の構成図。The block diagram of the image coding apparatus which concerns on 3rd Embodiment of this invention. イントラ予測部、動き推定部、符号化モード決定部の処理フローを示す図。The figure which shows the processing flow of an intra estimation part, a motion estimation part, and an encoding mode determination part. コンテンツ供給システムの全体構成を示すブロック図。The block diagram which shows the whole structure of a content supply system. 動画像符号化方法、動画像復号化方法を用いた携帯電話の例。The example of the mobile phone using the moving image encoding method and the moving image decoding method. 携帯電話のブロック図。A block diagram of a mobile phone. ディジタル放送用システムの例。An example of a system for digital broadcasting. 従来のマクロブロックの分割方法候補を示す図。The figure which shows the division | segmentation method candidate of the conventional macroblock. 従来のマクロブロックの分割方法候補による符号化ピクチャと参照ピクチャとの関係を示す図。The figure which shows the relationship between the coding picture by the division | segmentation method candidate of the conventional macroblock, and a reference picture. 従来のマクロブロックの予測方向を示す図。The figure which shows the prediction direction of the conventional macroblock. 従来の動き推定の処理フローを示す図。The figure which shows the processing flow of the conventional motion estimation. 従来の動き推定の処理フローを示す図。The figure which shows the processing flow of the conventional motion estimation. ＭＰＥＧ−４ＡＶＣにおける画像ブロックペアの概念を説明するための図。The figure for demonstrating the concept of the image block pair in MPEG-4AVC. 従来の符号化ピクチャ構造決定及び符号化予測方式決定の処理フローを示す図。The figure which shows the processing flow of the conventional encoding picture structure determination and encoding prediction system determination. 従来技術ではないが、ＭＰＥＧ−４ＡＶＣに従来の技術を適用したと想定した場合の符号化ピクチャ構造決定及び符号化予測方式決定の処理フローを示す図。The figure which shows the process flow of the encoding picture structure determination and encoding prediction system determination at the time of assuming that the prior art is applied to MPEG-4AVC although it is not a prior art.

Explanation of symbols

１エンコーダ
２イントラ予測部
３インター予測部
４切換部
１０動き推定部
１３フルペル予測部
１４分割方法候補選択部
１５サブペル予測部
１６分割方法決定部
６０エンコーダ
６１イントラ予測部
６２インター予測部
６３符号化モード決定部
６４切換部
６５動き推定部
６７符号化ピクチャ構造決定部
６８イントラ／インター選択部
９１イントラ予測部
９２インター予測部
９３符号化モード決定部
９４切換部
９５動き推定部
９６決定部
９７イントラ／インター選択部
９８符号化ピクチャ構造決定部
９９制御部
1 Encoder 2 Intra Prediction Unit 3 Inter Prediction Unit 4 Switching Unit 10 Motion Estimation Unit 13 Full Pel Prediction Unit 14 Division Method Candidate Selection Unit 15 Subpel Prediction Unit 16 Division Method Determination Unit 60 Encoder 61 Intra Prediction Unit 62 Inter Prediction Unit 63 Coding Mode Determining unit 64 Switching unit 65 Motion estimation unit 67 Coding picture structure determination unit 68 Intra / inter selection unit 91 Intra prediction unit 92 Inter prediction unit 93 Coding mode determination unit 94 Switching unit 95 Motion estimation unit 96 Determination unit 97 Intra / inter Selection unit 98 Encoded picture structure determination unit 99 Control unit

Claims

An encoding mode determining apparatus that determines at least one encoding mode of an image block from a plurality of candidates,
A simple motion estimation unit that derives a first encoding cost that is an encoding cost of each encoding mode based on simple motion estimation for a small block that is a partition of an image block obtained by each encoding mode;
An encoding mode selection unit that selects a part of the encoding modes from the plurality of encoding modes based on the first encoding cost derived by the simple motion estimation unit;
Complex motion estimation for deriving a second coding cost that is a coding cost of each coding mode based on a complex motion estimation for a small block obtained by at least a part of the coding modes. And
An encoding mode determination unit that determines an encoding mode of the image block based on a second encoding cost derived by the complex motion estimation unit;
Equipped with a,
The complex motion estimator is
Based on the simple motion estimation in the simple motion estimation unit, determine a picture reference direction in the complex motion estimation,
As a result of simple motion estimation for the small block in the simple motion estimation unit, if the first encoding cost for forward prediction and the first encoding cost for backward prediction are the same, both are selected, and if different, the code is Selecting only the one with the smaller encoding cost and calculating the second encoding cost;
Encoding mode determination device.

When deriving the first coding cost, the simple motion estimation unit performs simple motion estimation in a plurality of picture reference directions for each small block obtained by each coding mode, and reduces the first coding cost. Next, for each small block, a picture reference direction with the low first coding cost is selected, and the first coding cost of all small blocks related to the selected picture reference direction is then determined for each division method candidate. To derive the first encoding cost of the encoding mode for each candidate division method,
The encoding mode determination device according to claim 1.

When deriving the first coding cost, the simple motion estimation unit performs simple motion estimation in a plurality of picture reference directions for each small block obtained by each coding mode, and reduces the first coding cost. calculated, then converted to the first cost of coding each picture reference direction of the small blocks into image blocks, the first coding cost of coding mode for each picture reference direction of each candidate dividing To derive,
The encoding mode determination device according to claim 1.

The simple motion estimation in the reference direction of the plurality of pictures of the simple motion estimation unit includes only forward prediction referring to a temporally forward picture and backward prediction referring to a temporally backward picture. ,
The encoding mode determination device according to claim 2 or 3.

The simple motion estimation in the plurality of picture reference directions of the simple motion estimator includes forward prediction referring to a temporally forward picture, backward prediction referring to a temporally backward picture, temporal Bi-directional prediction with reference to bi-directional pictures,
The encoding mode determination device according to claim 2 or 3.

The simple motion estimation in the plurality of picture reference directions of the simple motion estimation unit includes forward prediction that refers to a temporally forward picture and backward prediction that refers to a temporally backward picture. ,
The simple motion estimator derives a coding cost when bi-directional prediction referring to a bi-directional picture is performed based on the forward prediction and the backward prediction.
The encoding mode determination device according to claim 2 or 3.

The complex motion estimation unit further selects at least a part of the coding mode from the part of the coding modes based on simple motion estimation for the small block in the simple motion estimation unit;
The encoding mode determination device according to any one of claims 1 to 6 .

The complex motion estimation unit selects each coding mode from the first coding cost in ascending order and within a range in which the sum of the processing amounts of the coding modes does not exceed the allowable value of the image block. Make an estimate,
The encoding mode determination device according to any one of claims 1 to 6.

The complex motion estimator selects each coding mode in ascending order of the first coding cost, and while the sum of the processing amount of the selected coding mode does not exceed the processing margin amount , When the first coding cost is repeated, the second coding cost is derived, and when the sum of the processing amounts of the selected coding mode exceeds the processing margin amount, the selection of the coding mode is performed. Discontinue, do not perform the second encoding cost derivation process after the sum of the processing amount of the selected encoding mode exceeds the processing margin amount,
The encoding mode determination device according to claim 1 .

The processing amount is calculated to be proportional to the number of pixels of the small block.
The encoding mode determination apparatus according to claim 8 or 9.

The processing amount is calculated to be proportional to the number of directions referring to a picture.
The encoding mode determination apparatus according to claim 8 or 9.

  The throughput is
  When the simple motion estimation of the plurality of picture reference directions of the simple motion estimation unit is executed by bidirectional prediction that refers to temporally bidirectional pictures, the number of directions referring to pictures is not counted,
  When simple motion estimation in the plurality of picture reference directions of the simple motion estimation unit is performed by other than bidirectional prediction that refers to temporally bidirectional pictures, the number of directions referring to pictures is counted.
  Calculated to be proportional to the number of reference directions obtained by
  The encoding mode determination device according to claim 11.

The simple motion estimation unit, so that the processing amount of the motion estimation process is maintained substantially constant, the motion estimation scheme definitive in the simple motion estimation, is changed according to the image attribute,
Coding mode determining apparatus according to any one of claims 1 to 1 2.

The complex motion estimation unit changes a motion estimation method in the complex motion estimation according to an image attribute so that a processing amount of the motion estimation process is kept substantially constant.
The encoding mode determination apparatus according to any one of claims 1 to 12.

The simple motion estimation unit and the complex motion estimation unit are configured so that the sum of the motion estimation processing amount by the simple motion estimation unit and the processing amount of the motion estimation processing by the complex motion estimation unit is kept substantially constant. , Respectively, change the motion estimation method according to the image attributes,
The encoding mode determination apparatus according to any one of claims 1 to 12.

The image attribute is at least one of image size, encoding method (I picture, P picture, B picture), format (interlace format, progressive format), color difference format, and image motion amount. ,
The encoding mode determination device according to any one of claims 13 to 15.

The simple motion estimator and / or the complex motion estimator changes the motion estimation method so that a product of an input image size composed of image blocks, a reference picture number, and a partition size number is constant. ,
The encoding mode determination device according to claim 16.

The simple motion estimation unit and / or the complex motion estimation unit reduces the number of reference pictures of a B picture to be less than that of a P picture so that the processing amount of motion estimation is kept constant for each picture.
The encoding mode determination device according to claim 16.

  The simple motion estimator and / or the complex motion estimator is
  (1) P picture refers to the front 4 pictures, B picture refers to the front 2 pictures and the back 2 pictures,
  (2) The P picture refers to the front 3 pictures, and the B picture refers to the front 2 pictures and the rear 1 picture.
  (3) P picture refers to the front two pictures, B picture refers to the front one and the rear one
  By any one of the above, the processing amount of motion estimation is kept constant for each picture.
The encoding mode determination device according to claim 16.

The simple motion estimator and / or the complex motion estimator is configured such that the number of partition sizes of a B picture is smaller than that of a P picture, and the processing amount of motion estimation in units of pictures is kept constant.
The encoding mode determination device according to claim 16.

  The simple motion estimator and / or the complex motion estimator is
  (1) With reference to the front one in the P picture, prediction is performed with four partition sizes of 16 pixels × 16 pixels, 16 pixels × 8 pixels, 8 pixels × 16 pixels, and 8 pixels × 8 pixels. Select any two partition sizes from the four partition sizes, and perform forward prediction and backward prediction for the two selected partition sizes, respectively.
  (2) In the P picture, the back one is referenced, and prediction is performed with four partition sizes of 16 pixels × 16 pixels, 16 pixels × 8 pixels, 8 pixels × 16 pixels, and 8 pixels × 8 pixels. Select any two partition sizes from the four partition sizes, and perform forward prediction and backward prediction for the two selected partition sizes, respectively.
  To ensure that the amount of motion estimation processing per picture is kept constant,
The encoding mode determination device according to claim 16.

The simple motion estimator and / or the complex motion estimator reduces the number of reference pictures or the number of partition sizes when the input image is an interlaced image than when the input image is a progressive image.
The encoding mode determination device according to claim 16.

  The simple motion estimator and / or the complex motion estimator is
  (1) In the case of a P picture, a progressive P picture refers to the front two frames, and an interlaced P picture refers to the front two fields.
  (2) In the case of a P picture, the progressive P picture refers to the front one frame, and has four partition sizes of 16 pixels × 16 pixels, 16 pixels × 8 pixels, 8 pixels × 16 pixels, and 8 pixels × 8 pixels. In the interlaced P picture, reference is made to the two front fields, and the four partition sizes of 16 pixels × 16 pixels, 16 pixels × 8 pixels, 8 pixels × 16 pixels, and 8 pixels × 8 pixels are respectively predicted. Select any two partition sizes from and perform prediction for each of the two selected partition sizes.
The encoding mode determination device according to claim 16.

The simple motion estimation unit and / or the complex motion estimation unit changes the number of reference pictures or the number of partition sizes according to the motion of an image.
The encoding mode determination device according to claim 16.

The simple motion estimation is motion estimation with integer pixel accuracy;
The complex motion estimation is motion estimation with non-integer pixel accuracy;
The encoding mode determination device according to any one of claims 1 to 25 .

The encoding mode determination device according to any one of claims 1 to 26 is included.
Integrated circuit.

The encoding mode determination device according to any one of claims 1 to 26 ,
An encoding device for encoding the image block based on the encoding mode of the image block determined by the encoding mode determination device;
An image encoding device comprising:

An image encoding device according to claim 28 ,
Integrated circuit.

An encoding mode determination method for determining an encoding mode of an image block as at least one of a plurality of candidates,
A simple motion estimation step for deriving a first encoding cost that is an encoding cost of each encoding mode based on simple motion estimation for a small block that is a partition of an image block obtained by each encoding mode;
An encoding mode selection step of selecting a part of the encoding modes from the plurality of encoding modes based on the first encoding cost derived by the simple motion estimation step;
Complex motion estimation for deriving a second coding cost that is a coding cost of each coding mode based on a complex motion estimation for a small block obtained by at least a part of the coding modes. Steps,
A coding mode determining step for determining a coding mode of the image block based on the second coding cost derived by the complex motion estimation step;
Including
In the complex motion estimation step,
Based on the simple motion estimation in the simple motion estimation step, determine a picture reference direction in the complex motion estimation,
As a result of simple motion estimation for the small block in the simple motion estimation step, if the first encoding cost for forward prediction and the first encoding cost for backward prediction are the same, both are selected. Deriving the second encoding cost by selecting only the one with the smaller encoding cost,
Coding mode determination method.

An encoding mode determination program for determining at least one encoding mode of an image block from a plurality of candidates by a computer,
The encoding mode determining program is stored in a computer.
A simple motion estimation step for deriving a first encoding cost that is an encoding cost of each encoding mode based on simple motion estimation for a small block that is a partition of an image block obtained by each encoding mode;
An encoding mode selection step of selecting a part of the encoding modes from the plurality of encoding modes based on the first encoding cost derived by the simple motion estimation step;
Complex motion estimation for deriving a second coding cost that is a coding cost of each coding mode based on a complex motion estimation for a small block obtained by at least a part of the coding modes. Steps,
A coding mode determining step for determining a coding mode of the image block based on the second coding cost derived by the complex motion estimation step;
Including
In the complex motion estimation step,
Based on the simple motion estimation in the simple motion estimation step, determine a picture reference direction in the complex motion estimation,
As a result of simple motion estimation for the small block in the simple motion estimation step, if the first encoding cost for forward prediction and the first encoding cost for backward prediction are the same, both are selected. A coding mode determination method for selecting only the one with the smaller coding cost and deriving the second coding cost is performed.
Encoding mode determination program.