JP4448115B2

JP4448115B2 - Encoded video playback device

Info

Publication number: JP4448115B2
Application number: JP2006238986A
Authority: JP
Inventors: 康之中島; 清乃氏原
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2006-09-04
Filing date: 2006-09-04
Publication date: 2010-04-07
Anticipated expiration: 2016-03-04
Also published as: JP2007014017A

Description

この発明は符号化動画像再生装置に関し、特に、ＭＰＥＧ２等により符号化された動画像を再生する際に、フレームメモリサイズを削減することが可能な符号化動画像再生装置に関する。 The present invention relates to an encoded moving image reproducing apparatus, and more particularly to an encoded moving image reproducing apparatus capable of reducing a frame memory size when reproducing a moving image encoded by MPEG2 or the like.

これまで、符号化動画像データの復号処理で、フレームメモリサイズを削減する方法としては、岩橋らの「低域ドリフトのないスケーラブル・デコーダ」（信学技報 DSP94-108）に示されているようなスケーラブルデコーダ（図２）がある。本来、スケーラブルデコーダは符号化データの一部を用いて復号する装置であるが、図の場合、次数の削減された逆DCT(IDCT:Inverse Discrete Cosine Transform) を用いることによってフレームメモリサイズを削減することができる。 So far, a method for reducing the frame memory size in the decoding process of encoded video data has been shown in Iwahashi et al.'S "Scalable decoder without low-frequency drift" (Science Technical Report DSP94-108). There is such a scalable decoder (FIG. 2). Originally, a scalable decoder is a device that decodes using a part of encoded data, but in the case of the figure, the frame memory size is reduced by using an inverse DCT (Inverse Discrete Cosine Transform) with reduced order. be able to.

MPEG2 などで符号化された動画像データは可変長復号器１で可変長復号され、そこで得られた量子化DCT 係数は逆量子化器２で逆量子化され、8 x 8 DCT 係数が得られる。8 x 8DCT係数の内4 x 4 係数のみを用いて、4 x 4 IDCTで２次元の逆DCT 処理を行うと、もとの画像の縦横共に１／２の4 x 4 画素データが得られる。復号するブロックが動き補償ブロックの場合は、動き補償部８からの動き補償ブロックを加算して画像復元を行う。復元された画像はもとの画像に比べ縦横共に１／２に縮小されているため、拡大器１１で画面を拡大処理して、後段の図示しない表示用メモリに順次蓄積され、表示される。 Video data encoded by MPEG2 or the like is variable-length decoded by the variable-length decoder 1, and the quantized DCT coefficients obtained there are inverse-quantized by the inverse quantizer 2 to obtain 8 × 8 DCT coefficients. . When 2D inverse DCT processing is performed with 4 × 4 IDCT using only 4 × 4 coefficients among 8 × 8 DCT coefficients, 4 × 4 pixel data that is 1/2 in both the vertical and horizontal directions of the original image is obtained. When the block to be decoded is a motion compensation block, the motion compensation block from the motion compensation unit 8 is added to perform image restoration. Since the restored image is reduced to 1/2 in both the vertical and horizontal directions compared to the original image, the enlargement unit 11 enlarges the screen and sequentially stores and displays it in a display memory (not shown).

更に、復元された画像データは、フレームメモリ６に蓄積される。この場合も、復元された画像が縦横ともに１／２であるため、必要となるフレームメモリサイズも元の１／４のサイズになる。また、動き補償も縮小された空間で行うため、動きベクトル変換器１２で動きベクトルを縦横１／２に変換してフレームメモリ６から該当するブロックを抽出する。また、２次元IDCTを横方向のみ4x4 IDCTを用い、縦方向は8x8IDCT を用いた場合、画像サイズおよびフレームメモリサイズは元の１／２になる。
岩橋らの「低域ドリフトのないスケーラブル・デコーダ」（信学技報 DSP94-108） Further, the restored image data is stored in the frame memory 6. Also in this case, since the restored image is 1/2 in both length and width, the required frame memory size is also the original 1/4 size. In addition, since motion compensation is also performed in a reduced space, the motion vector converter 12 converts the motion vector into 1/2 in the vertical and horizontal directions and extracts the corresponding block from the frame memory 6. In addition, when 2 × IDCT uses 4 × 4 IDCT only in the horizontal direction and 8 × 8 IDCT in the vertical direction, the image size and the frame memory size are ½ of the original.
Iwahashi et al. “Scalable decoder without low-frequency drift” (Science Tech. DSP94-108)

前記したような従来の方式では、IDCTのサイズを変更することによって、フレームメモリサイズを削減することが可能になるが、符号化側と復号側での予測画像の不一致によりドリフト雑音が生じ、画質劣化の原因になるという問題点があった。また、復元された画像の解像度も元の１／２、１／４になるため、復号画像の品質が劣化するという問題点もあった。 In the conventional method as described above, it is possible to reduce the frame memory size by changing the IDCT size, but drift noise occurs due to the mismatch of the predicted image on the encoding side and the decoding side, and the image quality There was a problem of causing deterioration. In addition, since the resolution of the restored image is also 1/2 or 1/4 of the original, there is a problem that the quality of the decoded image is deteriorated.

この発明は、前記問題点を解決し、符号化動画像データを復元する際に、ドリフト雑音を抑制し、かつ復号する解像度をなるべく低減させずに、フレームメモリサイズのみを削減することが可能な符号化動画像再生装置を提供することを目的とする。 The present invention solves the above-mentioned problems, suppresses drift noise when restoring encoded moving image data, and can reduce only the frame memory size without reducing the resolution to be decoded as much as possible. It is an object of the present invention to provide an encoded moving image reproduction apparatus.

この発明は、Ｎ×Ｎ画素（Ｎは正の整数）のブロックサイズで周波数変換符号化された符号化動画像データを入力する手段と、該入力されたＮ×Ｎ個の周波数変換係数のうち、低周波成分のＫ×M部分（Ｋ，ＭはＮより小さい正の整数）のみを水平方向にＫ×ＫのＤＣＴ基底で逆変換処理、垂直方向にＭ×ＭのＤＣＴ基底で逆変換処理をして、Ｋ×Ｍ個の画素ブロックデータを得るようにした、前記符号化動画像データを復号する手段と、前記復号された動画像データを予測符号化により再圧縮する圧縮手段と、前記圧縮手段から出力されたデータを蓄積する蓄積手段と、前記蓄積手段から出力されたデータを伸張する伸張手段と、前記伸張手段から出力された動画像データに対して動き補償予測を行う動き補償予測手段とを具備し、前記動き補償予測は、水平垂直方向それぞれ、Ｋ／Ｎ、Ｍ／Ｎにスケーリングされた動きベクトルを用いるようにした点に特徴がある。 According to the present invention, there is provided means for inputting encoded moving image data frequency-transform encoded with a block size of N × N pixels (N is a positive integer), and among the inputted N × N frequency conversion coefficients. , K × M of the low-frequency component (K, M is a positive integer smaller than N) only the inverse transform processing in the DCT basis of K × K in the horizontal direction, the inverse transform processing in the DCT basis of M × M in the vertical direction A means for decoding the encoded moving image data, and a compression means for recompressing the decoded moving image data by predictive encoding, so as to obtain K × M pixel block data ; Accumulating means for accumulating data outputted from the compressing means, decompressing means for decompressing data outputted from the accumulating means, and motion compensated prediction for performing motion compensation prediction on moving image data outputted from the decompressing means Means for compensating for movement. Prediction is characterized respectively horizontal and vertical direction, K / N, in that to use a scaled motion vector to M / N.

この発明によれば、フレームメモリに入力される画素データを劣化させることなく、あるいはわずかな劣化で圧縮し、また、削減したブロックサイズのIDCTと組み合わせることによってフレームメモリサイズを削減するため、従来のフレームメモリ削減方式と比べてドリフト雑音や解像度劣化を抑制することができるという効果がある。 According to the present invention, the pixel data input to the frame memory is compressed without being degraded or slightly degraded, and the frame memory size is reduced by combining with the reduced block size IDCT. Compared with the frame memory reduction method, there is an effect that drift noise and resolution degradation can be suppressed.

１例として、ISO で用いているテスト用動画像（Flower Garden ）シーケンスをMPEG2 方式で2Mbit/s から6Mbit/s で圧縮したデータを用いて、フレームメモリサイズを水平方向に半減する条件で復号特性を評価した所、第２実施例の場合、従来の方式に比較して、1.5dB から4.5dB のゲインを得ることができた。また、主観的にも解像度の劣化やドリフト雑音が大きく改善されていることが確認された。 As an example, using video data that is compressed from 2Mbit / s to 6Mbit / s in MPEG2 format for test video sequence (Flower Garden) used in ISO, decoding characteristics under the condition of halving the frame memory size in the horizontal direction In the case of the second embodiment, a gain of 1.5 dB to 4.5 dB could be obtained as compared with the conventional method. In addition, it was confirmed subjectively that resolution degradation and drift noise were greatly improved.

以下に、図面を参照して本発明を詳細に説明する。図１は、本発明の符号化動画像再生装置の第１の実施例の構成を示すブロック図である。 MPEG2などで符号化された動画像データは可変長復号器１で可変長復号され、そこで得られた量子化DCT 係数は逆量子化器２で逆量子化され、Ｎ×ＮDCT 係数が得られる。なお、MPEG方式の場合Ｎ＝８である。そして、逆ＤＣＴ変換器３で逆ＤＣＴ処理によりＮ×Ｎの差分画素データが得られる。加算器４は、差分画素データとフレームメモリ６から読み出された画像データとを加算し、再生画像データを出力する。復号するブロックが動き補償ブロックの場合は、動き補償部８において、動き補償に必要なブロックをフレームメモリ６から読み出して画像復元を行う。なお、出力データは後段の図示しない表示用メモリに順次蓄積され、表示される。 Hereinafter, the present invention will be described in detail with reference to the drawings. FIG. 1 is a block diagram showing the configuration of a first embodiment of the encoded moving image reproduction apparatus of the present invention. The moving image data encoded by MPEG2 or the like is variable-length decoded by the variable-length decoder 1, and the quantized DCT coefficient obtained there is inversely quantized by the inverse quantizer 2 to obtain N × NDCT coefficients. In the case of the MPEG system, N = 8. Then, the inverse DCT converter 3 obtains N × N difference pixel data by inverse DCT processing. The adder 4 adds the difference pixel data and the image data read from the frame memory 6 and outputs reproduced image data. When the block to be decoded is a motion compensation block, the motion compensation unit 8 reads out a block necessary for motion compensation from the frame memory 6 and performs image restoration. The output data is sequentially stored and displayed in a display memory (not shown) in the subsequent stage.

復元された画像データは、更に圧縮器５で画素データが再度圧縮され、フレームメモリ６に蓄積される。圧縮された画素データは動き補償で必要となるブロックを抽出する際に、伸長器７で動き補償ブロック内の画素がすべて復号され、動き補償部８で動き補償処理される。以上のように、本発明はフレームメモリに入力される画素データを圧縮することにより、フレームメモリサイズを削減した点に特徴がある。 The restored image data is further compressed again by the compressor 5 and stored in the frame memory 6. When a block necessary for motion compensation is extracted from the compressed pixel data, all the pixels in the motion compensation block are decoded by the decompressor 7 and subjected to motion compensation processing by the motion compensation unit 8. As described above, the present invention is characterized in that the frame memory size is reduced by compressing the pixel data input to the frame memory.

図３は、圧縮器５の構成を示すブロック図である。圧縮器５では、入力画素データは減算器２０に入力され、予測器２４から出力される予測値が減算されて、予測誤差Δが求められ、量子化器２１で量子化処理される。また、量子化されたデータは逆量子化器２５、加算器２３および予測器２４からなる局所復号器によって復号される。入力された画素データをＸ、予測された画素データをＰＸとすると、
Δ＝Ｘ−ＰＸ
となる。 FIG. 3 is a block diagram showing the configuration of the compressor 5. In the compressor 5, the input pixel data is input to the subtracter 20, and the prediction value output from the predictor 24 is subtracted to obtain a prediction error Δ, and the quantizer 21 performs quantization processing. The quantized data is decoded by a local decoder including an inverse quantizer 25, an adder 23, and a predictor 24. If the input pixel data is X and the predicted pixel data is PX,
Δ = X-PX
It becomes.

ＰＸはいくつかの方法で求めることができる。図５は、予測値ＰＸの各種予測方法を示す説明図である。図５において横軸は画素の水平方向を示す。ここでは水平方向の予測を例にとって説明する。また、斜線を付した画素は予測を用いずに画素値そのものを符号化する画素であり、白丸の画素は予測して圧縮符号化する画素を示す。動き補償を行うためには、フレームメモリの任意の位置からブロックを読み出すランダムアクセスが必要であるが、所定の周期ごとに画素値そのものを記憶することによって、差分符号化を採用してもランダムアクセスが可能となる。 PX can be determined in several ways. FIG. 5 is an explanatory diagram showing various prediction methods for the predicted value PX. In FIG. 5, the horizontal axis indicates the horizontal direction of the pixel. Here, description will be made by taking prediction in the horizontal direction as an example. Also, the hatched pixels are pixels that encode the pixel values themselves without using prediction, and the white circle pixels indicate pixels that are predicted and compressed and encoded. In order to perform motion compensation, random access to read a block from an arbitrary position in the frame memory is necessary, but even if differential encoding is adopted by storing the pixel value itself at a predetermined cycle, random access is performed. Is possible.

図５（ａ）は、左隣の画素を用いて予測する方式である。例えばＸ１に関する予測画素をＰＸ１とすると、
ＰＸ１＝Ｘ０、 Δ１＝Ｘ１−Ｘ０
としてΔ１を符号化する。Ｘ２については、符号化され、局所復元されたＸ１（ＲＸ１とする）を用いて予測する。従って、Ｘ２については予測画素をＰＸ２とすると、
ＰＸ２＝ＲＸ１、 Δ２＝Ｘ２−ＲＸ１
として、Δ２を符号化する。ただし、ＰＮ画素毎の画素は予測を用いず符号化する。ＰＮは２以上の整数で、無限大の場合は最初の画素のみを予測を用いずに符号化する。 FIG. 5A shows a prediction method using a pixel on the left side. For example, if the predicted pixel for X1 is PX1,
PX1 = X0, Δ1 = X1-X0
.DELTA.1 is encoded as follows. X2 is predicted by using encoded and locally restored X1 (referred to as RX1). Therefore, for X2, if the predicted pixel is PX2,
PX2 = RX1, Δ2 = X2-RX1
.DELTA.2 is encoded as follows. However, the pixels for each PN pixel are encoded without using prediction. PN is an integer greater than or equal to 2, and when it is infinite, only the first pixel is encoded without using prediction.

図５（ｂ）は予測符号化を用いずに符号化する画素のみを用いて予測符号化を行う例で、図でＸ１、Ｘ２、Ｘ３はすべてＸ０から予測する。例えばＸ３については予測画素をＰＸ３とすると、
ＰＸ３＝Ｘ０、 Δ３＝Ｘ３−Ｘ０
として、Δ３を符号化する。ただし、ＰＮ画素毎の画素は予測を用いずに符号化する。 FIG. 5B shows an example in which predictive encoding is performed using only pixels to be encoded without using predictive encoding. In FIG. 5, X1, X2, and X3 are all predicted from X0. For example, for X3, if the predicted pixel is PX3,
PX3 = X0, Δ3 = X3-X0
.DELTA.3 is encoded as follows. However, the pixels for each PN pixel are encoded without using prediction.

図５（ｃ）は、予測する画素に複数の復号された画素を用いて予測する方法の１例を示したものである。図で、例えばＸ１は水平方向に隣接するＸ０とＸ４を用いて予測する。この場合、予測画素ＰＸ１は、
ＰＸ１＝αＸ０＋βＸ４
となる。αとβは重み係数で、α＝β＝１／２として単純平均として用いたいり、画素間距離に応じて加重平均を取り、α＝３／４、β＝１／４とすることが可能である。 FIG. 5C shows an example of a prediction method using a plurality of decoded pixels as prediction pixels. In the figure, for example, X1 is predicted using X0 and X4 adjacent in the horizontal direction. In this case, the predicted pixel PX1 is
PX1 = αX0 + βX4
It becomes. α and β are weighting factors, and α = β = 1/2 can be used as a simple average, or a weighted average can be taken according to the distance between pixels, and α = 3/4 and β = 1/4 can be set. It is.

上記予測方法は水平方向のみならず、垂直方向でも同様に処理することが可能である。また、予測も垂直方向のみならず、対角線方向の予測も可能である。図６は、図５（ｃ）の予測を２次元空間に拡張した例を示す説明図である。ここで、Ｘ００、Ｘ０２、Ｘ２０、Ｘ２２は予測せずに符号化する画素である。予測方法については、Ｘ０１は同一水平上の画素Ｘ００、Ｘ０２を用いて予測する。また、Ｘ１０は垂直方向に位置するＸ００とＸ２０を用い、Ｘ１１は対角線方向に位置するＸ００とＸ２２を用いて予測することができる。 The prediction method can be processed not only in the horizontal direction but also in the vertical direction. Further, prediction is possible not only in the vertical direction but also in the diagonal direction. FIG. 6 is an explanatory diagram showing an example in which the prediction of FIG. 5C is extended to a two-dimensional space. Here, X00, X02, X20, and X22 are pixels to be encoded without prediction. As for the prediction method, X01 is predicted using pixels X00 and X02 on the same horizontal. X10 can be predicted using X00 and X20 positioned in the vertical direction, and X11 can be predicted using X00 and X22 positioned in the diagonal direction.

図７は、量子化器２１の特性例を示すグラフである。図７においては横軸が入力値で縦軸が出力値である。また、Ｑは量子化ステップサイズである。図７の点線で示す特性（ａ）は線形量子化器の特性を示している。例えば、入力値をＸ、出力値となる量子化代表値をＹとすると、（Ｑ／２）≦Ｘ＜（３Ｑ／２）ではＹ＝Ｑとなる。また、実線で示す特性（ｂ）はMPEG等で用いられているデッドゾーン付きの線形量子化器で、入力値が±Ｑの間は０が出力される。量子化器２１はこれらを含む線形量子化器の他に非線形量子化器を用いることも可能である。また、画素によって量子化方法を変更することも可能である。例えば、図４で示した予測を用いずに符号化する画素については、全く量子化の処理を行わず、直接入力値を出力することも可能である。 FIG. 7 is a graph illustrating an example of characteristics of the quantizer 21. In FIG. 7, the horizontal axis represents the input value and the vertical axis represents the output value. Q is the quantization step size. A characteristic (a) indicated by a dotted line in FIG. 7 indicates a characteristic of the linear quantizer. For example, assuming that the input value is X and the quantized representative value that is the output value is Y, Y = Q when (Q / 2) ≦ X <(3Q / 2). The characteristic (b) indicated by the solid line is a linear quantizer with a dead zone used in MPEG or the like, and 0 is output while the input value is ± Q. The quantizer 21 can use a nonlinear quantizer in addition to the linear quantizer including them. It is also possible to change the quantization method depending on the pixel. For example, for pixels to be encoded without using the prediction shown in FIG. 4, it is also possible to output the input value directly without performing any quantization processing.

符号器２２については、固定長符号化器やMPEGで用いられているような可変長符号化器を利用することが可能である。固定長符号化器の場合、例えば６４個の量子化代表値が得られる場合、各データは６ビットで表現することができる。また、２の補数表現を用いて符号化情報量を削減することも可能である。例えば、画素データは８ビットで表され、予測誤差Δが−２５５〜＋２５５で量子化ステップＱが４の場合、図７（ａ）の量子化器を用いると−２５２、−２４８…０…２４８、２５２の１２７種類の量子化代表値が得られる。そして、１２７個のデータを表現するためには７ビット必要となる。 As the encoder 22, a fixed length encoder or a variable length encoder such as that used in MPEG can be used. In the case of a fixed-length encoder, for example, when 64 quantized representative values are obtained, each data can be represented by 6 bits. It is also possible to reduce the amount of encoded information using 2's complement representation. For example, when the pixel data is represented by 8 bits, the prediction error Δ is −255 to +255, and the quantization step Q is 4, −252, −248... 0 to 248 using the quantizer of FIG. , 252 of 127 kinds of quantized representative values are obtained. 7 bits are required to represent 127 data.

しかし、ある画素Ｘに対して予測誤差Δは必ず２５６個ダイナミックレンジに収まることを利用すると量子化代表値は６４個のデータで表現でき、６ビットで表現することができる。例えばＰ＝０の場合、Ｘの範囲は０から２５５の範囲なので、Δ＝Ｘ−Ｐの範囲は０から２５５となる。また、Ｐ＝２５５の場合、Δの場合は−２５５から０までの範囲となる。このようにΔは常に２５６個のダイナミックレンジに収まるため、量子化ステップＱが４の場合、量子化代表値も６４個のデータで表現できる。 However, if the prediction error Δ always falls within the 256 dynamic range for a certain pixel X, the quantized representative value can be expressed by 64 data and can be expressed by 6 bits. For example, when P = 0, the range of X is from 0 to 255, so the range of Δ = X−P is from 0 to 255. In the case of P = 255, in the case of Δ, the range is from −255 to 0. As described above, Δ always falls within 256 dynamic ranges. Therefore, when the quantization step Q is 4, the quantization representative value can also be expressed by 64 data.

図４は、伸長器７の構成を示すブロック図である。フレームメモリに蓄積された圧縮画素データは、まず復号器３０では、図３の符号器２２の逆の処理が行われ、固定長符号や可変長符号などの符号から量子化データが得られる。量子化データは逆量子化器３１において逆量子化処理が行われ、予測誤差Δが復元される。例えば図７（ａ）の量子化特性を用いた場合、量子化代表値は０、Ｑ、２Ｑ、…となり、量子化ステップＱが４、量子化データが０、１、…６３の場合、Δは０、４、８、１２、…２５２となる。 FIG. 4 is a block diagram showing the configuration of the decompressor 7. The compressed pixel data stored in the frame memory is first subjected to the reverse processing of the encoder 22 in FIG. 3 in the decoder 30 to obtain quantized data from a code such as a fixed length code or a variable length code. The quantized data is subjected to an inverse quantization process in the inverse quantizer 31 to restore the prediction error Δ. For example, when the quantization characteristic of FIG. 7A is used, the representative quantization values are 0, Q, 2Q,..., The quantization step Q is 4, and the quantized data is 0, 1,. Becomes 0, 4, 8, 12,.

さらに、予測誤差Δは加算器３２で予測値Ｐと加算されて画素Ｘが復号される。即ち、Ｘ＝Ｐ＋Δとなる。なお、予測器３３については、図３の予測器２４と全く同一の装置を用いることが可能である。また、逆量子化処理については図７の線形量子化処理や非線形量子化処理の逆処理を用いることができる。 Further, the prediction error Δ is added to the predicted value P by the adder 32, and the pixel X is decoded. That is, X = P + Δ. For the predictor 33, the same device as the predictor 24 of FIG. 3 can be used. As the inverse quantization process, the inverse process of the linear quantization process or the nonlinear quantization process of FIG. 7 can be used.

次に第２実施例について説明する。図８は、本発明の復号装置の第２実施例の構成を示すブロック図である。 MPEG2などで符号化された動画像データは可変長復号器１で可変長復号され、そこで得られた量子化DCT 係数は逆量子化器２で逆量子化され、Ｎ×Ｎ DCT係数が得られる。Ｋ×Ｍ逆ＤＣＴ変換器４０では、Ｎ×ＮDCT 係数の内、低周波成分のＫ×Ｍ部分（K,M ≦N)のみを水平方向にＫ×Ｋの逆ＤＣＴ処理、垂直方向にＭ×Ｍの逆ＤＣＴ処理を行って、Ｋ×Ｍの画素データを得る。復号するブロックが動き補償ブロックの場合は動き補償部８からの動き補償ブロックを加算器４で加算して画像復元する。 Next, a second embodiment will be described. FIG. 8 is a block diagram showing the configuration of the second embodiment of the decoding apparatus of the present invention. Moving picture data encoded by MPEG2 or the like is variable-length decoded by the variable-length decoder 1, and the quantized DCT coefficient obtained there is inverse-quantized by the inverse quantizer 2 to obtain N × N DCT coefficients. . In the K × M inverse DCT converter 40, only the K × M portion (K, M ≦ N) of the low frequency component in the N × NDCT coefficient is subjected to K × K inverse DCT processing in the horizontal direction and M × in the vertical direction. M × DCM processing is performed to obtain K × M pixel data. When the block to be decoded is a motion compensation block, the motion compensation block from the motion compensation unit 8 is added by the adder 4 to restore the image.

更に、復元された画像データは、圧縮器５で画素データが再度圧縮され、フレームメモリ６に蓄積される。圧縮された画素データは動き補償で必要となるブロックを抽出する際に、伸長器７で動き補償ブロック内の画素がすべて復号され、動き補償部８で動き補償処理される。ただし、動きベクトルについては、動きベクトル変換器１２によって、可変長復号器１から得られる動きベクトルMVを水平、垂直それぞれＫ／Ｎ、Ｍ／Ｎにスケーリングした動きベクトルMV’を用いて動き補償を行う。また、復元された画像は拡大器４１で必要な画像サイズに変換する。例えば、符号化画像サイズと同一にするためには、垂直、水平それぞれＮ／Ｋ、Ｎ／Ｍ倍に拡大スケーリングする。 Further, the restored image data is compressed again in the compressor 5 by the compressor 5 and stored in the frame memory 6. When a block necessary for motion compensation is extracted from the compressed pixel data, all the pixels in the motion compensation block are decoded by the decompressor 7 and subjected to motion compensation processing by the motion compensation unit 8. However, for the motion vector, the motion vector converter 12 performs motion compensation using the motion vector MV ′ obtained by scaling the motion vector MV obtained from the variable length decoder 1 to K / N and M / N respectively in the horizontal and vertical directions. Do. The restored image is converted into a necessary image size by the enlarger 41. For example, in order to make it the same as the encoded image size, the scale is enlarged and scaled to N / K and N / M times respectively in the vertical and horizontal directions.

以上のように、本発明は削減したサイズのＩＤＣＴ４０とフレームメモリに入力される画素データの圧縮の双方を用いることにより、フレームメモリサイズを削減した点に特徴がある。 As described above, the present invention is characterized in that the frame memory size is reduced by using both the reduced size IDCT 40 and the compression of the pixel data input to the frame memory.

ここでＫ×ＭIDCTについて図１０（ａ）を用いて説明する。図１０（ａ）はＮ×ＮDCT 係数を示したものである。Ｋ×ＭIDCTはこのＮ×Ｎ DCT係数の内、低周波成分から水平方向にＫ個、垂直方向にＭラインの領域のＫ×Ｍ個のDCT 係数を用いてIDCT処理を行う。このIDCT処理は水平方向にＫ×ＫIDCT、垂直方向にＭ×Ｍ IDCT を用いて処理し、Ｋ×Ｍの画素ブロックデータが得られる。 Here, K × MIDCT will be described with reference to FIG. FIG. 10A shows N × NDCT coefficients. K × MIDCT performs IDCT processing using K × M DCT coefficients in the region of M lines in the vertical direction and K × M DCT coefficients in the horizontal direction from the low frequency component among the N × N DCT coefficients. This IDCT processing is performed using K × KIDCT in the horizontal direction and M × M IDCT in the vertical direction to obtain K × M pixel block data.

動きベクトル変換器１２では、動きベクトルMVを水平、垂直それぞれK/N 、M/N 倍してMV’として出力する。従って、水平、垂直方向のMV、MV’をそれぞれMVx 、MVx ’、MVy 、MVy ’とすると、
MVx ’＝ MVx × K / N
MVy ’＝ MVy × M / N
となる。ただし、演算精度は整数精度の他、少数点精度で求めることができ、動き補償もMPEGで用いられているような小数点精度の動き補償を適用することができる。 The motion vector converter 12 multiplies the motion vector MV horizontally and vertically by K / N and M / N, respectively, and outputs it as MV ′. Therefore, if the horizontal and vertical MV and MV ′ are MVx, MVx ′, MVy and MVy ′, respectively,
MVx '= MVx × K / N
MVy '＝ MVy × M / N
It becomes. However, the calculation accuracy can be obtained with integer accuracy as well as decimal point accuracy, and motion compensation with decimal point accuracy as used in MPEG can be applied for motion compensation.

拡大器４１は、復元された画像を必要な画像サイズに変換する。例えば、符号化画像サイズと同一にするためには、垂直、水平それぞれＮ／Ｋ、Ｎ／Ｍ倍に拡大スケーリングする。この処理は線形内挿により実現することができる。例えば３画素を４画素に変換する場合には、元の連続する画素値をR0, R1, R2、求める画素値をQ0,Q1,Q2,Q3 とすると、Q0,Q1,Q2,Q3 はR0,R1,R2の画素位置の距離比により以下のようにして求めることができる。
Q0 = R0 , Q1 = 1/3 R0 + 2/3 R1
Q2 = 2/3 R1 + 1/3 R2 , Q3 = R2 The enlarger 41 converts the restored image into a necessary image size. For example, in order to make it the same as the encoded image size, the scale is enlarged and scaled to N / K and N / M times respectively in the vertical and horizontal directions. This process can be realized by linear interpolation. For example, when converting 3 pixels to 4 pixels, if the original continuous pixel values are R0, R1, R2 and the pixel values to be calculated are Q0, Q1, Q2, Q3, Q0, Q1, Q2, Q3 are R0, It can be obtained as follows according to the distance ratio of the pixel positions of R1 and R2.
Q0 = R0, Q1 = 1/3 R0 + 2/3 R1
Q2 = 2/3 R1 + 1/3 R2, Q3 = R2

図９は本発明の再生装置の第３実施例の構成を示すブロック図である。MPEG2 などで符号化された動画像データは可変長復号器１で可変長復号され、そこで得られた量子化DCT 係数は逆量子化器２で逆量子化され、Ｎ×Ｎ DCT係数が得られる。ゼロパッドＩＤＣＴ（0 Pad IDCT）５０では、逆量子化器２から出力されたＮ×Ｎ DCT係数の内、低周波成分のＫ×Ｍ部分以外は０で埋めて、Ｎ×Ｎの逆ＤＣＴ処理を行い、Ｎ×Ｎの画素データを得る。図１０（ｂ）は、ゼロパッドIDCTの説明図である。ゼロパッドIDCT５０では、Ｎ×Ｎ DCT係数の内、低周波側からＫ×Ｍ個の係数以外の係数を０とし、Ｎ×ＮのIDCTを行うことによりＮ×Ｎの画素データを得る。復号するブロックが動き補償ブロックの場合は、動き補償部８からの動き補償ブロックを加算器４で加算して画像復元する。 FIG. 9 is a block diagram showing the configuration of the third embodiment of the reproducing apparatus of the present invention. Video data encoded by MPEG2 or the like is variable-length decoded by the variable-length decoder 1, and the quantized DCT coefficient obtained there is inverse-quantized by the inverse quantizer 2 to obtain N × N DCT coefficients. . In the zero pad IDCT (0 Pad IDCT) 50, the N × N DCT coefficient output from the inverse quantizer 2 is filled with 0 except for the K × M portion of the low frequency component, and the N × N inverse DCT processing is performed. N × N pixel data is obtained. FIG. 10B is an explanatory diagram of the zero pad IDCT. The zero pad IDCT 50 sets N × N DCT coefficients other than K × M coefficients from the low frequency side to 0, and performs N × N IDCT to obtain N × N pixel data. When the block to be decoded is a motion compensation block, the motion compensation block from the motion compensation unit 8 is added by the adder 4 to restore the image.

更に、復元された画像データは、圧縮器５で画素データが再度圧縮され、フレームメモリ６に蓄積される。圧縮された画素データは動き補償で必要となるブロックを抽出する際に、伸長器７で動き補償ブロック内の画素がすべて復号され、動き補償部８で動き補償処理される。以上のように、本実施例は、ある領域を０で埋めてIDCTを行い、フレームメモリに入力される画素データを圧縮することにより、フレームメモリサイズを削減した点に特徴がある。このような構成によって、逆ＤＣＴ変換器から出力される差分画像データの高周波成分を削減することにより、フレームメモリ６に記憶される画像データの精度に対して相性が良くなる。 Further, the restored image data is compressed again in the compressor 5 by the compressor 5 and stored in the frame memory 6. When a block necessary for motion compensation is extracted from the compressed pixel data, all the pixels in the motion compensation block are decoded by the decompressor 7 and subjected to motion compensation processing by the motion compensation unit 8. As described above, this embodiment is characterized in that the frame memory size is reduced by performing IDCT by filling a certain area with 0 and compressing the pixel data input to the frame memory. With such a configuration, by reducing the high-frequency component of the difference image data output from the inverse DCT converter, compatibility with the accuracy of the image data stored in the frame memory 6 is improved.

図１１は、本発明の再生装置の第４の実施例の構成を示すブロック図である。MPEG2 などで符号化された動画像データは可変長復号器１で可変長復号され、そこで得られた量子化DCT 係数は逆量子化器２で逆量子化され、Ｎ×Ｎ DCT係数が得られる。IDCT３で逆ＤＣＴ処理によりＮ×Ｎの画素データが得られる。復号するブロックが動き補償ブロックの場合は動き補償部８からの動き補償ブロックを加算して画像復元を行う。 FIG. 11 is a block diagram showing the configuration of the fourth embodiment of the reproducing apparatus of the present invention. Video data encoded by MPEG2 or the like is variable-length decoded by the variable-length decoder 1, and the quantized DCT coefficient obtained there is inverse-quantized by the inverse quantizer 2 to obtain N × N DCT coefficients. . N × N pixel data is obtained by inverse DCT processing in IDCT3. If the block to be decoded is a motion compensation block, the motion compensation block from the motion compensation unit 8 is added to perform image restoration.

更に、復元された画像データは、ダウンサンプラ６０で画素データ数が削減され、フレームメモリ６に蓄積される。削減された画素データは動き補償で必要となるブロックを抽出する際に、アップサンプラ６１でブロック内の画素がすべて復元され動き補償部８で動き補償処理される。 Further, the restored image data is stored in the frame memory 6 after the downsampler 60 reduces the number of pixel data. When the reduced pixel data is used to extract a block necessary for motion compensation, all the pixels in the block are restored by the upsampler 61 and subjected to motion compensation processing by the motion compensation unit 8.

ダウンサンプラ６０は、例えばＳＮ画素毎（ＳＮ＞０、ＳＮは整数）の画素のみをサンプルし、それ以外の画素は廃棄することにより、１：Ｎの割合で画素データ数を削減する。これにより、フレームメモリに記録する画素データ数は１／ＳＮになる。アップサンプラ６１は、例えば線形補間により廃棄された画素データを復元するものである。例えばX0とX3がフレームメモリ６に記録された画素データであるものとすると、X1、X2はX0とX3の距離比を用いてそれぞれ以下のように求めることができる。
X1 = 2/3 X0 + 1/3 X3, X2 = 1/3 X0 + 2/3 X3 For example, the down sampler 60 samples only pixels for each SN pixel (SN> 0, SN is an integer) and discards the other pixels, thereby reducing the number of pixel data at a ratio of 1: N. As a result, the number of pixel data recorded in the frame memory is 1 / SN. The upsampler 61 restores pixel data discarded by, for example, linear interpolation. For example, assuming that X0 and X3 are pixel data recorded in the frame memory 6, X1 and X2 can be obtained as follows using the distance ratio between X0 and X3.
X1 = 2/3 X0 + 1/3 X3, X2 = 1/3 X0 + 2/3 X3

図１２、１３は、本発明の再生装置の第５の実施例の要部構成を示すブロック図である。第５の実施例は、図１に示す第１の実施例において、圧縮器５および伸長器７に、それぞれ図１２および図１３に示す構成を採用したものである。図１２は、第５実施例における圧縮器５の構成を示すブロック図である。Ｓ×Ｔアダマール変換器７０は、画素データをＳ×Ｔブロック（Ｓ、Ｔは正の整数）毎にアダマール変換し、周波数領域のデータに変換する。この出力データは量子化器２１により量子化され、符号器２２によって符号化される。量子化器２１、符号器２２については第１の実施例と同様の方式を用いることができ、量子化器２１については、ブロック内の係数の位置や大きさに応じて量子化ステップＱを変化させることも可能である。 FIGS. 12 and 13 are block diagrams showing the main configuration of the fifth embodiment of the reproducing apparatus of the present invention. In the fifth embodiment, the configuration shown in FIGS. 12 and 13 is adopted for the compressor 5 and the expander 7 in the first embodiment shown in FIG. FIG. 12 is a block diagram showing the configuration of the compressor 5 in the fifth embodiment. The S × T Hadamard transformer 70 performs Hadamard transform on the pixel data for each S × T block (S and T are positive integers), and converts the data into frequency domain data. This output data is quantized by the quantizer 21 and encoded by the encoder 22. The quantizer 21 and the encoder 22 can use the same system as in the first embodiment, and the quantizer 21 changes the quantization step Q according to the position and size of the coefficient in the block. It is also possible to make it.

図１３は第５実施例における伸長器７の構成を示すブロック図である。フレームメモリ６に蓄積された圧縮画像データは、まず復号器３０によって量子化データが復元され、該量子化データは逆量子化器３１によってＳ×Ｔブロック毎の周波数領域データに復元される。そして、周波数領域データはＳ×Ｔ逆アダマール変換器７１によって画像データに復元される。復号器３０や逆量子化器３１については、実施例１と同様の方式を用いることができる。なお、第５実施例においてはアダマール変換を用いる例を開示したが、アダマール変換の代わりにＤＣＴなどの他の直交変換を採用することも可能である。 FIG. 13 is a block diagram showing the configuration of the decompressor 7 in the fifth embodiment. The compressed image data stored in the frame memory 6 is first restored to quantized data by the decoder 30, and the quantized data is restored to frequency domain data for each S × T block by the inverse quantizer 31. Then, the frequency domain data is restored to image data by the S × T inverse Hadamard transformer 71. For the decoder 30 and the inverse quantizer 31, the same method as in the first embodiment can be used. In the fifth embodiment, an example using Hadamard transform is disclosed, but other orthogonal transform such as DCT can be adopted instead of Hadamard transform.

以上、実施例を開示したが、更に以下に述べるような変形例も考えられる。変換符号化については、DCT のみならず、ウェーブレット（Wavelet ）変換などあらゆる変換符号化に適用することが可能である。 Although the embodiments have been disclosed above, the following modifications are also conceivable. The transform coding can be applied not only to DCT but also to any transform coding such as wavelet transform.

本実施例については、量子化器を使用する例を開示したが、量子化器を用いない構成も可能であり、この場合ロスレス（完全に復元可能な）符号化を行うことが可能である。 In this embodiment, an example using a quantizer has been disclosed. However, a configuration that does not use a quantizer is possible, and in this case, lossless (completely recoverable) encoding can be performed.

本発明の各機能ブロックは、例えばパソコンやワークステーション等のコンピュータによって全てソフトウェアにより処理を行うことが可能であり、また専用のＬＳＩ等により処理することも可能である。 Each functional block of the present invention can be processed by software by a computer such as a personal computer or a workstation, and can also be processed by a dedicated LSI or the like.

本発明の再生装置の第１実施例の構成を示すブロック図である。It is a block diagram which shows the structure of 1st Example of the reproducing | regenerating apparatus of this invention. 従来の再生装置の構成を示すブロック図である。It is a block diagram which shows the structure of the conventional reproducing | regenerating apparatus. 圧縮器５の構成を示すブロック図である。3 is a block diagram showing a configuration of a compressor 5. FIG. 伸長器７の構成を示すブロック図である。3 is a block diagram showing a configuration of an expander 7. FIG. 予測値ＰＸの各種予測方法を示す説明図である。It is explanatory drawing which shows the various prediction methods of the predicted value PX. 図５ｃの予測を２次元空間に拡張した例を示す説明図である。It is explanatory drawing which shows the example which expanded the prediction of FIG. 5c to two-dimensional space. 量子化器２１の特性例を示すグラフである。3 is a graph showing an example of characteristics of a quantizer 21. 本発明の再生装置の第２実施例の構成を示すブロック図である。It is a block diagram which shows the structure of 2nd Example of the reproducing | regenerating apparatus of this invention. 本発明の再生装置の第３実施例の構成を示すブロック図である。It is a block diagram which shows the structure of 3rd Example of the reproducing | regenerating apparatus of this invention. 第２、第３実施例に関するIDCTの方式例を示す説明図である。It is explanatory drawing which shows the example of a system of IDCT regarding a 2nd, 3rd Example. 再生装置の第４実施例の構成を示すブロック図である。It is a block diagram which shows the structure of 4th Example of a reproducing | regenerating apparatus. 第５実施例の圧縮器５の構成を示すブロック図である。It is a block diagram which shows the structure of the compressor 5 of 5th Example. 第５実施例の伸長器７の構成を示すブロック図である。It is a block diagram which shows the structure of the expander 7 of 5th Example.

Explanation of symbols

１…可変長復号器、２…逆量子化器、３…逆ＤＣＴ変換器、４…加算器、５…圧縮器、６…フレームメモリ、７…伸長器、８…動き補償部、１０…４×４逆ＤＣＴ変換器、１１、４１…拡大器、１２…動きベクトル変換器、２０…減算器、２１…量子化器、２２…符号器、２３、３２…加算器、２４、３３…予測器、３０…復号器、３１…逆量子化器、４０…Ｋ×Ｍ逆ＤＣＴ変換器、５０…ゼロパッド逆ＤＣＴ変換器、６０…ダウンサンプラ、６１…アップサンプラ、７０…アダマール変換器、７１…逆アダマール変換器
DESCRIPTION OF SYMBOLS 1 ... Variable length decoder, 2 ... Inverse quantizer, 3 ... Inverse DCT converter, 4 ... Adder, 5 ... Compressor, 6 ... Frame memory, 7 ... Decompressor, 8 ... Motion compensation part, 10 ... 4 X4 inverse DCT converter, 11, 41 ... enlarger, 12 ... motion vector converter, 20 ... subtractor, 21 ... quantizer, 22 ... encoder, 23, 32 ... adder, 24, 33 ... predictor 30 ... Decoder, 31 ... Inverse quantizer, 40 ... KxM inverse DCT converter, 50 ... Zero pad inverse DCT converter, 60 ... Downsampler, 61 ... Upsampler, 70 ... Hadamard transformer, 71 ... Inverse Hadamard transformer

Claims

Means for inputting encoded moving image data frequency-transform encoded with a block size of N × N pixels (N is a positive integer);
Of N × N number of frequency transform coefficients the input, K × M of the low-frequency component (K, M is a positive integer smaller than N) inverse transform processing in the DCT basis of K × K only in the horizontal direction Means for decoding the encoded moving image data , wherein K × M pixel block data is obtained by performing an inverse transformation process on the M × M DCT base in the vertical direction;
Compression means for recompressing the decoded moving image data by predictive encoding;
Storage means for storing data output from the compression means;
Decompression means for decompressing data output from the storage means;
Motion compensation prediction means for performing motion compensation prediction on the moving image data output from the decompression means,
The motion compensated prediction uses a motion vector scaled to K / N and M / N in the horizontal and vertical directions, respectively.

In the encoded moving image reproduction device according to claim 1,
The encoded moving image characterized in that the compression means for recompressing keeps periodically sampled image data uncompressed and compresses other image data by predictive encoding using the uncompressed image data. Image reproduction device.