JP4293151B2

JP4293151B2 - Video encoding method and video recording apparatus

Info

Publication number: JP4293151B2
Application number: JP2005096756A
Authority: JP
Inventors: 敏行大河内
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 2005-03-30
Filing date: 2005-03-30
Publication date: 2009-07-08
Anticipated expiration: 2025-03-30
Also published as: JP2006279638A

Description

本発明は映像符号化方法及び映像記録装置に係り、特に動画像に関する標準テレビジョン信号を、フレーム間圧縮を用いて間欠動画像信号を生成するのに好適な映像符号化方法及びその間欠動画像信号を記録する映像記録装置に関する。 The present invention relates to a video encoding method and a video recording apparatus, and more particularly to a video encoding method suitable for generating an intermittent video signal by using inter-frame compression for a standard television signal related to a video and its intermittent video. The present invention relates to a video recording apparatus for recording a signal.

スーパーマーケット、コンビニエンスストア、パチンコ店などでは、店内の万引きなどの犯罪防止、証拠保全のため、カメラによる監視システムが使われている。以前はこの監視システムに、長時間記録監視用ＶＴＲであるタイムラプスＶＴＲが使用されていたが、磁気テープを記録媒体として用いているために、記録再生ヘッドの目詰まり、テープ巻き込みなどの不具合が発生することがあるため、近年ではハードディスク又は光ディスクに画像信号を記録するビデオディスクレコーダ（以下、ＶＤＲと称す）が監視システムに採用されることが多くなってきている。 In supermarkets, convenience stores, pachinko parlors, etc., camera surveillance systems are used to prevent shoplifting crimes and preserve evidence. In the past, this monitoring system used a time-lapse VTR, which is a VTR for long-term recording monitoring. However, since magnetic tape is used as a recording medium, problems such as clogging of the recording / reproducing head and tape entrainment occur. For this reason, in recent years, video disk recorders (hereinafter referred to as VDR) that record image signals on a hard disk or an optical disk are increasingly used in surveillance systems.

このＶＤＲにおいても、以前のタイムラプスＶＴＲと同様に、記録時間を延ばすため、動画像を間欠に記録する方法が採られている。また、動画像を効率良く間欠的に記録するために、記録する動画像の画像信号に対して画像圧縮が行われている。このようなＶＤＲにおいて、入力動画像信号のｎフレーム（ｎは２以上の自然数）毎に、先頭の１フレームをイントラフレームに符号化すると共に、その先頭の１フレーム以降の（ｎ−１）フレームにはイントラフレームとの相関関係を用いた予測フレームであるＰピクチャに符号化し、符号化されたフレームを所定の割合で間引いて、１個のイントラフレームとｍ個（ｍはｎ−１以下の自然数）の予測フレームとし、１個のイントラフレームとｍ個の予測フレームからなる符号列を出力する映像符号化方法が本出願人により先に提案されている（例えば、特許文献１参照）。 Also in this VDR, as in the previous time-lapse VTR, a method of intermittently recording a moving image is employed in order to extend the recording time. Further, in order to efficiently record a moving image intermittently, image compression is performed on the image signal of the moving image to be recorded. In such a VDR, for every n frames (n is a natural number of 2 or more) of the input video signal, the first frame is encoded into an intra frame, and (n-1) frames after the first frame are encoded. Is encoded into a P picture, which is a prediction frame using a correlation with an intra frame, and the encoded frames are thinned out at a predetermined rate to obtain one intra frame and m (m is equal to or less than n−1). A video encoding method for outputting a code sequence including one intra frame and m prediction frames as a (natural number) prediction frame has been previously proposed by the present applicant (see, for example, Patent Document 1).

この従来の映像符号化方法によれば、Ｐピクチャを間引くことによって、ＪＰＥＧ（Joint Photographic Experts Group）方式によるフレーム内圧縮された画像データや、ＭＰＥＧ２方式のイントラフレームであるＩピクチャのみのフレーム内圧縮画像データを記録する場合よりも、低フレームレートで高画質・高圧縮の動画像信号を記録できる。 According to this conventional video coding method, P-pictures are thinned out, and image data compressed in a frame by JPEG (Joint Photographic Experts Group) method or in-frame compression of only an I picture that is an MPEG2 method intra frame. Compared to recording image data, it is possible to record moving image signals with high image quality and high compression at a low frame rate.

特開２００３−３３３５２６号公報JP 2003-333526 A

しかしながら、特許文献１記載の従来の映像符号化方法では、動画や静止画などの画像の種類により圧縮率や画質が変化する。図８は従来のフレームとデータ量の関係の模式図を示す。図８（ａ）は静止画を圧縮した例であり、Ｐピクチャのデータ量は小さい。一般にＭＰＥＧ方式の圧縮ではＧＯＰ（Group Of Picture）単位でデータ量がほぼ一定になるような符号量制御を行うことが多い。 However, in the conventional video encoding method described in Patent Document 1, the compression rate and the image quality change depending on the type of image such as a moving image or a still image. FIG. 8 is a schematic diagram showing the relationship between the conventional frame and the data amount. FIG. 8A shows an example in which a still image is compressed, and the data amount of the P picture is small. In general, in MPEG compression, code amount control is often performed so that the amount of data becomes almost constant in GOP (Group Of Picture) units.

図８（ａ）によれば、Ｐピクチャのデータ量が小さいためＧＯＰ全体のデータ量が小さい。そこで、ＧＯＰ単位でデータ量を一定になるような符号量制御を適用すると、図８（ｂ）に示すように全体のデータ量が増え、Ｉピクチャのデータ量は大きくなる。この画像のＰピクチャを間引くと、図８（ｃ）に示すようになり、間引くことによるデータ量の削減効果は少なく、静止画は必要以上に高画質である。 According to FIG. 8A, since the data amount of the P picture is small, the data amount of the entire GOP is small. Therefore, when code amount control is performed so that the data amount is constant in GOP units, the entire data amount increases as shown in FIG. 8B, and the data amount of the I picture increases. If the P picture of this image is thinned out, the result is as shown in FIG. 8C, and the effect of reducing the data amount by thinning out is small, and the still image has higher image quality than necessary.

次に、図８（ｄ）は動画を圧縮した例であり、Ｐピクチャのデータ量が大きいため、ＧＯＰ全体のデータ量が大きい。この画像にＧＯＰ単位でデータ量を一定になるような符号量制御を行うと、図８（ｅ）に示すように全体としてデータ量が減り、Ｉピクチャのデータ量は小さくなる。この画像のＰピクチャを間引くと、図８（ｆ）に示すようになり、データ量は削減されるもののＩピクチャのデータ量が小さいため動画の画質が劣化する。 Next, FIG. 8D shows an example in which a moving image is compressed. Since the data amount of the P picture is large, the data amount of the entire GOP is large. When code amount control is performed on this image so that the data amount becomes constant in GOP units, the data amount as a whole decreases as shown in FIG. 8E, and the data amount of the I picture decreases. When the P picture of this image is thinned out, the result is as shown in FIG. 8F, and although the data amount is reduced, the data amount of the I picture is small, so the image quality of the moving image is degraded.

監視の用途においては、異常がない通常の場合は被写体の動きがなく、異常が発生して画像を見たいのは被写体の動きがある場合が多く、特に動画を高画質で記録することが求められている。そのため、画質と圧縮率が変化し、特に動画の画質が劣化する従来の映像符号化方法では、上記の監視の用途における要求に十分応えられていないという課題がある。 For monitoring purposes, there is no subject movement in normal cases where there is no abnormality, and there are many cases where there is movement of the subject when an abnormality occurs and you want to see an image. It has been. Therefore, the conventional video coding method in which the image quality and the compression rate change, and particularly the moving image quality deteriorates, has a problem that it does not sufficiently meet the above-described demands for monitoring applications.

本発明は以上の点に鑑みなされたもので、入力映像信号をフレーム間圧縮を用いて間欠動画像信号を生成するに際し、静止画と動画での画質のばらつきが少なく、静止画に対しては符号データ量が少なく、動画に対しては高画質となる、監視の用途に特に好適な映像符号化方法及びその間欠動画像信号を記録する映像記録装置を提供することを目的とする。 The present invention has been made in view of the above points. When an intermittent video signal is generated using inter-frame compression of an input video signal, there is little variation in image quality between a still image and a moving image. An object of the present invention is to provide a video encoding method that is particularly suitable for monitoring applications and has a small amount of code data and high image quality for moving images, and a video recording apparatus that records the intermittent moving image signal.

上記の目的を達成するため、本発明の映像符号化方法は、入力映像信号をフレーム内圧縮符号化したイントラフレーム及びイントラフレームとの相関関係を用いて予測符号化した予測フレームのいずれかの符号化を選択的に行う映像符号化方法であって、入力映像信号の連続したｎフレーム（ｎは２以上の自然数）を符号化グループ単位とし、その符号化グループ単位毎に、先頭の１フレームを、データ量を所望の目標値に近付ける符号量制御を行いながら、イントラフレームに符号化する第１のステップと、符号化グループ単位毎に、先頭の１フレームを除いた残りの（ｎ−１）フレームを、符号量制御を行うことなく予測フレームに符号化する第２のステップと、第２のステップで符号化された予測フレームを、予め設定した割合で間引く第３のステップと、第１のステップで得られたイントラフレームと、第３のステップで間引かれた残りの予測フレームとからなる符号列を所定のフォーマットで出力する第４のステップとを含むことを特徴とする。 In order to achieve the above object, the video encoding method of the present invention includes an intra frame obtained by intra-frame compression encoding an input video signal, and a code of any prediction frame obtained by predictive encoding using a correlation with the intra frame. A video encoding method for selectively performing encoding, wherein n consecutive frames of an input video signal (n is a natural number of 2 or more) are used as encoding group units, and the first frame is determined for each encoding group unit. The first step of encoding into an intra frame while performing the code amount control to bring the data amount close to a desired target value, and the remaining (n−1) excluding the first frame for each encoding group unit A second step of encoding a frame into a prediction frame without performing code amount control and a prediction frame encoded in the second step are thinned out at a preset ratio. A third step, and a fourth step of outputting a code string composed of the intra frame obtained in the first step and the remaining prediction frames thinned out in the third step in a predetermined format. It is characterized by that.

この発明では、イントラフレームと、（ｎ−１）個の予測フレームから間引いて得た予測フレームとからなる符号列を出力するに際し、イントラフレームのデータ量を目標値に近付ける符号化を行うことにより、従来のようにイントラフレームのデータ量に配慮せずに符号化する場合に比べ、少ないデータ量で画質のよい間欠動画像信号を生成することができる。 In the present invention, when outputting a code string composed of an intra frame and a prediction frame obtained by thinning out (n-1) prediction frames, encoding is performed so that the data amount of the intra frame approaches the target value. As compared with the conventional case where encoding is performed without considering the data amount of the intra frame, it is possible to generate an intermittent moving image signal having a good image quality with a small data amount.

また、上記の目的を達成するため、本発明の映像記録装置は、入力映像信号をフレーム内圧縮符号化したイントラフレーム及びイントラフレームとの相関関係を用いて予測符号化した予測フレームのいずれかの符号化を選択的に行って得られた符号列を記録媒体に記録する映像記録装置であって、入力映像信号の連続したｎフレーム（ｎは２以上の自然数）を符号化グループ単位とし、その符号化グループ単位毎に、先頭の１フレームを、データ量を所望の目標値に近付ける符号量制御を行いながら、イントラフレームに符号化する第１の符号化手段と、符号化グループ単位毎に、先頭の１フレームを除いた残りの（ｎ−１）フレームを、符号量制御を行うことなく予測フレームに符号化する第２の符号化手段と、第２の符号化手段で符号化された予測フレームを、予め設定した割合で間引くフレーム間引き手段と、第１の符号化手段で得られたイントラフレームと、フレーム間引き手段で間引かれた残りの予測フレームとからなる符号列を所定のフォーマットで記録媒体に記録する記録手段とを有することを特徴とする。 In order to achieve the above object, the video recording apparatus according to the present invention includes an intra frame obtained by intra-frame compression coding an input video signal and a prediction frame obtained by predictive coding using a correlation with the intra frame. A video recording apparatus for recording a code string obtained by selectively performing encoding on a recording medium, wherein n consecutive frames of an input video signal (n is a natural number of 2 or more) are encoded group units, For each encoding group unit, the first encoding unit that encodes the first frame into an intra frame while performing code amount control to bring the data amount close to a desired target value, and for each encoding group unit, The remaining (n-1) frames excluding the first frame are encoded by a second encoding unit that encodes the prediction frame without performing the code amount control, and the second encoding unit A code sequence comprising a frame decimation unit that decimates the predicted frame obtained at a preset ratio, an intra frame obtained by the first encoding unit, and a remaining prediction frame decimation by the frame decimation unit And recording means for recording on a recording medium in the format described above.

この発明では、イントラフレームと、（ｎ−１）個の予測フレームから間引いて得た予測フレームとからなる符号列を記録媒体に記録するに際し、イントラフレームのデータ量を目標値に近付ける符号化を行うことにより、従来のようにイントラフレームのデータ量に配慮せずに符号化した符号列を記録する場合に比べ、同じ容量の記録媒体に対して高画質で長時間の符号列の記録ができる。 In the present invention, when a code string composed of an intra frame and a prediction frame obtained by thinning out (n-1) prediction frames is recorded on a recording medium, encoding is performed so that the data amount of the intra frame approaches a target value. By doing so, it is possible to record a code string for a long time with high image quality on a recording medium of the same capacity as compared to the case of recording a code string encoded without considering the amount of data of an intra frame as in the past. .

本発明によれば、イントラフレームのデータ量を目標値に近付ける符号化を行い、符号化グループ単位の符号量制御は行わず、また、予測フレームは符号量制御を行わず、動きの大きい画像は多くのデータ量として、予測フレームの一部を間引いて出力又は記録するものとしたので、動画のイントラフレームも静止画のイントラフレームと同じ符号量となるので画質が劣化せず、また動きの大きい画像の予測フレームは多くのデータ量となるので、動きに関しても十分な画質の映像信号を出力又は記録することができ、特に監視の用途に適用して好適である。 According to the present invention, encoding is performed so that the data amount of an intra frame approaches a target value, code amount control for each encoding group is not performed, and code amount control is not performed for a prediction frame, As a large amount of data, a part of the prediction frame is thinned and output or recorded, so the intraframe of the moving image has the same code amount as the intraframe of the still image, so the image quality does not deteriorate and the movement is large. Since a predicted frame of an image has a large amount of data, a video signal with sufficient image quality can be output or recorded with respect to motion, and is particularly suitable for application to monitoring.

次に、本発明の一実施の形態について図面と共に説明する。図１は本発明になる映像符号化方法の一実施の形態が適用される画像圧縮部のブロック図を示す。同図において、画像圧縮部１内の画像フォーマット変換部１０１は、供給された原画像データを三原色信号のＲＧＢフォーマットや、サンプリング比が４：２：０以外の色差フォーマットから輝度信号Ｙと２種類の色差信号Ｃｂ及びＣｒからなる、４：２：０のＹＣｂＣｒフォーマットの画像データに変換し、このうちＣｂ及びＣｒのみを例えば０ｘ８０（１６進数）だけプラス方向にレベルシフトして出力する。 Next, an embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram of an image compression unit to which an embodiment of a video encoding method according to the present invention is applied. In the figure, the image format conversion unit 101 in the image compression unit 1 converts the supplied original image data into the luminance signal Y from the RGB format of the three primary color signals and the color difference format other than the sampling ratio of 4: 2: 0. The image data is converted into 4: 2: 0 YCbCr format image data consisting of the color difference signals Cb and Cr, and only Cb and Cr are level-shifted in the positive direction by 0x80 (hexadecimal number), for example, and output.

２次元ＤＣＴ処理部１０２は、画像フォーマット変換部１０１から供給された、例えば４：２：０のＹＣｂＣｒフォーマットの、Ｙ／Ｃｂ／Ｃｒの各画像データそれぞれを別々にブロック単位で２次元ＤＣＴ（離散コサイン変換）処理して各ＤＣＴ係数を出力する。バッファ部１０３は、２次元ＤＣＴ処理部１０２から供給されたＹ／Ｃｂ／Ｃｒの各画像データの各ＤＣＴ係数について、少なくとも１画面分のＤＣＴ係数を一時記憶する。 The two-dimensional DCT processing unit 102 separately supplies each Y / Cb / Cr image data of, for example, 4: 2: 0 YCbCr format supplied from the image format conversion unit 101 in units of blocks. (Cosine transform) processing and output each DCT coefficient. The buffer 103 temporarily stores at least one screen of DCT coefficients for each DCT coefficient of Y / Cb / Cr image data supplied from the two-dimensional DCT processing unit 102.

量子化処理部１０４は、バッファ部１０３から読み出した各ＤＣＴ係数をそれぞれ量子化して圧縮データをエントロピ符号化部１０５へ出力する。エントロピ符号化部１０５は、量子化処理部１０４から供給されたＹ，Ｃｂ，及びＣｒの各画像データの各圧縮データをそれぞれエントロピ符号化して、更に圧縮された符号データを出力バッファ部１０６へ出力する。出力バッファ部１０６は、エントロピ符号化部１０５から供給された各符号データを一時記憶する。 The quantization processing unit 104 quantizes each DCT coefficient read from the buffer unit 103 and outputs the compressed data to the entropy coding unit 105. The entropy encoding unit 105 entropy-encodes each compressed data of the Y, Cb, and Cr image data supplied from the quantization processing unit 104, and outputs the further compressed code data to the output buffer unit 106. To do. The output buffer unit 106 temporarily stores each code data supplied from the entropy encoding unit 105.

更に、符号量決定部１０７は、最終的な目標符号量を設定すると共に、エントロピ符号化部１０５から供給された各符号データの符号量に基づき、輝度信号データ（Ｙデータ）についての目標符号量を決定する。量子化テーブル算出部１０８は、符号量決定部１０７で設定及び決定された目標符号量と、後述する量子化テーブル設定部１０９に設定された量子化テーブルとに基づき、輝度信号データのＤＣＴ係数の量子化処理で使用する量子化テーブルを生成して、量子化処理部１０４に設定する。 Furthermore, the code amount determination unit 107 sets a final target code amount, and based on the code amount of each code data supplied from the entropy encoding unit 105, the target code amount for the luminance signal data (Y data). To decide. The quantization table calculation unit 108, based on the target code amount set and determined by the code amount determination unit 107 and the quantization table set in the quantization table setting unit 109 described later, the DCT coefficient of the luminance signal data. A quantization table used in the quantization process is generated and set in the quantization processing unit 104.

量子化テーブル記憶部１１０は、少なくともＹ／Ｃｂ／Ｃｒの各画像データそれぞれのＤＣＴ係数を最初に量子化処理する際に使用するデフォルトの量子化テーブルを記憶している。読み出し制御部１１１は、量子化テーブル設定部１０９の制御に基づいて、バッファ部１０３に記憶されたＹ／Ｃｂ／Ｃｒの各画像データの各ＤＣＴ係数の中から、量子化処理部１０４で量子化処理するために使用するＤＣＴ係数を選択して読み出す制御を実行する。 The quantization table storage unit 110 stores a default quantization table that is used when the DCT coefficients of at least Y / Cb / Cr image data are first quantized. Based on the control of the quantization table setting unit 109, the read control unit 111 performs quantization in the DCT coefficient of each image data of Y / Cb / Cr stored in the buffer unit 103 by the quantization processing unit 104. Control is performed to select and read out DCT coefficients to be used for processing.

次に、本発明になる映像符号化装置の一実施の形態について説明する。図２は本発明になる映像符号化装置の一実施の形態のブロック図を示す。同図において、ＶＤＲ２０は、本発明になる映像符号化装置の一実施の形態としての間欠動画像の記録機能だけでなく、再生機能をも備えた記録再生装置（ビデオディスクレコーダ）で、Ａ／Ｄ変換部２１、図１に示した構成の画像圧縮部２２、音声圧縮部２３、間引き処理部２４、ディスク装置２５、画像伸張部２６、音声伸張部２７、Ｄ／Ａ変換部２８、ＣＰＵ（中央処理装置）２Ａ、操作部２Ｂ及び表示部２Ｃを有しており、これらの各部２１〜２８、２Ａ、２Ｂ及び２Ｃは双方向のバス２９を介してＣＰＵ２Ａが統括的に制御する構成とされている。 Next, an embodiment of a video encoding apparatus according to the present invention will be described. FIG. 2 shows a block diagram of an embodiment of a video encoding apparatus according to the present invention. In the figure, a VDR 20 is a recording / reproducing apparatus (video disc recorder) having not only an intermittent moving picture recording function as an embodiment of the video encoding apparatus according to the present invention but also a reproducing function. 1, the image compression unit 22, the audio compression unit 23, the thinning processing unit 24, the disk device 25, the image expansion unit 26, the audio expansion unit 27, the D / A conversion unit 28, the CPU ( A central processing unit) 2A, an operation unit 2B, and a display unit 2C. These units 21 to 28, 2A, 2B, and 2C are configured to be centrally controlled by the CPU 2A via a bidirectional bus 29. ing.

画像圧縮部２２及び音声圧縮部２３は、標準テレビジョン信号を公知のＭＰＥＧ２方式に準拠した圧縮符号化を行う汎用のＭＰＥＧ圧縮用集積回路（ＩＣ）により構成されている。また、操作部２Ｂは、使用者により操作され、記録モードか再生モードかの指定、記録するフレーム数などを入力する。ＣＰＵ２Ａはこの操作部２Ｂからの入力指示に従って、上記の各部のうち必要なブロックを制御する。また、表示部２Ｃは、ＶＤＲ２０の状況を表示する。 The image compression unit 22 and the audio compression unit 23 are configured by a general-purpose MPEG compression integrated circuit (IC) that performs compression encoding on a standard television signal in accordance with a known MPEG2 system. The operation unit 2B is operated by a user and inputs designation of a recording mode or a reproduction mode, the number of frames to be recorded, and the like. The CPU 2A controls necessary blocks among the above-described units in accordance with an input instruction from the operation unit 2B. The display unit 2C displays the status of the VDR 20.

次に、図２の実施の形態の動作について、図３のタイムチャートを併せ参照して説明する。まず、記録時の動作について説明する。図示しないカメラにより監視対象等の被写体を撮像して得られた、例えばＮＴＳＣ方式の標準テレビジョン信号は、ＶＤＲ２０内のＡ／Ｄ変換部２１に供給され、画像信号は必要なフレームだけディジタル化された後、画像圧縮部２２に供給され、音声信号はディジタル化された後、音声圧縮部２３に供給される。 Next, the operation of the embodiment of FIG. 2 will be described with reference to the time chart of FIG. First, the operation during recording will be described. For example, an NTSC standard television signal obtained by imaging a subject such as a monitoring target with a camera (not shown) is supplied to the A / D converter 21 in the VDR 20, and the image signal is digitized only in necessary frames. After that, the audio signal is supplied to the image compression unit 22, and the audio signal is digitized and then supplied to the audio compression unit 23.

図３（ａ）はＶＤＲ２０に入力される標準テレビジョン信号を１フレーム単位で模式的に示したものである。ここでは、Ｖ１、Ｖ２、Ｖ３、Ｖ４の４フレーム分のテレビジョン信号（画像信号及び音声信号）を示している。また、後述する間引き処理部２４による間引く度合いを１／３（３フレーム中２フレーム間引く）とする。 FIG. 3 (a) schematically shows a standard television signal input to the VDR 20 in units of one frame. Here, television signals (image signals and audio signals) for four frames of V1, V2, V3, and V4 are shown. In addition, the degree of thinning by the thinning processing unit 24 described later is set to 1/3 (2 frames are thinned out of 3 frames).

上記の画像圧縮部２２は、Ａ／Ｄ変換部２１から入力されたディジタル化された画像信号に対して、図１のブロック図の構成によりＭＰＥＧ２−ＰＳ（Program Stream）方式に準拠した圧縮符号化を行うが、この実施の形態では、動画像に関する入力標準テレビジョン信号の所定の連続したｎフレーム（ｎは２以上の自然数）毎に先頭の１フレームを１つの空間的相関関係を用いて符号化されたＩピクチャとし、残りの（ｎ−１）フレームはＩピクチャとの相関関係を用いて符号化して得られた予測フレーム、すなわち、Ｐピクチャとする。図３の例では、上記のｎは６である。従って、上記の画像圧縮部２２は、図３（ａ）に示す第１のフレームＶ１はＩピクチャに符号化する。Ｉピクチャの符号化は、所望の目標値のデータ量に近付くよう符号量制御を行う。 The image compression unit 22 compresses and encodes the digitized image signal input from the A / D conversion unit 21 in accordance with the MPEG2-PS (Program Stream) system with the configuration of the block diagram of FIG. However, in this embodiment, for each predetermined consecutive n frames (n is a natural number of 2 or more) of the input standard television signal related to a moving image, the top one frame is encoded using one spatial correlation. The remaining (n-1) frames are predicted frames obtained by encoding using the correlation with the I picture, that is, P pictures. In the example of FIG. 3, the above n is 6. Therefore, the image compression unit 22 encodes the first frame V1 shown in FIG. 3A into an I picture. In encoding an I picture, the code amount is controlled so as to approach the data amount of a desired target value.

この画像圧縮部２２によるＩピクチャの符号化の動作について、図１のブロック図及び図４、図５のフローチャート等を併せ参照して詳細に説明するに、まず、画像圧縮部２２に供給されたＲＧＢフォーマットの画像データは、図１の画像フォーマット変換部１０１においてＲＧＢフォーマットからＹＣｂＣｒフォーマットに変換される。このフォーマット変換は、周知のマトリクス変換式によって容易に変換可能なものであり、このマトリクス変換についての詳細の説明は省略する。 The operation of encoding the I picture by the image compression unit 22 will be described in detail with reference to the block diagram of FIG. 1 and the flowcharts of FIGS. 4 and 5. First, the image compression unit 22 is supplied to the image compression unit 22. The RGB format image data is converted from the RGB format to the YCbCr format by the image format conversion unit 101 in FIG. This format conversion can be easily converted by a known matrix conversion formula, and detailed description of the matrix conversion is omitted.

次に、上記の変換後のＹＣｂＣｒデータについて、Ｙ（輝度）データについては画素の間引きをせず、Ｃｂ及びＣｒ（いずれも色差）データについては水平及び垂直の各方向に１画素おきの間引きをして、サンプリング比４：２：０のＹＣｂＣｒデータを生成する。続いて、生成されたＣｂ及びＣｒデータは、図１に示す２次元ＤＣＴ処理部１０２での２次元ＤＣＴ処理をし易くするために、プラス方向に０ｘ８０だけレベルシフトされて出力される（以上、図４のステップＳ２０１）。 Next, with respect to the converted YCbCr data, pixels are not thinned out for Y (luminance) data, and every other pixel is thinned out in the horizontal and vertical directions for Cb and Cr (both color difference) data. Thus, YCbCr data with a sampling ratio of 4: 2: 0 is generated. Subsequently, the generated Cb and Cr data are level-shifted by 0x80 in the plus direction and output in order to facilitate the two-dimensional DCT processing in the two-dimensional DCT processing unit 102 shown in FIG. Step S201 in FIG.

このサンプリング比４：２：０のＹＣｂＣｒデータの生成においては、図６に示すように、Ｃｂ及びＣｒデータに各１つのブロックデータの生成に対して、Ｙデータは４つのブロックデータＹ０，Ｙ１，Ｙ２，Ｙ３が生成される。よって、画像フォーマット変換部１０１から出力されるＹ／Ｃｂ／Ｃｒの各ブロックデータを、Ｙ０，Ｙ１，Ｙ２，及びＹ３，並びにＣｂ及びＣｒとして以下説明する。 In the generation of the YCbCr data with the sampling ratio 4: 2: 0, as shown in FIG. 6, the Y data has four block data Y0, Y1, Y2 and Y3 are generated. Therefore, the Y / Cb / Cr block data output from the image format conversion unit 101 will be described below as Y0, Y1, Y2, and Y3, and Cb and Cr.

次に、２次元ＤＣＴ処理部１０２は、画像フォーマット変換部１０１からそれぞれ出力されたＣｂ及びＣｒ、並びにＹ０〜Ｙ３について、各１画面分のデータについてブロック単位で２次元ＤＣＴ処理を行い、それぞれのＤＣＴ係数を図１に示すバッファ部１０３に記憶させる（図４のステップＳ２０２）。このとき、バッファ部１０３に記憶される各ＤＣＴ係数は、読み出し制御部１１１によって読み出し可能な状態で記憶される。 Next, the two-dimensional DCT processing unit 102 performs two-dimensional DCT processing on a block-by-block basis for each screen data for Cb and Cr output from the image format conversion unit 101 and Y0 to Y3, respectively. The DCT coefficient is stored in the buffer unit 103 shown in FIG. 1 (step S202 in FIG. 4). At this time, each DCT coefficient stored in the buffer unit 103 is stored in a state in which it can be read by the read control unit 111.

次に、図１の量子化テーブル設定部１０９は、量子化テーブル記憶部１１０に予め記憶させてあるＣｂデータ用のデフォルト量子化テーブルを読み出して量子化処理部１０４に設定する（図４のステップＳ２０３）。続いて、量子化テーブル設定部１０９は、バッファ部１０３からＣｂデータのＤＣＴ係数を連続的に読み出すよう読み出し制御部１１１を制御する。これにより、読み出し制御部１１１は、バッファ部１０３をアドレス制御することよって、バッファ部１０３からＣｂデータのＤＣＴ係数を読み出して量子化処理部１０４に供給し、ここでＣｂデータ用のデフォルト量子化テーブルに基づいて量子化させる（図４のステップＳ２０４）。 Next, the quantization table setting unit 109 in FIG. 1 reads the default quantization table for Cb data stored in advance in the quantization table storage unit 110 and sets it in the quantization processing unit 104 (step in FIG. 4). S203). Subsequently, the quantization table setting unit 109 controls the read control unit 111 to continuously read out the DCT coefficients of the Cb data from the buffer unit 103. As a result, the read control unit 111 controls the address of the buffer unit 103 to read out the DCT coefficient of the Cb data from the buffer unit 103 and supply the DCT coefficient to the quantization processing unit 104. Here, the default quantization table for Cb data is used. Is quantized based on (step S204 in FIG. 4).

次に、量子化処理部１０４から出力されたＣｂの量子化データは、図１のエントロピ符号化部１０５でハフマン符号化されて更にデータ圧縮される（図４のステップＳ２０５）。エントロピ符号化部１０５でハフマン符号化されたＣｂの符号データは、必要に応じてバイトスタッフされて出力バッファ部１０６に一時記憶されると共に、このＣｂの符号データのデータサイズＳＩＺＥｂｃｂが符号量決定部１０７に記憶される（図４のステップＳ２０６，Ｓ２０７）。 Next, the Cb quantized data output from the quantization processing unit 104 is Huffman encoded by the entropy encoding unit 105 in FIG. 1 and further compressed (step S205 in FIG. 4). The Cb code data Huffman-encoded by the entropy encoding unit 105 is byte-stuffed as necessary and temporarily stored in the output buffer unit 106, and the data size SIZEbcb of the Cb code data is the code amount determination unit. 107 (steps S206 and S207 in FIG. 4).

次に、図１の量子化テーブル設定部１０９は、量子化テーブル記憶部１１０に予め記憶させてあるＣｒデータ用のデフォルト量子化テーブルを読み出して量子化処理部１０４に設定する（図４のステップＳ２０８）。これにおいて、Ｃｒデータ用のデフォルト量子化テーブルは、前述したＣｂデータ用のデフォルト量子化テーブルと同一であってもよい。このＣｒデータに関する以降の処理（図４のステップＳ２０９〜Ｓ２１２）は前述したＣｂデータに関する処理（図４のステップＳ２０４〜Ｓ２０７）と同様であるため、説明を省略する。但し、符号量決定部１０７に記憶されるＣｒの符号データのデータサイズをＳＩＺＥｂｃｒとする。 Next, the quantization table setting unit 109 in FIG. 1 reads the default quantization table for Cr data stored in advance in the quantization table storage unit 110 and sets it in the quantization processing unit 104 (step in FIG. 4). S208). In this case, the default quantization table for Cr data may be the same as the default quantization table for Cb data described above. Since the subsequent processing relating to the Cr data (steps S209 to S212 in FIG. 4) is the same as the processing relating to the Cb data (steps S204 to S207 in FIG. 4), description thereof is omitted. However, the data size of the Cr code data stored in the code amount determination unit 107 is SIZEbcr.

以上により、１画面分の色差データ（Ｃｂ及びＣｒ）の圧縮後の符号データが出力バッファ部１０６に一時記憶されると共に、各符号量ＳＩＺＥｂｃｂ及びＳＩＺＥｂｃｒと全体的な１画面分の目標符号量ＴＡＲＧＥＴとが符号量決定部１０７に設定記憶されることとなる。 As described above, the code data after compression of the color difference data (Cb and Cr) for one screen is temporarily stored in the output buffer unit 106, and each code amount SIZEbcb and SIZEbcr and the overall target code amount TARGET for one screen. Are set and stored in the code amount determination unit 107.

次に、図１の画像圧縮部１（図２の２２）は、上記の図４のステップＳ２１２の処理に続いて、Ｙデータのデータ圧縮処理を図５のフローチャートに従って行う。まず、図１の符号量決定部１０７は、全体的な１画面分の目標符号量ＴＡＲＧＥＴと、Ｃｂ及びＣｒの１画面分の符号量ＳＩＺＥｂｃｂ及びＳＩＺＥｂｃｒとからＹデータの目標符号量ＴＡＲＧＥＴｙを次の計算式によって求めて記憶する（図５のステップＳ２１３）。 Next, the image compression unit 1 (22 in FIG. 2) in FIG. 1 performs the data compression process of Y data according to the flowchart in FIG. 5 following the process in step S212 in FIG. First, the code amount determination unit 107 in FIG. 1 calculates the target code amount TARGETy of Y data from the overall target code amount TARGET for one screen and the code amounts SIZEbbc and SIZEbcr for one screen of Cb and Cr as follows. It is obtained by a calculation formula and stored (step S213 in FIG. 5).

ＴＡＲＧＥＴｙ＝ＴＡＲＧＥＴ−（ＳＩＺＥｂｃｂ＋ＳＩＺＥｂｃｒ）
次に、Ｙデータについて図６に示したＹ０，Ｙ１，Ｙ２，及びＹ３に組み分けされる各輝度データについて、Ｙ０，Ｙ１，Ｙ２，そしてＹ３の順番でＤＣＴ係数の量子化及びエントロピ符号化の各処理を実行する。まず、量子化テーブル設定部１０９は、量子化テーブル記憶部１１０に予め記憶してあるＹデータ用のデフォルト量子化テーブル（Ｑ０ｙ）を読み出して量子化処理部１０４に設定する（図５のステップＳ２１４）。 TARGETy = TARGET- (SIZEbbcb + SIZEbcr)
Next, for the Y data, the luminance data divided into Y0, Y1, Y2, and Y3 shown in FIG. 6 are subjected to DCT coefficient quantization and entropy coding in the order of Y0, Y1, Y2, and Y3. Execute each process. First, the quantization table setting unit 109 reads the default quantization table (Q0y) for Y data stored in advance in the quantization table storage unit 110 and sets it in the quantization processing unit 104 (step S214 in FIG. 5). ).

続いて、量子化テーブル設定部１０９は、バッファ部１０３からＹ０データのＤＣＴ係数を連続的に読み出すよう読み出し制御部１１１を制御し、これにより、読み出し制御部１１１はバッファ部１０３をアドレス制御することによって、バッファ部１０３からＹ０データのＤＣＴ係数を読み出して量子化処理部１０４に供給させ、ここで量子化させる（図５のステップＳ２１５）。続いて、量子化されたＹ０データのＤＣＴ係数は、エントロピ符号化部１０５によりハフマン符号化され、得られた符号データのデータサイズＳＩＺＥ０ｙが符号量決定部１０７に供給される（図５のステップＳ２１６）。 Subsequently, the quantization table setting unit 109 controls the read control unit 111 so as to continuously read the DCT coefficients of the Y0 data from the buffer unit 103, whereby the read control unit 111 controls the address of the buffer unit 103. Thus, the DCT coefficient of the Y0 data is read from the buffer unit 103 and supplied to the quantization processing unit 104, where it is quantized (step S215 in FIG. 5). Subsequently, the DCT coefficient of the quantized Y0 data is Huffman encoded by the entropy encoding unit 105, and the data size SIZE0y of the obtained code data is supplied to the code amount determining unit 107 (step S216 in FIG. 5). ).

次に、符号量決定部１０７は、Ｙ０の符号データのデータサイズＳＩＺＥ０ｙと、前記Ｙデータの目標符号量ＴＡＲＧＥＴｙとを比較し、例えばＳＩＺＥ０ｙがＴＡＲＧＥＴｙの±５％以内にあると判定した場合は、これ以降のＹ１，Ｙ２，及びＹ３の処理を止めるよう制御する（図５のステップＳ２１７Ｙｅｓ）。一方、符号量決定部１０７がＳＩＺＥ０ｙがＴＡＲＧＥＴｙの±５％以内にないと判定した場合（ステップＳ２１７Ｎｏ）は、その判定結果が量子化テーブル算出部１０８に供給され、量子化テーブル算出部１０８により次の計算式によってＹ１用の量子化テーブルＱ１ｙが計算される（図５のステップＳ２１８）。 Next, the code amount determination unit 107 compares the data size SIZE0y of the Y0 code data with the target code amount TARGETy of the Y data, and when it is determined that, for example, SIZE0y is within ± 5% of TARGETy, Control is performed so as to stop the subsequent processing of Y1, Y2, and Y3 (step S217 Yes in FIG. 5). On the other hand, when the code amount determination unit 107 determines that SIZE0y is not within ± 5% of TARGETy (No in step S217), the determination result is supplied to the quantization table calculation unit 108, and the quantization table calculation unit 108 The quantization table Q1y for Y1 is calculated by the following calculation formula (step S218 in FIG. 5).

Ｑ１ｙ＝（（１／４）×ＴＡＲＧＥＴｙ／ＳＩＺＥ０ｙ）×Ｑ０
次に、量子化テーブル設定部１０９は、量子化テーブル算出部１０８で算出した上記のＹ１用の量子化テーブルＱ１ｙを、量子化処理部１０４に設定する（図５のステップＳ２１９）。続いて、前記ステップＳ２１５〜Ｓ２１７と同様に、Ｙ１データのＤＣＴ係数の量子化処理部１０４による量子化処理（図５のステップＳ２２０）、エントロピ符号化部１０５によるＹ１の量子化データの符号化処理（図５のステップＳ２２１）、符号量決定部１０７による、Ｙ１の符号データのデータサイズＳＩＺＥ１ｙが前記Ｙデータの目標符号量ＴＡＲＧＥＴｙの±５％以内にあるか否かの判定処理（図５のステップＳ２２２）が順次に行われる。 Q1y = ((1/4) × TARGETy / SIZE0y) × Q0
Next, the quantization table setting unit 109 sets the Y1 quantization table Q1y calculated by the quantization table calculation unit 108 in the quantization processing unit 104 (step S219 in FIG. 5). Subsequently, as in steps S215 to S217, the quantization processing unit 104 performs the DCT coefficient quantization processing on the Y1 data (step S220 in FIG. 5), and the entropy encoding unit 105 encodes the Y1 quantization data. (Step S221 in FIG. 5), the code amount determination unit 107 determines whether the data size SIZE1y of the Y1 code data is within ± 5% of the target code amount TARGETy of the Y data (Step in FIG. 5) S222) is performed sequentially.

そして、ＳＩＺＥ１ｙが目標符号量ＴＡＲＧＥＴｙの±５％以内にあるときは以降の処理を中止し、±５％より大きいときは、量子化テーブル算出部１０８は、次の計算式によってＹ２用の量子化テーブルＱ２ｙを計算する（図５のステップＳ２２３）。 When SIZE1y is within ± 5% of the target code amount TARGETy, the subsequent processing is stopped. When SIZE1y is larger than ± 5%, the quantization table calculation unit 108 performs the quantization for Y2 according to the following equation: The table Q2y is calculated (step S223 in FIG. 5).

Ｑ２ｙ＝（（１／４）×ＴＡＲＧＥＴｙ／ＳＩＺＥ１ｙ）×Ｑ１ｙ
次に、量子化テーブル設定部１０９は、量子化テーブル算出部１０８で算出した上記のＹ２用の量子化テーブルＱ２ｙを、量子化処理部１０４に設定する（図５のステップＳ２２４）。続いて、前記ステップＳ２１５〜Ｓ２１７と同様に、Ｙ２データのＤＣＴ係数の量子化処理部１０４による量子化処理（図５のステップＳ２２５）、エントロピ符号化部１０５によるＹ２の量子化データの符号化処理（図５のステップＳ２２６）、符号量決定部１０７による、Ｙ２の符号データのデータサイズＳＩＺＥ２ｙが前記Ｙデータの目標符号量ＴＡＲＧＥＴｙの±５％以内にあるか否かの判定処理（図５のステップＳ２２７）が順次に行われる。 Q2y = ((1/4) × TARGETy / SIZE1y) × Q1y
Next, the quantization table setting unit 109 sets the Y2 quantization table Q2y calculated by the quantization table calculation unit 108 in the quantization processing unit 104 (step S224 in FIG. 5). Subsequently, similarly to steps S215 to S217, the quantization process by the quantization processing unit 104 of the DCT coefficient of the Y2 data (step S225 in FIG. 5), and the encoding process of the Y2 quantized data by the entropy coding unit 105 (Step S226 in FIG. 5), the code amount determination unit 107 determines whether the data size SIZE2y of the Y2 code data is within ± 5% of the target code amount TARGETy of the Y data (Step in FIG. 5) S227) is performed sequentially.

そして、ＳＩＺＥ２ｙが目標符号量ＴＡＲＧＥＴｙの±５％以内にあるときは以降の処理を中止し、±５％より大きいときは、量子化テーブル算出部１０８は、次の計算式によってＹ３用の量子化テーブルＱ３ｙを計算する（図５のステップＳ２２８）。 When SIZE2y is within ± 5% of the target code amount TARGETy, the subsequent processing is stopped. When SIZE2y is greater than ± 5%, the quantization table calculation unit 108 performs quantization for Y3 according to the following equation: The table Q3y is calculated (step S228 in FIG. 5).

Ｑ３ｙ＝（（１／４）×ＴＡＲＧＥＴｙ／ＳＩＺＥ２ｙ）×Ｑ２ｙ
次に、量子化テーブル設定部１０９は、量子化テーブル算出部１０８で算出した上記のＹ３用の量子化テーブルＱ３ｙを、量子化処理部１０４に設定する（図５のステップＳ２２９）。続いて、前記ステップＳ２１５、Ｓ２１６と同様に、Ｙ３データのＤＣＴ係数の量子化処理部１０４による量子化処理（図５のステップＳ２３０）、エントロピ符号化部１０５によるＹ３の量子化データの符号化処理（図５のステップＳ２３１）が順次に行われる。 Q3y = ((1/4) × TARGETy / SIZE2y) × Q2y
Next, the quantization table setting unit 109 sets the Y3 quantization table Q3y calculated by the quantization table calculation unit 108 in the quantization processing unit 104 (step S229 in FIG. 5). Subsequently, similarly to steps S215 and S216, the quantization processing unit 104 performs the DCT coefficient quantization processing on the Y3 data (step S230 in FIG. 5), and the entropy encoding unit 105 encodes the Y3 quantization data. (Step S231 in FIG. 5) is sequentially performed.

そして、量子化テーブル算出部１０８は、符号量決定部１０７により決定されたＹ３の符号データのデータサイズＳＩＺＥ３ｙを用いて、次の計算式によって最終的なＹデータ全体用の量子化テーブルＱｙを計算する（図５のステップＳ２３２）。 Then, the quantization table calculation unit 108 uses the data size SIZE3y of the Y3 code data determined by the code amount determination unit 107 to calculate the final quantization table Qy for the entire Y data by the following calculation formula: (Step S232 in FIG. 5).

Ｑｙ＝（（１／４）×ＴＡＲＧＥＴｙ／ＳＩＺＥ３ｙ）×Ｑ３ｙ
続いて、この最終的なＹデータ全体用の量子化テーブルＱｙは、量子化テーブル設定部１０９により量子化処理部１０４に設定された後（図５のステップＳ２３３）、１画面分のＹデータのＤＣＴ係数のすべてについて量子化処理部１０４による量子化処理（図５のステップＳ２３４）と、エントロピ符号化部１０５によるＹデータの量子化データの符号化処理（図５のステップＳ２３５）とが順次に行われる。 Qy = ((1/4) × TARGETy / SIZE3y) × Q3y
Subsequently, the final quantization table Qy for the entire Y data is set in the quantization processing unit 104 by the quantization table setting unit 109 (step S233 in FIG. 5). For all DCT coefficients, quantization processing by the quantization processing unit 104 (step S234 in FIG. 5) and quantization processing of quantized data of Y data by the entropy coding unit 105 (step S235 in FIG. 5) are sequentially performed. Done.

なお、ＳＩＺＥ０ｙ，ＳＩＺＥ１ｙ，又はＳＩＺＥ２ｙがＴＡＲＧＥＴｙの±５％以内にあると判定されて前述したステップＳ２１７，Ｓ２２２，又はＳ２２７からステップＳ２３４に分岐移行された場合は、その時点での量子化テーブルＱ１ｙ，Ｑ２ｙ，又はＱ３ｙをＱｙとして用いる。 If it is determined that SIZE0y, SIZE1y, or SIZE2y is within ± 5% of TARGETy and branching is transferred from step S217, S222, or S227 to step S234, the quantization table Q1y, Q2y or Q3y is used as Qy.

上記のステップＳ２３５の処理に続いて、エントロピ符号化部１０５でハフマン符号化された最終的なＹの符号データは、必要に応じてバイトスタッフされて出力バッファ部１０６に供給されて、既に一時記憶されているＣｂ及びＣｒの符号データと共に圧縮データとして出力される（図５のステップＳ２３６）。このようにして、Ｉピクチャの処理を行うことで、最終的に目標符号量に最も近似した符号量となる符号データの出力が可能となる。 Subsequent to the processing in step S235 described above, the final Y code data Huffman-encoded by the entropy encoding unit 105 is byte-stuffed as necessary and supplied to the output buffer unit 106, and is already temporarily stored. The compressed Cb and Cr code data are output as compressed data (step S236 in FIG. 5). In this way, by processing the I picture, it is possible to output code data having a code amount that is the closest to the target code amount.

次に、図２の画像圧縮部２２は、図３（ａ）に示す第２のフレーム以降のフレームＶ２、Ｖ３、Ｖ４、…、Ｖ（ｎ−１）は、上記のようにして符号化して得たＩピクチャとの相関関係を用いて符号化する予測フレーム、すなわちＰピクチャに符号化する。Ｐピクチャの符号化においては、符号量制御を行わず、常に同じ処理を行って公知の方法で符号化する。これにより動きが激しい動画ではＰピクチャのサイズが大きくなる。 Next, the image compression unit 22 in FIG. 2 encodes the frames V2, V3, V4,..., V (n−1) after the second frame shown in FIG. The prediction frame to be encoded using the correlation with the obtained I picture, that is, the P picture is encoded. In coding a P picture, code amount control is not performed, and the same processing is always performed and coding is performed by a known method. As a result, the size of the P picture is increased in a moving image with intense motion.

静止画を符号化して得たデータ量とフレームの関係の一例を図７（ａ）に、動画を符号化して得たデータ量とフレームの関係の一例を図７（ｃ）に示す。入力標準テレビジョン信号のｎフレーム毎に上記の符号化が行われる（図７の例ではｎ＝６）。このようにして、符号化を行うので、データストリーム全体としてはＣＢＲ（コンスタントビットレート）ではなくＶＢＲ（バリアブルビットレート）として符号化されることとなる。 FIG. 7A shows an example of the relationship between the data amount obtained by encoding a still image and the frame, and FIG. 7C shows an example of the relationship between the data amount obtained by encoding a moving image and the frame. The above encoding is performed every n frames of the input standard television signal (n = 6 in the example of FIG. 7). Since encoding is performed in this way, the entire data stream is encoded not as CBR (constant bit rate) but as VBR (variable bit rate).

各ピクチャは、それぞれ複数のビデオパケット（ビデオＰＥＳ（Packetized Elementary Stream））から構成されている。また、図２の音声圧縮部２３は、Ａ／Ｄ変換部２１から入力されたディジタル化された音声信号に対して、圧縮符号化してオーディオパックを出力する。このオーディオパックは、画像圧縮部２２から出力されるビデオパケットと共にデータストリームに入れられる。 Each picture is composed of a plurality of video packets (video PES (Packetized Elementary Stream)). 2 compresses and encodes the digitized audio signal input from the A / D converter 21 and outputs an audio pack. This audio pack is put into the data stream together with the video packet output from the image compression unit 22.

オーディオパックとビデオパケットには、それぞれ提示時刻を表すＰＴＳ（Presentation Time Stamp）データが含まれている。これらをまとめると、画像圧縮部２２と音声圧縮部２３の各出力信号は、図３（ｂ）に示すようになる。図３（ｂ）において、Ｖ．ＰＥＳはビデオＰＥＳ、Ａ．ＰＥＳはオーディオＰＥＳを模式的に示す。ただし、図３（ｂ）において、オーディオＰＥＳのみが音声圧縮部２３の出力パケットであり、それ以外が画像圧縮部２２の出力パケットを示す。また、ｐａ１〜ｐａ４は、固定ビットレートの場合全体のビットレートを一定にするためのパディングパケットである。 Each audio pack and video packet includes PTS (Presentation Time Stamp) data representing the presentation time. In summary, the output signals of the image compression unit 22 and the audio compression unit 23 are as shown in FIG. In FIG. PES is a video PES, A. PES schematically represents an audio PES. However, in FIG. 3B, only the audio PES is an output packet of the audio compression unit 23, and the other is an output packet of the image compression unit 22. Further, pa1 to pa4 are padding packets for making the entire bit rate constant in the case of a fixed bit rate.

図２において、画像圧縮部２２から出力された圧縮符号化された画像データ（ＰＳ）は、間引き処理部２４に供給され、ここで先頭の１フレームのＩピクチャを含む連続した３フレーム毎に、先頭のフレームを除く２フレームのビデオパケット（Ｖ．ＰＥＳ）と全フレームのパディングパケットとを間引かれた後、ディスク装置２５に供給される。一方、これと同時に、音声圧縮部２３から出力された圧縮符号化された音声データ（オーディオＰＥＳ）は、間引かれること無くディスク装置２５に供給される。 In FIG. 2, the compression-encoded image data (PS) output from the image compression unit 22 is supplied to the thinning processing unit 24, where every three consecutive frames including the first I frame I picture. Two frames of video packets (V.PES) excluding the first frame and padding packets of all frames are thinned out, and then supplied to the disk device 25. On the other hand, at the same time, the compression-coded audio data (audio PES) output from the audio compression unit 23 is supplied to the disk device 25 without being thinned out.

これにより、ディスク装置２５には図３（ｃ）に模式的に示すように、先頭の１フレームのＩピクチャと、３フレーム当たり１フレームのＰピクチャのビデオＰＥＳと、すべてのオーディオＰＥＳとからなる離散的なデータが入力され、これらがハードディスクあるいは光ディスクに記録される。すなわち、ディスク装置２５は、ＩピクチャとＰピクチャとが時系列的に合成された間欠動画像のフレーム間圧縮が行われたパケットをハードディスクあるいは光ディスクに記録する。ハードディスクあるいは光ディスクに記録される、フレームとデータ量の関係を示す、静止画を間引いたデータの例を図７（ｂ）に、動画を間引いた例を図７（ｄ）に示す。 Thereby, as schematically shown in FIG. 3 (c), the disk device 25 is composed of an I picture of the first frame, a video PES of P picture of one frame per three frames, and all audio PESs. Discrete data is input and recorded on a hard disk or an optical disk. That is, the disk device 25 records, on a hard disk or an optical disk, a packet in which inter-frame compression of an intermittent moving image in which an I picture and a P picture are synthesized in time series is performed. FIG. 7B shows an example of data obtained by thinning out a still image and shows an example of thinning out a moving image, which shows the relationship between a frame and a data amount recorded on a hard disk or an optical disk.

次に、図２の再生系の動作について説明する。ＣＰＵ２Ａはディスク装置２５を制御し、ハードディスクあるいは光ディスクに記録されているデータを再生させ、得られた再生データのうちＩピクチャ又はＰピクチャのビデオＰＥＳは画像伸張部２６に供給し、オーディオＰＥＳは音声伸張部２７に供給する。 Next, the operation of the reproduction system of FIG. 2 will be described. The CPU 2A controls the disk device 25 to reproduce the data recorded on the hard disk or the optical disk. Among the obtained reproduction data, the video PES of I picture or P picture is supplied to the image expansion unit 26, and the audio PES is audio. Supply to the extension unit 27.

それぞれの伸張部２６、２７は内部の基準同期信号ＳＴＣ（System Time Clock）が入力再生データ中のＰＴＳに達した時、そのアクセスユニットを出力する。これにより、画像伸張部２６は、例えば図３（ｃ）のＩピクチャ、Ｐピクチャをデコードして、図３（ｄ）に示す先頭の１フレームＶ１とそれ以降のＶ４を復号し、これらをＤ／Ａ変換部２８に供給する。 Each of the expansion units 26 and 27 outputs the access unit when an internal reference synchronization signal STC (System Time Clock) reaches the PTS in the input reproduction data. Thereby, the image decompression unit 26 decodes, for example, the I picture and the P picture of FIG. 3C, decodes the first frame V1 and the subsequent V4 shown in FIG. / A converter 28 is supplied.

Ｄ／Ａ変換部２８は、入力された復号フレームＶ１及びＶ４をアナログ画像信号に変換すると共に、間引かれている第２、第３フレームＶ２、Ｖ３を第１フレームＶ１のアナログ画像信号で補間し、第５、第６フレームを第４フレームＶ４のアナログ画像信号で補間する等の補間処理を行い、図３（ｅ）に示すように、信号としては毎秒３０フレームのＮＴＳＣ方式標準テレビジョン信号で、中身の画像としては、毎秒１０駒の間欠動画像の信号を出力する。 The D / A converter 28 converts the input decoded frames V1 and V4 into analog image signals, and interpolates the thinned second and third frames V2 and V3 with the analog image signals of the first frame V1. Then, interpolation processing such as interpolating the fifth and sixth frames with the analog image signal of the fourth frame V4 is performed. As shown in FIG. 3E, the signal is an NTSC standard television signal of 30 frames per second. Thus, as a content image, an intermittent moving image signal of 10 frames per second is output.

一方、オーディオＰＥＳは間引かれていないので、音声伸張部２７で伸張されて得られたオーディオデータは、Ｄ／Ａ変換部２８により、図３（ｅ）に示すように途切れのないアナログ音声信号に変換されて出力される。 On the other hand, since the audio PES has not been thinned out, the audio data obtained by being decompressed by the speech decompression unit 27 is converted into an uninterrupted analog speech signal by the D / A conversion unit 28 as shown in FIG. Is converted to output.

ここで、ディスク装置２５が再生するハードディスクあるいは光ディスクに記録されているビデオＰＥＳは、静止画を間引いたデータの場合は例えば図７（ｂ）に、動画を間引いたデータの場合は例えば図７（ｄ）に示される。図７（ｂ）、（ｄ）から分かるように、Ｉピクチャのサイズが固定なので、静止画と動画での画質のばらつきが少なくなる。また、動きが激しい動画ではＰピクチャのサイズが大きいので、画質に応じて圧縮率は変化するが、動きが激しいほど高ビットレートになる。 Here, the video PES recorded on the hard disk or the optical disk reproduced by the disk device 25 is, for example, FIG. 7B in the case of data obtained by thinning out still images, and in the case of data in which moving pictures are thinned out, for example, FIG. d). As can be seen from FIGS. 7B and 7D, since the size of the I picture is fixed, variations in image quality between still images and moving images are reduced. In addition, since the size of a P picture is large in a moving image with a large amount of motion, the compression rate changes according to the image quality, but the bit rate increases as the motion increases.

映像を用いて監視を行う場合、通常の状態は静止画が多く、異常があった状態は画面上で動きがあることが多い。そのため、本実施の形態を監視用のＶＤＲに適用すると、通常の状態ではデータ量が少なくなり、異常があった状態では高画質となるので、監視の用途に特に好適である。 When monitoring is performed using video, there are many still images in the normal state, and there are many movements on the screen in the abnormal state. For this reason, when this embodiment is applied to a monitoring VDR, the amount of data decreases in a normal state, and the image quality becomes high in an abnormal state, which is particularly suitable for monitoring purposes.

このように、本実施の形態によれば、画像圧縮部１でＩピクチャを目標のデータ量になるよう符号量制御を行って符号化し、ＧＯＰ単位の符号量制御は行わず、またＰピクチャは符号量制御を行わず、動きの大きい画像は多くのデータ量として、Ｐピクチャの一部を間引いて記録するものとしたので、動画のＩピクチャも静止画のＩピクチャと同じ符号量となるので画質が劣化せず、また、動きの大きい画像のＰピクチャは多くのデータ量となるので、動きに関しても十分な画質で映像を記録媒体に記録できる。 As described above, according to the present embodiment, the image compression unit 1 encodes the I picture by performing the code amount control so that the target data amount is obtained, and does not perform the GOP unit code amount control. Since the code amount control is not performed and a large motion image is recorded as a large amount of data, a part of the P picture is thinned out, so that the moving picture I picture also has the same code quantity as the still picture I picture. Since the image quality does not deteriorate and the P picture of an image with a large amount of motion has a large amount of data, the image can be recorded on the recording medium with a sufficient image quality regarding the motion.

なお、本発明は上記の実施の形態に限定されるものではなく、例えば、画像圧縮部２２、音声圧縮部２３、画像伸張部２６及び音声伸張部２７を、エンコード・デコード機能を併せ持つ一つのＭＰＥＧコーデック用ＩＣを用いることができる。また、記録手段として実施の形態ではハードディスク又は光ディスクに符号列を記録し再生するディスク装置２５を用いているが、磁気テープや磁気ディスクに符号列を記録し再生する磁気記録再生装置、半導体メモリに符号列を記録し再生するメモリ記録再生装置にも本発明を適用することができることは勿論である。 Note that the present invention is not limited to the above-described embodiment. For example, the image compression unit 22, the audio compression unit 23, the image expansion unit 26, and the audio expansion unit 27 are combined into one MPEG having an encoding / decoding function. A codec IC can be used. In the embodiment, the recording device uses a disk device 25 that records and reproduces a code string on a hard disk or an optical disk. However, a magnetic recording / reproducing device that records and reproduces a code string on a magnetic tape or a magnetic disk, and a semiconductor memory. Of course, the present invention can also be applied to a memory recording / reproducing apparatus that records and reproduces a code string.

なお、磁気テープを用いた場合は、前述したヘッドの目詰まりの問題はあるが、本発明によれば従来と同じ容量の磁気テープや磁気ディスクに対して従来よりも大容量の符号列の記録再生、すなわち長時間の記録再生が可能という特長は有する。また、本発明は、図１の画像圧縮部１（図２の２２）の動作をコンピュータにより実現するコンピュータプログラムも包含するものである。 In the case of using a magnetic tape, there is a problem of the clogging of the head described above. However, according to the present invention, a code string having a larger capacity than that of the conventional recording can be recorded on a magnetic tape or magnetic disk having the same capacity as the conventional one. There is a feature that reproduction, that is, recording and reproduction for a long time is possible. The present invention also includes a computer program for realizing the operation of the image compression unit 1 (22 in FIG. 2) in FIG. 1 by a computer.

本発明方法の一実施の形態が適用される画像圧縮部のブロック図である。It is a block diagram of an image compression part to which an embodiment of the method of the present invention is applied. 本発明の映像記録装置の一実施の形態のブロック図である。It is a block diagram of one embodiment of a video recording device of the present invention. 図１及び図２の動作説明用タイムチャートである。3 is a time chart for explaining operations of FIGS. 1 and 2. 図１の動作説明用フローチャート（その１）である。FIG. 3 is a flowchart (part 1) for explaining the operation of FIG. 1; FIG. 図１の動作説明用フローチャート（その２）である。FIG. 3 is a flowchart (part 2) for explaining the operation of FIG. 1; 本発明装置における記録画像の一例の模式図である。It is a schematic diagram of an example of the recorded image in this invention apparatus. 本発明の一実施の形態におけるフレームとデータ量の関係の模式図である。It is a schematic diagram of the relationship between the flame | frame and data amount in one embodiment of this invention. 従来の一例のフレームとデータ量の関係の模式図である。It is a schematic diagram of the relationship between a conventional example frame and data amount.

Explanation of symbols

１、２２画像圧縮部
２０ＶＤＲ
２１Ａ／Ｄ変換部
２３音声圧縮部
２４間引き処理部
２５ディスク装置
２６画像伸張部
２７音声伸張部
２８Ｄ／Ａ変換部
２Ａ中央処理装置（ＣＰＵ）
２Ｂ操作部
２Ｃ表示部
１０１画像フォーマット変換部
１０２２次元ＤＣＴ処理部
１０３バッファ部
１０４量子化処理部
１０５エントロピ符号化部
１０６出力バッファ部
１０７符号量決定部
１０８量子化テーブル算出部
１０９量子化テーブル設定部
１１０量子化テーブル記憶部
１１１読み出し制御部

1, 22 Image compression unit 20 VDR
21 A / D conversion unit 23 Audio compression unit 24 Thinning-out processing unit 25 Disk device 26 Image expansion unit 27 Audio expansion unit 28 D / A conversion unit 2A Central processing unit (CPU)
2B Operation unit 2C Display unit 101 Image format conversion unit 102 Two-dimensional DCT processing unit 103 Buffer unit 104 Quantization processing unit 105 Entropy encoding unit 106 Output buffer unit 107 Code amount determination unit 108 Quantization table calculation unit 109 Quantization table setting Unit 110 Quantization table storage unit 111 Read control unit

Claims

A video encoding method that selectively encodes one of an intra frame obtained by intra-frame compression encoding an input video signal and a prediction frame that is predictively encoded using a correlation with the intra frame,
Code amount control in which n frames (n is a natural number greater than or equal to 2) of the input video signal are set as a coding group unit, and the data amount of the first frame is brought close to a desired target value for each coding group unit. A first step of encoding into the intra frame while performing
A second step of encoding the remaining (n−1) frames excluding the first frame for each encoding group unit into the prediction frame without performing code amount control;
A third step of thinning out the prediction frame encoded in the second step at a preset rate;
And a fourth step of outputting a code string composed of the intra frame obtained in the first step and the remaining prediction frame thinned out in the third step in a predetermined format. A characteristic video encoding method.

A code string obtained by selectively performing encoding of an intra frame obtained by intra-frame compression encoding of an input video signal and a prediction frame obtained by predictive encoding using a correlation with the intra frame is used as a recording medium. A video recording device for recording,
Code amount control in which n frames (n is a natural number greater than or equal to 2) of the input video signal are set as a coding group unit, and the data amount of the first frame is brought close to a desired target value for each coding group unit. First encoding means for encoding into the intra frame while performing
Second encoding means for encoding the remaining (n−1) frames excluding the first frame for each encoding group unit into the prediction frame without performing code amount control;
Frame thinning means for thinning out the prediction frame encoded by the second encoding means at a preset rate;
Recording means for recording a code string composed of the intra frame obtained by the first encoding means and the remaining prediction frames thinned by the frame thinning means on a recording medium in a predetermined format. A video recording apparatus characterized by that.