JP4891938B2

JP4891938B2 - Compression encoding preprocessing apparatus and compression encoding preprocessing program

Info

Publication number: JP4891938B2
Application number: JP2008048286A
Authority: JP
Inventors: 康孝松尾; 英輔中須; 清一合志
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2008-02-28
Filing date: 2008-02-28
Publication date: 2012-03-07
Anticipated expiration: 2028-02-28
Also published as: JP2009206953A

Description

本発明は、圧縮符号化する前の画像信号を処理する圧縮符号化前置処理装置及び圧縮符号化前置処理プログラムに関する。 The present invention relates to a compression encoding preprocessing apparatus and a compression encoding preprocessing program for processing an image signal before compression encoding.

従来、観視者の視点の動きに着目して画像信号を処理する装置としては、観視者の眼球運動を検出する手段を備え、画面に表示された画像における視点位置近傍の領域に係る画像信号に割り当てる符号量を、他の領域に係る画像信号に割り当てる符号量よりも多くすることによって、復号時に利用者の感じる画質劣化を抑えるものが知られている（例えば、特許文献１及び２参照）。
特開平７−４４１１０号公報特許第３９２４７９４号公報 Conventionally, as an apparatus for processing an image signal by paying attention to the movement of a viewer's viewpoint, an image relating to a region in the vicinity of the viewpoint position in an image displayed on a screen is provided with means for detecting the eye movement of the viewer It is known that the amount of code assigned to a signal is made larger than the amount of code assigned to an image signal related to another region, thereby suppressing image quality degradation felt by the user during decoding (see, for example, Patent Documents 1 and 2). ).
JP 7-44110 A Japanese Patent No. 3924794

しかしながら、従来のものは、頭部を固定した状態で観視者に動画像を観視させるものであり、観視者が頭部を動かした際の注視点の位置を自動測定する処理が困難な実現性の低いものであった。また、従来のものでは、対象画像における観視者の視覚特性に依存した汎用性の低い圧縮符号が得られるおそれがあった。 However, the conventional one makes the viewer view the moving image with the head fixed, and it is difficult to automatically measure the position of the gazing point when the viewer moves the head. It was a very low possibility. Further, in the conventional one, there is a possibility that a compression code with low versatility depending on the visual characteristics of the viewer in the target image may be obtained.

本発明は、従来の課題を解決するためになされたものであり、観視者の視覚特性に依存しない汎用性の高い高圧縮符号を効率的に得ることができる圧縮符号化前置処理装置及び圧縮符号化前置処理プログラムを提供することを目的とする。 The present invention has been made in order to solve the conventional problems, and a compression coding pre-processing apparatus capable of efficiently obtaining a highly versatile high compression code that does not depend on the visual characteristics of a viewer, and An object of the present invention is to provide a compression coding pre-processing program.

本発明の圧縮符号化前置処理装置は、圧縮符号化する前の画像信号を処理して圧縮符号化処理装置に出力する圧縮符号化前置処理装置であって、入力した画像信号に係る画像から動きのある動領域を予め定めた方式で検出する動領域検出手段と、検出した前記動領域から動オブジェクトを抽出し、抽出した前記動オブジェクトのオプティカルフロー情報を取得するオプティカルフロー情報取得手段と、前記画像が表示される画面の外側から前記画面内に出現する出現オブジェクト又は前記画面上に表示された画像に隠れた状態から出現する前景出現オブジェクトを前記オプティカルフロー情報に基づいて検出し、検出した前記出現オブジェクト又は前記前景出現オブジェクトの位置を示すオブジェクト領域指示信号を出力する出現オブジェクト検出部と、前記画面の大きさ及び前記画面を観視する観視者から前記画面までの距離を示す観視距離に対して跳躍性眼球運動を発生させる移動速度閾値以上の動き速度を有するオブジェクトを前記オプティカルフロー情報に基づいて検出したとき、検出した前記オブジェクトの位置を示す跳躍性眼球運動領域信号を出力する跳躍性眼球運動領域検出部と、前記オブジェクト領域指示信号又は前記跳躍性眼球運動領域信号が出力されている期間において前記画像信号の動領域の空間周波数帯域を制限して前記圧縮符号化処理装置に出力する信号帯域制限手段とを備えた構成を有している。 The compression coding pre-processing apparatus of the present invention is a compression coding pre-processing apparatus that processes an image signal before compression coding and outputs the processed image signal to the compression coding processing apparatus, and an image related to the input image signal A moving area detecting means for detecting a moving area with movement by a predetermined method; an optical flow information acquiring means for extracting a moving object from the detected moving area and acquiring optical flow information of the extracted moving object; Detecting an appearance object appearing in the screen from the outside of the screen on which the image is displayed or a foreground appearance object appearing in a state hidden in the image displayed on the screen based on the optical flow information, and detecting An appearing object that outputs an object area instruction signal indicating the position of the appearing object or the foreground appearing object An object having a moving speed equal to or higher than a moving speed threshold for generating a jumping eye movement with respect to a viewing distance indicating a size of the screen and a distance from a viewer viewing the screen to the screen; , A jumping eye movement region detection unit that outputs a jumping eye movement region signal indicating the detected position of the object, and the object region instruction signal or the jumping eye movement region when detecting based on the optical flow information And a signal band limiting unit that limits a spatial frequency band of the moving region of the image signal and outputs the signal to the compression coding processing apparatus during a period in which the signal is output .

この構成により、本発明の圧縮符号化前置処理装置は、オプティカルフロー情報及び跳躍性眼球運動による注視点移動を考慮して、跳躍性眼球運動により動オブジェクトに対する観視者の視覚が低下する期間において動オブジェクトを含む動領域の信号帯域を制限するので、観視者の視覚特性に依存しない汎用性の高い高圧縮符号を効率的に得ることができる。 With this configuration, the compression coding pre-processing device of the present invention takes into account the movement of the gazing point due to the optical flow information and the jumping eye movement, and the period during which the viewer's visual perception of the moving object is reduced by the jumping eye movement. Since the signal band of the moving area including the moving object is limited, a highly versatile high-compression code that does not depend on the visual characteristics of the viewer can be efficiently obtained.

さらに、本発明の圧縮符号化前置処理装置は、前記オブジェクト領域指示信号及び前記跳躍性眼球運動領域信号を入力する画像信号指示手段をさらに備え、該画像信号指示手段は、前記圧縮符号化処理装置が出力する画像信号として、フレーム内符号化により符号化されるフレーム内符号化画像信号、再生時に先に再生されるフレーム画像信号を用いて符号化する順方向予測符号化により符号化されるフレーム間順方向予測符号化画像信号、又は再生時に先に再生されるフレーム画像信号と後に再生されるフレーム画像信号とを用いて符号化する双方向予測符号化により符号化される双方向予測符号化画像信号のいずれにするかを前記圧縮符号化処理装置に指示する構成を有している。 Furthermore, the compression coding pre-processing apparatus of the present invention further includes an image signal instruction means for inputting the object area instruction signal and the jumping eye movement area signal, and the image signal instruction means includes the compression encoding process. As an image signal output by the apparatus, an intra-frame encoded image signal that is encoded by intra-frame encoding, and a forward predictive encoding that is encoded using a frame image signal that is reproduced first during reproduction are encoded. Inter-frame forward prediction encoded image signal, or bi-directional prediction code encoded by bi-directional predictive encoding encoded using a frame image signal reproduced earlier and a frame image signal reproduced later The compressed encoding processing apparatus is instructed to select one of the encoded image signals .

この構成により、本発明の圧縮符号化前置処理装置は、オプティカルフロー情報及び跳躍性眼球運動による注視点移動を考慮して、フレーム内符号化画像信号、フレーム間順方向予測符号化画像信号及び双方向予測符号化画像信号のいずれかを圧縮符号化時の画像信号として圧縮符号化処理装置に指示するので、観視者の視覚特性に依存しない汎用性の高い高圧縮符号を効率的に得ることができる。 With this configuration, the compression coding pre-processing device of the present invention takes into account the gaze point movement due to the optical flow information and the jumping eye movement, the intra-frame coded image signal, the inter-frame forward prediction coded image signal, and Since either one of the bidirectional predictive encoded image signals is instructed to the compression encoding processing apparatus as an image signal at the time of compression encoding, a highly versatile high compression code that does not depend on the visual characteristics of the viewer can be efficiently obtained. be able to.

本発明の圧縮符号化前置処理プログラムは、圧縮符号化する前の画像信号を処理して圧縮符号化処理装置に出力する圧縮符号化前置処理装置としてコンピュータを機能させる圧縮符号化前置処理プログラムであって、前記コンピュータを、入力した画像信号に係る画像から動きのある動領域を予め定めた方式で検出する動領域検出手段と、検出した前記動領域から動オブジェクトを抽出し、抽出した前記動オブジェクトのオプティカルフロー情報を取得するオプティカルフロー情報取得手段と、前記画像が表示される画面の外側から前記画面内に出現する出現オブジェクト又は前記画面上に表示された画像に隠れた状態から出現する前景出現オブジェクトを前記オプティカルフロー情報に基づいて検出し、検出した前記出現オブジェクト又は前記前景出現オブジェクトの位置を示すオブジェクト領域指示信号を出力する出現オブジェクト検出手段と、前記画面の大きさ及び前記画面を観視する観視者から前記画面までの距離を示す観視距離に対して跳躍性眼球運動を発生させる移動速度閾値以上の動き速度を有するオブジェクトを前記オプティカルフロー情報に基づいて検出したとき、検出した前記オブジェクトの位置を示す跳躍性眼球運動領域信号を出力する跳躍性眼球運動領域検出手段と、前記オブジェクト領域指示信号又は前記跳躍性眼球運動領域信号が出力されている期間において前記画像信号の動領域の空間周波数帯域を制限して前記圧縮符号化処理装置に出力する信号帯域制限手段として機能させる構成を有している。 The compression coding pre-processing program of the present invention is a compression coding pre-processing for causing a computer to function as a compression coding pre-processing device that processes an image signal before compression coding and outputs the processed image signal to the compression coding processing device. A program, wherein the computer extracts a moving object from the moving area detected by moving area detecting means for detecting a moving area having movement from an image related to the input image signal by a predetermined method. Optical flow information acquisition means for acquiring optical flow information of the moving object, and an appearance object appearing in the screen from the outside of the screen on which the image is displayed or appearing from a state hidden in the image displayed on the screen A foreground appearance object to be detected based on the optical flow information, and the detected appearance object or Appearing object detection means for outputting an object area instruction signal indicating the position of the foreground appearing object, and the viewing distance indicating the size of the screen and the distance from the viewer viewing the screen to the screen A jumping eye movement that outputs a jumping eye movement region signal indicating a position of the detected object when an object having a movement speed equal to or higher than a movement speed threshold value that generates the jumping eye movement is detected based on the optical flow information. A signal band that limits the spatial frequency band of the moving area of the image signal and outputs the signal to the compression coding processing apparatus during a period in which the object area instruction signal or the jumping eye movement area signal is output; It has the structure which functions as a limiting means.

この構成により、本発明の圧縮符号化前置処理プログラムは、オプティカルフロー情報及び跳躍性眼球運動による注視点移動を考慮して、跳躍性眼球運動により動オブジェクトに対する観視者の視覚が低下する期間において動オブジェクトを含む動領域の信号帯域を制限するので、観視者の視覚特性に依存しない汎用性の高い高圧縮符号を効率的に得ることができる。 With this configuration, the compression coding pre-processing program of the present invention takes into account the movement of the gazing point due to the optical flow information and the jumping eye movement, and the period during which the viewer's visual perception of the moving object is reduced by the jump eye movement Since the signal band of the moving area including the moving object is limited, a highly versatile high-compression code that does not depend on the visual characteristics of the viewer can be efficiently obtained.

さらに、本発明の圧縮符号化前置処理プログラムは、前記圧縮符号化前置処理装置は、前記オブジェクト領域指示信号及び前記跳躍性眼球運動領域信号を入力する画像信号指示手段をさらに備え、前記圧縮符号化前置処理プログラムは、前記コンピュータを、さらに、前記圧縮符号化処理装置が出力する画像信号として、フレーム内符号化により符号化されるフレーム内符号化画像信号、再生時に先に再生されるフレーム画像信号を用いて符号化する順方向予測符号化により符号化されるフレーム間順方向予測符号化画像信号、又は再生時に先に再生されるフレーム画像信号と後に再生されるフレーム画像信号とを用いて符号化する双方向予測符号化により符号化される双方向予測符号化画像信号のいずれにするかを前記圧縮符号化処理装置に指示する画像信号指示手段として機能させる構成を有している。 Furthermore, in the compression coding pre-processing program of the present invention, the compression coding pre- processing apparatus further comprises an image signal instruction means for inputting the object area instruction signal and the jumping eye movement area signal, and the compression encoding pre-processing program, the computer, further as an image signal, wherein the compression coding processing apparatus outputs intraframe coded image signal to be encoded by intra-frame coding, is reproduced first at the time of reproduction forward predictive coding inter-frame forward predictive coded picture signal to be encoded by the reduction, or a frame image signal reproduced after the frame image signal reproduced first when the reproduction of coded using frame image signal that preparative said compression encoding processor whether to any of the bidirectional predictive coded picture signal to be encoded by the bidirectional predictive encoding for encoding using It has a structure to function as an image signal instructing means for instructing.

この構成により、本発明の圧縮符号化前置処理プログラムは、オプティカルフロー情報及び跳躍性眼球運動による注視点移動を考慮して、フレーム内符号化画像信号、フレーム間順方向予測符号化画像信号及び双方向予測符号化画像信号のいずれかを圧縮符号化時の画像信号として圧縮符号化処理装置に指示するので、観視者の視覚特性に依存しない汎用性の高い高圧縮符号を効率的に得ることができる。 With this configuration, the compression coding pre-processing program of the present invention takes into account optical flow information and gaze point movement due to jumping eye movements, intra-frame coded image signals, inter-frame forward prediction coded image signals, and Since either one of the bidirectional predictive encoded image signals is instructed to the compression encoding processing apparatus as an image signal at the time of compression encoding, a highly versatile high compression code that does not depend on the visual characteristics of the viewer can be efficiently obtained. be able to.

本発明は、観視者の視覚特性に依存しない汎用性の高い高圧縮符号を効率的に得ることができるという効果を有する圧縮符号化前置処理装置及び圧縮符号化前置処理プログラムを提供することができるものである。 The present invention provides a compression coding pre-processing apparatus and a compression coding pre-processing program having an effect of being able to efficiently obtain a highly versatile high-compression code that does not depend on the visual characteristics of a viewer. Is something that can be done.

本発明の実施の形態について説明する前に、人間の視覚特性について説明する。 Before describing embodiments of the present invention, human visual characteristics will be described.

（１）中心視及び周辺視の説明
視覚には、中心視と周辺視とがある。中心視では、視力が高く、形や空間的な構造の詳細に関する感度が高いので、視標のテクスチャ、形態、色に関する情報が主に処理される。周辺視では、中心視よりも視力が低いが、空間座標の知覚や変動知覚の感度が高いので、視標の位置や運動に関する情報が処理される。 (1) Description of central vision and peripheral vision Visual vision includes central vision and peripheral vision. Since central vision has high visual acuity and high sensitivity regarding details of shape and spatial structure, information on texture, form, and color of the visual target is mainly processed. In peripheral vision, the visual acuity is lower than in central vision, but since the sensitivity of spatial coordinate perception and fluctuation perception is high, information on the position and motion of the target is processed.

（２）滑動性追跡眼球運動及び跳躍性眼球運動の説明
動画像内の動物体の動きは、眼球運動による追随視により捉えられる。眼球運動による追随視としては、主に、滑動性追跡眼球運動（Smooth Pursuit）と、跳躍性眼球運動（Saccade）とが知られている。滑動性追跡眼球運動は、動視標に追随して眼球を動かす運動であり、視角が数度／秒以下の遅い視標動き速度に対して機能する。跳躍性眼球運動は、目標の位置に素早く眼球を動かす跳躍運動であり、滑動性追跡眼球運動で追随視できない速い視標動きに対して機能する。 (2) Explanation of slidable pursuit eye movement and jumping eye movement The movement of the moving object in the moving image can be captured by following the eye movement. As the follow-up by eye movement, mainly, a smooth pursuit eye movement (Smooth Pursuit) and a jumping eye movement (Saccade) are known. The sliding pursuit eye movement is a movement that moves the eyeball following the moving target, and functions for a slow target movement speed with a visual angle of several degrees / second or less. The jumping eye movement is a jumping movement that quickly moves the eyeball to a target position, and functions for a quick target movement that cannot be followed by the sliding pursuit eye movement.

（３）跳躍性眼球運動による追随視の説明
網膜の中心窩で捉えた視標が高速に動き、眼球が視標を中心窩で捉えられなくなると跳躍性眼球運動を生じる。以下、図１を用いて説明する。図１は、動き視標に対する跳躍性眼球運動時の眼球の応答を示す概念図である。初期状態として、観視者は、図１に示した時刻ｔ１において網膜の中心窩で視標を捉えているものとする。時刻ｔ１から時間が進むに従って視標が移動することにより視標が中心窩から外れ、視力の低下が生じる。そして、視標は中心窩から十分に外れた位置まで移動する。 (3) Explanation of follow-up by jumping eye movement When the target captured at the fovea of the retina moves at high speed, and the eyeball can no longer capture the target at the fovea, jumping eye movement occurs. Hereinafter, a description will be given with reference to FIG. FIG. 1 is a conceptual diagram showing a response of an eyeball during a jumping eye movement with respect to a motion target. As an initial state, it is assumed that the viewer is capturing the visual target at the fovea of the retina at time t1 shown in FIG. As the time progresses from time t1, the target moves and the target moves off the fovea, resulting in a reduction in visual acuity. The target moves to a position sufficiently away from the fovea.

このとき、跳躍性眼球運動を生じる時刻ｔ３は、眼球が視標を中心窩で捉えられなくなった時刻（十分に速い動きでは、≒ｔ１）から約１５０ｍｓｅｃ〜２００ｍｓｅｃ後となる。また、跳躍性眼球運動が起こる５０ｍｓｅｃ前から跳躍性眼球運動が起こる瞬間（時刻ｔ３）までは、視覚信号は抑制されており、動き指標は見えなくなる。この視標が見えなくなる時刻をｔ２とする。続いて、時刻ｔ３からｔ４までの約５０ｍｓｅｃ〜１００ｍｓｅｃの期間は、眼球が物体の動きに応答する立ち上がり期間であり、観視者の視力はかなり低下している。したがって、図１に示した時刻ｔ２からｔ４までの期間は、跳躍性眼球運動によりほとんど動き視標が見えない期間であると考えられる。 At this time, the time t3 when the jumping eye movement occurs is about 150 msec to 200 msec after the time when the eyeball can no longer catch the target in the fovea (approx. T1 for sufficiently fast movement). Further, from 50 msec before the jumping eye movement occurs until the moment (time t3) when the jumping eye movement occurs, the visual signal is suppressed and the motion index becomes invisible. The time when the target is not visible is t2. Subsequently, a period of about 50 msec to 100 msec from time t3 to t4 is a rising period in which the eyeball responds to the movement of the object, and the visual acuity of the viewer is considerably reduced. Therefore, it is considered that the period from time t2 to t4 shown in FIG. 1 is a period in which the moving target is hardly visible due to the jumping eye movement.

（４）注視の説明
図２は、高速動き視標に対する跳躍性眼球運動時の眼球運動を示す概念図である。図中の実線は時間変位に対する視標位置を示し、図中の破線は時間変位に対する視線位置を示す。図２に示すように、高速動き視標を追随視する際には、跳躍性眼球運動により眼球を跳躍させながら視標の追随を行う。そして視標位置と視線位置とが一瞬一致した際、観視者は、動き視標を高い視力で捉えることができる。 (4) Explanation of gaze FIG. 2 is a conceptual diagram showing eye movement during jumping eye movement with respect to a high-speed motion target. A solid line in the figure indicates a visual target position with respect to time displacement, and a broken line in the figure indicates a visual line position with respect to time displacement. As shown in FIG. 2, when following a high-speed motion target, the target is tracked while jumping the eyeball by jumping eye movement. When the target position and the line-of-sight position coincide for a moment, the viewer can capture the moving target with high visual acuity.

動画像フレーム内で視標が移動する場合、眼球運動によって動画像を注視する運動は、下記のような運動の繰り返しによるものと考えられる。 When the target moves within the moving image frame, the movement of gazing at the moving image by eye movement is considered to be due to repetition of the following movements.

（Ａ）初期状態において注視点を中心視で捉えている場合、（ａ）視標が高速で動くときは、目が視標を中心窩で捉えられなくなると、跳躍性眼球運動を生じて動き視標を中心視で捉えようとし、（ｂ）視標が低速で動くときは、滑動性追跡眼球運動により、動き視標を中心視で捉えようとする。また、（Ｂ）初期状態において注視点を周辺視で捉えた場合、周辺視で注視すべき対象に気づき、跳躍性眼球運動で中心視野を対象に合わせてから、中心視で対象の詳細を把握する。 (A) When the gazing point is captured in the initial state in the initial state, (a) When the target moves at high speed, if the eye cannot capture the target in the fovea, a jumping eye movement occurs and moves Attempts to capture the visual target with central vision. (B) When the visual target moves at low speed, it tries to capture the motion visual target with central vision by sliding pursuit eye movement. (B) In the initial state, when the gazing point is captured with peripheral vision, the object to be gazed with peripheral vision is noticed, and the details of the target are grasped with central vision after adjusting the central visual field with jumping eye movements. To do.

以上より、圧縮符号化における跳躍性眼球運動を生じるような動き部位について、跳躍性眼球運動を起こす約５０ｍｓｅｃから跳躍性眼球運動後約１００ｍｓｅｃの期間は、大きな帯域制限や精度・確度の低い動き補償を施しても、劣化（原画像に対してぼけやブロック歪等）が目立ちにくいといえる。逆にそれ以後の期間は、観視者は跳躍性眼球運動しながら速い動きを追随視し、眼球運動速度と動画像内動き速度が一致した任意箇所で劣化を認識する可能性があるため、動き部位だからといって安易に撮像・表示蓄積ボケ等により高周波成分が失われた分以上の帯域制限を施したり、精度や確度の低い動き補償を行ったりしないのが好ましいことが分かる。 As described above, with respect to a movement part that causes jumping eye movement in compression coding, a large band limitation or motion compensation with low accuracy / accuracy is required for a period from about 50 msec at which jumping eye movement occurs to about 100 msec after jumping eye movement. Even if it is applied, it can be said that deterioration (blurring, block distortion, etc.) is not conspicuous. On the other hand, in the subsequent period, the viewer may follow the fast movement while jumping eye movements, and may recognize deterioration at any point where the eye movement speed matches the movement speed in the moving image. It can be seen that it is preferable not to easily limit the bandwidth more than the loss of high frequency components due to imaging / display accumulation blur or to perform motion compensation with low accuracy and accuracy simply because it is a moving part.

次に、本発明に係る圧縮符号化前置処理装置の一実施の形態における構成について説明する。本実施の形態における圧縮符号化前置処理装置は、予め定めた大きさの画面に動画像を表示するための動画像信号を出力する画像信号出力装置と、所定の画像圧縮符号化方式で動画像信号を圧縮符号化する圧縮符号化処理装置との間に設けられるものである。 Next, the configuration of an embodiment of the compression coding pre-processing apparatus according to the present invention will be described. The compression coding pre-processing device in the present embodiment includes an image signal output device that outputs a moving image signal for displaying a moving image on a screen of a predetermined size, and a moving image using a predetermined image compression coding method. It is provided between a compression encoding processing apparatus that compresses and encodes an image signal.

なお、以下の説明において、圧縮符号化処理装置は、ＭＰＥＧ（Moving Picture Coding Experts Group）−２方式に基づいて動画像信号を圧縮符号化するものとするが、これに限定されるものではなく、ＭＰＥＧ−４やＨ．２６４等の動画像圧縮規格を用いるものにも適用できる。また、時刻ｔにおける動画像フレームを動画像フレームＦ（ｔ）、動画像フレームＦ（ｔ）の画素位置（ｉ，ｊ）における画素値をＦ_ｉ,ｊ（ｔ）と記す。 In the following description, the compression coding processing device compresses and encodes a moving image signal based on the MPEG (Moving Picture Coding Experts Group) -2 method, but is not limited thereto. MPEG-4 and H.264 It can also be applied to those using a moving image compression standard such as H.264. Also, a moving image frame at time t is referred to as a moving image frame F (t), and a pixel value at a pixel position (i, j) of the moving image frame F (t) is referred to as F _{i, j} (t).

図３に示すように、本実施の形態における圧縮符号化前置処理装置１０は、画像信号を記憶するフレームメモリ１１と、クロック信号を発生するクロック信号発生器１２と、シーンチェンジを検出するシーンチェンジ検出部１３と、動画像が含まれる動領域を検出して動領域画像信号２０を出力する動領域検出部１４とを備えている。 As shown in FIG. 3, the compression coding pre-processing apparatus 10 in the present embodiment includes a frame memory 11 that stores an image signal, a clock signal generator 12 that generates a clock signal, and a scene that detects a scene change. A change detection unit 13 and a moving region detection unit 14 that detects a moving region including a moving image and outputs a moving region image signal 20 are provided.

また、圧縮符号化前置処理装置１０は、動領域画像信号２０に基づいて動ベクトルを検出する動ベクトル検出部１５と、画面上に出現する出現オブジェクトを検出する出現オブジェクト検出部１６と、跳躍性眼球運動領域を検出する跳躍性眼球運動領域検出部１７と、信号帯域を制限する信号帯域制限部１８と、ピクチャを切り替えるピクチャ切替部１９とを備えている。なお、圧縮符号化前置処理装置１０は、例えばコンピュータで構成され、所定のプログラムに基づいて動作するものである。 Further, the compression coding preprocessing apparatus 10 includes a motion vector detection unit 15 that detects a motion vector based on the motion region image signal 20, an appearance object detection unit 16 that detects an appearance object that appears on the screen, and a jump. A jumping eye movement region detecting unit 17 that detects a sexual eye movement region, a signal band limiting unit 18 that limits a signal band, and a picture switching unit 19 that switches pictures. Note that the compression coding pre-processing device 10 is configured by a computer, for example, and operates based on a predetermined program.

フレームメモリ１１は、フレームメモリ１１ａ〜１１ｄを含み、クロック信号発生器１２が発生するクロック信号に基づいて、図示しない画像信号出力装置から入力した動画像信号をフレーム毎に記憶するようになっている。なお、フレームメモリ１１ｃに記憶された画像信号を、時刻ｔにおける符号化対象フレームの画像信号として以下説明する。 The frame memory 11 includes frame memories 11a to 11d, and stores a moving image signal input from an image signal output device (not shown) for each frame based on a clock signal generated by the clock signal generator 12. . The image signal stored in the frame memory 11c will be described below as the image signal of the encoding target frame at time t.

クロック信号発生器１２は、予め定めた周波数のクロック信号（ＣＬＫ信号と図示）を発生するようになっており、このクロック信号に基づいてフレームメモリ１１に記憶された画像信号がシフトされる。 The clock signal generator 12 generates a clock signal (shown as a CLK signal) having a predetermined frequency, and the image signal stored in the frame memory 11 is shifted based on this clock signal.

シーンチェンジ検出部１３は、時刻ｔにおける符号化対象フレームの前後フレームの差分情報に基づき、シーンチェンジ（場面変化）を検出するようになっている。具体的には、シーンチェンジ検出部１３は、フレームメモリ１１ｃに記憶された画像信号と、フレームメモリ１１ｂに記憶された画像信号とを比較して両者間の差分情報を取得し、両者間に差がある場合はシーンチェンジがあったものと判定して、フレームメモリ１１に記憶した画像信号をシフトするためのクロック制御信号（ＣＬＫ制御信号と図示）をクロック信号発生器１２に出力するようになっている。 The scene change detection unit 13 detects a scene change (scene change) based on difference information between frames before and after the encoding target frame at time t. Specifically, the scene change detection unit 13 compares the image signal stored in the frame memory 11c with the image signal stored in the frame memory 11b to obtain difference information between the two, and the difference between the two is obtained. If there is, it is determined that there has been a scene change, and a clock control signal (CLK control signal and illustrated) for shifting the image signal stored in the frame memory 11 is output to the clock signal generator 12. ing.

動領域検出部１４は、シーンチェンジがない場合、シーンチェンジ検出部１３から画像信号を入力し、入力した画像信号において動きのある領域を示す動領域を検出するようになっている。具体的には、動領域検出部１４は、例えば、フレームＦの時間軸上の位置をｔとすると、その前後ｔ±ｑフレームを含む入力画像フレーム列Ｆ（ｔ）；−ｑ≦ｔ≦ｑに対して、時間軸方向に１階１次元離散ウェーブレット分解を行い、その高周波成分を得ることで原画像フレーム列から動領域画像フレーム列の画像信号を抜き出すようになっている。 When there is no scene change, the moving area detecting unit 14 receives an image signal from the scene change detecting unit 13 and detects a moving area indicating a moving area in the input image signal. Specifically, for example, when the position of the frame F on the time axis is t, the moving region detection unit 14 includes an input image frame sequence F (t) including t ± q frames before and after that; −q ≦ t ≦ q On the other hand, the first-order one-dimensional discrete wavelet decomposition is performed in the time axis direction, and the high-frequency component is obtained to extract the image signal of the moving area image frame sequence from the original image frame sequence.

また、動領域検出部１４は、抜き出した動領域画像フレーム列の画像信号を示す動領域信号２０を動ベクトル検出部１５に出力するようになっている。図１においては、シーンチェンジがない場合の、フレームメモリ１１ａ〜１１ｄにそれぞれ記憶された動領域画像信号２０を動領域画像信号２０ａ〜２０ｄとして示している。なお、動領域検出部１４は、本発明に係る動領域検出手段を構成する。 In addition, the moving area detecting unit 14 outputs a moving area signal 20 indicating an image signal of the extracted moving area image frame sequence to the moving vector detecting unit 15. In FIG. 1, the moving area image signals 20 stored in the frame memories 11a to 11d when there is no scene change are shown as moving area image signals 20a to 20d, respectively. The moving area detection unit 14 constitutes a moving area detection unit according to the present invention.

動ベクトル検出部１５は、動ベクトル検出部１５ａ〜１５ｃを備え、動領域画像信号２０から動きのあるオブジェクト（動オブジェクト）の信号を抽出し、オブジェクトの動きを示す動ベクトルを検出するようになっている。具体的には、例えば、動ベクトル検出部１５ａは、動領域画像信号２０ａと動領域画像信号２０ｂとに対して空間方向にｎ階２次元離散ウェーブレット分解を行い、各ｎ階における低周波成分と、水平・垂直・対角方向の高周波成分とを求め、動領域画像信号２０ａと動領域画像信号２０ｂとの間におけるオブジェクトの動ベクトルの検出を階層的に行うものである。 The motion vector detection unit 15 includes motion vector detection units 15a to 15c, extracts a signal of a moving object (moving object) from the motion area image signal 20, and detects a motion vector indicating the motion of the object. ing. Specifically, for example, the motion vector detection unit 15a performs n-order two-dimensional discrete wavelet decomposition in the spatial direction on the motion area image signal 20a and the motion area image signal 20b, The horizontal, vertical and diagonal high-frequency components are obtained, and the motion vector of the object between the motion region image signal 20a and the motion region image signal 20b is hierarchically detected.

また、動ベクトル検出部１５は、検出したフレーム間の動ベクトルを用いてオブジェクトの抽出及び追跡を行い、画面上におけるオブジェクトの各点の動き場情報であるオプティカルフロー情報を含む信号を出現オブジェクト検出部１６及び跳躍性眼球運動領域検出部１７に出力するようになっている。なお、動ベクトル検出部１５は、本発明に係る動オブジェクト抽出手段及びオプティカルフロー情報取得手段を構成する。 The motion vector detection unit 15 extracts and tracks an object using the detected motion vector between frames, and detects a signal including optical flow information which is motion field information of each point of the object on the screen. It outputs to the part 16 and the jumping eye movement area | region detection part 17. FIG. The motion vector detection unit 15 constitutes a moving object extraction unit and an optical flow information acquisition unit according to the present invention.

出現オブジェクト検出部１６は、オプティカルフロー情報に基づき、出現オブジェクトの位置を検出し、出現オブジェクトの位置を示す信号をＯＢＪ（OBJect）領域指示信号として信号帯域制限部１８及びピクチャ切替部１９に出力するようになっている。ここで、出現オブジェクトとは、画面外から画面内に出現（フレームイン）するオブジェクト（出現オブジェクト）や、画面上に表示された障害物の画像に隠れた（オクルージョン）状態から現れたオブジェクト（前景出現オブジェクト）を総称したものをいう。なお、出現オブジェクト検出部１６は、本発明に係る動オブジェクト抽出手段を構成する。 The appearance object detection unit 16 detects the position of the appearance object based on the optical flow information, and outputs a signal indicating the position of the appearance object to the signal band limiting unit 18 and the picture switching unit 19 as an OBJ (OBJect) area instruction signal. It is like that. Here, an appearance object is an object (appearance object) that appears (frame-in) from the outside of the screen in the screen, or an object (foreground) that appears from an obstruction (occlusion) image displayed on the screen (foreground). This is a generic term for "appearing objects". In addition, the appearance object detection part 16 comprises the moving object extraction means which concerns on this invention.

跳躍性眼球運動領域検出部１７は、オプティカルフロー情報に基づき、跳躍性眼球運動を生じさせるような動き速度を持ったオブジェクトの動き位置を検出し、該当するオブジェクトの位置を示す信号をＳ（Saccade）領域指示信号として信号帯域制限部１８及びピクチャ切替部１９に出力するようになっている。ここで、跳躍性眼球運動領域検出部１７は、オプティカルフロー情報で示されるオブジェクトの移動速度と、画像を表示する画面の大きさ及び標準観視距離に基づいて予め定めた移動速度閾値とを比較し、オブジェクトの移動速度が移動速度閾値よりも大きいとき、当該オブジェクトが跳躍性眼球運動を行っていると判断するようになっている。なお、標準観視距離とは、視力１．０の観視者により画素構造が知覚されない視角１分に対して１画素となる視距離をいう。 Based on the optical flow information, the jumping eye movement region detection unit 17 detects the movement position of an object having a movement speed that causes jumping eye movement, and outputs a signal indicating the position of the corresponding object S (Saccade ) The signal is output to the signal band limiting unit 18 and the picture switching unit 19 as an area instruction signal. Here, the jumping eye movement region detecting unit 17 compares the moving speed of the object indicated by the optical flow information with a moving speed threshold value determined in advance based on the size of the screen on which the image is displayed and the standard viewing distance. When the moving speed of the object is larger than the moving speed threshold, it is determined that the object is performing a jumping eye movement. The standard viewing distance refers to a viewing distance of 1 pixel per viewing angle of 1 minute at which a pixel structure is not perceived by a viewer with a visual acuity of 1.0.

以下、画像を表示する画面の大きさ及び標準観視距離と、跳躍性眼球運動との関係について図４を用いて説明する。 Hereinafter, the relationship between the size of the screen for displaying an image, the standard viewing distance, and the jumping eye movement will be described with reference to FIG.

人間の視覚特性についての説明において述べたように、滑動性追跡眼球運動は視角が数度／秒以下の遅い視標動き速度に対して機能するものであるので、以下の説明では１０度／秒以上の動きで跳躍性眼球運動が発生するものとする。 As described in the description of human visual characteristics, the sliding pursuit eye movement functions for a slow target movement speed with a visual angle of several degrees / second or less, and in the following description, it is 10 degrees / second. It is assumed that jumping eye movements occur with the above movement.

図４は、ＳＨＶ（Super Hi-Vision）、ＨＤＴＶ（Hi-Vision TV）、ＮＴＳＣ（National Television Standards Committee）の各映像方式において、各画面を構成する画素数と、標準観視距離と、標準観視距離での画角とを示したものである。 FIG. 4 shows the number of pixels constituting each screen, the standard viewing distance, and the standard view in each video system of SHV (Super Hi-Vision), HDTV (Hi-Vision TV), and NTSC (National Television Standards Committee). It shows the angle of view at the viewing distance.

各映像方式において、撮影画角が同じになるようカメラで被写体を撮影し、その動画像を標準観視距離で観視する場合、跳躍性眼球運動が発生するフレーム間の動き速度は以下のようになる。
（１）ＳＨＶの場合：１００［度］／１０［度／秒］＝１０［秒］
（２）ＨＤＴＶの場合：３０［度］／１０［度／秒］＝３［秒］
（３）ＮＴＳＣの場合：１０［度］／１０［度／秒］＝１［秒］ In each video format, when shooting a subject with the camera so that the shooting angle of view is the same, and viewing the moving image at the standard viewing distance, the movement speed between frames in which jumping eye movement occurs is as follows: become.
(1) In the case of SHV: 100 [degrees] / 10 [degrees / second] = 10 [seconds]
(2) HDTV: 30 [degrees] / 10 [degrees / second] = 3 [seconds]
(3) For NTSC: 10 [degrees] / 10 [degrees / second] = 1 [seconds]

すなわち、各映像方式において上記の秒数以下で画面を横断する速い動きにおいて、跳躍性眼球運動が発生するといえる。カメラで同じ画角で被写体を撮影した場合、視距離の関係上、ＳＨＶではＨＤＴＶの約３．３倍、ＮＴＳＣの約１０倍の動き速度となり、より跳躍性眼球運動が発生しやすいことが分かる。 That is, it can be said that jumping eye movements occur in a fast movement across the screen in the above-described number of seconds in each video system. When shooting a subject with the same angle of view with a camera, due to the viewing distance, the SHV has a movement speed that is about 3.3 times that of HDTV and about 10 times that of NTSC, and it can be seen that jumping eye movements are more likely to occur. .

以下、ＨＤＴＶで現在標準的に用いられている、社団法人テレビジョン学会が作成した「ハイビジョン・システム評価用標準動画像（以下、ＩＴＥ標準動画像）」と、同法人が発行する「ハイビジョン・システム評価用標準動画像統計量測定解説書（以下、解説書）」を参考にする。 The following is a standard video for high-definition system evaluation (hereinafter referred to as ITE standard video) created by the Institute of Television Engineers, which is currently used as a standard in HDTV, and a high-definition system issued by the same corporation. Refer to “Standard moving image for evaluation, Statistics measurement manual (hereinafter referred to as manual)”.

上記解説書では、ＩＴＥ標準動画像の６０シーケンスの各動画像シーケンスのフレーム間動きベクトル値が示されている。同書によると、テレビジョン画像では一般に最も速い動きが、画面を１秒間で横切るような動きであり、平均的な動きは画面を数秒間で横切るような動きと言われている。例えば、ＨＤＴＶにおいて、最も速い動きとしては１フィールド間で１９２０［画素］／（６０×１）［フィールド（１秒）］＝３２［画素／フィールド］の動き速度である。また、ＨＤＴＶでの平均的な動きとしては画面を５秒で横切るとすると、１９２０［画素］／（６０×５）［フィールド］＝６．４［画素／フィールド］である。 In the above description, the inter-frame motion vector value of each moving image sequence of 60 sequences of ITE standard moving images is shown. According to the book, it is generally said that the fastest movement in a television image is a movement that crosses the screen in one second, and the average movement is a movement that crosses the screen in several seconds. For example, in HDTV, the fastest motion is a motion speed of 1920 [pixels] / (60 × 1) [field (1 second)] = 32 [pixels / field] between one field. As an average motion in HDTV, if the screen is crossed in 5 seconds, 1920 [pixel] / (60 × 5) [field] = 6.4 [pixel / field].

前述のように跳躍性眼球運動が発生する動き速度を１０［度／秒］とすると、ＨＤＴＶにおいて画面を横切る時間は３秒なので、そのときの動き速度は、１９２０［画素］／（６０×３）［フィールド（３秒）］＝１０．７［画素／フィールド］である。 As described above, when the movement speed at which the jumping eye movement is generated is 10 [degrees / second], the time for crossing the screen in HDTV is 3 seconds, and the movement speed at that time is 1920 [pixels] / (60 × 3 ) [Field (3 seconds)] = 10.7 [pixel / field].

一方、ＳＨＶカメラで同じ画角で被写体を撮影したとすると、画面を数秒間（例えば５秒間）で横切るような平均的な動きでは、１フレーム間で７６８０［画素］／（６０×５）［フレーム（５秒）］＝２５．６［画素／フレーム］となる。また、跳躍性眼球運動が発生する動き速度を１０［度／秒］とすると、ＳＨＶにおいて画面を横切る時間は１０秒なので、そのときの動き速度は、７６８０［画素］／（６０×１０）［フレーム（１０秒）］＝１２．８［画素／フレーム］となる。 On the other hand, if the subject is photographed with the SHV camera at the same angle of view, 7680 [pixels] / (60 × 5) [ Frame (5 seconds)] = 25.6 [pixel / frame]. If the movement speed at which the jumping eye movement is generated is 10 [degrees / second], the time to cross the screen in SHV is 10 seconds, and the movement speed at that time is 7680 [pixels] / (60 × 10) [ Frame (10 seconds)] = 12.8 [pixel / frame].

前述のように、映像方式によって画面の大きさが異なるので、予め定めた画面の大きさ及び観視距離に対して、跳躍性眼球運動による注視点移動が起こると考えられるような動き速度を移動速度閾値として定めておく。例えば、ＳＨＶにおける移動速度閾値として前述の１２．８［画素／フレーム］を用いる。跳躍性眼球運動領域検出部１７は、この移動速度閾値以上の動き速度を有するオブジェクトが検出された場合、当該オブジェクトが跳躍性眼球運動を行っていると判断し、当該オブジェクトに関するＳ領域指示信号を出力するようになっている。ここで、Ｓ領域指示信号が示すＳ領域は、オブジェクトが表示される領域としてもよいが、オブジェクトが表示される領域に所定数の画素数の広がりを加えたオブジェクト周辺の領域を含むものとしてもよい。 As mentioned above, the screen size differs depending on the video system, so the moving speed is changed so that the gaze point movement due to the jumping eye movement occurs with respect to the predetermined screen size and viewing distance. It is determined as a speed threshold. For example, the above-described 12.8 [pixel / frame] is used as the moving speed threshold in SHV. When an object having a movement speed equal to or greater than the movement speed threshold is detected, the jumping eye movement area detection unit 17 determines that the object is performing jumping eye movement, and outputs an S area instruction signal related to the object. It is designed to output. Here, the S area indicated by the S area instruction signal may be an area where the object is displayed, or may include an area around the object obtained by adding a predetermined number of pixels to the area where the object is displayed. Good.

図３に戻り、構成の説明を続ける。信号帯域制限部１８は、ＯＢＪ領域指示信号及びＳ領域指示信号に基づいて、画像信号の空間周波数帯域を制限し、当該画像信号から予め定めた周波数以上の高域成分を除去するようになっている。また、信号帯域制限部１８は、制限した空間周波数帯域の信号成分を含む画像信号を圧縮符号化処理装置に出力するようになっている。なお、信号帯域制限部１８は、本発明に係る信号帯域制限手段を構成する。 Returning to FIG. 3, the description of the configuration is continued. The signal band limiting unit 18 limits the spatial frequency band of the image signal based on the OBJ region instruction signal and the S region instruction signal, and removes a high frequency component having a predetermined frequency or more from the image signal. Yes. The signal band limiting unit 18 outputs an image signal including a limited spatial frequency band signal component to the compression coding processing apparatus. The signal band limiting unit 18 constitutes a signal band limiting unit according to the present invention.

具体的には、信号帯域制限部１８は、以下に示すように動作するものである。
まず、信号帯域制限部１８は、跳躍性眼球運動領域検出部１７からＳ領域指示信号が出力されている期間、すなわち、観視者の視力が跳躍性眼球運動により低下する期間において、Ｓ領域内における画像信号の空間周波数帯域を制限するようになっている。なお、跳躍性眼球運動が単調なパターンで繰り返される場合は、観視者が跳躍性眼球運動しながら速い動きを追随視できる可能性があるので、信号帯域制限部１８は、例えば初回の空間周波数帯域のみを行うよう構成するのが好ましい。 Specifically, the signal band limiting unit 18 operates as described below.
First, the signal band limiting unit 18 is configured to perform the intra-S region in a period in which the S region instruction signal is output from the jumping eye movement region detection unit 17, that is, in a period in which the visual acuity of the viewer is reduced by the jumping eye movement. The spatial frequency band of the image signal is limited. If the jumping eye movement is repeated in a monotonous pattern, the signal band limiting unit 18 may, for example, perform the first spatial frequency because the viewer may be able to follow a fast movement while jumping eye movement. It is preferable to configure to perform only the band.

また、信号帯域制限部１８は、出現オブジェクト検出部１６からＯＢＪ領域指示信号が出力されている期間であって、画面外から画面内にフレームインするオブジェクト領域を周辺視で観視すると想定される時から跳躍性眼球運動により当該オブジェクト領域に中心視野を移動すると想定される時までの期間において、フレームインするオブジェクト領域近傍における画像信号の空間周波数帯域を制限するようになっている。 Further, it is assumed that the signal band limiting unit 18 is a period in which the OBJ region instruction signal is output from the appearance object detection unit 16 and the object region that is framed in from the outside of the screen to the inside of the screen is viewed with peripheral vision. The spatial frequency band of the image signal in the vicinity of the object area to be framed in is limited from the time until the time when the central visual field is assumed to move to the object area by jumping eye movement.

同様に、信号帯域制限部１８は、出現オブジェクト検出部１６からＯＢＪ領域指示信号が出力されている期間であって、背景から前景に出現するオブジェクト領域を周辺視で観視すると想定される時から跳躍性眼球運動により当該オブジェクト領域に中心視野を移動すると想定される時までの期間において、当該オブジェクト領域近傍における画像信号の空間周波数帯域を制限するようになっている。 Similarly, the signal band limiting unit 18 is a period in which the OBJ region instruction signal is output from the appearance object detection unit 16, and from the time when it is assumed that the object region appearing in the foreground from the background is viewed with peripheral vision. The spatial frequency band of the image signal in the vicinity of the object region is limited in a period until the time when it is assumed that the central visual field is moved to the object region by the jumping eye movement.

さらに、信号帯域制限部１８は、ＯＢＪ領域指示信号及びＳ領域指示信号を入力しない場合は、画像信号の空間周波数帯域を制限せず、フレームメモリ１１から入力した画像信号をそのまま圧縮符号化処理装置に出力するようになっている。 Further, when the OBJ area instruction signal and the S area instruction signal are not input, the signal band limiting unit 18 does not limit the spatial frequency band of the image signal, and directly compresses the image signal input from the frame memory 11. To output.

ピクチャ切替部１９は、ＯＢＪ領域指示信号及びＳ領域指示信号に基づき、圧縮符号化処理装置が使用するピクチャタイプを切り替えるためのピクチャ切替制御信号を圧縮符号化処理装置に出力するようになっている。すなわち、本実施の形態では、圧縮符号化処理装置がＭＰＥＧ−２方式を用いるものとしているので、ピクチャ切替部１９は、現在符号化を行いたいフレームの画像信号をＩピクチャ（フレーム内符号化画像信号）、Ｐピクチャ（フレーム間順方向予測符号化画像信号）、Ｂピクチャ（双方向予測符号化画像信号）のいずれにするかを指定する信号を出力するようになっている。 The picture switching unit 19 outputs a picture switching control signal for switching the picture type used by the compression coding processing apparatus to the compression coding processing apparatus based on the OBJ area instruction signal and the S area instruction signal. . That is, in this embodiment, since the compression encoding processing apparatus uses the MPEG-2 system, the picture switching unit 19 converts an image signal of a frame to be encoded into an I picture (intra-frame encoded image). Signal), P picture (interframe forward predictive encoded image signal), and B picture (bidirectional predictive encoded image signal) are output.

具体的には、ピクチャ切替部１９は、跳躍性眼球運動による注視点移動が起こると考えられるような場合、例えば大きなオブジェクトが画面上を動く場合や、画面全体でパンニングが実行される場合は、跳躍性眼球運動が生じ視覚情報が抑制・視力低下していると考えられる期間、Ｂピクチャ等の圧縮効率の高い圧縮符号化を用いるよう指示するピクチャ切替制御信号を圧縮符号化処理装置に出力するようになっている。 Specifically, the picture switching unit 19 is used when the gazing point movement due to the jumping eye movement is considered to occur, for example, when a large object moves on the screen or when panning is executed on the entire screen. A picture switching control signal for instructing to use compression coding with high compression efficiency, such as a B picture, is output to the compression coding processing apparatus during a period in which jumping eye movements occur and visual information is considered to be suppressed / decreased in visual acuity It is like that.

一方、ピクチャ切替部１９は、滑動性追跡眼球運動に近い動きや、高速動きでも跳躍性眼球運動を繰り返すことで中心視により動物体を追随視する可能性がある長時間動きの場合は、フレーム間相関が高く動ベクトルの検出確度が高いＰピクチャや、Ｉピクチャを用いるよう指示するピクチャ切替制御信号を圧縮符号化処理装置に出力するようになっている。なお、ピクチャ切替部１９は、本発明に係る画像信号指示手段を構成する。 On the other hand, the picture switching unit 19 moves the frame in the case of a long-time movement that may follow the moving object by the central vision by repeating a jumping eye movement even at a high speed movement or a movement close to the sliding pursuit eye movement. A picture switching control signal for instructing to use a P picture or an I picture with high inter-correlation and high motion vector detection accuracy is output to the compression coding processing apparatus. Note that the picture switching unit 19 constitutes an image signal instruction unit according to the present invention.

次に、本実施の形態における圧縮符号化前置処理装置１０の動作について図３を用いて説明する。 Next, the operation of the compression coding pre-processing apparatus 10 in the present embodiment will be described with reference to FIG.

図示しない画像信号出力装置から入力した画像信号は、クロック信号に基づいて、フレーム毎にフレームメモリ１１ａ〜１１ｄに記憶される。 An image signal input from an image signal output device (not shown) is stored in the frame memories 11a to 11d for each frame based on the clock signal.

シーンチェンジ検出部１３は、時刻ｔにおける符号化対象フレームの画像信号であるフレームメモリ１１ｃに記憶された画像信号と、フレームメモリ１１ｂに記憶された画像信号とを比較して両者間の差分情報を取得する。このとき、シーンチェンジ検出部１３は、両者間に差がある場合はシーンチェンジがあったものと判定し、クロック制御信号をクロック信号発生器１２に出力する。その結果、フレームメモリ１１に記憶された画像信号がシフトされる。一方、シーンチェンジ検出部１３は、両者間に差がない場合はシーンチェンジがないものと判定し、画像信号を動領域検出部１４に出力する。 The scene change detection unit 13 compares the image signal stored in the frame memory 11c, which is the image signal of the encoding target frame at time t, with the image signal stored in the frame memory 11b, and obtains difference information between them. get. At this time, if there is a difference between the two, the scene change detection unit 13 determines that a scene change has occurred, and outputs a clock control signal to the clock signal generator 12. As a result, the image signal stored in the frame memory 11 is shifted. On the other hand, if there is no difference between the two, the scene change detection unit 13 determines that there is no scene change and outputs an image signal to the moving region detection unit 14.

動領域検出部１４は、シーンチェンジ検出部１３から画像信号を入力し、入力した画像信号における動領域を検出する。この動領域検出部１４による動領域の検出について図５を用いて具体的に説明する。なお、図５に示したＦ（ｔ＋３）を動領域の検出対象として説明する。 The moving area detection unit 14 receives an image signal from the scene change detection unit 13 and detects a moving area in the input image signal. The detection of the moving area by the moving area detector 14 will be specifically described with reference to FIG. Note that F (t + 3) shown in FIG. 5 will be described as a moving area detection target.

動領域検出部１４は、例えばDaubechiesの２次のウェーブレット（サポート長＝４）を用いて、図５（ａ）及び（ｂ）の模式図に示すようにＦ（ｔ）、Ｆ（ｔ＋１）、Ｆ（ｔ＋２）、Ｆ（ｔ＋３）からなる動画像フレーム列の時間軸方向の時系列データに対して１階１次元離散ウェーブレット分解を行い、その高周波成分であるＭＡ（ｔ）を算出する。すなわち、ＭＡ（ｔ）は、サポート長４フレーム分の時系列変化の成分を含むものである。 The moving region detection unit 14 uses, for example, Daubechies secondary wavelets (support length = 4), as shown in the schematic diagrams of FIGS. 5A and 5B, F (t), F (t + 1), First-order one-dimensional discrete wavelet decomposition is performed on time-series data in the time axis direction of a moving image frame sequence composed of F (t + 2) and F (t + 3), and MA (t) that is a high-frequency component is calculated. That is, MA (t) includes a time-series change component for a support length of 4 frames.

同様に、Ｆ（ｔ＋２）、Ｆ（ｔ＋３）、Ｆ（ｔ＋４）、Ｆ（ｔ＋５）からなる時間軸方向の時系列データからＭＡ（ｔ＋２）を算出する。 Similarly, MA (t + 2) is calculated from time series data in the time axis direction including F (t + 2), F (t + 3), F (t + 4), and F (t + 5).

そして、ＭＡ（ｔ）とＭＡ（ｔ＋２）とのＡＮＤ計算を行うことによって、ＭＡ（ｅｖｅｎ）を算出する。 Then, MA (even) is calculated by performing AND calculation of MA (t) and MA (t + 2).

また、Ｆ（ｔ＋１）、Ｆ（ｔ＋２）、Ｆ（ｔ＋３）、Ｆ（ｔ＋４）からなる時間軸方向の時系列データから、同様にＭＡ（ｔ＋１）を算出する。 Further, MA (t + 1) is similarly calculated from time-series data in the time axis direction including F (t + 1), F (t + 2), F (t + 3), and F (t + 4).

次に、Ｆ（ｔ＋３）、Ｆ（ｔ＋４）、Ｆ（ｔ＋５）、Ｆ（ｔ＋６）からなる時間軸方向の時系列データから、同様にＭＡ（ｔ＋３）を算出する。 Next, MA (t + 3) is similarly calculated from time-series data in the time axis direction including F (t + 3), F (t + 4), F (t + 5), and F (t + 6).

そして、ＭＡ（ｔ＋１）とＭＡ（ｔ＋３）とのＡＮＤ計算を行うことによって、ＭＡ（ｏｄｄ）を算出する。 Then, MA (odd) is calculated by performing AND calculation of MA (t + 1) and MA (t + 3).

次に、ＭＡ（ｅｖｅｎ）とＭＡ（ｏｄｄ）とのＡＮＤ計算を行って２値化処理を行い、Ｍａ（ｔ）を算出し、Ｆ（ｔ＋３）とＭａ（ｔ）とのＡＮＤ計算を行うことによって、１フレーム分の動領域画像フレーム列Ｍ（ｔ）を算出する。 Next, an AND calculation of MA (even) and MA (odd) is performed, binarization processing is performed, Ma (t) is calculated, and AND calculation of F (t + 3) and Ma (t) is performed. Thus, a moving area image frame sequence M (t) for one frame is calculated.

続いて、動ベクトル検出部１５は、図６に示すように、動領域画像フレーム列Ｍ（ｔ）の各フレームに対して空間方向にｎ階２次元離散ウェーブレット分解を行い、各ｎ階における低周波成分Ｍ＿ＬＬ^ｎ（ｔ）と、水平・垂直・対角方向の高周波成分Ｍ＿ＬＨ^ｎ（ｔ）・Ｍ＿ＨＬ^ｎ（ｔ）・Ｍ＿ＨＨ^ｎ（ｔ）とを得る。 Subsequently, as shown in FIG. 6, the motion vector detection unit 15 performs n-order two-dimensional discrete wavelet decomposition in the spatial direction on each frame of the motion region image frame sequence M (t), A frequency component M_LL ⁿ (t) and horizontal, vertical, and diagonal high frequency components M_LH ⁿ (t), M_HL ⁿ (t), and M_HH ⁿ (t) are obtained.

また、動ベクトル検出部１５は、動領域画像フレーム列Ｍ（ｔ）の時刻ｔにおけるフレームＭ（ｔ）を基準フレームとし、時刻ｔ＋１におけるフレームＭ（ｔ＋１）を参照フレームとして、フレームＭ（ｔ）とＭ（ｔ＋１）との間で動ベクトルの検出を階層的に行う。ここで、動ベクトル検出部１５による動ベクトルの検出処理について図７を用いて詳細に説明する。図７は、動ベクトル検出部１５が実行する動ベクトルの検出処理を示すフローチャートである。なお、対象とする動画像フレームＦ（ｔ）が０≦ｔ≦Ｔの時間範囲にあるものとする。 In addition, the motion vector detection unit 15 uses the frame M (t) at time t in the motion area image frame sequence M (t) as a reference frame, and sets the frame M (t + 1) at time t + 1 as a reference frame, to frame M (t). And M (t + 1) are detected in a hierarchical manner. Here, the motion vector detection processing by the motion vector detection unit 15 will be described in detail with reference to FIG. FIG. 7 is a flowchart showing a motion vector detection process executed by the motion vector detection unit 15. It is assumed that the target moving image frame F (t) is in the time range of 0 ≦ t ≦ T.

まず、時刻ｔを"０"に初期化し（ステップＳ１０）、ウェーブレット分解の階層を示すインデックスｎを"１"に初期化する（ステップＳ１１）。 First, the time t is initialized to “0” (step S10), and the index n indicating the wavelet decomposition hierarchy is initialized to “1” (step S11).

次に、低周波成分Ｍ＿ＬＬ^ｎ（ｔ）と低周波成分Ｍ＿ＬＬ^ｎ（ｔ＋１）との間でブロックマッチング法を用いて、低周波成分Ｍ＿ＬＬ^ｎ（ｔ）の画素位置（ｉ，ｊ）に対応する動ベクトルＶ^ｎ _ｉ，ｊ（ｔ）の検出を行う（ステップＳ１２）。 Next, a block matching method is used between the low frequency component M_LL ⁿ (t) and the low frequency component M_LL ⁿ (t + 1) to correspond to the pixel position (i, j) of the low frequency component M_LL ⁿ (t). The motion vector V ⁿ _{i, j} (t) is detected (step S12).

次に、低周波成分Ｍ＿ＬＬ^ｎ（ｔ）、水平高周波成分Ｍ＿ＬＨ^ｎ（ｔ）、垂直高周波成分Ｍ＿ＨＬ^ｎ（ｔ）及び対角高周波成分Ｍ＿ＨＨ^ｎ（ｔ）の画素位置（ｉ，ｊ）における画素値Ｍ＿ＬＬ^ｎ _ｉ，ｊ（ｔ）、Ｍ＿ＬＨ^ｎ _ｉ，ｊ（ｔ）、Ｍ＿ＨＬ^ｎ _ｉ，ｊ（ｔ）、Ｍ＿ＨＨ^ｎ _ｉ，ｊ（ｔ）でＶ^ｎ _ｉ，ｊ（ｔ）ずれた画素位置における画素値を置換し、動き補正低周波成分Ｍ'＿ＬＬ^ｎ（ｔ）、動き補正水平高周波成分Ｍ'＿ＬＨ^ｎ（ｔ）、動き補正垂直高周波成分Ｍ'＿ＨＬ^ｎ（ｔ）及び動き補正対角高周波成分Ｍ'＿ＨＨ^ｎ（ｔ）を算出する（ステップＳ１３）。 Next, the pixel value at the pixel position (i, j) of the low frequency component M_LL ⁿ (t), the horizontal high frequency component M_LH ⁿ (t), the vertical high frequency component M_HL ⁿ (t), and the diagonal high frequency component M_HH ⁿ (t). Pixels at pixel positions shifted by V ⁿ _{i, j} (t) by M_LL ⁿ _{i, j} (t), M_LH ⁿ _{i, j} (t), M_HL ⁿ _{i, j} (t), and M_HH ⁿ _{i, j} (t) Replace values, motion correction low frequency component M′_LL ⁿ (t), motion correction horizontal high frequency component M′_LH ⁿ (t), motion correction vertical high frequency component M′_HL ⁿ (t) and motion correction diagonal high frequency component M′_HH ⁿ (t) is calculated (step S13).

次に、動き補正低周波成分Ｍ'＿ＬＬ^ｎ（ｔ）、動き補正水平高周波成分Ｍ'＿ＬＨ^ｎ（ｔ）、動き補正垂直高周波成分Ｍ'＿ＨＬ^ｎ（ｔ）及び動き補正対角高周波成分Ｍ'＿ＨＨ^ｎ（ｔ）を１階再構成して、動き補正低周波成分Ｍ'＿ＬＬ^ｎ−１（ｔ）を算出する（ステップＳ１４）。 Next, the motion correction low frequency component M′_LL ⁿ (t), the motion correction horizontal high frequency component M′_LH ⁿ (t), the motion correction vertical high frequency component M′_HL ⁿ (t), and the motion correction diagonal high frequency component M ′ _HH ⁿ (t) is reconstructed to the first floor, and the motion-corrected low-frequency component M′_LL ⁿ⁻¹ (t) is calculated (step S14).

次に、低周波成分Ｍ＿ＬＬ^ｎ−１（ｔ）を動き補正低周波成分Ｍ'＿ＬＬ^ｎ−１（ｔ）で置換する（ステップＳ１５）。 Next, the low frequency component M_LL ⁿ⁻¹ (t) is replaced with the motion corrected low frequency component M′_LL ⁿ⁻¹ (t) (step S15).

次に、ウェーブレット分解の階層を示すインデックスｎが最大値Ｎに到達したか否かを判定し（ステップＳ１６）、否定判定した場合はインデックスｎをインクリメントして（ステップＳ１７）、ステップＳ１２の処理に戻る。 Next, it is determined whether or not the index n indicating the wavelet decomposition hierarchy has reached the maximum value N (step S16). If a negative determination is made, the index n is incremented (step S17), and the process of step S12 is performed. Return.

ステップＳ１６において、インデックスｎが最大値Ｎに到達したと判定した場合は、［数１］より、元の動領域画像Ｍ（ｔ）に対する動ベクトルＶ^０ _ｉ，ｊ（ｔ）を算出する（ステップＳ１８）。 If it is determined in step S16 that the index n has reached the maximum value N, the motion vector V ⁰ _{i, j} (t) for the original motion area image M (t) is calculated from [Equation 1] (step 1). S18).

次に、時刻ｔが時刻Ｔに到達したか否かを判定し（ステップＳ１９）、否定判定したときは時刻ｔをインクリメントして（ステップＳ２０）、ステップＳ１１の処理に戻る。 Next, it is determined whether or not the time t has reached the time T (step S19). If a negative determination is made, the time t is incremented (step S20), and the process returns to step S11.

ステップＳ１９において、時刻ｔが時刻Ｔに到達したと判定した場合は、動ベクトルの検出処理を終了する。 If it is determined in step S19 that time t has reached time T, the motion vector detection process ends.

引き続き、動ベクトル検出部１５は、Ｆ（ｔ_０）上の任意画素位置（ｋ，ｍ）において、フレーム間の動ベクトルＶ^０ _ｉ，ｊ（ｔ）を用い、Ｆ（ｔ）；−ｑ≦ｔ≦ｑにおける動きを追跡する。この時系列のオプティカルフロー（画面上の各点の動き場）情報から、動ベクトル検出部１５は、オブジェクトの抽出・追跡を行う。 Subsequently, the motion vector detection unit 15 uses the motion vector V ⁰ _{i, j} (t) between frames at an arbitrary pixel position (k, m) on F (t ₀ ), and F (t); −q ≦ Track the motion at t ≦ q. From this time-series optical flow (motion field of each point on the screen) information, the motion vector detection unit 15 extracts and tracks an object.

具体的には、動ベクトル検出部１５は、動ベクトルＶ^０ _ｉ，ｊ（ｔ'）（１≦ｉ≦Ｉ、１≦ｊ≦Ｊ、ｔ_０−ｑ≦ｔ'）を用いて、動画像フレームＦ（ｔ）の画素位置（ｉ，ｊ）の動きを時刻ｔ_０からｔ_０−ｑに渡って時間軸マイナス方向に追跡する。さらに、動ベクトルＶ^０ _ｉ，ｊ（ｔ'）（１≦ｉ≦Ｉ、１≦ｊ≦Ｊ、ｔ'≦ｔ_０＋ｑ）を用いて、動画像フレームＦ（ｔ）の画素位置（ｉ，ｊ）の動きを時刻ｔ_０からｔ_０＋ｑに渡って時間軸プラス方向に追跡する。 Specifically, the motion vector detection unit 15 uses the motion vector V ⁰ _{i, j} (t ′) (1 ≦ i ≦ I, 1 ≦ j ≦ J, t ₀ −q ≦ t ′) to generate a moving image. pixel position of the frame F (t) (i, j ) to track the motion from time _{t 0} to the time axis negative direction over _{t 0-q} of. Further, using the motion vector V ⁰ _{i, j} (t ′) (1 ≦ i ≦ I, 1 ≦ j ≦ J, t ′ ≦ t _{0 + q} ), the pixel position (i, j) of the moving image frame F (t) is used. ) to track the motion from time t ₀ to the time axis plus direction over t _{0 + q} of.

次に、動画像フレームＦ（ｔ）の画素位置（ｉ，ｊ）の追跡結果に基づいて、動ベクトルＶ^０ _ｉ，ｊ（ｔ'）（ｔ_０−ｑ≦ｔ'≦ｔ_０＋ｑ）の絶対値が最小となる時刻ｔ_ｓと、追跡位置（ｋ，ｍ）とを決定する。また、連続するフレーム間における同一オブジェクトの動き方向はほぼ連続していると考えられるので、動ベクトル検出部１５は、動ベクトルＶ^０ _ｉ，ｊ（ｔ'）（ｔ_０−ｑ≦ｔ'≦０）において動ベクトル値及びオブジェクト位置を追跡し、オブジェクトがフレームインする時刻を出現オブジェクト時刻ｔ＿ｉｎとして決定する。また、動ベクトル検出部１５は、動ベクトルＶ^０ _ｉ，ｊ（ｔ）とＶ^０ _ｉ，ｊ（ｔ−１）との内積を計算し、計算で求めた内積値と予め定めた閾値ｔｈとを比較する。この閾値ｔｈは、動ベクトルの動きの連続性を判定するためのものであり、内積値が閾値ｔｈより大きければ動ベクトルの動きの連続性が損なわれていることを示す。内積値が閾値ｔｈより大きい場合、オブジェクトが背景から前景に現れる等の動きがあったと推測されるため、動ベクトル検出部１５は、該当時刻を出現オブジェクト時刻ｔ＿ａｐｐｅａｒとして決定する。 Next, based on the tracking result of the pixel position (i, j) of the moving image frame F (t), the absolute _{value of} the moving vector V ⁰ _{i, j} (t ′) (t _0−q ≦ t ′ ≦ t _{0 + q} ) and time t _s the value is minimized, it determines the tracking position (k, m). In addition, since the motion direction of the same object between successive frames is considered to be substantially continuous, the motion vector detection unit 15 performs motion vector V ⁰ _{i, j} (t ′) (t _0−q ≦ t ′ ≦ 0), the motion vector value and the object position are tracked, and the time at which the object enters the frame is determined as the appearance object time t_in. The motion vector detection unit 15 calculates an inner product of the motion vectors V ⁰ _{i, j} (t) and V ⁰ _{i, j} (t−1), and calculates the inner product value obtained by the calculation and a predetermined threshold th. Compare This threshold value th is for determining the continuity of motion vector motion. If the inner product value is larger than the threshold value th, it indicates that the motion vector motion continuity is impaired. If the inner product value is larger than the threshold th, it is presumed that there has been a motion such as an object appearing in the foreground from the background, so the motion vector detection unit 15 determines the corresponding time as the appearance object time t_appear.

図３に戻り、動作の説明を続ける。出現オブジェクト検出部１６は、動ベクトル検出部１５が出現オブジェクト時刻ｔ＿ｉｎやｔ＿ａｐｐｅａｒを決定した場合、出現オブジェクトが検出されたと判断し、出現オブジェクトの位置を示すＯＢＪ領域指示信号を信号帯域制限部１８及びピクチャ切替部１９に出力する。 Returning to FIG. 3, the description of the operation will be continued. When the motion vector detection unit 15 determines the appearance object time t_in or t_appear, the appearance object detection unit 16 determines that the appearance object has been detected, and sends the OBJ region instruction signal indicating the position of the appearance object to the signal band limiting unit 18 and The image is output to the picture switching unit 19.

並行して、跳躍性眼球運動領域検出部１７は、動ベクトル検出部１５が出力するオプティカルフロー情報及び移動速度閾値に基づき、跳躍性眼球運動を生じさせるような動き速度を持ったオブジェクトの動き位置を検出し、該当するオブジェクトを検出した場合はそのオブジェクトの位置を示すＳ領域指示信号を信号帯域制限部１８及びピクチャ切替部１９に出力する。 In parallel, the jumping eye movement region detection unit 17 is based on the optical flow information output from the motion vector detection unit 15 and the movement speed threshold, and the movement position of the object having a movement speed that causes jumping eye movement. When a corresponding object is detected, an S area instruction signal indicating the position of the object is output to the signal band limiting unit 18 and the picture switching unit 19.

次に、信号帯域制限部１８は、出現オブジェクト検出部１６からのＯＢＪ領域指示信号、又は跳躍性眼球運動領域検出部１７からのＳ領域指示信号のいずれかを入力した場合は、検出されたオブジェクトの表示位置付近の画像の空間周波数帯域を制限し、所定の高周波成分を除去した画像信号を圧縮符号化処理装置に出力する。一方、信号帯域制限部１８は、ＯＢＪ領域指示信号及びＳ領域指示信号を入力しない場合は、画像信号の空間周波数帯域を制限せず、フレームメモリ１１から入力した画像信号をそのまま圧縮符号化処理装置に出力する。 Next, when either the OBJ region instruction signal from the appearance object detection unit 16 or the S region instruction signal from the jumping eye movement region detection unit 17 is input, the signal band limiting unit 18 detects the detected object. The spatial frequency band of the image near the display position is limited, and an image signal from which a predetermined high-frequency component has been removed is output to the compression coding processing apparatus. On the other hand, when the OBJ region instruction signal and the S region instruction signal are not input, the signal band limiting unit 18 does not limit the spatial frequency band of the image signal, and directly compresses the image signal input from the frame memory 11. Output to.

並行して、ピクチャ切替部１９は、出現オブジェクト検出部１６からのＯＢＪ領域指示信号、又は跳躍性眼球運動領域検出部１７からのＳ領域指示信号のいずれかを入力した場合は、圧縮符号化処理装置の圧縮符号化において、現在符号化を行いたいフレームの画像信号をＩピクチャ、Ｐピクチャ、Ｂピクチャのいずれにするかを指定するためのピクチャ切替制御信号を出力する。一方、ピクチャ切替部１９は、ＯＢＪ領域指示信号及びＳ領域指示信号を入力しない場合は、ピクチャ切替制御信号を出力しない。 In parallel, when the picture switching unit 19 receives either the OBJ region instruction signal from the appearance object detection unit 16 or the S region instruction signal from the jumping eye movement region detection unit 17, the compression coding process is performed. In the compression encoding of the apparatus, a picture switching control signal for designating whether an image signal of a frame to be encoded at present is an I picture, a P picture, or a B picture is output. On the other hand, the picture switching unit 19 does not output the picture switching control signal when the OBJ area instruction signal and the S area instruction signal are not input.

なお、動作説明において述べた各処理をプログラミングし、当該プログラムによってコンピュータを動作させることにより、当該プログラムは、コンピュータを圧縮符号化前置処理装置１０として機能させることができる。この場合、コンピュータは、図３に示した構成を有することとなる。 Note that by programming each process described in the explanation of the operation and causing the computer to operate according to the program, the program can cause the computer to function as the compression encoding pre-processing device 10. In this case, the computer has the configuration shown in FIG.

以上のように、本実施の形態における圧縮符号化前置処理装置１０によれば、オプティカルフロー情報及び跳躍性眼球運動による注視点移動を考慮して、跳躍性眼球運動により動オブジェクトに対する観視者の視覚が低下する期間において動オブジェクトを含む動領域の信号帯域を制限する構成としたので、観視者の視覚特性に依存しない汎用性の高い高圧縮符号を効率的に得ることができる。 As described above, according to the compression coding pre-processing apparatus 10 according to the present embodiment, the viewer of the moving object by the jumping eye movement is considered in consideration of the optical flow information and the gaze point movement by the jumping eye movement. Since the signal band of the moving area including the moving object is limited during the period when the visual perception of the image decreases, a highly versatile high-compression code that does not depend on the visual characteristics of the viewer can be efficiently obtained.

また、本実施の形態における圧縮符号化前置処理装置１０によれば、出現オブジェクトのオプティカルフロー情報及び跳躍性眼球運動による注視点移動を考慮して、観視者が出現オブジェクトの領域を周辺視で観視すると想定される時から跳躍性眼球運動により出現オブジェクトの領域に中心視野を移動すると想定される時までの期間において出現オブジェクトを含む動領域の信号帯域を制限する構成としたので、観視者の視覚特性に依存しない汎用性の高い高圧縮符号を効率的に得ることができる。 In addition, according to the compression coding pre-processing device 10 in the present embodiment, the viewer views the region of the appearing object in the peripheral view in consideration of the optical flow information of the appearing object and gaze point movement due to the jumping eye movement. The signal band of the moving region including the appearing object is limited during the period from the time when it is assumed to be viewed in the period until it is assumed that the central visual field is moved to the area of the appearing object by the jumping eye movement. A highly versatile high-compression code that does not depend on the visual characteristics of the viewer can be obtained efficiently.

また、本実施の形態における圧縮符号化前置処理装置１０によれば、前景出現オブジェクトのオプティカルフロー情報及び跳躍性眼球運動による注視点移動を考慮して、観視者が前景出現オブジェクトの領域を周辺視で観視すると想定される時から跳躍性眼球運動により前景出現オブジェクトの領域に中心視野を移動すると想定される時までの期間において前景出現オブジェクトを含む動領域の信号帯域を制限する構成としたので、観視者の視覚特性に依存しない汎用性の高い高圧縮符号を効率的に得ることができる。 In addition, according to the compression coding pre-processing apparatus 10 in the present embodiment, the viewer determines the area of the foreground appearance object in consideration of the optical flow information of the foreground appearance object and gaze point movement due to the jumping eye movement. A configuration that limits the signal band of the moving region including the foreground appearance object during a period from when it is assumed to be viewed in peripheral vision to when it is assumed that the central visual field is moved to the region of the foreground appearance object by jumping eye movement Therefore, a highly versatile high-compression code that does not depend on the visual characteristics of the viewer can be efficiently obtained.

また、本実施の形態における圧縮符号化前置処理装置１０によれば、オプティカルフロー情報及び跳躍性眼球運動による注視点移動を考慮して、フレーム内符号化画像信号、フレーム間順方向予測符号化画像信号及び双方向予測符号化画像信号のいずれかを圧縮符号化時の画像信号として圧縮符号化処理装置に指示する構成としたので、観視者の視覚特性に依存しない汎用性の高い高圧縮符号を効率的に得ることができる。 In addition, according to the compression coding pre-processing device 10 in the present embodiment, intra-frame coded image signals and inter-frame forward prediction coding are performed in consideration of optical flow information and gaze point movement due to jumping eye movements. Since either the image signal or the bi-directional predictive encoded image signal is instructed to the compression encoding processing apparatus as an image signal at the time of compression encoding, high versatility and high compression that does not depend on the visual characteristics of the viewer The code can be obtained efficiently.

特に、圧縮符号化前置処理装置１０は、広視野映像システムにおいて大きな効果を得ることができる。具体的には、画面内の動きオブジェクトが動く時間は一般的に平均１秒程度であって、跳躍性眼球運動により画面上の動領域の帯域制限を行う期間は約１５０〜２５０ｍｓｅｃ程度であり、画面内の動領域（一般にフレーム内の約６割〜８割）の殆どはＳＨＶでは跳躍性眼球運動で追うこととなるので、広視野・超高精細映像であるＳＨＶでは本発明の効果が非常に高いと言える。 In particular, the compression coding pre-processing device 10 can obtain a great effect in a wide-field video system. Specifically, the time for the moving object in the screen to move is generally about 1 second on average, and the period for limiting the bandwidth of the moving area on the screen by jumping eye movement is about 150 to 250 msec, Since most of the moving area in the screen (generally about 60% to 80% in the frame) is followed by jumping eye movement in SHV, the effect of the present invention is very effective in SHV which is a wide field of view / ultra high definition video. It can be said that it is very expensive.

以上のように、本発明に係る圧縮符号化前置処理装置及び圧縮符号化前置処理プログラムは、観視者の視覚特性に依存しない汎用性の高い高圧縮符号を効率的に得ることができるという効果を有し、圧縮符号化する前の画像信号を処理する圧縮符号化前置処理装置及び圧縮符号化前置処理プログラム等として有用である。 As described above, the compression coding pre-processing apparatus and the compression coding pre-processing program according to the present invention can efficiently obtain a highly-compressed high-compression code that does not depend on the visual characteristics of the viewer. It is useful as a compression encoding pre-processing apparatus and a compression encoding pre-processing program for processing an image signal before compression encoding.

動き視標に対する跳躍性眼球運動時の眼球の応答を示す概念図Conceptual diagram showing the response of the eyeball during jumping eye movements to a motion target 跳躍性眼球運動時の高速動き視標に対する眼球運動を示す概念図Conceptual diagram showing eye movements for high-speed movement targets during jumping eye movements 本発明に係る圧縮符号化前置処理装置の一実施の形態における構成を示すブロック図The block diagram which shows the structure in one Embodiment of the compression coding pre-processing apparatus which concerns on this invention ＳＨＶ、ＨＤＴＶ及びＮＴＳＣの各映像方式で画面を構成する画素数と、標準観視距離と、標準観視距離での画角とを示す図The figure which shows the number of pixels which comprise a screen by each video system of SHV, HDTV, and NTSC, a standard viewing distance, and an angle of view in a standard viewing distance 本発明に係る圧縮符号化前置処理装置の一実施の形態において、動領域検出部による動領域の検出についての説明図Explanatory drawing about the detection of the moving region by a moving region detection part in one Embodiment of the compression coding pre-processing apparatus which concerns on this invention 本発明に係る圧縮符号化前置処理装置の一実施の形態において、動ベクトル検出部が動領域画像フレーム列Ｍ（ｔ）に対して行うｎ階２次元離散ウェーブレット分解の模式図FIG. 3 is a schematic diagram of n-th order two-dimensional discrete wavelet decomposition performed by the motion vector detection unit on the motion region image frame sequence M (t) in the embodiment of the compression coding pre-processing apparatus according to the present invention. 本発明に係る圧縮符号化前置処理装置の一実施の形態において、動ベクトル検出部が実行する動ベクトルの検出処理を示すフローチャートThe flowchart which shows the detection process of the motion vector which a motion vector detection part performs in one Embodiment of the compression coding pre-processing apparatus which concerns on this invention

Explanation of symbols

１０圧縮符号化前置処理装置
１１（１１ａ〜１１ｄ）フレームメモリ
１２クロック信号発生器
１３シーンチェンジ検出部
１４動領域検出部（動領域検出手段）
１５（１５ａ〜１５ｃ）動ベクトル検出部（動オブジェクト抽出手段、オプティカルフロー情報取得手段）
１６出現オブジェクト検出部（動オブジェクト抽出手段）
１７跳躍性眼球運動領域検出部
１８信号帯域制限部（信号帯域制限手段）
１９ピクチャ切替部（画像信号指示手段）
２０（２０ａ〜２０ｄ）動領域画像信号 DESCRIPTION OF SYMBOLS 10 Compression encoding pre-processing apparatus 11 (11a-11d) Frame memory 12 Clock signal generator 13 Scene change detection part 14 Motion area detection part (motion area detection means)
15 (15a-15c) Motion vector detection unit (moving object extraction means, optical flow information acquisition means)
16 Appearance object detection part (moving object extraction means)
17 Jumping Eye Movement Region Detection Unit 18 Signal Band Limiting Unit (Signal Band Limiting Unit)
19 Picture switching unit (image signal instruction means)
20 (20a-20d) Moving region image signal

Claims

A compression coding pre-processing device that processes an image signal before compression coding and outputs the processed image signal to a compression coding processing device,
A moving region detecting means for detecting a moving region having movement from an image related to the input image signal by a predetermined method;
An optical flow information acquisition means for extracting a moving object from the detected moving area and acquiring optical flow information of the extracted moving object;
An appearance object that appears in the screen from the outside of the screen on which the image is displayed or a foreground appearance object that appears from a state hidden in the image displayed on the screen is detected based on the optical flow information, and detected. An appearance object detection unit that outputs an object area instruction signal indicating a position of the appearance object or the foreground appearance object;
An object having a movement speed equal to or higher than a movement speed threshold for generating a jumping eye movement with respect to a viewing distance indicating a size of the screen and a distance from a viewer who views the screen to the screen. A jumping eye movement region detection unit that outputs a jumping eye movement region signal indicating the position of the detected object when detected based on information;
Signal band limiting means for limiting the spatial frequency band of the moving region of the image signal and outputting the same to the compression coding processing device during a period in which the object region instruction signal or the jumping eye movement region signal is output. A compression coding pre-processing apparatus characterized by the above.

The compression coding pre-processing apparatus further includes an image signal instruction means for inputting the object area instruction signal and the jumping eye movement area signal,
The image signal instructing means encodes an image signal output from the compression encoding processing device using an intra-frame encoded image signal encoded by intra-frame encoding and a frame image signal previously reproduced during reproduction. Bi-directional encoding using an inter-frame forward predictive encoded image signal encoded by forward predictive encoding or a frame image signal reproduced earlier during reproduction and a frame image signal reproduced later 2. The compression coding pre-processing device according to claim 1, wherein the compression coding processing device is instructed to select one of bidirectional prediction coded image signals to be coded by predictive coding .

A compression encoding preprocessing program that causes a computer to function as a compression encoding preprocessing device that processes an image signal before compression encoding and outputs the processed image signal to a compression encoding processing device,
The computer,
A moving region detecting means for detecting a moving region having movement from an image related to the input image signal by a predetermined method;
An optical flow information acquisition means for extracting a moving object from the detected moving area and acquiring optical flow information of the extracted moving object;
An appearance object that appears in the screen from the outside of the screen on which the image is displayed or a foreground appearance object that appears from a state hidden in the image displayed on the screen is detected based on the optical flow information, and detected. Appearance object detection means for outputting an object area instruction signal indicating the position of the appearance object or the foreground appearance object;
An object having a movement speed equal to or higher than a movement speed threshold for generating a jumping eye movement with respect to a viewing distance indicating a size of the screen and a distance from a viewer who views the screen to the screen. A jumping eye movement region detecting means for outputting a jumping eye movement region signal indicating the position of the detected object when detected based on information;
In the period in which the object area instruction signal or the jumping eye movement area signal is output, the spatial frequency band of the moving area of the image signal is limited to function as signal band limiting means for outputting to the compression coding processing apparatus. A compression encoding preprocessing program characterized by the above.

The compression coding pre-processing apparatus further includes an image signal instruction means for inputting the object area instruction signal and the jumping eye movement area signal,
The compression encoding pre-processing program stores the computer,
Further, as an image signal output from the compression coding processing apparatus, forward prediction is performed by using an intra-frame encoded image signal encoded by intra-frame encoding, and a frame image signal previously reproduced during reproduction. Coded by bi-directional predictive coding, which is coded using an inter-frame forward-predicted encoded image signal encoded by encoding, or a frame image signal reproduced earlier during reproduction and a frame image signal reproduced later 4. The compression coding preprocessing program according to claim 3, wherein the compression coding preprocessing program is made to function as an image signal instruction means for instructing the compression coding processing device which of the two-way predictive coded image signals to be converted to. .