JP6299103B2

JP6299103B2 - Object recognition device, object recognition program used for the object recognition device, and moving body control system

Info

Publication number: JP6299103B2
Application number: JP2013156606A
Authority: JP
Inventors: 関　海克; 海克関
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2013-07-29
Filing date: 2013-07-29
Publication date: 2018-03-28
Anticipated expiration: 2033-07-29
Also published as: JP2015026310A

Description

本発明は、オブジェクト認識装置及びそのオブジェクト認識装置に用いるオブジェクト認識用プログラム及び移動体制御システムに関する。 The present invention relates to an object recognition apparatus, an object recognition program used for the object recognition apparatus, and a moving body control system.

近時、自車両の前方の走行車両、歩行者等のオブジェクトとしての移動物体を認識し、自車両と前方車両との間への別の車両の割り込み、歩行者の飛び出し等の危険な状態を早期発見し、ドライバーに警告等を行い、危険を予防するオブジェクト認識装置が開発されている。 Recently, a moving vehicle as an object such as a traveling vehicle or a pedestrian in front of the host vehicle is recognized, and a dangerous state such as an interruption of another vehicle between the host vehicle and the preceding vehicle, a pedestrian jumping out, etc. Object recognition devices have been developed that detect early, warn drivers, and prevent danger.

オブジェクト認識装置には、色と空間周波数という特徴量を用いて、歩行者と二輪車とを認識する技術が知られている（例えば、特許文献１参照）。しかしながら、背景が複雑になると、背景に被写体と同じ色、同じ空間周波数の物体が増加するため、対象とする移動物体の認識率が低下し、誤認識率が増加する。 A technique for recognizing a pedestrian and a two-wheeled vehicle using feature quantities such as color and spatial frequency is known as an object recognition device (see, for example, Patent Document 1). However, when the background becomes complicated, the number of objects having the same color and the same spatial frequency as the subject increases in the background, so that the recognition rate of the target moving object decreases and the false recognition rate increases.

そこで、カメラにより得られた画像から視差を計算して視差画像（距離画像ともいう）を求め、その視差画像と輝度画像とにより、対象物としての移動物体の特定効率および特定精度の向上を図る技術が提案されている（特許文献２参照）。なお、視差画像とは視差値を画素値とした画像をいう。 Accordingly, parallax is calculated from an image obtained by the camera to obtain a parallax image (also referred to as a distance image), and the parallax image and the luminance image are used to improve the identification efficiency and identification accuracy of the moving object as the target object. A technique has been proposed (see Patent Document 2). A parallax image is an image having a parallax value as a pixel value.

しかしながら、この特許文献２に開示のものでも、背景が複雑化すると、認識精度が低下すると共に、誤認識率が増加する。
本発明は、上記の事情に鑑みて為されたもので、オブジェクトの認識精度の向上、オブジェクトの誤認識の低減を図ることができるオブジェクト認識装置及びそのオブジェクト認識装置に用いるオブジェクト認識用プログラム及び移動体制御システムを提供することにある。 However, even with the one disclosed in Patent Document 2, when the background becomes complicated, the recognition accuracy is lowered and the misrecognition rate is increased.
The present invention has been made in view of the above circumstances. An object recognition apparatus capable of improving object recognition accuracy and reducing object misrecognition, an object recognition program used for the object recognition apparatus, and movement It is to provide a body control system.

本発明のオブジェクト認識装置は、ステレオカメラにより得られたステレオ画像信号が逐次入力されるステレオ画像入力部と、該ステレオ画像入力部から出力されたステレオ画像の少なくとも一方の画像をフレーム毎に逐次記憶保持して輝度画像フレームを構築する輝度画像構築部と、前記ステレオ画像入力部から前記輝度画像構築部への出力に同期して出力されたステレオ画像を用いて視差を計算して視差画像フレームを構築する視差画像構築部と、輝度画像フレームと視差画像フレームとを用いてオブジェクトを認識するオブジェクト認識部とを備え、該オブジェクト認識部は、輝度画像フレームから輝度特徴量とその輝度特徴量を演算するために用いた画素の視差値により視差特徴量とを演算するオブジェクト特徴量演算部と、事前学習した前記輝度特徴量と前記視差特徴量とに関する学習データと前記輝度特徴量と前記視差特徴量とによってオブジェクトを決定するための評価値を演算してオブジェクトを決定するオブジェクト決定処理部と、備え、
前記オブジェクト特徴量演算部は、前記視差特徴量として視差平均値と視差標準偏差とを演算し、前記記憶データは、前記評価値を求めるための前記輝度特徴量に重みづけを行うための重み係数と、前記評価値を求めるための前記視差特徴量に重みづけを行うための重み係数を含み、
前記評価値を求めるための演算式が、
であることを特徴とするオブジェクト認識装置。
ただし、Ｆ（ｘ）は前記評価値を求めるための評価関数、ｈ _ｔ（ｘ）は輝度特徴量、α _ｔは輝度特徴量ｈ _ｔ（ｘ）に重みづけを行うための係数、α _ｄは視差平均値に重みづけを行うための係数、α _σｄは視差標準偏差に重みづけを行うための係数である。 The object recognition apparatus of the present invention sequentially stores, for each frame, a stereo image input unit to which stereo image signals obtained by a stereo camera are sequentially input and at least one of the stereo images output from the stereo image input unit. A luminance image construction unit that constructs a luminance image frame by holding the parallax image frame by calculating parallax using a stereo image output in synchronization with an output from the stereo image input unit to the luminance image construction unit; A parallax image construction unit to be constructed, and an object recognition unit for recognizing an object using the luminance image frame and the parallax image frame. The object recognition unit calculates a luminance feature amount and the luminance feature amount from the luminance image frame. An object feature amount calculation unit for calculating a parallax feature amount based on a parallax value of a pixel used to An object determination processing unit for determining the object said calculates an evaluation value for determining the object by the luminance feature amount and the parallax features and of learning data and the luminance feature quantity and said parallax feature amounts includes,
The object feature amount calculation unit calculates a parallax average value and a parallax standard deviation as the parallax feature amount, and the stored data is a weighting factor for weighting the luminance feature amount for obtaining the evaluation value And a weighting coefficient for weighting the parallax feature quantity for obtaining the evaluation value,
An arithmetic expression for obtaining the evaluation value is:
An object recognition device characterized by being.
Where F (x) is an evaluation function for obtaining the evaluation value, h _t (x) is a luminance feature amount, α _t is a coefficient for weighting the luminance feature amount h _t (x), and α _d is A coefficient for weighting the parallax average value, α _σd is a coefficient for weighting the parallax standard deviation.

本発明によれば、オブジェクトを認識するための輝度特徴量と、この輝度特徴量を演算するために用いた画素が有する視差値により求めた視差特徴量と、その輝度特徴量とその視差特徴量とに関して事前学習した学習データとによって、オブジェクトを決定することにしたので、オブジェクトの認識精度の向上、オブジェクトの誤認識の低減を図ることができる According to the present invention, the luminance feature amount for recognizing the object, the parallax feature amount obtained from the parallax value of the pixel used to calculate the luminance feature amount, the luminance feature amount, and the parallax feature amount Because it was decided to determine the object based on the learning data learned in advance, the recognition accuracy of the object can be improved and the erroneous recognition of the object can be reduced.

図１は本発明に係るオブジェクト認識装置が搭載されている移動体としての車両を示す概略図である。FIG. 1 is a schematic view showing a vehicle as a moving body on which an object recognition device according to the present invention is mounted. 図２は図１に示すオブジェクト認識装置のハードウエアの構成を示すブロック図である。FIG. 2 is a block diagram showing a hardware configuration of the object recognition apparatus shown in FIG. 図３は図２に示すオブジェクト認識装置の機能を説明するためのブロック図である。FIG. 3 is a block diagram for explaining the function of the object recognition apparatus shown in FIG. 図４はステレオカメラの構造と視差との関係を説明するための模式図である。FIG. 4 is a schematic diagram for explaining the relationship between the structure of the stereo camera and the parallax. 図５はステレオカメラにより取得された左右のステレオ画像と視差画像との説明図であって、（ａ）は左側の撮影レンズ系による画像、（ｂ）は右側の撮影レンズ系による画像の一例を示し、（ｃ）はこのステレオ画像から求められた視差画像の一例を示している。FIG. 5 is an explanatory diagram of left and right stereo images and parallax images acquired by a stereo camera, where (a) is an image by the left photographic lens system, and (b) is an example of an image by the right photographic lens system. (C) shows an example of a parallax image obtained from this stereo image. 図６は輝度画像から切り出される矩形ブロックを模式的に示す図である。FIG. 6 is a diagram schematically showing a rectangular block cut out from the luminance image. 図７は矩形ブロックの輝度特徴量を求めるために用いる分割パターンとその分割パターンを用いて輝度特徴量を求めるための一例を模式的に示す説明図である。FIG. 7 is an explanatory diagram schematically illustrating an example of a division pattern used for obtaining a luminance feature amount of a rectangular block and an example of obtaining a luminance feature amount using the division pattern. 図８は輝度特徴量と視差特徴量とに関する学習データを学習させるための学習用のオブジェクトの一例を示し、（a）は学習用のオブジェクトの輝度画像を示し、（ｂ）は学習用のオブジェクトの視差画像を示している。FIG. 8 shows an example of a learning object for learning the learning data regarding the luminance feature quantity and the parallax feature quantity, (a) shows the luminance image of the learning object, and (b) shows the learning object. The parallax images are shown. 図９はオブジェクト識別器の階層構造の概念を説明するための図である。FIG. 9 is a diagram for explaining the concept of the hierarchical structure of the object classifier. 図１０は認識したオブジェクトを説明するための図である。FIG. 10 is a diagram for explaining the recognized object. 図１１は、本発明に係るオブジェクト認識処理装置の実施例の作用を説明するためのフローチャート図である。FIG. 11 is a flowchart for explaining the operation of the embodiment of the object recognition processing apparatus according to the present invention.

以下、図面を参照しつつ本発明に係るオブジェクト認識処理装置について説明する。
図１は本発明に係るオブジェクト認識処理装置を搭載した移動体としての車両の外観を示す概略図である。 Hereinafter, an object recognition processing apparatus according to the present invention will be described with reference to the drawings.
FIG. 1 is a schematic view showing the appearance of a vehicle as a moving body equipped with an object recognition processing apparatus according to the present invention.

その図１において、１は移動体としての自車両、２はオブジェクト認識装置、Sは追尾対象としてのオブジェクト（移動物体）である。オブジェクト認識装置２は、自車両の本体に備えられている。 In FIG. 1, reference numeral 1 denotes a host vehicle as a moving body, 2 denotes an object recognition device, and S denotes an object (moving object) as a tracking target. The object recognition device 2 is provided in the main body of the host vehicle.

図２はそのステレオカメラとオブジェクト認識装置２のハードウェア構成を示すブロック図である。
オブジェクト認識装置２は、図２に示す車載用のステレオカメラから出力される信号を処理する回路から構成されている。図２はそのステレオカメラのハードウェア構成を示すブロック図である。その図２において、１a、１ｂは左右の撮影レンズ系を示し、２a、２ｂは左右の撮像素子としてのCMOS（Complementary Metal Oxide Semiconductor）を示す。
なお、このオブジェクト認識装置２はステレオカメラに含まれていても良い。 FIG. 2 is a block diagram showing a hardware configuration of the stereo camera and the object recognition device 2.
The object recognition device 2 includes a circuit that processes a signal output from the in-vehicle stereo camera shown in FIG. FIG. 2 is a block diagram showing a hardware configuration of the stereo camera. In FIG. 2, 1a and 1b indicate left and right photographing lens systems, and 2a and 2b indicate CMOS (Complementary Metal Oxide Semiconductor) as left and right imaging elements.
The object recognition device 2 may be included in a stereo camera.

被写体光は、左右の撮影レンズ系１a、１ｂによりCMOS２a、２ｂにそれぞれ結像される。
CMOS２a、２ｂは、その撮像面に結像された光学像を電気信号に変換して、アナログ画像データとして左右のCDS（Correlated Double Sampling:相関２重サンプリング）回路３a、３ｂに向けて出力する。 The subject light is imaged on the CMOS 2a and 2b by the left and right photographing lens systems 1a and 1b, respectively.
The CMOS 2a, 2b converts the optical image formed on the imaging surface into an electric signal and outputs it as analog image data to the left and right CDS (Correlated Double Sampling) circuits 3a, 3b.

そのCDS回路３a、３ｂはCMOS２a、２ｂから出力されたアナログ画像データのノイズ成分を除去して、Ａ／Ｄ変換器４a、４ｂに向けて出力する。A／D変換器４a、４ｂはアナログ画像データをデジタル画像データに変換して画像処理回路５a、５ｂに向けて出力する。 The CDS circuits 3a and 3b remove the noise components of the analog image data output from the CMOSs 2a and 2b, and output them to the A / D converters 4a and 4b. The A / D converters 4a and 4b convert the analog image data into digital image data and output it to the image processing circuits 5a and 5b.

そのCMOS２a、２ｂ、CDS回路３a、３ｂ、A／D変換器４a、４ｂは、タイミング信号を発生するタイミング信号発生器６によりタイミング制御される。そのタイミング信号発生器６はCPU（Central Processing Unit）７によって、コントロールされる。なお、CPU７は画像処理回路５a、５ｂ、後述する画像圧縮伸張回路、メモリカードもコントロールする。 The CMOSs 2a and 2b, the CDS circuits 3a and 3b, and the A / D converters 4a and 4b are timing-controlled by a timing signal generator 6 that generates a timing signal. The timing signal generator 6 is controlled by a CPU (Central Processing Unit) 7. The CPU 7 also controls the image processing circuits 5a and 5b, an image compression / decompression circuit (to be described later), and a memory card.

画像処理回路５a、５ｂは、画像データを一時格納するSDRAM（SynchronousDRAM）８を用いて、Y、Cr、Cb変換処理、ホワイトバランス処理、コントラスト補正処理、エッジ強調処理、色変換処理などの各種画像処理を行う。 The image processing circuits 5a and 5b use an SDRAM (Synchronous DRAM) 8 that temporarily stores image data, and various images such as Y, Cr, Cb conversion processing, white balance processing, contrast correction processing, edge enhancement processing, and color conversion processing. Process.

なお、ホワイトバランス処理は、画像情報の色濃さを調整し、コントラスト補正処理は、画像情報のコントラストを調整する画像処理である。エッジ強調処理は、画像情報のシャープネスを調整し、色変換処理は、画像情報の色合いを調整する画像処理である。 The white balance process is an image process that adjusts the color density of image information, and the contrast correction process is an image process that adjusts the contrast of image information. The edge enhancement process adjusts the sharpness of image information, and the color conversion process is an image process that adjusts the hue of image information.

また、信号処理、画像処理が施された画像情報は、圧縮伸張回路９を介して、メモリカード１０に記録される。圧縮伸張回路９は、画像処理回路５a、５ｂから出力されるデジタル画像データを圧縮してメモリカード１０に出力すると共に、メモリカード１０から読み出した画像情報を伸張して画像処理回路５a、５ｂに出力する機能を有する。 The image information subjected to the signal processing and the image processing is recorded in the memory card 10 via the compression / decompression circuit 9. The compression / decompression circuit 9 compresses the digital image data output from the image processing circuits 5a and 5b and outputs the compressed data to the memory card 10, and also decompresses the image information read from the memory card 10 to the image processing circuits 5a and 5b. Has a function to output.

CPU７は、コンピュータで読み取り可能なオブジェクト認識用プログラムに従って各種演算処理を行い、プログラム等が格納された読み出し専用メモリとしてのROM（Read Only Memory）１１、各種の処理過程で利用するワークエリア、各種データ格納エリア等を有する読み出し書き込み自在のメモリとしてのRAM（Random Access Memory）１２を内蔵し、これらがバスラインによって相互接続されている。なお、１３はこのオブジェクト認識装置２の操作部である。ここでは、この操作部１３の操作により、オブジェクト認識装置２は、その機能を実行する。 The CPU 7 performs various arithmetic processes in accordance with a computer-readable object recognition program, a ROM (Read Only Memory) 11 as a read-only memory storing the program, a work area used in various processing steps, and various data A RAM (Random Access Memory) 12 as a readable / writable memory having a storage area and the like is built in, and these are interconnected by a bus line. Reference numeral 13 denotes an operation unit of the object recognition apparatus 2. Here, the object recognition apparatus 2 executes the function by the operation of the operation unit 13.

このオブジェクト認識装置２のCPU７による処理機能の概要を図３を参照しつつ説明する。ステレオカメラのCMOS２a、２ｂ、CDS回路３ａ，３ｂ，Ａ/Ｄ変換器４ａ，４ｂ，は、ここでは、ステレオ画像信号を逐次出力するステレオ画像出力部としての機能を有する。 An outline of processing functions of the CPU 7 of the object recognition apparatus 2 will be described with reference to FIG. Here, the CMOS 2a and 2b, the CDS circuits 3a and 3b, and the A / D converters 4a and 4b of the stereo camera have a function as a stereo image output unit that sequentially outputs a stereo image signal.

その画像処理回路５a、５ｂは、ステレオ画像出力部から出力されたステレオ画像が入力されてフレーム毎にステレオ画像を逐次記憶保持するステレオ画像入力部２０としての機能と、そのステレオ画像の少なくとも一方に基づいて画像フレーム毎に輝度画像フレームを構築する輝度画像構築部２１としての機能とを有する。 The image processing circuits 5a and 5b receive the stereo image output from the stereo image output unit, and function as the stereo image input unit 20 that sequentially stores and holds the stereo image for each frame, and at least one of the stereo images. And a function as a luminance image constructing unit 21 that constructs a luminance image frame for each image frame.

CPU７は、ROM１１に格納されているプログラムがロードされ、ステレオ画像入力部２０からの輝度画像構築部部２１への出力に同期して出力されたステレオ画像を用いて視差を計算し、視差画像フレームを構築する視差画像構築部２２として機能する。 The CPU 7 loads a program stored in the ROM 11, calculates a parallax using a stereo image output in synchronization with an output from the stereo image input unit 20 to the luminance image construction unit 21, and generates a parallax image frame Functions as a parallax image construction unit 22 for constructing

また、CPU7は、輝度画像構築部２１からの輝度画像フレームと視差画像構築部２２からの視差画像フレームとを用いてオブジェクトを認識するオブジェクト認識部２３として機能する。 Further, the CPU 7 functions as an object recognition unit 23 that recognizes an object using the luminance image frame from the luminance image construction unit 21 and the parallax image frame from the parallax image construction unit 22.

オブジェクト認識部２３は、ここでは、輝度画像フレームから得られた矩形ブロック内の画素の輝度値により求めた輝度特徴量とその輝度特徴量を求めるために用いた画素の視差特徴量とを演算により求めるオブジェクト特徴量演算部２３aと、オブジェクトを決定するために予め学習により求められた特徴量に関する重み係数を記憶するオブジェクト認識辞書メモリ部（記録媒体）２３ｃと、オブジェクト認識辞書メモリ部２３ｃに記憶されている重み係数とオブジェクト特徴量演算部２３aにより求められた輝度特徴量と視差特徴量とによりオブジェクトを決定するオブジェクト決定処理部２３ｂとから構成されている。なお、オブジェクト認識辞書メモリ部２３ｃは、例えば、ROM１１からなる。 Here, the object recognizing unit 23 calculates the luminance feature amount obtained from the luminance value of the pixel in the rectangular block obtained from the luminance image frame and the parallax feature amount of the pixel used for obtaining the luminance feature amount. The object feature amount calculation unit 23a to be obtained, the object recognition dictionary memory unit (recording medium) 23c for storing the weighting factor related to the feature amount obtained by learning in advance to determine the object, and the object recognition dictionary memory unit 23c. And an object determination processing unit 23b for determining an object based on the luminance feature amount and the parallax feature amount obtained by the weighting factor and the object feature amount calculation unit 23a. The object recognition dictionary memory unit 23c is composed of the ROM 11, for example.

図4はオブジェクトSとステレオカメラによる像との関係を模式的に示す図である。
オブジェクトSの撮像対象点Ｏは、図４に示すように、撮影レンズ系１a、１ｂによりCMOS２a、２ｂの撮像面に結像される。 FIG. 4 is a diagram schematically showing the relationship between the object S and the image obtained by the stereo camera.
As shown in FIG. 4, the imaging target point O of the object S is imaged on the imaging surfaces of the CMOSs 2a and 2b by the taking lens systems 1a and 1b.

撮影レンズ系１a、１ｂの焦点距離をｆ、撮影レンズ系１ａ，１ｂの光軸間距離（基線長）をD、ステレオカメラからオブジェクト（移動物体）Sまでの距離をZ、結像中心からのずれ量をそれぞれΔ１、Δ２とすると、視差Δは、Δ＝Δ１＋Δ２である。なお、ずれ量Δ１、Δ２は、CMOS２a、２ｂ上における撮像対象点Oの結像位置によって求まる。また、この実施例では、符号Δは視差値の意味でも使用する。 The focal length of the photographic lens systems 1a and 1b is f, the distance between the optical axes (base line length) of the photographic lens systems 1a and 1b is D, the distance from the stereo camera to the object (moving object) S is Z, and the distance from the imaging center. If the shift amounts are Δ1 and Δ2, respectively, the parallax Δ is Δ = Δ1 + Δ2. Note that the shift amounts Δ1 and Δ2 are obtained from the imaging position of the imaging target point O on the CMOSs 2a and 2b. In this embodiment, the symbol Δ is also used to mean the parallax value.

なお、オブジェクト（移動物体）Sまでの距離Zは、Z＝D×（ｆ／Δ）の式から求めることができる。
視差画像構築部２２は、視差Δを用いて画素毎に視差値を計算し、オブジェクト距離計算部２５aは、Z＝D×（ｆ／Δ）の演算式を用いて、オブジェクトSの撮像対象点Ｏまでの距離を演算することができる。 The distance Z to the object (moving object) S can be obtained from the equation Z = D × (f / Δ).
The parallax image construction unit 22 calculates a parallax value for each pixel using the parallax Δ, and the object distance calculation unit 25a uses an arithmetic expression of Z = D × (f / Δ) to calculate the imaging target point of the object S. The distance to O can be calculated.

なお、その図４において、符号ｓ’はCMOS２a、２ｂの撮像面に結像されたオブジェクト（移動物体）Sの像を示している。また、ここでは、撮影レンズ系１a、１ｂの光軸は水平方向（横方向又は左右方向）に平行であるとし、水平方向の光軸に対して画素のラインは左右の画像について縦方向にずれていないものとする。なお、撮影レンズ系１a、１ｂの光軸は縦方向に平行でもよい。 In FIG. 4, the symbol s' indicates an image of the object (moving object) S formed on the imaging surfaces of the CMOSs 2a and 2b. Also, here, the optical axes of the photographing lens systems 1a and 1b are assumed to be parallel to the horizontal direction (lateral direction or horizontal direction), and the pixel lines are shifted in the vertical direction for the left and right images with respect to the horizontal optical axis. Shall not. The optical axes of the photographing lens systems 1a and 1b may be parallel to the vertical direction.

図５はそのCMOS２a、２ｂの撮像面に結像されたオブジェクトSのステレオ画像と視差画像とを示し、（ａ）は左側の画像Ｇ（G1）、（ｂ）は右側の画像Ｇ（G2）を示している。また、図５（ｃ）は、そのステレオ画像から求められた視差画像Ｇ３の模式図を示している。左側の画像G1と右側のG2とは、視差のため、撮像面上での結像位置は異なる。これらの画像G1、G2は、例えば、RAM１２のメモリ領域に一時的に保存される。 FIG. 5 shows a stereo image and a parallax image of the object S formed on the imaging surfaces of the CMOSs 2a and 2b. (A) is a left image G (G1), and (b) is a right image G (G2). Is shown. FIG. 5C shows a schematic diagram of the parallax image G3 obtained from the stereo image. The left image G1 and the right G2 have different imaging positions on the imaging surface due to parallax. These images G1 and G2 are temporarily stored in the memory area of the RAM 12, for example.

その画像G1,G2から輝度画像フレームと視差画像フレームとが生成され、同時刻に撮像された輝度画像Ｇ１、Ｇ２と視差画像Ｇ３、すなわち、輝度画像フレームと視差画像フレームとが同期した複数枚の連続した画像フレームが生成される。その輝度画像フレームの画像上の座標と視差画像フレームの画像上の座標とは一対一に対応している。 A luminance image frame and a parallax image frame are generated from the images G1 and G2, and the luminance images G1 and G2 and the parallax image G3 captured at the same time, that is, a plurality of images in which the luminance image frame and the parallax image frame are synchronized. Successive image frames are generated. The coordinates on the image of the luminance image frame and the coordinates on the image of the parallax image frame have a one-to-one correspondence.

その視差画像は、例えば、以下に説明する方法により求める。
まず、図５に示す高さｙ１、ｙ２の横ラインがオブジェクトS’、路面領域ROを横切る際の視差について説明する。
高さｙ１の横ラインについての画素の視差値を、例えば「…、５、４、…、１２、１２、…、１２、…、４、…、１、１、…」とする。ここで、視差値「１２、１２、…」は、例えば、オブジェクトS’を意味し、視差値「１、１、…」は、路面領域R0を意味する。 The parallax image is obtained by, for example, the method described below.
First, the parallax when the horizontal lines having heights y1 and y2 shown in FIG. 5 cross the object S ′ and the road surface area RO will be described.
The parallax value of the pixel for the horizontal line of height y1 is, for example, “..., 5, 4,..., 12, 12,. Here, the parallax value “12, 12,...” Means, for example, the object S ′, and the parallax value “1, 1,...” Means the road surface region R0.

すなわち、ここでは、高さｙ１における横ラインの画素の視差値「１２」は、例えば、図１に示す自車両１から約１０ｍの位置にあるオブジェクトSとしての前方車両を意味し、視差値「１」はその前方車両よりも遠くの位置にある路面を意味している。 That is, here, the parallax value “12” of the pixel on the horizontal line at the height y1 means, for example, the preceding vehicle as the object S located at about 10 m from the host vehicle 1 shown in FIG. “1” means a road surface farther away than the vehicle ahead.

また、高さｙ２の横ラインについての画素の視差値を、例えば、「…、２５、２５、…、２４、…、２４、２５、…、」とする。この高さｙ２における横ラインの画素の視差値は、前方車両よりも自車両１に近い路面領域R0の視差値を意味する。 Further, the parallax values of the pixels with respect to the horizontal line having the height y2 are, for example, “..., 25, 25,..., 24,. The parallax value of the pixel on the horizontal line at the height y2 means the parallax value of the road surface region R0 that is closer to the host vehicle 1 than to the preceding vehicle.

視差画像を作成するために、図５（a）、図５（ｂ）に示すように、左右の画像G1、G2をブロックIrで分割し、ブロックの差分が一番小さいとき、ブロックIrがマッチングしたとして、視差値を求める。なお、ブロックIrのサイズの最適値は実験により求めて調整した後、設定する。 In order to create a parallax image, as shown in FIGS. 5A and 5B, the left and right images G1 and G2 are divided by the block Ir, and when the block difference is the smallest, the block Ir is matched. As a result, the parallax value is obtained. Note that the optimum value of the size of the block Ir is set after being determined through experiments.

例えば、１２８０×９６０画素の画像G1、G2について５×５サイズのブロックIrで分割し、このブロックＩｒによる左の画像Ｇ１のブロックＩｒと右の画像Ｇ２のブロックＩｒのブロックマッチングについて説明する。 For example, 1280 × 960 pixel images G1 and G2 are divided by a block Ir of 5 × 5 size, and block matching of the block Ir of the left image G1 and the block Ir of the right image G2 by this block Ir will be described.

高さｙ１の横ラインについて、左の画像Ｇ１のブロックＩｒが、座標原点位置（０、０）に対して（ｘ１、ｙ１）の位置にあるとする。この例では、左の画像Ｇ１のブロックＩｒは左側の前方風景とオブジェクトSの像S１’の左側の白い部分との境界部に位置している。この左の画像Ｇ１のブロックＩｒの輝度値の総和を例えばＩr（ｘ１、ｙ１）とする。 It is assumed that the block Ir of the left image G1 is located at the position (x1, y1) with respect to the coordinate origin position (0, 0) for the horizontal line having the height y1. In this example, the block Ir of the left image G1 is located at the boundary between the left front landscape and the white portion on the left side of the image S1 'of the object S. For example, the sum of the luminance values of the block Ir of the left image G1 is Ir (x1, y1).

これに対して、高さｙ１の横ラインについて、右の画像Ｇ２のブロックＩｒを、例えば、位置（０、ｙ１）から位置（ｘ１＋Δ、ｙ１）まで移動させる。その位置（０、ｙ１）のブロックＩｒの輝度値の総和をＩｒ（０、ｙ１）、その位置（ｘ１＋Δ、ｙ１）のブロックIrの輝度値の総和をＩｒ（ｘ１＋Δ、ｙ１）とする。 On the other hand, the block Ir of the right image G2 is moved from the position (0, y1) to the position (x1 + Δ, y1) for the horizontal line having the height y1, for example. The sum of the luminance values of the block Ir at the position (0, y1) is Ir (0, y1), and the sum of the luminance values of the block Ir at the position (x1 + Δ, y1) is Ir (x1 + Δ, y1).

左の画像Ｇ１のブロックＩｒは、オブジェクト像S1’の境界部に位置しており、このブロックＩｒによって囲まれる領域の左半分は暗く、右半分は明るい。これに対して、右の画像Ｇ２の位置（０、ｙ１）に存在するブロックＩｒによって囲まれる領域は全体的に暗い。その左の画像Ｇ１のブロックＩｒの輝度値の総和Ｉｒ（ｘ１、ｙ１）と右の画像Ｇ２のブロックＩｒの輝度値の総和Ｉｒ（０、ｙ１）との差が大きいので、ブロックマッチングしたと判定されない。 The block Ir of the left image G1 is located at the boundary of the object image S1 ', and the left half of the area surrounded by the block Ir is dark and the right half is bright. In contrast, the area surrounded by the block Ir existing at the position (0, y1) of the right image G2 is entirely dark. Since the difference between the total luminance value Ir (x1, y1) of the block Ir of the left image G1 and the total luminance value Ir (0, y1) of the block Ir of the right image G2 is large, it is determined that block matching has been performed. Not.

右の画像Ｇ２のブロックＩｒを、その位置（０、ｙ１）から位置（ｘ１＋Δ、ｙ１）に向かって移動させて、左の画像Ｇ１のブロックＩｒの輝度値の総和Ｉｒ（ｘ１、ｙ１）と右の画像Ｇ２のブロックＩｒの輝度値の総和Ｉｒ（ｘ、ｙ１）との差を順次求める。 The block Ir of the right image G2 is moved from the position (0, y1) toward the position (x1 + Δ, y1), and the sum Ir (x1, y1) of the luminance values of the block Ir of the left image G1 and the right The difference from the sum Ir (x, y1) of the luminance values of the block Ir of the image G2 is sequentially obtained.

すると、右の画像Ｇ２のブロックＩｒが位置（ｘ１＋Δ、ｙ１）に位置した時点で、左の画像Ｇ1のブロックIｒの輝度値の総和Ｉr（ｘ１、ｙ１）と、右の画像Ｇ2のブロックＩｒの輝度値の総和Ｉｒ（ｘ１＋Δ、ｙ１）との差が最小となる。この差が最小のときブロックマッチングしたと判定して、視差Δを求める。この視差Δは、ブロックＩｒがオブジェクトSの像S１’の境界部に存在するので、例えば、「１２」である。 Then, when the block Ir of the right image G2 is positioned at the position (x1 + Δ, y1), the sum Ir of the luminance values of the block Ir of the left image G1 (x1, y1) and the block Ir of the right image G2 The difference from the total luminance value Ir (x1 + Δ, y1) is minimized. When this difference is minimum, it is determined that block matching has been performed, and parallax Δ is obtained. The parallax Δ is “12”, for example, because the block Ir exists at the boundary of the image S1 ′ of the object S.

ブロックＩｒを高さｙ１の横ラインから高さｙ２の横ラインの方向に向かって順次移動させる。ついで、各高さｙの横ラインにおいて、ブロックIｒをｘ方向に移動させて、ブロックマッチング処理を行う。すると、オブジェクトSの像S1’の境界部において、視差Δが逐次求められる。 The block Ir is sequentially moved from the horizontal line having the height y1 toward the horizontal line having the height y2. Next, in each horizontal line of height y, block Ir is moved in the x direction to perform block matching processing. Then, the parallax Δ is sequentially obtained at the boundary portion of the image S1 ′ of the object S.

また、例えば、左の画像Ｇ１のブロックＩｒが高さｙ２の横ラインの位置（ｘ１、ｙ２）に位置していたとする。すなわち、左の画像Ｇ１のブロックＩｒが白線の境界部に位置していたとする。 Further, for example, it is assumed that the block Ir of the left image G1 is located at the position (x1, y2) of the horizontal line having the height y2. That is, it is assumed that the block Ir of the left image G1 is located at the boundary of the white line.

これに対して、高さｙ２の横ラインにおいて、右の画像Ｇ２のブロックＩｒを、例えば、位置（０、ｙ２）から位置（ｘ１＋Δ、ｙ２）まで移動させる。その位置（０、ｙ２）のブロックＩｒの輝度値の総和をＩｒ（０、ｙ２）、その位置（ｘ１＋Δ、ｙ２）の輝度値の総和をＩｒ（ｘ１＋Δ、ｙ２）とする。 On the other hand, the block Ir of the right image G2 is moved from the position (0, y2) to the position (x1 + Δ, y2), for example, on the horizontal line having the height y2. The sum of the luminance values of the block Ir at the position (0, y2) is Ir (0, y2), and the sum of the luminance values at the position (x1 + Δ, y2) is Ir (x1 + Δ, y2).

左の画像Ｇ１のブロックＩｒは、白線の境界部に位置しており、このブロックＩｒによって囲まれる領域の左半分は暗く、右半分は明るい。これに対して、右の画像Ｇ２の位置（０、ｙ２）に存在するブロックＩｒによって囲まれる領域は全体的に暗い。このため、その左の画像Ｇ１のブロックＩｒの輝度値の総和Ｉｒ（ｘ１、ｙ１）と右の画像G2のブロックＩｒの輝度値の総和Ｉｒ（０、ｙ２）との差が大きく、ブロックマッチングしたと判定されない。 The block Ir of the left image G1 is located at the boundary of the white line, and the left half of the area surrounded by the block Ir is dark and the right half is bright. In contrast, the area surrounded by the block Ir existing at the position (0, y2) of the right image G2 is entirely dark. Therefore, the difference between the sum Ir (x1, y1) of the luminance values of the block Ir of the left image G1 and the sum of the luminance values Ir (0, y2) of the block Ir of the right image G2 is large, and block matching is performed. Is not determined.

右の画像Ｇ２のブロックＩｒを、位置（０、ｙ２）から位置（ｘ１＋Δ、ｙ２）に向かって移動させて、右の画像Ｇ２のブロックＩｒの各位置（ｘ、ｙ２）において、左の画像Ｇ１のブロックＩｒの輝度値の総和Ｉｒ（ｘ１、ｙ２）と右の画像Ｇ２のブロックＩｒの輝度値の総和Ｉｒ（ｘ、ｙ２）との差を順次求める。 The block Ir of the right image G2 is moved from the position (0, y2) toward the position (x1 + Δ, y2), and at each position (x, y2) of the block Ir of the right image G2, the left image G1 The difference between the total luminance value Ir (x1, y2) of the block Ir and the total luminance value Ir (x, y2) of the block Ir of the right image G2 is sequentially obtained.

すると、右の画像Ｇ２のブロックＩｒが位置（ｘ１＋Δ、ｙ２）に位置した時点で、左の画像Ｇ１のブロックIｒの輝度値の総和Ｉr（ｘ１、ｙ２）と、右の画像G2のブロックＩｒの輝度値の総和Ｉｒ（ｘ１＋Δ、ｙ２）との差が最小となる。この差が最小のときブロックマッチングしたと判定して、視差Δを求める。この視差Δは、ブロックＩｒが白線の境界部に存在するので、例えば、「２４」又は「２５」である。高さｙ２の横ラインの白線以外の部分についてもときどきマッチングする点が得られる。 Then, when the block Ir of the right image G2 is positioned at the position (x1 + Δ, y2), the sum Ir (x1, y2) of the luminance values of the block Ir of the left image G1 and the block Ir of the right image G2 The difference from the sum of the luminance values Ir (x1 + Δ, y2) is minimized. When this difference is minimum, it is determined that block matching has been performed, and parallax Δ is obtained. The parallax Δ is, for example, “24” or “25” because the block Ir exists at the boundary of the white line. A matching point is sometimes obtained for portions other than the white line of the horizontal line of height y2.

このようにして得られた画像が視差画像G3であり、その視差画像G3を図５（ｃ）に模式的に示す。この視差画像G3では、視差値の大きい方を高い輝度値で表現し、視差が小さい方を低い輝度値で表現している。画像のコントラストが大きい部分では、ブロックＩｒがマッチングする個数が多いので、明るい点の個数が境界部では多い。 The image thus obtained is a parallax image G3, and the parallax image G3 is schematically shown in FIG. In this parallax image G3, the one with a larger parallax value is represented by a higher luminance value, and the one with a smaller parallax is represented by a lower luminance value. In the portion where the contrast of the image is large, since the number of matching blocks Ir is large, the number of bright spots is large in the boundary portion.

オブジェクト認識部２３は、輝度画像構築部２１により得られた輝度画像フレームを用いて、車両、歩行者等のオブジェクトSを決定する。
機械学習方法により、事前にオブジェクトSの学習データを用いて、オブジェクト認識用辞書を作成する。オブジェクトSの種類によって、別々にオブジェクト認識用辞書を作成する必要がある。 The object recognition unit 23 determines an object S such as a vehicle or a pedestrian using the luminance image frame obtained by the luminance image construction unit 21.
An object recognition dictionary is created in advance by using learning data of the object S by a machine learning method. Depending on the type of object S, it is necessary to create a dictionary for object recognition separately.

例えば、車両を認識するためには、車両をオブジェクトとして認識する辞書を作成する必要がある。歩行者をオブジェクトSとして認識するためには、歩行者を認識する辞書を作成する必要がある。従って、対象とするオブジェクトSごとにオブジェクト認識用辞書を作成する。 For example, in order to recognize a vehicle, it is necessary to create a dictionary that recognizes the vehicle as an object. In order to recognize a pedestrian as the object S, it is necessary to create a dictionary that recognizes the pedestrian. Therefore, an object recognition dictionary is created for each target object S.

このオブジェクトSを認識するため、まず、輝度画像G（G1又はG2）から図６に示すように、輝度画像Gの範囲内で、矩形ブロックBR１を設定する。矩形ブロックBR1の左上隅点の座標（Xs,Ys）と右下隅点の座標（Xe,Ye）は、輝度画像G内での矩形ブロックBR1の位置と矩形ブロックの大きさとによって決まる。矩形ブロックBR1は大きいサイズから小さいサイズまで順に選択する。 In order to recognize this object S, first, a rectangular block BR1 is set within the range of the luminance image G from the luminance image G (G1 or G2) as shown in FIG. The coordinates (Xs, Ys) of the upper left corner point and the coordinates (Xe, Ye) of the lower right corner point of the rectangular block BR1 are determined by the position of the rectangular block BR1 in the luminance image G and the size of the rectangular block. The rectangular block BR1 is selected in order from the largest size to the smallest size.

この実施例では、矩形ブロックBR1の正規化を行うので、大きいサイズの矩形ブロックBR1から小さいサイズの矩形ブロックBR２までを生成するための処理時間は同じである。輝度画像Gには大きいサイズの矩形ブロックBR1となる候補の数が少なく、小さいサイズの矩形ブロックBR２の数は多いと考えられる。というのは、自車両の直前方に存在する移動物体は少ないと考えられるからである。なお、矩形ブロックBR2は、図５（a）、図５（ｂ）に示すオブジェクトＳ’に対応しているものとして説明する。 In this embodiment, since normalization of the rectangular block BR1 is performed, the processing time for generating from the large size rectangular block BR1 to the small size rectangular block BR2 is the same. In the luminance image G, the number of candidates for the large-sized rectangular block BR1 is small, and the number of small-sized rectangular blocks BR2 is considered to be large. This is because it is considered that there are few moving objects in front of the host vehicle. Note that the rectangular block BR2 is described as corresponding to the object S 'shown in FIGS. 5 (a) and 5 (b).

大きいサイズの矩形ブロックBR1は、輝度画像上では、その個数が少ない。なので、大きいサイズの矩形ブロックBR1から順に小さいサイズの矩形ブロックBR2を用いて探索することにすれば、より早く追尾すべき移動物体を検知できる。また、大きいオブジェクトSの画像が検知されると、出力するとき体感速度は速くなる。 The number of large-sized rectangular blocks BR1 is small on the luminance image. Therefore, if the search is performed using the rectangular block BR2 having a smaller size in order from the rectangular block BR1 having a larger size, a moving object to be tracked can be detected more quickly. Also, when an image of a large object S is detected, the sensation speed increases when output.

例えば、図６に示すように、大きいサイズの矩形ブロックBR1’を、輝度画像Ｇ（G1、G2）内で矢印Ａｒｘ、Ａｒｙ方向に走査して、認識対象物体があるか画像中を探索する。
ついで、若干小さいサイズの矩形ブロックBR1を用いて走査を行い、認識対象物体がありか画像中を探索する。 For example, as shown in FIG. 6, a rectangular block BR1 ′ having a large size is scanned in the directions of arrows Arx and Ary in the luminance image G (G1, G2) to search the image for a recognition target object.
Next, scanning is performed using a rectangular block BR1 having a slightly smaller size, and a search is made in the image for the object to be recognized.

オブジェクト特徴量演算部２３aは、輝度特徴量と視差特徴量を計算する。例えば、矩形ブロックBR2内に存在する白黒の矩形領域の輝度特徴量を計算する。 The object feature amount calculation unit 23a calculates a luminance feature amount and a parallax feature amount. For example, the luminance feature amount of a black and white rectangular area existing in the rectangular block BR2 is calculated.

図７（ａ）ないし（ｄ）は、オブジェクトＳ’に対応する矩形ブロックBR2について、輝度特徴量を求めるための４つの特徴的な分割パターンの例が模式的に示されている。この分割パターンは、車両に対する輝度特徴量を求めるためのものである。 FIGS. 7A to 7D schematically show examples of four characteristic division patterns for obtaining the luminance feature amount for the rectangular block BR2 corresponding to the object S ′. This division pattern is for obtaining a luminance feature amount for the vehicle.

図７（ａ）は矩形ブロックBR2を横方向に白矩形領域BR3と黒矩形領域BR４に分割した場合を示している。図７（ｂ）は矩形ブロックBR2を上下方向に白矩形領域BR3と黒矩形領域BR４とに分割した場合を示している。図７（ｃ）は横方向に間隔を開けて位置する白矩形領域BR3と白矩形領域BR3との間に黒矩形領域BR4が存在するように矩形ブロックBR2を横方向に分割した場合を示している。図７（ｄ）は対角状に交差して白矩形領域BR３と黒矩形領域BR4とが存在するように矩形ブロックBR2を分割した場合を示している。 FIG. 7A shows a case where the rectangular block BR2 is divided into a white rectangular area BR3 and a black rectangular area BR4 in the horizontal direction. FIG. 7B shows a case where the rectangular block BR2 is divided into a white rectangular area BR3 and a black rectangular area BR4 in the vertical direction. FIG. 7C shows a case where the rectangular block BR2 is divided in the horizontal direction so that the black rectangular area BR4 exists between the white rectangular area BR3 and the white rectangular area BR3 which are spaced apart in the horizontal direction. Yes. FIG. 7D shows a case where the rectangular block BR2 is divided so that a white rectangular area BR3 and a black rectangular area BR4 exist so as to cross diagonally.

その図７（ａ）ないし図７（ｄ）に示す白矩形領域BR3と黒矩形領域BR4とをそれぞれ矩形ブロックBR2に重ねて、その重なっている矩形領域内に存在する各画素の輝度値の合計値を求める。 The white rectangular area BR3 and the black rectangular area BR4 shown in FIGS. 7A to 7D are superimposed on the rectangular block BR2, respectively, and the luminance values of the respective pixels existing in the overlapping rectangular area are summed up. Find the value.

そして、白矩形領域BR3に重なっている矩形ブロックBR2の画素の輝度値の総和と黒矩形領域BR4に重なっている矩形ブロックBR２の画素の輝度値の総和との差を求め、これを矩形ブロックBR2の輝度特徴量ｈ_ｔ（ｘ）とする。 Then, the difference between the sum of the luminance values of the pixels of the rectangular block BR2 that overlaps the white rectangular region BR3 and the sum of the luminance values of the pixels of the rectangular block BR2 that overlaps the black rectangular region BR4 is obtained, and this is obtained. Luminance feature amount h _t (x).

以下に、矩形ブロックBR2についての輝度特徴量ｈ_ｔ（ｘ）の演算の一例について説明する。
例えば、矩形ブロックBR２が図５（ａ）に示すオブジェクトS’に対応するものとする。このオブジェクトS1’は、略矩形状で、下部の車体部分が白く上部に窓があって黒く映っている。このブロックBR２に、例えば、図７（ｂ）に示す分割パターンを用いて輝度特徴量ｈ_ｔ（ｘ）の演算を行うと、以下のようになる。 Hereinafter, an example of the calculation of the luminance feature amount h _t (x) for the rectangular block BR2 will be described.
For example, it is assumed that the rectangular block BR2 corresponds to the object S ′ shown in FIG. This object S1 'has a substantially rectangular shape, and the lower body part is white and the upper part has a window and appears black. When the luminance feature amount h _t (x) is calculated for this block BR2 using, for example, the division pattern shown in FIG. 7B, the result is as follows.

例えば、図７（ｅ）に模式的に示すように、矩形ブロックBR２の上方部分の画素の輝度値がＢｒ＝０（黒）であるとする。また、矩形ブロックBR２の下半分の画素の輝度値がＷａ＝２５５（白）であるとする。この場合、黒矩形領域BR4に重なっている矩形ブロックBR2の輝度値の総和は、ΣＢｒである。これに対して、白矩形領域BR3に重なっている矩形ブロックBR2の輝度値の総和は、ΣＷａである。ΣＢｒは輝度値Ｂｒを有する画素の個数と輝度値Ｂｒの積であり、ここでは、ΣＢｒ＝０である。同様に、ΣＷａは輝度値Ｗａを有する画素の個数と輝度値Ｗａ＝２５５との積である。従って、特徴量ｈ_ｔ（ｘ）＝ΣＷａ―ΣＢｒは０よりもはるかに大きい。 For example, as schematically shown in FIG. 7E, it is assumed that the luminance value of the pixel in the upper part of the rectangular block BR2 is Br = 0 (black). Further, it is assumed that the luminance value of the lower half pixel of the rectangular block BR2 is Wa = 255 (white). In this case, the sum of the luminance values of the rectangular block BR2 overlapping the black rectangular area BR4 is ΣBr. On the other hand, the sum of the luminance values of the rectangular block BR2 overlapping the white rectangular area BR3 is ΣWa. ΣBr is the product of the number of pixels having the luminance value Br and the luminance value Br, and here, ΣBr = 0. Similarly, ΣWa is the product of the number of pixels having the luminance value Wa and the luminance value Wa = 255. Therefore, the feature quantity h _t (x) = ΣWa−ΣBr is much larger than zero.

また、図５（ａ）に示すオブジェクトS’に対して、図７（ａ）に示す分割パターンを用いて輝度特徴量ｈ_ｔ（ｘ）の演算を行うと、以下のようになる。図７（ｆ）に示すように、左半分の白矩形領域BR3に重なっている矩形ブロックBR2の輝度値の総和は、例えばΣ（Ｗａ＋Ｂｒ）となる。同様に、右半分の黒矩形領域BR4に重なっている矩形ブロックBR2の輝度値の総和はΣ（Ｂｒ＋Ｗａ）となる。この図７（ｆ）では、矩形ブロックBR2が左右対称的な輝度値を有するので、特徴量ｈ_ｔ（ｘ）は、ｈ_ｔ（ｘ）＝０となる。 Further, when the luminance feature amount h _t (x) is calculated for the object S ′ shown in FIG. 5A using the division pattern shown in FIG. As shown in FIG. 7F, the sum of the luminance values of the rectangular block BR2 that overlaps the white rectangular area BR3 in the left half is, for example, Σ (Wa + Br). Similarly, the sum of the luminance values of the rectangular block BR2 overlapping the black rectangular area BR4 in the right half is Σ (Br + Wa). In FIG. 7F, since the rectangular block BR2 has symmetrical luminance values, the feature value h _t (x) is h _t (x) = 0.

このように、図７（ａ）ないし図７（ｄ）に示すように特徴量ｈ_ｔ（ｘ）を求めるための白矩形領域BR３、黒矩形領域BR4の大きさと位置と分割パターンを用いて特徴量ｈ_ｔ（ｘ）を求める。そして、この輝度特徴量ｈ_ｔ（ｘ）からオブジェクトを認識するための輝度評価値ｆ（ｘ）を求める。 As described above, as shown in FIGS. 7 (a) to 7 (d), the size and position of the white rectangular area BR3 and the black rectangular area BR4 for obtaining the characteristic amount h _t (x) and the division pattern are used. The quantity h _t (x) is determined. Then, a luminance evaluation value f (x) for recognizing the object is obtained from the luminance feature value h _t (x).

ここで、計算した輝度特徴量ｈ_ｔ（ｘ）を用いて式（１）に示すような特徴量重み付けの輝度評価値f(x)を計算する。式（１）に示すようにブロック内の輝度特徴量ｈ_ｔ（ｘ）を計算して、重み係数α_ｔを考慮して、評価関数としての輝度評価値f(x)を計算する。
Here, a luminance evaluation value f (x) of feature amount weighting as shown in the equation (1) is calculated using the calculated luminance feature amount h _t (x). As shown in Expression (1), the luminance feature value h _t (x) in the block is calculated, and the luminance evaluation value f (x) as the evaluation function is calculated in consideration of the weighting coefficient α _t .

評価関数は輝度特徴量ｈ_ｔ（ｘ）と重み係数αｔを持っている。輝度特徴量ｈｔ（ｘ）に対する重み係数αｔは学習により予め計算して求める。すなわち、オブジェクト認識対象に対して、学習データを集め、学習させ、輝度特徴量ｈ_ｔ（ｘ）に対する重み係数α_ｔを求める。 The evaluation function has a luminance feature amount h _t (x) and a weighting coefficient αt. The weighting coefficient αt for the luminance feature quantity ht (x) is obtained by calculation in advance by learning. That is, learning data is collected and learned for the object recognition target, and a weighting coefficient α _t for the luminance feature amount h _t (x) is obtained.

また、オブジェクト特徴量演算部２３aは、視差特徴量を求める。オブジェクト特徴量演算部２３aは、この視差特徴量を求めるために、視差平均値ｄと視差標準偏差値σｄとを演算する。
その視差平均値ｄは、例えば、以下に説明するようにして求める。例えば、図７（e）に示すように、白矩形領域BR3により囲まれた領域内、黒矩形領域BR4により囲まれた領域内の各画素が有する視差値をｄi、この白矩形領域BR３と黒矩形領域BR４に存在する画素の総個数をNとすると、
視差平均値ｄは
ｄ＝Σｄi/N
により求められる。 Further, the object feature amount calculation unit 23a obtains a parallax feature amount. The object feature quantity computing unit 23a computes the parallax average value d and the parallax standard deviation value σd in order to obtain the parallax feature quantity.
The parallax average value d is obtained as described below, for example. For example, as shown in FIG. 7E, the disparity value of each pixel in the area surrounded by the white rectangular area BR3 and the area surrounded by the black rectangular area BR4 is di, and the white rectangular area BR3 and the black If the total number of pixels existing in the rectangular area BR4 is N,
The parallax average value d is d = Σdi / N
Is required.

視差標準偏差値σｄは、統計数学により、

この視差平均値ｄと視差標準偏差値σdはオブジェクトの視差特徴量である。
オブジェクト決定部２３ｂは、オブジェクト認識辞書メモリ部２３ｃに記憶されている重み係数とオブジェクト特徴量演算部２３aにより求められた輝度特徴量と視差特徴量とによりオブジェクトを決定する。 The parallax standard deviation value σd is calculated by statistical mathematics.

The parallax average value d and the parallax standard deviation value σd are parallax feature amounts of the object.
The object determination unit 23b determines an object based on the weighting coefficient stored in the object recognition dictionary memory unit 23c, the luminance feature amount obtained by the object feature amount calculation unit 23a, and the parallax feature amount.

オブジェクト決定処理部２３ｂは、そのオブジェクトを決定するために、輝度特徴量ｈ_ｔ（ｘ）と視差特徴量ｄ、σｄと重み係数とを用いて、下記に示す評価関数F（ｘ）により評価値を求める。

ここで、α_ｄ、α_σdは視差特徴量の重み係数である。
そのオブジェクト認識辞書メモリ部（記録媒体）２３ｃは、オブジェクトを決定するために予め学習により求められた輝度特徴量ｈ_ｔ（ｘ）、視差特徴量ｄ、σｄとに対する重み係数α_ｔ、α_ｄ、α_σｄとを記憶している。
なお、オブジェクト認識辞書メモリ部（記録媒体）２３ｃには、図７（ａ）ないし図７（ｄ）に示す矩形ブロック矩形ブロックの二辺が交差する四隅の交点のうちの左上隅の交点の座標値、その矩形ブロックの横幅と縦幅（矩形のサイズ）、白領域（又は黒領域）の二辺が交差する四隅の交点のうちの左上隅の交点の座標値、その白領域と黒領域のサイズ、ｆ_ｔ（ｘ）の評価値の閾値、特徴量の重み係数α_ｔ等が記憶されている。
なお、符号ｔは輝度特徴量の番号、符号Ｔは輝度特徴量の総数である。 In order to determine the object, the object determination processing unit 23b uses the luminance feature quantity h _t (x), the parallax feature quantity d, σd, and the weighting factor to evaluate the evaluation value using the evaluation function F (x) shown below. Ask for.

Here, α _d and α _σd are weighting coefficients of the parallax feature amount.
The object recognition dictionary memory unit (recording medium) 23c has weighting factors α _t , α _d , and luminance feature amounts h _t (x), parallax feature amounts d, and σd obtained in advance for determining an object. α _σd is stored.
In the object recognition dictionary memory unit (recording medium) 23c, the coordinates of the intersection of the upper left corner among the intersections of the four corners where two sides of the rectangular block rectangular block shown in FIGS. 7A to 7D intersect. Value, the horizontal and vertical width (rectangular size) of the rectangular block, the coordinate value of the intersection of the upper left corner of the four corners where the two sides of the white area (or black area) intersect, the white area and the black area The size, the threshold value of the evaluation value of f _t (x), the weight coefficient α _t of the feature amount, and the like are stored.
Note that the symbol t is the number of luminance feature amounts, and the symbol T is the total number of luminance feature amounts.

図８は学習用のオブジェクトの一例を示し、（a）は学習用のオブジェクトの輝度画像G４を示し、（ｂ）は学習用のオブジェクトの視差画像G５を示している。
この学習用のオブジェクトの輝度画像G4から大まかな分割パターン、細かな分割パターンにより輝度画像の領域を分割して、輝度画像G4のオブジェクトの輝度特徴量ｈ_ｔ（ｘ）に対する重み係数αｔを求め、これをオブジェクト認識辞書メモリ部２３ｃに記憶させる。
その輝度特徴量ｈ_ｔ（ｘ）を演算するための画素の座標値（ｘi、ｙi）が、例えば、左上隅を原点（０、０）として保存され、これらの画素の座標値（ｘi、ｙi）と分割パターンとを用いて輝度特徴量ｈ_ｔ（ｘ）を演算する。 FIG. 8 shows an example of the learning object, (a) shows the luminance image G4 of the learning object, and (b) shows the parallax image G5 of the learning object.
A luminance image region is divided from the luminance image G4 of the learning object by a rough division pattern and a fine division pattern to obtain a weighting coefficient αt for the luminance feature amount h _t (x) of the object of the luminance image G4, This is stored in the object recognition dictionary memory unit 23c.
The coordinate values (xi, yi) of the pixels for calculating the luminance feature value h _t (x) are stored with the upper left corner as the origin (0, 0), for example, and the coordinate values (xi, yi) of these pixels are stored. ) And the division pattern, the luminance feature amount h _t (x) is calculated.

同様に、視差画像G５から、輝度画像G４のパターンに対応する画素の視差平均値ｄと視差標準偏差値σｄに対する重み係数を求め、これをオブジェクト認識辞書メモリ部２３ｃに記憶させる。
その視差特徴量に対する重み係数も、例えば、オブジェクト認識辞書メモリ部２３ｃに認識すべきオブジェクト毎に表1に示すようにテーブルとして保存されている。
Similarly, weight coefficients for the parallax average value d and the parallax standard deviation value σd of the pixels corresponding to the pattern of the luminance image G4 are obtained from the parallax image G5, and are stored in the object recognition dictionary memory unit 23c.
The weighting coefficient for the parallax feature amount is also stored as a table as shown in Table 1 for each object to be recognized in the object recognition dictionary memory unit 23c, for example.

例えば、あるオブジェクトの視差特徴量と重み係数として、視差平均値ｄ１＝１００（ｍ）、重み係数α_ｄ１＝０.１、視差標準偏差値σｄ１＝１/１０、重み係数α_σｄ１＝０.０１、視差平均値ｄ２＝８０（ｍ）、重み係数α_ｄ２＝０.２、視差標準偏差値σｄ２＝１/２０、重み係数α_σｄ２＝０.０２、…、視差平均値ｄi＝１（ｍ）、…等のテーブルがオブジェクト認識辞書メモリ部２３ｃに保存されている。 For example, as a parallax feature amount and a weighting coefficient of an object, a parallax average value d1 = 100 (m), a weighting coefficient α _d1 = 0.1, a parallax standard deviation value σd1 = 1/10, and a weighting coefficient α _σd1 = 0.01 , Parallax average value d2 = 80 (m), weight coefficient α _d2 = 0.2, parallax standard deviation value σd2 = 1/20, weight coefficient α _σd2 = 0.02,..., Parallax average value di = 1 (m) ,... Are stored in the object recognition dictionary memory unit 23c.

オブジェクトの認識は図９に示す階層を持つオブジェクト識別器を用いて行う。各階層において式（４）に示す評価関数F（ｘ）を持つ。その評価関数F（ｘ）の値が予め設定した閾値より小さい場合、オブジェクトでないと判断され、その矩形ブロックBR２の評価を中止する。
各階層（ｎ１からｎ１；ｎは正の整数）で評価値F（ｘ）を計算する。最後の階層ｎ１でオブジェクトでないと判断されない矩形ブロックBR1はオブジェクト候補領域と判断する。 Object recognition is performed using an object classifier having a hierarchy shown in FIG. Each hierarchy has an evaluation function F (x) shown in Expression (4). When the value of the evaluation function F (x) is smaller than a preset threshold value, it is determined that the object is not an object, and the evaluation of the rectangular block BR2 is stopped.
An evaluation value F (x) is calculated at each layer (n1 to n1; n is a positive integer). The rectangular block BR1 that is not determined to be an object in the last hierarchy n1 is determined to be an object candidate area.

オブジェクト特徴量演算部２３ａは、例えば、最初に階層１１において評価値F（ｘ）を計算するための輝度特徴量ｈｔ（ｘ）と重み係数αｔを用いて輝度評価値ｆ（ｘ）を演算する。ついで、輝度特徴量ｈｔ（ｘ）を求めるために用いた画素が有する視差値から視差平均値ｄiと標準偏差σｄiとを求める。 For example, the object feature amount calculation unit 23a first calculates the luminance evaluation value f (x) using the luminance feature amount ht (x) and the weighting coefficient αt for calculating the evaluation value F (x) in the hierarchy 11 first. . Next, the parallax average value di and the standard deviation σdi are obtained from the parallax values of the pixels used for obtaining the luminance feature quantity ht (x).

そして、重み係数を用いて評価関数F(ｘ)を作成する。
例えば、階層１１では大まかに車らしい特徴があるか否かを判断するために、図７（ａ）ないし図７（ｄ）に示すパターンを用いて、輝度評価値ｆ（ｘ）を計算する。ついで、輝度特徴量ｈｔ（ｘ）を求めるために用いた画素が有する視差値から視差平均値ｄiと標準偏差σｄiとを求める。そして、重み係数を用いて評価関数F(ｘ)を作成する。この評価関数F（ｘ）により求められた評価値と階層１１の閾値とを比較する。 Then, an evaluation function F (x) is created using the weight coefficient.
For example, in order to determine whether or not there is a car-like feature in the hierarchy 11, the luminance evaluation value f (x) is calculated using the patterns shown in FIGS. 7 (a) to 7 (d). Next, the parallax average value di and the standard deviation σdi are obtained from the parallax values of the pixels used for obtaining the luminance feature quantity ht (x). Then, an evaluation function F (x) is created using the weight coefficient. The evaluation value obtained by the evaluation function F (x) is compared with the threshold value of the hierarchy 11.

図７（ａ）に示す分割パターンを用いた場合、図５（ａ）に示すオブジェクトS’に対する輝度評価値ｆ（ｘ）はｆ（ｘ）＝０となる。ついで、視差特徴量を用いて評価関数F（ｘ）を作成するが、評価関数F（ｘ）による評価値が閾値よりも小さいため、階層１１においては、図７（ａ）に示す分割パターンを用いた場合、オブジェクトS’は非オブジェクトブロックと判断されて、その処理が中止される。 When the division pattern shown in FIG. 7A is used, the luminance evaluation value f (x) for the object S ′ shown in FIG. 5A is f (x) = 0. Next, the evaluation function F (x) is created using the parallax feature amount. Since the evaluation value based on the evaluation function F (x) is smaller than the threshold value, in the hierarchy 11, the division pattern shown in FIG. When it is used, the object S ′ is determined as a non-object block, and its processing is stopped.

ついで、図７（ｂ）に示す分割パターンを用いた処理に移行する。この場合、図５（ａ）に示すオブジェクトS’に対する輝度評価値ｆ（ｘ）は閾値よりも大きな値となる。ついで、視差特徴量を用いて評価関数F（ｘ）を作成する。評価関数F（ｘ）が閾値よりも大きい場合、階層２１に移行する。 Then, the process proceeds to the process using the division pattern shown in FIG. In this case, the luminance evaluation value f (x) for the object S ′ shown in FIG. 5A is larger than the threshold value. Next, an evaluation function F (x) is created using the parallax feature amount. When the evaluation function F (x) is larger than the threshold value, the level 21 is entered.

階層２１では、更に細かな分割パターンが準備されている。その個数は、階層１１で準備されている分割パターンの個数よりも多い。これらの分割パターンを用いて、図５（ａ）に示すオブジェクトS’に対する評価値F（ｘ）を作成する。この評価値F（ｘ）が閾値を超えた場合、次の階層３１に移行する。このような処理を繰り返し、最後の階層ｎ１でオブジェクトでないと判断されない矩形ブロックBR2はオブジェクトと判断する。このようにして決定されたオブジェクトの一例が図１０に示されている。 In the hierarchy 21, a finer division pattern is prepared. The number is larger than the number of division patterns prepared in the hierarchy 11. Using these division patterns, an evaluation value F (x) for the object S ′ shown in FIG. When the evaluation value F (x) exceeds the threshold value, the process moves to the next hierarchy 31. Such processing is repeated, and the rectangular block BR2 that is not determined to be an object in the last hierarchy n1 is determined to be an object. An example of the object thus determined is shown in FIG.

次に、このオブジェクト認識装置２の作用の概略を図１１に示すフローチャートを参照しつつ説明する。
まず、CPU７は、ステレオカメラにより得られたステレオ画像信号が逐次入力されるステレオ画像入力ステップを実行する（S.1）。 Next, an outline of the operation of the object recognition apparatus 2 will be described with reference to the flowchart shown in FIG.
First, the CPU 7 executes a stereo image input step in which stereo image signals obtained by the stereo camera are sequentially input (S.1).

ついで、CPU7は、ステレオ画像入力ステップにおいて得られたステレオ画像の少なくとも一方の画像を輝度画像としてフレーム毎に逐次記憶保持して輝度画像フレームを構築する輝度画像構築ステップを実行する（S.2）。 Next, the CPU 7 executes a luminance image construction step in which at least one of the stereo images obtained in the stereo image input step is sequentially stored and held for each frame as a luminance image to construct a luminance image frame (S.2). .

ついで、CPU7は、ステレオ画像入力ステップにおいて輝度画像フレームの出力に同期して出力されたステレオ画像を用いて視差を計算して視差画像フレームを構築する視差画像構築ステップ（S.３）を実行する。 Next, the CPU 7 executes a parallax image construction step (S.3) in which the parallax image frame is constructed by calculating the parallax using the stereo image output in synchronization with the output of the luminance image frame in the stereo image input step. .

次に、CPU７は輝度画像構築ステップ（S.２）により得られた輝度画像フレームと視差画像構築ステップ（S.3）により得られた視差画像フレームとを用いてオブジェクト認識ステップを実行する（S.4）。このオブジェクト認識ステップ（S.4）は、視差画像フレームから輝度特徴量を演算すると共に、輝度特徴量を得るために用いた画素の視差値から視差特徴量を演算するオブジェクト特徴量演算ステップ（S.41）と、オブジェクト認識辞書メモリ部２３ｃから学習した重み係数を入力する入力ステップ（S.41）と、輝度特徴量と視差特徴量と重み係数とによりオブジェクトを決定するオブジェクト決定処理（S.42）とからなる。 Next, the CPU 7 executes an object recognition step using the luminance image frame obtained by the luminance image construction step (S.2) and the parallax image frame obtained by the parallax image construction step (S.3) (S .Four). This object recognition step (S.4) calculates the luminance feature amount from the parallax image frame, and also calculates the parallax feature amount from the parallax value of the pixel used to obtain the luminance feature amount (S. .41), an input step (S.41) for inputting a weighting factor learned from the object recognition dictionary memory unit 23c, and an object determination process (S.41) for determining an object based on the luminance feature amount, the parallax feature amount, and the weighting factor. 42).

CPU７は、そのオブジェクト決定処理（S.42）によりオブジェクトが決定された場合、後述する処理、例えば、オブジェクトの追尾処理を行うためにオブジェクト認識結果を出力する（S.5）。
このオブジェクト認識処理装置２は、例えば、認識したオブジェクトに対する衝突を回避するために移動体としての車両に搭載されている操舵装置等の機器、移動体としての車両に搭載されている警報装置等の機器の制御を行う移動体制御システムの一部として用いられる。 When the object is determined by the object determination process (S.42), the CPU 7 outputs an object recognition result to perform a process described later, for example, an object tracking process (S.5).
The object recognition processing device 2 is, for example, a device such as a steering device mounted on a vehicle as a moving body to avoid a collision with a recognized object, an alarm device mounted on a vehicle as a moving body, or the like. It is used as a part of a mobile control system that controls equipment.

２０…ステレオ画像入力部
２１…輝度画像構築部
２２…視差画像構築部
２３…オブジェクト認識部
２３ａ…オブジェクト特徴量演算部
２３ｂ…オブジェクト決定処理部 20 ... Stereo image input unit 21 ... Luminance image construction unit 22 ... Parallax image construction unit 23 ... Object recognition unit 23a ... Object feature amount calculation unit 23b ... Object determination processing unit

特開２００８−１４６５４９号公報JP 2008-146549 A 特開２０１２−２２６６８９号公報JP 2012-226689 A

Claims

Builds a luminance image frame by sequentially storing and holding at least one of the stereo image input unit to which the stereo image signal obtained by the stereo camera is sequentially input and the stereo image output from the stereo image input unit for each frame. A luminance image constructing unit, a parallax image constructing unit that constructs a parallax image frame by calculating parallax using a stereo image output in synchronization with an output from the stereo image input unit to the luminance image constructing unit, An object recognizing unit for recognizing an object using the luminance image frame and the parallax image frame, and the object recognizing unit calculates the luminance feature amount from the luminance image frame and the parallax of the pixel used to calculate the luminance feature amount An object feature amount calculation unit for calculating a parallax feature amount based on a value, and a stored luminance feature amount and a parallax feature amount An object determination processing unit for determining the object by calculating an evaluation value for determining the object憶data and the luminance feature amount and by the parallax feature amount includes,
The object feature amount calculation unit calculates a parallax average value and a parallax standard deviation as the parallax feature amount, and the stored data is a weighting factor for weighting the luminance feature amount for obtaining the evaluation value And a weighting coefficient for weighting the parallax feature quantity for obtaining the evaluation value,
An arithmetic expression for obtaining the evaluation value is:
An object recognition device characterized by being.
Where F (x) is an evaluation function for obtaining the evaluation value, h _t (x) is a luminance feature amount, α _t is a coefficient for weighting the luminance feature amount h _t (x), and α _d is A coefficient for weighting the parallax average value, α _σd is a coefficient for weighting the parallax standard deviation.

A moving body control system for controlling a device mounted on a moving object using the object recognition device according to claim 1 .