JP7616973B2

JP7616973B2 - Image processing device and image processing method

Info

Publication number: JP7616973B2
Application number: JP2021148558A
Authority: JP
Inventors: 茂松尾; 健遠藤; 雅士高田
Original assignee: Hitachi Astemo Ltd
Current assignee: Astemo Ltd
Priority date: 2021-09-13
Filing date: 2021-09-13
Publication date: 2025-01-17
Anticipated expiration: 2041-09-13
Also published as: WO2023037575A1; JP2023041286A; DE112022003490T5

Description

本発明は、複数のカメラを用いて車外の障害物を認識する画像処理装置、および、画像処理方法に関する。 The present invention relates to an image processing device and an image processing method that uses multiple cameras to recognize obstacles outside the vehicle.

車両の走行安全性を向上させるために、車載の前方監視センサで車両前方の障害物を検知し、車両がその障害物に衝突する可能性がある場合は、ドライバへ警報したり、自動ブレーキをかけたりするシステムがある。 To improve vehicle driving safety, there are systems that use on-board forward monitoring sensors to detect obstacles in front of the vehicle, and if there is a risk of the vehicle colliding with the obstacle, they will warn the driver and automatically apply the brakes.

そのようなシステムで使用される前方監視センサとしては、ミリ波レーダーやレーザレーダの他、カメラがある。カメラの種類としては、単眼カメラと、複数のカメラを使用したステレオカメラがある。ステレオカメラは、所定の間隔の２つのカメラで撮影された重複領域の視差を利用して、撮影された物体までの距離を計測することができる。このため、前方の物体との衝突危険度を的確に把握することができる。 Sensors used in such systems for monitoring the forward movement include millimeter-wave radar, laser radar, and cameras. Camera types include monocular cameras and stereo cameras that use multiple cameras. Stereo cameras can measure the distance to an object photographed by using the parallax of the overlapping area photographed by two cameras spaced at a specified distance. This makes it possible to accurately grasp the risk of collision with an object ahead.

ステレオカメラは、２つのカメラで撮影された画像の視差を求めて、その視差を距離に変換する。計測距離が遠方になるにつれ、視差が小さくなるという特徴がある。そして、視差の演算方式としてブロックマッチングによって左右画像の対応付けをチェックする方式が知られている。 A stereo camera calculates the disparity between images captured by two cameras and converts the disparity into distance. A characteristic of stereo cameras is that the disparity becomes smaller as the measurement distance becomes farther. A known method of calculating disparity is to check the correspondence between the left and right images using block matching.

しかし、ブロックマッチング方式の視差演算方式には、テクスチャが少ない画像では有効な視差が減少し視差の精度が低下するという課題があるため、最近では、ブロックマッチング方式を代替する方式として、ニューラルネットワークによって視差を演算する方式が開発されている。 However, the block matching method of disparity calculation has the problem that the effective disparity is reduced in images with little texture, resulting in a decrease in disparity accuracy. Recently, therefore, a method of calculating disparity using a neural network has been developed as an alternative to the block matching method.

例えば、特許文献１の要約書では、課題として「距離画像中の誤マッチング領域における視差値を補正可能な技術を提供する。」と記載されており、解決手段として「車両１０に搭載される距離画像生成装置１１０は、ステレオカメラ１２２で撮影された左右の撮像画像を基準画像及び対比画像として用いて、基準画像に存在する物体までの距離を表す距離画像を生成する距離画像生成部１１１と、ニューラルネットワークを用いて、距離画像と比較するための対照画像を生成する対照画像生成部１１２と、基準画像において画像の特徴量が予め定められた閾値以下となる領域に対応する距離画像における補正領域を検出する補正領域検出部１１３と、補正領域の各画素の持つ距離情報を、対照画像における補正領域の対応部分の情報に応じて補正する補正部１１４と、を備える。」と記載されている。すなわち、特許文献１では、画像の一部（走行路面など特徴量が少ない部分）にニューラルネットワーク方式の視差演算を適用する方法が提案されている。 For example, the abstract of Patent Document 1 states that the problem is to "provide a technology capable of correcting disparity values in mismatched regions in a distance image," and states that the solution is to "provide a distance image generating device 110 mounted on a vehicle 10, which uses left and right captured images taken by a stereo camera 122 as a reference image and a comparison image to generate a distance image showing the distance to an object present in the reference image, a comparison image generating unit 112 which generates a comparison image to be compared with the distance image using a neural network, a correction area detecting unit 113 which detects a correction area in the distance image corresponding to an area in the reference image where the image feature amount is equal to or less than a predetermined threshold, and a correction unit 114 which corrects the distance information of each pixel in the correction area according to information on the corresponding part of the correction area in the comparison image." That is, Patent Document 1 proposes a method of applying a parallax calculation using a neural network to a part of an image (a part with few features such as the road surface).

特開２０２０－１４３９５７号公報JP 2020-143957 A

特許文献１のニューラルネットワーク方式は、ブロックマッチング方式の課題を改善し、テクスチャの少ない画像からも高精度の有効視差を演算できる反面、演算量が膨大になるという課題がある。 The neural network method of Patent Document 1 overcomes the problems with the block matching method and can calculate effective parallax with high accuracy even from images with little texture, but it has the problem of requiring a huge amount of calculations.

本発明は、複数のカメラを使用した画像処理装置において、ニューラルネットワークによる視差を演算量を減らして演算し、その視差をベースにしてブロックマッチングによって画像全体の有効視差を増やし高精度な視差画像を入力画像全域で生成することを目的とする。 The present invention aims to use an image processing device that uses multiple cameras to calculate parallax using a neural network with a reduced amount of calculation, and to use that parallax as a base to increase the effective parallax of the entire image by block matching, thereby generating a highly accurate parallax image over the entire input image.

上記課題を解決するため本発明は、２つの入力画像を縮小して２つの縮小画像を生成する画像縮小部と、ニューラルネットワーク処理によって前記２つの縮小画像の視差を求め、縮小視差画像を生成する第１視差画像生成部と、一方の前記入力画像の一部の領域がマッチングする領域を他方の前記入力画像の中から求めることで、前記２つの入力画像の視差を求め視差画像を生成する第２視差画像生成部を有し、前記第２視差画像生成部は、前記縮小視差画像の各画素が対応する前記入力画像の各領域に対して、当該縮小視差画像の各画素の視差値をもとに、前記マッチングする領域を求めるときの探索範囲を設定する。 To solve the above problem, the present invention includes an image reduction unit that reduces two input images to generate two reduced images, a first parallax image generation unit that calculates the parallax between the two reduced images by neural network processing and generates a reduced parallax image, and a second parallax image generation unit that calculates the parallax between the two input images and generates a parallax image by calculating an area in one of the input images that matches a partial area in the other input image, and the second parallax image generation unit sets a search range when calculating the matching area for each area of the input images to which each pixel of the reduced parallax image corresponds, based on the parallax value of each pixel of the reduced parallax image.

本発明によれば、縮小画像からニューラルネットワークによって有効視差を演算し、前記有効視差と対応する位置の縮小前の画像の視差をその有効視差と同値または近似値となるようにブロックマッチング方式で演算するため、ニューラルネットワークの演算量を削減しつつ、縮小前の画像の有効な視差を増やすことができる。 According to the present invention, the effective parallax is calculated from the reduced image by a neural network, and the parallax of the pre-reduced image at a position corresponding to the effective parallax is calculated by a block matching method so that it is the same as or close to the effective parallax, so that the amount of calculations by the neural network can be reduced while increasing the effective parallax of the pre-reduced image.

実施例１の画像処理装置の機能ブロック図Functional block diagram of an image processing apparatus according to a first embodiment 実施例１の画像処理装置の画像処理のフローチャート1 is a flowchart of image processing in the image processing apparatus according to the first embodiment. ブロックマッチング方式の視差演算方法を説明する図A diagram explaining a disparity calculation method using the block matching method. 実施例１でのブロックマッチングの探索範囲の決定方法の一例を説明する図FIG. 1 is a diagram for explaining an example of a method for determining a search range for block matching in the first embodiment; 実施例１でのブロックマッチングの探索範囲の決定方法の他例を説明する図FIG. 11 is a diagram for explaining another example of a method for determining a search range for block matching in the first embodiment; 特徴量が大きくなる画像の一例An example of an image with large feature values 実施例２の画像処理装置の機能ブロック図Functional block diagram of an image processing apparatus according to a second embodiment 実施例２の画像処理装置の路面高さ推定処理のフローチャート11 is a flowchart of a road surface height estimation process of the image processing device according to the second embodiment. 実施例３の画像処理装置の機能ブロック図Functional block diagram of an image processing apparatus according to a third embodiment 実施例４の第１視差画像生成処理のフローチャートFlowchart of first parallax image generating process according to the fourth embodiment 実施例５の第２視差画像生成処理のフローチャートFlowchart of second parallax image generating process according to the fifth embodiment

以下、図面等を用いて、本発明の実施形態について説明する。なお、以下の説明は本発明の内容の具体例を示すものであり、本発明がこれらの説明に限定されるものではなく、本明細書に開示される技術的思想の範囲内において当業者による様々な変更および修正が可能である。また、本発明を説明するための全図において、同一の機能を有するものは、同一の符号を付け、その繰り返しの説明は省略する場合がある。 The following describes an embodiment of the present invention with reference to the drawings. Note that the following description shows specific examples of the contents of the present invention, and the present invention is not limited to these descriptions. Various changes and modifications are possible by those skilled in the art within the scope of the technical ideas disclosed in this specification. In addition, in all drawings used to explain the present invention, parts having the same functions are given the same reference numerals, and repeated explanations may be omitted.

図１は、本発明の実施例１に係る画像処理装置１００の機能ブロック図である。ここに示すように、画像処理装置１００は、右カメラ１Ｒ、左カメラ１Ｌ、画像縮小部２、第１視差画像生成部３、探索範囲決定部４、第２視差画像生成部５、認識処理部６を備えている。なお、右カメラ１Ｒと左カメラ１Ｌは、必ずしもステレオカメラの左右カメラである必要は無く、左右に配置した一対の単眼カメラの夫々を右カメラ１Ｒや左カメラ１Ｌとして利用しても良い。また、画像縮小部２から認識処理部６は、例えば、ＣＰＵ等の演算装置、半導体メモリ等の記憶装置、および、通信装置などのハードウェアを備えたコンピュータにおいて、演算装置が記憶装置内の所定プログラムを実行して実現した機能部であるが、以下では、このようなコンピュータ分野の周知技術を適宜省略しながら、各部の詳細を説明する。 1 is a functional block diagram of an image processing device 100 according to a first embodiment of the present invention. As shown in the figure, the image processing device 100 includes a right camera 1R, a left camera 1L, an image reduction unit 2, a first parallax image generation unit 3, a search range determination unit 4, a second parallax image generation unit 5, and a recognition processing unit 6. The right camera 1R and the left camera 1L do not necessarily need to be the left and right cameras of a stereo camera, and a pair of monocular cameras arranged on the left and right may be used as the right camera 1R and the left camera 1L, respectively. In addition, the image reduction unit 2 to the recognition processing unit 6 are functional units realized by a calculation unit such as a CPU, a storage device such as a semiconductor memory, and a computer equipped with hardware such as a communication device, by the calculation unit executing a predetermined program in the storage device. In the following, the details of each unit will be described while appropriately omitting such well-known technologies in the computer field.

図１に示すように、右カメラ１Ｒと左カメラ１Ｌで撮影された画像はそれぞれ右画像Ｐ_Ｒ、左画像Ｐ_Ｌとして画像縮小部２に転送される。画像縮小部２は入力された左右画像を縮小し、右縮小画像Ｓ_Ｒと左縮小画像Ｓ_Ｌを生成する。画像を縮小する理由は第１視差画像生成部３での演算量を削減するためであり、システムで許容される第１視差画像生成部３の処理時間に応じて縮小率は決定される。第１視差画像生成部３は、右縮小画像Ｓ_Ｒと左縮小画像Ｓ_Ｌを入力としてニューラルネットワーク処理により第１視差画像Ｄ１（縮小視差画像）と、特徴マップＭを生成する。 As shown in Fig. 1, images captured by the right camera 1R and the left camera 1L are transferred to the image reducing unit 2 as a right image P _R and a left image P _L , respectively. The image reducing unit 2 reduces the input left and right images to generate a right reduced image S _R and a left reduced image S _L. The reason for reducing the images is to reduce the amount of calculation in the first parallax image generating unit 3, and the reduction ratio is determined according to the processing time allowed by the system for the first parallax image generating unit 3. The first parallax image generating unit 3 receives the right reduced image S _R and the left reduced image S _L as input, and generates a first parallax image D1 (reduced parallax image) and a feature map M by neural network processing.

探索範囲決定部４は、第１視差画像Ｄ１と特徴マップＭを使用して、第２視差画像生成部５の探索範囲を決定する。第２視差画像生成部５は、ブロックマッチング方式によって右画像Ｐ_Ｒと左画像Ｐ_Ｌの２つの画像から第２視差画像Ｄ２を生成するものであり、ブロックマッチングの探索範囲は探索範囲決定部４に従う。第２視差画像Ｄ２は、縮小される前の画像から視差が生成されるため、第１視差画像Ｄ１よりも高密度な視差画像である。認識処理部６は、第２視差画像Ｄ２と右画像Ｐ_Ｒを使用して前方車両、歩行者、障害物などを認識処理する。そして、認識処理部６の認識結果が、車両の駆動系、制動系、操舵系等を制御するＥＣＵ（図示せず）に入力されることで、必要に応じて、ドライバへ警報したり、自動ブレーキをかけたりすることができる。 The search range determination unit 4 uses the first parallax image D1 and the feature map M to determine the search range of the second parallax image generation unit 5. The second parallax image generation unit 5 generates the second parallax image D2 from two images, the right image P _R and the left image P _L, by a block matching method, and the search range of the block matching follows the search range determination unit 4. The second parallax image D2 is a parallax image with a higher density than the first parallax image D1 because the parallax is generated from the image before being reduced. The recognition processing unit 6 uses the second parallax image D2 and the right image P _R to recognize a vehicle, a pedestrian, an obstacle, and the like ahead. Then, the recognition result of the recognition processing unit 6 is input to an ECU (not shown) that controls the drive system, braking system, steering system, and the like of the vehicle, so that an alarm can be issued to the driver or automatic braking can be applied as necessary.

図２は、本実施例の画像処理装置１００の画像処理の概要を説明するフローチャートである。 Figure 2 is a flowchart outlining the image processing performed by the image processing device 100 of this embodiment.

まず、ステップＳ１０では、画像縮小部２は、左右カメラから入力された右画像Ｐ_Ｒ、左画像Ｐ_Ｌを縮小し、右縮小画像Ｓ_Ｒと左縮小画像Ｓ_Ｌを生成する。 First, in step S10, the image reducing unit 2 reduces the right image P _R and the left image P _L input from the left and right cameras to generate a right reduced image S _R and a left reduced image S _L.

次に、ステップＳ１１では、第１視差画像生成部３が、右縮小画像Ｓ_Ｒと左縮小画像Ｓ_Ｌを入力としてニューラルネットワーク処理により第１視差画像Ｄ１と、特徴マップＭを生成する。 Next, in step S11, the first parallax image generating unit 3 generates a first parallax image D1 and a feature map M by neural network processing using the right reduced image S _{4 R} and the left reduced image S 4 _L as input.

ステップＳ１２では、探索範囲決定部４が、第１視差画像Ｄ１を使用して、または、第１視差画像Ｄ１と特徴マップＭを使用して、第２視差画像生成部５の探索範囲を決定する。なお、第１視差画像Ｄ１を使用する方法の詳細については図４で説明し、第１視差画像Ｄ１と特徴マップＭを使用する方法の詳細については図５で説明する。 In step S12, the search range determination unit 4 determines the search range of the second parallax image generation unit 5 using the first parallax image D1 or using the first parallax image D1 and the feature map M. Details of the method of using the first parallax image D1 are described in FIG. 4, and details of the method of using the first parallax image D1 and the feature map M are described in FIG. 5.

ステップＳ１３では、第２視差画像生成部５が、探索範囲決定部４が決定した探索範囲によるブロックマッチング方式によって、右画像Ｐ_Ｒと左画像Ｐ_Ｌの２つの画像から第２視差画像Ｄ２を生成する。なお、ブロックマッチング方式の詳細については図３で説明する。 In step S13, the second parallax image generating unit 5 generates a second parallax image D2 from the two images, the right image P _R and the left image P _L, by a block matching method using the search range determined by the search range determining unit 4. Details of the block matching method will be described with reference to FIG.

ステップＳ１４では、認識処理部６が、第２視差画像Ｄ２と右画像Ｐ_Ｒを使用して前方車両、歩行者、障害物などを認識処理する。 In step S14, the recognition processing unit 6 uses the second parallax image D2 and the right image _PR to recognize a forward vehicle, a pedestrian, an obstacle, and the like.

図３は、一般的なブロックマッチング方式による視差演算処理を例示したものである。この例では、右カメラ１Ｒが撮影した右画像Ｐ_Ｒを基準画像とし、例えば１６画素×１６画素（サイズはこの例に限るものではない）のような基準ブロック画像Ｐ_Ｂを定義する。そして、左カメラ１Ｌが撮影した左画像Ｐ_Ｌの中で、基準ブロック画像Ｐ_Ｂと同じ縦位置（Ｙ座標）と横位置（Ｘ座標）を基準として、所定の探索幅ｒ（例えば２７２画素）の参照画像Ｐ_ｒｅｆを選択する。その後、基準ブロック画像Ｐ_Ｂと参照画像Ｐ_ｒｅｆの差分を計算する。この差分計算はＳＡＤと呼ばれ、次の式１により計算を行う。 FIG. 3 illustrates a parallax calculation process by a general block matching method. In this example, the right image P _R captured by the right camera 1R is used as a reference image, and a reference block image P _B , such as 16 pixels x 16 pixels (the size is not limited to this example), is defined. Then, in the left image P _L captured by the left camera 1L, a reference image P _ref with a predetermined search width r (for example, 272 pixels) is selected based on the same vertical position (Y coordinate) and horizontal position (X coordinate) as the reference block image P _B. Then, the difference between the reference block image P _B and the reference image P _ref is calculated. This difference calculation is called SAD, and is performed according to the following formula 1.

なお、式１において、Ｉは参照画像Ｐ_ｒｅｆ中の画像ブロック（例：１６×１６画素）、Ｔは基準ブロック画像Ｐ_Ｂ中の画像ブロックであり、ｉ、ｊは画像ブロック内の座標である。１つの視差を演算するために、参照画像Ｐ_ｒｅｆの参照位置を左端から１画素ずつずらしながら探索幅ｒの全ての画像と比較するため、基準ブロック画像Ｐ_Ｂの幅が１６画素であり、探索幅ｒが２７２画素であれば、２５６回のブロック画像の比較演算を行い、最もＳＡＤ値が小さくなる位置を探索する。 In addition, in formula 1, I is an image block (e.g., 16×16 pixels) in the reference image _Pref , T is an image block in the base block image P _B , and i, j are coordinates in the image block. In order to calculate one parallax, the reference position of the reference image _Pref is shifted one pixel at a time from the left end and compared with all images in the search width r. Therefore, if the base block image P _B has a width of 16 pixels and the search width r is 272 pixels, 256 comparison calculations of the block images are performed to search for the position with the smallest SAD value.

図３の、車両の前方窓から見た実際の風景には、路面を走行する前方車両Ｖがある。右カメラ１Ｒで撮影した右画像Ｐ_Ｒと、左カメラ１Ｌで撮影した左画像Ｐ_Ｌがある時、前方車両Ｖの一部Ｖ１は、右画像Ｐ_Ｒでは基準ブロック画像Ｐ_Ｂの位置に撮影され、左画像Ｐ_Ｌでは参照ブロック画像Ｐ_Ｂ’の位置に撮影される。この結果、基準ブロック画像Ｐ_Ｂと参照ブロック画像Ｐ_Ｂ’のＳＡＤ値は視差ｄの位置で最小になる。この視差ｄは、前方車両Ｖが画像処理装置１００に近い場合は大きい値となり、遠いものは小さい値となる。このように求めた視差を、画像全体で求める。この視差ｄを用いて、三角測量の原理で画像処理装置１００までの距離を測定することが出来る。視差ｄから距離Ｚは次の式２で求められる。 In the actual scene seen from the front window of the vehicle in FIG. 3, there is a vehicle V ahead running on the road. When there is a right image P _R captured by the right camera 1R and a left image P _L captured by the left camera 1L, a part V1 of the vehicle ahead V is captured at the position of the base block image P _B in the right image P _R , and at the position of the reference block image P _B ' in the left image P _L. As a result, the SAD value of the base block image P _B and the reference block image P _B ' is minimized at the position of the parallax d. This parallax d is a large value when the vehicle ahead V is close to the image processing device 100, and a small value when it is far away. The parallax calculated in this way is calculated for the entire image. Using this parallax d, the distance to the image processing device 100 can be measured according to the principle of triangulation. From the parallax d, the distance Z can be calculated by the following formula 2.

但し、式２において、ｆは左右カメラの焦点距離、Ｂは右カメラ１Ｒと左カメラ１Ｌの距離（基線長）である。 In Equation 2, f is the focal length of the left and right cameras, and B is the distance (baseline length) between the right camera 1R and the left camera 1L.

＜探索範囲決定部４と第２視差画像生成部５の詳細動作の一例＞
図４は、探索範囲決定部４と第２視差画像生成部５の詳細動作の一例を示したものであり、図２のステップＳ１２にて、第１視差画像Ｄ１に基づいてブロックマッチングの探索範囲を決定するフローに相当する。 <Example of detailed operations of the search range determination unit 4 and the second parallax image generation unit 5>
FIG. 4 shows an example of detailed operations of the search range determination unit 4 and the second parallax image generation unit 5, and corresponds to the flow of determining the search range for block matching based on the first parallax image D1 in step S12 in FIG. 2.

図４に示すように、右画像Ｐ_Ｒと左画像Ｐ_Ｌには、横方向の左から右に向けてＸ，縦方向の上から下に向けてＹとして座標が付けられている。右画像Ｐ_Ｒと左画像Ｐ_Ｌは画像縮小部２により縮小され右縮小画像Ｓ_Ｒと左縮小画像Ｓ_Ｌとなる。ここでは例として縮小の倍率を１／４としている。次に第１視差画像生成部３が、右縮小画像Ｓ_Ｒと左縮小画像Ｓ_Ｌを入力としてニューラルネットワーク処理により第１視差画像Ｄ１を生成する。この例では画像の縮小の倍率が１／４であるため、第１視差画像Ｄ１の視差値Ａの画素は右画像Ｐ_ＲのＸ座標＝０とＹ座標＝０、Ｘ座標＝０とＹ座標＝１、Ｘ座標＝１とＹ座標＝０、Ｘ座標＝１とＹ座標＝１の４画素の位置に対応する画素である。 As shown in FIG. 4, the right image P _R and the left image P _L are assigned coordinates X from left to right in the horizontal direction and Y from top to bottom in the vertical direction. The right image P _R and the left image P _L are reduced by the image reduction unit 2 to become a right reduced image S _R and a left reduced image S _L. Here, the reduction ratio is set to 1/4 as an example. Next, the first parallax image generation unit 3 generates a first parallax image D1 by neural network processing using the right reduced image S _R and the left reduced image S _L as inputs. In this example, since the reduction ratio of the image is 1/4, the pixel of the parallax value A of the first parallax image D1 corresponds to the positions of four pixels of the right image P _R, namely, X coordinate = 0 and Y coordinate = 0, X coordinate = 0 and Y coordinate = 1, X coordinate = 1 and Y coordinate = 0, and X coordinate = 1 and Y coordinate = 1.

探索範囲決定部４は、第２視差画像生成部５のブロックマッチングの範囲を第１視差画像Ｄ１によって決定する。例えば、第２視差画像生成部５が右画像Ｐ_ＲのＸ座標＝０とＹ座標＝０、Ｘ座標＝０とＹ座標＝１、Ｘ座標＝１とＹ座標＝０、Ｘ座標＝１とＹ座標＝１の４画素の視差を生成するときは、それらに対応する第１視差画像Ｄ１のＸ座標＝０とＹ座標＝０の視差値であるＡを基準にして探索範囲を決定する。具体的には探索範囲の開始位置ｓは視差値Ａから減算器４１で固定値を減算した値、探索範囲の終了位置ｅは視差値Ａに加算器４２で固定値を加算した値とする。固定値は例えば１０とする。これにより視差値Ａの位置に対応する第２視差画像Ｄ２の４画素（Ａ１，Ａ２，Ａ３，Ａ４）は、ブロックマッチングの探索範囲が有効視差値付近に限定されるため、有効視差値である視差値Ａと近似または同値の視差値となり、有効視差値となる可能性が高い。また、１画素の視差を演算するブロックマッチングの回数は２０回で済む。一般的なブロックマッチング方式（図３）では探索範囲が固定であり例えば２５６回のＳＡＤ演算となるが、本方式ではブロックマッチングの演算量が１／８以下となる。 The search range determination unit 4 determines the range of block matching of the second parallax image generation unit 5 by the first parallax image D1. For example, when the second parallax image generation unit 5 generates parallax of four _pixels of X coordinate=0 and Y coordinate=0, X coordinate=0 and Y coordinate=1, X coordinate=1 and Y coordinate=0, and X coordinate=1 and Y coordinate=1 in the right image PR, the search range is determined based on the parallax value A of X coordinate=0 and Y coordinate=0 in the first parallax image D1 corresponding to the four pixels. Specifically, the start position s of the search range is a value obtained by subtracting a fixed value from the parallax value A by a subtracter 41, and the end position e of the search range is a value obtained by adding a fixed value to the parallax value A by an adder 42. The fixed value is, for example, 10. As a result, the four pixels (A1, A2, A3, A4) in the second parallax image D2 corresponding to the position of the parallax value A have a parallax value that is close to or equal to the parallax value A, which is the effective parallax value, and are therefore highly likely to be effective parallax values, since the search range of the block matching is limited to the vicinity of the effective parallax value. Also, the number of block matching operations required to calculate the parallax of one pixel is only 20. In a general block matching method (FIG. 3), the search range is fixed and, for example, 256 SAD operations are required, but in this method, the amount of calculations for block matching is 1/8 or less.

＜探索範囲決定部４と第２視差画像生成部５の詳細動作の他例＞
図５は、探索範囲決定部４と第２視差画像生成部５の詳細動作の他例を示したものであり、図２のステップＳ１２にて、第１視差画像Ｄ１と特徴マップＭに基づいてブロックマッチングの探索範囲を決定するフローに相当する。以下、図４との相違点を説明する。 <Another Example of Detailed Operation of Search Range Determination Unit 4 and Second Parallax Image Generation Unit 5>
Fig. 5 shows another example of detailed operations of the search range determination unit 4 and the second parallax image generation unit 5, and corresponds to the flow of determining the search range for block matching based on the first parallax image D1 and the feature map M in step S12 in Fig. 2. Differences from Fig. 4 will be described below.

第１視差画像生成部３は、ニューラルネットワーク方式の演算により視差を演算する。ニューラルネットワークでは畳み込み演算の処理が行われ、その結果として特徴マップＭが生成される。画像の中で画素値の変化が大きい部分は特徴量も大きくなる傾向にある。例えば、図６に示すように、画像領域Ｒ１は路面だけの画像のため特徴量が少なく、この領域の距離はほぼ同じであるため視差値も近似値となる。一方、画像領域Ｒ２は路面と前方車両Ｖの一部と交通標識が含まれており、複数の物体が重なっており特徴量が大きくなる。この画像領域Ｒ２の前方車両Ｖ、路面、交通標識はそれぞれ距離が異なるため視差も異なるため、この画像領域Ｒ２内の視差を近似値にならないようにする必要がある。 The first parallax image generator 3 calculates the parallax by neural network calculation. The neural network performs convolution calculation processing, and as a result, a feature map M is generated. Parts of an image where there is a large change in pixel value tend to have large feature values. For example, as shown in FIG. 6, image region R1 is an image of only the road surface, so there are few feature values, and the distance in this region is almost the same, so the parallax value is also an approximate value. On the other hand, image region R2 includes the road surface, part of the forward vehicle V, and traffic signs, and multiple objects overlap, resulting in a large feature value. The forward vehicle V, road surface, and traffic signs in this image region R2 are all at different distances, so the parallax also differs, so it is necessary to prevent the parallax in this image region R2 from becoming an approximate value.

図４では第１視差画像Ｄ１の画素値Ａに対応する第２視差画像Ｄ２の４画素（画素値Ａ１、Ａ２、Ａ３、Ａ４）は、いずれも視差値Ａの近似値である。このため、この４画素の中で遠方と近傍の物体の境界がある場合は、視差値Ａとは近似ではない視差である可能性がある。 In FIG. 4, the four pixels (pixel values A1, A2, A3, and A4) in the second parallax image D2 that correspond to pixel value A in the first parallax image D1 are all approximations of the parallax value A. Therefore, if there is a boundary between a distant object and a nearby object among these four pixels, there is a possibility that the parallax is not an approximation of the parallax value A.

これに対し、図５では、第１視差画像Ｄ１に対する減算器４１、加算器４２と同様の機能を担う、特徴マップＭ用の減算器４３、加算器４４を設けることで、特徴マップＭが示す特徴量が大きい部分においては探索範囲をより広くして、正しい視差を演算できるようにした。具体的には、第２視差画像生成部５が第１視差画像Ｄ１のＡの画素に対応する部分の処理を行っている場合は、特徴マップＭのｉ画素の値を探索範囲の開始位置ｓから減算し、終了位置ｅに加算する。第１視差画像Ｄ１のＢの画素に対応する部分を処理する場合は、特徴マップＭの画素ｖを使用する。このようにすることで、特徴量が大きい部分は探索範囲を拡大することが可能となり、正しい視差が演算可能となる。 5, a subtractor 43 and an adder 44 for the feature map M, which have the same functions as the subtractor 41 and the adder 42 for the first parallax image D1, are provided, so that the search range is made wider in the part where the feature amount indicated by the feature map M is large , and the correct parallax can be calculated. Specifically, when the second parallax image generating unit 5 processes the part corresponding to pixel A of the first parallax image D1, the value of pixel i of the feature map M is subtracted from the start position s of the search range and added to the end position e. When the part corresponding to pixel B of the first parallax image D1 is processed, pixel v of the feature map M is used. In this way, the search range can be expanded in the part where the feature amount is large , and the correct parallax can be calculated.

以上で説明した本実施例の画像処理装置によれば、縮小画像からニューラルネットワークによって有効視差を演算し、前記有効視差と対応する位置の縮小前の画像の視差をその有効視差と同値または近似値となるようにブロックマッチング方式で演算するため、ニューラルネットワークの演算量を削減しつつ、縮小前の画像の有効な視差を増やすことができる。 According to the image processing device of this embodiment described above, the effective parallax is calculated from the reduced image by a neural network, and the parallax of the image before reduction at a position corresponding to the effective parallax is calculated by a block matching method so that it is the same as or close to the effective parallax, so that the effective parallax of the image before reduction can be increased while reducing the amount of calculations of the neural network.

次に、図７と図８を用いて、実施例２の画像処理装置を説明する。なお、実施例１との共通点については重複説明を省略する。 Next, an image processing device according to a second embodiment will be described with reference to Figures 7 and 8. Note that a duplicate description of the points common to the first embodiment will be omitted.

図７は、実施例２に係る画像処理装置１００の機能ブロック図であり、実施例１との相違は、認識処理部６での路面認識時に、第１視差画像Ｄ１を使用する点である。認識処理部６では、前方車両Ｖや障害物を検知する場合、それらが路面上に存在するのかを判定する必要がある。例えば、第１視差画像生成と第２視差画像生成をハードウェアで処理し、認識処理をソフトウエアで処理する場合、路面の認識処理（ソフトウェア処理）を第２視差画像Ｄ２の生成処理（ハードウェア処理）と並列に処理が可能となり、システム全体の処理の高速化を図ることができる。 Figure 7 is a functional block diagram of an image processing device 100 according to a second embodiment. The difference from the first embodiment is that the first parallax image D1 is used when the recognition processing unit 6 recognizes the road surface. When the recognition processing unit 6 detects a forward vehicle V or an obstacle, it is necessary to determine whether they are present on the road surface. For example, when the first parallax image generation and the second parallax image generation are processed by hardware and the recognition processing is processed by software, the road surface recognition processing (software processing) can be processed in parallel with the generation processing (hardware processing) of the second parallax image D2, and the processing speed of the entire system can be increased.

図８は、認識処理部６で路面の高さの認識時に第１視差画像Ｄ１を使用するときのフローチャートを示したものである。まず、第１視差画像Ｄ１を生成したあと（ステップＳ２０）、認識処理部６は第１視差画像Ｄ１で路面の高さの認識処理を行い（ステップＳ２１）、それと並行して第２視差画像生成部５で第２視差画像Ｄ２を生成する（ステップＳ２２）。その後、認識処理部６で第２視差画像Ｄ２を用いて路上の前方車両Ｖや障害物の検知を行う（ステップＳ２３）。この場合、認識処理部６がすべての処理を第２視差画像Ｄ２で処理する場合に比べて、システム全体の処理の高速化を図ることができる。 Figure 8 shows a flowchart when the first parallax image D1 is used by the recognition processing unit 6 to recognize the height of the road surface. First, after generating the first parallax image D1 (step S20), the recognition processing unit 6 performs a process of recognizing the height of the road surface using the first parallax image D1 (step S21), and in parallel with this, the second parallax image generation unit 5 generates the second parallax image D2 (step S22). After that, the recognition processing unit 6 uses the second parallax image D2 to detect a vehicle V ahead on the road and obstacles (step S23). In this case, the processing speed of the entire system can be increased compared to when the recognition processing unit 6 performs all processing using the second parallax image D2.

次に、図９を用いて、実施例３の画像処理装置を説明する。なお、上記実施例との共通点については重複説明を省略する。 Next, an image processing device according to a third embodiment will be described with reference to FIG. 9. Note that a duplicate description of points common to the above embodiments will be omitted.

図９は、実施例３に係る画像処理装置１００の機能ブロック図であり、実施例１との相違は、視差精度低下エリア検知部７を追加した点である。この視差精度低下エリア検知部７は、第１視差画像生成部３で正しい視差が生成されない恐れがある画像エリアを判別し、第１視差画像生成部３はそのエリアに相当する、第１視差画像Ｄ１と特徴マップＭの一部を無効設定する。例えば雨天時に右画像Ｐ_Ｒと左画像Ｐ_Ｌの一部にワイパーが映り込んでいる場合などは、ワイパー部分の画像からは正確な視差が演算できない。このような部分は無効視差を出力するように第１視差画像生成部３に通知する。視差精度低下エリア検知部７の検知方法としては、画像の一部の輝度値が他の部分に比べて極端に暗くなっている状態を判定するなどの方式がある。 FIG. 9 is a functional block diagram of an image processing device 100 according to a third embodiment. The difference from the first embodiment is that a parallax accuracy reduction area detection unit 7 is added. The parallax accuracy reduction area detection unit 7 determines an image area where the first parallax image generation unit 3 may not generate a correct parallax, and the first parallax image generation unit 3 invalidates a part of the first parallax image D1 and the feature map M corresponding to the determined area. For example, when a wiper is reflected in a part of the right image P _R and the left image P _L during rainy weather, an accurate parallax cannot be calculated from the image of the wiper part. The first parallax image generation unit 3 is notified to output an invalid parallax for such a part. The parallax accuracy reduction area detection unit 7 may detect an image area where the brightness value of a part of the image is extremely dark compared to other parts.

次に、図１０を用いて、実施例４の画像処理装置を説明する。なお、上記実施例との共通点については重複説明を省略する。 Next, an image processing device according to a fourth embodiment will be described with reference to FIG. 10. Note that a duplicate description of points common to the above embodiments will be omitted.

図１０は、画像の処理位置に応じて画像縮小部２の縮小の倍率を変更する方式のフローチャートを示したものである。例えば図３のような実際の風景が撮影された画像とした場合、路面の画像の下側部分は路面の画像の上側部分に比べてカメラから近距離になるので道路幅は広くなり物体は大きく撮影される。従って、画像の下側では縮小の倍率を小さくしても問題なく視差を演算できると考えられる。 Figure 10 shows a flowchart of a method for changing the reduction ratio of the image reduction unit 2 depending on the processing position of the image. For example, in the case of an image of an actual landscape such as that shown in Figure 3, the lower part of the image of the road surface is closer to the camera than the upper part of the image of the road surface, so the road width is wider and objects are photographed larger. Therefore, it is considered that parallax can be calculated without any problem even if the reduction ratio is smaller in the lower part of the image.

図１０のフローチャートでは、撮影画像を上下に２分割して、その２分割したＹ座標の値に応じて縮小率を変えて画像を縮小し、第１視差画像生成部３の処理を行う。具体的には、左右画像のＹ座標を確認し（ステップＳ４０）、Ｙ座標が所定値以上であれば（近傍を撮影している可能性の高い画像下側では）縮小率を１／８に設定する（ステップＳ４１）。一方、Ｙ座標が所定値より小さければ（遠方を撮影している可能性の高い画像上側では）縮小率を１／４に設定する（ステップＳ４２）。そして、画像縮小部２は、ステップＳ４１，４２で設定した縮小率で縮小画像を生成し（ステップＳ４３）、第１視差画像生成部３は上下で縮小率の異なる左右の縮小画像を用いて第１視差画像Ｄ１や特徴マップＭを生成する。このようにすることにより、画像の縮小の倍率を小さくした分だけ、演算量を削減できる。 In the flowchart of FIG. 10, the captured image is divided into two parts, top and bottom, and the image is reduced by changing the reduction ratio according to the value of the Y coordinate of the divided image, and the first parallax image generating unit 3 performs processing. Specifically, the Y coordinates of the left and right images are checked (step S40), and if the Y coordinate is equal to or greater than a predetermined value (at the lower part of the image where it is highly likely that a nearby object is being photographed) the reduction ratio is set to 1/8 (step S41). On the other hand, if the Y coordinate is smaller than the predetermined value (at the upper part of the image where it is highly likely that a distant object is being photographed) the reduction ratio is set to 1/4 (step S42). Then, the image reducing unit 2 generates a reduced image at the reduction ratio set in steps S41 and 42 (step S43), and the first parallax image generating unit 3 generates the first parallax image D1 and the feature map M using the left and right reduced images with different reduction ratios for the top and bottom. In this way, the amount of calculation can be reduced by the amount of reduction in the reduction ratio of the image.

次に、図１１を用いて、実施例５の画像処理装置を説明する。なお、上記実施例との共通点については重複説明を省略する。 Next, an image processing device according to a fifth embodiment will be described with reference to FIG. 11. Note that a duplicate description of points common to the above embodiments will be omitted.

図１１は、画像の処理位置に応じて探索範囲を決定する固定値を変更するフローチャートである。上述したように、路面画像の下側は近傍が撮影され、路面画像の上側は遠方部が撮影される。式２に示すように視差は距離に応じて反比例するため、近距離部では視差値の１の差は距離の数十ｃｍの差程度であるが遠方では視差値の１の差は距離の数ｍの差となる。従って、第２視差画像Ｄ２の精度を第１視差画像Ｄ１に近づけるためには、探索範囲も距離に応じて変更する方が望ましい。そこで路面の画像を処理するときは、Ｙ座標が所定のＹ座標よりも小さいときは固定値を５に変更して探索範囲を決定し、第２視差画像生成部５で第２視差画像Ｄ２を生成する。具体的には、左右画像のＹ座標を確認し（ステップＳ５０）、Ｙ座標が所定値以上であれば（近傍を撮影している可能性の高い画像下側では）固定値を１０に設定する（ステップＳ５１）。一方、Ｙ座標が所定値より小さければ（遠方を撮影している可能性の高い画像上側では）固定値を５に設定する（ステップＳ５２）。そして、探索範囲決定部４は、ステップＳ５１，５２で設定した固定値で探索範囲を決定し（ステップＳ５３）、第２視差画像生成部５は上下で幅の異なる探索領域を用いて第２視差画像Ｄ２を生成する。このようにすることにより、探索範囲の幅を小さくした分だけ、演算量を削減できる。 Figure 11 is a flowchart for changing the fixed value that determines the search range according to the processing position of the image. As described above, the lower side of the road surface image captures the vicinity, and the upper side of the road surface image captures the distant area. As shown in Equation 2, the parallax is inversely proportional to the distance, so a difference of 1 in the parallax value in the near area is a difference of several tens of centimeters in distance, but a difference of 1 in the parallax value in the distant area is a difference of several meters in distance. Therefore, in order to bring the accuracy of the second parallax image D2 closer to that of the first parallax image D1, it is desirable to change the search range according to the distance. Therefore, when processing the image of the road surface, if the Y coordinate is smaller than a predetermined Y coordinate, the fixed value is changed to 5 to determine the search range, and the second parallax image generation unit 5 generates the second parallax image D2. Specifically, the Y coordinates of the left and right images are checked (step S50), and if the Y coordinate is equal to or greater than a predetermined value (at the lower side of the image where it is highly likely that the vicinity is being captured), the fixed value is set to 10 (step S51). On the other hand, if the Y coordinate is smaller than the predetermined value (at the top of the image where it is highly likely that a distant object is being photographed), a fixed value of 5 is set (step S52). Then, the search range determination unit 4 determines the search range using the fixed values set in steps S51 and S52 (step S53), and the second parallax image generation unit 5 generates the second parallax image D2 using a search region with different widths above and below. By doing this, the amount of calculations can be reduced by the amount that the width of the search range is reduced.

なお、本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれる。例えば、上記した実施例は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、ある実施例の構成の一部を他の実施例の構成に置き換えることが可能であり、また、ある実施例の構成に他の実施例の構成を加えることも可能である。また、各実施例の構成の一部について、他の構成の追加・削除・置換をすることが可能である。 The present invention is not limited to the above-described embodiments, but includes various modified examples. For example, the above-described embodiments have been described in detail to clearly explain the present invention, and are not necessarily limited to those having all of the configurations described. It is also possible to replace part of the configuration of one embodiment with the configuration of another embodiment, and it is also possible to add the configuration of another embodiment to the configuration of one embodiment. It is also possible to add, delete, or replace part of the configuration of each embodiment with other configurations.

１００：画像処理装置
１Ｌ：左カメラ
１Ｒ：右カメラ
２：画像縮小部
３：第１視差画像生成部
４：探索範囲決定部
４１、４３：減算器
４２、４４：加算器
５：第２視差画像生成部
５１：ＳＡＤ演算器
６：認識処理部
７：視差精度低下エリア検知部
Ｐ_Ｌ：左画像
Ｐ_Ｒ：右画像
Ｐ_Ｂ：基準参照ブロック画像
Ｐ_Ｂ’：参照ブロック画像
Ｓ_Ｒ：右縮小画像
Ｓ_Ｌ：左縮小画像
Ｄ１：第１視差画像
Ｄ２：第２視差画像
Ｍ：特徴マップ 100: Image processing device 1L: Left camera 1R: Right camera 2: Image reduction unit 3: First parallax image generation unit 4: Search range determination unit 41, 43: Subtractor 42, 44: Adder 5: Second parallax image generation unit 51: SAD calculator 6: Recognition processing unit 7: Parallax accuracy reduction area detection unit P _L : Left image P _R : Right image P _B : Base reference block image P _B ': Reference block image S _R : Right reduced image S _L : Left reduced image D1: First parallax image D2: Second parallax image M: Feature map

Claims

an image reducing unit that reduces two input images to generate two reduced images;
a first parallax image generating unit that obtains a parallax between the two reduced images by neural network processing and generates a reduced parallax image;
a second parallax image generating unit that obtains a region in one of the input images that matches a partial region of the other input image, thereby obtaining a parallax of the two input images and generating a parallax image;
the second parallax image generating unit sets a search range for determining the matching area based on a parallax value of each pixel of the reduced parallax image for each area of the input image to which each pixel of the reduced parallax image corresponds, and
The image processing device according to claim 1, further comprising: a step of: expanding the search range for pixels having a large feature amount in the feature map output by the first parallax image generating unit .

an image reducing unit that reduces two input images to generate two reduced images;
a first parallax image generating unit that obtains a parallax between the two reduced images by neural network processing and generates a reduced parallax image;
a second parallax image generating unit that obtains a region in one of the input images that matches a partial region of the other input image, thereby obtaining a parallax of the two input images and generating a parallax image;
In the image processing device, the second parallax image generating unit sets a search range for determining the matching area based on a parallax value of each pixel of the reduced parallax image for each area of the input image to which each pixel of the reduced parallax image corresponds ,
13. An image processing device comprising: an input image of a road surface divided into upper and lower parts for processing, the input image being reduced at different reduction rates from the upper image and the lower image.

3. The image processing device according to claim 1,
The image processing device according to claim 1, wherein the second parallax image generating unit generates a parallax having the same value as or an approximate value to the reduced parallax image at a corresponding position.

3. The image processing device according to claim 1,
The image processing device, wherein the second parallax image generating unit performs matching processing within the search range.

3. The image processing device according to claim 1,
2. An image processing device comprising: a step of: performing a process of recognizing a height of a road surface using the reduced parallax image; and recognizing an object on the road surface based on a result of the process performed by the second parallax image generating unit.

3. The image processing device according to claim 1,
an image processing device having a disparity accuracy reduction area detection unit that determines from the reduced image that there is an area that cannot be calculated as part of the disparity calculated by the first disparity image generation unit, and indicates that the disparity of the image area detected by the disparity accuracy reduction area detection unit is invalid.

3. The image processing device according to claim 1,
13. An image processing device comprising: a first parallax image generating unit configured to generate a second parallax image for generating a second parallax image on a first image side of a road surface;

a first step of reducing two input images to generate two reduced images;
a second step of calculating a parallax between the two reduced images by neural network processing and generating a reduced parallax image;
a third step of obtaining a region in one of the input images that matches a partial region of the other of the input images, thereby obtaining a disparity between the two input images and generating a disparity image;
the third step is a method for processing an image, comprising the steps of: setting a search range for determining the matching region based on a disparity value of each pixel of the reduced disparity image for each region of the input image to which each pixel of the reduced disparity image corresponds ; and widening the search range for pixels having a large feature amount in a feature map .

a first step of reducing two input images to generate two reduced images;
a second step of calculating a parallax between the two reduced images by neural network processing and generating a reduced parallax image;
a third step of obtaining a region in one of the input images that matches a partial region of the other of the input images, thereby obtaining a disparity between the two input images and generating a disparity image;
In the third step, for each region of the input image to which each pixel of the reduced parallax image corresponds, a search range is set based on the parallax value of each pixel of the reduced parallax image when determining the matching region, and when the road surface portion of the input image is divided into upper and lower parts and processed, the reduction ratio of the upper image and the reduction ratio of the lower image are different.