JP7493545B2

JP7493545B2 - Position estimation device, position estimation method, and program

Info

Publication number: JP7493545B2
Application number: JP2022053028A
Authority: JP
Inventors: 敦俊長谷部
Original assignee: KYB Corp
Current assignee: KYB Corp
Priority date: 2022-03-29
Filing date: 2022-03-29
Publication date: 2024-05-31
Anticipated expiration: 2042-03-29
Also published as: JP2023146044A; WO2023189544A1

Description

本開示は、位置推定装置、位置推定方法、及びプログラムに関する。 This disclosure relates to a position estimation device, a position estimation method, and a program.

道路脇には、雑草や樹木を含む様々な植生が存在する。これらの植生が成長し、葉や茎の数が増えたり、背丈が高くなったり、幹や茎が太くなると、道路上を通行する車両の進行の妨げとなる恐れがある。そのため、道路管理の業務においては、目視で植生の道路面へのはみ出しの有無、及びその位置関係を把握したうえで、道路面にはみ出した植生の伐採範囲を決定していた。また例えば特許文献１には、近赤外カメラでの植生の撮像を、期間をおいて複数回行い、それらの複数の撮像画像から植生の変化率を算出して、伐採する植生を選択する旨が記載されている。 There are various vegetation, including weeds and trees, on the side of the road. When this vegetation grows and the number of leaves and stems increases, it grows taller, and the trunks and stems become thicker, it may hinder the progress of vehicles traveling on the road. For this reason, in road management work, the presence or absence of vegetation protruding onto the road surface and its positional relationship are visually determined, and then the extent of cutting down the vegetation protruding onto the road surface is determined. For example, Patent Document 1 describes how vegetation is photographed multiple times with a near-infrared camera over a period of time, and the rate of change of the vegetation is calculated from the multiple captured images to select the vegetation to be cut down.

特開２０２１－１０８５８２号公報JP 2021-108582 A

しかしながら、伐採する植生の位置関係を、容易に検出することが求められている。 However, there is a need to easily detect the relative positions of vegetation to be cut down.

上述した課題を解決し、目的を達成するために、本開示に係る位置推定装置は、撮像された画像データから、対象物を含む対象領域を検出する領域検出部と、前記撮像された画像に写る範囲の深度情報を検出する距離検出部と、前記深度情報に基づき、前記対象領域までの距離の情報を取得する領域距離取得部と、を含み、前記領域検出部は、第１対象物を含む第１対象領域と第２対象物を含む第２対象領域とを検出し、前記第１対象領域と前記第２対象領域とが画像の同一垂線上にあり、かつ、前記深度情報において前記第１対象領域の深度と前記第２対象領域の深度との差が所定の範囲内に含まれるかを判定する判定部を更に含む。 In order to solve the above-mentioned problems and achieve the object, the position estimation device according to the present disclosure includes an area detection unit that detects an object area including an object from captured image data, a distance detection unit that detects depth information of the range captured in the captured image, and an area distance acquisition unit that acquires information on the distance to the object area based on the depth information, and the area detection unit detects a first object area including a first object and a second object area including a second object, and further includes a determination unit that determines whether the first object area and the second object area are on the same perpendicular line in the image and whether the difference in the depth information between the depth of the first object area and the depth of the second object area is within a predetermined range.

上述した課題を解決し、目的を達成するために、本開示に係る位置推定方法は、撮像された画像データから、対象物を含む対象領域を検出するステップと、前記撮像された画像に写る範囲の深度情報を検出するステップと、前記深度情報に基づき、前記対象領域までの距離の情報を取得するステップと、を含み、前記対象領域を検出するステップでは、第１対象物を含む第１対象領域と第２対象物を含む第２対象領域とを検出し、前記第１対象領域と前記第２対象領域とが画像の同一垂線上にあり、かつ、前記深度情報において前記第１対象領域の深度と前記第２対象領域の深度との差が所定の範囲内に含まれるかを判定するステップを更に含む。 In order to solve the above-mentioned problems and achieve the objective, the position estimation method according to the present disclosure includes the steps of detecting a target area including a target from captured image data, detecting depth information of the range captured in the captured image, and acquiring information on the distance to the target area based on the depth information, and the step of detecting the target area further includes the steps of detecting a first target area including a first target and a second target area including a second target, determining whether the first target area and the second target area are on the same perpendicular line in the image and whether the difference in the depth information between the depth of the first target area and the depth of the second target area is within a predetermined range.

上述した課題を解決し、目的を達成するために、本開示に係るプログラムは、撮像された画像データから、対象物を含む対象領域を検出するステップと、前記撮像された画像に写る範囲の深度情報を検出するステップと、前記深度情報に基づき、前記対象領域までの距離の情報を取得するステップと、をコンピュータに実行させ、前記対象領域を検出するステップでは、第１対象物を含む第１対象領域と第２対象物を含む第２対象領域とを検出し、前記第１対象領域と前記第２対象領域とが画像の同一垂線上にあり、かつ、前記深度情報において前記第１対象領域の深度と前記第２対象領域の深度との差が所定の範囲内に含まれるかを判定するステップを更に含む。 In order to solve the above-mentioned problems and achieve the object, the program of the present disclosure causes a computer to execute the steps of detecting a target area including an object from captured image data, detecting depth information of the range captured in the captured image, and acquiring information on the distance to the target area based on the depth information, and the step of detecting the target area further includes the steps of detecting a first target area including a first object and a second target area including a second object, determining whether the first target area and the second target area are on the same perpendicular line of the image and whether the difference in the depth information between the depth of the first target area and the depth of the second target area is within a predetermined range.

本開示によれば、撮像した画像に写った対象物までの位置関係を、容易に検出できる。 According to this disclosure, it is possible to easily detect the positional relationship to an object captured in a captured image.

図１は、本実施形態に係る位置推定システムの模式的なブロック図である。FIG. 1 is a schematic block diagram of a position estimation system according to the present embodiment. 図２は、車両の模式図である。FIG. 2 is a schematic diagram of a vehicle. 図３は、位置推定装置の模式的なブロック図である。FIG. 3 is a schematic block diagram of a position estimation device. 図４は、撮像された画像の一例を示す図である。FIG. 4 is a diagram showing an example of a captured image. 図５は、領域検出部が対象領域の検出を行った結果の一例を示す図である。FIG. 5 is a diagram showing an example of a result of detection of a target region by the region detection unit. 図６は、距離検出部が深度情報の検出を行った結果の一例を示す図である。FIG. 6 is a diagram showing an example of a result of detection of depth information by the distance detection unit. 図７は、領域距離取得部が対象物までの距離の情報を取得した結果の一例を示す図である。FIG. 7 is a diagram showing an example of a result of the area distance acquisition unit acquiring information on the distance to the target object. 図８は、位置推定装置の処理フローを説明するフローチャートである。FIG. 8 is a flowchart illustrating a process flow of the position estimation device.

以下に、本開示の実施形態を図面に基づいて詳細に説明する。なお、以下に説明する実施形態により本開示が限定されるものではない。 Embodiments of the present disclosure are described in detail below with reference to the drawings. Note that the present disclosure is not limited to the embodiments described below.

（位置推定システム）
図１は、本実施形態に係る位置推定システムの模式的なブロック図である。図１に示すように、本実施形態に係る位置推定システム１は、車両４と、測定データ取得装置６と、位置推定装置１０とを含む。位置推定システム１は、位置推定装置１０によって、撮像された画像に写っている対象物を含む領域までの距離を算出する。以下、対象物を含む領域を、適宜、対象領域と記載する。 (Location Estimation System)
Fig. 1 is a schematic block diagram of a position estimation system according to this embodiment. As shown in Fig. 1, the position estimation system 1 according to this embodiment includes a vehicle 4, a measurement data acquisition device 6, and a position estimation device 10. The position estimation system 1 calculates a distance to an area including an object captured in a captured image by the position estimation device 10. Hereinafter, the area including the object will be referred to as the target area as appropriate.

位置推定システム１においては、車両４が、道路を走行しながら周囲を撮像し、撮像した画像データを測定データ取得装置６に送信する。測定データ取得装置６は、例えば道路を管理する主体に管理される装置（コンピュータ）である。測定データ取得装置６は、車両４から送信された画像データを、位置推定装置１０に送信する。このように、位置推定装置１０は、測定データ取得装置６を介して画像データを取得して、その画像データに写る対象物までの距離を算出する。ただしそれに限られず、例えば、位置推定システム１は、測定データ取得装置６が設けられておらず、位置推定装置１０が、車両４から画像データを取得してもよい。また、位置推定装置１０は、車両４によって撮像された画像データを取得することに限られず、撮像された任意の画像データを取得して、その画像データに写る対象物までの距離を算出してよい。 In the position estimation system 1, the vehicle 4 captures images of the surroundings while traveling on a road, and transmits the captured image data to the measurement data acquisition device 6. The measurement data acquisition device 6 is, for example, a device (computer) managed by an entity that manages the road. The measurement data acquisition device 6 transmits the image data transmitted from the vehicle 4 to the position estimation device 10. In this way, the position estimation device 10 acquires image data via the measurement data acquisition device 6 and calculates the distance to an object depicted in the image data. However, this is not limited to this, and for example, the position estimation system 1 may not be provided with the measurement data acquisition device 6, and the position estimation device 10 may acquire image data from the vehicle 4. Furthermore, the position estimation device 10 is not limited to acquiring image data captured by the vehicle 4, and may acquire any image data captured and calculate the distance to an object depicted in the image data.

（車両）
図２は、車両の模式図である。図２に示すように、車両４は、カメラ４Ａと、測定装置４Ｂとを備える。カメラ４Ａは、車両４の周囲を撮像するカメラである。より詳しくは、本実施形態においては、カメラ４Ａは、車両４が移動している道路を含む領域を撮像する。カメラ４Ａは、光学素子と撮像素子とを含むカメラによって実現されてよい。光学素子は、例えばレンズ、ミラー、プリズム、フィルタなどの光学系を構成する素子である。撮像素子は、光学素子を通して入射した光を電気信号である画像信号に変換する素子である。撮像素子は、例えば、ＣＣＤ（ＣｈａｒｇｅＣｏｕｐｌｅｄＤｅｖｉｃｅ）センサやＣＭＯＳ（ＣｏｍｐｌｅｍｅｎｔａｒｙＭｅｔａｌＯｘｉｄｅＳｅｍｉｃｏｎｄｕｃｔｏｒ）センサなどにより実現されてよい。 (vehicle)
FIG. 2 is a schematic diagram of a vehicle. As shown in FIG. 2, the vehicle 4 includes a camera 4A and a measuring device 4B. The camera 4A is a camera that captures an image of the surroundings of the vehicle 4. More specifically, in this embodiment, the camera 4A captures an image of an area including a road on which the vehicle 4 is moving. The camera 4A may be realized by a camera including an optical element and an image sensor. The optical element is an element that constitutes an optical system, such as a lens, a mirror, a prism, or a filter. The image sensor is an element that converts light incident through the optical element into an image signal that is an electrical signal. The image sensor may be realized by, for example, a CCD (Charge Coupled Device) sensor or a CMOS (Complementary Metal Oxide Semiconductor) sensor.

また、車両４は、車両４の位置情報を取得する位置センサを有していてもよい。車両４の位置情報とは、車両４及び道路の位置を規定可能な座標系である所定座標系における、車両４の位置を示す情報である。ここでの所定座標系は、車両４や道路上の位置を規定可能な任意の座標系であってよく、例えば本実施形態では地球座標系であってよい。この場合、位置センサは、ＧＮＳＳ（ＧｌｏｂａｌＮａｖｉｖａｔｉｏｎＳａｔｅｌｉｔｅＳｙｓｔｅｍ）用のモジュールであってよい。 The vehicle 4 may also have a position sensor that acquires position information of the vehicle 4. The position information of the vehicle 4 is information indicating the position of the vehicle 4 in a specified coordinate system that is a coordinate system that can define the positions of the vehicle 4 and the road. The specified coordinate system here may be any coordinate system that can define the position of the vehicle 4 and the road, and may be the Earth coordinate system in this embodiment, for example. In this case, the position sensor may be a module for GNSS (Global Navigation Satellite System).

測定装置４Ｂは、カメラ４Ａを制御して画像を撮像させて、カメラ４Ａが撮像した画像データを記録する装置である。すなわち、測定装置４Ｂは、画像データを記録するデータロガーとして機能する。測定装置４Ｂは、コンピュータであるとも言え、制御部４Ｂ１と、記憶部４Ｂ２と、通信部４Ｂ３とを含む。制御部４Ｂ１は、演算装置であり、例えばＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）などの演算回路を含む。記憶部４Ｂ２は、制御部４Ｂ１の演算内容やプログラム、画像データなどの各種情報を記憶するメモリであり、例えば、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）と、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）のような主記憶装置と、フラッシュメモリやＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）などの不揮発性の記憶装置のうち、少なくとも１つ含む。なお、記憶部４Ｂ２が保存する制御部４Ｂ１用のプログラムは、測定装置４Ｂが読み取り可能な記録媒体に記憶されていてもよい。通信部４Ｂ３は、外部の装置と通信を行う通信モジュールであり、例えば、ＮＩＣ（ＮｅｔｗｏｒｋＩｎｔｅｒｆａｃｅＣａｒｄ）、無線ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）カード等によって実現される。 The measuring device 4B is a device that controls the camera 4A to capture an image and records the image data captured by the camera 4A. In other words, the measuring device 4B functions as a data logger that records image data. The measuring device 4B can also be said to be a computer, and includes a control unit 4B1, a memory unit 4B2, and a communication unit 4B3. The control unit 4B1 is a calculation device and includes a calculation circuit such as a CPU (Central Processing Unit). The memory unit 4B2 is a memory that stores various information such as the calculation contents, programs, and image data of the control unit 4B1, and includes at least one of a main memory device such as a RAM (Random Access Memory) and a ROM (Read Only Memory), and a non-volatile memory device such as a flash memory or a HDD (Hard Disk Drive). The program for the control unit 4B1 stored in the memory unit 4B2 may be stored in a recording medium that can be read by the measurement device 4B. The communication unit 4B3 is a communication module that communicates with external devices, and is realized by, for example, a NIC (Network Interface Card) or a wireless LAN (Local Area Network) card.

制御部４Ｂ１は、記憶部４Ｂ２に記憶されたプログラムを読み出して、カメラ４Ａの制御を実行する。制御部４Ｂ１は、車両４が道路を走行中に、カメラ４Ａに道路を含む領域を撮像させて、撮像された画像データを取得する。 The control unit 4B1 reads out the program stored in the memory unit 4B2 and executes control of the camera 4A. While the vehicle 4 is traveling on a road, the control unit 4B1 causes the camera 4A to capture an image of an area including the road, and acquires the captured image data.

（位置推定装置）
図３は、位置推定装置の模式的なブロック図である。図３に示すように、位置推定装置１０は、例えばコンピュータであり、記憶部１２と、制御部１３と、通信部１４と、表示部１５とを含む。記憶部１２は、制御部１３の演算内容やプログラムを記憶するメモリであり、例えば、ＲＡＭと、ＲＯＭのような主記憶装置と、フラッシュメモリやＨＤＤなどの不揮発性の記憶装置のうち、少なくとも１つを含む。なお、記憶部１２が保存する制御部１３用のプログラムは、位置推定装置１０が読み取り可能な記録媒体に記憶されていてもよい。通信部１４は、外部の装置と通信を行う通信モジュールであり、例えば、ＮＩＣ、無線ＬＡＮカード等によって実現される。表示部１５は、画像を表示するディスプレイである。表示部１５は、後述の出力部２５によって制御されて、出力部２５は、画像データを表示部１５に出力することで、表示部１５に画像データに対応する画像を表示させる。 (Location Estimation Device)
FIG. 3 is a schematic block diagram of the position estimation device. As shown in FIG. 3, the position estimation device 10 is, for example, a computer, and includes a storage unit 12, a control unit 13, a communication unit 14, and a display unit 15. The storage unit 12 is a memory that stores the calculation contents and programs of the control unit 13, and includes at least one of a RAM, a main storage device such as a ROM, and a non-volatile storage device such as a flash memory or a HDD. The program for the control unit 13 stored in the storage unit 12 may be stored in a recording medium that can be read by the position estimation device 10. The communication unit 14 is a communication module that communicates with an external device, and is realized by, for example, a NIC, a wireless LAN card, or the like. The display unit 15 is a display that displays an image. The display unit 15 is controlled by an output unit 25 described later, and the output unit 25 outputs image data to the display unit 15, thereby causing the display unit 15 to display an image corresponding to the image data.

制御部１３は、演算装置であり、例えばＣＰＵなどの演算回路を含む。制御部１３は、領域検出部２１と、距離検出部２２と、領域距離取得部２３と、判定部２４と、出力部２５とを含む。制御部１３は、記憶部１２からプログラム（ソフトウェア）を読み出して実行することで、領域検出部２１と距離検出部２２と領域距離取得部２３と判定部２４と出力部２５とを実現して、それらの処理を実行する。なお、制御部１３は、１つのＣＰＵによってこれらの処理を実行してもよいし、複数のＣＰＵを備えて、それらの複数のＣＰＵで、処理を実行してもよい。また、領域検出部２１と距離検出部２２と領域距離取得部２３と判定部２４と出力部２５との少なくとも一部を、ハードウェアで実現してもよい。 The control unit 13 is a calculation device and includes a calculation circuit such as a CPU. The control unit 13 includes an area detection unit 21, a distance detection unit 22, an area distance acquisition unit 23, a determination unit 24, and an output unit 25. The control unit 13 realizes the area detection unit 21, the distance detection unit 22, the area distance acquisition unit 23, the determination unit 24, and the output unit 25 by reading and executing a program (software) from the storage unit 12, and executes these processes. The control unit 13 may execute these processes using one CPU, or may be provided with multiple CPUs and execute the processes using the multiple CPUs. Also, at least a part of the area detection unit 21, the distance detection unit 22, the area distance acquisition unit 23, the determination unit 24, and the output unit 25 may be realized by hardware.

本実施形態では、位置推定装置１０は、画像データと対象物を含む領域との対応関係を機械学習した第１ＡＩ（ＡｒｔｉｆｉｃｉａｌＩｎｔｅｌｌｉｇｅｎｃｅ）モデルと、画像データと画像の位置毎の深度との対応関係を機械学習した第２ＡＩモデルとを用いて、画像データに写る対象物までの距離を検出する。以降においては、第１ＡＩモデル及び第２ＡＩモデルの学習方法を説明し、その後に、学習済みのこれらのＡＩモデルを用いた距離の検出方法を説明する。以降においては、位置推定装置１０が、これらのＡＩモデルを機械学習させることを例にするが、それに限られず、位置推定装置１０は、他の装置によって機械学習されたこれらのＡＩモデルを取得してもよい。 In this embodiment, the position estimation device 10 detects the distance to an object shown in image data using a first AI (Artificial Intelligence) model that has been machine-learned to learn the correspondence between image data and an area containing the object, and a second AI model that has been machine-learned to learn the correspondence between image data and the depth for each position on the image. In the following, a learning method for the first AI model and the second AI model will be described, and then a distance detection method using these trained AI models will be described. In the following, an example will be given in which the position estimation device 10 trains these AI models by machine learning, but this is not limited thereto, and the position estimation device 10 may acquire these AI models that have been machine-learned by other devices.

（第１ＡＩモデルの学習処理）
位置推定装置１０は、画像データと、対象領域を示す情報とを教師データとして、第１ＡＩモデルに機械学習させる。対象領域を示す情報とは、画像における対象領域の位置（座標）を示す情報である。位置推定装置１０は、画像データを入力値とし、対象領域を示す情報を出力値としたデータセットを、教師データとして、第１ＡＩモデルに入力する。位置推定装置１０は、画像データと対象領域を示す情報とからなるデータセットを複数準備して、複数のデータセットのそれぞれを第１ＡＩモデルに入力することが好ましい。これにより、第１ＡＩモデルは、画像データと対象領域との対応関係を機械学習して、画像データが入力されたら、対象領域を示す情報が出力されるモデル（プログラム）となる。なお、本実施形態では、対象領域を複数種類設定する。従って、第１ＡＩモデルは、画像データが入力されたら、複数種類の対象領域のうちで、その画像に含まれている対象領域の種類（対象領域の種類を示すラベル）と、画像におけるその対象領域の位置を示す情報とを、対象領域を示す情報として出力する。第１ＡＩモデルの機械学習に用いる画像データは、任意のものを用いてよいが、例えば、カメラ４Ａに予め撮像させておいた道路を含む領域の画像データを用いてよい。また、対象領域を指定する情報は、例えばユーザにより指定されるなど、適宜設定されてよい。 (Learning process of first AI model)
The position estimation device 10 trains the first AI model by machine learning using image data and information indicating the target area as teacher data. The information indicating the target area is information indicating the position (coordinates) of the target area in the image. The position estimation device 10 inputs a data set in which the image data is used as an input value and the information indicating the target area is used as an output value to the first AI model as teacher data. It is preferable that the position estimation device 10 prepares a plurality of data sets consisting of image data and information indicating the target area, and inputs each of the plurality of data sets to the first AI model. As a result, the first AI model trains the correspondence between the image data and the target area by machine learning, and becomes a model (program) that outputs information indicating the target area when the image data is input. In this embodiment, multiple types of target areas are set. Therefore, when image data is input, the first AI model outputs, as information indicating the target area, the type of the target area included in the image (a label indicating the type of the target area) and information indicating the position of the target area in the image, among the multiple types of target areas. Any image data may be used for the machine learning of the first AI model, and for example, image data of an area including a road captured in advance by the camera 4A may be used. Information specifying the target area may be appropriately set, for example, by a user.

このように、第１ＡＩモデルは、いわゆる教師ありのＡＩモデルであるが、それに限られず教師なしのＡＩモデルであってもよい。 Thus, the first AI model is a so-called supervised AI model, but is not limited to this and may also be an unsupervised AI model.

第１ＡＩモデルは、入力されたデータに基づき、そのデータのラベルを判定できる任意のＡＩモデルであってよく、例えば、ＣＮＮ（ＣｏｎｖｅｎｔｉｏｎａｌＮｅｕｒａｌＮｅｔｗｏｒｋ：畳み込みニューラルネットワーク）モデルであってよい。また、第１ＡＩモデルは、画像データの画素（ピクセル）を、どの物体クラスに属するかにより分類する手法であるセマンティック・セグメンテーション（ＳｅｍａｎｔｉｃＳｅｇｍｅｎｔａｔｉｏｎ）により実現されてもよいし、画像データの画素を、どの物体クラスに属するか、及びどの事例に属するかにより分類する手法であるインスタント・セグメンテーション（ＩｎｓｔａｎｔＳｅｇｍｅｎｔａｔｉｏｎ）により実現されてもよいし、セマンティック・セグメンテーションとインスタント・セグメンテーションを組み合わせた手法であり、画像データの全ての画素（ピクセル）にラベルが付与され、かつ数えられる物体に対しては、そのそれぞれを個別で認識するパノプティック・セグメンテーション（ＰａｎｏｐｔｉｃＳｅｇｍｅｎｔａｔｉｏｎ）により実現されてよい。 The first AI model may be any AI model that can determine the label of the data based on the input data, for example, a CNN (Conventional Neural Network) model. The first AI model may be realized by semantic segmentation, which is a method of classifying pixels of image data according to which object class they belong to, instant segmentation, which is a method of classifying pixels of image data according to which object class and which example they belong to, or panoptic segmentation, which is a combination of semantic segmentation and instant segmentation, in which all pixels of image data are assigned a label and, for countable objects, each of them is recognized individually.

（第２ＡＩモデルの学習処理）
位置推定装置１０は、画像データと深度情報とを教師データとして、第２ＡＩモデルに機械学習させる。深度情報とは、検出元から対象までの距離である。位置推定装置１０は、画像データを入力値とし、画像データにおける画素（画像上の位置）毎の深度情報を出力値としたデータセットを、第２ＡＩモデルに入力する。位置推定装置１０は、画像データと画素毎の深度情報とからなるデータセットを複数準備して、複数のデータセットのそれぞれを第２ＡＩモデルに入力することが好ましい。これにより、第２ＡＩモデルは、画像データと画素毎の深度情報との対応関係を機械学習して、画像データが入力されたら、画素毎の深度情報が出力されるモデル（プログラム）となる。 (Learning process of second AI model)
The position estimation device 10 trains the second AI model by machine learning using image data and depth information as teacher data. Depth information is the distance from the detection source to the target. The position estimation device 10 inputs a data set in which image data is used as an input value and depth information for each pixel (position on the image) in the image data is used as an output value to the second AI model. It is preferable that the position estimation device 10 prepares a plurality of data sets consisting of image data and depth information for each pixel, and inputs each of the plurality of data sets to the second AI model. As a result, the second AI model trains the correspondence between image data and depth information for each pixel by machine learning, and becomes a model (program) that outputs depth information for each pixel when image data is input.

なお、第２ＡＩモデルの機械学習に用いる画像データは、任意のものを用いてよいが、例えば、車両４に予め撮像させておいた道路を含む領域の画像データを用いてよい。また、深度情報は、予め検出されたものを用いる。この場合例えば、車両４に、深度検出センサを設けておき、カメラ４Ａに画像を撮像させつつ、深度検出センサに、カメラ４Ａに撮像された画像に写る範囲の深度情報を検出させる。深度検出センサが検出した深度情報は、検出元である車両４（深度検出センサ）から、対象までの距離ともいえる。深度検出センサは、例えば、ＬｉＤＡＲ（ＬｉｇｈｔＤｅｔｅｃｔｉｏｎＡｎｄＲａｎｇｉｎｇ）を用いて実現されてよい。ＬｉＤＡＲは、レーザ光を対象に照射し、レーザ光が対象に当たって反射する反射光が検出されるまでの時間を計測することで、対象までの距離や対象が位置する方向を計測する。また、深度検出センサは、ステレオカメラであってもよい。ステレオカメラは、２つのカメラによって対象の画像を撮像し、三角測量法を用いて対象までの距離を計測する。また、第２ＡＩモデルも、任意のＡＩモデルであってよい。 The image data used for the machine learning of the second AI model may be any image data, but for example, image data of an area including a road that has been captured in advance by the vehicle 4 may be used. Also, the depth information is detected in advance. In this case, for example, a depth detection sensor is provided in the vehicle 4, and while the camera 4A captures an image, the depth detection sensor detects depth information of the range captured in the image captured by the camera 4A. The depth information detected by the depth detection sensor can also be said to be the distance from the vehicle 4 (depth detection sensor) that is the detection source to the target. The depth detection sensor may be realized, for example, using LiDAR (Light Detection and Ranging). LiDAR irradiates a laser light to the target and measures the time until the reflected light reflected by the laser light hitting the target is detected, thereby measuring the distance to the target and the direction in which the target is located. Also, the depth detection sensor may be a stereo camera. The stereo camera captures images of the target using two cameras and measures the distance to the target using triangulation. The second AI model can also be any AI model.

以降において、位置推定装置１０による、対象物までの距離の検出方法について説明する。 The following describes how the position estimation device 10 detects the distance to an object.

（対象領域の検出）
図４は、撮像された画像の一例を示す図であり、図５は、領域検出部が対象領域の検出を行った結果の一例を示す図である。領域検出部２１は、撮像された画像データから、対象領域を検出する。具体的には、領域検出部２１は、測定データ取得装置６から、車両４のカメラ４Ａによって撮像された画像データを取得して、取得した画像データから、対象領域を検出する。本実施形態においては、領域検出部２１は、カメラ４Ａによって撮像された画像データを、機械学習済みの第１ＡＩモデルに入力することで、カメラ４Ａによって撮像された画像に含まれる対象領域を検出する。第１ＡＩモデルにおいては、画像データが入力データとして入力されて演算が実行されて、その画像データに含まれる対象領域の情報が、出力データとして出力される。より詳しくは、第１ＡＩモデルからは、入力された画像データに含まれている対象領域の種類と、画像におけるその対象領域の位置とが、対象領域の情報として出力される。領域検出部２１は、第１ＡＩモデルから出力された対象領域の情報を取得することで、対象領域を検出する。 (Detection of target area)
FIG. 4 is a diagram showing an example of a captured image, and FIG. 5 is a diagram showing an example of a result of the area detection unit detecting the target area. The area detection unit 21 detects the target area from the captured image data. Specifically, the area detection unit 21 acquires image data captured by the camera 4A of the vehicle 4 from the measurement data acquisition device 6, and detects the target area from the acquired image data. In this embodiment, the area detection unit 21 detects the target area included in the image captured by the camera 4A by inputting the image data captured by the camera 4A to a first AI model that has been machine-learned. In the first AI model, the image data is input as input data, a calculation is performed, and information on the target area included in the image data is output as output data. More specifically, the first AI model outputs the type of the target area included in the input image data and the position of the target area in the image as information on the target area. The area detection unit 21 detects the target area by acquiring information on the target area output from the first AI model.

言い換えれば、領域検出部２１は、画像データから、その画像データに含まれる対象領域の境界に沿って対象物を検出することで、画像から対象物（対象領域）を検出する。また、領域検出部２１は、対象物を種類毎に検出することで、検出された対象物の種類を分類する。また、領域検出部２１は、カメラ４Ａで撮像された画像データから、検出された対象領域の画像データを抽出することで、検出された対象領域のみが写る抽出画像を生成してもよい。例えばこの場合、対象領域以外の領域の階調値をゼロとしつつ対象領域の階調値をゼロより大きくする２値化処理を行ったり、対象領域以外の領域の階調値をゼロとしつつ対象領域の種類毎に階調値を異ならせる処理を行ったりすることで、抽出画像を生成してよい。 In other words, the area detection unit 21 detects the object (target area) from the image by detecting the object along the boundary of the target area included in the image data. The area detection unit 21 also classifies the type of the detected object by detecting each type of object. The area detection unit 21 may also generate an extracted image that shows only the detected target area by extracting image data of the detected target area from the image data captured by the camera 4A. For example, in this case, the extracted image may be generated by performing a binarization process that sets the gradation value of the target area to greater than zero while setting the gradation value of the area other than the target area to zero, or by performing a process that sets the gradation value of the area other than the target area to zero while varying the gradation value for each type of target area.

図４に示すように、道路Ａ１及び植生Ａ２を含む画像ＰＡの画像データが取得された場合を例にする。この場合、例えば第１ＡＩモデルは、対象物の種類が道路である第１対象領域の特徴量と、対象物の種類が植生である第２対象領域の特徴量とを機械学習済みであってよい。そしてこの場合には、領域検出部２１は、画像ＰＡの画像データから、道路を含む第１対象領域の情報と植生を含む第２対象領域の情報とを検出して、図４に示すように、道路Ａ１に対応する第１対象領域ＣＬ１ａと、植生Ａ２に対応する第２対象領域ＣＬ２ａとのみが含まれる抽出画像ＰＢ（抽出画像データ）を生成する。なお、図４では、対象領域の種類として、植生と道路とが検出されているが、対象領域の種類はそれに限られず任意であってよい。また、検出（分類）される対象領域の種類の数も２つに限られず任意であってよく、１つであってもよいし３つ以上であってもよい。 As shown in FIG. 4, an example is taken in which image data of an image PA including a road A1 and vegetation A2 is acquired. In this case, for example, the first AI model may have already machine-learned the feature amount of a first target area whose type of object is a road and the feature amount of a second target area whose type of object is vegetation. In this case, the area detection unit 21 detects information on the first target area including the road and information on the second target area including vegetation from the image data of the image PA, and generates an extracted image PB (extracted image data) that includes only the first target area CL1a corresponding to the road A1 and the second target area CL2a corresponding to the vegetation A2, as shown in FIG. 4. Note that in FIG. 4, vegetation and roads are detected as the types of target areas, but the types of target areas are not limited to these and may be any. In addition, the number of types of target areas detected (classified) is not limited to two and may be any, and may be one or three or more.

領域検出部２１は、第１ＡＩモデルを用いて対象領域を検出することに限られず、任意の方法で画像データから対象領域を検出してよい。 The area detection unit 21 is not limited to detecting the target area using the first AI model, but may detect the target area from the image data using any method.

（深度情報の検出）
図６は、距離検出部が深度情報の検出を行った結果の一例を示す図である。距離検出部２２は、カメラ４Ａによって撮像された画像（領域検出部２１が対象物検出を行った画像）に写る範囲の深度情報を検出する。具体的には、距離検出部２２は、カメラ４Ａによって撮像された画像の位置（画素）毎に、深度情報を検出する。本実施形態においては、距離検出部２２は、カメラ４Ａによって撮像された画像データ（領域検出部２１が対象物検出を行った画像データ）を用いて、深度情報を検出する。さらに言えば、距離検出部２２は、カメラ４Ａによって撮像された画像データを、機械学習済みの第２ＡＩモデルに入力することで、画像データの位置毎の深度情報を検出する。第２ＡＩモデルにおいては、画像データが入力データとして入力されて演算が実行されて、その画像データの位置毎の深度情報が、出力データとして出力される。領域検出部２１は、第２ＡＩモデルから出力された深度情報を取得することで、深度情報を検出する。 (Depth information detection)
FIG. 6 is a diagram showing an example of the result of the distance detection unit detecting depth information. The distance detection unit 22 detects depth information of the range captured in the image captured by the camera 4A (the image in which the area detection unit 21 performed the object detection). Specifically, the distance detection unit 22 detects depth information for each position (pixel) of the image captured by the camera 4A. In this embodiment, the distance detection unit 22 detects depth information using image data captured by the camera 4A (the image data in which the area detection unit 21 performed the object detection). More specifically, the distance detection unit 22 detects depth information for each position of the image data by inputting the image data captured by the camera 4A into a second AI model that has been machine-learned. In the second AI model, the image data is input as input data, a calculation is performed, and the depth information for each position of the image data is output as output data. The area detection unit 21 detects depth information by acquiring the depth information output from the second AI model.

距離検出部２２は、検出した位置毎の深度情報をマトリクス状にマッピングした距離画像を生成してもよい。この場合例えば、距離検出部２２は、深度の大きさ毎に階調値を異ならせることで、距離画像を生成する。図６の例では、出力された深度情報が、画像データの位置ごとに、深度が遠い方から近い方に順にハッチングが薄くなるように（階調値が高くなるように）設定された距離画像ＰＣが示されている。つまり、図６の中心部分の濃いハッチングの箇所は、車両４からの距離が遠いことを示しており、図６の下端の薄いハッチングの箇所は、車両４からの距離が近いことを示している。このように、距離検出部２２は、画像データの位置ごとに深度情報を推定する。 The distance detection unit 22 may generate a distance image in which the depth information for each detected position is mapped in a matrix. In this case, for example, the distance detection unit 22 generates the distance image by varying the gradation value for each depth magnitude. The example of FIG. 6 shows a distance image PC in which the output depth information is set for each position of the image data so that the hatching becomes lighter (the gradation value becomes higher) from the farthest depth to the closer depth. In other words, the darkly hatched areas in the center of FIG. 6 indicate that the distance from the vehicle 4 is far, and the lightly hatched areas at the bottom of FIG. 6 indicate that the distance from the vehicle 4 is close. In this way, the distance detection unit 22 estimates the depth information for each position of the image data.

なお、距離検出部２２は、第２ＡＩモデルにより、画像データに基づいて深度情報を検出することに限られず、任意の方法で深度情報を検出してよい。例えば、車両４に深度検出センサを設けておき、カメラ４Ａに画像を撮像させつつ、深度検出センサに、カメラ４Ａに撮像された画像に写る範囲の深度情報を検出させてよい。距離検出部２２は、深度検出センサが検出した深度情報を取得してもよい。 The distance detection unit 22 is not limited to detecting depth information based on image data using the second AI model, and may detect depth information in any manner. For example, a depth detection sensor may be provided in the vehicle 4, and while the camera 4A is capturing an image, the depth detection sensor may detect depth information of the range captured in the image captured by the camera 4A. The distance detection unit 22 may acquire the depth information detected by the depth detection sensor.

（対象物の距離の算出）
図７は、領域距離取得部が対象物までの距離の情報を取得した結果の一例を示す図である。領域距離取得部２３は、深度情報に基づき、対象領域（対象物）までの距離の情報を取得する。対象領域までの距離の情報とは、撮像元（車両４）から対象物までの距離を示す。具体的には、領域距離取得部２３は、距離検出部２２により検出された画像データの位置毎の深度情報のうちから、対象領域の位置における深度情報を抽出することで、対象領域（対象物）までの距離の情報を取得する。すなわち、領域距離取得部２３は、画像データの位置毎の深度情報のうちから、領域検出部２１により検出された対象領域の位置と同じ位置における深度情報を、対象領域の距離の情報として取得する。領域距離取得部２３は、対象領域が複数検出されている場合には、対象領域毎に、対象領域までの距離の情報を取得する。 (Calculating the distance to the object)
FIG. 7 is a diagram showing an example of the result of the area distance acquisition unit acquiring information on the distance to the target object. The area distance acquisition unit 23 acquires information on the distance to the target area (target object) based on the depth information. The information on the distance to the target area indicates the distance from the image capture source (vehicle 4) to the target object. Specifically, the area distance acquisition unit 23 acquires information on the distance to the target area (target object) by extracting depth information at the position of the target area from the depth information for each position of the image data detected by the distance detection unit 22. That is, the area distance acquisition unit 23 acquires depth information at the same position as the position of the target area detected by the area detection unit 21 from the depth information for each position of the image data as information on the distance to the target area. When multiple target areas are detected, the area distance acquisition unit 23 acquires information on the distance to the target area for each target area.

領域距離取得部２３は、対象領域以外の領域が表示されず、かつ、対象領域がその対象領域までの距離を示すような表示態様となる、対象画像データを生成してもよい。例えば、領域距離取得部２３は、検出された対象領域の位置における階調値を、距離検出部２２によって設定された距離画像で設定された階調値とし、対象領域以外における階調値を一定（例えばゼロ）とするように、対象画像データを生成してよい。図７の例では、第１対象領域ＣＬ１ａと第２対象領域ＣＬ２ａとが深度毎の階調値を有するような、対象画像Ｐが生成されている。 The area distance acquisition unit 23 may generate target image data in a display mode in which areas other than the target area are not displayed and the target area indicates the distance to the target area. For example, the area distance acquisition unit 23 may generate target image data such that the gradation value at the position of the detected target area is the gradation value set in the distance image set by the distance detection unit 22, and the gradation value outside the target area is constant (for example, zero). In the example of FIG. 7, a target image P is generated in which the first target area CL1a and the second target area CL2a have gradation values for each depth.

このように、本実施形態に係る位置推定装置１０は、画像データから検出された対象領域と、その画像に写る範囲の深度情報とに基づいて、対象領域までの距離の情報を取得する。そのため、撮像した画像に写った、注目している対象物までの距離を、容易にかつ精度よく検出することが可能となる。さらに言えば、本実施形態では、画像データから深度情報を検出するため、深度を検出するためのセンサを別途用いることなく、画像データのみから、注目している対象物の距離を、容易に算出できる。また、センサを別途用いることがないため、装置の小型化、コストダウンになる。また、単眼カメラによる深度情報から距離を算出できるので、ＬｉＤＡＲやステレオカメラを用いた場合と比較すると、汎用性が向上する。 In this way, the position estimation device 10 according to this embodiment acquires information on the distance to the target area based on the target area detected from the image data and the depth information of the range captured in the image. Therefore, it is possible to easily and accurately detect the distance to the target of interest captured in the captured image. Furthermore, in this embodiment, since the depth information is detected from the image data, the distance to the target of interest can be easily calculated from the image data alone without using a separate sensor for detecting depth. In addition, since a separate sensor is not used, the device can be made smaller and less expensive. Furthermore, since the distance can be calculated from the depth information obtained by the monocular camera, versatility is improved compared to the case of using LiDAR or a stereo camera.

（対象領域同士の位置関係の判定）
また、位置推定装置１０は、判定部２４により、対象領域同士の位置関係の判定を行ってよい。以降においては、第１対象領域と第２対象領域との位置関係の判定を例にして説明する。 (Determination of Positional Relationship Between Target Areas)
Furthermore, the position estimation device 10 may determine the positional relationship between the target areas by the determination unit 24. In the following, the determination of the positional relationship between the first target area and the second target area will be described as an example.

判定部２４は、第１対象領域上の位置と前記第２対象領域上の位置とが画像の同一垂線上にあり、かつ、第１対象領域上の位置の深度と第２対象領域上の位置の深度との差が、所定の範囲内に含まれるかを判定する。例えば、判定部２４は、第１対象領域の各位置のうちで、第２対象領域上の位置に対して同一垂線上にある位置を抽出する。ここでの同一垂線上にあるとは、位置同士が、画像の座標系において、鉛直方向に沿って並んでいることを指す。ただし、厳密に鉛直方向から見て重なっていることに限られず、位置同士が鉛直方向に並びつつ、位置同士の水平方向のずれ量が所定値内である場合でも、同一垂線上にあると判断してもよい。 The determination unit 24 determines whether a position in the first target area and a position in the second target area are on the same perpendicular line of the image, and the difference in depth between the position in the first target area and the position in the second target area is within a predetermined range. For example, the determination unit 24 extracts positions in the first target area that are on the same perpendicular line to the positions in the second target area. On the same perpendicular line here refers to positions being lined up along the vertical direction in the coordinate system of the image. However, this is not limited to positions strictly overlapping when viewed vertically, and positions may be determined to be on the same perpendicular line even if they are lined up vertically and the horizontal deviation between the positions is within a predetermined value.

判定部２４は、抽出した第１対象領域上の位置における深度と、その位置と同一垂線上にある第２対象領域上の位置における深度との差が、所定の範囲内にあるかを判定する。判定部２４は、深度の差が所定の範囲内にある場合には、その位置同士が、実世界上で、鉛直方向に沿って並んでいると判断する。一方、判定部２４は、深度の差が所定の範囲内にない場合には、その位置同士が、実世界上で、鉛直方向に沿って並んでいると判断する。判定部２４は、第１対象領域の位置毎に、同様の処理を行う。図７の例では、第１対象領域ＣＬ１ａ上の位置Ａ、Ｂ、Ｃのそれぞれについて、同一垂線上に第２対象領域上の位置があるか、及び、深度の差が所定の範囲内にあるかを判断している例を示している。 The determination unit 24 determines whether the difference between the depth at the extracted position in the first target area and the depth at the position in the second target area on the same perpendicular line as that position is within a predetermined range. If the difference in depth is within the predetermined range, the determination unit 24 determines that the positions are lined up vertically in the real world. On the other hand, if the difference in depth is not within the predetermined range, the determination unit 24 determines that the positions are lined up vertically in the real world. The determination unit 24 performs the same process for each position in the first target area. The example in FIG. 7 shows an example in which it is determined whether there is a position in the second target area on the same perpendicular line for each of positions A, B, and C in the first target area CL1a, and whether the difference in depth is within a predetermined range.

なお、以上の説明では、同一垂線上にある第１対象領域上の位置と第２対象領域上の位置とを抽出してから、深度の差が所定の範囲内にあるかを判断したが、処理の順番を逆にしてもよい。すなわち、判定部２４は、深度の差が所定の範囲内にある、第１対象領域上の位置と第２対象領域上の位置とのペアを抽出して、それらの位置同士が、同一垂線上にあるかを判断してよい。この場合、判定部２４は、深度の差が所定の範囲内にある位置同士が同一垂線上にある場合には、その位置同士が、実世界上で、鉛直方向に沿って並んでいると判断する。一方、判定部２４は、深度の差が所定の範囲内にある位置同士が同一垂線上にない場合には、その位置同士が、実世界上で、鉛直方向に沿って並んでいないと判断する。 In the above description, positions in the first and second target areas that are on the same perpendicular line are extracted, and then it is determined whether the depth difference is within a predetermined range. However, the order of processing may be reversed. That is, the determination unit 24 may extract pairs of positions in the first and second target areas whose depth difference is within a predetermined range, and determine whether these positions are on the same perpendicular line. In this case, if positions whose depth difference is within the predetermined range are on the same perpendicular line, the determination unit 24 determines that the positions are lined up vertically in the real world. On the other hand, if positions whose depth difference is within the predetermined range are not on the same perpendicular line, the determination unit 24 determines that the positions are not lined up vertically in the real world.

出力部２５は、判定部２４による判定結果を出力する。例えば、出力部２５は、同一垂線上にあるとされた位置をその領域に含む第１対象領域と第２対象領域とを示す位置関係情報を、出力してもよい。位置関係情報は、例えば、同一垂線上にあるとされた位置をその領域に含む第１対象領域と第２対象領域とを示す画像データであってもよいし、同一垂線上にあるとされた位置をその領域に含む第１対象領域と第２対象領域との位置（地球座標）を示すデータであってもよい。このように、第１対象領域と第２対象領域とを示す位置関係情報を出力することで、第１対象領域と第２対象領域との位置関係をユーザに認識させることができる。例えば、第１対象領域と第２対象領域との位置関係により、道路の鉛直方向上方に植生があることが認識できるため、例えば道路の上にあり伐採対象となる植生を適切に検出できる。 The output unit 25 outputs the determination result by the determination unit 24. For example, the output unit 25 may output positional relationship information indicating the first target area and the second target area that include the positions that are determined to be on the same perpendicular line. The positional relationship information may be, for example, image data indicating the first target area and the second target area that include the positions that are determined to be on the same perpendicular line, or data indicating the positions (earth coordinates) of the first target area and the second target area that include the positions that are determined to be on the same perpendicular line. In this way, by outputting the positional relationship information indicating the first target area and the second target area, the user can be made to recognize the positional relationship between the first target area and the second target area. For example, the positional relationship between the first target area and the second target area makes it possible to recognize that there is vegetation vertically above the road, and therefore, for example, vegetation that is on the road and is to be cut down can be appropriately detected.

また、判定部２４は、同一垂線上にあるとされた第１対象領域上の位置から、第２対象領域上の位置までの距離を算出してもよい。この場合例えば、判定部２４は、カメラ４Ａの撮像条件（例えば画角など）から算出された、画像の座標系における画素同士の距離と、実際の距離との対応関係を取得する。そして、判定部２４は、画像の座標系における、第１対象領域上の位置から第２対象領域上の位置までの距離と、この対応関係とに基づいて、第１対象領域上の位置から、第２対象領域上の位置までの距離を算出する。判定部は、この距離の情報も位置関係情報として出力してよい。これにより、例えば道路の上にある樹木と道路までの距離が把握できるため、例えば伐採対象となる樹木をより適切に選定できる。すなわち、道路からの距離が所定値以下となる樹木を伐採対象としている場合には、この距離の情報から、伐採対象をより容易に選定できる。 The determination unit 24 may also calculate the distance from a position on the first target area, which is assumed to be on the same perpendicular line, to a position on the second target area. In this case, for example, the determination unit 24 acquires the correspondence between the distance between pixels in the coordinate system of the image, calculated from the imaging conditions of the camera 4A (e.g., the angle of view, etc.), and the actual distance. Then, the determination unit 24 calculates the distance from a position on the first target area to a position on the second target area based on the distance from a position on the first target area to a position on the second target area in the coordinate system of the image and this correspondence. The determination unit may also output this distance information as positional relationship information. This allows, for example, the distance between a tree on a road and the road to be grasped, so that, for example, a tree to be cut down can be more appropriately selected. In other words, when trees whose distance from the road is equal to or less than a predetermined value are to be cut down, the tree to be cut down can be more easily selected from this distance information.

なお、以上の例では、樹木と道路との位置関係の判定を例にしていたが、位置関係の判定対象は、それに限られず任意であってよい。 In the above example, the positional relationship between a tree and a road was determined, but the subject of the positional relationship determination is not limited to this and may be anything.

（処理フロー）
次に、以上説明した位置推定装置１０の処理フローを説明する。図８は、位置推定装置の処理フローを説明するフローチャートである。図８に示すように、位置推定装置１０は、カメラ４Ａが撮像した画像データを取得し（ステップＳ１０）、領域検出部２１により、取得した画像データから、対象領域を検出し（ステップＳ１２）、距離検出部２２により、取得した画像データから、深度情報を検出する（ステップＳ１４）。位置推定装置１０は、領域距離取得部２３により、検出された対象領域の情報と深度情報とから、対象領域までの距離の情報を取得し（ステップＳ１６）、対象領域までの距離の情報に基づき、第１対象領域と第２対象領域との位置関係を判定する（ステップＳ１８）。 (Processing flow)
Next, the process flow of the position estimation device 10 described above will be described. FIG. 8 is a flowchart for explaining the process flow of the position estimation device. As shown in FIG. 8, the position estimation device 10 acquires image data captured by the camera 4A (step S10), detects a target area from the acquired image data by the area detection unit 21 (step S12), and detects depth information from the acquired image data by the distance detection unit 22 (step S14). The position estimation device 10 acquires information on the distance to the target area from the information on the detected target area and the depth information by the area distance acquisition unit 23 (step S16), and determines the positional relationship between the first target area and the second target area based on the information on the distance to the target area (step S18).

以上説明したように、本実施形態に係る位置推定装置１０は、撮像された画像データから、対象物を含む対象領域を検出する領域検出部２１と、撮像された画像に写る範囲の深度情報を検出する距離検出部２２と、深度情報に基づき、前記対象領域までの距離の情報を取得する領域距離取得部２３と、を含む。そのため、本実施形態によると、撮像した画像に写った、注目している対象物までの距離を、容易にかつ精度よく検出することが可能となる。 As described above, the position estimation device 10 according to this embodiment includes an area detection unit 21 that detects a target area including a target object from captured image data, a distance detection unit 22 that detects depth information of the range captured in the captured image, and an area distance acquisition unit 23 that acquires information on the distance to the target area based on the depth information. Therefore, according to this embodiment, it is possible to easily and accurately detect the distance to a target object of interest captured in a captured image.

領域検出部２１は、第１対象物を含む第１対象領域と第２対象物を含む第２対象領域とを検出する。また、位置推定装置１０は、第１対象領域と第２対象領域とが画像の同一垂線上にあり、かつ、深度情報において第１対象領域の深度と第２対象領域の深度との差が所定の範囲内に含まれるかを判定する判定部２４を更に含む。本実施形態によると、第１対象領域と第２対象領域との位置関係を、容易にかつ精度よく検出することが可能となる。さらに言えば、第１対象領域と第２対象領域とが鉛直方向に並んでいるかを、容易にかつ精度よく検出することが可能となる。 The area detection unit 21 detects a first object area including a first object and a second object area including a second object. The position estimation device 10 further includes a determination unit 24 that determines whether the first object area and the second object area are on the same perpendicular line of the image and whether the difference in depth information between the depth of the first object area and the depth of the second object area is within a predetermined range. According to this embodiment, it is possible to easily and accurately detect the positional relationship between the first object area and the second object area. Furthermore, it is possible to easily and accurately detect whether the first object area and the second object area are aligned vertically.

第１対象物は道路であり、第２対象物は植生であってよい。そのため、本実施形態によると、道路の上に樹木などの植生が位置しているかを、容易にかつ精度よく検出することが可能となる。 The first object may be a road, and the second object may be vegetation. Therefore, according to this embodiment, it is possible to easily and accurately detect whether vegetation such as trees is located on the road.

領域検出部２１は、画像データと対象物を含む領域との対応関係を機械学習した第１ＡＩモデルに、画像データを入力することで、対象領域を検出する。距離検出部２２は、画像データと画像の位置毎の深度との対応関係を機械学習した第２ＡＩモデルに、画像データを入力することで、深度情報を検出する。このように、本実施形態によると、ＡＩモデルを用いて、画像データから、対象領域と深度情報を検出する。従って、深度を検出するためのセンサを別途用いることなく、画像データから、注目している対象物の距離を、容易に検出できる。 The area detection unit 21 detects the target area by inputting the image data into a first AI model that has learned the correspondence between image data and areas containing the target object through machine learning. The distance detection unit 22 detects depth information by inputting the image data into a second AI model that has learned the correspondence between image data and the depth for each position in the image through machine learning. In this way, according to this embodiment, the target area and depth information are detected from the image data using an AI model. Therefore, the distance to the target object of interest can be easily detected from the image data without using a separate sensor for detecting depth.

領域距離取得部２３は、対象領域以外の領域が表示されず、かつ、対象領域が、その対象領域までの距離を示すような表示態様となるように、対象画像データを生成する。本実施形態によると、対象画像データを生成することで、対象物までの距離をユーザに適切に認識させることができる。 The area distance acquisition unit 23 generates the target image data so that areas other than the target area are not displayed and the target area is displayed in a manner that indicates the distance to the target area. According to this embodiment, by generating the target image data, it is possible to allow the user to properly recognize the distance to the target object.

以上、本発明の実施形態を説明したが、この実施形態の内容により実施形態が限定されるものではない。また、前述した構成要素には、当業者が容易に想定できるもの、実質的に同一のもの、いわゆる均等の範囲のものが含まれる。さらに、前述した構成要素は適宜組み合わせることが可能である。さらに、前述した実施形態の要旨を逸脱しない範囲で構成要素の種々の省略、置換又は変更を行うことができる。 Although the embodiment of the present invention has been described above, the embodiment is not limited to the contents of this embodiment. The above-mentioned components include those that a person skilled in the art can easily imagine, those that are substantially the same, and those that are within the so-called equivalent range. Furthermore, the above-mentioned components can be combined as appropriate. Furthermore, various omissions, substitutions, or modifications of the components can be made without departing from the spirit of the above-mentioned embodiment.

４車両
４Ａカメラ
１０位置推定装置
２１領域検出部
２２距離検出部
２３領域距離取得部
２４判定部
２５出力部 4 Vehicle 4A Camera 10 Position estimation device 21 Area detection unit 22 Distance detection unit 23 Area distance acquisition unit 24 Determination unit 25 Output unit

Claims

a region detection unit that detects a target region including a target object from the captured image data;
a distance detection unit that detects depth information of a range captured in the captured image;
a region distance acquisition unit that acquires information on a distance to the target region based on the depth information;
Including,
The area detection unit detects a first object area including a first object and a second object area including a second object,
a determination unit that determines whether the first target region and the second target region are on the same perpendicular line of an image and whether a difference in depth between the first target region and the second target region in the depth information is within a predetermined range;
Location estimation device.

The area detection unit detects the object area by inputting the image data into a first AI model that has machine-learned a correspondence relationship between image data and an area including an object.
The position estimation device according to claim 1 .

The distance detection unit detects the depth information by inputting the image data to a second AI model that has machine-learned a correspondence between image data and a depth for each position in the image.
The position estimation device according to claim 1 or 2.

detecting a target area including a target object from the captured image data;
Detecting depth information of a range captured in the captured image;
obtaining information on a distance to the target region based on the depth information;
Including,
In the step of detecting the object region, a first object region including a first object and a second object region including a second object are detected,
The method further includes a step of determining whether the first target region and the second target region are on the same perpendicular line of an image, and whether a difference in depth between the first target region and the second target region in the depth information is within a predetermined range.
Location estimation methods.

detecting a target area including a target object from the captured image data;
Detecting depth information of a range captured in the captured image;
obtaining information on a distance to the target region based on the depth information;
Run the following on your computer:
In the step of detecting the object region, a first object region including a first object and a second object region including a second object are detected,
The method further includes a step of determining whether the first target region and the second target region are on the same perpendicular line of an image, and whether a difference in depth between the first target region and the second target region in the depth information is within a predetermined range.
program.