JP7708017B2

JP7708017B2 - Image Processing Device

Info

Publication number: JP7708017B2
Application number: JP2022106560A
Authority: JP
Inventors: 裕太椿; 智裕松田
Original assignee: Toyota Motor Corp
Current assignee: Toyota Motor Corp
Priority date: 2022-06-30
Filing date: 2022-06-30
Publication date: 2025-07-15
Anticipated expiration: 2042-06-30
Also published as: JP2024006025A

Description

本発明は、複数の画像を合成する画像処理装置に関する。 The present invention relates to an image processing device that synthesizes multiple images.

物流効率改善のため、トラック等の荷台に積載された荷物の積載率を自動で算出することが求められている。積載率はデータベースから予測ができるが、積載時の隙間等により誤差が生じる。また、荷物は多数存在するため、ＲＦＩＤによる管理も困難である。そのため、撮像した画像から積載率を自動で算出することが望ましいが、撮像者と荷台の距離が近いとき、荷台全体を撮像できないため、分割して撮像した画像を結合・合成する必要がある。 To improve logistics efficiency, there is a need to automatically calculate the loading rate of luggage loaded on the bed of a truck or other vehicle. The loading rate can be predicted from a database, but errors occur due to gaps during loading. In addition, since there are many luggage, it is difficult to manage them using RFID. Therefore, it is desirable to automatically calculate the loading rate from captured images, but when the distance between the photographer and the bed is short, it is not possible to capture an image of the entire bed, so it is necessary to combine and synthesize the captured images.

上記の画像合成技術に関し、特許文献１には、合成する画像間のオーバーラップ位置（対応点）を検出し、一方の画像のエッジが所定値以上の強いポイントについて、他方の画像と最も一致する点を見つけ出し、合成する技術が開示されている。 Regarding the above image synthesis technology, Patent Document 1 discloses a technology that detects overlap positions (corresponding points) between the images to be synthesized, finds the points in one image where the edge is stronger than a predetermined value and most similar to the other image, and synthesizes them.

特開平９－９１４０７号公報Japanese Patent Application Publication No. 9-91407

特許文献１の技術を、例えばトラックの荷台上に積載された荷物の積載率を算出するために用いようとするとき、荷台に同じ種類や同程度の寸法の荷物が複数積載されている場合、画像の類似点が複数存在し、画像間の一致点を正確に検出することができず、画像合成ができない。 When trying to use the technology of Patent Document 1 to calculate the loading rate of luggage loaded on the bed of a truck, for example, if multiple luggage of the same type or similar dimensions are loaded on the bed, there will be multiple similarities between the images, making it impossible to accurately detect matching points between the images and making it impossible to synthesize the images.

本発明は、上記課題を解決するためになされたものであり、複数の荷物を撮像して得られた複数の画像間に同じ種類や同程度の寸法の荷物が複数存在する場合であっても、正確に画像合成することが可能な画像処理装置を提供することを目的とする。 The present invention has been made to solve the above problems, and aims to provide an image processing device that can accurately synthesize images even when multiple images obtained by capturing images of multiple packages contain multiple packages of the same type or similar dimensions.

上記課題を解決するために、本発明の画像処理装置は、以下の構成を有する。また、それぞれの構成によって発揮される効果についても併せて説明する。 To solve the above problems, the image processing device of the present invention has the following configuration. The effects achieved by each configuration will also be described.

本発明に係る画像処理装置は、複数の荷物を撮像して得られた複数の画像を合成する画像処理装置であって、複数の画像を取得する画像取得部と、複数の画像から、複数の荷物それぞれの種類および位置座標を検出する荷物検出部と、荷物の位置座標から、荷物の寸法を算出する荷物寸法算出部と、検出された荷物の種類及び寸法に基づいて、複数の画像間に共通する荷物が含まれているか否か判定する荷物判定部と、荷物判定部が複数の画像間に共通する荷物が含まれていると判定した場合に、該共通する荷物の部分を重ね合わせることで該複数の画像を合成する画像合成部と、を備える画像処理装置である。 The image processing device according to the present invention is an image processing device that synthesizes multiple images obtained by capturing images of multiple pieces of luggage, and includes an image acquisition unit that acquires multiple images, a luggage detection unit that detects the type and position coordinates of each of the multiple pieces of luggage from the multiple images, a luggage dimension calculation unit that calculates the dimensions of the luggage from the position coordinates of the luggage, a luggage determination unit that determines whether or not the multiple images contain luggage that is common to both images based on the detected luggage type and dimensions, and an image synthesis unit that synthesizes the multiple images by superimposing parts of the luggage that are common to both images when the luggage determination unit determines that the multiple images contain luggage that is common to both images.

本発明によれば、従来技術のように画像間のオーバーラップ位置の検出を利用するのではなく、荷物の種類及び位置座標に基づいて、画像の重なり具合を認識することによって画像間に共通の荷物があるか否か判定するため、複数の画像間に同じ種類や同程度の寸法の荷物が複数存在する場合であっても、正確に画像合成することが可能になる。 In accordance with the present invention, instead of detecting overlapping positions between images as in conventional technology, the present invention determines whether or not there is a common piece of luggage between images by recognizing the degree of overlap between images based on the type of luggage and its position coordinates, making it possible to accurately synthesize images even when multiple images contain multiple pieces of luggage of the same type or similar dimensions.

より好ましい画像処理装置としては、荷物判定部は、複数の画像間に共通する荷物が含まれていると判定した場合に、さらに該共通する荷物に接する他の荷物が該複数の画像間で共通しているか否か判定する。 In a more preferred image processing device, when the luggage determination unit determines that a common piece of luggage is included among the multiple images, it further determines whether other luggage adjacent to the common piece of luggage is common among the multiple images.

この構成によれば、画像間に共通する荷物が複数あることを確認することが可能になり、画像間の共通性をより正確に算出することが可能になる。 This configuration makes it possible to confirm that there are multiple pieces of luggage that are common between the images, and allows for a more accurate calculation of the commonality between the images.

より好ましい画像処理装置としては、荷物検出部は、荷物が縦方向に複数積層された荷物列毎に種類及び位置座標を検出し、荷物判定部は、荷物列毎に、複数の画像間に共通する荷物であるか否か判定し、複数の画像間に共通する荷物が含まれていると判定した場合に、さらに該共通する荷物の上下のいずれかに接する他の荷物が該複数の画像間で共通しているか否か判定する。 In a more preferred image processing device, the luggage detection unit detects the type and position coordinates for each row of luggage in which multiple pieces of luggage are stacked vertically, and the luggage determination unit determines for each row of luggage whether the luggage is common to the multiple images, and if it determines that a piece of luggage is common to the multiple images, it further determines whether other luggage adjacent to the top or bottom of the common luggage is common to the multiple images.

トラック等の自動車両の荷台に積載された荷物は、縦方向に複数積層された荷物列が荷台の幅方向に沿って載置されている場合が多く、上記構成を採用することによって、一定の規則に従って荷物判定を行うことになるため、画像処理装置のユーザに対して簡便性を提供することが可能になる。 Luggage loaded onto the bed of a truck or other motor vehicle is often stacked vertically in rows aligned across the width of the bed, and by adopting the above configuration, luggage determination is performed according to certain rules, providing convenience to users of the image processing device.

より好ましい画像処理装置としては、荷物判定部は、複数の画像間において、種類が一致し、かつ寸法の差分が所定の閾値以内にある荷物を共通の荷物として判定する。 In a more preferred image processing device, the luggage determination unit determines that luggage whose type matches between multiple images and whose dimensional differences are within a predetermined threshold are the same luggage.

この構成によれば、複数の画像間を得る際に、画像間において撮像装置と撮像対象との距離や角度が多少異なるような場合であっても、その差を補正し、正確に画像合成することが可能になる。 With this configuration, when obtaining multiple images, even if the distance or angle between the imaging device and the subject differs slightly between the images, it is possible to correct the difference and accurately synthesize the images.

より好ましい画像処理装置としては、荷物検出部は、複数の荷物を撮像した画像を教師データとして、該画像に含まれる荷物の抽出方法とその荷物の種類の特定方法を機械学習したものであり、荷物検出部は、画像取得部で取得した複数の画像から、複数の荷物それぞれの種類および位置座標を検出する。 In a more preferred image processing device, the luggage detection unit uses images of multiple pieces of luggage as training data and machine-learns a method for extracting the luggage contained in the images and a method for identifying the type of luggage, and the luggage detection unit detects the type and position coordinates of each of the multiple pieces of luggage from the multiple images acquired by the image acquisition unit.

この構成によれば、予め多数の荷物を例えばニューラルネットワークを用いて学習させたＡＩモデルを利活用して荷物を検出することが可能になり、検出精度の大幅な向上が期待できる。 This configuration makes it possible to detect luggage by utilizing an AI model that has been trained on a large number of packages in advance using, for example, a neural network, and is expected to significantly improve detection accuracy.

より好ましい画像処理装置としては、複数の荷物は、自動車両の荷台に積載された複数の荷物であり、複数の画像は、荷台を自動車両の前後方向に分割した複数の画像である。 In a more preferred image processing device, the multiple pieces of luggage are multiple pieces of luggage loaded on the bed of a motor vehicle, and the multiple images are multiple images of the bed divided in the front-rear direction of the motor vehicle.

物流を担うトラックの荷台には、同じ種類や同程度の寸法の荷物が多数積載される場合が多く、従って、本発明はこのような構成の対象に対して好適に採用することができる。 Trucks used in logistics are often loaded with a large number of items of the same type and similar dimensions on their loading platforms, and therefore the present invention can be suitably applied to such configurations.

本発明によれば、複数の荷物を撮像して得られた複数の画像間に同じ種類や同程度の寸法の荷物が複数存在する場合であっても、正確に画像合成することが可能になる。 According to the present invention, accurate image synthesis is possible even when there are multiple pieces of luggage of the same type or similar dimensions among the multiple images obtained by capturing multiple pieces of luggage.

本発明の一実施例に係る画像処理装置のハードウェア構成を示すブロック図である。1 is a block diagram showing a hardware configuration of an image processing apparatus according to an embodiment of the present invention; 画像処理装置の機能構成を示すブロック図である。FIG. 2 is a block diagram showing a functional configuration of the image processing device. 画像処理装置が得た複数の画像に対して、荷物毎に種類及び位置座標を検出する処理を示す図である。FIG. 13 is a diagram showing a process of detecting the type and position coordinates of each piece of luggage from a plurality of images obtained by an image processing device. 検出した荷物情報を、位置座標に基づいてソートした状態を示す図表である。11 is a table showing detected package information sorted based on location coordinates. 複数の画像間に共通する荷物が含まれているか否か判定する処理、及び画像合成を行う処理を示す図である。13A and 13B are diagrams illustrating a process of determining whether or not a common package is included in a plurality of images, and a process of performing image synthesis. 画像処理装置が実行する処理を示すフローチャートである。4 is a flowchart showing a process executed by the image processing device.

以下、本発明の実施例について図面を用いて説明する。なお、本実施例においては、複数の荷物が積載されたトラック（自動車両）の荷台部分を撮像対象とし、撮像の結果得られる複数の画像は、トラックの荷台をトラックの前後方向に分割した画像である例を説明する。 The following describes an embodiment of the present invention with reference to the drawings. In this embodiment, the bed of a truck (motor vehicle) carrying multiple packages is the subject of imaging, and the multiple images obtained as a result of imaging are images of the bed of the truck divided in the front-rear direction of the truck.

図１は、本発明の一実施例に係る画像処理装置１のハードウェア構成を示すブロック図である。画像処理装置１は、データの読み出しが可能な記憶装置であるＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）１１、データの書き込みと読み出しの両方が可能であるＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）１２、データを基に車両制御に必要なパラメータの演算を行うＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）１３、及びデータの送受信を行うインターフェースである通信モジュール１４を有している。 Figure 1 is a block diagram showing the hardware configuration of an image processing device 1 according to one embodiment of the present invention. The image processing device 1 has a ROM (Read Only Memory) 11, which is a storage device capable of reading data, a RAM (Random Access Memory) 12, which is capable of both writing and reading data, a CPU (Central Processing Unit) 13, which calculates parameters required for vehicle control based on the data, and a communication module 14, which is an interface for sending and receiving data.

画像処理装置１は、通信路２を介して撮像装置３及び外部サーバ４と接続されている。通信路２は、Ｗｉ－Ｆｉ（登録商標）等の無線ネットワークから構成され得る。画像処理装置１は、撮像装置３からは、撮像対象を撮像して得られた複数の画像を受信する。また、外部サーバ４との間においては、演算結果や、後述する機械学習用の教師データを含む各種情報の送受信を行う。外部サーバ４は、ストレージを備えたコンピュータとしてコントロールセンターに設置されていてもよいし、クラウド上に仮想マシンとして実装されていてもよい。 The image processing device 1 is connected to an imaging device 3 and an external server 4 via a communication path 2. The communication path 2 may be configured from a wireless network such as Wi-Fi (registered trademark). The image processing device 1 receives a plurality of images obtained by capturing an image of an object from the imaging device 3. In addition, the image processing device 1 transmits and receives various information including calculation results and training data for machine learning, which will be described later, between the image processing device 1 and the external server 4. The external server 4 may be installed in a control center as a computer with storage, or may be implemented as a virtual machine on the cloud.

画像処理装置１は通信路２を介したデータを受信する。すると、受信に応じて、受信データがＲＡＭ１２に展開される。そして、書き込まれたＲＡＭ１２のデータとＲＯＭ１１の読み出しデータを基に、ＣＰＵ１３が演算を行う。また、画像処理装置１には予めプログラムが組み込まれており、ＣＰＵ１３が当該プログラムを実行することによって以下に説明する処理を実行することが可能になる。以下では、このようなコンピュータ分野の周知技術を適宜省略しながら、各部の詳細を説明する。 The image processing device 1 receives data via the communication path 2. In response to the reception, the received data is expanded in the RAM 12. The CPU 13 then performs calculations based on the data written in the RAM 12 and the data read from the ROM 11. A program is also pre-installed in the image processing device 1, and the CPU 13 executes the program to perform the processing described below. Below, the details of each part will be described, omitting such well-known techniques in the computer field as appropriate.

次に、画像処理装置１の機能構成について図２の機能ブロック図を用いて説明する。図２に示すように、画像処理装置１は、画像取得部１５、荷物検出部１６、荷物寸法算出部１７、荷物判定部１８、画像合成部１９、及び出力部２０を備える。また、画像処理装置１の記憶領域には荷物情報データベース（ＤＢ）２１が格納されている。 Next, the functional configuration of the image processing device 1 will be described with reference to the functional block diagram of FIG. 2. As shown in FIG. 2, the image processing device 1 includes an image acquisition unit 15, a baggage detection unit 16, a baggage size calculation unit 17, a baggage determination unit 18, an image synthesis unit 19, and an output unit 20. In addition, a baggage information database (DB) 21 is stored in the memory area of the image processing device 1.

図２に示す機能ブロック図は例示であり、機能の単位および名称はこれに限らない。たとえば、画像取得部１５と出力部２０をまとめてデータ送受信部として扱ってもよい。 The functional block diagram shown in FIG. 2 is an example, and the functional units and names are not limited to this. For example, the image acquisition unit 15 and the output unit 20 may be collectively treated as a data transmission/reception unit.

画像取得部１５は、撮像装置３が撮像した複数の画像を、通信路２を介して取得する。荷物検出部１６は、画像取得部１５が取得した複数の画像を画像毎に解析し、画像内に含まれる荷物の種類及び位置座標を検出する。荷物寸法算出部１７は、荷物検出部１６が検出した荷物それぞれの位置座標から、各荷物の寸法を算出する。荷物判定部１８は、荷物検出部１６が検出した各荷物の種類及び荷物寸法算出部１７が算出した各荷物の寸法に基づいて、複数の画像間に共通する荷物が含まれているか否か判定する。画像合成部１９は、荷物判定部１８によって、共通の荷物が含まれていると判定された複数の画像を合成する。出力部２０は、合成された画像データや、画像に含まれていた荷物情報、各機能部によって演算された結果等を外部サーバ４に出力する。 The image acquisition unit 15 acquires multiple images captured by the imaging device 3 via the communication path 2. The luggage detection unit 16 analyzes the multiple images acquired by the image acquisition unit 15 for each image and detects the type and position coordinates of the luggage contained in the image. The luggage dimension calculation unit 17 calculates the dimensions of each luggage from the position coordinates of each luggage detected by the luggage detection unit 16. The luggage determination unit 18 determines whether or not a common luggage is included among the multiple images based on the type of each luggage detected by the luggage detection unit 16 and the dimensions of each luggage calculated by the luggage dimension calculation unit 17. The image synthesis unit 19 synthesizes multiple images that are determined by the luggage determination unit 18 to include a common luggage. The output unit 20 outputs the synthesized image data, luggage information included in the images, results calculated by each functional unit, etc. to the external server 4.

荷物情報ＤＢ２１には、複数の荷物を撮像した画像を教師データとして、該画像に含まれる荷物の寸法とその荷物の種類の検出結果を予め機械学習して得られた荷物に関する各種データが格納されている。また、本実施例においては、後述するように荷物検出部１６は、ＡＩモデル（学習モデル）を構成している。ＡＩモデルは、撮像対象になり得る荷物を撮像した画像データを教師データとして、予め、画像に含まれる荷物の抽出方法（荷物の認識及び荷物の画像上における座標の検出処理）と、抽出した荷物の種類の特定処理を機械学習したモデルである。具体的には、該荷物毎の寸法または形状などの特徴量が抽出されるように、教師データを用いて、荷物の特徴量を機械学習する。これにより、画像に含まれている荷物を高精度で検出することが可能になっている。 The luggage information DB 21 stores various data related to luggage obtained by machine learning in advance using images of multiple luggage as training data to detect the dimensions of the luggage contained in the images and the type of luggage. In this embodiment, the luggage detection unit 16 configures an AI model (learning model) as described below. The AI model is a model that uses image data of luggage that can be the subject of an image as training data to machine learn a method for extracting luggage contained in an image (recognizing the luggage and detecting the coordinates of the luggage on the image) and a process for identifying the type of the extracted luggage. Specifically, the training data is used to machine learn the features of the luggage so that features such as the dimensions or shape of each luggage can be extracted. This makes it possible to detect luggage contained in an image with high accuracy.

次に、画像処理装置が得た複数の画像に対して、荷物毎に種類及び位置座標を検出する処理について、図３を用いて説明する。なお、本実施例においては、２枚の画像間に共通する荷物が含まれており、その２枚の画像を合成して１枚の画像を生成する処理について説明するが、合成する画像の枚数及び生成される画像の枚数について本実施例に限られないことは言うまでもない。 Next, the process of detecting the type and location coordinates of each piece of luggage from multiple images obtained by the image processing device will be described with reference to FIG. 3. Note that in this embodiment, luggage is common to two images, and the process of synthesizing the two images to generate a single image will be described, but it goes without saying that the number of images to be synthesized and the number of images to be generated are not limited to this embodiment.

画像取得部１５は、図３（ａ）に示す第１画像及び図３（ｂ）に示す第２画像を撮像装置３から取得する。上述の通り本実施例において取得する画像は複数の荷物が積載されたトラックの荷台の一部であり、図３（ａ）に示す第１画像は、荷物１００ａが縦方向に複数積層されてなる荷物列（Ｌ_１、Ｌ_２、Ｌ_３・・・）が荷台の先頭から後方に沿って積載された状態を撮像して得られた画像であり、図３（ｂ）に示す第２画像は、任意の範囲の荷物列（Ｌ_ｋ、Ｌ_ｋ＋１、Ｌ_ｋ＋２・・・）が荷台の途中から後方に沿って積載された状態を撮像して得られた画像である。従って、図３（ｂ）に示す第２画像は図３（ａ）に示す第１画像よりも後方の荷台部分を撮像したものであり、ゆえに、第１画像と第２画像とが合成される場合には第１画像の右端を含む部分と第２画像の左端を含む部分とが共通する部分として重ね合わせられて合成されることになる。 The image acquisition unit 15 acquires the first image shown in Fig. 3(a) and the second image shown in Fig. 3(b) from the imaging device 3. As described above, the image acquired in this embodiment is a part of the bed of a truck on which a plurality of luggage items are loaded, and the first image shown in Fig. 3(a) is an image obtained by imaging a state in which a row of luggage items (L ₁ , L ₂ , L ₃ . . . ) consisting of a plurality of luggage items 100a stacked vertically is loaded from the front of the bed to the rear, and the second image shown in Fig. 3(b) is an image obtained by imaging a state in which a row of luggage items (L _k , L _k+1 , L _k+2 . . . ) of an arbitrary range is loaded from the middle of the bed to the rear. Therefore, the second image shown in Fig. 3(b) is an image obtained by imaging a part of the bed that is further rearward than the first image shown in Fig. 3(a), and therefore, when the first image and the second image are combined, a part including the right end of the first image and a part including the left end of the second image are superimposed as a common part and combined.

ここで、本実施例における荷物１００ａは、フォークリフトのフォークを差し入れて運搬するためのスキッド部１０２ａと、スキッド部１０２ａに載置される荷部１０１ａとから構成されるものと定義される。しかしながら、トラックの荷台に積載され得る荷物に限らず、荷物として定義され得る対象は限りなく存在することは言うまでもなく、また、荷部１０１ａとスキッド部１０２ａとを区別してそれぞれを荷物として定義してもよい。このように、本発明の適用範囲としての荷物の定義は任意に決定できる。 In this embodiment, the cargo 100a is defined as consisting of a skid portion 102a for inserting the forks of a forklift to transport it, and a load portion 101a that is placed on the skid portion 102a. However, it goes without saying that there are an infinite number of objects that can be defined as cargo, not limited to cargo that can be loaded onto the bed of a truck, and the load portion 101a and the skid portion 102a may be distinguished and each defined as cargo. In this way, the definition of cargo within the scope of application of this invention can be determined arbitrarily.

画像取得部１５は、これらの取得した画像データを荷物検出部１６へ送信する。荷物検出部１６は、上述の通り荷物検出のためのＡＩモデルを構成するものであり、画像取得部１５から受信した画像データを入力として、画像に含まれる荷物の数、種類、位置座標等を出力する。具体的には、学習されたＡＩモデルを用いて、画像に含まれる物体の特徴をピクセル単位で抽出し、色彩、明るさ、歪み、ノイズ、範囲の区別などを自動処理して何が写っているのかを可視化して分類する「画像分類処理」、自動で被写体を認識し、画像中の物体の存在範囲を検出する「画像検出処理」、及び画像領域内に何があるのかを識別するために被写体の境界線を明確に区分けする「画像セグメンテーション処理」を各画像に対して実行する。 The image acquisition unit 15 transmits the acquired image data to the luggage detection unit 16. As described above, the luggage detection unit 16 constitutes an AI model for luggage detection, and receives image data from the image acquisition unit 15 as input, and outputs the number, type, and position coordinates of luggage contained in the image. Specifically, the learned AI model is used to extract the characteristics of objects contained in the image on a pixel-by-pixel basis, and automatically processes color, brightness, distortion, noise, range distinctions, etc. to visualize and classify what is in the image, perform an "image classification process" that automatically recognizes the subject and detects the range in which the object exists in the image, an "image detection process" that automatically recognizes the subject and detects the range in which the object exists in the image, and an "image segmentation process" that clearly distinguishes the boundary line of the subject to identify what is within the image area.

例えば図３（ａ）のスキッド部１０２ａを例にとると、荷物検出部１６に、スキッド部１０２ａの「縦方向と横方向の寸法比率」、「フォーク差し込み用孔の位置及び全体に対する面積比率」、「構成材料の明度、色彩、表面の凹凸、透明度等」等の特徴量を荷物の画像の教師データとして、画像に含まれる荷物の抽出処理を予め学習させたＡＩモデル（学習モデル）を構築しておく。構築したＡＩモデルに、荷物の検出対象となる画像を入力すると、ＡＩモデルは、画像の中から、荷物の特徴量を抽出し、この抽出した特徴量に基づいて、符号１０２ａが付された物体がスキッド部であると検出することができる。これにより、荷物検出部１６（ＡＩモデル）は、画像に含まれるスキッド部の検出により、画像内における荷物１００ａの位置座標を検出し、荷物１００ａの種類を特定することが可能になる。スキッド部１０２ａ上に載置された荷部１０１ａについても同様であり、当然これら以外の物体についても予め教師データを与えて学習させておくことで高精度に検出できる。 For example, taking the skid section 102a in FIG. 3(a) as an example, an AI model (learning model) is constructed in the luggage detection unit 16, which is trained in the extraction process of luggage contained in an image using features such as the "ratio of vertical and horizontal dimensions," "position of holes for fork insertion and area ratio to the whole," "brightness, color, surface unevenness, transparency, etc. of the constituent materials" as training data of the luggage image. When an image to be detected for luggage is input to the constructed AI model, the AI model extracts the features of the luggage from the image, and based on the extracted features, it can detect that the object marked with the symbol 102a is a skid section. This allows the luggage detection unit 16 (AI model) to detect the position coordinates of the luggage 100a in the image by detecting the skid section included in the image, and to identify the type of luggage 100a. The same applies to the load section 101a placed on the skid section 102a, and of course objects other than these can also be detected with high accuracy by providing training data in advance and training the model.

荷物検出部１６は、上記のような画像解析の結果各荷物の種類を検出する。そして、図３（ｂ）に示すように各荷物に対して、荷物毎に区別した荷物セグメンテーションを付与する（荷物１００ａに対して荷物セグメンテーション２００ａ、荷物１００ｂに対して荷物セグメンテーション２００ｂ）。また、上記の一連の処理中、画像をピクセル解析し、荷物セグメンテーション毎の位置座標も算出しておく。位置座標の原点は例えば各画像の最も左上の点である。 The luggage detection unit 16 detects the type of each piece of luggage as a result of the image analysis described above. Then, as shown in FIG. 3(b), a luggage segmentation that distinguishes each piece of luggage is assigned to each piece of luggage (luggage segmentation 200a for luggage 100a, luggage segmentation 200b for luggage 100b). During the above series of processes, the image is subjected to pixel analysis, and the position coordinates for each piece of luggage segmentation are also calculated. The origin of the position coordinates is, for example, the top left point of each image.

以上の処理によって、荷物検出部１６は各荷物の種類及び位置座標を検出し、検出結果を荷物寸法算出部１７へと送信する。荷物寸法算出部１７は受信した検出結果に基づいて次のような処理を実行する。 By the above process, the luggage detection unit 16 detects the type and location coordinates of each piece of luggage and transmits the detection results to the luggage dimension calculation unit 17. The luggage dimension calculation unit 17 performs the following process based on the received detection results.

図４（ａ）は、荷物検出部１６が検出した荷物情報の一覧であり、検出した荷物の荷物ｉｄ、荷物の種類（ｌａｂｅｌ）、検出の確度（ａｃｃｕｒａｃｙ）、及び四辺の座標（ｌｅｆｔ、ｔｏｐ、ｒｉｇｈｔ、ｂｏｔｔｏｍ）を含む。図４（ａ）は、荷物検出部１６が例えば各荷物を認識した際に付与するｉｄに沿ってデータが羅列されているものであり、このままデータを扱うことは困難である。従って、荷物寸法算出部１７は、まずデータを扱いやすくするために図４（ａ）のデータを整理する。 Figure 4(a) is a list of luggage information detected by the luggage detection unit 16, including the luggage ID of the detected luggage, the type of luggage (label), the detection accuracy (accuracy), and the coordinates of the four sides (left, top, right, bottom). In Figure 4(a), the data is listed according to the ID that the luggage detection unit 16 assigns when it recognizes each piece of luggage, for example, and it is difficult to handle the data as is. Therefore, the luggage dimension calculation unit 17 first organizes the data in Figure 4(a) to make it easier to handle.

具体的には、例えば各荷物の左辺の座標をソートする。すると、図４（ｂ）に示すように、図表上から、左辺の座標が共通する荷物が表示されることになる。この例からは、ｉｄ：２１、１９、３及び１が付与された荷物は撮像された画像中左端の荷物列であり、ｉｄ：１のｓｋｉｄが荷台上の底面上に直接載置され、その上にｉｄ：１９のｐｏｒｉが載置され、その上にｉｄ：３のｓｋｉｄが載置され、その上にｉｄ：２１のｐｏｒｉが載置されていることがわかる（座標原点が画像左上である点に注意）。 Specifically, for example, the coordinates of the left side of each piece of luggage are sorted. Then, as shown in FIG. 4(b), luggage with the same coordinates on the left side are displayed from the top of the chart. From this example, it can be seen that the luggage with id:21, 19, 3, and 1 are in the row of luggage at the left end of the captured image, and that the skid with id:1 is placed directly on the bottom surface of the loading platform, on top of which is the poli with id:19, on top of which is the skid with id:3, on top of which is the poli with id:21 (note that the coordinate origin is the top left of the image).

このようにして、荷物寸法算出部１７は荷物情報をソートして荷物列毎のデータとして扱えるように整理する。そして、荷物列毎に、位置座標の差分を求めることによって荷物毎の寸法（高さ及び幅）を算出する。このようにして算出された荷物の寸法情報は、図４（ｂ）の荷物情報にさらに付加され、荷物判定部１８へと送信される。 In this way, the luggage dimension calculation unit 17 sorts the luggage information and organizes it so that it can be handled as data for each luggage row. Then, for each luggage row, the dimensions (height and width) of each luggage are calculated by finding the difference in position coordinates. The luggage dimension information calculated in this way is further added to the luggage information in Figure 4 (b) and transmitted to the luggage determination unit 18.

なお、図４（ｂ）の表の、左辺座標が「９１１」である一連のデータを参照すると、検出されている６個の荷物のうち５個の右辺座標が「１０２４」であり、残りの１個も「１０２１」である。また、この座標差から求められる荷物幅も「１１３」、及び「１１０」と他の荷物に比べて極端に小さい。このように、算出された荷物幅が他の荷物の幅と比較して極端に小さいような場合には、その荷物列はその画像から見切れているものとして、荷物判定部１８による判定対象から除外することもできる。この場合の除外基準については、検出された荷物の横幅の平均値に対して例えば５０％以下とする等予め設定してもよいし、状況に応じてリアルタイムに変更してもよい。 In addition, referring to the series of data with the left side coordinate "911" in the table of FIG. 4(b), the right side coordinate of five of the six detected pieces of luggage is "1024", and the remaining one is also "1021". The luggage width calculated from this coordinate difference is also extremely small compared to the other pieces of luggage, at "113" and "110". In this way, when the calculated luggage width is extremely small compared to the width of the other pieces of luggage, the luggage row can be excluded from the judgment by the luggage judgment unit 18 as it is cut off from the image. The exclusion criteria in this case may be preset, for example, to 50% or less of the average width of the detected luggage, or may be changed in real time depending on the situation.

次に、荷物寸法算出部１７から荷物情報を受信した荷物判定部１８が、複数の画像間に共通する荷物が含まれるか否か判定する処理、及び荷物判定部１８が複数の画像間に共通する荷物が含まれると判定した場合に、画像合成部１９が画像合成する処理について図５を用いて説明する。 Next, the process in which the luggage determination unit 18, which has received luggage information from the luggage dimension calculation unit 17, determines whether or not a common piece of luggage is included among the multiple images, and the process in which the image synthesis unit 19 synthesizes the images when the luggage determination unit 18 determines that a common piece of luggage is included among the multiple images, will be described with reference to FIG. 5.

まず荷物判定部１８は、荷物寸法算出部１７から受信した荷物情報から、図５（ａ）に示す第１画像が図５（ｂ）に示す第２画像よりも荷台左方の部分を撮像して得られた画像であることを認識し、第１画像を左側に、第２画像を右側に配置する。また、荷物寸法算出部１７による処理の結果、第１画像内の判定対象荷物列はＬ_１、Ｌ_２、Ｌ_３であると設定する。 First, the luggage determination unit 18 recognizes from the luggage information received from the luggage dimension calculation unit 17 that the first image shown in Fig. 5(a) is an image obtained by capturing an image of a portion of the platform to the left of the second image shown in Fig. 5(b), and arranges the first image on the left and the second image on the right. In addition, as a result of processing by the luggage dimension calculation unit 17, the luggage sequence to be determined in the first image is set to _L1 , _L2 , and _L3 .

ここで、本実施例における荷物判定部１８は、各荷物列に含まれる全ての荷物を荷物列セグメンテーションとして認識するように設定されている（荷物列Ｌ_３に含まれる荷物を荷物列セグメンテーション３００ａとして認識する）。そして、荷物判定部１８は、荷物列セグメンテーション毎に、画像間に共通する荷物が含まれているか否か判定する処理を実行する。 Here, the baggage determination unit 18 in this embodiment is set to recognize all the bags included in each baggage row as a baggage row segmentation (the baggage included in baggage row _L3 is recognized as baggage row segmentation 300a). Then, the baggage determination unit 18 executes a process of determining whether or not there is a baggage common to the images for each baggage row segmentation.

まず荷物判定部１８は、第１画像の、判定対象である荷物列のうち最も右側の荷物列セグメンテーション３００ａに含まれる荷物１１０ａ～１４０ａの種類及び寸法を荷物情報から抽出する。そして、まずこの荷物列セグメンテーション３００ａの最上段に載置されている荷物１１０ａが第２画像に含まれているか否かの判定を開始する。以下では、荷物１１０ａの種類及び寸法を基準荷物情報と呼ぶ。 First, the luggage determination unit 18 extracts from the luggage information the type and dimensions of the luggage 110a-140a included in the rightmost luggage row segmentation 300a of the luggage row to be determined in the first image. Then, it starts to determine whether the luggage 110a placed on the topmost level of this luggage row segmentation 300a is included in the second image. Hereinafter, the type and dimensions of the luggage 110a are referred to as reference luggage information.

上記の基準荷物情報に基づき、第２画像の、判定対象である荷物列のうち最も左側の荷物列セグメンテーションのさらに最も上段に載置されている荷物１００ｂの種類及び寸法を検出する。検出した荷物の種類または寸法のいずれか一方でも基準荷物情報と一致しない場合には、判定した荷物１００ｂより一段下に載置されている荷物の種類及び寸法を検出し、同様の判定を行う。荷物列セグメンテーションの一番下段に載置されている荷物まで判定した場合には、その荷物列セグメンテーションより１列右側の荷物列セグメンテーションの最上段に載置された荷物に移動して、判定を続ける。なお、画像間における、荷物の寸法の比較においては、各画像を撮像した際の撮像装置と撮像対象領域との間の距離や光軸角の相違等によるずれを補正するために各画像をキャリブレーションしてもよい。また、寸法が一致すると判定できる一定の閾値を定めても良い。 Based on the above reference luggage information, the type and dimensions of the luggage 100b placed on the topmost level of the leftmost luggage row segmentation in the luggage row to be judged in the second image are detected. If either the type or dimensions of the detected luggage do not match the reference luggage information, the type and dimensions of the luggage placed one level below the judged luggage 100b are detected and a similar judgment is made. If the luggage placed on the bottom level of the luggage row segmentation has been judged, the judgment is continued by moving to the luggage placed on the topmost level of the luggage row segmentation one row to the right of the luggage row segmentation. Note that when comparing the luggage dimensions between images, each image may be calibrated to correct deviations due to differences in the distance and optical axis angle between the imaging device and the imaging target area when each image is captured. A certain threshold value may also be set at which it can be determined that the dimensions match.

本実施例においては、上記の処理を繰り返した結果、荷物判定部１８は、第２画像の、判定対象である荷物列のうち２番目に左側の荷物列セグメンテーション３００ｂの最上段の荷物１１０ｂの種類及び寸法が、基準荷物情報と一致したと判定したとする。すなわち、第２画像中最も左側の荷物列が、第１画像中の荷物列Ｌ_２と同一であると判定したとする。 In this embodiment, as a result of repeating the above process, the luggage determination unit 18 determines that the type and dimensions of the uppermost luggage 110b in the second leftmost luggage row segmentation 300b in the luggage row to be determined in the second image match the reference luggage information. In other words, it is determined that the leftmost luggage row in the second image is the same as the luggage row _L2 in the first image.

すると荷物判定部１８は再び第１画像を参照し、今度は荷物列セグメンテーション３００ａの最上段から２番目に載置されている荷物１２０ａの荷物情報を抽出し、これと、第２画像の、荷物列セグメンテーション３００ｂの最上段から２番目に載置されている荷物１２０ｂの種類及び寸法と一致しているか判定する。 The luggage determination unit 18 then refers to the first image again, this time extracting luggage information for luggage 120a that is placed second from the top of luggage row segmentation 300a, and determining whether this matches the type and dimensions of luggage 120b that is placed second from the top of luggage row segmentation 300b in the second image.

以上の処理によって、荷物判定部１８は、第１画像及び第２画像間において、上下に隣接する２つの荷物（第１画像の荷物１１０ａ及び１２０ａと第２画像の荷物１１０ｂ及び１２０）が共通する荷物列セグメンテーション（第１画像の３００ａ及び第２画像の３００ｂ）に存在すると判定し、判定結果を画像合成部１９へと送信する。 Through the above processing, the luggage determination unit 18 determines that two vertically adjacent luggage (luggages 110a and 120a in the first image and luggage 110b and 120 in the second image) exist in a common luggage row segmentation (300a in the first image and 300b in the second image) between the first image and the second image, and transmits the determination result to the image synthesis unit 19.

画像合成部１９は、荷物判定部１８から判定結果を受け取ると、第１画像の荷物列セグメンテーション３００ａと第２画像の荷物列セグメンテーション３００ｂとが重なるように２つの画像を重ね合わせ、図５（ｃ）に示す合成画像を生成する。 When the image synthesis unit 19 receives the judgment result from the luggage judgment unit 18, it superimposes the two images so that the luggage row segmentation 300a of the first image and the luggage row segmentation 300b of the second image overlap, generating a composite image as shown in FIG. 5(c).

なお、上記においては荷物列セグメンテーションに含まれる、上下に隣接する２つの荷物が一致する場合に、その荷物列セグメンテーションが一致すると判定したが、比較する荷物の数はこれに限られず、上記の判定処理を、第１画像中の荷物１３０ａ、１４０ａ、及び第２画像中の荷物１３０ｂ、１４０ｂに対しても同様に実行してもよい。比較する荷物の数が増えるほど、画像判定の精度が向上する。 In the above, when two adjacent pieces of luggage included in the luggage row segmentation match, the luggage row segmentation is determined to match. However, the number of pieces of luggage to be compared is not limited to this, and the above determination process may be performed similarly for the pieces of luggage 130a, 140a in the first image and the pieces of luggage 130b, 140b in the second image. The more pieces of luggage to be compared, the more accurate the image determination becomes.

なお、上記の説明においては、説明を簡略化するために第１画像が荷台の先頭を含む部分であり、第２画像が第１画像よりも荷台後方であることを前提としたが、荷物判定部１８は、必ずしも受信した画像データ同士の位置関係を把握できるわけではなく、従って、上述した第１画像及び第２画像とは逆の位置関係の場合もあり得る。従って、左側に配置した第１画像のある荷物列セグメンテーションに含まれる全ての荷物に対して判定処理を行った結果、第２画像に一致する荷物が含まれていると判定されなかった場合には、第２画像が第１画像より荷台前方の部分を撮像した画像である可能性も考慮し、第１画像において、１列左の荷物列セグメンテーションに含まれる荷物を判定対象として新たに設定し、判定処理を続けていく。 In the above explanation, in order to simplify the explanation, it is assumed that the first image is a part including the front of the loading platform, and the second image is further back on the loading platform than the first image. However, the luggage determination unit 18 cannot necessarily grasp the positional relationship between the received image data, and therefore the positional relationship may be the opposite of that between the first image and the second image described above. Therefore, if the result of performing the determination process on all luggage included in the luggage row segmentation with the first image located on the left is that it is not determined that the luggage included in the second image matches the luggage included in the luggage row segmentation located on the left side, the possibility that the second image is an image of a part of the loading platform further forward than the first image is taken into consideration, and the luggage included in the luggage row segmentation one row to the left in the first image is newly set as the determination target, and the determination process continues.

最後に、出力部２０が判定結果を荷物情報ＤＢ２１及び／または外部サーバ４に出力し、教師データの更新等を行う。 Finally, the output unit 20 outputs the judgment result to the luggage information DB 21 and/or the external server 4, and updates the teacher data, etc.

上述した一連の判定処理について図６のフローチャートを用いて説明する。画像取得部１５が撮像装置３から複数の画像を取得した時点で処理がスタートする。 The series of determination processes described above will be explained using the flowchart in FIG. 6. The process starts when the image acquisition unit 15 acquires multiple images from the imaging device 3.

ステップＳ６０１において荷物検出部１６は、画像取得部１５から取得した複数の画像のそれぞれに対して、上述したようなＡＩモデルを用いて画像が含む荷物の種類及び位置座標を検出する。 In step S601, the luggage detection unit 16 detects the type and location coordinates of luggage contained in each of the multiple images acquired from the image acquisition unit 15 using the AI model described above.

ステップＳ６０２において、荷物検出部１６は、検出した荷物の種類毎の荷物の横幅の平均を算出する。 In step S602, the luggage detection unit 16 calculates the average width of each type of luggage detected.

ステップＳ６０３において、荷物寸法算出部１７は、図４を用いて説明した通り荷物情報をソートし、荷物列毎の情報に整理する。この際に、ステップＳ６０２にて算出した荷物の種類毎の横幅の平均を用いて、荷物判定部１８による判定対象から除外する荷物列を選定してもよい。 In step S603, the luggage dimension calculation unit 17 sorts the luggage information as described with reference to FIG. 4 and organizes the information for each luggage row. At this time, the luggage row to be excluded from the judgment target by the luggage judgment unit 18 may be selected using the average width for each luggage type calculated in step S602.

ステップＳ６０４において、荷物判定部１８は、図５を用いて説明した通り、第１画像（取得した複数の画像のうち最も荷台前方部分を撮像した画像）内の右端の荷物列セグメンテーション内の荷物を抽出し、荷物情報を算出する。 In step S604, the luggage determination unit 18 extracts luggage in the luggage row segmentation at the right end in the first image (the image capturing the most forward part of the loading platform among the multiple images acquired) and calculates luggage information, as described with reference to FIG. 5.

ステップＳ６０５－Ｓ６０６において、荷物判定部１８は、第２画像（第１画像よりも荷台後方部分を撮像した画像）内の、判定対象の荷物の荷物情報を検出し、荷物寸法算出部１７から受け取った荷物情報に基づいて、ステップＳ６０４にて抽出された荷物列セグメンテーション中の荷物と共通する荷物が第２画像内に存在するか否か判定する。 In steps S605-S606, the luggage determination unit 18 detects luggage information of the luggage to be determined in the second image (an image captured from the rear of the loading platform relative to the first image), and determines, based on the luggage information received from the luggage dimension calculation unit 17, whether or not there is luggage in the second image that is common to the luggage in the luggage row segmentation extracted in step S604.

ステップＳ６０６において、第１画像及び第２画像間に共通する荷物が存在しないと判定された場合には、第１画像中の、一列左の荷物列セグメンテーション内の荷物情報の算出を行い、同様に第２画像内に共通する荷物が存在するか否かの判定を続ける。 If it is determined in step S606 that there is no luggage in common between the first image and the second image, the luggage information in the luggage row segmentation one row to the left in the first image is calculated, and similarly, the determination of whether there is any luggage in common in the second image is continued.

ステップＳ６０６において、第１画像及び第２画像間に共通する荷物が存在すると判定された場合には、ステップＳ６０７へと移行する。このステップＳ６０７では荷物判定部１８はさらに、第１画像及び第２画像のそれぞれに対して、ステップＳ６０６において共通すると判定された荷物の一段下の荷物が共通しているか判定する。 If it is determined in step S606 that there is a piece of luggage in common between the first image and the second image, the process proceeds to step S607. In step S607, the luggage determination unit 18 further determines whether there is a piece of luggage in common between the first image and the second image that is one level below the piece of luggage determined to be in common in step S606.

ステップＳ６０７で、一段下の荷物が画像間で一致すると判定されれば、ステップＳ６０８において、その結果を受け取った画像合成部１９によって画像が合成される。一段下の荷物が画像間で一致しないと判定されれば、ステップＳ６０６において共通すると判定されていた荷物は偶然両画像内に存在していたと判定され、荷物判定部１８による処理はステップＳ６０９へと移行する。その後は既述の通りである。 If it is determined in step S607 that the images of the luggage one level below match, then in step S608 the images are combined by the image combination unit 19 which receives the result. If it is determined that the images of the luggage one level below do not match, then it is determined that the luggage determined to be common in step S606 happened to exist in both images, and the processing by the luggage determination unit 18 proceeds to step S609. The process thereafter is as described above.

画像合成部１９による画像合成が行われると、一連の処理結果が出力部２０によって荷物情報ＤＢ２１及び／または外部サーバ４に出力され、処理を終了する。 Once image synthesis is performed by the image synthesis unit 19, the results of the process are output by the output unit 20 to the luggage information DB 21 and/or the external server 4, and the process ends.

以上説明したように、本発明によれば、画像間に共通する荷物が存在するか否かという判定を、荷物の種類及び位置座標に基づく寸法に基づいて実行しているため、類似する荷物が多数存在するような場合であっても判定の精度を向上させることが可能になる。しかも、画像間で共通する荷物が存在すると判定された場合に、その荷物に接する荷物についても共通性があるか判定することにより、判定精度をより向上させることが可能になる。さらに、荷物の種類及び寸法の検出に際して、荷物検出部を、予め複数の荷物を撮像して得られた画像を教師データとして、荷物の種類及び寸法を機械学習させて生成したＡＩモデルとすることにより、荷物検出の精度を非常に向上させることが可能になる。 As described above, according to the present invention, the determination of whether or not there is luggage common between images is performed based on the type of luggage and dimensions based on the position coordinates, making it possible to improve the accuracy of the determination even when there are many similar luggage. Moreover, when it is determined that there is luggage common between images, it is possible to further improve the accuracy of the determination by determining whether there is commonality in luggage adjacent to that luggage. Furthermore, when detecting the type and dimensions of luggage, the luggage detection unit can be an AI model generated by machine learning the type and dimensions of luggage using images obtained by previously capturing multiple luggage as training data, making it possible to greatly improve the accuracy of luggage detection.

以上、本発明の実施形態について詳述したが、本発明は、上記の実施形態に限定されるものではなく、特許請求の範囲に記載された本発明の精神を逸脱しない範囲で、種々の変更を行うことができる。本発明は、或る実施形態の構成を他の実施形態の構成に追加したり、或る実施形態の構成を他の実施形態と置換したり、或る実施形態の構成の一部を削除したりすることができる。 Although the embodiments of the present invention have been described in detail above, the present invention is not limited to the above embodiments, and various modifications can be made without departing from the spirit of the present invention described in the claims. The present invention allows the configuration of one embodiment to be added to the configuration of another embodiment, the configuration of one embodiment to be replaced with another embodiment, or part of the configuration of one embodiment to be deleted.

１画像処理装置、１５画像取得部、１６荷物検出部、１７荷物寸法算出部、１８荷物判定部、１９画像合成部 1 Image processing device, 15 Image acquisition unit, 16 Baggage detection unit, 17 Baggage size calculation unit, 18 Baggage determination unit, 19 Image synthesis unit

Claims

An image processing device that captures images of a plurality of packages and synthesizes the images,
an image acquisition unit for acquiring the plurality of images;
a baggage detection unit that detects the type and location coordinates of each of the plurality of bags from the plurality of images;
a luggage dimension calculation unit that calculates the dimensions of the luggage from the position coordinates of the luggage;
a baggage determination unit that determines whether or not a common baggage is included among the plurality of images based on the detected baggage type and size;
an image synthesis unit that synthesizes the plurality of images by superimposing a portion of the common package when the package determination unit determines that the plurality of images contain a common package;
Equipped with
When it is determined that a common package is included among the plurality of images, the package determination unit further determines whether or not another package adjacent to the common package is common among the plurality of images.
13. An image processing device comprising:

2. The image processing device according to claim 1 ,
the luggage detection unit detects the type and position coordinates for each row of luggage in which the luggage is stacked in a vertical direction;
the luggage determination unit determines, for each of the luggage rows, whether or not the luggage is a common luggage among the plurality of images, and when it is determined that a common luggage is included among the plurality of images, it further determines whether or not another luggage adjacent to the top or bottom of the common luggage is common among the plurality of images.
13. An image processing device comprising:

2. The image processing device according to claim 1,
the baggage determination unit determines, among the plurality of images, baggage whose type matches and whose dimensional difference is within a predetermined threshold value as a common baggage;
13. An image processing device comprising:

2. The image processing device according to claim 1,
the luggage detection unit uses images of the plurality of luggage as training data and performs machine learning to learn a method for extracting luggage included in the images and a method for identifying the type of the luggage;
The baggage detection unit detects the type and position coordinates of each of the plurality of bags of luggage from the plurality of images acquired by the image acquisition unit.
13. An image processing device comprising:

2. The image processing device according to claim 1,
The plurality of packages are a plurality of packages loaded on a loading platform of a motor vehicle,
2. An image processing device according to claim 1, wherein the plurality of images are images obtained by dividing the loading platform in the front-rear direction of the motor vehicle.