JP7852001B2

JP7852001B2 - Imaging and analysis device, program, and imaging and analysis method

Info

Publication number: JP7852001B2
Application number: JP2024151399A
Authority: JP
Inventors: 祐介谷田部; 弘典小味; 康則平川; 泰裕村井; 歩中村; 洋登永吉; 博章三沢; 義明坪倉; 暢幸棚谷; 淳一木村
Original assignee: Hitachi Industry and Control Solutions Co Ltd
Current assignee: Hitachi Industry and Control Solutions Co Ltd
Priority date: 2024-09-03
Filing date: 2024-09-03
Publication date: 2026-04-27
Anticipated expiration: 2044-09-03
Also published as: JP2026046686A

Description

本発明は、作業現場を撮像して作業現場の状況把握を支援する撮像解析装置、プログラムおよび撮像解析方法に関する。 This invention relates to an imaging and analysis device, program, and imaging and analysis method that support understanding the conditions of a work site by imaging the work site.

建設工事に用いられる鋼矢板やＨ形鋼などの重資材は、大型トラックに複数混在して積載されて輸送される。このような重資材を検収する際には、作業員がトラックの荷台に乗って目視で数や破損状態を確認する。このため工数や安全面が課題となっている。また大規模な建設現場での進捗確認では、現場移動のコストが大きく、映像の遠隔監視により効率化が求められている。 Heavy materials used in construction, such as steel sheet piles and H-beams, are transported in large trucks, often in mixed loads. During inspection, workers must visually check the quantity and condition of these materials by standing on the truck bed. This presents challenges in terms of labor costs and safety. Furthermore, monitoring progress at large construction sites incurs significant costs due to site travel, highlighting the need for efficiency improvements through remote video monitoring.

これらの課題解決の手法として、破損などの不具合がない資材や建設計画どおりに建設が進んだ建設物の三次元モデルと、現物の資材や建設中の建設物を示す三次元モデルとの比較がある。比較して差分が把握できれば、効率的な検収や進捗把握ができるようになる。 One method for addressing these challenges is to compare three-dimensional models of materials without defects or damage, and of buildings constructed according to the construction plan, with three-dimensional models representing the actual materials and buildings under construction. By comparing and identifying the differences, more efficient inspection and progress tracking become possible.

他の課題解決の手法として、トラック荷の積載状況や建設現場を示すテクスチャが付与された三次元メッシュ（三次元モデル、デジタルツイン）がある。このような三次元モデルの再現精度が高ければ、自由視点での目視確認や、計数や寸法測定が可能となる。 Another approach to solving these problems involves creating a three-dimensional mesh (three-dimensional model, digital twin) with textures that represent the loading status of truck cargo or the construction site. If the accuracy of such a three-dimensional model is high, it becomes possible to perform visual inspection from any viewpoint, as well as counting and dimensional measurement.

三次元モデル同士の比較技術として、特許文献１に記載の物量算出装置がある。この物量算出装置は、所定の空間を示す第１三次元モデルと、当該所定の空間を示し、第１三次元モデルとは異なる第２三次元モデルと、を取得する取得部を備える。また物量算出装置は、第１三次元モデルおよび第２三次元モデルそれぞれが有する属性情報に基づいて、第１三次元モデルと第２三次元モデルとの位置合わせを行う位置合わせ部を備える。さらに物量算出装置は、第１三次元モデルと第２三次元モデルとの差分の量を算出し、当該差分が有する属性情報と当該差分の量を示す差分情報とを出力する算出部を備える。 As a technique for comparing three-dimensional models, there is a material quantity calculation device described in Patent Document 1. This material quantity calculation device includes an acquisition unit that acquires a first three-dimensional model representing a predetermined space and a second three-dimensional model representing the same predetermined space but different from the first three-dimensional model. The material quantity calculation device also includes an alignment unit that aligns the first and second three-dimensional models based on the attribute information of each model. Furthermore, the material quantity calculation device includes a calculation unit that calculates the difference between the first and second three-dimensional models and outputs attribute information related to that difference and difference information indicating the amount of that difference.

国際公開第２０２０／１７９４３８号International Publication No. 2020/179438

特許文献１に記載の物量算出装置は、属性情報を認識して２つの三次元モデルの位置を合わせるので、処理コストが大きいという問題がある。作業現場にある物体を三次元モデル（デジタルツイン）として再現し、基準となる三次元モデルとの比較、または再現精度の向上により、物品の検収や建設の進捗確認などを含めた現場状況を把握する支援が求められている。
本発明は、このような背景に鑑みてなされたものであり、三次元モデルを用いた現場状況把握の支援を可能とする撮像解析装置、プログラムおよび撮像解析方法を提供することを課題とする。 The material quantity calculation device described in Patent Document 1 has the problem of high processing costs because it recognizes attribute information and aligns the positions of two three-dimensional models. There is a need for support in understanding on-site conditions, including inspection of goods and confirmation of construction progress, by reproducing objects at the work site as three-dimensional models (digital twins) and comparing them with a reference three-dimensional model, or by improving the accuracy of the reproduction.
This invention was made in view of the above background, and aims to provide an imaging analysis device, program, and imaging analysis method that enable support for understanding on-site conditions using a three-dimensional model.

上記した課題を解決するため、本発明に係る撮像解析装置は、撮像情報を基に撮像対象物体の三次元モデルである生成三次元モデルを生成する三次元モデル生成部と、前記生成三次元モデルとの比較対象となる、物体の三次元モデルであり、かつ、前記撮像対象物体との比較対象となる物体の三次元モデルである参照三次元モデルおよび前記生成三次元モデルを比較して、差分を算出する比較部と、前記撮像対象物体において前記差分が生じている部分である追加撮像箇所を撮像するのに適した位置・姿勢を出力する追加撮像支援部と、を備える。 To solve the above-mentioned problems, the imaging analysis apparatus according to the present invention comprises: a three-dimensional model generation unit that generates a generated three-dimensional model, which is a three-dimensional model of the object to be imaged, based on imaging information; a comparison unit that compares the generated three-dimensional model with a reference three-dimensional model, which is a three-dimensional model of an object that is also a three-dimensional model of an object to be compared with the object to be imaged, and calculates the difference; and an additional imaging support unit that outputs a position and orientation suitable for imaging the additional imaging location, which is the part of the object to be imaged where the difference occurs.

本発明によれば、三次元モデルを用いた現場状況把握の支援を可能とする撮像解析装置、プログラムおよび撮像解析方法を提供することができる。上記した以外の課題、構成および効果は、以下の実施形態の説明により明らかにされる。 According to the present invention, it is possible to provide an imaging analysis device, program, and imaging analysis method that enable support for understanding on-site conditions using a three-dimensional model. Other problems, configurations, and effects will be clarified by the following description of embodiments.

第１実施形態に係る撮像解析装置の機能ブロック図である。This is a functional block diagram of the imaging and analysis device according to the first embodiment. 第１実施形態に係る撮像解析結果画面の画面構成図である。This is a screen configuration diagram of the imaging analysis results screen according to the first embodiment. 第１実施形態に係る追加撮像の位置・姿勢を示すポインタが表示される撮像解析結果画面の画面構成図である。This is a screen configuration diagram of the imaging analysis results screen, which displays a pointer indicating the position and orientation of additional imaging according to the first embodiment. 第１実施形態に係る追加撮像後の撮像解析結果画面の画面構成図である。This is a screen configuration diagram of the imaging analysis results screen after additional imaging according to the first embodiment. 第１実施形態に係る撮像解析処理のフローチャートである。This is a flowchart of the imaging analysis process according to the first embodiment. 第２実施形態に係る映像解析装置を示す概略の構成図である。This is a schematic diagram showing the video analysis device according to the second embodiment. 橋脚の映像データ例である。This is an example of video data of a bridge pier. 第２実施形態に係る計測可視化処理のフローチャートである。This is a flowchart of the measurement visualization process according to the second embodiment. 第２実施形態に係る映像解析装置のデータフロー図である。This is a data flow diagram of the video analysis device according to the second embodiment. 第２実施形態に係る映像解析装置のハードウェアの構成図である。This is a hardware configuration diagram of the video analysis device according to the second embodiment. 第３実施形態に係る映像解析装置を示す概略の構成図である。This is a schematic diagram showing the video analysis device according to the third embodiment. 第３実施形態に係る映像解析装置のデータフロー図である。This is a data flow diagram of the video analysis device according to the third embodiment. 第４実施形態に係る映像解析装置を示す概略の構成図である。This is a schematic diagram showing the video analysis device according to the fourth embodiment. 第４実施形態に係る計測可視化処理のフローチャートである。This is a flowchart of the measurement visualization process according to the fourth embodiment. 上記した実施形態に係る撮像解析装置や映像解析装置の機能を実現するコンピュータの一例を示すハードウェア構成図である。This is a hardware configuration diagram showing an example of a computer that implements the functions of the imaging and video analysis device and the video analysis device according to the above embodiment.

以下に本発明を実施するための形態（実施形態）における撮像解析装置について説明する。撮像解析装置は、複数のＬｉＤＡＲ（Light Detection And Ranging）およびカメラが作業現場を撮像した三次元点群情報および画像（映像）を基に三次元モデルを生成する。この三次元モデルは、例えばトラックに積載された鋼矢板やＨ形鋼などの重資材のテクスチャ（画像）付き三次元メッシュである。次に撮像解析装置は、生成された三次元モデル（生成三次元モデル）と基準となる三次元モデル（参照三次元モデル）とを比較して、差分を出力する。差分は、例えば鋼矢板の長尺方向における長さの差分であり、基準となる鋼矢板から切断された部分の長さである。鋼矢板やＨ形鋼などの重資材の検収においては、破損状況の把握の他に、重資材の長さと数量を求めることが検収に含まれる。 The following describes an imaging analysis device in an embodiment for carrying out the present invention. The imaging analysis device generates a three-dimensional model based on three-dimensional point cloud information and images (video) captured by multiple LiDAR (Light Detection and Ranging) and cameras at the work site. This three-dimensional model is, for example, a textured three-dimensional mesh of heavy materials such as steel sheet piles and H-beams loaded on a truck. Next, the imaging analysis device compares the generated three-dimensional model (generated three-dimensional model) with a reference three-dimensional model (reference three-dimensional model) and outputs the difference. The difference is, for example, the difference in length in the longitudinal direction of the steel sheet pile, and is the length of the portion cut from the reference steel sheet pile. In the inspection of heavy materials such as steel sheet piles and H-beams, in addition to understanding the extent of damage, determining the length and quantity of the heavy materials is included in the inspection.

従来までは、作業員がトラックの荷台に乗って重資材の長さを計測して検収していた。撮像解析装置を用いることで、作業員の検収作業の効率が向上するだけではなく、作業員の安全性も向上することができるようになる。 Traditionally, workers would stand on the back of a truck to measure the length of heavy materials and then inspect them. By using an imaging and analysis device, not only can the efficiency of the workers' inspection process be improved, but the safety of the workers can also be enhanced.

また撮像解析装置は、建設の進捗状況の把握・監視にも利用できる。基準となる三次元モデルとしての、建設計画どおりに建設が進んだ建築物の三次元モデル（参照三次元モデル）と、生成三次元モデルとを比較することで、管理者は建設計画との差分／進捗状況を把握できるようになる。 Furthermore, the imaging and analysis system can also be used to understand and monitor the progress of construction. By comparing the generated 3D model with a 3D model of a building constructed according to the construction plan (reference 3D model), managers can understand the differences and progress compared to the construction plan.

このように撮像解析装置は、作業現場の状況確認を支援することができる。以下では、鋼矢板やＨ形鋼などの重資材の検収を例に、撮像解析装置を説明する。 Thus, imaging and analysis devices can support the confirmation of conditions at the work site. Below, we will explain imaging and analysis devices using the inspection of heavy materials such as steel sheet piles and H-beams as an example.

≪撮像解析装置の構成≫
図１は、第１実施形態に係る撮像解析装置１００の機能ブロック図である。撮像解析装置１００には、複数のＬｉＤＡＲ４１１およびカメラ４１２が接続される。組となるＬｉＤＡＲ４１１およびカメラ４１２は、近くに設置されているものとする。ＬｉＤＡＲ４１１は、撮像した三次元点群情報を撮像解析装置１００に送信する。またカメラ４１２は、撮像した画像（映像）を撮像解析装置１００に送信する。以下では、ＬｉＤＡＲ４１１およびカメラ４１２を合わせて撮像装置とも記す。また三次元点群情報および画像を合わせて撮像情報とも記す。ＬｉＤＡＲ４１１およびカメラ４１２は、一体となった撮像装置であってもよい。また、ＬｉＤＡＲ４１１およびカメラ４１２は、それぞれ撮像している位置や画角が異なるため、第２実施形態以降に記載する両者の位置合わせ手法などにより対応関係を計算してもよい。 <<Configuration of the imaging and analysis system>>
Figure 1 is a functional block diagram of the imaging analysis device 100 according to the first embodiment. Multiple LiDARs 411 and cameras 412 are connected to the imaging analysis device 100. Pairs of LiDARs 411 and cameras 412 are assumed to be installed close together. The LiDAR 411 transmits captured three-dimensional point cloud information to the imaging analysis device 100. The camera 412 transmits captured images (video) to the imaging analysis device 100. Hereinafter, the LiDARs 411 and cameras 412 will be collectively referred to as the imaging device. The three-dimensional point cloud information and images will also be collectively referred to as imaging information. The LiDARs 411 and cameras 412 may be an integrated imaging device. Furthermore, since the LiDARs 411 and cameras 412 have different imaging positions and fields of view, their correspondence may be calculated using the alignment method for both described in the second embodiment and subsequent embodiments.

撮像解析装置１００はコンピュータであり、制御部１１０、記憶部１２０、および入出力部１８０を備える。入出力部１８０には、ディスプレイやキーボード、マウスなどのユーザインターフェイス機器が接続される。入出力部１８０が通信デバイスを備え、ＬｉＤＡＲ４１１およびカメラ４１２とのデータ送受信が可能である。また入出力部１８０にメディアドライブが接続され、記録媒体を用いたデータのやり取りが可能であってもよい。 The imaging and analysis device 100 is a computer comprising a control unit 110, a storage unit 120, and an input/output unit 180. User interface devices such as a display, keyboard, and mouse are connected to the input/output unit 180. The input/output unit 180 is equipped with a communication device, enabling data transmission and reception with the LiDAR 411 and camera 412. A media drive may also be connected to the input/output unit 180, enabling data exchange using a recording medium.

≪撮像解析装置：記憶部≫
記憶部１２０は、ＲＯＭ（Read Only Memory）やＲＡＭ（Random Access Memory）、ＳＳＤ（Solid State Drive）などの記憶機器を含んで構成される。記憶部１２０には、撮像情報データベース１３０、撮像装置情報データベース１４０、物体情報データベース１５０、参照三次元モデルデータベース１６０、認識モデル１２１、生成三次元モデル１２２、およびプログラム１２８が記憶される。なお、記憶部１２０の各種記憶内容は、クラウドサーバなどの外部の記憶装置に記憶されているものを必要に応じて読み込まれるようにしたものであってもよい。 ≪Imaging and Analysis Device: Memory Unit≫
The memory unit 120 is composed of memory devices such as ROM (Read Only Memory), RAM (Random Access Memory), and SSD (Solid State Drive). The memory unit 120 stores an imaging information database 130, an imaging device information database 140, an object information database 150, a reference three-dimensional model database 160, a recognition model 121, a generated three-dimensional model 122, and a program 128. The various contents of the memory unit 120 may be stored in an external storage device such as a cloud server and read as needed.

≪記憶部：撮像情報データベース・撮像装置情報データベース≫
撮像情報データベース１３０には、撮像装置が撮像した三次元点群情報および画像が、撮像装置の識別情報や撮像時刻と関連付けられて記憶される。
撮像装置情報データベース１４０には、撮像装置であるＬｉＤＡＲ４１１やカメラ４１２に係る識別情報や種別、仕様、設置位置、設置姿勢などの情報が格納される。 ≪Storage Unit: Imaging Information Database / Imaging Device Information Database≫
The imaging information database 130 stores three-dimensional point cloud information and images captured by the imaging device, associated with the imaging device's identification information and the time of acquisition.
The imaging device information database 140 stores information such as identification information, type, specifications, installation location, and installation orientation related to the imaging device, LiDAR 411, and camera 412.

≪記憶部：物体情報データベース・参照三次元モデルデータベース≫
物体情報データベース１５０には、撮像対象となる物体（撮像対象物体）の情報が記憶される。第１実施形態では撮像対象物体は重資材であり、基準となる重資材の名称やタイプ、サイズ／大きさ、撮像画像、三次元モデル（参照三次元モデル）の識別情報などの情報が関連付けられて、物体情報データベース１５０に記憶される。なお基準となるとは、物体（重資材）が、切断前や破損・変形が生じる前の新品と同じ状態にあり、検収対象の重資材と比較される基準となるということである。 ≪Memory Unit: Object Information Database / Reference 3D Model Database≫
The object information database 150 stores information about the object to be imaged (the object to be imaged). In the first embodiment, the object to be imaged is heavy material, and information such as the name, type, size, imaged image, and identification information of the three-dimensional model (reference three-dimensional model) of the reference heavy material is associated with it and stored in the object information database 150. "Reference" means that the object (heavy material) is in the same condition as a new product before cutting or before any damage or deformation occurs, and serves as a standard for comparison with the heavy material subject to inspection.

参照三次元モデルデータベース１６０には、識別情報と関連付けられて、基準となる重資材の三次元モデルである参照三次元モデルが記憶される。この三次元モデルは三次元点群情報を基に生成されたものとは限らず、例えば設計情報としての三次元モデル（三次元ＣＡＤモデル）であってもよい。なお撮像解析装置１００を建設の進捗確認に用いる場合に参照三次元モデルデータベース１６０には、ＢＩＭ（Building Information Modeling）やＣＩＭ（Construction Information Modeling）などの設計三次元データが格納されてもよい。 The reference three-dimensional model database 160 stores reference three-dimensional models, which are three-dimensional models of standard heavy materials, associated with identification information. These three-dimensional models are not necessarily generated based on three-dimensional point cloud information; for example, they may be three-dimensional models (three-dimensional CAD models) used as design information. Furthermore, when the imaging analysis device 100 is used for construction progress verification, the reference three-dimensional model database 160 may also store design three-dimensional data such as BIM (Building Information Modeling) or CIM (Construction Information Modeling).

≪記憶部：認識モデル≫
認識モデル１２１は、画像を基に撮像対象物体のタイプを認識／識別して検出する機械学習モデルである。画像は、カメラ４１２が撮像した画像に限らず、撮像対象物体を示す三次元モデルのテクスチャである画像であってもよい。 ≪Memory Unit: Cognitive Model≫
The recognition model 121 is a machine learning model that recognizes/identifies and detects the type of object being imaged based on the image. The image is not limited to an image captured by the camera 412, but may also be an image that is a texture of a three-dimensional model representing the object being imaged.

タイプとしては、鋼矢板やＨ形鋼などがあるが、他の重資材やさらに細かなタイプであってもよい。また認識モデル１２１は、物体（重資材）の断面（長尺方向の垂直面）が写る画像を基に、個々の物体を検出することができる。 Examples of types include steel sheet piles and H-beams, but other heavy materials or even more specific types may also be used. Furthermore, the recognition model 121 can detect individual objects based on images showing the cross-section (vertical plane in the long direction) of the objects (heavy materials).

≪記憶部：生成三次元モデル・プログラム≫
生成三次元モデル１２２は、撮像情報を基に生成された、撮像対象物体（重資材）のテクスチャ付きの三次元メッシュである。生成三次元モデル１２２の生成手法については、後記する。
プログラム１２８は、後記する制御部１１０に備わる機能部が実行する処理の記述を含む。 ≪Memory Unit: Generating Three-Dimensional Model Program≫
The generated three-dimensional model 122 is a textured three-dimensional mesh of the imaged object (heavy material) generated based on the imaging information. The method for generating the generated three-dimensional model 122 will be described later.
Program 128 includes a description of the processing to be performed by the functional unit of the control unit 110, which will be described later.

≪撮像解析装置：制御部≫
制御部１１０は、ＣＰＵ（Central Processing Unit）を含んで構成され、撮像取得部１１１、三次元モデル生成部１１２、比較部１１３、表示制御部１１４、および追加撮像支援部１１５が備わる。制御部１１０は、ＧＰＵ（Graphics Processing Unit）、ＮＰＵ（Neural (network) Processing Unit）、ＦＰＧＡ（Field Programmable Gate Array）、ＡＳＩＣ（Application Specific Integrated Circuit）などを含んで構成されてもよい。 <<Imaging and Analysis System: Control Unit>>
The control unit 110 includes a CPU (Central Processing Unit) and comprises an image acquisition unit 111, a three-dimensional model generation unit 112, a comparison unit 113, a display control unit 114, and an additional image acquisition support unit 115. The control unit 110 may also include a GPU (Graphics Processing Unit), an NPU (Neural (network) Processing Unit), an FPGA (Field Programmable Gate Array), an ASIC (Application Specific Integrated Circuit), etc.

≪制御部：撮像取得部≫
撮像取得部１１１は、ＬｉＤＡＲ４１１やカメラ４１２が撮像して送信した三次元点群情報や画像を撮像情報データベース１３０に格納する。撮像取得部１１１は、撮像装置の識別情報や撮像時刻も合わせて撮像情報データベース１３０に格納する。 ≪Control Unit: Image Acquisition Unit≫
The imaging acquisition unit 111 stores the three-dimensional point cloud information and images captured and transmitted by the LiDAR 411 and camera 412 in the imaging information database 130. The imaging acquisition unit 111 also stores the identification information of the imaging device and the time of acquisition in the imaging information database 130.

≪制御部：三次元モデル生成部≫
三次元モデル生成部１１２は、撮像情報データベース１３０にある三次元点群情報や画像を基に、テクスチャ付き三次元メッシュである三次元モデルを生成する。この三次元モデルは、現場にある物体（重資材）を示す生成三次元モデルである。詳しく説明すると三次元モデル生成部１１２は、ＬｉＤＡＲ４１１の位置・姿勢を参照し、複数の三次元点群情報を基に三次元メッシュを生成する。三次元メッシュの生成手法として、Alpha Shape法やPoisson法などがある。三次元メッシュは、撮像対象物体（の表面）を示すポリゴンや位置を含んで構成されるものとする。 ≪Control Unit: Three-Dimensional Model Generation Unit≫
The three-dimensional model generation unit 112 generates a three-dimensional model, which is a textured three-dimensional mesh, based on three-dimensional point cloud information and images in the imaging information database 130. This three-dimensional model is a generated three-dimensional model that represents an object (heavy material) at the site. More specifically, the three-dimensional model generation unit 112 refers to the position and orientation of the LiDAR 411 and generates a three-dimensional mesh based on multiple three-dimensional point cloud information. Three-dimensional mesh generation methods include the Alpha Shape method and the Poisson method. The three-dimensional mesh is composed of polygons and positions that represent the surface of the object to be imaged.

複数の三次元点群情報をつなぎ合わせる手法として、例えばＩＣＰ（Iterative Closest Point）があるが、三次元モデル生成部１１２はＩＣＰとは異なる手法を用いて三次元メッシュを生成してもよい。例えば三次元モデル生成部１１２は、個々の三次元点群情報に含まれる特徴量を抽出し、対応する特徴量を参照することで三次元点群情報をつなぎ合わせて三次元メッシュを生成してもよい。特徴量としては、例えばエッジや尖った部分などがある。 One method for combining multiple three-dimensional point cloud data is ICP (Iterative Closest Point), but the three-dimensional model generation unit 112 may generate a three-dimensional mesh using a method different from ICP. For example, the three-dimensional model generation unit 112 may extract feature quantities contained in each three-dimensional point cloud data and generate a three-dimensional mesh by combining the three-dimensional point cloud data by referencing the corresponding feature quantities. Examples of feature quantities include edges and sharp points.

次に三次元モデル生成部１１２は、三次元メッシュ、画像、カメラ４１２の位置・姿勢を参照して、ＵＶマップとテクスチャを生成する。テクスチャは、三次元メッシュに貼られる画像である。ＵＶマップは三次元メッシュを構成するポリゴン（の画素）とテクスチャ（の画素）の対応を示す。この際、撮像時の画像を基にＵＶマップの対応関係を算出してもよい。この場合出力となるテクスチャ画像は、撮像時の画像から変換のない画像である。 Next, the 3D model generation unit 112 generates a UV map and texture by referring to the 3D mesh, image, and the position and orientation of the camera 412. The texture is the image applied to the 3D mesh. The UV map shows the correspondence between the polygons (pixels) that make up the 3D mesh and the texture (pixels). In this case, the correspondence of the UV map may be calculated based on the image captured during imaging. In this case, the output texture image is an image that has not been converted from the image captured during imaging.

三次元モデル生成部１１２は、ポリゴンに相当する撮像対象物体の表面を撮像したカメラ４１２が１つである場合には、当該カメラ４１２が撮像した画像に基づいて、テクスチャを算出する。詳しく説明すると三次元モデル生成部１１２は、ポリゴンの画素に対応するテクスチャの画素を、当該カメラ４１２が撮像した画像（画素）に基づいて算出する。 The three-dimensional model generation unit 112 calculates the texture based on the image captured by the camera 412 when there is only one camera 412 that captures the surface of the object to be imaged, which corresponds to a polygon. More specifically, the three-dimensional model generation unit 112 calculates the pixels of the texture corresponding to the pixels of the polygon based on the image (pixels) captured by the camera 412.

以下では、ポリゴンに相当する撮像対象物体の表面を撮像したカメラ４１２が複数である場合を説明する。三次元モデル生成部１１２は、ポリゴンの画素（当該画素に対応するテクスチャの画素）を、当該画素に対応するカメラ４１２が撮像した画像、および当該画素の位置（ポリゴンの位置）と当該カメラ４１２との距離に応じて算出する。例えば三次元モデル生成部１１２は、最短距離のカメラ４１２の画像とする。三次元モデル生成部１１２は画素を、複数の画像に含まれる画素の平均値としてもよいし、距離が近いほど重く加重した平均値（加重平均値）としてもよい。 The following describes the case where multiple cameras 412 capture images of the surface of an object corresponding to a polygon. The three-dimensional model generation unit 112 calculates the polygon's pixels (the texture pixels corresponding to those pixels) based on the image captured by the corresponding camera 412, and the distance between the pixel's position (the polygon's position) and the camera 412. For example, the three-dimensional model generation unit 112 uses the image from the camera 412 at the shortest distance. The three-dimensional model generation unit 112 may use the average value of pixels included in multiple images, or it may use a weighted average value where closer distances are given greater weight.

三次元モデル生成部１１２は、ポリゴンとカメラ４１２との距離ではなく、撮像角度に応じてテクスチャを算出してもよい。三次元モデル生成部１１２は画素を、ポリゴンとカメラ４１２を結ぶ線とポリゴンの垂線がなす角が最小である、より正面から撮像したカメラ４１２の画像とする。三次元モデル生成部１１２は画素を、ポリゴンとカメラ４１２を結ぶ線とポリゴンの垂線がなす角が近いほど重く加重した平均値としてもよい。 The 3D model generation unit 112 may calculate the texture based on the imaging angle rather than the distance between the polygon and the camera 412. The 3D model generation unit 112 uses the image captured by the camera 412 from the most frontal angle, where the angle between the line connecting the polygon and the camera 412 and the perpendicular line of the polygon is minimized, as the image. The 3D model generation unit 112 may also use an average value for the pixels, weighting more heavily on the angle between the line connecting the polygon and the camera 412 and the perpendicular line of the polygon.

テクスチャは、カメラ４１２が撮像した画像、および、距離または角度に応じて算出された画像を含む。別の形態としてテクスチャは、ＵＶマップで示される、三次元メッシュ（ポリゴン）に対応する画像を含んでもよい。 The texture includes images captured by camera 412 and images calculated according to distance or angle. Alternatively, the texture may include images corresponding to a three-dimensional mesh (polygon) represented by a UV map.

以上に説明したように撮像解析装置１００は、撮像情報を基に撮像対象物体の三次元モデルである生成三次元モデル１２２を生成する三次元モデル生成部１１２を備える。
撮像情報は、撮像対象物体の三次元点群情報および画像である。
生成三次元モデル１２２は、テクスチャが付与された三次元メッシュであるテクスチャ付き三次元メッシュである。 As described above, the imaging analysis device 100 includes a three-dimensional model generation unit 112 that generates a generated three-dimensional model 122, which is a three-dimensional model of the object to be imaged, based on the imaging information.
The imaging information consists of three-dimensional point cloud information and images of the object being imaged.
The generated three-dimensional model 122 is a textured three-dimensional mesh, which is a three-dimensional mesh to which a texture has been applied.

撮像情報は、複数の撮像装置が撮像した、撮像対象物体の複数の三次元点群情報および画像である。
三次元モデル生成部１１２は、複数の撮像装置の位置および姿勢（撮像装置情報データベース１４０参照）を参照して、三次元メッシュとテクスチャとを生成するとともに、当該三次元メッシュと当該テクスチャとの対応関係であるＵＶマップを生成する。 The imaging information consists of multiple three-dimensional point cloud data and images of the target object, captured by multiple imaging devices.
The three-dimensional model generation unit 112 generates a three-dimensional mesh and texture by referring to the positions and orientations of multiple imaging devices (see imaging device information database 140), and also generates a UV map which represents the correspondence between the three-dimensional mesh and the texture.

三次元モデル生成部１１２は、テクスチャに含まれる画素を、複数の撮像装置が撮像した画像を基に、当該画素に対応する撮像対象物体にある点と当該撮像装置との距離に応じて算出する。
三次元モデル生成部１１２は、画素を、距離が最小の前記撮像装置が撮像した画像を基に算出する。
三次元モデル生成部１１２は、画素を、距離に応じた重みを付けた撮像装置が撮像した画像を基に算出する。
三次元モデル生成部１１２は、テクスチャに含まれる画素を、当該画素に対応する撮像対象物体にある点に対応する、複数の撮像装置が撮像した画像の平均とする。 The three-dimensional model generation unit 112 calculates the pixels included in the texture based on images captured by multiple imaging devices, according to the distance between the point on the object to be imaged corresponding to the pixel and the imaging device.
The three-dimensional model generation unit 112 calculates pixels based on the image captured by the imaging device with the smallest distance.
The three-dimensional model generation unit 112 calculates pixels based on images captured by an imaging device, which are weighted according to distance.
The three-dimensional model generation unit 112 takes the pixels included in the texture and uses the average of images captured by multiple imaging devices corresponding to points on the object to be imaged that correspond to those pixels.

テクスチャは、撮像装置が撮像した画像、および、三次元モデル生成部１１２が算出した画素を含んで構成される画像を含む。
テクスチャは、ＵＶマップに対応した画像を含む。 The texture includes an image captured by the imaging device and an image composed of pixels calculated by the three-dimensional model generation unit 112.
Textures include images that correspond to UV maps.

≪制御部：比較部≫
比較部１１３は、三次元モデル生成部１１２が生成した三次元モデルである生成三次元モデル１２２と、参照三次元モデルデータベース１６０にある三次元モデルである参照三次元モデルとを比較して、差分を算出する。詳しく説明すると比較部１１３は、最初に生成三次元モデル１２２のテクスチャまたはカメラ４１２の画像を基に、認識モデル１２１を用いて撮像対象物体のタイプを検出する。比較部１１３は、参照三次元モデルおよび生成三次元モデル１２２の大きさや形状を比較して、撮像対象物体のタイプを検出してもよい。この際、撮像対象物体（例えば鋼矢板）の長尺方向の長さは、切断されて短くなっている可能性や、破損や変形の可能性を考慮して比較する。 ≪Control Unit: Comparison Unit≫
The comparison unit 113 compares the generated three-dimensional model 122, which is a three-dimensional model generated by the three-dimensional model generation unit 112, with the reference three-dimensional model, which is a three-dimensional model in the reference three-dimensional model database 160, and calculates the difference. More specifically, the comparison unit 113 first detects the type of object to be imaged using the recognition model 121 based on the texture of the generated three-dimensional model 122 or the image from the camera 412. The comparison unit 113 may also detect the type of object to be imaged by comparing the size and shape of the reference three-dimensional model and the generated three-dimensional model 122. In this case, the length in the longitudinal direction of the object to be imaged (for example, a steel sheet pile) is compared taking into account the possibility that it may have been cut and shortened, or that it may have been damaged or deformed.

なお、参照三次元モデルは、参照三次元モデルデータベース１６０に格納されている三次元モデル以外のものでもよい。また、映像から資材の端面や側面を検出して、その情報を基に生成三次元モデルにより大きさや形状を計測してもよい。また、複数の種類の資材が混在されている場合には、映像や三次元情報を基にある程度の塊（形状の類似や距離の類似性）を検知して、その単位ごとに上記の比較を行ってもよい。この塊の検出は、映像認識や三次元情報を基に距離の類似性を、閾値などを用いて判定してもよい。 Furthermore, the reference 3D model may be one other than those stored in the reference 3D model database 160. Alternatively, the end faces and sides of materials may be detected from the video, and their size and shape may be measured using the generated 3D model based on that information. In cases where multiple types of materials are mixed, certain clusters (similar in shape or distance) may be detected based on the video and 3D information, and the above comparisons may be performed for each cluster. This cluster detection may be determined by using a threshold or similar method based on distance similarity using video recognition or 3D information.

次に比較部１１３は、撮像対象物体の断面（長尺方向の垂直面）が写るカメラ４１２の画像または生成三次元モデル１２２のテクスチャを基に、認識モデル１２１を用いて個々の撮像対象物体を検出して個数をカウントする。比較部１１３は、生成三次元モデル１２２の体積および参照三次元モデルの体積を基に撮像対象物体の個数をカウントしてもよい。また比較部１１３は、個数カウントの結果、生成三次元モデル１２２において、それぞれの対象物体が占める部分を取得してもよい。 Next, the comparison unit 113 uses the recognition model 121 to detect and count individual target objects based on the image from the camera 412 showing the cross-section (vertical plane in the longitudinal direction) of the target object, or the texture of the generated three-dimensional model 122. The comparison unit 113 may also count the number of target objects based on the volume of the generated three-dimensional model 122 and the volume of the reference three-dimensional model. Furthermore, the comparison unit 113 may obtain the portion of the generated three-dimensional model 122 occupied by each target object based on the count result.

また比較部１１３は、生成三次元モデル１２２を基に、撮像対象物体のサイズ、特に長尺方向の長さを算出する。さらに比較部１１３は、生成三次元モデル１２２と参照三次元モデルの形状を比較して、差分としての破損や変形を検出する。比較部１１３は、物体情報データベース１５０を参照して検出した撮像対象物体のタイプに対応する参照三次元モデルの識別情報を取得する。続いて比較部１１３は、当該識別情報に対応する参照三次元モデルを参照三次元モデルデータベース１６０から取得し、生成三次元モデル１２２と比較して大きさ（長さ）の差分を検出してもよい。これらの処理は、生成三次元モデル１２２のそれぞれの対象物体に対応した部分に対して行うのが望ましい。 The comparison unit 113 also calculates the size of the object to be imaged, particularly its length in the longitudinal direction, based on the generated three-dimensional model 122. Furthermore, the comparison unit 113 compares the shape of the generated three-dimensional model 122 with that of a reference three-dimensional model to detect any damage or deformation as a difference. The comparison unit 113 obtains identification information for the reference three-dimensional model corresponding to the detected type of object to be imaged by referring to the object information database 150. Subsequently, the comparison unit 113 may obtain the reference three-dimensional model corresponding to this identification information from the reference three-dimensional model database 160 and compare it with the generated three-dimensional model 122 to detect any difference in size (length). It is desirable to perform these processes on the parts of the generated three-dimensional model 122 corresponding to each target object.

以上に説明したように撮像解析装置１００は、生成三次元モデル１２２との比較対象となる、物体の三次元モデルである参照三次元モデルおよび生成三次元モデル１２２を比較して、差分を算出する比較部１１３を備える。
比較部１１３は、撮像対象物体の個数をカウントする。
比較部１１３は、画像またはテクスチャを基に撮像対象物体のタイプを検出し、検出結果のタイプに対応する参照三次元モデルおよび生成三次元モデル１２２を比較して、大きさおよび形状の少なくとも一方の差分を算出する。 As described above, the imaging analysis device 100 includes a comparison unit 113 that compares the generated three-dimensional model 122 with a reference three-dimensional model, which is a three-dimensional model of an object, and the generated three-dimensional model 122, and calculates the difference.
The comparison unit 113 counts the number of objects to be imaged.
The comparison unit 113 detects the type of object to be imaged based on the image or texture, compares the reference three-dimensional model and the generated three-dimensional model 122 corresponding to the detected type, and calculates the difference in at least one of the size and shape.

≪制御部：表示制御部≫
表示制御部１１４は、後記する撮像解析結果画面５１０（後記する図２参照）を入出力部１８０に接続されたディスプレイに出力する。図２は、第１実施形態に係る撮像解析結果画面５１０の画面構成図である。タイプが検出された生成三次元モデル１２２が表示される領域５１１には、検出されたタイプや個数、サイズ（長さ）、破損／変形状況を含むコメント５１２が付与される。タイプが検出されなかった生成三次元モデル１２２が表示される領域５１３には、未検出のコメント５１４が付与される。なお濃いハッチングの領域は、撮像装置の死角となっている領域である。また、複数の資材や複数種資材の混在時には検知資材や検知できなかった資材単位（形状の類似や三次元で計測した距離が近い塊単位）について図２に示すように並べて表示してもよい。 ≪Control Unit: Display Control Unit≫
The display control unit 114 outputs the imaging analysis result screen 510 (see Figure 2 below) to a display connected to the input/output unit 180. Figure 2 is a screen configuration diagram of the imaging analysis result screen 510 according to the first embodiment. In the area 511 where the generated three-dimensional model 122 for which a type has been detected is displayed, comments 512 including the detected type, number, size (length), and damage/deformation status are added. In the area 513 where the generated three-dimensional model 122 for which a type has not been detected is displayed, comments 514 indicating that it was not detected are added. Areas with dark hatching are areas that are blind spots of the imaging device. Furthermore, when multiple materials or multiple types of materials are mixed, the detected materials and material units that could not be detected (units with similar shapes or close distances measured in three dimensions) may be displayed side by side as shown in Figure 2.

なお未検出の領域５１３は、タイプが未検出の領域とは限らない。サイズ（長さ）が変わることが想定されない撮像対象物体について、比較部１１３が生成三次元モデル１２２と参照三次元モデルとでサイズに差（差分）があると判定した場合、表示制御部１１４は差がある部分を未検出の領域として表示する。 Note that the undetected region 513 is not necessarily a region of type undetected. For imaging target objects where the size (length) is not expected to change, if the comparison unit 113 determines that there is a size difference between the generated three-dimensional model 122 and the reference three-dimensional model, the display control unit 114 displays the differing portion as an undetected region.

コメント５１２，５１４の「修正」ボタンを押すことでコメント内容の編集画面（不図示）が表示され、撮像解析装置１００の利用者である作業員はコメント５１２，５１４を修正することができる。「コメント追加」ボタン５１５を押すことで作業員は、領域や当該領域に対するコメントを追加することができる。「追加撮像」ボタン５１６を押すことで作業員は、未検出である領域５１３を撮像するのに適した位置・姿勢を取得することができる。 By pressing the "Edit" button for comments 512 and 514, an editing screen for the comment content (not shown) is displayed, allowing the operator, who is a user of the imaging analysis device 100, to modify comments 512 and 514. By pressing the "Add Comment" button 515, the operator can add a region or a comment about that region. By pressing the "Add Image" button 516, the operator can acquire a suitable position and orientation for imaging the undetected region 513.

図３は、第１実施形態に係る追加撮像の位置・姿勢を示すポインタ５２６が表示される撮像解析結果画面５２０の画面構成図である。作業員は、ポインタ５２６で示される位置・姿勢でＬｉＤＡＲやカメラを用いて撮像し、撮像情報を撮像解析装置１００に送る。撮像解析装置１００は、生成三次元モデル１２２を再度生成し、タイプの検出や個数のカウントなどを再実行して、撮像解析結果画面５３０（後記する図４参照）を再表示する。 Figure 3 is a screen configuration diagram of the imaging analysis results screen 520, which displays a pointer 526 indicating the position and orientation of additional imaging according to the first embodiment. The operator takes images using LiDAR or a camera at the position and orientation indicated by the pointer 526 and sends the imaging information to the imaging analysis device 100. The imaging analysis device 100 regenerates the generated three-dimensional model 122, re-executes type detection and counting, and redisplays the imaging analysis results screen 530 (see Figure 4 below).

図４は、第１実施形態に係る追加撮像後の撮像解析結果画面５３０の画面構成図である。未検出の領域５１３（図２参照）がなくなり、鋼矢板の領域５３１が正しく検出されて表示されている。コメント５３２には、正しい長さが表示されている。このように追加撮像を行うことで撮像解析装置１００は、未検出部分を削減して、より精度の高い情報を表示することができる。 Figure 4 is a screen configuration diagram of the imaging analysis result screen 530 after additional imaging according to the first embodiment. The undetected region 513 (see Figure 2) is gone, and the steel sheet pile region 531 is correctly detected and displayed. The correct length is displayed in comment 532. By performing additional imaging in this way, the imaging analysis device 100 can reduce the undetected portion and display more accurate information.

≪制御部：追加撮像支援部≫
図１に戻って制御部１１０の説明を続ける。追加撮像支援部１１５は、未検出である領域５１３（図２参照）を撮像するのに適した位置・姿勢（図３記載のポインタ５２６参照）を算出する。追加撮像支援部１１５は、未検出の領域５１３である追加撮像箇所を正面から撮像できる位置・姿勢を算出して、ポインタ５２６として表示する。 ≪Control Unit: Additional Imaging Support Unit≫
Returning to Figure 1, let's continue the explanation of the control unit 110. The additional imaging support unit 115 calculates a suitable position and orientation (see pointer 526 in Figure 3) for imaging the undetected region 513 (see Figure 2). The additional imaging support unit 115 calculates a position and orientation that allows the additional imaging location, which is the undetected region 513, to be imaged from the front, and displays it as pointer 526.

なお追加撮像支援部１１５は、位置・姿勢を撮像装置に送信して、撮像者である作業員を誘導するようにしてもよい。撮像装置としては、ＬｉＤＡＲ、カメラ、衛星測位システムのアンテナ、角速度センサなどが備わる装置、例えばＬｉＤＡＲ付きのスマートフォンが望ましい。このような撮像装置が、自身の位置・姿勢を測定して、受信した位置・姿勢で撮像できるように作業員を誘導するようにしてもよい。 The additional imaging support unit 115 may also transmit its position and orientation to the imaging device to guide the worker who is performing the imaging. The imaging device should preferably be a device equipped with LiDAR, a camera, a satellite positioning system antenna, an angular velocity sensor, etc., such as a smartphone with LiDAR. Such an imaging device may also measure its own position and orientation and guide the worker so that imaging can be performed at the received position and orientation.

以上に説明したように撮像解析装置１００は、撮像対象物体において差分が生じている部分である追加撮像箇所を撮像するのに適した位置・姿勢を出力する追加撮像支援部１１５を備える。 As described above, the imaging analysis device 100 includes an additional imaging support unit 115 that outputs a suitable position and orientation for imaging the additional imaging area, which is the part of the object to be imaged where a difference has occurred.

≪撮像解析処理≫
図５は、第１実施形態に係る撮像解析処理のフローチャートである。撮像解析処理の開始時点で、撮像情報データベース１３０には撮像情報が格納済みであるとする。
ステップＳ１１において三次元モデル生成部１１２は、複数の三次元点群情報を基に、撮像対象物体を示す三次元メッシュを生成する。 <<Image Analysis Processing>>
Figure 5 is a flowchart of the imaging analysis process according to the first embodiment. At the start of the imaging analysis process, it is assumed that imaging information has already been stored in the imaging information database 130.
In step S11, the three-dimensional model generation unit 112 generates a three-dimensional mesh representing the object to be imaged based on multiple three-dimensional point cloud information.

ステップＳ１２において三次元モデル生成部１１２は、三次元メッシュ、画像、カメラ４１２の位置・姿勢を参照して、ＵＶマップとテクスチャを生成する。三次元メッシュ、ＵＶマップおよびテクスチャが生成されたことで三次元モデル（生成三次元モデル）が生成されたことになる。
ステップＳ１３において比較部１１３は、認識モデル１２１を用いて撮像対象物体（生成三次元モデル）のタイプを検出する。 In step S12, the 3D model generation unit 112 generates a UV map and texture by referring to the 3D mesh, image, and the position and orientation of the camera 412. Once the 3D mesh, UV map, and texture are generated, the 3D model (generated 3D model) is created.
In step S13, the comparison unit 113 uses the recognition model 121 to detect the type of the object to be imaged (generated three-dimensional model).

ステップＳ１４において比較部１１３は、認識モデル１２１を用いて個々の撮像対象物体を検出して、撮像対象物体の個数をカウントする。
ステップＳ１５において比較部１１３は、生成三次元モデル１２２を基に、撮像対象物体のサイズを算出する。 In step S14, the comparison unit 113 uses the recognition model 121 to detect individual objects to be imaged and counts the number of objects to be imaged.
In step S15, the comparison unit 113 calculates the size of the object to be imaged based on the generated three-dimensional model 122.

ステップＳ１６において比較部１１３は、生成三次元モデル１２２および参照三次元モデルの形状を比較して、破損や変形を検出する。
ステップＳ１７において表示制御部１１４は、撮像解析結果画面５１０（図２参照）をディスプレイに出力する。 In step S16, the comparison unit 113 compares the shapes of the generated three-dimensional model 122 and the reference three-dimensional model to detect damage or deformation.
In step S17, the display control unit 114 outputs the imaging analysis result screen 510 (see Figure 2) to the display.

ステップＳ１８において表示制御部１１４は、利用者である作業者の操作を受け付ける。操作が終了の指示であれば（ステップＳ１８→終了）撮像解析処理を終了する。操作が「修正」ボタンの押下であれば（ステップＳ１８→修正）表示制御部１１４は処理をステップＳ１９に進める。操作が「コメント追加」ボタン５１５の押下であれば（ステップＳ１８→コメント追加）表示制御部１１４は処理をステップＳ２０に進める。操作が「追加撮像」ボタン５１６の押下であれば（ステップＳ１８→追加撮像）表示制御部１１４は処理をステップＳ２１に進める。 In step S18, the display control unit 114 receives an operation from the user (operator). If the operation is an instruction to finish (step S18 → finish), the imaging analysis process is terminated. If the operation is the "correct" button pressed (step S18 → correct), the display control unit 114 proceeds to step S19. If the operation is the "add comment" button 515 pressed (step S18 → add comment), the display control unit 114 proceeds to step S20. If the operation is the "add image" button 516 pressed (step S18 → additional image), the display control unit 114 proceeds to step S21.

ステップＳ１９において表示制御部１１４は、作業者の指示に従ってコメント５１２，５１４を修正してステップＳ１７に戻る。
ステップＳ２０において表示制御部１１４は、作業者の指示に従って領域や当該領域に対するコメントを追加してステップＳ１７に戻る。 In step S19, the display control unit 114 modifies comments 512 and 514 according to the operator's instructions and returns to step S17.
In step S20, the display control unit 114 adds an area and a comment for that area according to the operator's instructions and returns to step S17.

ステップＳ２１において追加撮像支援部１１５は、未検出の領域５１３を撮像するのに適した位置・姿勢を算出し、ポインタ５２６（図３参照）として表示してステップＳ１１に戻る。ステップＳ１１，Ｓ１２において三次元モデル生成部１１２は、追加撮像した撮像情報を含めて新たに生成三次元モデルを生成する。 In step S21, the additional imaging support unit 115 calculates a suitable position and orientation for imaging the undetected region 513, displays it as a pointer 526 (see Figure 3), and returns to step S11. In steps S11 and S12, the three-dimensional model generation unit 112 generates a new three-dimensional model including the additionally acquired imaging information.

以上に説明したように三次元モデル生成部１１２は、追加撮像箇所を撮像した追加撮像情報と撮像情報とを基に、新たに生成三次元モデル１２２を生成する（ステップＳ２１，Ｓ１１，Ｓ１２参照）。 As explained above, the three-dimensional model generation unit 112 generates a new three-dimensional model 122 based on the additional imaging information obtained from capturing additional imaging locations and the imaging information (see steps S21, S11, and S12).

≪撮像解析装置の特徴≫
撮像解析装置１００は、撮像対象物体を写す複数の三次元点群情報および画像を基に三次元モデルを生成し、撮像対象物体のタイプやサイズ、個数などを算出する。このような撮像解析装置１００によれば、鋼矢板などの重資材を含む撮像対象物体のタイプの識別や個数カウント、破損状況の確認などを遠隔地から実行でき、例えば検収作業が効率化できる。または現場での作業が削減できるため、作業員の安全性が向上する。 Features of the imaging and analysis device
The imaging analysis device 100 generates a three-dimensional model based on multiple three-dimensional point cloud data and images of the object to be imaged, and calculates the type, size, and number of the object to be imaged. With such an imaging analysis device 100, it is possible to remotely identify the type of object to be imaged, count the number of objects, and check the damage status of heavy materials such as steel sheet piles, which can streamline inspection work, for example. Alternatively, it can reduce on-site work, thereby improving worker safety.

撮像解析装置１００は、建設作業の進捗状況確認にも利用できる。建築途中の建築物の三次元モデルを参照三次元モデルとして、生成三次元モデル１２２と比較することで、作業管理者は建築計画との差分を把握して進捗状況を確認することができる。 The imaging and analysis device 100 can also be used to check the progress of construction work. By comparing the three-dimensional model of the building under construction as a reference three-dimensional model with the generated three-dimensional model 122, the work manager can understand the differences from the construction plan and check the progress.

撮像解析装置１００は、撮像情報を基に生成した三次元モデルを用いたサイズの測定の他に、基準となる三次元モデル（参照三次元モデル）と比較して、サイズ／大きさや形状の差分を算出している。さらに撮像解析装置１００は、この差分の部分を未検出の領域とする。追加撮像が指示されると撮像解析装置１００は、当該領域を追加撮像するのに適した撮像位置・姿勢を算出して、追加撮像を支援する。 The imaging analysis device 100, in addition to measuring the size using a three-dimensional model generated based on the imaging information, calculates the difference in size/size and shape by comparing it with a reference three-dimensional model. Furthermore, the imaging analysis device 100 designates this difference as an undetected region. When additional imaging is instructed, the imaging analysis device 100 calculates the appropriate imaging position and orientation for additional imaging of the region, thereby supporting the additional imaging.

≪変形例：三次元点群情報≫
上記した第１実施形態において三次元点群情報は、ＬｉＤＡＲ４１１から取得したものである。これに限らず三次元点群情報は、画像情報を基に人工知能やステレオビジョンなどの技術を用いて生成／取得されたものであってもよい。 ≪Variation: Three-dimensional point cloud information≫
In the first embodiment described above, the three-dimensional point cloud information is obtained from LiDAR 411. However, the three-dimensional point cloud information is not limited to this and may be generated/obtained using technologies such as artificial intelligence or stereo vision based on image information.

≪変形例：三次元メッシュ生成≫
三次元メッシュの生成手法としてAlpha Shape法やPoisson法などを説明したが、他の手法を用いてもよい。ここでＬｉＤＡＲ４１１と同一の視点になるように深度カメラを設置したと仮定する。三次元点群から計算される深度画像をＬｉＤＡＲ４１１の設置数だけ生成し、ＴＳＤＦ（Truncated Signed Distance Function) などを用いて深度画像を統合することにより、三次元メッシュを生成してもよい。 <<Variation Example: 3D Mesh Generation>>
While methods such as the Alpha Shape method and the Poisson method have been described as three-dimensional mesh generation techniques, other methods may also be used. Here, we assume that depth cameras are installed so that they have the same viewpoint as the LiDAR411. Alternatively, depth images can be generated from the three-dimensional point cloud for each LiDAR411 installed, and the three-dimensional mesh can be generated by integrating these depth images using a TSDF (Truncated Signed Distance Function) or the like.

≪変形例：三次元情報の統合≫
複数の三次元点群情報をつなぎ合わせる手法としてＩＣＰを説明したが、他の手法を用いてもよい。複数のＬｉＤＡＲ４１１が取得した三次元点群情報からを基に、それぞれ三次元メッシュを生成して、各三次元メッシュを統合することによって１つの三次元メッシュを生成してもよい。各ＬｉＤＡＲ４１１で取得された三次元点群情報を統合することによって直接１つの三次元メッシュを生成してもよい。 <<Modification: Integration of three-dimensional information>>
While ICP was described as a method for combining multiple 3D point cloud data, other methods may also be used. Alternatively, a 3D mesh may be generated from the 3D point cloud data acquired by multiple LiDAR411s, and then a single 3D mesh may be generated by integrating these meshes. Alternatively, a single 3D mesh may be generated directly by integrating the 3D point cloud data acquired by each LiDAR411.

さらに他の手法として撮像解析装置１００は、各ＬｉＤＡＲ４１１が取得した三次元点群情報を基に、それぞれテクスチャ付き三次元メッシュを生成する。撮像解析装置１００は、テクスチャ付き三次元モデルの閲覧時に生成された複数のテクスチャ付き三次元メッシュを同時に描画することにより、１つの三次元モデルであるかのように表示してもよい。 Furthermore, as another method, the imaging analysis device 100 generates textured three-dimensional meshes based on the three-dimensional point cloud information acquired by each LiDAR 411. The imaging analysis device 100 may also display multiple textured three-dimensional meshes simultaneously when viewing the textured three-dimensional model, making them appear as a single three-dimensional model.

≪変形例：追加撮像≫
上記した実施形態において追加撮像支援部１１５は、追加撮像に適した撮像位置・姿勢を算出してポインタ５２６（図３参照）として表示している。追加撮像支援部１１５は、追加撮像をロボットに指示してもよい。追加撮像支援部１１５は、撮像装置を備えた自走式ロボットまたはドローンに、算出した撮像位置・姿勢を送信し、追加撮像を指示してもよい。ロボットは、追加撮像した撮像情報を撮像解析装置１００に送信する。 <<Variation: Additional imaging>>
In the embodiment described above, the additional imaging support unit 115 calculates an imaging position and orientation suitable for additional imaging and displays it as a pointer 526 (see Figure 3). The additional imaging support unit 115 may also instruct the robot to perform additional imaging. The additional imaging support unit 115 may transmit the calculated imaging position and orientation to a self-propelled robot or drone equipped with an imaging device and instruct it to perform additional imaging. The robot transmits the additionally captured imaging information to the imaging analysis device 100.

≪変形例：三次元モデル生成装置≫
撮像解析装置は、比較部１１３を備えない形態の三次元モデル生成装置であってもよい。このような撮像解析装置においては、さまざまな方向から撮像した三次元点群情報および画像を基に三次元モデル生成部１１２が、生成三次元モデル１２２を生成する。三次元モデル生成部１１２は、複数の画像を基にテクスチャを生成しており、撮像対象物体の色の再現性が高い。作業者は、この生成三次元モデル１２２をさまざまな視点から見ることで、破損を含めた撮像対象物体の状態を把握することができるようになる。 <<Variation Example: Three-Dimensional Model Generator>>
The imaging analysis device may be a three-dimensional model generation device that does not include a comparison unit 113. In such an imaging analysis device, the three-dimensional model generation unit 112 generates a generated three-dimensional model 122 based on three-dimensional point cloud information and images captured from various directions. The three-dimensional model generation unit 112 generates textures based on multiple images, and has high color reproduction accuracy of the object being imaged. By viewing this generated three-dimensional model 122 from various viewpoints, the operator can grasp the condition of the object being imaged, including any damage.

≪第２実施形態≫
第２実施形態に係る映像解析装置は、監視カメラの映像解析のみでリアルタイムに水位を計測するものである。これにより、水位を計測するためのメータの設置が不要であり、水位の計測計器や設備のメンテナンスが不要となり、かつオートキャリブレーションにより水位計測の経年誤差などを打ち消すことが可能である。さらにリアルタイムで水位を計測できると共に、コスト面と美観の面で優位である。 ≪Second Embodiment≫
The video analysis device according to the second embodiment measures the water level in real time solely by analyzing video from a surveillance camera. This eliminates the need to install meters for measuring the water level, eliminates the need for maintenance of water level measuring instruments and equipment, and allows for the cancellation of age-related errors in water level measurement through auto-calibration. Furthermore, it offers advantages in terms of cost and aesthetics, as well as the ability to measure the water level in real time.

図６は、第２実施形態に係る映像解析装置６１０を示す概略の構成図である。
映像解析装置６１０は、映像特徴量変換部６１２と、映像特徴量保持部６１３と、三次元データ特徴量変換部６１５と、三次元データ特徴量保持部６１６と、空間位置合わせ部６１７と、計測部６１８と、表示制御部６００とを含んで構成される。 Figure 6 is a schematic diagram showing the video analysis device 610 according to the second embodiment.
The video analysis device 610 is comprised of a video feature conversion unit 612, a video feature retention unit 613, a three-dimensional data feature conversion unit 615, a three-dimensional data feature retention unit 616, a spatial alignment unit 617, a measurement unit 618, and a display control unit 600.

三次元データ６１４は、ある対象物を、予めＬｉＤＡＲなどの三次元計測装置で計測したものである。映像データ６１１は、ＬｉＤＡＲなどで撮影した対象物と同一の対象物の映像データであり、二次元カメラで撮影したものである。映像データ６１１は、例えば複数の映像フレームが時系列に配列された動画ストリームであるが、静止画であってもよい。 The three-dimensional data 614 is obtained by pre-measuring an object using a three-dimensional measuring device such as LiDAR. The video data 611 is video data of the same object as the one captured by the LiDAR, and was captured using a two-dimensional camera. The video data 611 is, for example, a video stream with multiple video frames arranged chronologically, but it may also be a still image.

三次元データ特徴量変換部６１５は、入力された三次元データ６１４の複数の視点での透視投影により、二次元射影データを仮定して特徴量を変換した三次元特徴量を算出する。つまり、三次元データ特徴量変換部６１５は、対象物を三次元計測装置で計測した三次元データに対して複数の視点での透視投影により二次元を仮定して特徴量を抽出する。三次元データ特徴量変換部６１５は、対象物を計測した三次元データに含まれる法線から検出される面の境界情報を前記特徴量として抽出する。ここで対象物とは、例えば橋脚であり、橋脚の三次元データに含まれる法線から検出される面の境界情報を特徴量として抽出し、水位計測に活用する。なお、構造物の三次元データに含まれる法線から検出される面の境界情報は、構造物の形状のエッジ情報である。 The three-dimensional data feature transformation unit 615 calculates three-dimensional features by performing perspective projection of the input three-dimensional data 614 from multiple viewpoints, assuming two-dimensional projection data. In other words, the three-dimensional data feature transformation unit 615 extracts features from three-dimensional data obtained by measuring an object with a three-dimensional measuring device, assuming two dimensions through perspective projection from multiple viewpoints. The three-dimensional data feature transformation unit 615 extracts boundary information of surfaces detected from normals included in the three-dimensional data of the measured object as the aforementioned features. Here, the object is, for example, a bridge pier, and boundary information of surfaces detected from normals included in the three-dimensional data of the bridge pier is extracted as features and used for water level measurement. Note that the boundary information of surfaces detected from normals included in the three-dimensional data of a structure is edge information of the structure's shape.

三次元データ特徴量変換部６１５が算出した三次元データ特徴量は、三次元データ特徴量保持部６１６に保持される。三次元データ６１４は、最初に１度だけ計測されたものでよく、対象物を連続して計測する必要はない。また、既に三次元情報が存在すれば、そちらを利用してもよい。 The three-dimensional data features calculated by the three-dimensional data feature conversion unit 615 are stored in the three-dimensional data feature storage unit 616. The three-dimensional data 614 only needs to be measured once initially; it is not necessary to continuously measure the object. Furthermore, if three-dimensional information already exists, that information may be used.

映像特徴量変換部６１２は、入力された映像データ６１１の特徴量を変換した映像特徴量を算出する。つまり映像特徴量変換部６１２は、対象物をカメラで撮影した映像データの特徴量を抽出する。映像特徴量変換部６１２は、対象物を撮影した映像データ６１１のエッジ情報を特徴量として抽出する。ここで対象物とは例えば橋脚である。水位計測において、映像特徴量変換部６１２は、橋脚を撮影した映像中のエッジ情報を特徴量として抽出する。また、映像特徴量変換部６１２は、対象物を撮影した映像データ６１１からの特徴量の抽出にあたり、三次元データ６１４を用いるとよい。これにより、映像特徴量変換部６１２は、橋脚の特徴量を好適に抽出可能である。映像特徴量変換部６１２が算出した映像特徴量は、映像特徴量保持部６１３に保持される。
また映像特徴量変換部６１２は、時系列の異なる複数の映像フレームを用いて、特徴量を抽出するとよい。これにより、映像特徴量変換部６１２は、個々のフレームのノイズによらず特徴量を抽出可能である。さらに映像特徴量変換部６１２は、三次元計測装置の位置情報とカメラの位置情報を元に特徴量を抽出するとよい。これにより、映像特徴量変換部６１２は、三次元計測装置で好適に測定できる面がカメラからどのように撮影されるかを考慮しつつ、好適に特徴量を抽出できる。 The video feature conversion unit 612 calculates video features by converting the features of the input video data 611. In other words, the video feature conversion unit 612 extracts the features of video data in which an object is captured by a camera. The video feature conversion unit 612 extracts edge information from the video data 611 in which the object is captured as features. Here, the object is, for example, a bridge pier. In water level measurement, the video feature conversion unit 612 extracts edge information from the video in which the bridge pier is captured as features. Furthermore, the video feature conversion unit 612 may use three-dimensional data 614 when extracting features from the video data 611 in which the object is captured. This allows the video feature conversion unit 612 to suitably extract the features of the bridge pier. The video features calculated by the video feature conversion unit 612 are stored in the video feature storage unit 613.
Furthermore, the video feature conversion unit 612 may extract features using multiple video frames from different time series. This allows the video feature conversion unit 612 to extract features regardless of the noise in individual frames. Additionally, the video feature conversion unit 612 may extract features based on the position information of the three-dimensional measuring device and the position information of the camera. This allows the video feature conversion unit 612 to extract features appropriately while considering how surfaces that can be suitably measured by the three-dimensional measuring device are captured by the camera.

空間位置合わせ部６１７は、映像特徴量保持部６１３と三次元データ特徴量保持部６１６のデータを比較して、映像データ６１１に最も近い二次元射影データを決定する。つまり、空間位置合わせ部６１７は、三次元データ特徴量保持部６１６に保存された前記三次元データの特徴量と前記映像特徴量保持部に保存された前記映像データの特徴量とを比較して、前記映像データに最も近い画角となる二次元の射影データを決定する。
この比較に際しては、映像特徴量保持部６１３のデータは構造物のエッジだけではなく、影などにより映像上に発生したエッジも検出されている可能性があるため、三次元データ特徴量保持部６１６の特徴量の部分のみで比較を行うとよい。
空間位置合わせ部６１７は、三次元データ６１４を元に、所定の高さ以上での比較を行うとよい。所定の高さとは、例えば水面である。
また、この比較については、初めは大まかな視点や画角での二次元射影データを比較し、大体の視点や画角を絞り込んだのち、細かな視点や画角の変更で階層的に最も近い画角を算出するのが良い。 The spatial alignment unit 617 compares the data in the video feature storage unit 613 and the data in the three-dimensional data feature storage unit 616 to determine the two-dimensional projection data that is closest to the video data 611. In other words, the spatial alignment unit 617 compares the feature quantities of the three-dimensional data stored in the three-dimensional data feature storage unit 616 with the feature quantities of the video data stored in the video feature storage unit to determine the two-dimensional projection data that has the closest field of view to the video data.
For this comparison, since the data in the video feature retention unit 613 may detect not only the edges of structures but also edges that appear on the image due to shadows, etc., it is best to compare only the feature portion of the three-dimensional data feature retention unit 616.
The spatial alignment unit 617 should perform a comparison at a predetermined height or higher based on the three-dimensional data 614. The predetermined height is, for example, the water surface.
Furthermore, for this comparison, it is best to first compare two-dimensional projection data at a general viewpoint or field of view to narrow down the possible viewpoints and fields of view, and then calculate the closest field of view hierarchically by making finer changes to the viewpoint and field of view.

計測部６１８は、空間位置合わせ部６１７で決定した二次元射影データでの特徴量と、映像特徴量保持部が保存した映像データの特徴量とを比較することで、映像中の対象物のサイズを計測する。
表示制御部６００は、位置合わせ後の映像データおよび三次元データ、注意と危険の水位を三次元データに重畳して表示する。これにより、現在の水位が注意すべきものであること、または危険であることをユーザに示すことができる。 The measurement unit 618 measures the size of an object in the video by comparing the feature quantities of the two-dimensional projection data determined by the spatial alignment unit 617 with the feature quantities of the video data stored by the video feature quantity storage unit.
The display control unit 600 displays the aligned video data and three-dimensional data, and superimposes the water levels indicating caution and danger onto the three-dimensional data. This allows the user to see whether the current water level is something to be cautious about or whether it is dangerous.

具体的な計測方法として、計測部６１８は、映像データでの喫水線の特徴量と、三次元データ特徴量保持部６１６に格納された三次元データの特徴量とを比較することで、映像中の喫水線情報を三次元データの高さ情報に変換する。 As a specific measurement method, the measurement unit 618 compares the waterline feature quantities in the video data with the feature quantities of the three-dimensional data stored in the three-dimensional data feature quantity storage unit 616, thereby converting the waterline information in the video into height information in the three-dimensional data.

また、計測部６１８は、ある時の映像データの特徴量を基準として、異なる時刻の映像データの特徴量を比較して、その差異により水位を計測する。これは、例えば増水で河川の水位が増えた場合には、架橋などの構造物は水によって映像に映らなくなるオクルージョン領域が発生し、映像の特徴量として検出されなくなるためである。これを水位の高さ毎の特徴量のヒストグラムとして表現した場合には、基準としていた特徴量と、水位が増えた場合の特徴量ではその水位の増加部分でヒストグラムに大きな違いが発生することになる。計測部６１８は、この差異が発生する部分から水位の増加を計測して、基準値に対して高さを加味することで水位に変換する。この際、計測部６１８は、水位の決定にあたり、ヒストグラムの基準との比較を水位の高い方から行い、ある閾値より多く値が変化した箇所などを選択したり、周辺の高さと大きく傾向が異なるところを選択してもよい。これにより、監視カメラが撮影した映像データに基づいて、河川の水位情報をリアルタイムで算出できる。
また、計測部６１８は、水位を計測するため短い時間単位での計測を実施するが、その際、空間位置合わせ処理を毎回実施しなくてもよい。 Furthermore, the measurement unit 618 uses the feature quantities of the video data at a given time as a reference and compares the feature quantities of the video data at different times to measure the water level based on the difference. This is because, for example, when the water level of a river rises due to flooding, occlusion regions occur where structures such as bridges are obscured by water and are no longer detected as feature quantities in the video. If this is represented as a histogram of feature quantities for each water level, a large difference will occur in the histogram between the reference feature quantity and the feature quantity when the water level increases. The measurement unit 618 measures the increase in water level from the part where this difference occurs and converts it into a water level by adding the height to the reference value. In this case, when determining the water level, the measurement unit 618 may compare the histogram with the reference starting from the highest water level and select a point where the value has changed more than a certain threshold, or a point where the trend is significantly different from the surrounding heights. This makes it possible to calculate river water level information in real time based on video data captured by the surveillance camera.
Furthermore, the measuring unit 618 performs measurements in short time units to measure the water level, but it is not necessary to perform spatial alignment processing each time.

図７は、橋脚の映像データ例である。
橋脚は、河川に据え付けられており、構造物の形状や喫水面が橋脚の表面に現れている。この形状や喫水面の三次元データの高さ情報を算出することや、基準の映像データの特徴量との比較により、河川の水位情報を得ることができる。この構造物の形状情報を得るために、三次元データから面の表を示すベクトルである法線の状況により異なる面同士の境界線（エッジ）情報を検出し、こちらを特徴量とすることが好ましい。 Figure 7 shows an example of video data of a bridge pier.
Bridge piers are installed in rivers, and the shape of the structure and the waterline are visible on the surface of the piers. By calculating the height information of the three-dimensional data of this shape and waterline, and comparing it with the features of reference video data, river water level information can be obtained. To obtain the shape information of this structure, it is preferable to detect boundary line (edge) information between different surfaces from the three-dimensional data based on the status of the normal vectors, which are vectors that represent the surface of the surface, and use this as a feature.

図８は、第２実施形態に係る計測可視化処理のフローチャートである。
最初、三次元データ特徴量変換部６１５は、高精度の三次元データ６１４を取得する（ステップＳ４０）。そして、三次元データ特徴量変換部６１５は、この三次元データ６１４から、三次元データに含まれる法線から検出される面の境界情報を抽出する（ステップＳ４１）。
次に空間位置合わせ部６１７は、画像の特徴量を、三次元データに含まれる面の境界情報と対応付ける（ステップＳ４２）。計測部６１８は、対応付けた画像の特徴量のうち、二次元射影データで現れている特徴点が映像データで隠されているオクルージョンの差異を計測して高さ別マッチ量ヒストグラムを作成し（ステップＳ４３）、基準時とのヒストグラムと比較することで水位を特定すると（ステップＳ４４）、図８の処理を終了する。 Figure 8 is a flowchart of the measurement visualization process according to the second embodiment.
First, the three-dimensional data feature conversion unit 615 acquires high-precision three-dimensional data 614 (step S40). Then, the three-dimensional data feature conversion unit 615 extracts boundary information of surfaces detected from the normals contained in the three-dimensional data 614 (step S41).
Next, the spatial alignment unit 617 associates the image features with the boundary information of the surfaces included in the three-dimensional data (step S42). The measurement unit 618 measures the difference in occlusion among the associated image features, where feature points appearing in the two-dimensional projection data are hidden by the video data, and creates a height-specific match amount histogram (step S43). By comparing this with the histogram at the reference time, the water level is determined (step S44), and the process shown in Figure 8 is completed.

図９は、第２実施形態に係る映像解析装置６１０のデータフロー図である。
データの収集時には、ＬｉＤＡＲ７１０が対象物の三次元計測を実施して、点群７２１を生成する。そして、映像解析装置６１０は、メッシュ７２２を生成してＲａｙｃａｓｔ部７３０に引き渡す。Ｒａｙｃａｓｔ部７３０は、例えばゲームエンジンＵｎｉｔｙ（登録商標）のＲａｙｃａｓｔに該当し、ある特定のオブジェクトから透明な光線を出し、光線がぶつかった別のオブジェクトの座標を取得する処理を実行する。 Figure 9 is a data flow diagram of the video analysis device 610 according to the second embodiment.
During data collection, the LiDAR 710 performs three-dimensional measurement of the object and generates a point cloud 721. The video analysis device 610 then generates a mesh 722 and hands it over to the Raycast unit 730. The Raycast unit 730 corresponds to, for example, the Raycast of the game engine Unity (registered trademark), and performs a process that emits a transparent ray from a specific object and obtains the coordinates of another object that the ray hits.

カメラ設置時にＲａｙｃａｓｔ部７３０は、映像を撮影するカメラの位置と方向を示すカメラパラメータ７３１に基づき、メッシュ７２２で示されるオブジェクトから、カメラで対象物を撮影したときの映像データに最も近い二次元射影データを決定する。この二次元射影データは、仮想法線画像特徴データベース７４０に格納される。仮想法線画像特徴データベース７４０はさらに、ＬｉＤＡＲ／カメラ変換行列７４１とＲａｙｃａｓｔ解像度データ７４２を格納する。 When the camera is installed, the Raycast unit 730 determines the two-dimensional projection data closest to the image data when the camera captures the target object, based on camera parameters 731 indicating the position and direction of the camera that captures the image, from the objects represented by the mesh 722. This two-dimensional projection data is stored in the virtual normal image feature database 740. The virtual normal image feature database 740 further stores the LiDAR/camera transformation matrix 741 and Raycast resolution data 742.

運用時に監視カメラ７５０は、対象物を撮影してＲＧＢ画像７５１を生成する。映像解析装置６１０は、ＲＧＢ画像７５１から特徴データ７５２を生成し、特徴マッチング部７６０と特徴比較部７７０に引き渡す。
特徴マッチング部７６０は、仮想法線画像特徴データベース７４０に格納された二次元射影データの特徴点と、特徴データ７５２とを比較する。そして、特徴マッチング部７６０は、最一致の法線画像特徴７６１を出力する。特徴比較部７７０は、特徴データ７５２と、最一致の法線画像特徴７６１と、基準時の特徴量とを比較して、高さ別マッチ量ヒストグラム７７１を生成する。水位推定部７８０は、この高さ別マッチ量ヒストグラム７７１を判定することにより、現在の水位を推定する。 During operation, the surveillance camera 750 captures the target object and generates an RGB image 751. The video analysis device 610 generates feature data 752 from the RGB image 751 and passes it to the feature matching unit 760 and the feature comparison unit 770.
The feature matching unit 760 compares the feature points of the two-dimensional projection data stored in the virtual normal image feature database 740 with the feature data 752. The feature matching unit 760 then outputs the best-matching normal image feature 761. The feature comparison unit 770 compares the feature data 752, the best-matching normal image feature 761, and the feature quantities at the reference time to generate a height-specific match amount histogram 771. The water level estimation unit 780 estimates the current water level by determining this height-specific match amount histogram 771.

図１０は、第２実施形態に係る映像解析装置６１０の具体的なハードウェアの構成図である。
映像解析装置６１０は、例えばコンピュータであり、特徴処理プラグイン８３０と水位計測プラグイン８４０が組み込まれている。映像解析装置６１０はさらに、撮影情報データベース８８１と、三次元データ８８２と、画像特徴データベース８８３と、特徴マッチング処理結果８８４と、解析結果８８５とを格納している。
映像解析装置６１０には、監視カメラ８１０およびビデオ管理装置８２０と、ＬｉＤＡＲなどの三次元計測装置８５０と、初期データ生成装置８６０が接続されている。 Figure 10 is a diagram showing the specific hardware configuration of the video analysis device 610 according to the second embodiment.
The video analysis device 610 is, for example, a computer and incorporates a feature processing plugin 830 and a water level measurement plugin 840. The video analysis device 610 also stores an image capture information database 881, three-dimensional data 882, an image feature database 883, feature matching processing results 884, and analysis results 885.
The video analysis device 610 is connected to a surveillance camera 810, a video management device 820, a three-dimensional measurement device 850 such as LiDAR, and an initial data generation device 860.

監視カメラ８１０は、映像を撮影してビデオ管理装置８２０に引き渡す。ビデオ管理装置８２０は、監視カメラ８１０が撮影した映像を蓄積して、特徴処理プラグイン８３０と水位計測プラグイン８４０に画像データを出力する。 The surveillance camera 810 captures video footage and hands it over to the video management device 820. The video management device 820 stores the video footage captured by the surveillance camera 810 and outputs image data to the feature processing plugin 830 and the water level measurement plugin 840.

三次元計測装置８５０は、例えばＬｉＤＡＲであり、対象物を三次元計測して、撮影場所ＩＤを撮影情報データベース８８１に書き込み、メッシュデータと撮影場所ＩＤを三次元データ８８２に書き込む。
初期データ生成装置８６０は、仮想法線画像生成部８６１と特徴量抽出部８６２を含んで構成される。仮想法線画像生成部８６１は、撮影場所ＩＤとメッシュデータから法線画像を生成する。特徴量抽出部８６２は、法線画像から特徴画像と深度画像を生成する。初期データ生成装置８６０は、法線画像と特徴画像と深度画像を、画像特徴データベース８８３に格納する。 The three-dimensional measuring device 850 is, for example, a LiDAR, which measures the object in three dimensions, writes the shooting location ID to the shooting information database 881, and writes the mesh data and shooting location ID to the three-dimensional data 882.
The initial data generation device 860 includes a virtual normal image generation unit 861 and a feature extraction unit 862. The virtual normal image generation unit 861 generates a normal image from the shooting location ID and mesh data. The feature extraction unit 862 generates a feature image and a depth image from the normal image. The initial data generation device 860 stores the normal image, feature image, and depth image in the image feature database 883.

特徴処理プラグイン８３０は、特徴抽出部８３１と特徴マッチング部８３２を備えている。特徴抽出部８３１は、画像データから特徴を抽出して、特徴マッチング部８３２に出力する。特徴マッチング部８３２には、画像データの特徴が入力され、撮影情報データベース８８１から撮影場所ＩＤおよびカメラパラメータが入力される。 The feature processing plugin 830 comprises a feature extraction unit 831 and a feature matching unit 832. The feature extraction unit 831 extracts features from image data and outputs them to the feature matching unit 832. The feature matching unit 832 receives the features of the image data as input, along with the shooting location ID and camera parameters from the shooting information database 881.

特徴マッチング部８３２は、画像特徴データベース８８３に撮影場所ＩＤを引き渡して、撮影場所ＩＤに対応した特徴画像を読み取る。特徴マッチング部８３２は、画像データの特徴と特徴画像とをマッチングして、法線画像ＩＤを選択する。特徴マッチング部８３２は、法線画像ＩＤと撮影場所ＩＤを、特徴マッチング処理結果８８４に格納する。 The feature matching unit 832 passes the shooting location ID to the image feature database 883 and reads the feature image corresponding to the shooting location ID. The feature matching unit 832 matches the features of the image data with the feature image to select the normal image ID. The feature matching unit 832 stores the normal image ID and the shooting location ID in the feature matching processing result 884.

水位計測プラグイン８４０は、特徴抽出部８４１と水位計測部８４２を備えている。特徴抽出部８４１には、ビデオ管理装置８２０から画像データが入力される。特徴抽出部８４１は、法線画像ＩＤと画像データから特徴点を抽出して、水位計測部８４２に出力する。水位計測部８４２は、特徴点と、法線画像ＩＤと特徴画像と深度画像とから、水位を計測する。水位計測部８４２は、撮影場所ＩＤと撮影時間と画像データと水位とを、解析結果８８５に格納する。
解析結果表示装置８７０は、解析結果８８５から水位などを取得して表示するものである。これにより監視カメラ８１０の映像から水位を計測することが可能である。なお、本実施形態では、ＬｉＤＡＲにより事前に三次元データを取得する例であったが、既に取得されている三次元情報を活用する手法でもよい。 The water level measurement plug-in 840 includes a feature extraction unit 841 and a water level measurement unit 842. Image data is input to the feature extraction unit 841 from the video management device 820. The feature extraction unit 841 extracts feature points from the normal image ID and image data and outputs them to the water level measurement unit 842. The water level measurement unit 842 measures the water level from the feature points, the normal image ID, the feature image, and the depth image. The water level measurement unit 842 stores the shooting location ID, shooting time, image data, and water level in the analysis results 885.
The analysis result display device 870 acquires and displays the water level and other information from the analysis results 885. This makes it possible to measure the water level from the video feed of the surveillance camera 810. In this embodiment, three-dimensional data was acquired in advance using LiDAR, but methods that utilize already acquired three-dimensional information may also be used.

≪第３実施形態≫
第３実施形態に係る映像解析装置は、工事現場などを変換の歪なく、自由視点で俯瞰映像化するものである。これにより、工事の進捗などを容易に把握可能である。 ≪Third Embodiment≫
The video analysis device according to the third embodiment creates an overhead view of a construction site or similar location from a free viewpoint without distortion. This makes it easy to grasp the progress of construction work.

図１１は、第３実施形態に係る映像解析装置６１０Ａを示す概略の構成図である。
映像解析装置６１０Ａは、映像特徴量変換部６１２と、映像特徴量保持部６１３と、三次元データ特徴量変換部６１５と、三次元データ特徴量保持部６１６と、空間位置合わせ部６１７と、三次元データ生成部６１９とを含んで構成される。 Figure 11 is a schematic diagram showing the video analysis device 610A according to the third embodiment.
The video analysis device 610A is composed of a video feature conversion unit 612, a video feature retention unit 613, a three-dimensional data feature conversion unit 615, a three-dimensional data feature retention unit 616, a spatial alignment unit 617, and a three-dimensional data generation unit 619.

三次元データ６１４は、例えば工事現場などの対象物を、その都度ＬｉＤＡＲなどの三次元計測装置で計測したものである。映像データ６１１は、ＬｉＤＡＲなどで撮影した工事現場などの対象物と同一の対象物の映像であり、二次元カメラで撮影したものである。 The three-dimensional data 614 is obtained by measuring objects, such as those at a construction site, using a three-dimensional measuring device such as LiDAR. The video data 611 is video footage of the same objects as those captured by LiDAR, but this footage was taken with a two-dimensional camera.

三次元データ特徴量変換部６１５は、入力された三次元データ６１４の複数の視点での透視投影により、二次元射影データを仮定して特徴量を変換した三次元データ特徴量を算出する。三次元データ特徴量変換部６１５が算出した三次元データ特徴量は、三次元データ特徴量保持部６１６に保持される。三次元データ６１４は、工事が進捗する都度に取得される。 The three-dimensional data feature transformation unit 615 calculates three-dimensional data features by performing perspective projections of the input three-dimensional data 614 from multiple viewpoints, assuming two-dimensional projected data. The three-dimensional data features calculated by the three-dimensional data feature transformation unit 615 are stored in the three-dimensional data feature storage unit 616. The three-dimensional data 614 is acquired each time the construction progresses.

映像特徴量変換部６１２は、入力された映像データ６１１の特徴量を変換した映像特徴量を算出する。映像特徴量変換部６１２が算出した映像特徴量は、映像特徴量保持部６１３に保持される。 The video feature conversion unit 612 calculates video features by converting the features of the input video data 611. The video features calculated by the video feature conversion unit 612 are stored in the video feature storage unit 613.

空間位置合わせ部６１７は、映像特徴量保持部６１３と三次元データ特徴量保持部６１６のデータを比較して、映像データ６１１に最も近い二次元射影データを決定する。
三次元データ生成部６１９は、空間位置合わせ部６１７で決定した二次元射影データに基づいて、映像データのテクスチャを、三次元データにマッピングすることで、テクスチャマッピングされた三次元データを生成する。つまり、三次元データ生成部６１９は、空間位置合わせ部６１７が決定した二次元の射影データに対応する三次元データから生成したメッシュ情報に対して、映像データを紐づけている。この三次元データを、例えばゲームエンジンＵｎｉｔｙ（登録商標）などでレンダリングすることで、自由視点で対象物を俯瞰することができる。 The spatial alignment unit 617 compares the data from the video feature storage unit 613 and the three-dimensional data feature storage unit 616 to determine the two-dimensional projection data that is closest to the video data 611.
The three-dimensional data generation unit 619 generates texture-mapped three-dimensional data by mapping the texture of the video data onto the three-dimensional data based on the two-dimensional projection data determined by the spatial alignment unit 617. In other words, the three-dimensional data generation unit 619 associates video data with mesh information generated from the three-dimensional data corresponding to the two-dimensional projection data determined by the spatial alignment unit 617. By rendering this three-dimensional data with, for example, a game engine such as Unity (registered trademark), it becomes possible to view the object from a free viewpoint.

図１２は、変形例の映像解析装置６２０のデータフロー図である。
映像解析装置６２０は、テクスチャ抽出部６２２とメッシュデータ化部６２５と、視点変換レンダリング部６２６を備え、複数のカメラ６２１ａ，６２１ｂと、長距離ＬｉＤＡＲ６２３とポータブルＬｉＤＡＲ６２４とが接続されている。 Figure 12 is a data flow diagram of a modified image analysis device 620.
The video analysis device 620 includes a texture extraction unit 622, a mesh data conversion unit 625, and a viewpoint transformation rendering unit 626, and is connected to multiple cameras 621a, 621b, a long-range LiDAR 623, and a portable LiDAR 624.

複数のカメラ６２１ａ，６２１ｂは、それぞれ異なる視点に設置されており、複数の視点から対象物の映像を撮影する。長距離ＬｉＤＡＲ６２３は、対象物との距離が長い地点から、この対象物の三次元データを測定する。ポータブルＬｉＤＡＲ６２４は、対象物と近接した地点から、この対象物の三次元データを測定する。 Multiple cameras 621a and 621b are positioned at different viewpoints, capturing images of the object from multiple perspectives. The long-range LiDAR 623 measures the object's three-dimensional data from a point at a long distance. The portable LiDAR 624 measures the object's three-dimensional data from a point in close proximity.

メッシュデータ化部６２５は、対象物の三次元データを統合してメッシュデータ化する。メッシュデータ化部６２５が生成したメッシュデータは、テクスチャ抽出部６２２に出力される。 The mesh data generation unit 625 integrates the three-dimensional data of the object and converts it into mesh data. The mesh data generated by the mesh data generation unit 625 is output to the texture extraction unit 622.

テクスチャ抽出部６２２は、複数のカメラ６２１ａ，６２１ｂから映像の入力を受け付ける。そしてテクスチャ抽出部６２２は、三次元のメッシュデータを、カメラ６２１ａ，６２１ｂの各視点から見たときの二次元射影データに変換し、各二次元射影データと各映像とを対応付けて、メッシュデータに貼り付けるためのテクスチャを映像から抽出する。これにより、テクスチャ抽出部６２２は、テクスチャを貼り付けたメッシュデータを作成する。 The texture extraction unit 622 receives video input from multiple cameras 621a and 621b. The texture extraction unit 622 then converts the three-dimensional mesh data into two-dimensional projection data as viewed from each viewpoint of cameras 621a and 621b. It then associates each two-dimensional projection data with each video image and extracts textures from the video to be applied to the mesh data. In this way, the texture extraction unit 622 creates mesh data with the applied textures.

視点変換レンダリング部６２６は、例えばゲームエンジンＵｎｉｔｙ（登録商標）であり、テクスチャを貼り付けたメッシュデータを任意の視点でレンダリングする。これにより、所望の視点で対象物、例えば工事現場を観察可能である。
なお、本実施形態では、ＬｉＤＡＲにより三次元情報を逐次撮影しているが、既に取得されている三次元情報を活用する手法でもよい。既に取得されている三次元情報は、例えばＢＩＭ（Building Information Modeling）や、ＣＩＭ（Construction Information Modeling）等である。 The viewpoint transformation rendering unit 626 is, for example, the game engine Unity®, and renders textured mesh data from an arbitrary viewpoint. This makes it possible to observe an object, such as a construction site, from a desired viewpoint.
In this embodiment, three-dimensional information is captured sequentially using LiDAR, but methods that utilize already acquired three-dimensional information may also be used. Examples of already acquired three-dimensional information include BIM (Building Information Modeling) and CIM (Construction Information Modeling).

≪第４実施形態≫
第４実施形態に係る映像解析装置は、ＬｉＤＡＲで取得した三次元データを元に、スマートフォンなどの携帯端末で撮影するだけでその場で重仮設資材の長さや体積を計測するものである。映像解析装置は、携帯端末で撮影した映像から資材束の状況を把握し、横断面の長さ解析情報とのマッチング処理を実施する。これにより、例えば２０ｍの長さの資材を２０ｍ離れたところから測定して、１０ｃｍ刻みで資材の長さをリアルタイムで分類可能である。 ≪Fourth Embodiment≫
The video analysis device according to the fourth embodiment measures the length and volume of heavy temporary construction materials on the spot simply by taking a picture with a mobile device such as a smartphone, based on three-dimensional data acquired by LiDAR. The video analysis device grasps the condition of the material bundle from the video captured by the mobile device and performs matching processing with the length analysis information of the cross section. As a result, for example, it is possible to measure a material with a length of 20m from a distance of 20m and classify the length of the material in 10cm increments in real time.

第４実施形態に係る映像解析装置は、トラックの上の積載物を、一度に長さと体積の計測を実現する。これにより、従来一本ずつ手作業で荷台の上に登って実施した作業を安全かつ効率的に実施可能である。 The video analysis device according to the fourth embodiment enables the measurement of both length and volume of cargo on a truck in a single step. This allows for safe and efficient execution of tasks that were previously performed manually, one item at a time, by climbing onto the truck bed.

図１３は、第４実施形態に係る映像解析装置６１０Ｂを示す概略の構成図である。
映像解析装置６１０Ｂは、映像特徴量変換部６１２と、映像特徴量保持部６１３と、三次元データ特徴量変換部６１５と、三次元データ特徴量保持部６１６と、映像認識部６０１と、計測部６０２とを含んで構成される。 Figure 13 is a schematic diagram showing the video analysis device 610B according to the fourth embodiment.
The video analysis device 610B is comprised of a video feature conversion unit 612, a video feature retention unit 613, a three-dimensional data feature conversion unit 615, a three-dimensional data feature retention unit 616, a video recognition unit 601, and a measurement unit 602.

三次元データ６１４は、ある対象物を、予めＬｉＤＡＲなどの三次元計測装置で計測したものである。映像データ６１１は、ＬｉＤＡＲなどで撮影した対象物と同一の対象物の映像であり、二次元カメラで撮影したものである。 The three-dimensional data 614 is obtained by pre-measuring an object using a three-dimensional measuring device such as LiDAR. The video data 611 is an image of the same object as the one captured by the LiDAR, but it was captured using a two-dimensional camera.

三次元データ特徴量変換部６１５は、入力された三次元データ６１４の複数の視点での透視投影により、二次元射影データを仮定して特徴量を変換した三次元データ特徴量を算出する。三次元データ特徴量変換部６１５が算出した三次元データ特徴量は、三次元データ特徴量保持部６１６に保持される。三次元データ６１４は、最初に１度だけ計測されたものでよく、対象物を連続して計測する必要はない。 The three-dimensional data feature transformation unit 615 calculates three-dimensional data features by performing perspective projections of the input three-dimensional data 614 from multiple viewpoints, assuming two-dimensional projected data. The three-dimensional data features calculated by the three-dimensional data feature transformation unit 615 are stored in the three-dimensional data feature storage unit 616. The three-dimensional data 614 only needs to be measured once initially; it is not necessary to continuously measure the object.

映像認識部６０１は、映像データ６１１から対象物を画像認識し、対象資材の束か否かを認識し、対象物が重なっているときには、その個体の枚数または個数を認識する。そして、映像認識部６０１は、映像特徴量保持部６１３の特徴量データと三次元データ特徴量保持部６１６の特徴量データを比較して、映像データ６１１に最も近い二次元射影データを決定し、認識した対象物に対応する三次元データを認識する。 The image recognition unit 601 recognizes objects from the image data 611, determines whether it is a bundle of materials, and, if objects are overlapping, recognizes the number of individual pieces or items. Then, the image recognition unit 601 compares the feature data from the image feature storage unit 613 with the feature data from the three-dimensional data feature storage unit 616 to determine the two-dimensional projection data closest to the image data 611, and recognizes the three-dimensional data corresponding to the recognized object.

計測部６０２は、映像認識部６０１が認識した対象物の三次元データの範囲により、対象物のサイズを計測する。 The measurement unit 602 measures the size of the object based on the range of the three-dimensional data of the object recognized by the image recognition unit 601.

具体的にいうと、このサイズ計測とは、認識した対象物（例えば束の塊）の三次元データを基に縦・横・長さを計測し、そこから体積を算出したり、束塊に含まれる資材の個数や形状を映像より検知したりしてそれぞれ個別の長さの計測を実施する。 Specifically, this size measurement involves measuring the length, width, and height based on the three-dimensional data of the recognized object (for example, a bundle of materials), calculating the volume from that data, and detecting the number and shape of individual materials contained within the bundle from the video, thereby measuring the length of each individual material.

図１４は、第４実施形態に係る計測可視化処理のフローチャートである。
ユーザは最初、資材の側面からＬｉＤＡＲで三次元データを計測する（ステップＳ５０）。次にユーザは、資材の断面や概要をカメラで撮影する（ステップＳ５１）。計測した三次元データと撮影した映像データは、映像解析装置６１０Ｂに入力される。
映像解析装置６１０Ｂの映像認識部６０１は、映像データ６１１から資材束を画像認識する（ステップＳ５２）。 Figure 14 is a flowchart of the measurement visualization process according to the fourth embodiment.
The user first measures three-dimensional data from the side of the material using LiDAR (step S50). Next, the user takes pictures of the cross-section and outline of the material with a camera (step S51). The measured three-dimensional data and the captured video data are input to the video analysis device 610B.
The video recognition unit 601 of the video analysis device 610B recognizes the bundle of materials from the video data 611 (step S52).

映像解析装置６１０Ｂの映像認識部６０１は、映像データ６１１と三次元データ６１４のマッチング処理を実施して、資材束の状況を把握する（ステップＳ５３）。これにより、三次元データ６１４のうち資材束の部分を認識可能である。
さらに映像解析装置６１０Ｂの映像認識部６０１は、断面形状から対象の資材の本数を解析する（ステップＳ５４）。その後、映像解析装置６１０Ｂの計測部６０２は、資材の長手方向の長さを解析すると（ステップＳ５５）、図１４の処理は終了する。 The video recognition unit 601 of the video analysis device 610B performs a matching process between the video data 611 and the three-dimensional data 614 to understand the status of the material bundle (step S53). This makes it possible to recognize the portion of the three-dimensional data 614 that represents the material bundle.
Furthermore, the video recognition unit 601 of the video analysis device 610B analyzes the number of target materials from their cross-sectional shape (step S54). After that, the measurement unit 602 of the video analysis device 610B analyzes the length of the materials in the longitudinal direction (step S55), and the process in Figure 14 is completed.

≪その他変形例≫
以上、本発明のいくつかの実施形態や変形例について説明したが、これらの実施形態は、例示に過ぎず、本発明の技術的範囲を限定するものではない。本発明はその他のさまざまな実施形態を取ることが可能であり、さらに、本発明の要旨を逸脱しない範囲で、省略や置換など種々の変更を行うことができる。これら実施形態やその変形は、本明細書などに記載された発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 <<Other variations>>
Although several embodiments and modifications of the present invention have been described above, these embodiments are merely illustrative and do not limit the technical scope of the present invention. The present invention can take on a variety of other embodiments, and furthermore, various modifications such as omissions and substitutions can be made without departing from the spirit of the invention. These embodiments and their variations are included in the scope and spirit of the invention as described herein and elsewhere, as well as in the scope of the invention and its equivalents as described in the claims.

≪ハードウェア構成≫
上記した実施形態に係る撮像解析装置１００や映像解析装置６１０，６１０Ａ，６１０Ｂ，６２０は、例えば図１５に示すような構成のコンピュータ９００によって実現される。図１５は、上記した実施形態に係る撮像解析装置１００や映像解析装置６１０，６１０Ａ，６１０Ｂ，６２０の機能を実現するコンピュータ９００の一例を示すハードウェア構成図である。コンピュータ９００は、ＣＰＵ９０１、ＲＯＭ９０２、ＲＡＭ９０３、ＳＳＤ９０４、入出力インターフェイス９０５（図１５では入出力Ｉ／Ｆ（Interface）と記載）、通信インターフェイス９０６（図１５では通信Ｉ／Ｆと記載）およびメディアインターフェイス９０７（図１５ではメディアＩ／Ｆと記載）を備える。コンピュータ９００は、ＳＳＤ９０４の替わりにＨＤＤ（Hard Disc Drive）を備えてもよいし、ＳＳＤ９０４に加えて、さらにＨＤＤを備えてもよい。 Hardware Configuration
The imaging analysis device 100 and video analysis devices 610, 610A, 610B, and 620 according to the above-described embodiment are implemented by a computer 900 having a configuration such as that shown in Figure 15. Figure 15 is a hardware configuration diagram showing an example of a computer 900 that implements the functions of the imaging analysis device 100 and video analysis devices 610, 610A, 610B, and 620 according to the above-described embodiment. The computer 900 includes a CPU 901, ROM 902, RAM 903, SSD 904, input/output interface 905 (labeled as input/output I/F (Interface) in Figure 15), communication interface 906 (labeled as communication I/F in Figure 15), and media interface 907 (labeled as media I/F in Figure 15). The computer 900 may be equipped with an HDD (Hard Disc Drive) instead of the SSD 904, or it may be equipped with an HDD in addition to the SSD 904.

ＣＰＵ９０１は、ＲＯＭ９０２またはＳＳＤ９０４に記憶されたプログラムに基づいて作動し、図１の制御部１１０による制御を行う。ＲＯＭ９０２は、コンピュータ９００の起動時にＣＰＵ９０１により実行されるブートプログラムや、コンピュータ９００のハードウェアに係るプログラムなどを記憶する。 The CPU 901 operates based on programs stored in the ROM 902 or SSD 904, and is controlled by the control unit 110 shown in Figure 1. The ROM 902 stores boot programs executed by the CPU 901 when the computer 900 starts up, as well as programs related to the computer 900's hardware.

ＣＰＵ９０１は、入出力インターフェイス９０５を介して、マウスやキーボードなどの入力装置９１０、およびディスプレイやプリンタなどの出力装置９１１を制御する。ＣＰＵ９０１は、入出力インターフェイス９０５を介して、入力装置９１０からデータを取得するとともに、生成したデータを出力装置９１１へ出力する。 The CPU 901 controls input devices 910, such as a mouse and keyboard, and output devices 911, such as a display and printer, via the input/output interface 905. The CPU 901 acquires data from the input devices 910 and outputs the generated data to the output devices 911 via the input/output interface 905.

ＳＳＤ９０４は、ＣＰＵ９０１により実行されるプログラムおよび当該プログラムによって使用されるデータなどを記憶する。通信インターフェイス９０６は、通信網を介して不図示の他の装置（例えば、ＬｉＤＡＲ４１１，７１０やカメラ４１２，６１２ａ、６１２ｂ）からデータを受信してＣＰＵ９０１へ出力し、また、ＣＰＵ９０１が生成したデータを、通信網を介して他の装置へ送信する。 The SSD 904 stores programs executed by the CPU 901 and data used by those programs. The communication interface 906 receives data from other devices (e.g., LiDAR 411, 710 and cameras 412, 612a, 612b) via the communication network and outputs it to the CPU 901. It also transmits data generated by the CPU 901 to other devices via the communication network.

メディアインターフェイス９０７は、記録媒体９１２に格納されたプログラムまたはデータを読み取り、ＲＡＭ９０３を介してＣＰＵ９０１へ出力する。ＣＰＵ９０１は、プログラムを、メディアインターフェイス９０７を介して記録媒体９１２からＲＡＭ９０３上にロードし、ロードしたプログラムを実行する。記録媒体９１２は、ＤＶＤ（Digital Versatile Disk）などの光学記録媒体、ＭＯ（Magneto Optical disk）などの光磁気記録媒体、磁気記録媒体、導体メモリテープ媒体または半導体メモリなどである。 The media interface 907 reads programs or data stored in the recording medium 912 and outputs them to the CPU 901 via the RAM 903. The CPU 901 loads the program from the recording medium 912 onto the RAM 903 via the media interface 907 and executes the loaded program. The recording medium 912 can be an optical recording medium such as a DVD (Digital Versatile Disk), a magneto-optical recording medium such as an MO (Magneto Optical Disk), a magnetic recording medium, a conductive memory tape medium, or a semiconductor memory.

例えば、コンピュータ９００が上記した実施形態に係る撮像解析装置１００や映像解析装置６１０，６１０Ａ，６１０Ｂ，６２０として機能する場合、コンピュータ９００のＣＰＵ９０１は、ＲＡＭ９０３上にロードされたプログラム１２８（図１参照）を実行することにより、撮像解析装置１００や映像解析装置６１０，６１０Ａ，６１０Ｂ，６２０の機能を実現する。ＣＰＵ９０１は、プログラムを記録媒体９１２から読み取って実行する。この他、ＣＰＵ９０１は、他の装置から通信網を介してプログラムを読み込んでもよいし、記録媒体９１２からＳＳＤ９０４にプログラム１２８をインストールして実行してもよい。 For example, when the computer 900 functions as the imaging analysis device 100 or video analysis devices 610, 610A, 610B, and 620 according to the above-described embodiment, the CPU 901 of the computer 900 realizes the functions of the imaging analysis device 100 or video analysis devices 610, 610A, 610B, and 620 by executing the program 128 (see Figure 1) loaded onto the RAM 903. The CPU 901 reads and executes the program from the recording medium 912. Alternatively, the CPU 901 may read the program from another device via a communication network, or it may install and execute the program 128 from the recording medium 912 onto the SSD 904.

１００撮像解析装置（映像解析装置）
１１１撮像取得部
１１２三次元モデル生成部
１１３比較部
１１４表示制御部
１１５追加撮像支援部
１２１認識モデル
１２２生成三次元モデル
１２８プログラム
１３０撮像情報データベース
１４０撮像装置情報データベース
１５０物体情報データベース
１６０参照三次元モデルデータベース（参照三次元モデル）
５１０，５２０，５３０撮像解析結果画面
６１０，６１０Ａ，６１０Ｂ映像解析装置（撮像解析装置）
６１１映像データ
６１２映像特徴量変換部
６１３映像特徴量保持部
６１４三次元データ
６１５三次元データ特徴量変換部
６１６三次元データ特徴量保持部
６１７空間位置合わせ部
６１８計測部
６２０映像解析装置
６２２テクスチャ抽出部
６２５メッシュデータ化部
６２６視点変換レンダリング部
６２１ａカメラ
６２１ｂカメラ
６２３長距離ＬｉＤＡＲ
６２４ポータブルＬｉＤＡＲ
７１０ＬｉＤＡＲ
６１９三次元データ生成部
７２１点群
７２２メッシュ
７３０Ｒａｙｃａｓｔ部
７３１カメラパラメータ
７４０仮想法線画像特徴データベース
７４１ＬｉＤＡＲ／カメラ変換行列
７４２Ｒａｙｃａｓｔ解像度データ
７５０監視カメラ
７５１ＲＧＢ画像
７５２特徴データ
７６０特徴マッチング部
７７０特徴比較部
７６１法線画像特徴
７７１高さ別マッチ量ヒストグラム
７８０水位推定部
８３０特徴処理プラグイン
８４０水位計測プラグイン
８８１撮影情報データベース
８８２三次元データ
８８３画像特徴データベース
８１０監視カメラ
８２０ビデオ管理装置
８５０三次元計測装置
８６０初期データ生成装置
８６１仮想法線画像生成部
８６２特徴量抽出部
８３１特徴抽出部
８３２特徴マッチング部
８４１特徴抽出部
８４２水位計測部
８８５解析結果
８７０解析結果表示装置 100 Imaging and analysis device (video analysis device)
111 Image acquisition unit 112 Three-dimensional model generation unit 113 Comparison unit 114 Display control unit 115 Additional imaging support unit 121 Recognition model 122 Generated three-dimensional model 128 Program 130 Imaging information database 140 Imaging device information database 150 Object information database 160 Reference three-dimensional model database (reference three-dimensional model)
510, 520, 530 Image analysis result screen 610, 610A, 610B Image analysis device (image analysis device)
611 Video data 612 Video feature conversion unit 613 Video feature retention unit 614 Three-dimensional data 615 Three-dimensional data feature conversion unit 616 Three-dimensional data feature retention unit 617 Spatial alignment unit 618 Measurement unit 620 Video analysis device 622 Texture extraction unit 625 Mesh data conversion unit 626 Viewpoint conversion rendering unit 621a Camera 621b Camera 623 Long-range LiDAR
624 Portable LiDAR
710 LiDAR
619 Three-dimensional data generation unit 721 Point cloud 722 Mesh 730 Raycast unit 731 Camera parameters 740 Virtual normal image feature database 741 LiDAR/camera transformation matrix 742 Raycast resolution data 750 Surveillance camera 751 RGB image 752 Feature data 760 Feature matching unit 770 Feature comparison unit 761 Normal image features 771 Height-specific match amount histogram 780 Water level estimation unit 830 Feature processing plugin 840 Water level measurement plugin 881 Shooting information database 882 Three-dimensional data 883 Image feature database 810 Surveillance camera 820 Video management device 850 Three-dimensional measurement device 860 Initial data generation device 861 Virtual normal image generation unit 862 Feature quantity extraction unit 831 Feature extraction unit 832 Feature matching unit 841 Feature extraction unit 842 Water level measurement unit 885 Analysis result 870 Analysis result display device

Claims

A three-dimensional model generation unit generates a three-dimensional model of the object being imaged based on imaging information,
A comparison unit compares a reference three-dimensional model, which is a three-dimensional model of an object to be compared with the generated three-dimensional model and is also a three-dimensional model of an object to be compared with the object to be imaged, with the generated three-dimensional model and calculates the difference.
An imaging analysis device comprising: an additional imaging support unit that outputs a position and orientation suitable for imaging the additional imaging location, which is the part of the object to be imaged in which the difference occurs.

The three-dimensional model generation unit,
The imaging analysis apparatus according to claim 1, which generates a new three-dimensional model based on the additional imaging information obtained by imaging the additional imaging location and the imaging information.

The aforementioned imaging information is,
Multiple three-dimensional point cloud information and images of the object to be imaged, captured by multiple imaging devices,
The generated three-dimensional model is,
A textured 3D mesh is a 3D mesh with a texture applied to it.
The three-dimensional model generation unit,
Referencing the positions and orientations of the multiple imaging devices,
The imaging and analysis apparatus according to claim 1, which generates the three-dimensional mesh and the texture, and generates a UV map that represents the correspondence between the three-dimensional mesh and the texture.

The three-dimensional model generation unit,
The pixels included in the aforementioned texture
The imaging analysis device according to claim 3, which calculates based on images captured by a plurality of imaging devices, according to the distance between a point on the object to be imaged corresponding to a pixel and the imaging device.

The three-dimensional model generation unit,
The imaging analysis apparatus according to claim 4, wherein the pixels are calculated based on an image captured by the imaging device with the minimum distance.

The three-dimensional model generation unit,
The imaging analysis apparatus according to claim 4, wherein the pixels are calculated based on an image captured by the imaging device, weighted according to the distance.

The three-dimensional model generation unit,
The pixels included in the aforementioned texture
The imaging analysis device according to claim 3, which takes the average of images captured by a plurality of imaging devices corresponding to a point in the object to be imaged that corresponds to the pixel in question.

The aforementioned texture is
The imaging analysis apparatus according to claim 4 or 7, comprising an image captured by the imaging device and an image composed of pixels calculated by the three-dimensional model generation unit.

The aforementioned texture is
The imaging and analysis apparatus according to claim 3, which includes an image corresponding to the UV map.

The aforementioned imaging information is,
The three-dimensional point cloud information and image of the object to be imaged are as follows:
The generated three-dimensional model is,
A textured 3D mesh is a 3D mesh with a texture applied to it.
The comparison unit is,
The type of the object to be imaged is detected based on the image or texture.
The imaging analysis apparatus according to claim 1, which compares the reference three-dimensional model and the generated three-dimensional model corresponding to the type of detection result and calculates the difference in at least one of size and shape.

The comparison unit is,
The imaging analysis apparatus according to claim 10, which counts the number of objects to be imaged.

Computers,
A three-dimensional model generation unit generates a three-dimensional model of the object being imaged based on imaging information,
A comparison unit compares a reference three-dimensional model, which is a three-dimensional model of an object to be compared with the generated three-dimensional model and is also a three-dimensional model of an object to be compared with the object to be imaged, with the generated three-dimensional model and calculates the difference.
A program for functioning as an imaging analysis device, comprising: an additional imaging support unit that outputs a suitable position and orientation for imaging the additional imaging location, which is the part of the object to be imaged where the difference occurs.

The imaging and analysis device,
The steps include generating a three-dimensional model, which is a three-dimensional model of the object being imaged, based on the imaging information, and
A step of comparing the generated three-dimensional model with a reference three-dimensional model, which is a three-dimensional model of an object that is to be compared with the object to be imaged, and the generated three-dimensional model, and calculating the difference.
An imaging analysis method comprising the steps of: outputting a suitable position and orientation for imaging the additional imaging location, which is the part of the object to be imaged where the difference occurs.