JP7770656B2

JP7770656B2 - Method and system for estimating CCTV camera attitude and object coordinates based on precise road map

Info

Publication number: JP7770656B2
Application number: JP2023207155A
Authority: JP
Inventors: グチェ、イン; ファンキム、ソン; ヒョンパク、チョル; チャンキム、ギ; ジュンキム、ドク
Original assignee: Individual
Current assignee: Individual
Priority date: 2022-12-12
Filing date: 2023-12-07
Publication date: 2025-11-17
Anticipated expiration: 2043-12-07
Also published as: JP2024084136A; KR102605696B1

Description

本発明は精密道路地図基盤のＣＣＴＶカメラ姿勢および客体座標推定方法およびシステムに関する。 The present invention relates to a method and system for estimating CCTV camera pose and object coordinates based on a precise road map.

現在高速道路脇には交通状況をモニタリングするために数千個のＣＣＴＶが設置されており、都心部の場合にも多くの交差点でＣＣＴＶを設置して交通モニタリング用として活用しており、また、最近ではＡＩ技術の発展により、車両、人などに対する認識およびこれを利用した停止車両、歩行者、逆走車などに対する認識が可能な状況に発展している。 Currently, thousands of CCTVs have been installed along expressways to monitor traffic conditions, and even in urban areas, CCTVs are installed at many intersections and used for traffic monitoring. In addition, recent advances in AI technology have made it possible to recognize vehicles and people, and to use this information to recognize stopped vehicles, pedestrians, and vehicles driving in the wrong direction.

しかし、このように危険状況が自動で認識された場合にも、オペレータによる肉眼判断を通じて位置を概略的に推定するか、緯度、経度、高度のグローバル座標系で位置を推定できない場合が大部分であり、辛うじて推定するとしても誤差が大きく現れるなどの問題で情報共有に困難があるのが現在の状況である。 However, even when dangerous situations are automatically recognized, the operator must make a rough estimate of the location through visual judgment, or in most cases the location cannot be estimated using a global coordinate system of latitude, longitude, and altitude. Even when an estimate is made, there are problems such as large errors, making it difficult to share information.

特に、現在まではほぼ殆どのＣＣＴＶでオペレータによる肉眼モニタリングを通じて危険状況を認知し、同様に肉眼判断によって概略的な位置を推定した後に情報を共有することによって、情報生成に所要する時間が過多であり、正確度が顕著に落ちるためマッピングを通じての共有が難しい限界がある。 In particular, until now, most CCTVs have required operators to recognize dangerous situations through visual monitoring, and then share information after roughly estimating the location through visual judgment. This takes an excessive amount of time to generate information, and the accuracy is significantly reduced, making it difficult to share information through mapping.

韓国公開特許公報第１０－２０２１－００５０９９７号Korean Patent Publication No. 10-2021-0050997

本発明では肉眼による概略的な位置推定、グローバル座標推定の困難、推定するとしても正確度が顕著に低いという点などの問題点を解決するために、ＣＣＴＶカメラに撮影された映像内の客体や区域に対する自動化された位置座標推定技術とこれを利用した自動マッピングに関する技術を提供しようとする。 In order to solve problems such as the difficulty of rough position estimation using the naked eye, the difficulty of estimating global coordinates, and the significantly low accuracy of estimation even if it is done, the present invention aims to provide an automated position coordinate estimation technology for objects and areas in images captured by CCTV cameras and a technology related to automatic mapping using this technology.

また、固定式カメラの他にも、随時回転するＰＴＺカメラに対しても適用が可能な技術を開発して適用範囲を最大化する。 In addition to fixed cameras, we will also develop technology that can be applied to rotating PTZ cameras, maximizing the range of applications.

特に、最近主要道路を基準として、政府で構築中の精密道路地図（ＨＤＭａｐ）を活用して容易にＣＣＴＶカメラのカメラ姿勢を自動で推定し、推定されたカメラ姿勢を利用した座標推定およびマッピング技術を提案し、特に座標推定結果に対する誤差を確認してその結果を利用して位置誤差とカメラ姿勢を自動で補正する方法を提供する。 In particular, we propose a method to automatically estimate the camera orientation of CCTV cameras using the precision road map (HD Map) that the government is currently constructing based on major roads, and then use the estimated camera orientation to propose coordinate estimation and mapping technology. In particular, we provide a method to check errors in the coordinate estimation results and use the results to automatically correct position errors and camera orientation.

したがって、精密道路地図の構築が全国的に完了すれば、本発明を適用して、ＡＩによって認識された客体や危険区域などに対して自動で正確な位置を推定し、自動でマッピングして危険情報を共有することができる。 Therefore, once the construction of precise road maps is completed nationwide, this invention can be applied to automatically estimate the precise location of objects and dangerous areas recognized by AI, automatically map them, and share danger information.

前記課題を解決するための本発明の一実施例に係る精密道路地図基盤のＣＣＴＶカメラ姿勢および客体座標推定方法は、（ａ）ＣＣＴＶカメラの撮影映像での路面客体と精密道路地図をマッチングさせてカメラ姿勢を推定する段階；（ｂ）前記撮影映像内でマッピング対象客体を選択する段階；および（ｃ）前記推定されたカメラ姿勢に基づいて前記撮影映像内で選択されたマッピング対象客体の座標を推定する段階を含むものの、前記カメラ姿勢は前記ＣＣＴＶカメラの位置（Ｘ、Ｙ、Ｚ）、パン（ｐａｎ）、チルト（ｔｉｌｔ）に対する情報を含むことを特徴とする。 To solve the above problem, one embodiment of the present invention provides a method for estimating CCTV camera pose and object coordinates based on a precise road map, including: (a) matching road objects in an image captured by a CCTV camera with a precise road map to estimate the camera pose; (b) selecting an object to be mapped within the captured image; and (c) estimating the coordinates of the object to be mapped selected within the captured image based on the estimated camera pose, wherein the camera pose includes information on the position (X, Y, Z), pan, and tilt of the CCTV camera.

前記（ａ）段階は、前記撮影映像で前記路面客体の特徴点のピクセル座標を抽出する段階；前記路面客体を精密道路地図とマッチングさせて前記路面客体の特徴点の空間座標を抽出する段階；および前記抽出された路面客体の特徴点のピクセル座標と空間座標の関係からＰｎＰ（Ｐｅｒｓｐｅｃｔｉｖｅ－ｎ－Ｐｏｉｎｔ）を利用して前記カメラ姿勢を推定する段階を含むことができる。 Step (a) may include extracting pixel coordinates of feature points of the road surface object from the captured image; matching the road surface object with a precise road map to extract spatial coordinates of the feature points of the road surface object; and estimating the camera pose using Perspective-n-Point (PnP) from the relationship between the pixel coordinates and spatial coordinates of the extracted feature points of the road surface object.

前記ＣＣＴＶカメラがＰＴＺカメラである場合、前記（ａ）段階は、前記推定されたカメラ姿勢のパン、チルト値と前記ＰＴＺカメラから提供されたパン、チルト値の差から前記ＰＴＺカメラから提供されたパンおよびチルト値を真北と垂直方向を基準とした絶対値に変換する段階；および前記ＰＴＺカメラから提供されたズーム値の逆数だけ前記撮影映像を縮小してズーム値が「０」である撮影映像に変換する段階をさらに含むことができる。 If the CCTV camera is a PTZ camera, step (a) may further include converting the pan and tilt values provided by the PTZ camera into absolute values based on true north and the vertical direction based on the difference between the pan and tilt values of the estimated camera attitude and the pan and tilt values provided by the PTZ camera; and reducing the captured image by the reciprocal of the zoom value provided by the PTZ camera to convert it into a captured image with a zoom value of "0".

前記（ａ）段階は、（ａ１）前記推定されたカメラ姿勢を利用して前記精密道路地図を前記撮影映像に投影するか、前記撮影映像の客体を前記推定されたカメラ姿勢を利用して前記精密道路地図に逆投影する段階；（ａ２）互いに対応する前記撮影映像の客体および前記精密道路地図の客体の各特徴点のピクセル誤差を算出する段階；および（ａ３）前記算出されたピクセル誤差が臨界値以上である場合、前記推定されたカメラ姿勢を補正して前記算出されたピクセル誤差を臨界値未満に減らす段階をさらに含むことができる。 Step (a) may further include: (a1) projecting the precise road map onto the captured image using the estimated camera pose, or back-projecting an object in the captured image onto the precise road map using the estimated camera pose; (a2) calculating pixel errors of corresponding feature points of the object in the captured image and the object in the precise road map; and (a3) correcting the estimated camera pose to reduce the calculated pixel error to less than the critical value if the calculated pixel error is greater than or equal to a critical value.

前記（ｂ）段階で、前記マッピング対象客体は、前記撮影映像内で、ＡＩ（ａｒｔｉｆｉｃｉａｌｉｎｔｅｌｌｉｇｅｎｃｅ）に基づいて予め設定された規則に沿って自動で認識されて抽出された区域や客体、または使用者によって前記撮影映像内で手動で選択された区域や客体であり得る。 In step (b), the mapping target object may be an area or object automatically recognized and extracted from the captured image according to pre-set rules based on AI (artificial intelligence), or an area or object manually selected from the captured image by a user.

前記（ｂ）段階は、前記マッピング対象客体の各特徴点を前記推定されたカメラ姿勢により前記精密道路地図に逆投影する段階；および逆投影された前記マッピング対象客体の各特徴点の空間座標を前記精密道路地図から算出する段階を含むことができる。 Step (b) may include back-projecting each feature point of the mapping target object onto the precise road map using the estimated camera pose; and calculating the spatial coordinates of each back-projected feature point of the mapping target object from the precise road map.

したがって、本発明の一実施例に係るＣＣＴＶカメラ姿勢および客体座標推定方法およびシステムは、測量や基準点設定などの現場での作業なしに精密道路地図を利用する簡単な方法を通じてＣＣＴＶカメラ姿勢（Ｐｏｓｅ）を推定できる方法を提示することによって座標推定やマッピングを自動化することができ、また、既存の方法と比較して高い正確度と時間および費用削減などの効果を創り出すことができる。 Therefore, the CCTV camera pose and object coordinate estimation method and system according to one embodiment of the present invention provides a method for estimating the CCTV camera pose through a simple method using a precision road map without on-site work such as surveying or setting control points, thereby automating coordinate estimation and mapping, and achieving benefits such as higher accuracy and time and cost savings compared to existing methods.

また、固定式ＣＣＴＶだけでなくＰＴＺカメラを利用する場合にも適用が可能であり、座標推定およびマッピング過程を自動化することができ、精密道路地図上で座標を推定することによって簡単で正確度を向上させることができ、ＡＩを利用して車線などの路面客体の認識が可能である場合だけでなくこのような機能がない場合にも適用が可能である。 It can also be applied to fixed CCTVs and PTZ cameras, automating the coordinate estimation and mapping process and estimating coordinates on a precision road map, simplifying and improving accuracy. It can be applied not only when AI is available to recognize road objects such as lanes, but also when such functionality is not available.

また、座標推定結果やカメラ姿勢に対する誤差を、すべて推定過程で確認して自動で補正することによって正確度の高い推定が可能であり、路面客体の認識が可能である場合、全過程を自動化して便利性を増大することができる。 In addition, errors in coordinate estimation results and camera attitude are checked and automatically corrected during the estimation process, enabling highly accurate estimation. If road surface objects can be recognized, the entire process can be automated, increasing convenience.

本発明の一実施例に係る精密道路地図基盤のＣＣＴＶカメラ姿勢および客体座標推定方法およびシステムのネットワーク構成図である。1 is a network configuration diagram of a method and system for estimating CCTV camera pose and object coordinates based on a precise road map according to an embodiment of the present invention; 図１に図示された情報処理サーバーの細部構成図である。FIG. 2 is a detailed configuration diagram of the information processing server shown in FIG. 1; 本発明の一実施例に係る精密道路地図基盤のＣＣＴＶカメラ姿勢および客体座標推定方法を説明したフローチャートである。1 is a flowchart illustrating a method for estimating a CCTV camera pose and object coordinates based on a precise road map according to an embodiment of the present invention. 図３に図示されたカメラ姿勢推定段階の細部フローチャートである。4 is a detailed flowchart of the camera pose estimation step shown in FIG. 3. 撮影映像に精密道路地図を投影した後、微細調整要領を説明するための例示図である。10 is an exemplary diagram illustrating a fine adjustment procedure after projecting a precise road map onto a captured image. FIG. 図３に図示された座標推定段階乃至メッセージ生成配布段階を説明したフローチャートである。4 is a flowchart illustrating steps from a coordinate estimation step to a message generation and distribution step shown in FIG. 3; 危険区域に隣接した車線指定を通じての姿勢補正およびマッピング誤差補正の概念を説明するための例示図である。10 is an exemplary diagram illustrating the concept of posture correction and mapping error correction through lane designation adjacent to a dangerous area; FIG.

本発明に関する説明は構造的乃至機能的説明のための実施例に過ぎないので、本発明の権利範囲は本文に説明された実施例によって制限されるものと解釈されない。すなわち、実施例は多様な変更が可能であり多様な形態を有することができるため、本発明の権利範囲は技術的思想を具現できる均等物を含むものである。また、本発明で提示された目的または効果は、特定の実施例がこれをすべて含まなければならないとかそのような効果のみを含むべきであるという意味ではないので、本発明の権利範囲はこれによって制限されるものと理解されない。 The description of the present invention is merely an example for the purpose of structural or functional explanation, and therefore the scope of the present invention should not be construed as being limited by the example described herein. In other words, since the example may be modified in various ways and have various forms, the scope of the present invention includes equivalents that embody the technical concept. Furthermore, the objectives or effects presented in the present invention do not mean that a particular example must include all of these or only these effects, and therefore the scope of the present invention should not be construed as being limited thereby.

一方、本発明で叙述される用語の意味は次のように理解される。「第１」、「第２」などの用語は一つの構成要素を他の構成要素から区別するためのもので、これらの用語によって権利範囲が限定されてはならない。例えば、第１構成要素は第２構成要素と命名され得、同様に第２構成要素も第１構成要素と命名され得る。或る構成要素が他の構成要素に「連結されて」いると言及された時には、その他の構成要素に直接的に連結されてもよいが、中間に他の構成要素が存在してもよいと理解される。反面、或る構成要素が他の構成要素に「直接連結されて」いると言及された時には、中間に他の構成要素が存在しないものと理解される。一方、構成要素の間の関係を説明する他の表現、すなわち「～間に」と「すぐに～間に」または「～に隣り合う」と「～に直接隣り合う」なども同様に解釈される。単数の表現は文脈上明白に異なって意味しない限り複数の表現を含むものと理解されるべきであり、「含む」または「有する」などの用語は説示された特徴、数字、段階、動作、構成要素、部分品またはこれらを組み合わせたものが存在することを指定しようとするものであって、一つまたはそれ以上の他の特徴や数字、段階、動作、構成要素、部分品またはこれらを組み合わせたものなどの存在または付加の可能性を予め排除しないものと理解される。 Meanwhile, the meanings of the terms used in this invention shall be understood as follows: Terms such as "first" and "second" are used to distinguish one component from another, and shall not limit the scope of rights. For example, a first component may be designated as a second component, and similarly, a second component may be designated as a first component. When a component is referred to as being "connected" to another component, it is understood that it may be directly connected to the other component, but that there may be other components between them. Conversely, when a component is referred to as being "directly connected" to another component, it is understood that there are no other components between them. Meanwhile, other expressions describing the relationship between components, such as "between" and "immediately between," or "adjacent to" and "directly adjacent to," shall be interpreted similarly. Singular expressions should be understood to include plural expressions unless the context clearly indicates otherwise, and terms such as "include" or "have" are intended to specify the presence of stated features, numbers, steps, operations, components, parts, or combinations thereof, and are understood not to preclude the possibility of the presence or addition of one or more other features, numbers, steps, operations, components, parts, or combinations thereof.

以下、添付された図面に基づいて本発明の一実施例に係る精密道路地図基盤の道路脇のＣＣＴＶ姿勢推定、ＡＩ認識客体座標推定および危険区域マッピング自動化システムおよび方法をより詳細に説明することにする。 Hereinafter, a system and method for estimating roadside CCTV posture based on a precise road map, estimating AI-recognized object coordinates, and automating dangerous area mapping based on a precise road map, according to one embodiment of the present invention, will be described in more detail with reference to the accompanying drawings.

図１は、本発明の一実施例に係る精密道路地図基盤のＣＣＴＶカメラ姿勢および客体座標推定システムのネットワーク構成図である。図２は、図１に図示された情報処理サーバーの細部構成図である。 Figure 1 is a network configuration diagram of a system for estimating CCTV camera pose and object coordinates based on a precise road map according to one embodiment of the present invention. Figure 2 is a detailed configuration diagram of the information processing server shown in Figure 1.

まず、図１に図示された通り、本発明の一実施例に係る精密道路地図基盤のＣＣＴＶカメラ姿勢および客体座標推定システム１００はＣＣＴＶカメラ１０および情報処理サーバー２０を含み、追加にオペレータ端末３０をさらに含むことができる。 First, as shown in FIG. 1, a precise road map-based CCTV camera pose and object coordinate estimation system 100 according to one embodiment of the present invention includes a CCTV camera 10 and an information processing server 20, and may further include an operator terminal 30.

各構成はネットワークで通信すれば、前記ネットワークは複数の端末およびサーバーのようなそれぞれのノード相互間に情報交換が可能な連結構造を意味するもので、このようなネットワークの一例にはＲＦ、３ＧＰＰ（登録商標）（３ｒｄＧｅｎｅｒａｔｉｏｎＰａｒｔｎｅｒｓｈｉｐＰｒｏｊｅｃｔ）ネットワーク、ＬＴＥ（ＬｏｎｇＴｅｒｍＥｖｏｌｕｔｉｏｎ）ネットワーク、５ＧＰＰ（５ｔｈＧｅｎｅｒａｔｉｏｎＰａｒｔｎｅｒｓｈｉｐＰｒｏｊｅｃｔ）ネットワーク、ＷＩＭＡＸ（ＷｏｒｌｄＩｎｔｅｒｏｐｅｒａｂｉｌｉｔｙｆｏｒＭｉｃｒｏｗａｖｅＡｃｃｅｓｓ）ネットワーク、インターネット（Ｉｎｔｅｒｎｅｔ）、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）、ＷｉｒｅｌｅｓｓＬＡＮ（ＷｉｒｅｌｅｓｓＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）、ＷＡＮ（ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ）、ＰＡＮ（ＰｅｒｓｏｎａｌＡｒｅａＮｅｔｗｏｒｋ）、ブルートゥース（登録商標）（Ｂｌｕｅｔｏｏｔｈ）ネットワーク、ＮＦＣネットワーク、衛星放送ネットワーク、アナログ放送ネットワーク、ＤＭＢ（ＤｉｇｉｔａｌＭｕｌｔｉｍｅｄｉａＢｒｏａｄｃａｓｔｉｎｇ）ネットワークなどが含まれるがこれに限定されはしない。 When each component communicates over a network, the network refers to a connection structure that allows information exchange between nodes such as multiple terminals and servers. Examples of such networks include RF, 3GPP (registered trademark) (3rd Generation Partnership Project) network, LTE (Long Term Evolution) network, 5GPP (5th Generation Partnership Project) network, WIMAX (World Interoperability for Microwave Access) network, Internet, LAN (Local Area Network), WirelessLAN (Wireless Local Area Network), and WAN (Wide Area Network). Network), PAN (Personal Area Network), Bluetooth (registered trademark) network, NFC network, satellite broadcasting network, analog broadcasting network, DMB (Digital Multimedia Broadcasting) network, etc., but are not limited to these.

下記において、少なくとも一つのという用語は単数および複数を含む用語と定義され、少なくとも一つのという用語が存在しなくても各構成要素が単数または複数で存在することができ、単数または複数を意味することができることは自明であると言える。また各構成要素が単数または複数で備えられるのは実施例により変更可能であると言える。 In the following, the term "at least one" is defined as a term that includes both singular and plural, and it is clear that even if the term "at least one" is not present, each component can exist in singular or plural, and can mean singular or plural. Furthermore, it can be said that whether each component is provided in singular or plural can vary depending on the embodiment.

前記ＣＣＴＶカメラ１０はＰＴＺ（ｐａｎｔｉｌｔｚｏｏｍ）カメラまたは固定式カメラであり得、道路脇に設置されたカメラであり得る。ここで、ＣＣＴＶカメラがＰＴＺカメラである場合、カメラの位置（Ｘ、Ｙ、Ｚ）、Ｐａｎ、Ｔｉｌｔ、Ｚｏｏｍに対する情報を提供する装置であり得る。 The CCTV camera 10 may be a PTZ (pan tilt zoom) camera or a fixed camera installed on the side of the road. Here, if the CCTV camera is a PTZ camera, it may be a device that provides information on the camera's position (X, Y, Z), pan, tilt, and zoom.

次に、情報処理サーバー２０はＣＣＴＶカメラ１０が撮影した撮影映像と精密道路地図（ＨｉｇｈＤｅｆｉｎｉｔｉｏｎＭａｐ、ＨＤＭａｐ）を利用してＣＣＴＶカメラのカメラ姿勢（ｐｏｓｅ：Ｘ、Ｙ、Ｚ、Ｐａｎ、Ｔｉｌｔ、Ｚｏｏｍ）を推定する。ＣＣＴＶカメラが固定式である場合、臨界値以上の誤差が発見される前まで推定されたカメラ姿勢が利用され得、ＣＣＴＶカメラがＰＴＺカメラである場合、パン、チルト、ズームが変更されるたびにカメラ姿勢が推定され得る。 Next, the information processing server 20 estimates the camera pose (X, Y, Z, Pan, Tilt, Zoom) of the CCTV camera using the image captured by the CCTV camera 10 and a high definition road map (High Definition Map, HD Map). If the CCTV camera is fixed, the estimated camera pose may be used until an error exceeding a critical value is detected. If the CCTV camera is a PTZ camera, the camera pose may be estimated each time the pan, tilt, or zoom is changed.

また、情報処理サーバー２０はＣＣＴＶカメラが撮影した撮像映像内でＡＩ（ａｒｔｉｆｉｃｉａｌｉｎｔｅｌｌｉｇｅｎｃｅ）に基づいて予め設定された規則に沿って自動で認識されて抽出された客体や区域、または使用者によって撮影映像内で手動で選択された区域や客体をマッピング対象客体として設定する。 In addition, the information processing server 20 sets as mapping target objects objects or areas automatically recognized and extracted from the captured image captured by the CCTV camera according to pre-set rules based on AI (artificial intelligence), or areas or objects manually selected by the user from the captured image.

マッピング対象客体はＡＩ（ＡｒｔｉｆｉｃｉａｌＩｎｔｅｌｌｉｇｅｎｃｅ）基盤で規則に符合する移動客体や危険区域がマッピング対象客体として認識される場合もあれば、オペレータが肉眼で認識した客体や危険区域がマッピング対象客体として選択される場合もある。 Mapping targets may be moving objects or dangerous areas that conform to rules based on AI (Artificial Intelligence), or they may be objects or dangerous areas that an operator recognizes with the naked eye and are selected as mapping targets.

情報処理サーバー２０はＡＩで自動認識されるか、オペレータによって選択された移動客体や危険区域などのマッピング対象客体に対して精密道路地図を基盤としてマッピング対象客体の特徴点の座標を自動で推定するものの、その過程で精密道路地図を利用して予め推定した位置誤差およびカメラ姿勢を補正した後、危険の種類、走行不可領域に対する面単位情報などが含まれたメッセージを生成し、Ｖ２Ｘ、移動通信などを利用して配布する機能を遂行する構成であり得る。 The information processing server 20 may be configured to automatically estimate the coordinates of characteristic points of mapping targets, such as moving objects and dangerous areas, that are automatically recognized by AI or selected by an operator, based on a precise road map. In the process, the information processing server 20 may correct pre-estimated position errors and camera attitudes using the precise road map, and then generate a message containing surface-level information on the type of danger and prohibited areas, and distribute the message using V2X, mobile communications, etc.

次に、オペレータ端末３０は精密道路地図内に危険区域を設定した設定情報を情報処理サーバー２０に提供する構成であり得る。 Next, the operator terminal 30 may be configured to provide the information processing server 20 with setting information that sets dangerous areas within the precise road map.

より具体的には、前記情報処理サーバー２０はカメラ姿勢推定部２１、マッピング対象客体設定部２２、座標推定部２３、補正部２４およびメッセージ生成配布部２５のうち少なくとも一つを含むことができる。 More specifically, the information processing server 20 may include at least one of a camera pose estimation unit 21, a mapping target object setting unit 22, a coordinate estimation unit 23, a correction unit 24, and a message generation and distribution unit 25.

カメラ姿勢推定部２１は、ＣＣＴＶカメラの撮影映像での路面客体とそれに対応する精密道路地図での客体をマッチングさせてカメラ姿勢を推定する役割を遂行する。ここで、カメラ姿勢はＣＣＴＶカメラの位置（Ｘ、Ｙ、Ｚ）、パン（ｐａｎ）、チルト（ｔｉｌｔ）に対する情報を含み、追加にズーム（Ｚｏｏｍ）に対する情報をさらに含むことができる。 The camera posture estimation unit 21 performs the role of estimating the camera posture by matching road surface objects in the image captured by the CCTV camera with the corresponding objects in the precise road map. Here, the camera posture includes information on the CCTV camera's position (X, Y, Z), pan, and tilt, and may further include information on zoom.

カメラ姿勢推定部２１は、撮影映像で路面客体の特徴点のピクセル座標を抽出し、該当路面客体を精密道路地図とマッチングさせて該当路面客体の各特徴点の空間座標を抽出し、抽出された路面客体各特徴点のピクセル座標と空間座標の関係からＰｎＰ（Ｐｅｒｓｐｅｃｔｉｖｅ－ｎ－Ｐｏｉｎｔ）を利用してカメラ姿勢を推定することができる。 The camera posture estimation unit 21 extracts pixel coordinates of feature points of road surface objects in the captured image, matches the road surface objects with a precise road map to extract spatial coordinates of each feature point of the road surface objects, and estimates the camera posture using PnP (Perspective-n-Point) based on the relationship between the pixel coordinates and spatial coordinates of each extracted feature point of the road surface objects.

ＰｎＰ（Ｐｅｒｓｐｅｃｔｉｖｅ－ｎ－Ｐｏｉｎｔ）とは、３個以上の３Ｄ点が２Ｄ平面に投影される点の位置関係からカメラ姿勢を推定する方式で、本発明で路面客体で３個以上の特徴点を抽出し、該当特徴点のピクセル座標と空間座標の関係からＰｎＰ（Ｐｅｒｓｐｅｃｔｉｖｅ－ｎ－Ｐｏｉｎｔ）を利用してカメラ姿勢を推定することができる。 PnP (Perspective-n-Point) is a method of estimating camera posture based on the relative positions of three or more 3D points projected onto a 2D plane. In this invention, three or more feature points are extracted from road surface objects, and the camera posture can be estimated using PnP (Perspective-n-Point) based on the relationship between the pixel coordinates of the feature points and the spatial coordinates.

一方、本発明でいう特徴点とは、映像内に存在する客体をそれぞれ区分できる特徴を示すことができる点を意味し、角点や色相または明度で周辺と区分される点であり得る。 Meanwhile, in the present invention, a feature point refers to a point that can exhibit characteristics that can distinguish objects present in an image, and can be a corner point or a point that can be distinguished from its surroundings by hue or brightness.

ＣＣＴＶカメラが固定である場合、カメラの位置（Ｘ、Ｙ、Ｚ）、パン（ｐａｎ）、チルト（ｔｉｌｔ）、ズーム（Ｚｏｏｍ）が変更されないので、初期に推定されたカメラ姿勢を引き続き利用することができる。もちろん固定カメラである場合にも、多様な理由でカメラが揺れたり初期推定されたカメラ姿勢に誤差があり得るので、後述した補正部２４が多様な過程を通じてカメラ姿勢に対する補正を遂行することになる。 If the CCTV camera is fixed, the camera position (X, Y, Z), pan, tilt, and zoom do not change, so the initially estimated camera posture can continue to be used. Of course, even in the case of a fixed camera, the camera may shake for various reasons or there may be errors in the initially estimated camera posture, so the correction unit 24, described below, performs corrections to the camera posture through various processes.

ＣＣＴＶカメラがＰＴＺカメラである場合、パン、チルトおよびズームが遂行されない限りその姿勢が維持され得るのであるため、初期推定されたカメラ姿勢が引き続き利用され得る．パン、チルト、ズームのうち少なくとも一つの動作が遂行されれば該当動作値が獲得され得るので、カメラ姿勢推定部２１は初期カメラ姿勢にカメラから提供されるＰＴＺ（ＰａｎＴｉｌｔＺｏｏｍ）値を考慮して変更されたカメラ姿勢を獲得することができる。カメラの設置位置は固定されているので、補正が必要でない限りカメラの位置（Ｘ、Ｙ、Ｚ）は引き続き維持される。 If the CCTV camera is a PTZ camera, the initial estimated camera posture can continue to be used because the posture can be maintained unless pan, tilt, or zoom is performed. If at least one of pan, tilt, and zoom operations is performed, the corresponding operation value can be obtained, so the camera posture estimation unit 21 can obtain a modified camera posture by considering the PTZ (PanTilt Zoom) value provided by the camera in addition to the initial camera posture. Since the camera installation position is fixed, the camera position (X, Y, Z) continues to be maintained unless correction is required.

一方、ＣＣＴＶカメラがＰＴＺカメラである場合、カメラ姿勢推定部２１は、推定されたカメラ姿勢のパン、チルト値とＰＴＺカメラから提供されたパン、チルト値の差から、ＰＴＺカメラから提供されたパンおよびチルト値を真北と垂直方向を基準とした絶対値に変換する過程を通じて今後ＰＴ値を容易に獲得することができる。またＣＣＴＶカメラがＰＴＺカメラである場合、カメラ姿勢推定部２１は、ＰＴＺカメラから提供されたズーム値の逆数だけ前記撮影映像を縮小してズーム値が「０」である撮影映像に変換する過程を遂行できる。 On the other hand, if the CCTV camera is a PTZ camera, the camera attitude estimation unit 21 can easily obtain the PT value from the difference between the pan and tilt values of the estimated camera attitude and the pan and tilt values provided by the PTZ camera by converting the pan and tilt values provided by the PTZ camera into absolute values based on true north and the vertical direction. Also, if the CCTV camera is a PTZ camera, the camera attitude estimation unit 21 can perform a process of reducing the captured image by the reciprocal of the zoom value provided by the PTZ camera to convert it into a captured image with a zoom value of "0".

一方、補正部２４はカメラ姿勢に対する情報および／またはマッピング対象客体の座標に対する情報を補正することができる。 Meanwhile, the correction unit 24 can correct information regarding the camera pose and/or information regarding the coordinates of the object to be mapped.

補正部２４は精密道路地図を推定されたカメラ姿勢を利用して撮影映像に投影し、撮影映像の客体およびそれに対応する精密道路地図の客体の各特徴点のピクセル誤差を算出し、算出されたピクセル誤差が臨界値以上である場合、カメラ姿勢を補正し前記の過程を繰り返すことによってピクセル誤差を臨界値未満に軽減することができる。 The correction unit 24 projects a precise road map onto the captured image using the estimated camera pose, calculates the pixel error of each feature point of the object in the captured image and the corresponding object in the precise road map, and if the calculated pixel error is above a critical value, corrects the camera pose and repeats the above process to reduce the pixel error to below the critical value.

もちろん、補正部２４は前記過程を逆にする方法で姿勢を補正することもできるが、撮影映像の客体を推定されたカメラ姿勢を利用して精密道路地図に逆投影し、逆投影された撮影映像の客体およびそれに対応する精密道路地図の客体の各特徴点の位置誤差（座標誤差）を算出し、その算出された位置誤差が臨界値以上である場合、カメラ姿勢を補正し前記の過程を繰り返すことによってピクセル誤差を臨界値未満に軽減することができる。 Of course, the correction unit 24 can correct the pose by reversing the above process, but it can also back-project the objects in the captured image onto a precise road map using the estimated camera pose, calculate the position error (coordinate error) of each feature point of the back-projected objects in the captured image and the corresponding objects in the precise road map, and if the calculated position error is greater than or equal to a critical value, correct the camera pose and repeat the above process to reduce the pixel error to less than the critical value.

補正部２４はピクセル誤差および位置誤差（座標誤差）のうち少なくとも一つを算出し、カメラ姿勢を微細補正して該当誤差を臨界値未満に軽減する過程を繰り返すことによってカメラ姿勢と位置誤差を最小化することができる。 The correction unit 24 can minimize camera attitude and position errors by calculating at least one of pixel error and position error (coordinate error) and repeating the process of finely correcting the camera attitude to reduce the corresponding error below a critical value.

本発明全体において補正部２４の補正過程は、ｉ）精密道路地図を推定されたカメラ姿勢を利用して撮影映像に投影した後、精密道路地図の客体とそれに対応する映像客体間のピクセル誤差を利用して補正する方式、およびｉｉ）撮影映像を推定されたカメラ姿勢を利用して精密道路地図に逆投影した後、精密道路地図の客体とそれに対応する映像客体間の座標誤差を利用して補正する方式のうちいずれか一つの方式が利用され得る。前記ｉ）とｉｉ）の補正する過程は逆の関係であり、その補正する方式と結果が同一であるので、二つのうちいずれか一つの方式を使うか二つの方式をすべて使ってもよい。 In the present invention as a whole, the correction process of the correction unit 24 may be one of the following: i) a method of projecting a precise road map onto the captured image using the estimated camera attitude, and then correcting using pixel errors between objects on the precise road map and corresponding image objects; or ii) a method of back-projecting the captured image onto the precise road map using the estimated camera attitude, and then correcting using coordinate errors between objects on the precise road map and corresponding image objects. The correction processes i) and ii) are inversely related, and the correction methods and results are the same, so either one of the two methods or both of them may be used.

マッピング対象客体設定部２２は撮影映像内でＡＩ（ａｒｔｉｆｉｃｉａｌｉｎｔｅｌｌｉｇｅｎｃｅ）に基づいて予め設定された規則に沿って自動で認識されて抽出された客体や区域、または使用者（オペレータ）により撮影映像内で手動で選択された客体や区域をマッピング対象客体として設定する。 The mapping target object setting unit 22 sets, as mapping target objects, objects or areas that are automatically recognized and extracted from the captured image based on AI (artificial intelligence) in accordance with pre-set rules, or objects or areas that are manually selected from the captured image by a user (operator).

マッピング対象客体設定部２２は車線などの路面客体以外に、ディープラーニングなどのＡＩ基盤で認識した車両や人などの客体の座標を精密道路地図を基盤として自動で推定するか、またはＡＩＤ（ＡｕｔｏｍａｔｉｃＩｎｃｉｄｅｎｔＤｅｔｅｃｔｉｏｎ）を利用して自動で認識した危険要素または該当危険要素が占める道路上の区域を自動でマッピング対象客体としてマッピングする構成であり得る。 The mapping target object setting unit 22 may be configured to automatically estimate the coordinates of objects such as vehicles and people recognized based on AI such as deep learning, in addition to road surface objects such as lanes, based on a precise road map, or to automatically map automatically recognized hazardous elements or areas on the road occupied by the hazardous elements using AID (Automatic Incident Detection) as mapping target objects.

車両、人などの認識客体またはＡＩＤで認識した危険区域をマッピングする動作は次の通りである。 The operation of mapping recognized objects such as vehicles and people, or dangerous areas recognized by AID, is as follows:

車線などの路面客体の特徴点（特徴点３つ以上）のピクセル座標を抽出し、路面客体以外のマッピング対象客体（ＡＩＤ基盤危険区域を含む）を認識し、以後、マッピング対象客体と路面客体の特徴点を精密道路地図に逆投影するかまたは逆投影された路面客体とそれに対応する精密道路地図の同一客体間の位置誤差を算出し、前記精密道路地図を利用してマッピング対象の特徴点に対する空間座標を算出し、マッピング対象の特徴点に対する空間座標を補正する。以後、マッピング対象の特徴点をイメージに再投影した後、最初に設定された点を基準として再投影された特徴点別ピクセル誤差を算出し、前記ピクセル誤差が臨界値より小さければ、精密道路地図上にマッピング対象を表記（位置、属性）した後、メッセージを自動生成および表出する過程で動作される。 The system extracts pixel coordinates of feature points (three or more feature points) of road surface objects such as lanes and recognizes mapping targets other than road surface objects (including AID-based hazardous areas). Then, it back-projects the mapping target objects and feature points of the road surface objects onto a precise road map, or calculates the position error between the back-projected road surface objects and the corresponding objects on the precise road map. It then calculates the spatial coordinates of the mapping target feature points using the precise road map and corrects the spatial coordinates of the mapping target feature points. Then, it re-projects the mapping target feature points onto the image, and calculates the pixel error for each re-projected feature point based on the initially set point. If the pixel error is smaller than a critical value, it displays the mapping target (position, attributes) on the precise road map, and automatically generates and displays a message.

ここで、ピクセル誤差が臨界値以上であれば、特徴点別位置誤差を利用して姿勢（Ｐａｎ、Ｔｉｌｔ）を補正することができる。 Here, if the pixel error is above a critical value, the pose (pan, tilt) can be corrected using the position error for each feature point.

また、マッピング対象客体設定部２２はオペレータが設定または指定した客体や危険区域をマッピング対象客体として設定することができる。このために、マッピング対象客体設定部２２は車線や矢印などの精密道路地図上の路面客体を認識し、認識された路面客体の特徴点（特徴点３つ以上）のピクセル座標を抽出する。 In addition, the mapping target object setting unit 22 can set objects or dangerous areas set or designated by the operator as mapping target objects. To this end, the mapping target object setting unit 22 recognizes road surface objects on the precision road map, such as lanes and arrows, and extracts pixel coordinates of feature points (three or more feature points) of the recognized road surface objects.

オペレータから撮影映像内危険区域（特徴点３つ以上）の設定および属性が入力されると、後述する座標推定部２３は危険区域および路面客体の特徴点を精密道路地図に逆投影し、精密道路地図を利用して危険区域の特徴点の空間座標を算出することができる。 When the operator inputs the settings and attributes of a danger zone (three or more feature points) in the captured image, the coordinate estimation unit 23, described below, back-projects the danger zone and feature points of road surface objects onto a precise road map and can calculate the spatial coordinates of the feature points of the danger zone using the precise road map.

一方、補正部２４は逆投影路面客体と精密道路地図の同一客体間の位置誤差を算出し、以後、位置誤差を反映した危険区域の特徴点の空間座標を補正し、危険区域の特徴点を撮影映像に再投影した後、最小設定された点を基準として再投影された特徴点別ピクセル誤差を算出する。算出が完了し、前記ピクセル誤差が臨界値より小さければ、メッセージ生成配布部２５は精密道路地図上に危険区域（位置、属性）を表記し、メッセージを自動生成および表出する構成であり得る。 Meanwhile, the correction unit 24 calculates the position error between the back-projected road surface object and the same object on the precise road map, then corrects the spatial coordinates of the feature points in the dangerous area to reflect the position error, re-projects the feature points in the dangerous area onto the captured image, and calculates the pixel error for each re-projected feature point based on the minimum set point. Once the calculation is complete, if the pixel error is smaller than a critical value, the message generation and distribution unit 25 may be configured to mark the dangerous area (location, attributes) on the precise road map and automatically generate and display a message.

ここで、補正部２４はピクセル誤差が臨界値以上であれば、特徴点別位置誤差を利用して姿勢（Ｐａｎ、Ｔｉｌｔ）を補正することができる。 Here, if the pixel error is greater than or equal to a critical value, the correction unit 24 can correct the posture (pan, tilt) using the position error for each feature point.

座標推定部２３はマッピング対象客体（ＡＩで認識された客体、ＡＩＤで認識された危険区域、オペレータによって選択された客体や危険区域）に対して精密道路地図を基盤として特徴点の座標を自動で推定することができる。 The coordinate estimation unit 23 can automatically estimate the coordinates of feature points for mapping target objects (objects recognized by AI, danger areas recognized by AID, objects or danger areas selected by the operator) based on a precise road map.

補正部２４は、カメラ姿勢推定過程やマッピング対象客体の座標推定過程で精密道路地図を利用して予め推定したピクセル誤差、位置誤差およびカメラ姿勢のうち少なくとも一つを補正する構成であり得る。 The correction unit 24 may be configured to correct at least one of pixel error, position error, and camera attitude that are pre-estimated using a precise road map during the camera attitude estimation process or the coordinate estimation process of the mapping target object.

前記補正部２４は車両、人、表示などのＡＩで認識された客体、ＡＩＤで認識された危険区域、またはオペレータによって設定された危険区域を推定したり決定する過程でカメラ姿勢と位置誤差を補正する構成であり得る。 The correction unit 24 may be configured to correct camera attitude and position errors in the process of estimating or determining objects recognized by AI, such as vehicles, people, and signs, danger areas recognized by AID, or danger areas set by an operator.

補正部２４は精密道路地図を撮影映像に投影した後、撮影映像内の車線などの路面客体と投影された精密道路地図上の路面客体間の位置誤差を基準として事前に設定された基準によって自動で補正の有無を決定する。この時、認識された路面客体を精密道路地図上に逆投影したり精密道路地図の同一客体を撮影映像に投影した後、位置比較（座標比較）を通じて自動でマッピング対象客体や危険区域の座標を補正し姿勢も補正する過程である。 The correction unit 24 projects the precise road map onto the captured image and then automatically determines whether to perform correction based on a preset criterion, which is based on the positional error between road surface objects such as lanes in the captured image and the road surface objects on the projected precise road map. At this time, the recognized road surface objects are back-projected onto the precise road map, or the same objects on the precise road map are projected onto the captured image, and then the coordinates of the objects to be mapped and dangerous areas are automatically corrected through position comparison (coordinate comparison), and the posture is also corrected.

また、固定式の場合にも、時間の経過につれて変位による回転誤差が発生する可能性が高いので、マッピング時ごとに補正を通じて正確度を維持することができ、初期カメラ姿勢推定が完了した状態で位置（Ｘ、Ｙ、Ｚ）は変化の可能性がほとんどないので誤差がないと仮定し、誤差の種類をＰａｎとＴｉｌｔに区分することができる。 Even in the case of a fixed system, there is a high possibility that rotation errors will occur due to displacement over time, so accuracy can be maintained through correction at each mapping. Once the initial camera posture estimation is complete, the position (X, Y, Z) is unlikely to change, so it is assumed that there is no error, and the error types can be classified as Pan and Tilt.

左右の誤差はＰａｎ、上下の誤差はＴｉｌｔを微細調整して補正可能であり、誤差の大きさを算出して調整単位を決定した後に反対方向の誤差が発生するまで繰り返し、逆方向に最初の単位の１０分の１に微細調整した後、再び反対方向の誤差が発生するまで繰り返すことができる。各段階で臨界値（Ｔｈｒｅｓｈｏｌｄ）との比較を遂行して満足すれば終了処理する過程でなされ得る。 Left-right errors can be corrected by fine-tuning Pan, and up-down errors can be corrected by fine-tuning Tilt. After calculating the magnitude of the error and determining the adjustment unit, this can be repeated until an error in the opposite direction occurs, and then fine-tuning the opposite direction to 1/10 of the original unit, and then repeating this process until an error in the opposite direction occurs again. At each stage, a comparison with a threshold is performed, and if satisfied, the process can be terminated.

一方、カメラから提供されたＰＴＺ値のみを利用してオペレータが指定したマッピング対象客体や危険区域を精密道路地図上に逆投影した後、精密道路地図上で座標を算出する時にＰＴ値に誤差がある場合、座標推定結果に誤差が大きく発生する可能性がある。 However, if there is an error in the PT value when calculating coordinates on the precise road map after back-projecting the mapping object or danger zone designated by the operator using only the PTZ value provided by the camera, there is a possibility that the coordinate estimation results will have a large error.

このような誤差は、精密道路地図上で危険区域の座標を推定した後に周辺の路面客体と共に撮影映像に投影し、撮影映像に投影された精密道路地図の路面客体の位置を既存の撮影映像で対応する路面客体の位置とオペレータが肉眼で比較して誤差の大きさを確認することができる。オペレータが肉眼で確認した結果、誤差が大きく発生したと判断されれば、投影された路面客体と同じ撮影映像の路面客体の始終点のマーキングを通じて指定すれば、これを利用して自動でカメラ姿勢を補正することができ、これを利用して再び座標推定過程を遂行して該当誤差をさらに補正する。 These errors can be checked by estimating the coordinates of the dangerous area on a precise road map, projecting it onto the captured image along with surrounding road objects, and then visually comparing the position of the road object on the precise road map projected onto the captured image with the position of the corresponding road object in the existing captured image. If the operator determines that a significant error has occurred after visually inspecting it, they can mark the start and end points of the road object in the captured image that match the projected road object. This can then be used to automatically correct the camera posture, and the coordinate estimation process can be performed again to further correct the error.

固定式の場合にも、時間の経過につれて変位による回転誤差が発生する可能性が高いので、マッピング時ごとの補正を通じて正確度を維持することができる。 Even with fixed systems, there is a high possibility of rotational errors occurring due to displacement over time, so accuracy can be maintained through corrections made each time mapping is performed.

すなわち、初期カメラ姿勢推定が完了した状態で位置（Ｘ、Ｙ、Ｚ）は変化の可能性が殆どないので誤差がないと仮定し、誤差の種類をＰａｎとＴｉｌｔに区分することができる。 In other words, once initial camera pose estimation is complete, the position (X, Y, Z) is unlikely to change, so we can assume there is no error, and classify the types of error into Pan and Tilt.

左右の誤差はＰａｎ、上下の誤差はＴｉｌｔを微細調整して補正可能であり、補正手続きは誤差の大きさの算出後に調整単位の決定→反対方向の誤差が発生するまで繰り返し→逆方向に最初の単位の１０分の１に微細調整→再び反対方向の誤差が発生するまで繰り返す。各段階で臨界値（Ｔｈｒｅｓｈｏｌｄ）との比較を遂行して、満足すれば終了処理する過程でなされる。 Left-right errors can be corrected by fine-tuning Pan, and up-down errors by fine-tuning Tilt. The correction procedure involves calculating the magnitude of the error, then determining the adjustment unit → repeating until an error in the opposite direction occurs → fine-tuning in the opposite direction to 1/10 of the original unit → repeating until an error in the opposite direction occurs again. At each stage, a comparison with a threshold is made, and if satisfied, the process is terminated.

メッセージ生成および配布部２６は危険の種類、走行不可領域に対する面単位情報などが含まれたメッセージを生成し、Ｖ２Ｘ、移動通信などを利用して配布する構成であり得る。 The message generation and distribution unit 26 may be configured to generate messages containing information such as the type of danger and surface-level information on prohibited areas, and distribute these messages using V2X, mobile communications, etc.

図３は本発明の一実施例に係る精密道路地図基盤のＣＣＴＶカメラ姿勢および客体座標推定方法を説明したフローチャートであり、図４は図３に図示されたカメラ姿勢推定段階の細部フローチャートであり、図５は撮影映像に精密道路地図を投影した後、微細調整要領を説明するための例示図であり、図６は図３に図示された座標推定段階乃至メッセージ生成配布段階を説明したフローチャートを示す。図７は、危険区域に隣接した車線指定を通じての姿勢補正およびマッピング誤差補正の概念を説明するための例示図である。 Figure 3 is a flowchart illustrating a method for estimating CCTV camera pose and object coordinates based on a precise road map according to one embodiment of the present invention, Figure 4 is a detailed flowchart of the camera pose estimation step shown in Figure 3, Figure 5 is an example diagram illustrating a fine adjustment procedure after projecting a precise road map onto a captured image, and Figure 6 is a flowchart illustrating the coordinate estimation step through the message generation and distribution step shown in Figure 3. Figure 7 is an example diagram illustrating the concept of pose correction and mapping error correction through lane designation adjacent to a dangerous area.

まず、図３に図示された通り、本発明の一実施例に係る精密道路地図基盤のＣＣＴＶカメラ姿勢および客体座標推定方法（Ｓ７００）はカメラ姿勢推定段階（Ｓ７１０）、マッピング対象客体設定段階（Ｓ７２０）、座標推定段階（Ｓ７３０）、誤差補正段階（Ｓ７４０）およびメッセージ生成および配布段階（Ｓ７５０）のうち少なくとも一つを含む。 First, as shown in FIG. 3, a method for estimating CCTV camera pose and object coordinates based on a precise road map according to one embodiment of the present invention (S700) includes at least one of a camera pose estimation step (S710), a mapping target object setting step (S720), a coordinate estimation step (S730), an error correction step (S740), and a message generation and distribution step (S750).

カメラ姿勢推定段階（Ｓ７１０）はＣＣＴＶカメラのカメラ姿勢（Ｘ、Ｙ、Ｚ、Ｐａｎ、Ｔｉｌｔ）を推定し、ＰＴＺカメラの位置（Ｘ、Ｙ、Ｚ）確定および前記ＰＴＺカメラで提供するＰａｎ／Ｔｉｌｔ値を絶対値（真北方向、水平基準）に変換するための関係式（差分）を決定する段階であり得る。 The camera attitude estimation step (S710) may be a step of estimating the camera attitude (X, Y, Z, Pan, Tilt) of the CCTV camera, determining the position (X, Y, Z) of the PTZ camera, and determining a relational equation (difference) for converting the Pan/Tilt values provided by the PTZ camera into absolute values (true north direction, horizontal reference).

本願で言及するカメラ姿勢推定は、測量や基準点設定および表記などの現場での作業なしに精密道路地図を利用する簡単な過程を通じてＣＣＴＶカメラ姿勢を推定しようとする技術であって、後述する座標推定やマッピングを自動化できる土台を提供し、また、既存の方法と比較して高い正確度と時間および費用削減などの効果を提供する技術であり得る。したがって、ＰＴＺカメラである場合、装置では最初のセッティング位置を基準とした相対値を提供しているので、すぐにカメラ姿勢（Ｐｏｓｅ）に変換が不可能な点を考慮して初期カメラ姿勢（Ｐｏｓｅ）推定時に提供される値と真の値の差分を算出して反映することによって、以後提供されるＰＴＺ値を変換してカメラ姿勢（Ｐｏｓｅ）に活用する技術であり得る。 The camera pose estimation referred to in this application is a technology that attempts to estimate the CCTV camera pose through a simple process using a precision road map without on-site work such as surveying, setting control points, and marking. It provides a foundation for automating coordinate estimation and mapping, which will be described later, and can be a technology that offers benefits such as high accuracy and time and cost savings compared to existing methods. Therefore, in the case of a PTZ camera, since the device provides relative values based on the initial setting position, it is not possible to immediately convert them to the camera pose (Pose). Taking this into consideration, this technology can convert the PTZ values provided subsequently and use them as the camera pose (Pose) by calculating and reflecting the difference between the value provided when estimating the initial camera pose (Pose) and the true value.

次に、マッピング対象客体設定段階（Ｓ７２０）は、ＡＩ基盤のマッピング対象客体認識、ＡＩＤ（ＡｕｔｏｍａｔｉｃＩｎｃｉｄｅｎｔＤｅｔｅｃｔｉｏｎ）基盤の客体や危険区域認識およびオペレータが肉眼で危険状況であると認識した客体や区域を危険区域に指定する段階のうち少なくとも一つ以上を含むことができる。 Next, the mapping target object setting step (S720) may include at least one of AI-based mapping target object recognition, AID (Automatic Incident Detection)-based object or danger area recognition, and a step of designating an object or area that an operator recognizes as a dangerous situation with the naked eye as a danger area.

次に、座標推定段階（Ｓ７３０）は、マッピング対象客体（ＡＩで認識した客体や区域、ＡＩＤで認識した客体や危険区域、オペレータが手動で設定した客体や危険区域など）に対して精密道路地図に基づいて特徴点の座標を自動で推定しマッピングする段階であり得る。 Next, the coordinate estimation step (S730) may be a step of automatically estimating and mapping the coordinates of feature points based on a precise road map for the object to be mapped (objects or areas recognized by AI, objects or dangerous areas recognized by AID, objects or dangerous areas manually set by an operator, etc.).

前記マッピング対象客体の座標推定は、固定式ＣＣＴＶだけでなくＰＴＺカメラを利用する場合にも適用が可能であり、座標推定およびマッピング過程を自動化できる方法であって、精密道路地図上で座標を推定することによって、簡単であり正確度を高めることができる。特に、ＡＩ（ＡｒｔｉｆｉｃｉａｌＩｎｔｅｌｌｉｇｅｎｃｅ）を利用して車線などの路面客体の認識が可能である場合だけでなく、このような機能がない場合にも適用することができる。 The coordinate estimation of the object to be mapped can be applied when using not only fixed CCTV but also PTZ cameras. It is a method that can automate the coordinate estimation and mapping process, and by estimating coordinates on a precise road map, it is simple and highly accurate. In particular, it can be applied not only when it is possible to recognize road surface objects such as lanes using AI (Artificial Intelligence), but also when such functionality is not available.

次に、誤差補正段階（Ｓ７４０）は、カメラ姿勢推定段階（Ｓ７１０）やマッピング対象客体の座標推定段階（Ｓ７３０）で精密道路地図を利用して予め推定した位置やピクセル誤差およびカメラ姿勢を補正する段階であり得る。 Next, the error correction step (S740) may be a step of correcting the position, pixel error, and camera posture previously estimated using a precise road map in the camera posture estimation step (S710) and the coordinate estimation step (S730) of the mapping target object.

前記位置やピクセル誤差および／または姿勢補正段階は、座標推定結果やこの時に適用されたカメラ姿勢に対する誤差をすべての推定過程で確認して自動で補正することによって、正確度の高い推定を可能にした方法であって、特に、車線などの路面客体の認識が可能である場合、全過程を自動化することによって便利性を増大することができ、このような機能がない場合にもオペレータの簡単な操作だけで同じ効果を得ることができる技術であり得る。 The position, pixel error, and/or attitude correction step is a method that enables highly accurate estimation by checking and automatically correcting errors in the coordinate estimation results and the camera attitude applied at that time throughout the entire estimation process. In particular, if it is possible to recognize road objects such as lanes, automating the entire process can increase convenience. Even if such functionality is not available, this technology can achieve the same effect with just a simple operation by the operator.

次に、メッセージ生成および配布段階（Ｓ７５０）は、危険関連メッセージを自動で生成し配布する段階であり得る。 Next, the message generation and distribution step (S750) may be a step of automatically generating and distributing risk-related messages.

前述した過程に基づいて作成されたマッピング対象客体に対する情報を共有することにおいて、オペレータの追加手作業がないように自動でメッセージを生成し、その結果をオペレータが管理するモニタに表出して確認が可能であるようにしたし、配布も自動でなされるようにする技術であり得る。 When sharing information about the mapping target object created based on the above process, messages are automatically generated to eliminate additional manual work by the operator, and the results are displayed on a monitor managed by the operator so that they can be checked. This technology also allows for automatic distribution.

以下では、前述した各段階をより詳細に説明することにする。 Below, we'll explain each of the above steps in more detail.

まず、図４を参照すると、初期カメラ姿勢推定段階（Ｓ７１０）はＣＣＴＶ撮影映像上の路面客体の特徴点のピクセル座標を抽出し、精密道路地図とマッチングさせて前記路面客体の特徴点の空間座標を抽出し、ＰｎＰ（Ｐｅｒｓｐｅｃｔｉｖｅ－ｎ－Ｐｏｉｎｔ）を通じてカメラ姿勢を推定した後、推定されたカメラ姿勢を利用して精密道路地図を撮影映像に投影する。以後、前記特徴点別に映像内でのピクセル誤差を算出した後、前記ピクセル誤差と臨界値（Ｔｈｒｅｓｈｏｌｄ）を比較する。 First, referring to FIG. 4, the initial camera pose estimation step (S710) extracts pixel coordinates of feature points of road objects in CCTV-captured images, matches them with a precise road map to extract spatial coordinates of the feature points of the road objects, estimates the camera pose through Perspective-n-Point (PnP), and then projects the precise road map onto the captured image using the estimated camera pose. Then, pixel errors within the image are calculated for each feature point, and the pixel errors are compared with a threshold.

前述したように、ＰｎＰ（Ｐｅｒｓｐｅｃｔｉｖｅ－ｎ－Ｐｏｉｎｔ）とは座標位置が知られている３次元の特徴点座標および該当特徴点座標が投影されたイメージ上の２次元の特徴点座標に基づいて３次元座標系に対してイメージを獲得したカメラの正確な位置および姿勢を獲得する技法を意味する。 As mentioned above, PnP (Perspective-n-Point) refers to a technique for obtaining the exact position and orientation of a camera that captured an image relative to a 3D coordinate system based on the coordinates of 3D feature points whose coordinate positions are known and the 2D coordinates of feature points on the image onto which the corresponding feature points are projected.

この時、前記誤差が臨界値より小さければ終了するものの、誤差が臨界値（Ｔｈｒｅｓｈｏｌｄ）以上であれば、誤差特性を考慮してカメラ姿勢を微細に補正し、補正されたカメラ姿勢を利用して精密道路地図を撮影映像に投影する。それから特徴点別に映像内でのピクセル誤差が臨界値未満となるまで前記過程を引き続き繰り返すことができる．． If the error is less than the threshold, the process ends. However, if the error is greater than the threshold, the camera posture is finely corrected taking into account the error characteristics, and a precise road map is projected onto the captured image using the corrected camera posture. The process can then be repeated until the pixel error in the image for each feature point falls below the threshold.

一方、前記カメラ姿勢推定段階（Ｓ７１０）は、ＰｎＰを通じて１次姿勢を推定する段階以後、次のように進行され得る。 Meanwhile, the camera pose estimation step (S710) can be performed as follows after estimating the primary pose through PnP.

例えば、推定されたカメラ姿勢を基準として前記路面客体の特徴点を精密道路地図に逆投影し、逆投影された路面客体の特徴点の空間座標とそれに対応する精密道路地図上客体の空間座標の位置誤差を特徴点別に算出し、誤差が臨界値（Ｔｈｒｅｓｈｏｌｄ）より小さければ終了し、誤差が臨界値（Ｔｈｒｅｓｈｏｌｄ）以上である場合、誤差特性を考慮して推定されたカメラ姿勢を微細に補正した後、補正された姿勢を利用して精密道路地図に逆投影する過程を含むことができる。ここでも、特徴点別空間内での位置誤差が臨界値未満となるまで前記過程を引き続き繰り返すことができる。 For example, the method may include back-projecting the feature points of the road surface object onto a precise road map based on the estimated camera pose, calculating the position error between the spatial coordinates of the back-projected feature points of the road surface object and the corresponding spatial coordinates of the object on the precise road map for each feature point, and terminating the process if the error is less than a threshold. If the error is greater than or equal to the threshold, finely correcting the estimated camera pose in consideration of the error characteristics, and then back-projecting onto the precise road map using the corrected pose. Again, the process may be repeated until the position error in the space for each feature point becomes less than the threshold.

ここで、ＰｎＰ基盤の１次姿勢推定において、回転するＰＴＺカメラである場合、カメラを概略的に道路と直角方向にセッティングすることが効率的である。 Here, in PnP-based primary posture estimation, if a rotating PTZ camera is used, it is efficient to set the camera roughly perpendicular to the road.

概略的に分かっているＣＣＴＶカメラの位置情報、路面または道路脇の客体（精密道路地図に明示された）の情報を基準として、精密道路地図と比較して車線などの路面に存在する路面客体をマッチングし、概略的な位置情報と周辺客体を利用して肉眼でイメージと精密道路地図上の客体に対するマッチング対を探し（周辺客体：信号灯、表示板、車線以外の路面表示、橋梁の始終点、防護施設の始終点など）、３個以上のマッチング対に対するピクセル座標と空間座標を抽出した後、ＰｎＰを遂行する過程であり得る。 Based on the roughly known position information of the CCTV camera and information on objects on the road surface or roadside (shown on a precise road map), the process involves comparing the precise road map with road surface objects such as lanes, searching for matching pairs between the image and the precise road map with the naked eye using the rough position information and surrounding objects (surrounding objects: traffic lights, signs, road markings other than lanes, start and end points of bridges, start and end points of protective facilities, etc.), extracting pixel coordinates and spatial coordinates for three or more matching pairs, and then performing PnP.

また、誤差特性を考慮して姿勢を微細補正する過程の場合、ＰｎＰ基盤で１次カメラ姿勢を推定した後、このカメラ姿勢を利用して精密道路地図を撮影映像に投影した後、客体特徴点間のピクセル誤差を算出して誤差の程度を確認し、誤差の特性を反映した微細調整を通じてカメラ姿勢を最終補正する過程であり得る。 In addition, the process of fine-tuning the pose taking into account error characteristics may involve estimating the primary camera pose based on PnP, projecting a precise road map onto the captured image using this camera pose, calculating the pixel error between object feature points to confirm the degree of error, and finally correcting the camera pose through fine adjustments that reflect the error characteristics.

または、カメラ姿勢を利用して撮影映像の特定客体の特徴点を精密道路地図上に逆投影した後、該当客体の特徴点間の位置誤差に基づいてカメラ姿勢を補正できる方法であり得る。 Alternatively, it may be a method of back-projecting feature points of specific objects in the captured image onto a precise road map using the camera pose, and then correcting the camera pose based on the positional error between the feature points of the corresponding objects.

前記カメラ姿勢推定段階（Ｓ７１０）は、ＰＴＺカメラである場合、カメラキャリブレーションを遂行して真の値を決定した後、Ｐａｎ、Ｔｉｌｔ提供値と真の値の差を基準として絶対値に変換する。Ｚｏｏｍの場合、提供される値を利用して拡大比率を求めた後にこの逆数だけイメージを縮小すれば、Ｚｏｏｍ値が「０」であるイメージに変換されて、自動化されたマッピング過程で補正されたカメラ姿勢値を利用して持続的に差分を修正する過程を含むことができる。 In the camera posture estimation step (S710), if the camera is a PTZ camera, camera calibration is performed to determine true values, and then the difference between the provided Pan and Tilt values and the true values is converted into absolute values. In the case of zoom, the provided values are used to determine the enlargement ratio, and the image is reduced by the reciprocal of this ratio, thereby converting it into an image with a zoom value of '0'. This can include a process of continuously correcting the difference using the corrected camera posture values during the automated mapping process.

一方、マッピング対象客体設定段階（Ｓ７２０）は、ＡＩ基盤で認識された車両や人などの認識客体、ＡＩＤで認識された客体や危険区域、オペレータによって設定された客体や危険区域などを設定する方式により過程が異なり得る。 Meanwhile, the mapping target object setting step (S720) may vary depending on the method of setting recognized objects such as vehicles and people recognized based on AI, objects and danger areas recognized by AID, objects and danger areas set by an operator, etc.

まず、図６を参照すると、車線などの路面客体の特徴点のピクセル座標を抽出し、路面客体以外のマッピング対象客体（ＡＩＤ基盤危険区域を含む）を認識する。 First, referring to Figure 6, pixel coordinates of characteristic points of road surface objects such as lanes are extracted, and mapping targets other than road surface objects (including AID-based danger areas) are recognized.

以後、マッピング対象客体と路面客体の特徴点を精密道路地図に逆投影してマッピング対象客体の特徴点に対する空間座標を算出し、逆投影された客体と精密道路地図の同一客体間の位置誤差を算出し、マッピング対象の特徴点に対する空間座標を補正する。以後、マッピング対象客体の特徴点をイメージに再投影した後、最初に設定された点を基準として再投影された特徴点別ピクセル誤差を算出し、前記ピクセル誤差が臨界値（Ｔｈｒｅｓｈｏｌｄ）より小さければ、精密道路地図上にマッピング対象を表記（位置、属性）した後、メッセージを自動生成および表出する過程を含むことができる。 Then, the feature points of the object to be mapped and the road surface object are back-projected onto the precise road map to calculate the spatial coordinates of the feature points of the object to be mapped, and the position error between the back-projected object and the same object on the precise road map is calculated, and the spatial coordinates of the feature points of the object to be mapped are corrected. Then, the feature points of the object to be mapped are re-projected onto the image, and the pixel error for each re-projected feature point is calculated based on the initially set point. If the pixel error is smaller than a threshold, the method may include the steps of notating the object to be mapped (position, attributes) on the precise road map and automatically generating and displaying a message.

ここで、ピクセル誤差が臨界値以上であれば、特徴点別位置誤差を利用してカメラ姿勢（Ｐａｎ、Ｔｉｌｔ）を補正する過程をさらに含むことができる。 Here, if the pixel error is greater than or equal to a critical value, the method may further include a process of correcting the camera pose (pan, tilt) using the position error for each feature point.

車両、人、落下物、動物などの客体を認識した場合、該当客体の座標を自動で推定し、ＡＩＤ（ＡｕｔｏｍａｔｉｃＩｎｃｉｄｅｎｔＤｅｔｅｃｔｉｏｎ）により事故、逆走、停滞などの危険区域を自動で認識した場合、該当危険区域に対する座標を自動で推定してその結果を反映して自動でマッピングすることができる。 When an object such as a vehicle, person, fallen object, or animal is recognized, the coordinates of the object are automatically estimated. When a dangerous area such as an accident, wrong-way driving, or traffic jam is automatically recognized using AID (Automatic Incident Detection), the coordinates of the dangerous area are automatically estimated and the results are reflected in the automatic mapping.

ここで、ＡＩ基盤認識対象は座標推定が必要な客体、車両、人、動物などの移動客体、逆走車、落下物などの非定型障害物、停滞の後尾、カラーコーン（登録商標）、プレキャストバリアなどの工事関連客体などが含まれる。 Here, AI-based recognition targets include objects requiring coordinate estimation, moving objects such as vehicles, people, and animals, atypical obstacles such as wrong-way vehicles and fallen objects, the tail of traffic jams, traffic cones (registered trademark), precast barriers, and other construction-related objects.

座標推定が必要な危険区域は、ＡＩＤで認識した事故区域、連続したカラーコーン（登録商標）やプレキャストバリアなどを基準として認識した工事区間などの面で認識された危険区域であり得る。 Dangerous areas requiring coordinate estimation may be accident areas identified by AID, construction zones identified using a series of traffic cones (registered trademark) or precast barriers as reference points, or other dangerous areas identified by surface features.

また、誤差確認および姿勢補正のための路面客体は、車線、矢印、停止線などの精密道路地図に含まれた路面客体（必要に応じて表示板、信号灯を含む）であり得る。 Furthermore, road surface objects for error detection and posture correction may be road surface objects included in a precision road map, such as lanes, arrows, and stop lines (including signboards and traffic lights, if necessary).

ここで、初期カメラ姿勢推定状態はＰＴＺカメラである場合、提供値と真の値の差分を分かっているので、回転時にもリアルタイムでＰａｎとＴｉｌｔの絶対値（真値）の推定が可能である状態であり得る。 Here, if the initial camera posture estimation state is a PTZ camera, the difference between the provided value and the true value is known, so it may be possible to estimate the absolute values (true values) of Pan and Tilt in real time even during rotation.

また、位置および姿勢誤差の補正は精密道路地図を撮影映像に投影した後、撮影映像の路面客体と投影された精密道路地図の路面客体間の位置誤差を基準として事前に設定された基準によって自動で補正の有無を決定する。この時、映像で認識された路面客体を精密道路地図上に逆投影したり精密道路地図の同じ客体を撮影映像に投影した後、対応する客体の位置比較を通じて自動でマッピング対象客体や危険区域の座標を補正し、姿勢も補正する機能を含む。 In addition, position and orientation error correction is performed by projecting a precise road map onto the captured image, and then automatically determining whether or not to correct it based on pre-set criteria based on the position error between road objects in the captured image and those on the projected precise road map. This includes a function to back-project road objects recognized in the image onto the precise road map, or to project the same objects on the precise road map onto the captured image, and then automatically correct the coordinates of the objects being mapped and dangerous areas by comparing the positions of the corresponding objects, as well as correcting their orientation.

また、固定式の場合にも、時間の経過につれて変位による回転誤差が発生する可能性が高いので、マッピング時ごとの補正を通じて正確度を維持することができる。初期カメラ姿勢推定が完了した状態では位置（Ｘ、Ｙ、Ｚ）変化の可能性がほとんどないので、誤差がないと仮定するものの、誤差の種類をＰａｎとＴｉｌｔに区分することができる。 Even in the case of a fixed system, there is a high possibility that rotation errors will occur due to displacement over time, so accuracy can be maintained through corrections at each mapping. Once the initial camera posture estimation is complete, there is little possibility of position (X, Y, Z) changes, so it is assumed that there are no errors, but the types of errors can be classified as Pan and Tilt.

左右の誤差はＰａｎ、上下の誤差はＴｉｌｔを微細調整して補正が可能である。補正手続きは誤差の大きさを算出して調整単位を決定した後に反対方向の誤差が発生するまで繰り返し、逆方向に最初の単位の１０分の１に微細調整した後、再び反対方向の誤差が発生するまで繰り返す。各段階で臨界値（Ｔｈｒｅｓｈｏｌｄ）との比較を遂行して満足すれば終了する。 Left-right errors can be corrected by fine-tuning Pan, and up-down errors by fine-tuning Tilt. The correction procedure involves calculating the magnitude of the error and determining the adjustment unit, then repeating the process until an error in the opposite direction occurs, fine-tuning the adjustment in the opposite direction to 1/10 of the original unit, and then repeating the process again until an error in the opposite direction occurs. At each stage, a comparison with a threshold is performed and the process ends when a satisfactory result is reached.

ＡＩ基盤認識対象は座標推定が必要な客体であり、車両、人、動物などの移動客体（逆走車、落下物などの非定型障害物、停滞の後尾、カラーコーン（登録商標）、プレキャストバリアなどの工事関連客体がこれに該当し得る。 AI-based recognition targets are objects that require coordinate estimation, and include moving objects such as vehicles, people, and animals (including atypical obstacles such as wrong-way vehicles and fallen objects, the tail of traffic jams, traffic cones (registered trademark), precast barriers, and other construction-related objects).

また、座標推定が必要な危険区域は、ＡＩＤで認識した事故区域、連続したカラーコーン（登録商標）やプレキャストバリアなどを基準として認識した工事区間などの面で認識された危険区域であり得る。 In addition, danger areas that require coordinate estimation may be accident areas recognized by AID, construction areas recognized using a series of traffic cones (registered trademark) or precast barriers as reference points, or other danger areas recognized by surface.

一方、カメラ姿勢推定過程でＰＴＺカメラである場合、提供値と真の値の差分を分かっているので、回転時にもリアルタイムでＰａｎとＴｉｌｔの絶対値（真値）の推定が可能となり得る。 On the other hand, if a PTZ camera is used in the camera posture estimation process, the difference between the provided value and the true value is known, so it may be possible to estimate the absolute values (true values) of Pan and Tilt in real time even during rotation.

また、誤差補正過程で、提供されたＰＴＺ値のみを利用してオペレータが指定したマッピング対象客体や危険区域を精密道路地図上に逆投影した後、精密道路地図上で座標を算出する時、ＰＴ値に誤差がある場合、座標推定結果に誤差が大きく発生する可能性がある。 In addition, during the error correction process, the mapping object or danger zone designated by the operator is back-projected onto a precise road map using only the provided PTZ value, and then when calculating coordinates on the precise road map, if there is an error in the PT value, there is a possibility that a large error will occur in the coordinate estimation results.

このような誤差は精密道路地図上で危険区域の座標を推定した後、周辺の路面客体と共に推定されたカメラ姿勢を利用して撮影映像に再投影し、再投影された路面客体の映像での位置と既存の撮影映像で対応する同一の路面客体の位置をオペレータが肉眼で比較して誤差の大きさを確認することができる。オペレータが肉眼で確認した結果、誤差が大きいと判断されれば、再投影された路面客体に対応する撮影映像での路面客体の始終点をマーキングを通じて指定すれば、これを利用して自動でカメラ姿勢を補正することができ、これを利用して再び座標推定過程を遂行して座標推定誤差も補正することができる。 Such errors can be corrected by estimating the coordinates of the danger zone on a precise road map, then re-projecting the surrounding road objects onto the captured image using the estimated camera attitude. The operator can then visually compare the position of the re-projected road object in the image with the position of the corresponding identical road object in the existing captured image to check the magnitude of the error. If the operator determines that the error is significant after visually inspecting it, they can mark the start and end points of the road object in the captured image that correspond to the re-projected road object, which can then be used to automatically correct the camera attitude. Using this, the coordinate estimation process can be performed again to correct the coordinate estimation error.

すなわち、初期カメラ姿勢推定が完了した状態で位置（Ｘ、Ｙ、Ｚ）は変化の可能性が殆どないので、誤差がないと仮定し、誤差の種類をＰａｎとＴｉｌｔに区分することもできる。 In other words, once the initial camera pose estimation is complete, there is almost no possibility of the position (X, Y, Z) changing, so it can be assumed that there is no error, and the error type can be classified as Pan and Tilt.

左右の誤差はＰａｎ、上下の誤差はＴｉｌｔを微細調整して補正可能であり、補正手続きは誤差の大きさを算出して調整単位を決定→反対方向の誤差が発生するまで繰り返し→逆方向に最初の単位の１０分の１に微細調整→再び反対方向の誤差が発生するまで繰り返す。各段階で臨界値（Ｔｈｒｅｓｈｏｌｄ）との比較を遂行して満足すれば終了処理する。 Left-right errors can be corrected by fine-tuning Pan, and up-down errors by fine-tuning Tilt. The correction procedure involves calculating the magnitude of the error and determining the adjustment unit → repeating until an error in the opposite direction occurs → fine-tuning in the opposite direction to 1/10 of the original unit → repeating until an error in the opposite direction occurs again. At each stage, a comparison with a threshold is performed, and if satisfied, the process is terminated.

一方、オペレータによって撮影映像内の危険区域（特徴点３個以上）が設定されたり属性（オペレータ）が入力され得る。このような場合、危険区域に隣接した隣接路面客体の始終点を設定し、危険区域および隣接路面客体の特徴点を精密道路地図に逆投影した後、前記精密道路地図を利用して危険区域および隣接路面客体の特徴点の空間座標を算出することができる。この時、逆投影された隣接路面客体と精密道路地図上でそれに対応する客体間の位置誤差を算出することもできる。以後、位置誤差を利用した危険区域の特徴点の空間座標を補正し、危険区域の特徴点を撮影映像に再投影する。最初に設定された点を基準として再投影された特徴点別ピクセル誤差を算出した後、前記誤差が臨界値より小さければ、精密道路地図上にマッピング対象者危険区域（位置、属性）を表記し、メッセージを自動生成および表出する過程に進行される。 Meanwhile, an operator can set a danger zone (three or more feature points) within the captured image or input attributes (operator). In this case, the start and end points of adjacent road objects adjacent to the danger zone can be set, and the feature points of the danger zone and adjacent road objects can be back-projected onto a precise road map. The spatial coordinates of the feature points of the danger zone and adjacent road objects can then be calculated using the precise road map. At this time, the position error between the back-projected adjacent road objects and their corresponding objects on the precise road map can also be calculated. The spatial coordinates of the feature points of the danger zone are then corrected using the position error, and the feature points of the danger zone are re-projected onto the captured image. The pixel error for each re-projected feature point is calculated based on the initially set point. If the error is smaller than a critical value, the system proceeds to the process of notating the mapping target's danger zone (location, attributes) on the precise road map and automatically generating and displaying a message.

ここで、誤差が臨界値より大きければ、特徴点別位置誤差を利用したカメラ姿勢（Ｐａｎ、Ｔｉｌｔ）を補正する過程をさらに含むことができる。 Here, if the error is greater than a critical value, the method may further include a process of correcting the camera pose (pan, tilt) using the position error for each feature point.

本発明の一実施例に係る精密道路地図基盤のＣＣＴＶカメラ姿勢および客体座標推定方法は、測量や基準点設定および表記などの現場での作業なしに精密道路地図を利用する簡単な方法を通じてＣＣＴＶカメラのカメラ姿勢を推定することができる方法を提示することによって、座標推定やマッピングを自動化できる土台を提供し、また、既存の方法と比較して高い正確度と時間および費用削減などの効果を創り出すことができるという利点がある。 A method for estimating CCTV camera pose and object coordinates based on a precise road map according to one embodiment of the present invention provides a method for estimating the camera pose of a CCTV camera through a simple method using a precise road map without on-site work such as surveying, setting control points, and marking. This provides a foundation for automating coordinate estimation and mapping, and has the advantages of achieving higher accuracy and time and cost savings compared to existing methods.

また、ＰＴＺカメラである場合、装置で最初のセッティング位置を基準とした相対値を提供しているので、すぐにカメラ姿勢に変換が不可能な点を考慮して初期姿勢推定時に提供される値と真の値の差分を算出して反映することによって、以後提供されるＰＴＺ値を変換してすぐにカメラ姿勢に活用できるという利点がある。 In addition, in the case of a PTZ camera, since the device provides relative values based on the initial setting position, it is not possible to immediately convert them to camera posture. Taking this into consideration, the difference between the value provided during initial posture estimation and the true value is calculated and reflected, which has the advantage that PTZ values provided thereafter can be converted and immediately used as camera posture.

また、固定式ＣＣＴＶだけでなくＰＴＺカメラを利用する場合にも適用が可能であり、座標推定およびマッピング過程を自動化できる方法を提示し、精密道路地図上で座標を推定することによって、簡単であり正確度を向上させることができ、特に、ＡＩを利用して車線などの路面客体の認識が可能である場合だけでなくこのような機能がない場合にも適用可能であるという利点がある。 It can also be applied to cases where not only fixed CCTVs but also PTZ cameras are used, and it presents a method that can automate the coordinate estimation and mapping process, estimating coordinates on a precision road map, which is simple and improves accuracy. It has the advantage of being applicable not only when it is possible to recognize road objects such as lanes using AI, but also when such functionality is not available.

また、誤差および姿勢補正自動化が可能であるという利点、例えば、座標推定結果やこの時に適用されたカメラ姿勢に対する誤差をすべての推定過程で確認し自動で補正することによって、正確度の高い推定が可能であり、特に、車線などの路面客体の認識が可能である場合、全過程を自動化することによって便利性を増大することができる。このような機能がない場合にもオペレータの簡単な操作だけで同じ効果が得られるようにアルゴリズムの具現が可能であるという利点がある。 Another advantage is the ability to automate error and attitude correction. For example, errors in the coordinate estimation results and the camera attitude applied at that time are checked and automatically corrected throughout the entire estimation process, enabling highly accurate estimation. In particular, when road surface objects such as lanes can be recognized, the entire process can be automated, increasing convenience. Even if such functionality is not available, an algorithm can be implemented to achieve the same effect with simple operator operations.

また、メッセージ自動生成および配布が可能であるという利点、例えば、前記過程に基づいて作成されたマッピング情報を共有するにおいてオペレータの追加手作業がないように自動でメッセージを生成し、その結果をオペレータが管理するモニタに表出して確認が可能であるようにしたし、配布も自動でなされるという利点がある。 Another advantage is that messages can be automatically generated and distributed. For example, messages can be generated automatically to share mapping information created based on the above process, eliminating the need for additional manual work by the operator. The results can be displayed on a monitor managed by the operator, allowing them to be checked, and distribution is also automatic.

一方、本発明に開示された構成要素は、例えば、プロセッサ、コントローラ、ＡＬＵ（ａｒｉｔｈｍｅｔｉｃｌｏｇｉｃｕｎｉｔ）、デジタル信号プロセッサ（ｄｉｇｉｔａｌｓｉｇｎａｌｐｒｏｃｅｓｓｏｒ）、マイクロコンピュータ、ＦＰＡ（ｆｉｅｌｄｐｒｏｇｒａｍｍａｂｌｅａｒｒａｙ）、ＰＬＵ（ｐｒｏｇｒａｍｍａｂｌｅｌｏｇｉｃｕｎｉｔ）、マイクロプロセッサ、または命令（ｉｎｓｔｒｕｃｔｉｏｎ）を実行し応答できる他の何らかの装置とともに、一つ以上の汎用コンピュータまたは特殊目的コンピュータを利用して具現され得る。処理装置は運営体制（ＯＳ）および前記運営体制上で遂行される一つ以上のソフトウェアアプリケーションを遂行できる。また、処理装置はソフトウェアの実行に応答して、データを接近、保存、操作、処理および生成することもできる。理解の便宜のために、処理装置は一つが使われるものとして説明された場合もあるが、該当技術分野で通常の知識を有する者は、処理装置が複数個の処理要素（ｐｒｏｃｅｓｓｉｎｇｅｌｅｍｅｎｔ）および／または複数類型の処理要素を含むことができることが分かる。例えば、処理装置は複数個のプロセッサまたは一つのプロセッサおよび一つのコントローラを含むことができる。また、並列プロセッサ（ｐａｒａｌｌｅｌｐｒｏｃｅｓｓｏｒ）のような、他の処理構成（ｐｒｏｃｅｓｓｉｎｇｃｏｎｆｉｇｕｒａｔｉｏｎ）も可能である。 Meanwhile, the components disclosed herein may be implemented using one or more general-purpose or special-purpose computers, including, for example, a processor, a controller, an arithmetic logic unit (ALU), a digital signal processor, a microcomputer, a field programmable array (FPA), a programmable logic unit (PLU), a microprocessor, or any other device capable of executing and responding to instructions. The processing device may execute an operating system (OS) and one or more software applications running on the operating system. The processing device may also access, store, manipulate, process, and generate data in response to the execution of software. For ease of understanding, the description may assume that a single processing device is used. However, those skilled in the art will recognize that a processing device may include multiple processing elements and/or multiple types of processing elements. For example, a processing device may include multiple processors or one processor and one controller. Other processing configurations, such as parallel processors, are also possible.

ソフトウェアはコンピュータプログラム（ｃｏｍｐｕｔｅｒｐｒｏｇｒａｍ）、コード（ｃｏｄｅ）、命令（ｉｎｓｔｒｕｃｔｉｏｎ）、またはこれらのうち一つ以上の組合を含むことができ、望む通りに動作するように処理装置を構成したり独立的にまたは結合的に（ｃｏｌｌｅｃｔｉｖｅｌｙ）処理装置を命令することができる。ソフトウェアおよび／またはデータは、処理装置によって解釈されたり処理装置に命令またはデータを提供するために、何らかの類型の機械、構成要素（ｃｏｍｐｏｎｅｎｔ）、物理的装置、仮想の装置（ｖｉｒｔｕａｌｅｑｕｉｐｍｅｎｔ）、コンピュータ保存媒体または装置、または伝送される信号波（ｓｉｇｎａｌｗａｖｅ）に永久的に、または一時的に具体化（ｅｍｂｏｄｙ）できる。ソフトウェアはネットワークに連結されたコンピュータシステム上に分散されて、分散された方法で保存されたり実行されることもできる。ソフトウェアおよびデータは一つ以上のコンピュータ読み取り可能記録媒体に保存され得る Software may include computer programs, code, instructions, or a combination of one or more of these, and may configure a processing device to operate as desired or instruct the processing device, either independently or collectively. Software and/or data may be permanently or temporarily embodied in any type of machine, component, physical device, virtual device, computer storage medium or device, or transmitted signal wave, to be interpreted by the processing device or to provide instructions or data to the processing device. Software may also be distributed across computer systems connected to a network, where it is stored and executed in a distributed manner. Software and data may be stored on one or more computer-readable storage media.

本発明の実施例に係る方法は多様なコンピュータ手段を通じて遂行され得るプログラム命令形態で具現されてコンピュータ読み取り可能媒体に記録され得る。前記コンピュータ読み取り可能媒体はプログラム命令、データファイル、データ構造などを単独でまたは組み合わせて含むことができる。前記媒体に記録されるプログラム命令は実施例のために特別に設計され構成されたものであるか、コンピュータソフトウェア当業者に公知の使用可能なものであってもよい。コンピュータ読み取り可能記録媒体の例にはハードディスク、フロッピーディスクおよび磁気テープのような磁気媒体（ｍａｇｎｅｔｉｃｍｅｄｉａ）、ＣＤ－ＲＯＭ、ＤＶＤのような光記録媒体（ｏｐｔｉｃａｌｍｅｄｉａ）、フロプティカルディスク（ｆｌｏｐｔｉｃａｌｄｉｓｋ）のような磁気－光媒体（ｍａｇｎｅｔｏ－ｏｐｔｉｃａｌｍｅｄｉａ）、およびロム（ＲＯＭ）、ラム（ＲＡＭ）、フラッシュメモリなどのようなプログラム命令を保存し遂行するように特別に構成されたハードウェア装置が含まれる。プログラム命令の例にはコンパイラによって作られるような機械語コードだけでなく、インタープリタなどを使ってコンピュータによって実行され得る高級言語コードを含む。前記ハードウェア装置は実施例の動作を遂行するために一つ以上のソフトウェアモジュールとして作動するように構成され得、その逆も同様である。 Methods according to embodiments of the present invention may be embodied in the form of program instructions that can be executed by various computer means and recorded on a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, and the like, alone or in combination. The program instructions recorded on the medium may be specially designed and constructed for the embodiments, or may be readily available to those skilled in the art of computer software. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tape, optical media such as CD-ROMs and DVDs, magneto-optical media such as floptical disks, and hardware devices specially configured to store and execute program instructions, such as ROM, RAM, and flash memory. Examples of program instructions include not only machine language code, such as that produced by a compiler, but also high-level language code that can be executed by a computer using an interpreter, etc. The hardware devices may be configured to operate as one or more software modules to perform the operations of an embodiment, and vice versa.

以上、本発明は実施例を参照して詳細に説明されたが、本発明が属する技術分野で通常の知識を有する者であれば、前述された技術的思想を逸脱しない範囲内で多様な置換、付加および変形が可能であることは当然であり、このような変形された実施形態も下記の特許請求の範囲によって定められる本発明の保護範囲に属する。 The present invention has been described in detail above with reference to examples. However, it will be apparent to those skilled in the art that various substitutions, additions, and modifications may be made without departing from the technical concept described above, and such modified embodiments also fall within the scope of the present invention as defined by the following claims.

１００：システム
１０：ＣＣＴＶ
２０：情報処理サーバー
２１：カメラ姿勢推定部
２２：マッピング対象客体設定部
２３：座標推定部
２４：補正部
２５：メッセージ生成および配布部
３０：オペレータ端末
100: System 10: CCTV
20: Information processing server 21: Camera posture estimation unit 22: Mapping target object setting unit 23: Coordinate estimation unit 24: Correction unit 25: Message generation and distribution unit 30: Operator terminal

Claims

(a) estimating the camera posture by matching road objects in an image captured by a CCTV camera with a precise road map;
(b) selecting an object to be mapped within the captured image; and
(c) estimating coordinates in the precise road map of a mapping target object selected in the captured image based on the estimated camera pose;
The camera posture includes information on the position (X, Y, Z), pan, and tilt of the CCTV camera,
The step (a) comprises:
(a1) projecting the precise road map onto the photographed image using the estimated camera pose, or back-projecting an object in the photographed image onto the precise road map using the estimated camera pose;
(a2) calculating pixel errors of corresponding feature points of the object in the captured image and the object in the precise road map; and
(a3) if the calculated pixel error is equal to or greater than a critical value, correcting the estimated camera pose to reduce the calculated pixel error to less than the critical value;
The step (c)
back-projecting each feature point of the mapping target object onto the precise road map according to the estimated camera pose; and
and calculating spatial coordinates of each feature point of the back-projected mapping target object from the precise road map.
A method for estimating CCTV camera attitude and object coordinates based on a precise road map.

The step (a) comprises:
extracting pixel coordinates of feature points of the road surface object from the captured image;
extracting spatial coordinates of feature points of the road surface object by matching the road surface object with a precise road map; and
2. The method of claim 1, further comprising: estimating the camera pose using Perspective-n-Point (PnP) from a relationship between pixel coordinates and spatial coordinates of the extracted road surface object feature points.

If the CCTV camera is a PTZ camera, step (a) includes converting the pan and tilt values provided by the PTZ camera into absolute values based on true north and a vertical direction from the difference between the pan and tilt values of the estimated camera attitude and the pan and tilt values provided by the PTZ camera; and
3. The method of claim 2, further comprising: reducing the captured image by an inverse of the zoom value provided by the PTZ camera to convert it into a captured image with a zoom value of '0'.

In the step (b),
2. The method of claim 1, wherein the mapping target object is an area or object that is automatically recognized and extracted from the captured image according to a predetermined rule based on artificial intelligence (AI), or an area or object that is manually selected from the captured image by a user.