JP6953292B2

JP6953292B2 - Object identification device and method

Info

Publication number: JP6953292B2
Application number: JP2017228752A
Authority: JP
Inventors: 良亮渡邊; 敬介野中
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2017-11-29
Filing date: 2017-11-29
Publication date: 2021-10-27
Anticipated expiration: 2037-11-29
Also published as: JP2019101526A

Description

本発明は、複数のカメラで撮影した映像から、オブジェクトを高速かつロバストに認識、識別し、その結果を表示するオブジェクト識別装置および方法に関する。 The present invention relates to an object identification device and a method for rapidly and robustly recognizing and identifying an object from images taken by a plurality of cameras and displaying the result.

従来、単一のカメラで撮影した映像から、人物に代表される何らかのオブジェクトを抽出し、識別する技術が提案されてきた。この識別のために、例えば人物であればスポーツ選手の背番号や顔、車であればナンバープレートの番号を認識し、それを基にオブジェクトを識別、同定する手段がある。この技術はオブジェクトを切り出した画像に対し、何らかの手段を用いて認識を行う。 Conventionally, a technique has been proposed in which some object represented by a person is extracted and identified from an image taken by a single camera. For this identification, for example, there is a means of recognizing the athlete's uniform number and face in the case of a person, and the license plate number in the case of a car, and identifying and identifying an object based on the number. This technology recognizes an image obtained by cutting out an object by some means.

認識の手段としては、深層学習を用いる技術が非特許文献１に開示されている。この認識結果は、例えばスポーツ映像内の選手を対象とする場合、オブジェクトの追跡処理を行う際のオブジェクトの同定に用いることや、戦術分析等への応用が考えられる。 As a means of recognition, a technique using deep learning is disclosed in Non-Patent Document 1. For example, when targeting a player in a sports image, this recognition result can be used for identifying an object when tracking the object, or can be applied to tactical analysis or the like.

しかしながら、映像の中で認識対象となるオブジェクトが常に見え続けるということは少ない。例えば背番号であれば、カメラに対する選手の立つ角度や、選手同士の重なり等の問題から、常に背番号を正しく認識するということは困難であるし、車のナンバープレートであれば、ナンバープレートが見える角度というのは限定される。そのため、動画内の任意のフレームにおいて高い精度で認識するためには、フレームごとの認識結果だけでなく、認識した結果を追跡する処理を加えることや、複数のカメラを用いるというアプローチが考えられる。そこで、複数のカメラで撮影した画像間の相対的な方位関係を取得し、方位関係から得られる特徴を基に人物同定を行う既存技術が存在していた。 However, it is rare that the object to be recognized is always visible in the image. For example, in the case of a uniform number, it is difficult to always correctly recognize the uniform number due to problems such as the angle at which the athlete stands with respect to the camera and the overlap between the athletes, and in the case of a car license plate, the license plate is used. The viewing angle is limited. Therefore, in order to recognize an arbitrary frame in a moving image with high accuracy, it is conceivable to add a process of tracking the recognized result as well as the recognition result for each frame, or to use a plurality of cameras. Therefore, there has been an existing technique for acquiring the relative directional relationship between images taken by a plurality of cameras and identifying a person based on the characteristics obtained from the directional relationship.

特開2016-001447号公報Japanese Unexamined Patent Publication No. 2016-001447

Sebastian Gerke; Karsten Muller; Ralf Schafer, "Soccer Jersey Number Recognition Using Convolutional Neural Networks," The IEEE International Conference on Computer Vision (ICCV) Workshops, pp. 17-24, 2015.Sebastian Gerke; Karsten Muller; Ralf Schafer, "Soccer Jersey Number Recognition Using Convolutional Neural Networks," The IEEE International Conference on Computer Vision (ICCV) Workshops, pp. 17-24, 2015.

特許文献１は複数カメラの相対的な方位関係から得られる顔の向きを用いて人物同定の精度を高める技術であるが、候補領域画像を綺麗に抽出する必要がある。顔や背番号のみを完全に自動で切り取ることは技術的に困難であることは非特許文献１などでも述べられており、切り取りに失敗した場合は誤認識の原因となるため、精度の問題から適用しづらかった。 Patent Document 1 is a technique for improving the accuracy of person identification by using the orientation of a face obtained from the relative orientation relationship of a plurality of cameras, but it is necessary to clearly extract a candidate region image. It is also stated in Non-Patent Document 1 that it is technically difficult to cut out only the face and the uniform number completely automatically, and if the cutting fails, it causes erroneous recognition. It was difficult to apply.

上記の高い精度を出すことが難しいという問題について、認識精度を高めるためのアプローチとして、深層学習などの、画像の切り取り方が一様でなくても、一定の認識精度を出せる手法を適用するという手段がある。 Regarding the above-mentioned problem that it is difficult to obtain high accuracy, as an approach to improve recognition accuracy, it is said to apply a method such as deep learning that can obtain a certain recognition accuracy even if the image cropping method is not uniform. There is a means.

非特許文献１は、得られた背番号の画像に対して深層学習モデルを用いて認識を行っている。しかしながら、背番号が見える画像に対しての認識精度を示しているに過ぎず、背番号が見えない画像は手動で取り除いている。 Non-Patent Document 1 recognizes the obtained image of the uniform number using a deep learning model. However, it only shows the recognition accuracy for the image in which the uniform number can be seen, and the image in which the uniform number cannot be seen is manually removed.

しかしながら、このような手動処理を加えても認識精度は約83％と報告されている。非特許文献１は複数のカメラを使うことが想定されておらず、複数のカメラを用いてスタジアム等で認識を行う場合、ある人物に対して背番号の見えないカメラが多く存在するという状況が考えられ、このようなシーンに本手法を適用してしまうと、誤認識が多く発生するおそれがある。また、精度の高い機械学習等の手法を用いて認識を行う場合、認識自体の計算コストが大きいため、カメラ台数が多くなると処理時間の面でも課題が大きい。 However, it is reported that the recognition accuracy is about 83% even if such manual processing is added. Non-Patent Document 1 does not assume that a plurality of cameras are used, and when recognition is performed at a stadium or the like using a plurality of cameras, there are many cameras whose uniform numbers cannot be seen for a certain person. It is conceivable that if this method is applied to such a scene, many misrecognitions may occur. Further, when recognition is performed using a method such as machine learning with high accuracy, the calculation cost of the recognition itself is large, so that a large problem also arises in terms of processing time as the number of cameras increases.

本発明の目的は、上記の技術課題を解決し、複数のカメラで撮影した映像から、オブジェクトを高速かつロバストに認識、識別し、その結果を表示するオブジェクト識別装置および方法を提供することにある。 An object of the present invention is to solve the above technical problems and to provide an object identification device and a method for recognizing and identifying an object at high speed and robustly from images taken by a plurality of cameras and displaying the result. ..

上記の目的を達成するために、本発明は、オブジェクトに付されたIDに基づいて各オブジェクトを識別するオブジェクト識別装置において、以下の構成を具備した点に特徴がある。 In order to achieve the above object, the present invention is characterized in that the object identification device that identifies each object based on the ID attached to the object has the following configuration.

(1) 各オブジェクトをフレームごとに異なる視点で撮影する複数のカメラと、各オブジェクトの位置を推定する手段と、各オブジェクトの向きを推定する手段と、オブジェクトごとにその向きに基づいてID認識用のカメラを選定する手段と、選定した各カメラの画像からオブジェクト画像を抽出する手段と、オブジェクト画像上でID認識を実行する手段と、オブジェクトのID認識結果を統合してIDを識別する手段と、オブジェクトの位置とIDの識別結果とを対応付けて出力する手段とを具備した。 (1) Multiple cameras that shoot each object from different viewpoints for each frame, means for estimating the position of each object, means for estimating the orientation of each object, and for ID recognition based on the orientation of each object. A means for selecting a camera, a means for extracting an object image from the image of each selected camera, a means for performing ID recognition on the object image, and a means for integrating the ID recognition result of the object to identify the ID. , A means for outputting the position of the object and the ID identification result in association with each other is provided.

(2) 前記カメラを選定する手段は、オブジェクトごとに、その向きおよびオブジェクト上でのIDの位置に基づいてIDの指向方向を計算し、オブジェクトごとに、そのIDの指向方向に基づいてカメラの選定範囲を求め、その選定範囲内に位置するカメラを選定するようにした。 (2) The means for selecting the camera is to calculate the direction of the ID for each object based on its orientation and the position of the ID on the object, and for each object, the direction of the camera based on the direction of the ID. The selection range was calculated, and the cameras located within the selection range were selected.

(3) 前記カメラを選定する手段は、過去の選定実績に基づいて各カメラに重み付けを行って選定を行うようにした。 (3) The means for selecting the cameras is such that each camera is weighted based on the past selection results.

(4) 前記IDを識別する手段は、オブジェクトごとに得られる複数のID認識結果の多数決に基づいて当該オブジェクトのIDを識別するようにした。 (4) The means for identifying the ID is to identify the ID of the object based on a majority decision of a plurality of ID recognition results obtained for each object.

(5) 前記IDを識別する手段は、ID認識時の認識確率に基づいてIDの各認識結果に重み付けを行って前記多数決に反映するようにした。 (5) The means for identifying the ID is such that each recognition result of the ID is weighted based on the recognition probability at the time of ID recognition and reflected in the majority vote.

(6) 前記IDを識別する手段は、各オブジェクト画像を撮影したカメラに信頼度を設定する手段と、前記信頼度に基づいてIDの各認識結果に重み付けを行う手段とを具備した。 (6) The means for identifying the ID includes a means for setting the reliability of the camera that captured each object image and a means for weighting each recognition result of the ID based on the reliability.

(7) 前記信頼度を設定する手段は、前記IDの指向方向により近い位置のカメラに対してより高い信頼度を設定するようにした。 (7) The means for setting the reliability is to set a higher reliability for a camera located closer to the directivity direction of the ID.

(8) 前記信頼度を設定する手段は、IDの識別結果に対する認識結果の正答確率がより高い履歴を有するカメラにより高い信頼度を設定するようにした。 (8) The means for setting the reliability is to set a higher reliability for a camera having a history in which the probability of correct answer of the recognition result for the ID identification result is higher.

(9) フレーム間での各オブジェクト間の距離に基づいて各オブジェクトを追跡する追跡手段をさらに具備し、前記オブジェクトの向きを推定する手段は、前記オブジェクトの追跡結果に基づいて各オブジェクトの向きを推定するようにした。 (9) A tracking means for tracking each object based on the distance between each object between frames is further provided, and the means for estimating the orientation of the object determines the orientation of each object based on the tracking result of the object. I tried to estimate.

(10) 前記追跡手段による追跡結果を前記IDの識別結果に統合する追跡結果統合手段を更に具備した。 (10) A tracking result integration means for integrating the tracking result by the tracking means into the ID identification result is further provided.

(11) 過去フレームで抽出したオブジェクトのIDを今回フレームで識別できないと、過去フレームで抽出したオブジェクトの位置またはその近傍に前記IDを識別結果とするオブジェクトを割り当てる消失ID割当手段をさらに具備した。 (11) If the ID of the object extracted in the past frame cannot be identified in the frame this time, a vanishing ID allocation means for assigning the object whose identification result is the ID to the position of the object extracted in the past frame or its vicinity is further provided.

本発明によれば、以下のような効果が達成される。 According to the present invention, the following effects are achieved.

(1) 処理負荷の高いID認識を実行するオブジェクト画像を、オブジェクトの向き、あるいはIDの指向方向に基づいて予め選別したカメラから抽出したオブジェクト画像に限定するので、IDの認識確率が低いと推定されるオブジェクト画像を予め排除できる。したがって、ID認識に伴う処理負荷を低減しながらIDの識別確度を向上させることができる。 (1) Since the object images that perform ID recognition with a high processing load are limited to the object images extracted from the cameras selected in advance based on the orientation of the object or the orientation direction of the ID, it is estimated that the ID recognition probability is low. The object image to be created can be excluded in advance. Therefore, it is possible to improve the ID identification accuracy while reducing the processing load associated with ID recognition.

(2) ID認識用のオブジェクト画像を抽出するカメラを選定する際に、過去の選定実績に基づいて各カメラに重み付けを行うようにしたので、オブジェクトの向きやIDの指向方向とは別に、何らかの理由で潜在的に認識率の高いカメラを漏れなく選定できるようになる。 (2) When selecting a camera to extract an object image for ID recognition, each camera is weighted based on the past selection results, so something is different from the direction of the object and the direction of the ID. For this reason, it will be possible to select cameras with potentially high recognition rates without omission.

(3) オブジェクトごとに複数の得られるID認識の結果を統合して最終的にIDを識別する際に、IDの指向方向に対してより近いカメラや、正答履歴の高いカメラの優先度を高くして各認識結果に重み付けを行うようにしたので、ID識別の精度を向上させることができる。 (3) When integrating the results of multiple ID recognitions obtained for each object and finally identifying the ID, give priority to cameras that are closer to the directivity direction of the ID and cameras that have a high history of correct answers. Since each recognition result is weighted, the accuracy of ID identification can be improved.

(4) オブジェクトをフレーム間で追跡する手段を設け、フレーム間での追跡結果とフレームごとに得られるID認識の結果とを統合してIDを識別するようにしたので、ID識別の精度を更に向上させることができる。 (4) A means for tracking objects between frames is provided, and the tracking result between frames and the ID recognition result obtained for each frame are integrated to identify the ID, so that the accuracy of ID identification is further improved. Can be improved.

(5) 追跡および認識に失敗して消失したIDを、その喪失位置またはその近傍に割り当てるようにしたので、オブジェクト間にオクルージョンが発生してもID識別結果の連続的な出力が可能になる。 (5) Since the ID lost due to tracking and recognition failure is assigned to the lost position or its vicinity, continuous output of the ID identification result becomes possible even if occlusion occurs between objects.

本発明の一実施形態に係るオブジェクト識別装置の主要部の構成を示した機能ブロック図である。It is a functional block diagram which showed the structure of the main part of the object identification apparatus which concerns on one Embodiment of this invention. オブジェクトの向きを推定する方法を説明した図である。It is a figure explaining the method of estimating the orientation of an object. カメラの選定方法を説明した図（その１）である。It is a figure (the 1) which explained the selection method of a camera. カメラの選定方法を説明した図（その２）である。It is a figure (2) explaining the selection method of a camera. カメラごとのIDの認識結果を示した図である。It is a figure which showed the recognition result of the ID for each camera. 機械学習の一例を説明した図である。It is a figure explaining an example of machine learning. カメラ信頼度の設定方法を説明した図である。It is a figure explaining the setting method of the camera reliability. IDの認識結果に基づくIDの識別方法を説明した図である。It is a figure explaining the identification method of ID based on the recognition result of ID. 識別結果の表示例を示した図である。It is a figure which showed the display example of the identification result. 本発明の第２実施形態に係るオブジェクト識別装置の主要部の構成を示した機能ブロック図である。It is a functional block diagram which showed the structure of the main part of the object identification apparatus which concerns on 2nd Embodiment of this invention. 本発明の第３実施形態に係るオブジェクト識別装置の主要部の構成を示した機能ブロック図である。It is a functional block diagram which showed the structure of the main part of the object identification apparatus which concerns on 3rd Embodiment of this invention. 消失IDの割当方法を説明した図である。It is a figure explaining the allocation method of the disappearance ID.

以下、図面を参照して本発明の実施の形態について詳細に説明する。図１は、本発明の一実施形態に係るオブジェクト識別装置の主要部の構成を示した機能ブロック図である。本実施形態では、例えば自動車、二輪車またはスポーツの競技者のように、前部（正面）、後部（背面）および／または側面等に、ナンバープレート、背番号またはゼッケンのような識別子（ID）表示を備えたオブジェクトを認識対象としている。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. FIG. 1 is a functional block diagram showing a configuration of a main part of an object identification device according to an embodiment of the present invention. In this embodiment, an identifier (ID) such as a license plate, a number or a bib is displayed on the front (front), rear (rear) and / or side surface, for example, in a car, motorcycle or sports athlete. The object with is the recognition target.

複数のカメラ１０（Ca1，Ca2…）は、それぞれの設置の位置や向きが既知であり、複数のオブジェクトをそれぞれ異なる視点、視野で撮影して、そのカメラ映像をフレーム単位でそれぞれ出力する。 The positions and orientations of the plurality of cameras 10 (Ca1, Ca2 ...) Are known, and a plurality of objects are photographed from different viewpoints and fields of view, and the camera images are output in frame units.

オブジェクト位置推定部１１は、各カメラ映像からオブジェクトを抽出し、その位置座標を推定する。各オブジェクトの位置座標は、例えば、各カメラ映像からオブジェクトごとに抽出した複数のマスク画像に基づいて3Dモデル（例えば、Visual Hull）を生成して３次元空間に配置し、その重心位置を求めて各オブジェクトの位置座標とすることができる。 The object position estimation unit 11 extracts an object from each camera image and estimates its position coordinates. For the position coordinates of each object, for example, a 3D model (for example, Visual Hull) is generated based on a plurality of mask images extracted for each object from each camera image, arranged in a three-dimensional space, and the position of the center of gravity is obtained. It can be the position coordinates of each object.

あるいは、各オブジェクトに測位センサ等のデバイス装置を実装し、各デバイス装置が測位した位置信号に基づいて各オブジェクトの位置座標を求めても良い。各オブジェクトの位置座標は、２次元座標位置あるいは高さも含めた３次元座標位置のいずれであっても良い。 Alternatively, a device device such as a positioning sensor may be mounted on each object, and the position coordinates of each object may be obtained based on the position signal positioned by each device device. The position coordinates of each object may be either a two-dimensional coordinate position or a three-dimensional coordinate position including the height.

オブジェクト追跡部１２は、過去フレームのカメラ映像から抽出した各オブジェクトの位置座標と、今回フレームのカメラ映像から抽出した各オブジェクトの位置座標とに基づいて各オブジェクトを追跡する。すなわち、今回フレームから抽出した各オブジェクトが、過去フレームから抽出したオブジェクトといずれと一致するのかを突き止め、一致したオブジェクトに同一IDを付することで追跡を実現する。 The object tracking unit 12 tracks each object based on the position coordinates of each object extracted from the camera image of the past frame and the position coordinates of each object extracted from the camera image of the current frame. That is, tracking is realized by finding out which object each object extracted from the frame this time matches the object extracted from the past frame and assigning the same ID to the matched object.

本実施形態では、追跡を許容する最大追跡距離D_Tを予め設定しておき、今回フレームから抽出した全てのオブジェクトの現在の位置座標に対して、前回フレームから抽出した全てのオブジェクトの過去の位置座標との距離を計算し、最大追跡距離D_Tを下回る中で最も小さい距離を与えるオブジェクトペアを同一オブジェクトと推定する。 In the present embodiment, the maximum tracking distance D _T that allows tracking is set in advance, and the past positions of all the objects extracted from the previous frame with respect to the current position coordinates of all the objects extracted from the frame this time. The distance to the coordinates is calculated, and the object pair that gives the smallest distance below _{the maximum tracking distance D T is estimated to be the same object.}

この際、各オブジェクトからその色情報として、オブジェクトが例えばサッカー選手であればユニフォームや髪の色情報を取得し、色情報が大きく異なるオブジェクト同士は、たとえ距離が小さくても同一オブジェクトと判定しないなどの処理を加えて精度を高めてもよい。 At this time, as the color information from each object, for example, if the object is a soccer player, uniform or hair color information is acquired, and objects having significantly different color information are not determined to be the same object even if the distance is small. May be added to improve the accuracy.

他の追跡手法として、カルマンフィルタやパーティクルフィルタ等の時系列フィルタを適用し、追跡を行う手法も適用可能である。人物オブジェクトの追跡処理には、過去フレームから抽出した位置等の情報と、その情報に紐づくIDとが記録されている必要があるが、これは、後述する結果記録部２０がオブジェクト識別用データベース２１へ記録を行うことで実現できる。 As another tracking method, a method of tracking by applying a time series filter such as a Kalman filter or a particle filter can also be applied. In the tracking process of a person object, it is necessary to record information such as a position extracted from a past frame and an ID associated with the information. This is a database for object identification by the result recording unit 20 described later. This can be achieved by recording to 21.

オブジェクト向き推定部１３は、例えば前記オブジェクト追跡部１２による追跡結果に基づいてオブジェクトの向きを推定する。本実施形態では、図２に一例を示したように、オブジェクトごとに前回フレームにおける位置座標P0と今回フレームにおける位置座標P1とに基づいて移動ベクトルを求め、移動方向を正面（前部）、その反対方向を背面（後部）、移動方向と直交する方向を側面と推定することができる。 The object orientation estimation unit 13 estimates the orientation of an object based on, for example, the tracking result by the object tracking unit 12. In the present embodiment, as shown in FIG. 2, a movement vector is obtained for each object based on the position coordinate P0 in the previous frame and the position coordinate P1 in the current frame, and the movement direction is set to the front (front part). It can be estimated that the opposite direction is the back surface (rear part) and the direction orthogonal to the moving direction is the side surface.

あるいは、各オブジェクトに加速度センサを実装し、その出力を公知の姿勢推定アルゴリズムに適用することで各オブジェクトの姿勢を推定し、その推定結果に基づいてオブジェクトの向きを推定するようにしても良い。 Alternatively, an acceleration sensor may be mounted on each object, the output thereof may be applied to a known posture estimation algorithm to estimate the posture of each object, and the orientation of the object may be estimated based on the estimation result.

特徴位置取得部１４は、オブジェクト識別用データベース２１から、ID認識を行う際に優先すべきオブジェクトの特徴位置に関する情報を取得する。オブジェクトがサッカー選手であって、そのIDが背番号であれば、オブジェクトの向きと反対側（背中側）が特徴位置である旨の情報を取得する。同様に、オブジェクトが自動車であって、そのIDがナンバープレートであれば、オブジェクトの前方側及び後方側が特徴位置である旨の情報を取得する。 The feature position acquisition unit 14 acquires information on the feature position of the object that should be prioritized when performing ID recognition from the object identification database 21. If the object is a soccer player and its ID is a uniform number, information that the side opposite to the direction of the object (back side) is the feature position is acquired. Similarly, if the object is a car and its ID is a license plate, the information that the front side and the rear side of the object are the feature positions is acquired.

カメラ選定部１５は、ID指向方向推定部１５１および範囲設定部１５２を含み、各オブジェクトの向きの推定結果および特徴位置、さらには各オブジェクト位置座標に基づいて、オブジェクトごとにそのID認識を実行するカメラを選定する。 The camera selection unit 15 includes an ID direction direction estimation unit 151 and a range setting unit 152, and executes ID recognition for each object based on the estimation result of the orientation of each object, the feature position, and the position coordinates of each object. Select a camera.

前記ID指向方向推定部１５１は、オブジェクトに付されているIDの指向方向を推定する。例えば、オブジェクトがサッカー選手であれば、その特徴位置が背中（背番号）である旨の情報を取得できるので、オブジェクトの背面方向をIDの指向方向と推定できる。 The ID directivity direction estimation unit 151 estimates the directivity direction of the ID attached to the object. For example, if the object is a soccer player, information that the feature position is the back (number) can be acquired, so that the back direction of the object can be estimated as the direction of the ID.

範囲設定部１５２は、IDの指向方向を基準に所定の選定範囲内に位置するカメラをID認識に好適なカメラ群と推定する。オブジェクトがサッカー選手であれば、図３に示したように、IDの指向方向（背面方向）を基準に±θaの角度範囲がカメラの選定範囲とされる。 The range setting unit 152 estimates that a camera located within a predetermined selection range based on the directivity direction of the ID is a group of cameras suitable for ID recognition. If the object is a soccer player, as shown in FIG. 3, the angle range of ± θa with respect to the directivity direction (backward direction) of the ID is set as the camera selection range.

また、オブジェクトが自動車であれば、その特徴位置が車体の正面および後部である旨の情報を取得できるので、オブジェクトの背面方向のみならず正面方向もIDの指向方向と推定できる。したがって、図４に示したように、前記背面方向に加えて正面方向を基準に±θaの角度範囲がカメラの選定範囲とされる。 Further, if the object is an automobile, it is possible to acquire information that the feature positions are the front and the rear of the vehicle body, so that not only the back direction of the object but also the front direction can be estimated as the direction of the ID. Therefore, as shown in FIG. 4, the angle range of ± θa with respect to the front direction in addition to the back direction is set as the camera selection range.

前記カメラ選定部１５は、前記選定範囲内に位置するカメラを当該オブジェクトに関するID認識用のカメラとして選定する。オブジェクトごとに選定されたカメラの識別子はデータベース２１に蓄積される。 The camera selection unit 15 selects a camera located within the selection range as a camera for ID recognition related to the object. The camera identifiers selected for each object are stored in the database 21.

なお、カメラ選定部１５は前記IDの指向方向に対して所定の角度範囲内に位置する全てのカメラを選定する必要は無く、角度や過去の選定実績に基づいて各カメラに重み付けを行って選定を行うようにしても良い。例えば、IDの指向方向に近いカメラほど重み値を増す一方、過去に選定された実績の多いカメラほど重み値を増し、重み値の総和の大きい上位Nベストのカメラを選定するようにしても良い。 It is not necessary for the camera selection unit 15 to select all the cameras located within a predetermined angle range with respect to the directivity direction of the ID, and each camera is weighted and selected based on the angle and the past selection results. You may try to do. For example, the weight value may be increased as the camera is closer to the directivity direction of the ID, while the weight value may be increased as the camera has a large number of achievements selected in the past, and the top N best camera having a large sum of weight values may be selected. ..

また、上記のカメラ選定は全フレームで行う必要はなく、カメラの総数がn台であるとき、数ないし数十フレームに一度はｎ台全てのカメラを選定し、それ以外ではm（＜n）台のカメラのみを選定するようにしても良い。なお、カメラの選定数は固定値に限定されず、所定の条件を満たすカメラを全て選定するようにしても良い。 In addition, the above camera selection does not have to be performed for all frames. When the total number of cameras is n, all n cameras are selected once every several to several tens of frames, and m (<n) otherwise. You may choose only one camera. The number of selected cameras is not limited to a fixed value, and all cameras satisfying a predetermined condition may be selected.

オブジェクト画像抽出部１６は、オブジェクトごとに前記選定されたカメラのカメラ映像からオブジェクト画像を抽出する。オブジェクト画像は所定の画像フォーマット形式で出力してもよいし、単なる輝度値やカラー画像以外のデプス画像等として抽出してもよい。 The object image extraction unit 16 extracts an object image from the camera image of the camera selected for each object. The object image may be output in a predetermined image format, or may be extracted as a mere luminance value, a depth image other than a color image, or the like.

前記オブジェクト画像抽出部１６はさらに、各オブジェクト画像の抽出位置と前記オブジェクト位置推定部１１より推定された位置とを照合することで、同一オブジェクトの画像と位置とを紐付けて管理する。 The object image extraction unit 16 further collates the extraction position of each object image with the position estimated by the object position estimation unit 11 to manage the image and the position of the same object in association with each other.

ID取得部１７は、選定したカメラ映像から抽出したオブジェクト画像上でID認識を実行してオブジェクトIDを取得する。 The ID acquisition unit 17 acquires the object ID by executing ID recognition on the object image extracted from the selected camera image.

図５は、IDとして「２４」の背番号が付されたオブジェクトに対して、８台のカメラCa1〜Ca8でID認識を実行した際のカメラの位置と認識結果との関係を示した図であり、IDの指向方向に位置するカメラCa6，Ca7では正しい認識結果が得られているが、IDの指向方向から外れたカメラCa3，Ca8では誤った認識結果が得られている。なお、なお「認識失敗」とは、認識確率が極めて低いために認識結果を得られなかった場合を示している。本実施形態によれば、IDの指向方向に位置するカメラCa6，Ca7では高い認識結果が得られることが判る。前記認識結果は、その認識確率と共にオブジェクト識別用データベース２１に蓄積される。 FIG. 5 is a diagram showing the relationship between the position of the cameras and the recognition result when ID recognition is executed by the eight cameras Ca1 to Ca8 for the object with the uniform number "24" as the ID. Yes, the correct recognition results are obtained with the cameras Ca6 and Ca7 located in the direction of the ID, but the incorrect recognition results are obtained with the cameras Ca3 and Ca8 that are not in the direction of the ID. Note that "recognition failure" indicates a case where a recognition result cannot be obtained because the recognition probability is extremely low. According to this embodiment, it can be seen that high recognition results can be obtained with the cameras Ca6 and Ca7 located in the directivity direction of the ID. The recognition result is accumulated in the object identification database 21 together with the recognition probability.

ID取得部１７におけるID認識には機械学習を用いることができる。例えば、背番号をIDとして認識する際に機械学習を採用するのであれば、背番号の映ったオブジェクト画像を入力すると、予測した認識結果（背番号が何番であるかという推測結果）を取得できるモデルを作成する必要があるため、最初に学習画像を用いて背番号認識用のモデルを生成する。 Machine learning can be used for ID recognition in the ID acquisition unit 17. For example, if machine learning is adopted when recognizing a uniform number as an ID, when an object image showing the uniform number is input, the predicted recognition result (estimation result of what the uniform number is) is acquired. Since it is necessary to create a model that can be used, a model for number recognition is first generated using the training image.

このようなモデル作成は、例えば、大量の学習画像を用意し、畳み込みニューラルネットワークを用いて背番号認識用のモデルを作成する。学習画像の生成については、背番号が映っている画像を大量に用意して正解ラベルを手動で付与してもよいが、図６に一例を示したように、任意の背景画像にIDを表すフォント等を重ねて人工的に学習画像を生成うるようにしてもよい。 In such model creation, for example, a large number of training images are prepared, and a model for uniform number recognition is created using a convolutional neural network. Regarding the generation of the learning image, a large number of images showing the uniform number may be prepared and the correct answer label may be manually assigned, but as shown in FIG. 6, the ID is represented in an arbitrary background image. The learning image may be artificially generated by superimposing fonts and the like.

後者の方法によれば、自動で正解ラベルの付与された学習画像を生成できるため、手動で正解ラベルを割り付ける必要がなく効率的である。また、初めからフォントを回転させたり、歪ませたり、サイズを調節したりすることでさまざまな学習画像を生成できるので、オブジェクト画像上でIDが多少斜めを向いていたり、綺麗に切り取られていなくても、精度の高い認識が可能になる。 According to the latter method, since the learning image with the correct answer label can be automatically generated, it is not necessary to manually assign the correct answer label, which is efficient. Also, since various learning images can be generated by rotating, distorting, and adjusting the size of the font from the beginning, the ID is slightly diagonal on the object image, or it is not cut out neatly. However, highly accurate recognition becomes possible.

また、モデルの生成方法としては、例では畳み込みニューラルネットワークを取り上げているが、ID認識が可能であれば、どのような学習手法を用いてもよい。そして、生成したID認識用のモデルを用いて、カメラ映像より抽出されたオブジェクト画像を認識させることでIDを推測して出力する。このとき、IDの認識結果に加えて認識結果の確率（認識確率）を出力してもよい。 In addition, as a model generation method, a convolutional neural network is taken up in the example, but any learning method may be used as long as ID recognition is possible. Then, using the generated model for ID recognition, the ID is estimated and output by recognizing the object image extracted from the camera image. At this time, the probability of the recognition result (recognition probability) may be output in addition to the ID recognition result.

機械学習であれば、ID認識用モデルの出力層の活性化関数にsoftmax関数を用いることで認識確率を算出できる。また、全てのカメラからの結果が「認識失敗」と判定される場合には、ここでのIDそのものを「認識失敗」として、その結果を後段に渡すことも可能である。 In the case of machine learning, the recognition probability can be calculated by using the softmax function as the activation function of the output layer of the ID recognition model. Further, when the results from all the cameras are determined to be "recognition failure", the ID itself here can be regarded as "recognition failure" and the result can be passed to the subsequent stage.

認識結果統合部１８は、認識成否判定部１８１およびカメラ信頼度設定部１８２を含み、オブジェクトごとに選定した複数のカメラ映像上で実行したID認識の結果を一つに統合することでIDを識別する。 The recognition result integration unit 18 includes a recognition success / failure determination unit 181 and a camera reliability setting unit 182, and identifies IDs by integrating the results of ID recognition executed on a plurality of camera images selected for each object. do.

前記認識成否判定部１８１は、認識確率が所定の基準値を下回る結果や、実際には想定できない認識結果が得られると、これらを"認識失敗"に分類することで確度の低い認識結果がID識別に反映されることを防止する。 When the recognition success / failure determination unit 181 obtains a result in which the recognition probability falls below a predetermined reference value or a recognition result that cannot be actually expected, the recognition result with low accuracy is ID by classifying these as "recognition failure". Prevent it from being reflected in the identification.

例えば、誤った認識結果が多数得られるようなケースでは、個々の認識確率が低くても当該誤った認識結果が識別結果とされる可能性を否定できない。また、例えば野球やサッカー等の競技であれば、出現するIDのリスト（背番号やゼッケン）を予め取得できるので、当該リストにない認識結果は"認識失敗"に分類することで、当該認識結果が識別結果とされる可能性を低減できる。 For example, in a case where a large number of erroneous recognition results are obtained, it cannot be denied that the erroneous recognition results may be regarded as identification results even if the individual recognition probabilities are low. Also, in the case of competitions such as baseball and soccer, a list of appearing IDs (numbers and bibs) can be obtained in advance, so recognition results that are not on the list can be classified as "recognition failure". Can be reduced as an identification result.

カメラ信頼度設定部１８２は、前記オブジェクトごとに選定されたカメラごとに、ID指向方向とカメラ方向との角度差に基づいて信頼度を設定する。 The camera reliability setting unit 182 sets the reliability for each camera selected for each object based on the angle difference between the ID directivity direction and the camera direction.

図７は、カメラ信頼度の設定方法の一例を示した図であり、IDの指向方向とカメラの方向とが角度的にどれだけ離れているかを求め、角度差が小さいカメラほど、その信頼度を高く設定する。 FIG. 7 is a diagram showing an example of a method of setting the camera reliability. It is obtained how much the direction of the ID and the direction of the camera are angularly separated, and the smaller the angle difference, the higher the reliability. Is set high.

図示の例では、IDの指向方向とカメラCa7の方向との角度差θ7を、IDの指向方向とカメラCa6の方向との角度差θ6と比較し、ここではθ7＜θ6なので、カメラCa7の信頼度をカメラCa6の信頼度よりも高く設定する。なお、θ7＜θ6であれば、cosθ7＞cosθ6となるため、本実施形態ではIDの指向方向とカメラの方向とがなす角度のcos値をカメラ信頼度として採用する。 In the illustrated example, the angle difference θ7 between the direction of the ID and the direction of the camera Ca7 is compared with the angle difference θ6 between the direction of the ID and the direction of the camera Ca6. Here, θ7 <θ6, so the reliability of the camera Ca7 Set the degree higher than the reliability of the camera Ca6. If θ7 <θ6, then cosθ7> cosθ6. Therefore, in this embodiment, the cos value of the angle between the directivity direction of the ID and the direction of the camera is adopted as the camera reliability.

さらに、IDの識別結果に対する認識結果の正答確率がより高い履歴を有するカメラに対して、より高い信頼度が設定されるようにしても良い。 Further, a higher reliability may be set for a camera having a history in which the probability of correct answer of the recognition result for the ID identification result is higher.

前記認識結果統合部１８は、認識結果ごとに、前記認識成否の判定、ID認識における認識確率およびカメラ信頼度に基づいて最尤のIDを識別する。 The recognition result integration unit 18 identifies the maximum likelihood ID for each recognition result based on the determination of success or failure of recognition, the recognition probability in ID recognition, and the camera reliability.

図８は、認識結果統合部１８によるIDの識別方法を模式的に示した図であり、ここでは一つのオブジェクトに注目し、当該オブジェクトに関して８つのカメラCc1〜Cc8が選定されている場合を例にして説明する。 FIG. 8 is a diagram schematically showing an ID identification method by the recognition result integration unit 18. Here, attention is paid to one object, and an example is obtained in which eight cameras Cc1 to Cc8 are selected for the object. I will explain.

本実施形態では、前記認識成否判定部１８１によりカメラCa1，Ca2およびCa4については認識失敗と判別されているので統合対象から除外される。それ以外のカメラに関しては、認識確率とカメラ信頼度との積が評価値とされる。本実施形態では、カメラCa3はカメラ信頼度が"０"なので評価値は"０"となる。カメラCa5は、認識確率が"0.8"、カメラ信頼度が"0.4"なので、その評価値は"0.32"となっている。 In the present embodiment, the cameras Ca1, Ca2, and Ca4 are excluded from the integration target because they are determined to be recognition failures by the recognition success / failure determination unit 181. For other cameras, the product of the recognition probability and the camera reliability is used as the evaluation value. In the present embodiment, the camera Ca3 has a camera reliability of "0", so the evaluation value is "0". The camera Ca5 has a recognition probability of "0.8" and a camera reliability of "0.4", so its evaluation value is "0.32".

同様に、カメラCa6，Ca7，Ca8は、認識確率がそれぞれ"0.8"，"1.0"， "0.8"であり、カメラ信頼度がそれぞれ"0.7"，"0.9"，"0.6"なので、それぞれの評価値は"0.56"，"0.9"，"0.48"となっている。 Similarly, the cameras Ca6, Ca7, and Ca8 have recognition probabilities of "0.8", "1.0", and "0.8", respectively, and camera reliability of "0.7", "0.9", and "0.6", respectively. The values are "0.56", "0.9", and "0.48".

認識結果統合部１８はさらに、認識結果ごとに評価値の総和を求め、最大値を与える認識結果をID識別の結果として採用する。本実施形態では、認識結果「26」の評価値総和が０であり、認識結果「24」の評価値総和が1.78であり、認識結果「28」の評価値総和が0.48なので、認識結果「24」がIDの識別結果とされる。 The recognition result integration unit 18 further obtains the sum of the evaluation values for each recognition result, and adopts the recognition result that gives the maximum value as the result of the ID identification. In the present embodiment, the total evaluation value of the recognition result "26" is 0, the total evaluation value of the recognition result "24" is 1.78, and the total evaluation value of the recognition result "28" is 0.48. Therefore, the recognition result "24" Is the ID identification result.

結果表示部１９は、今回フレームから抽出した各オブジェクトの位置座標およびIDの識別結果を表示する。結果の表示方法には様々あり、コンソール上に位置座標とIDを数値として表示させるだけでもよいが、結果をグラフィカルに表現する一つの例として、図９に一例を示したように、オブジェクトの位置にIDを重畳表示する平面マップがある。 The result display unit 19 displays the identification result of the position coordinates and the ID of each object extracted from the frame this time. There are various ways to display the result, and it is sufficient to display the position coordinates and ID as numerical values on the console. However, as an example of graphically expressing the result, as shown in FIG. 9, the position of the object. There is a plane map that superimposes the ID on.

図９では、サッカーコートの半面を模した背景上に、各選手の位置座標に基づいた丸型の色付き（図では、丸型の線種）マーカを配置している。マーカ上に重ねて表示されている数字が背番号を示すIDである。 In FIG. 9, a round colored (round line type in the figure) markers based on the position coordinates of each player are arranged on a background imitating one side of a soccer court. The number displayed on top of the marker is the ID indicating the uniform number.

このような平面マップを、カメラ映像のフレームごとに出力して、動画的に動かすような表示方法も可能である。また、この表示の際に、例えば画像からユニフォームの色の情報を取得することで、選手の所属チームを判断し、その結果でマーカの色を変化させて平面マップに反映させてもよい。このとき、ユニフォームの色について、審判の色であると判断された人物については、選手ではないと判断して結果の表示から除外してもよい。 It is also possible to output such a plane map for each frame of the camera image and move it in a moving manner. Further, at the time of this display, for example, by acquiring the uniform color information from the image, the team to which the player belongs may be determined, and the color of the marker may be changed and reflected in the plane map as a result. At this time, regarding the color of the uniform, the person who is judged to be the color of the referee may be judged not to be a player and excluded from the display of the result.

結果記録部２０は、今回フレームから抽出したオブジェクトの位置座標や最終的なID識別結果等をデータベース２１に記録することで、次フレーム以降でのIDの推定に役立てることを目的としている。 The result recording unit 20 aims to use the result recording unit 20 for estimating the ID in the next frame and thereafter by recording the position coordinates of the object extracted from the frame this time, the final ID identification result, and the like in the database 21.

本実施形態では、現フレームにおいて得られたオブジェクトの位置座標と、そのオブジェクトに対応するIDの識別結果とが保存される。これは次フレームで追跡処理を行う際に不可欠となる情報である。 In the present embodiment, the position coordinates of the object obtained in the current frame and the identification result of the ID corresponding to the object are saved. This is indispensable information when performing the tracking process in the next frame.

また、後述する「追跡結果統合部２２」や「消失ID割当部２３」において、信頼度を用いて最適なIDを判断すべく信頼度を導入する場合には、信頼度をIDに紐づけてオブジェクト識別用データベース２１に保存し、次フレーム以降で追跡や消失ID割当が成功した場合に、現フレームの信頼度をそのまま引き継ぐ（あるいは、時間経過に応じて信頼度を割り引いて引き継ぐ）機能を付加してもよい。 Further, when introducing the reliability in order to determine the optimum ID by using the reliability in the "tracking result integration unit 22" and the "disappearing ID allocation unit 23" described later, the reliability is linked to the ID. Saved in the object identification database 21 and added a function to take over the reliability of the current frame as it is (or to take over by discounting the reliability according to the passage of time) when tracking or disappearance ID assignment is successful after the next frame. You may.

本実施形態によれば、処理負荷の高いID認識を実行するオブジェクト画像を、オブジェクトの向き、あるいはIDの指向方向に基づいて予め選別したカメラから抽出したオブジェクト画像に限定するので、IDの認識確率が低いと推定されるオブジェクト画像を予め排除できる。したがって、ID認識に伴う処理負荷を低減しながらIDの識別確度を向上させることができる。 According to the present embodiment, the object image that executes ID recognition with a high processing load is limited to the object image extracted from the camera selected in advance based on the orientation of the object or the orientation direction of the ID. Therefore, the ID recognition probability. Object images that are presumed to have a low value can be excluded in advance. Therefore, it is possible to improve the ID identification accuracy while reducing the processing load associated with ID recognition.

また、本実施形態によれば、ID認識用のオブジェクト画像を抽出するカメラを選定する際に、過去の選定実績に基づいて各カメラに重み付けを行うようにしたので、オブジェクトの向きやIDの指向方向とは別に、何らかの理由で潜在的に認識率の高いカメラを漏れなく選定できるようになる。 Further, according to the present embodiment, when selecting a camera for extracting an object image for ID recognition, each camera is weighted based on the past selection results, so that the orientation of the object and the orientation of the ID Apart from the direction, it will be possible to select a camera with a potentially high recognition rate for some reason without omission.

さらに、本実施形態によれば、オブジェクトごとに複数の得られるID認識の結果を統合して最終的にIDを識別する際に、IDの指向方向に対してより近いカメラや、正答履歴の高いカメラの優先度を高くして各認識結果に重み付けを行うようにしたので、ID識別の精度を向上させることができる。 Further, according to the present embodiment, when a plurality of ID recognition results obtained for each object are integrated to finally identify the ID, a camera closer to the direction in which the ID is directed and a history of correct answers are high. Since the priority of the camera is raised and each recognition result is weighted, the accuracy of ID identification can be improved.

図１０は、本発明の第２実施形態に係るオブジェクト識別装置の構成を示した機能ブロック図であり、前記と同一の符号は同一又は同等部分を表している。本実施形態は、認識結果統合部１８の後段に追跡結果統合部２２を更に設けた点に特徴がある。 FIG. 10 is a functional block diagram showing the configuration of the object identification device according to the second embodiment of the present invention, and the same reference numerals as those described above represent the same or equivalent parts. This embodiment is characterized in that a tracking result integration unit 22 is further provided after the recognition result integration unit 18.

追跡結果統合部２２は、前記認識結果統合部１８による統合結果と前記オブジェクト追跡部１２による追跡結果とを更に統合し、前記統合結果に追跡結果を反映することでID識別の確度を更に向上させる。 The tracking result integration unit 22 further integrates the integration result by the recognition result integration unit 18 and the tracking result by the object tracking unit 12, and reflects the tracking result in the integration result to further improve the accuracy of ID identification. ..

本実施形態では、IDの追跡結果と識別結果とが一致した場合を追跡成功と認識してその回数を記憶しておき、その後、追跡結果と識別結果とが不一致となったとき、追跡成功の回数や割合に応じて追跡結果および識別結果の一方を優先させる。 In the present embodiment, the case where the ID tracking result and the identification result match is recognized as the tracking success and the number of times is stored. After that, when the tracking result and the identification result do not match, the tracking is successful. Priority is given to either the tracking result or the identification result according to the number of times or the ratio.

たとえば、複数フレームに渡って追跡結果と識別結果とが「21」で一致していたオブジェクトに関して、たまたま今回フレームだけ「27」と認識された場合、追跡成功の回数が所定の閾値以上であれば、今回の識別結果に関わらず「21」と識別する一方、追跡成功の回数が所定の閾値未満であれば追跡結果に関わらず識別結果を優先して「27」と識別することができる。 For example, for an object whose tracking result and identification result match at "21" over multiple frames, if only the frame happens to be recognized as "27" this time, if the number of successful tracking is greater than or equal to a predetermined threshold. On the other hand, if the number of successful tracking is less than a predetermined threshold value, the identification result can be preferentially identified as "27" regardless of the tracking result.

このような実装を行うことで、精度の向上が可能である。ここで出力されるIDは、位置情報に対して1つのIDが出力される形でもよいし、信頼度と紐づけて複数のIDが出力されるような実装であってもよい。 By implementing such an implementation, it is possible to improve the accuracy. The ID output here may be a form in which one ID is output for the position information, or an implementation in which a plurality of IDs are output in association with the reliability.

本実施形態によれば、フレーム間での追跡結果とフレームごとに得られるID認識の結果とを統合してIDを識別するようにしたので、ID識別の精度を更に向上させることができる。 According to the present embodiment, since the ID is identified by integrating the tracking result between frames and the ID recognition result obtained for each frame, the accuracy of ID identification can be further improved.

図１１は、本発明の第３実施形態に係るオブジェクト識別装置の構成を示した機能ブロック図であり、前記と同一の符号は同一又は同等部分を表している。本実施形態は、追跡結果統合部２２の後段（または、第１実施形態における認識結果統合部１８の後段）に消失ID割当部２３を更に設けた点に特徴がある。 FIG. 11 is a functional block diagram showing the configuration of the object identification device according to the third embodiment of the present invention, and the same reference numerals as those described above represent the same or equivalent parts. The present embodiment is characterized in that the disappearance ID allocation unit 23 is further provided in the subsequent stage of the tracking result integration unit 22 (or the subsequent stage of the recognition result integration unit 18 in the first embodiment).

消失ID割当部２３は、前々回フレームまで認識できていたIDを前回フレームで認識できなくなると、今回フレームにおいて、オブジェクトは抽出できたがIDを識別できなかったオブジェクトの中から、最後に認識できた前々回フレームでのID認識位置近傍に位置しているオブジェクトを選択し、当該オブジェクトに対して、前記最後に認識できたIDを今回の認識結果とは無関係に割り当てる。 When the disappeared ID allocation unit 23 can no longer recognize the ID that could be recognized up to the previous frame in the previous frame, in this frame, the object could be extracted but the ID could not be identified at the end. An object located near the ID recognition position in the frame before the previous time is selected, and the last recognized ID is assigned to the object regardless of the recognition result this time.

図１２は、消失ID割当部２３の機能を示した図であり、ここでは、前々回フレームでは識別できていたID「10」を前回フレームでは識別できなかったため、IDの消失位置又はその近傍に、当該消失したID「10」を識別結果とするオブジェクトを認識結果とは無関係に割り当てる例を示している。 FIG. 12 is a diagram showing the function of the disappearance ID allocation unit 23. Here, since the ID “10” that could be identified in the previous frame could not be identified in the previous frame, the ID “10” could not be identified in the previous frame. An example is shown in which an object whose identification result is the lost ID "10" is assigned regardless of the recognition result.

オブジェクト同士がカメラ視点で接近したり重なったりするシーンでは、２つのオブジェクトが１つであると判定されてしまったり、重なって見えなくなってしまうことから、前回フレームでは存在していたオブジェクトが今回フレームのカメラ映像上では消失したように扱われてしまうことがある。 In a scene where objects approach or overlap from the camera's point of view, the two objects are judged to be one, or they overlap and become invisible, so the object that existed in the previous frame is now in the frame. It may be treated as if it disappeared on the camera image of.

このような場合、時間経過と共にオブジェクトの密集具合や重なりが解消されれば再び近傍にオブジェクトが認識されるが、オブジェクトが突然現れたように見えるため、認識や追跡をやり直す必要がある。 In such a case, if the density and overlap of the objects are resolved with the passage of time, the objects will be recognized in the vicinity again, but since the objects appear to appear suddenly, it is necessary to redo the recognition and tracking.

通常、このような状況が発生する場合、オブジェクトが再度登場する位置は、消失位置（最後に推定されたオブジェクトの位置：図１２(a)）の近傍であることが多い。そこで、本実施形態ではオブジェクトの消失位置を予め記録しておき、今回フレームで推定されたオブジェクトの位置と過去フレームで推定、記録された消失位置との距離を計算し、最大消失距離D_Vを下回る中で最も近い位置のオブジェクトに対して消失したIDを割り当てるようにしている。 Normally, when such a situation occurs, the position where the object reappears is often in the vicinity of the disappearance position (the position of the last estimated object: FIG. 12 (a)). Therefore, in the present embodiment may be recorded in advance disappearance position of the object, location and estimated in the past frame was estimated in the current frame object, to calculate the distance between the recorded erasure position, the maximum loss distance D _V The lost ID is assigned to the object at the closest position among the objects below.

最大消失距離D_Vは、ユーザが事前に不変のパラメータとして設定しても良いし、状況に応じて変化させてもよい。状況に応じて変化させる場合は、対象としているオブジェクトを中心に、ある領域内に他のオブジェクトがいくつ存在しているかをカウントし、それを密集度として算出する。そして、その密集度に基づいてオブジェクトごとに最大消失距離D_Vを動的に変化させることが考えられる。 Maximum loss distance D _V is, the user may set in advance as the unchanged parameters may be changed depending on the situation. When changing according to the situation, the number of other objects existing in a certain area is counted around the target object, and it is calculated as the density. Then, it is conceivable to dynamically change the maximum loss distance D _V for each object based on the density.

一般的に、オブジェクトが密集している状況であればあるほど、消失ID割当で間違ったIDが割り付けられる可能性が高まるため、最大消失距離D_Vを小さくすることで間違ったIDの割り当てを回避することが可能となる。 Generally, the more if the situation in which the object is dense, because the possibility of erroneous ID is allocated by the disappearance ID allocation is increased, avoiding the allocation of wrong ID by reducing the maximum loss distance D _V It becomes possible to do.

なお、消失ID割当部２３を具備する場合、前記結果記録部２０は、現フレームで識別したIDと、一つ前のフレームで識別したIDを比較することで、前フレームから現フレームの間で消失したIDを突き止め、その位置（消失位置）を記録しておく。これは、次フレーム以後の消失ID割当において、消失位置を必要とするために記録を行っている。また、消失しているとされていたIDが現フレームで現れた場合には、そのIDに関する消失位置情報は消去する。 When the vanishing ID allocation unit 23 is provided, the result recording unit 20 compares the ID identified in the current frame with the ID identified in the previous frame between the previous frame and the current frame. Find the lost ID and record its position (lost position). This is recorded because the disappearance position is required in the disappearance ID allocation after the next frame. If an ID that was supposed to be lost appears in the current frame, the lost position information related to that ID is deleted.

加えて、消失IDを記録する際に、ユニフォームの色などの情報から所属チームなどを解析できているのであれば、そのような付加情報を消失位置に紐づけて記録し、消失ID割当部２３にて割り当てを行う際の助けとしてもよい。 In addition, when recording the disappearance ID, if the team to which the team belongs can be analyzed from information such as the color of the uniform, such additional information is recorded in association with the disappearance position, and the disappearance ID assignment unit 23 It may be helpful when making an assignment at.

また、本装置を用いてIDの識別結果を表示する際に、認識と追跡、さらには消失ID割当も行われず、有意なIDが存在しないというケースも起こり得る。その場合には「認識結果なし」としてもよいし、何らかのランダムなIDを割り当ててもよい。 In addition, when displaying the ID identification result using this device, recognition and tracking, and further disappearance ID assignment are not performed, and there may be a case where a significant ID does not exist. In that case, "no recognition result" may be set, or some random ID may be assigned.

本実施形態によれば、追跡および認識に失敗して消失したIDを、その喪失位置またはその近傍に割り当てることができるので、オブジェクト間にオクルージョンが発生してもID識別結果の連続的な出力が可能になる。 According to the present embodiment, the ID lost due to the failure of tracking and recognition can be assigned to the lost position or its vicinity, so that the continuous output of the ID identification result can be obtained even if occlusion occurs between the objects. It will be possible.

１０…カメラ，１１…オブジェクト位置推定部，１２…オブジェクト追跡部，１３…オブジェクト向き推定部，１４…特徴位置取得部，１５…カメラ選定部，１６…オブジェクト画像抽出部，１７…ID取得部，１８…認識結果統合部，１９…結果表示部，２０…結果記録部，２１…オブジェクト識別用データベース，２２…追跡結果統合部，２３…消失ID割当部，１５１…ID指向方向推定部，１５２…範囲設定部，１８１…認識成否判定部，１８２…カメラ信頼度設定部 10 ... camera, 11 ... object position estimation unit, 12 ... object tracking unit, 13 ... object orientation estimation unit, 14 ... feature position acquisition unit, 15 ... camera selection unit, 16 ... object image extraction unit, 17 ... ID acquisition unit, 18 ... Recognition result integration unit, 19 ... Result display unit, 20 ... Result recording unit, 21 ... Object identification database, 22 ... Tracking result integration unit, 23 ... Loss ID allocation unit, 151 ... ID direction direction estimation unit, 152 ... Range setting unit, 181 ... Recognition success / failure judgment unit, 182 ... Camera reliability setting unit

Claims

In an object identification device that identifies each object based on the ID attached to the object
With multiple cameras that shoot objects from different perspectives,
A means of estimating the position of each object,
A means of estimating the orientation of each object,
A means of selecting a camera for ID recognition based on the orientation of each object,
A means for extracting an object image from the images of the selected cameras, and
A means to perform ID recognition for each object image,
A means of integrating the ID recognition results of objects to identify IDs,
It is provided with a means for outputting the position of the object and the identification result of the ID in association with each other .
The means for selecting a camera is an object identification device characterized in that each camera is weighted and selected based on a past selection record.

The object identification device according to claim 1, wherein the means for estimating the orientation of the object is to estimate the orientation of the object based on a change in the position of each object between frames.

It also has a means of retrieving information about the location of the ID on the object.
The means for selecting the camera is
For each object, a means of calculating the direction of the ID based on its orientation and the position of the ID,
Each object is provided with a means for obtaining a camera selection range based on the directivity direction of the ID.
The object identification device according to claim 1 or 2, wherein a camera located within the selection range is selected.

The object identification device according to claim 3 , wherein the means for identifying the ID is to identify the ID of the object based on a majority decision of a plurality of ID recognition results obtained for each object.

The object identification device according to claim 4 , wherein the means for identifying the ID is such that each recognition result of the ID is weighted based on the recognition probability at the time of ID recognition and reflected in the majority vote.

The means for identifying the ID is
A means to set the reliability of the camera that took each object image,
The object identification device according to claim 4 or 5 , further comprising a means for weighting each recognition result of an ID based on the reliability.

The object identification device according to claim 6 , wherein the means for setting the reliability is to set a higher reliability for a camera located closer to the directivity direction of the ID.

The object identification device according to claim 6 , wherein the means for setting the reliability is to set a high reliability by a camera having a history in which the probability of correct answer of the recognition result with respect to the ID identification result is higher.

Further equipped with a tracking means for tracking each object based on the distance between each object between frames,
The object identification device according to any one of claims 1 to 8 , wherein the means for estimating the orientation of the object estimates the orientation of each object based on the tracking result of the object.

The object identification device according to claim 9 , further comprising a tracking result integrating means for integrating the tracking result by the tracking means into the identification result of the ID.

If the ID of the object extracted in the past frame cannot be identified in the frame this time, the vanishing ID allocation means for assigning the object whose identification result is the ID to the position of the object extracted in the past frame or its vicinity is further provided. The object identification device according to claim 9 or 10.

In the object identification method that identifies each object based on the ID attached to the object,
Shoot objects with multiple cameras from different viewpoints,
Estimate the position of each object
Estimate the orientation of each object
For each object, select a camera for ID recognition based on its orientation,
An object image is extracted from the images of each of the selected cameras.
ID recognition is executed on the object image,
Identifies the ID by integrating the ID recognition results of the object,
Outputs the object position and ID identification result in association with each other .
An object identification method characterized in that when selecting a camera, each camera is weighted and selected based on a past selection record.