JP7488673B2

JP7488673B2 - MOVING OBJECT TRACKING DEVICE, MOVING OBJECT TRACKING METHOD, AND MOVING OBJECT TRACKING PROGRAM

Info

Publication number: JP7488673B2
Application number: JP2020050234A
Authority: JP
Inventors: 豪二水戸; 匠宗片
Original assignee: Secom Co Ltd
Current assignee: Secom Co Ltd
Priority date: 2020-03-19
Filing date: 2020-03-19
Publication date: 2024-05-22
Anticipated expiration: 2040-03-19
Also published as: JP2021149686A

Description

本発明は、順次撮影した撮影画像に基づいて移動物体を追跡する技術に関する。 This invention relates to a technology for tracking a moving object based on sequentially captured images.

警備等の目的で、カメラによって順次撮影した撮影画像を解析して人等の移動物体を追跡することが行われ、追跡の処理に用いられる手法のひとつとしてパーティクルフィルタ（Particle Filter）が知られている。パーティクルフィルタによれば、追跡中の移動物体それぞれの移動先の候補（候補位置）を表す多数の仮説を設定し、撮影画像において各候補位置に当該移動物体の特徴が現れている度合いに基づいて当該移動物体の位置を決定する。 For security purposes, moving objects such as people are tracked by analyzing images captured sequentially by a camera, and a particle filter is known as one of the techniques used in tracking processing. With a particle filter, multiple hypotheses are set that represent candidate destinations (candidate positions) for each moving object being tracked, and the position of the moving object is determined based on the degree to which the characteristics of the moving object appear at each candidate position in the captured images.

例えば、特許文献１には、移動物体を構成するパーツごとに候補位置を設定する移動物体追跡装置が記載されている。この移動物体追跡装置においては、各移動物体に設定する候補位置の総数が予め定められており、各パーツに割り当てる候補位置の個数を当該パーツの過去追跡信頼度に応じて定めることによって、姿勢変動によるオクルージョンが原因で追跡困難となったパーツの仮説を他のパーツに割り当てて追跡に有効な仮説の減少を防止している。 For example, Patent Document 1 describes a moving object tracking device that sets candidate positions for each part that constitutes a moving object. In this moving object tracking device, the total number of candidate positions to be set for each moving object is determined in advance, and the number of candidate positions to be assigned to each part is determined according to the past tracking reliability of that part, thereby allocating hypotheses for parts that have become difficult to track due to occlusion caused by pose change to other parts, preventing a decrease in hypotheses that are effective for tracking.

特開２０１６－１７０６０３号公報JP 2016-170603 A

しかしながら、従来技術においては、１移動物体当たりの候補位置の個数が固定値であったため、イベント会場のように人が多く集まるエリアでの追跡に適用した場合、混雑度が高まると１撮影画像当たりの候補位置の個数が増加しすぎて追跡処理の大幅な遅延を生じさせる問題があった。 However, in conventional technology, the number of candidate positions per moving object was a fixed value. Therefore, when applied to tracking in crowded areas such as event venues, as the level of congestion increases, the number of candidate positions per captured image increases too much, resulting in significant delays in the tracking process.

また、遅延防止のために、１撮影画像当たりの候補位置の個数を定め、当該個数を追跡中の移動物体で等分することも考えられる。しかし、その場合、混雑度が高まると１移動物体当たりの候補位置の個数が少なくなり過ぎて追跡精度が低下する問題があった。 To prevent delays, it is also possible to determine the number of candidate positions per captured image and divide this number equally among the moving objects being tracked. However, in that case, as the level of congestion increases, the number of candidate positions per moving object becomes too small, which can lead to a problem of reduced tracking accuracy.

本発明は、上記問題を鑑みてなされたものであり、移動物体の混雑度が高くなった場合でも追跡処理の遅延と精度低下を抑制できる移動物体追跡技術を提供することを目的とする。 The present invention has been made in consideration of the above problems, and aims to provide a moving object tracking technology that can suppress delays in tracking processing and reductions in accuracy even when the degree of congestion of moving objects becomes high.

（１）本発明に係る移動物体追跡装置は、順次撮影した撮影画像に基づいて１又は複数の移動物体を追跡する移動物体追跡装置であって、前記撮影画像に撮影された前記移動物体の混雑度を推定する混雑度推定手段と、前記混雑度に応じて各移動物体の候補位置数を割り当てる候補位置数割当手段と、前記各移動物体に割り当てられた前記候補位置数と同じ数だけ、過去の前記移動物体の位置から現時刻における前記移動物体の候補位置を求める候補位置設定手段と、前記各移動物体における前記複数の候補位置に基づいて、現時刻における前記移動物体の物体位置を求める物体位置判定手段と、を備える。 (1) The moving object tracking device according to the present invention is a moving object tracking device that tracks one or more moving objects based on sequentially captured images, and includes a congestion degree estimation means that estimates the congestion degree of the moving objects captured in the captured images, a candidate position number allocation means that allocates a number of candidate positions for each moving object according to the congestion degree, a candidate position setting means that determines candidate positions of the moving object at the current time from past positions of the moving object, the same number as the number of candidate positions assigned to each moving object, and an object position determination means that determines the object position of the moving object at the current time based on the multiple candidate positions of each moving object.

（２）上記（１）に記載する本発明に係る移動物体追跡装置において、前記混雑度推定手段は、前記撮影画像を入力されると当該撮影画像内の任意の位置の前記混雑度を出力するよう予め学習した推定器に前記撮影画像を入力して前記撮影画像内の任意の位置の前記混雑度を推定し、前記候補位置数割当手段は、前記撮影画像上の当該移動物体の物体位置または候補位置の前記混雑度に応じて各移動物体の候補位置数を割り当てる。 (2) In the moving object tracking device according to the present invention described in (1) above, the congestion degree estimation means estimates the congestion degree of an arbitrary position in the captured image by inputting the captured image to an estimator that has been trained in advance to output the congestion degree of an arbitrary position in the captured image when the captured image is input, and the candidate position number allocation means allocates the number of candidate positions for each moving object according to the congestion degree of the object position or candidate positions of the moving object in the captured image.

（３）上記（１）または（２）に記載する本発明に係る移動物体追跡装置において、前記候補位置数割当手段は、前記混雑度が高くなるにつれて前記移動物体の候補位置数を多く割り当てる。 (3) In the moving object tracking device according to the present invention described in (1) or (2) above, the candidate position number allocation means allocates a larger number of candidate positions for the moving object as the congestion degree increases.

（４）上記（３）に記載する本発明に係る移動物体追跡装置において、前記候補位置数割当手段は、１時刻における全追跡対象の前記移動物体の前記候補位置の総和が総数上限値に収まるように各移動物体の候補位置数を圧縮する。 (4) In the moving object tracking device according to the present invention described in (3) above, the candidate position number allocation means compresses the number of candidate positions of each moving object so that the sum of the candidate positions of all moving objects to be tracked at one time falls within the upper limit of the total number.

（５）上記（１）～（４）に記載する本発明に係る移動物体追跡装置において、前記候補位置設定手段は、前記混雑度が小さいほど前記移動物体の前記候補位置を設定する範囲を広くする。 (5) In the moving object tracking device according to the present invention described in (1) to (4) above, the candidate position setting means sets the range of the candidate positions of the moving object to a wider range as the congestion degree becomes smaller.

（６）上記（１）に記載する本発明に係る移動物体追跡装置において、前記混雑度が一定以上の領域では、前記候補位置数割当手段、前記候補位置設定手段、前記物体位置判定手段での処理を行わず、順次撮影した前記撮影画像上における移動物体群の位置を記憶する。 (6) In the moving object tracking device according to the present invention described in (1) above, in an area where the degree of congestion is equal to or higher than a certain level, the candidate position number allocation means, the candidate position setting means, and the object position determination means do not perform processing, and the positions of the group of moving objects on the sequentially captured images are stored.

本発明によれば、移動物体の混雑度が高くなった場合でも追跡処理の遅延と精度低下を抑制できる。 According to the present invention, it is possible to suppress delays in tracking processing and deterioration in accuracy even when the degree of congestion of moving objects increases.

移動物体追跡装置の概略の構成を示すブロック図である。1 is a block diagram showing a schematic configuration of a moving object tracking device. 移動物体追跡装置の概略を示す機能ブロック図である。1 is a functional block diagram showing an outline of a moving object tracking device. 混雑度推定部による処理を示す模式図である。FIG. 13 is a schematic diagram showing a process performed by a congestion degree estimation unit. 混雑度に比例した候補位置数の関数を示す図である。FIG. 13 is a diagram showing a function of the number of candidate locations proportional to the degree of congestion. 実施形態に係る移動物体追跡装置の全体的な処理を示す概略のフロー図である。1 is a schematic flow diagram showing the overall processing of a moving object tracking device according to an embodiment. 変形例における混雑度に応じた候補位置数の関数を示す図である。FIG. 13 is a diagram illustrating a function of the number of candidate locations according to the degree of congestion in a modified example.

以下、本発明の実施の形態（以下実施形態という）について、図面に基づいて説明する。本実施形態は、撮影部と表示部とがコンピュータに接続された移動物体追跡装置であり、撮影部により順次撮影された撮影画像をコンピュータが解析して移動物体である人の追跡を行う。 The following describes an embodiment of the present invention (hereinafter referred to as "embodiment") with reference to the drawings. In this embodiment, a moving object tracking device has a camera unit and a display unit connected to a computer, and the computer analyzes images captured sequentially by the camera unit to track a person, which is a moving object.

本実施形態にて一例として示す移動物体追跡装置１は、移動物体ごとに、過去の位置から現時刻における位置の候補を求め、現時刻の撮影画像において候補の位置に当該移動物体の画像特徴が現れている度合いに基づいて現時刻の位置を求めるものである。以下、移動物体の位置の候補を候補位置と称する。候補位置は１時刻、１移動物体当たり複数設定されるものとし、その個数を候補位置数と称する。また、移動物体の位置を物体位置と称する。物体位置は１時刻、１移動物体当たり１つ設定されるものとする。すなわち、候補位置は物体位置の候補であり、複数の候補位置を統合して１つの物体位置が決定される。なお移動物体追跡装置１は混雑度に応じて各移動物体の候補位置数を割り当てることを特徴としている。 The moving object tracking device 1 shown as an example in this embodiment determines, for each moving object, a candidate position at the current time from the past position, and determines the current position based on the degree to which the image features of the moving object appear at the candidate position in the captured image at the current time. Hereinafter, the candidate positions of the moving object are referred to as candidate positions. It is assumed that multiple candidate positions are set per moving object at one time, and the number of candidate positions is referred to as the number of candidate positions. In addition, the position of the moving object is referred to as the object position. It is assumed that one object position is set per moving object at one time. In other words, the candidate positions are candidates for the object position, and one object position is determined by integrating multiple candidate positions. It should be noted that the moving object tracking device 1 is characterized by assigning the number of candidate positions to each moving object according to the degree of congestion.

図１は移動物体追跡装置１の概略の構成を示すブロック図である。移動物体追跡装置１は撮影部２、通信部３、記憶部４、画像処理部５および表示部６からなる。 Figure 1 is a block diagram showing the general configuration of a moving object tracking device 1. The moving object tracking device 1 is composed of a photographing unit 2, a communication unit 3, a storage unit 4, an image processing unit 5, and a display unit 6.

撮影部２はいわゆる監視カメラである。撮影部２は通信部３を介して画像処理部５と接続され、監視空間を所定の時間間隔で撮影して画像を生成し、生成した画像を順次、画像処理部５に入力する。例えば、撮影部２は、監視空間であるイベント会場に立てられたポールに当該監視空間に存在する人を俯瞰する固定視野を有して設置され、監視空間を時間間隔１／５秒で撮影してカラー画像を生成する。なお、撮影部２はカラー画像の代わりにモノクロ画像を生成してもよい。 The photographing unit 2 is a so-called surveillance camera. The photographing unit 2 is connected to the image processing unit 5 via the communication unit 3, photographs the monitored space at a predetermined time interval to generate images, and sequentially inputs the generated images to the image processing unit 5. For example, the photographing unit 2 is installed on a pole erected in an event venue, which is the monitored space, with a fixed field of view that overlooks people present in the monitored space, photographs the monitored space at time intervals of 1/5 seconds, and generates color images. Note that the photographing unit 2 may generate monochrome images instead of color images.

通信部３は通信回路であり、その一端が画像処理部５に接続され、他端が撮影部２および表示部６と接続される。通信部３は撮影部２から画像を取得して画像処理部５に入力する。また、通信部３は画像処理部５から移動物体の追跡結果を表示部６へ出力する。 The communication unit 3 is a communication circuit, one end of which is connected to the image processing unit 5 and the other end of which is connected to the photographing unit 2 and the display unit 6. The communication unit 3 acquires images from the photographing unit 2 and inputs them to the image processing unit 5. The communication unit 3 also outputs the tracking results of the moving object from the image processing unit 5 to the display unit 6.

なお、撮影部２、通信部３、記憶部４、画像処理部５および表示部６の間は各部の設置場所に応じた形態で適宜接続される。例えば、撮影部２と通信部３および画像処理部５とが遠隔に設置される場合、撮影部２と通信部３との間をインターネット回線にて接続することができる。また、通信部３と画像処理部５との間はバスで接続する構成とすることができる。その他、接続手段として、ＬＡＮ（Local Area Network）、各種ケーブルなどを用いることができる。 The imaging unit 2, communication unit 3, storage unit 4, image processing unit 5, and display unit 6 are appropriately connected in a manner appropriate to the installation location of each unit. For example, if the imaging unit 2, communication unit 3, and image processing unit 5 are installed remotely, the imaging unit 2 and communication unit 3 can be connected via an Internet line. Also, the communication unit 3 and image processing unit 5 can be connected via a bus. Other connection means that can be used include a LAN (Local Area Network), various cables, etc.

記憶部４は、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）等のメモリ装置であり、各種プログラムや各種データを記憶する。例えば、記憶部４は学習用のデータや、学習済みモデルである推定器の情報を記憶し、画像処理部５との間でこれらの情報を入出力する。すなわち、推定器の学習に用いる情報や当該処理の過程で生じた情報などが記憶部４と画像処理部５との間で入出力される。 The storage unit 4 is a memory device such as a ROM (Read Only Memory) or a RAM (Random Access Memory), and stores various programs and various data. For example, the storage unit 4 stores data for learning and information on an estimator, which is a trained model, and inputs and outputs this information between the image processing unit 5. In other words, information used to train the estimator and information generated during the process are input and output between the storage unit 4 and the image processing unit 5.

画像処理部５は、ＣＰＵ（Central Processing Unit）、ＤＳＰ（Digital Signal Processor）、ＭＣＵ（Micro Control Unit）、ＧＰＵ(Graphics Processing Unit)等の演算装置で構成される。画像処理部５は記憶部４からプログラムを読み出して実行することにより各種の処理手段・制御手段として動作し、必要に応じて、各種データを記憶部４から読み出し、生成したデータを記憶部４に記憶させる。例えば、画像処理部５は推定器を学習し生成すると共に、生成した推定器を通信部３経由で記憶部４に記憶させる。 The image processing unit 5 is composed of arithmetic devices such as a CPU (Central Processing Unit), a DSP (Digital Signal Processor), an MCU (Micro Control Unit), and a GPU (Graphics Processing Unit). The image processing unit 5 operates as various processing means and control means by reading and executing programs from the storage unit 4, reading various data from the storage unit 4 as necessary, and storing the generated data in the storage unit 4. For example, the image processing unit 5 learns and generates an estimator, and stores the generated estimator in the storage unit 4 via the communication unit 3.

表示部６は、液晶ディスプレイまたは有機ＥＬ（Electro-Luminescence）ディスプレイ等であり、通信部３を経由して画像処理部５から入力される移動物体の追跡結果を表示する。 The display unit 6 is a liquid crystal display or an organic EL (Electro-Luminescence) display, etc., and displays the tracking results of the moving object input from the image processing unit 5 via the communication unit 3.

図２は移動物体追跡装置１の概略の機能ブロック図であり、記憶部４が混雑度記憶手段４０、物体情報記憶手段４１として機能し、画像処理部５が混雑度推定手段５０、候補位置数割当手段５１、候補位置設定手段５２、物体位置判定手段５３として機能する。また、通信部３が画像処理部５と協働し、画像取得手段３０および追跡結果出力手段３１として機能する。 Figure 2 is a schematic functional block diagram of the moving object tracking device 1, in which the memory unit 4 functions as a congestion level storage means 40 and an object information storage means 41, and the image processing unit 5 functions as a congestion level estimation means 50, a candidate position number allocation means 51, a candidate position setting means 52, and an object position determination means 53. In addition, the communication unit 3 cooperates with the image processing unit 5 and functions as an image acquisition means 30 and a tracking result output means 31.

画像取得手段３０は撮影部２から順次撮影した撮影画像を取得して混雑度推定手段５０および物体位置判定手段５３に出力する。 The image acquisition means 30 acquires images captured sequentially from the image capture unit 2 and outputs them to the congestion degree estimation means 50 and the object position determination means 53.

混雑度推定手段５０は、撮影画像に撮影された移動物体の混雑度を推定する。混雑度推定手段５０は、例えば、１時刻前の物体位置の数を撮影画像全体または撮影画像を複数のブロックに分割したブロックごとに計数することによって現時刻の混雑度（の近似値）を推定してもよい。しかし、追跡結果から混雑度を算出し、その混雑度を追跡に用いた場合、追跡精度が下がると追跡結果に基づいた混雑度の推定精度も低下してしまい、追跡精度の低下が倍化してしまう。そのため、追跡や混雑度推定の誤差が倍化してしまわないよう、追跡とは独立した処理によって混雑度を推定するのが望ましい。 The congestion degree estimation means 50 estimates the congestion degree of moving objects captured in the captured image. The congestion degree estimation means 50 may estimate (an approximation of) the congestion degree at the current time, for example, by counting the number of object positions one time before in the entire captured image or for each block obtained by dividing the captured image into multiple blocks. However, if the congestion degree is calculated from the tracking result and used for tracking, a decrease in tracking accuracy will also decrease the congestion degree estimation accuracy based on the tracking result, doubling the decrease in tracking accuracy. For this reason, it is desirable to estimate the congestion degree by a process independent of tracking, so as to prevent the errors in tracking and congestion degree estimation from doubling.

すなわち好適には混雑度推定手段５０は過去の撮影画像に基づかずに現時刻の撮影画像から現時刻の撮影画像上の任意の位置に撮影された移動物体の混雑度を推定する。本実施形態においては、混雑度推定手段５０は、撮影画像を入力されると当該撮影画像内の任意の位置の混雑度を出力するよう予め学習した推定器に撮影画像を入力して撮影画像内の任意の位置の混雑度を推定する。具体的には、混雑度推定手段５０は、撮影画像を入力されると各画素の混雑度を推定した混雑度マップを出力するよう予め学習した推定器に、撮影画像を入力して当該撮影画像の混雑度マップを出力させ、得られた混雑度マップを混雑度記憶手段４０に記憶させる。 That is, preferably, the congestion degree estimation means 50 estimates the congestion degree of a moving object photographed at an arbitrary position on the photographed image at the current time from the photographed image at the current time, without being based on past photographed images. In this embodiment, the congestion degree estimation means 50 inputs the photographed image to an estimator that has been trained in advance to output the congestion degree of an arbitrary position in the photographed image when the photographed image is input, and estimates the congestion degree of an arbitrary position in the photographed image. Specifically, the congestion degree estimation means 50 inputs the photographed image to an estimator that has been trained in advance to output a congestion degree map that estimates the congestion degree of each pixel when the photographed image is input, and causes the estimator to output a congestion degree map of the photographed image, and stores the obtained congestion degree map in the congestion degree storage means 40.

推定器は具体的にはディープラーニングの技術を用いて実現できる。すなわち推定器は撮影画像を入力されると当該撮影画像の混雑度マップを出力するＣＮＮ（畳み込みニューラルネット―ワーク；convolutional neural network）でモデル化することができる。学習のために、例えば、群衆が撮影された大量の学習用画像と、学習用画像それぞれにおける各人の頭部の重心位置を平均値とし当該頭部のサイズに応じた分散を有する確率密度関数を設定して頭部ごとの当該関数の値を画素ごとに加算した混雑度マップとが用意される。そして、モデルに学習用画像それぞれを入力したときの出力を当該画像に対応する混雑度マップに近づける学習が事前に行われる。こうして得られた学習済みモデルを混雑度推定手段５０のプログラムの一部をなす推定器として記憶部４に記憶させておく。例えば、“Single image crowd counting via multi-column convolutional neural network”, Zhang, Y. ,Zhou他, CVPR 2016に記載されているＭＣＮＮ（multi-column convolutional neural network）は推定器の一例であり、当該論文に記載されている群衆密度マップ（crowd density map）は混雑度マップの一例である。 Specifically, the estimator can be realized using deep learning technology. That is, the estimator can be modeled as a CNN (convolutional neural network) that receives a photographed image and outputs a congestion map of the photographed image. For learning, for example, a large number of learning images of a crowd are prepared, and a congestion map is prepared in which a probability density function having an average value of the center of gravity of each person's head in each learning image and a variance according to the size of the head is set and the value of the function for each head is added for each pixel. Then, learning is performed in advance to make the output when each learning image is input into the model closer to the congestion map corresponding to the image. The learned model obtained in this way is stored in the memory unit 4 as an estimator that is part of the program of the congestion degree estimation means 50. For example, the MCNN (multi-column convolutional neural network) described in "Single image crowd counting via multi-column convolutional neural network", Zhang, Y., Zhou et al., CVPR 2016, is an example of an estimator, and the crowd density map described in the same paper is an example of a congestion map.

また、混雑度推定手段５０は、混雑度が閾値Ｔ３以上である超高混雑領域を検出して、超高混雑領域の情報を混雑度記憶手段４０に記憶させる。超高混雑領域は俯角が小さな位置（横から撮影したようになる位置）に移動物体が密集した場合に生じる。例えば移動物体が人であれば超高混雑領域では頭と肩付近しか写らなくなる上に頭同士のオクルージョンが生じて追跡困難となる。閾値Ｔ３は事前の実験を通じて予め定めておく。 The congestion degree estimation means 50 also detects ultra-high congestion areas where the congestion degree is equal to or greater than the threshold T3, and stores information about the ultra-high congestion areas in the congestion degree storage means 40. Ultra-high congestion areas occur when moving objects are crowded together in a position with a small depression angle (a position that appears as if the object is photographed from the side). For example, if the moving object is a person, only the head and shoulders will be captured in the ultra-high congestion area, and occlusion of the heads will occur, making tracking difficult. The threshold T3 is determined in advance through prior experiments.

混雑度記憶手段４０は、撮影画像に対して推定された混雑度を記憶する。具体的には最新所定数の混雑度マップを循環記憶する。図３は混雑度推定手段５０の処理を示す模式図である。図３（ａ）の撮影画像に対し、出力された混雑度マップが図３（ｂ）である。また、図３（ｃ）は図３（ａ）の直線５１０上の混雑度を示したものである。混雑している場所ほど高い値が出力され、人物５０１，５０２，５０３の位置における混雑度はそれぞれ０．２，１．２，２．８と推定される。 The congestion degree storage means 40 stores the congestion degree estimated for the captured image. Specifically, it cyclically stores a predetermined number of the most recent congestion degree maps. Figure 3 is a schematic diagram showing the processing of the congestion degree estimation means 50. Figure 3(b) shows the congestion degree map output for the captured image in Figure 3(a). Figure 3(c) shows the congestion degree on line 510 in Figure 3(a). The more crowded the location, the higher the value output, and the congestion degrees at the positions of people 501, 502, and 503 are estimated to be 0.2, 1.2, and 2.8, respectively.

物体情報記憶手段４１は、追跡中の移動物体の情報（物体情報）を記憶する。具体的には、追跡中の移動物体それぞれを識別する物体ＩＤと対応付けて、当該移動物体のテンプレート、当該移動物体の物体位置、当該移動物体の仮説を記憶する。また、物体情報記憶手段４１は、移動物体の形状モデルを記憶する。 The object information storage means 41 stores information (object information) about moving objects being tracked. Specifically, it stores a template of the moving object, an object position of the moving object, and a hypothesis for the moving object in association with an object ID that identifies each moving object being tracked. The object information storage means 41 also stores a shape model of the moving object.

移動物体のテンプレートの一例は撮影画像から該当する人物の領域を切り出したテンプレート画像である。テンプレート画像に代えて、又はそれと併せて、テンプレート画像から算出される色ヒストグラムや輝度勾配等の特徴量などとしてもよい。移動物体の仮説は、物体ＩＤ、仮説ＩＤ、候補位置及び尤度などを対応付けた情報であり、移動物体と候補位置の組み合わせの数だけ生成される。なお、移動物体の物体位置や候補位置は、撮影画像の水平方向、垂直方向をそれぞれｘ軸，ｙ軸とし、撮影画像における位置をｘ座標，ｙ座標の組（ｘ，ｙ）で表す。移動物体の形状モデルの一例は立位の人を近似した楕円の図形データである。楕円に代えて矩形、又は頭部・胴部・脚部をそれぞれ近似した楕円を３つ連結した図形としてもよい。好適には形状モデルは撮影画像内の各位置の人の像の大きさの違いに対応させ、基準の図形データおよび位置と図形データの大きさを対応付けたデータで構成する。 An example of a template for a moving object is a template image obtained by cutting out the area of a person from a photographed image. Instead of or in addition to the template image, features such as a color histogram or brightness gradient calculated from the template image may be used. A hypothesis for a moving object is information that associates an object ID, a hypothesis ID, a candidate position, and a likelihood, and is generated for each combination of moving objects and candidate positions. The object position and candidate positions of a moving object are represented by the x-axis and y-axis in the horizontal and vertical directions of the photographed image, respectively, and the position in the photographed image is represented by a pair of x- and y-coordinates (x, y). An example of a shape model for a moving object is graphic data of an ellipse that approximates a person in a standing position. Instead of an ellipse, a rectangle or a figure of three connected ellipses that approximate the head, torso, and legs may be used. Preferably, the shape model corresponds to the difference in size of the image of a person at each position in the photographed image, and is composed of reference graphic data and data that associates the position with the size of the graphic data.

候補位置数割当手段５１は混雑度に応じて各移動物体に候補位置数を割り当てる。その一例として、候補位置数割当手段５１が、混雑度が高くなるにつれて移動物体の候補位置数を多く割り当てる。 The candidate position number allocation means 51 allocates a number of candidate positions to each moving object according to the degree of congestion. As an example, the candidate position number allocation means 51 allocates a larger number of candidate positions to a moving object as the degree of congestion increases.

基本的には、候補位置数を増やせば、真の物体位置からのずれ（位置ずれ）の小さな候補位置が含まれる可能性が高まるため移動物体間の比較を厳格化でき、追跡精度を高めることができる。例えば真の物体位置からずれたテンプレートマッチングは背景の特徴量との比較が混ざるため移動物体の特徴量同士の厳格な比較とならない。 Essentially, increasing the number of candidate positions increases the likelihood that candidate positions with small deviations (positional shifts) from the true object position will be included, allowing for stricter comparisons between moving objects and improved tracking accuracy. For example, template matching that is shifted from the true object position involves comparisons with background features, so it does not result in a strict comparison between the features of moving objects.

しかし、候補位置数を増やすほど処理負荷は増加する。そして、混雑度が高まるほど移動物体間の距離は近くなる上に近接移動物体の数も増えるため位置ずれによる追跡精度低下が生じやすくなる。他方、混雑度が低くなるほど移動物体間の距離は遠くなり近接移動物体の数は減るため位置ずれによる追跡精度低下は生じにくくなる。 However, the more candidate positions there are, the greater the processing load becomes. Furthermore, as the degree of congestion increases, the distance between moving objects becomes closer and the number of nearby moving objects also increases, making it easier for tracking accuracy to decrease due to position shifts. On the other hand, as the degree of congestion decreases, the distance between moving objects becomes farther and the number of nearby moving objects decreases, making it less likely for tracking accuracy to decrease due to position shifts.

よって、混雑度が高まるほど各移動物体の候補位置数を増やすことで混雑による追跡精度低下を防止でき、混雑度が低い場合は各移動物体の候補位置数を少なく抑えることで処理負荷を抑えることができるようになる。 Therefore, by increasing the number of candidate positions for each moving object as the degree of congestion increases, it is possible to prevent a decrease in tracking accuracy due to congestion, and by keeping the number of candidate positions for each moving object low when the degree of congestion is low, it is possible to reduce the processing load.

このようにすることで、例えば、撮影画像全体で一つの混雑度を推定する場合、混雑度が高い期間に処理の遅延が生じてもその後の混雑度が低下した期間に短時間で遅延を解消できる。また、例えば、複数の撮影部で撮影した撮影画像ごとに一つの混雑度を推定する場合、撮影画像間での移動物体の分布の偏りを利用して、混雑による追跡精度低下を防止しつつ処理負荷を抑えることができる。 In this way, for example, when estimating one congestion level for the entire captured image, even if a processing delay occurs during a period when congestion is high, the delay can be eliminated in a short time during the subsequent period when congestion levels decrease. Also, for example, when estimating one congestion level for each captured image captured by multiple image capture units, the bias in the distribution of moving objects among the captured images can be utilized to reduce the processing load while preventing a decrease in tracking accuracy due to congestion.

また、候補位置数割当手段５１が、撮影画像における移動物体それぞれの物体位置の混雑度に応じて当該移動物体の候補位置数を割り当てる構成としてもよい。つまり、物体位置における混雑度が高いと予測される移動物体については候補位置数を増やし、物体位置における混雑度が低いと予測される移動物体の候補位置数を少なく抑える。こうすることで、撮影画像内における移動物体の分布の偏りを利用して、混雑による追跡精度低下を防止しつつ処理負荷を抑えることができる。 The candidate position number allocation means 51 may also be configured to allocate the number of candidate positions for a moving object according to the degree of congestion at the object position of each moving object in the captured image. In other words, the number of candidate positions is increased for moving objects whose object positions are predicted to be highly crowded, and the number of candidate positions for moving objects whose object positions are predicted to be low crowded. In this way, the bias in the distribution of moving objects in the captured image can be utilized to prevent a decrease in tracking accuracy due to congestion while reducing the processing load.

具体的には、候補位置数割当手段５１は、混雑度記憶手段４０に記憶されている混雑度マップと物体情報記憶手段４１に記憶されている各移動物体の物体位置および移動物体の数とを参照して、各移動物体の候補位置数を決定し、決定した候補位置数を候補位置設定手段５２に出力する。 Specifically, the candidate position number allocation means 51 determines the number of candidate positions for each moving object by referring to the congestion map stored in the congestion degree storage means 40 and the object positions and number of moving objects for each moving object stored in the object information storage means 41, and outputs the determined number of candidate positions to the candidate position setting means 52.

候補位置数割当手段５１による移動物体それぞれに割り当てる候補位置数の算出方法の具体例を示す。図４に示す関数（ａ）～（ｄ）は、混雑度に応じた候補位置数を算出する関数であり、これに基づき混雑度から候補位置数を算出する。 A specific example of a method for calculating the number of candidate positions to be assigned to each moving object by the candidate position number allocation means 51 is shown below. Functions (a) to (d) shown in FIG. 4 are functions for calculating the number of candidate positions according to the congestion level, and the number of candidate positions is calculated from the congestion level based on these functions.

（ｄ）は最も単純な候補位置数を算出する関数であり、混雑度に比例して候補位置数が多くなる。これに対して（ｃ）では混雑度が閾値Ｔ１未満の場合は候補位置数は一定であり混雑度が閾値Ｔ１以上である場合は混雑度に比例して候補位置数が多くなる。混雑度が閾値Ｔ１未満の場合の候補位置数は１移動物体に割り当てる候補位置数の最小値であり事前の実験を通じて設定される。最小値を設定することで混雑度推定の誤差の影響で実際には移動物体が存在する位置の混雑度が低い値となり候補位置数が極端に少なく設定されてしまうことを防ぐことができる。一方で、追跡人物が移動可能な範囲は、ある程度限られており、候補位置数を必要以上に増やしても精度の向上は期待できない。このため（ｂ）では候補位置数が一定数（最大値）に到達する閾値Ｔ２以上の混雑度である場合は混雑度がさらに上昇したとしても候補位置数を最大値で一定としている。（ａ）は候補位置数に最小値と最大値の両方が設定されている関数である。また、図４中のＴ３は超高混雑領域を弁別するための閾値である。 (d) is the simplest function for calculating the number of candidate positions, and the number of candidate positions increases in proportion to the degree of congestion. In contrast, in (c), when the degree of congestion is less than the threshold T1, the number of candidate positions is constant, and when the degree of congestion is equal to or greater than the threshold T1, the number of candidate positions increases in proportion to the degree of congestion. When the degree of congestion is less than the threshold T1, the number of candidate positions is the minimum value of the number of candidate positions to be assigned to one moving object, and is set through prior experiments. By setting the minimum value, it is possible to prevent the degree of congestion at the position where the moving object actually exists from being low due to the influence of an error in the degree of congestion estimation, and the number of candidate positions being set to an extremely small number. On the other hand, the range in which a person to be tracked can move is limited to a certain extent, and even if the number of candidate positions is increased more than necessary, an improvement in accuracy cannot be expected. For this reason, in (b), when the degree of congestion is equal to or greater than the threshold T2 at which the number of candidate positions reaches a certain number (maximum value), the number of candidate positions is fixed at the maximum value even if the degree of congestion further increases. (a) is a function in which both the minimum and maximum values are set for the number of candidate positions. Also, T3 in FIG. 4 is a threshold value for discriminating ultra-high congestion areas.

［総数上限値による候補位置数の圧縮］
上記関数で追跡人物に対する候補位置数を定めた場合、高混雑領域に存在する人物の数が増えてくると、全追跡人物の候補位置数の合計が大きくなり処理に遅延が発生する。 [Compression of the number of candidate positions by the total upper limit]
When the number of candidate positions for a person to be tracked is determined by the above function, as the number of people existing in a highly congested area increases, the total number of candidate positions for all people to be tracked increases, causing a delay in processing.

例えば監視空間が交差点の信号待ち領域である場合、混雑度が信号の周期によって定期的に変動するため、一時的に高混雑状況に伴う遅延が発生しても、混雑解消以降に遅延を復旧することが可能である。一方で、イベント会場の入退場口などでは、混雑が長時間継続する可能性がある。このような監視空間では、遅延が発生すると復旧するのが困難であるため、あらかじめ候補位置数に総数上限値を設け遅延を発生させないことが望ましい。 For example, if the monitored space is an area where traffic is waiting for the light to turn at an intersection, the degree of congestion fluctuates periodically depending on the signal cycle, so even if delays occur temporarily due to high congestion, it is possible to recover from the delay once the congestion is resolved. On the other hand, congestion may continue for long periods at places such as the entrances and exits of an event venue. In such monitored spaces, recovery is difficult once a delay occurs, so it is desirable to set an upper limit on the total number of candidate locations in advance to prevent delays from occurring.

具体的には各人物に対し上記処理において１時刻における全追跡対象の移動物体の候補位置の総和が総数上限値に収まるように一度求めた各移動物体の候補位置数を圧縮する。ある追跡人物ｉに対する候補位置数は、混雑度ｄ_iと関数ｆ（ｘ）から求まる圧縮前の候補位置数をｆ（ｄ_i）とすると以下の（１）式とすることができる。なお、総数上限値は、１時刻に割り当てる候補位置の上限数である。換言すると候補位置数の総和の上限値である。総数上限値は、予め移動物体追跡装置の性能等に応じて処理の遅延が生じない範囲の値に定める。 Specifically, the number of candidate positions of each moving object once calculated for each person in the above process is compressed so that the sum of candidate positions of all moving objects to be tracked at one time falls within the total upper limit value. The number of candidate positions for a certain tracking person i can be expressed by the following formula (1), where f(d _i ) is the number of candidate positions before compression calculated from the congestion degree d _i and function f(x). The total upper limit value is the upper limit number of candidate positions to be assigned at one time. In other words, it is the upper limit value of the sum of the number of candidate positions. The total upper limit value is set in advance to a value within a range that does not cause processing delays depending on the performance of the moving object tracking device, etc.

ここで、ｍｉｎ（１．０，総数上限値／Σ_jｆ（ｄ_j））は候補位置数の圧縮率であり、１．０を最大値とすることで、総候補位置数Σ_jｆ（ｄ_j）が総数上限値に到達していない場合は、候補位置数を維持している。これは、圧縮前の候補位置数が必要十分数であるということを前提にしているためである。このように総数上限値を設けることで、総数上限値を超えない範囲で混雑度に応じた候補位置数の割り当てが可能となるため、混雑が長時間継続する監視空間であっても追跡精度低下を防止しつつ遅延の発生を防止できる。 Here, min(1.0,total upper limit value/ _Σjf ( _dj )) is the compression rate of the number of candidate locations, and by setting 1.0 as the maximum value, the number of candidate locations is maintained if the total number of candidate locations _Σjf ( _dj ) has not reached the total upper limit value. This is because it is assumed that the number of candidate locations before compression is a necessary and sufficient number. By setting a total upper limit value in this way, it becomes possible to allocate the number of candidate locations according to the degree of congestion without exceeding the total upper limit value, so that it is possible to prevent a decrease in tracking accuracy and the occurrence of delays even in a monitored space where congestion continues for long periods of time.

［各追跡人物に対する候補位置数の下限数］
総数上限値を用いて候補位置数を圧縮する場合、圧縮された候補位置数が著しく少ない数となり、極端に低混雑領域での追跡性能が低下する可能性がある。これを防ぐため、追跡人物に対する候補位置数の下限数を設定してもよい。（２）式は各移動物体に下限数を割り当てたうえで、総数上限値から割り当て済みの候補位置数を減じ残余数とし、各移動物体の圧縮前の候補位置数から下限数を除いた数の比に応じて分配する。なお、下限数は、１移動物体に割り当てる候補位置の下限数である。下限数は、予めの実験を通じ、混雑度が低い状態で目標値以上の追跡精度が得られる最小の割当数に定める。 [Minimum number of candidate positions for each tracked person]
When compressing the number of candidate positions using the total number upper limit, the number of compressed candidate positions becomes significantly smaller, and there is a possibility that tracking performance in low-congestion areas will be extremely degraded. To prevent this, a lower limit may be set for the number of candidate positions for a person to be tracked. In formula (2), the lower limit is assigned to each moving object, and the number of assigned candidate positions is subtracted from the total number upper limit to obtain the remaining number, which is then distributed according to the ratio of the number of candidate positions of each moving object before compression minus the lower limit. The lower limit is the minimum number of candidate positions to be assigned to one moving object. The lower limit is set to the minimum number of assignments that can obtain tracking accuracy equal to or higher than the target value in a low-congestion area through prior experiments.

候補位置数割当手段５１は、各移動物体について、当該移動物体の物体位置から当該移動物体が存在し得る領域を求め、混雑度マップから当該領域の混雑度を取得して当該移動物体が存在し得る領域についての混雑度とする。本実施形態においては、候補位置数割当手段５１は、人物ごとに、物体情報記憶手段４１に記憶されている過去の物体位置に現時刻の物体位置を外挿し、外挿した現時刻の物体位置の混雑度を取得する。ただし、過去の物体位置が２つ以上存在しない人物については１時刻前の物体位置の混雑度を取得する。その場合は混雑度マップも１時刻前のものを用いることが望ましい。別の実施形態においては、候補位置数割当手段５１は、人物ごとに、外挿した現時刻の物体位置とその周囲の混雑度を読み取ってそれらの代表値を移動物体が存在し得る領域についての混雑度とする。代表値は平均値、最頻値または最大値とすることができる。ただし、過去の物体位置が２つ以上存在しない人物については１時刻前の物体位置とその周囲の混雑度の代表値とする。さらに、１時刻前の混雑度マップに１時刻前の候補位置を適用して混雑度を取得しても良い。その場合、１時刻前の混雑度を１時刻前の候補位置の尤度で重み付け平均して移動物体の混雑度を算出する。 The candidate position number allocation means 51 determines, for each moving object, an area in which the moving object may exist from the object position of the moving object, and obtains the congestion degree of the area from the congestion degree map to determine the congestion degree of the area in which the moving object may exist. In this embodiment, the candidate position number allocation means 51 extrapolates the object position at the current time to the past object positions stored in the object information storage means 41 for each person, and obtains the congestion degree of the extrapolated object position at the current time. However, for a person who does not have two or more past object positions, the congestion degree of the object position one time ago is obtained. In that case, it is desirable to use the congestion degree map from one time ago as well. In another embodiment, the candidate position number allocation means 51 reads the extrapolated object position at the current time and the congestion degree around it for each person, and determines the representative value of these as the congestion degree of the area in which the moving object may exist. The representative value can be the average value, the mode value, or the maximum value. However, for a person who does not have two or more past object positions, the representative value of the object position one time ago and the congestion degree around it is used. Furthermore, the congestion degree may be obtained by applying the candidate position one time ago to the congestion degree map one time ago. In this case, the congestion degree of the moving objects is calculated by weighting the congestion degree from one time point before with the likelihood of the candidate position from one time point before.

候補位置設定手段５２は、各移動物体に割り当てられた候補位置数と同じ数だけ、過去の移動物体の位置から現時刻における移動物体の候補位置を求める。すなわち、追跡中の各移動物体について、当該移動物体の過去の物体位置、または、過去の候補位置、または、過去の物体位置および候補位置から当該移動物体の現時刻の候補位置を当該移動物体に割り当てられた候補位置数と同数だけ設定し、当該移動物体の各候補位置を含んだ仮説を設定する。 The candidate position setting means 52 determines candidate positions of the moving object at the current time from the past positions of the moving object, the same number as the number of candidate positions assigned to each moving object. That is, for each moving object being tracked, the means sets candidate positions of the moving object at the current time from the moving object's past object positions, or past candidate positions, or past object positions and candidate positions, the same number as the number of candidate positions assigned to the moving object, and sets hypotheses including each candidate position of the moving object.

具体的には、候補位置数割当手段５１から入力された候補位置数と物体情報記憶手段４１に記憶されている各移動物体の過去の物体位置または／および過去の仮説を必要に応じて参照して、各移動物体に対する現時刻の仮説を設定し、設定した仮説を物体情報記憶手段４１に記憶させる。さらに、混雑度記憶手段４０から混雑度マップを入力して、それを参照してもよい。 Specifically, the number of candidate positions input from the candidate position number allocation means 51 and the past object positions and/or past hypotheses of each moving object stored in the object information storage means 41 are referenced as necessary to set a hypothesis for each moving object at the current time, and the set hypothesis is stored in the object information storage means 41. Furthermore, a congestion degree map may be input from the congestion degree storage means 40 and referenced.

候補位置設定手段５２は、移動物体のそれぞれについて、現時刻において移動物体の存在が予測される候補位置を設定する。現時刻ｔにおける第ｉ番の候補位置Ｐ_i,tは過去の物体位置又は過去の候補位置に基づき、次の３つの方法のいずれかにより算出することができる。 The candidate position setting means 52 sets, for each moving object, a candidate position where the moving object is predicted to exist at the current time. The i-th candidate position P _i,t at the current time t can be calculated based on the past object position or the past candidate position by any of the following three methods:

〔第１方法〕
一時刻前の物体位置Ｑ_t-1を中心に候補位置Ｐ_i,tを分布させる方法である。候補位置設定手段５２は、移動物体ごとに、乱数に基づいて拡散量Δ０を設定しては物体位置Ｑ_t-1に拡散量Δ０を加算して候補位置Ｐ_i,tとする処理（Ｐ_i,t＝Ｑ_t-1＋Δ０）を候補位置数と同じ回数だけ行う。 [First Method]
This method distributes candidate positions P _i,t around the object position Q _t-1 from one time before. The candidate position setting means 52 sets a diffusion amount Δ0 for each moving object based on a random number and adds the diffusion amount Δ0 to the object position Q _t-1 to set the candidate position P _i,t (P _i,t = Q _t-1 + Δ0) the same number of times as the number of candidate positions.

〔第２方法〕
現在の物体位置を予測した予測値ｑ_tを中心に候補位置Ｐ_i,tを分布させる方法である。候補位置設定手段５２は、移動物体ごとに、まず過去の複数の物体位置に等速運動モデル等の運動モデル又はカルマンフィルタ等の予測フィルタを適用して予測値ｑ_tを予測し、次に乱数に基づいて拡散量Δ１を設定しては予測値ｑ_tに拡散量Δ１を加算して候補位置Ｐ_i,tとする処理（Ｐ_i,t＝ｑ_t＋Δ１）を候補位置数と同じ回数だけ行う。 [Second Method]
This method distributes candidate positions P _i,t around a predicted value _qt that predicts the current object position. The candidate position setting means 52 first predicts the predicted value _qt for each moving object by applying a motion model such as a constant velocity motion model or a prediction filter such as a Kalman filter to a plurality of past object positions, and then sets a diffusion amount Δ1 based on a random number and adds the diffusion amount Δ1 to the predicted value _qt to obtain the candidate position P _i,t (P _i,t = _qt + Δ1), performing this process the same number of times as the number of candidate positions.

〔第３方法〕
過去の候補位置から現在の候補位置を予測する方法である。まず、候補位置設定手段５２は、移動物体ごとに、候補位置数と同数の仮説を、重複を許容して１時刻前の仮説の中から乱数に基づいて選出する。１時刻前の各仮説の当選確率は、例えば、当該移動物体の１時刻前の仮説の尤度の総和に対する当該仮説の尤度の比とする。次に、候補位置設定手段５２は、移動物体ごとに、選出した各仮説について、当該仮説および当該仮説に対応する過去の複数の仮説の候補位置に上記運動モデル又は上記予測フィルタを適用して予測値ｑ_i,tを算出するとともに乱数に基づいて拡散量Δ２を設定し、予測値ｑ_i,tに拡散量Δ２を加算して候補位置Ｐ_i,tとする（Ｐ_i,t＝ｑ_i,t＋Δ２）。 [Third Method]
This is a method for predicting a current candidate position from a past candidate position. First, the candidate position setting means 52 selects, for each moving object, the same number of hypotheses as the number of candidate positions from the hypotheses one time ago, allowing overlapping, based on a random number. The winning probability of each hypothesis one time ago is, for example, the ratio of the likelihood of the hypothesis to the sum of the likelihoods of the hypotheses one time ago for the moving object. Next, for each selected hypothesis, the candidate position setting means 52 applies the above-mentioned motion model or the above-mentioned prediction filter to the candidate positions of the hypothesis and multiple past hypotheses corresponding to the hypothesis to calculate a predicted value q _i,t for each moving object, and sets a diffusion amount Δ2 based on a random number, and adds the diffusion amount Δ2 to the predicted value q _i, t to obtain a candidate position P _i,t (P _i,t = q _i,t + Δ2).

本実施形態においては、速度が算出可能になるまでは第１方法（物体位置判定手段５３が新規移動物体として登録した時刻を含め２時刻～５時刻）で候補位置が算出され、速度が算出可能になって以降は第３方法で候補位置が算出される。 In this embodiment, until the speed can be calculated, the candidate positions are calculated using the first method (times 2 to 5, including the time when the object position determination means 53 registers the object as a new moving object), and once the speed can be calculated, the candidate positions are calculated using the third method.

また、混雑度が小さいほど移動物体の候補位置を設定する範囲を広くしてもよい。すなわち、候補位置設定手段５２は、上記第１方法から第３方法に記載された拡散量Δ０～Δ２を混雑度が小さいほど大きな量とする。例えば、拡散量Δ０～Δ２は、混雑度の高い領域にいる人物に対しては０ｃｍ～６０ｃｍの範囲に設定され、混雑度の低い領域にいる人物に対しては、３０ｃｍ～１００ｃｍの範囲に設定される。このようにする理由は以下の通りである。すなわち混雑度が小さいと移動物体が動き得る範囲が広くなるため候補位置を広い範囲に設定しなければ対応づけのし損ねが生じやすくなる。そこで候補位置を設定する範囲を混雑度に基づいて制御することで混雑度が小さい場合の対応付けのし損ねを防止できる。特に、追跡とは独立して推定した混雑度で追跡のパラメータを制御できるため、信頼性の高い制御が可能となる。 The range in which the candidate positions of the moving object are set may be wider as the degree of congestion becomes smaller. That is, the candidate position setting means 52 sets the diffusion amount Δ0 to Δ2 described in the first to third methods to a larger amount as the degree of congestion becomes smaller. For example, the diffusion amount Δ0 to Δ2 is set to a range of 0 cm to 60 cm for a person in an area with a high degree of congestion, and is set to a range of 30 cm to 100 cm for a person in an area with a low degree of congestion. The reason for this is as follows. That is, when the degree of congestion is small, the range in which the moving object can move becomes wide, so if the candidate positions are not set to a wide range, matching failures are likely to occur. Therefore, by controlling the range in which the candidate positions are set based on the degree of congestion, it is possible to prevent matching failures when the degree of congestion is small. In particular, since the tracking parameters can be controlled by the degree of congestion estimated independently of tracking, highly reliable control is possible.

物体位置判定手段５３は、各移動物体における複数の候補位置に基づいて、現時刻における移動物体の物体位置を求める。すなわち、追跡中の各移動物体について、当該移動物体に対して現時刻に設定された複数の仮説それぞれが示す候補位置に対応する現時刻の撮影画像上の位置に当該移動物体の特徴量が現れている度合いを仮説の尤度として算出し、複数の仮説が示す候補位置を当該仮説の尤度の高さに応じて統合することによって当該移動物体の現時刻における物体位置を算出する。ただし、撮影画像外に位置している仮説および超高混雑度領域に位置している仮説については、尤度の信頼性が低くなるため予め算出対象から除外する。 The object position determination means 53 determines the object position of each moving object at the current time based on multiple candidate positions for each moving object. That is, for each moving object being tracked, the degree to which the feature of the moving object appears at a position on the captured image at the current time corresponding to the candidate position indicated by each of multiple hypotheses set for the moving object at the current time is calculated as the likelihood of the hypothesis, and the candidate positions indicated by the multiple hypotheses are integrated according to the likelihood of the hypothesis to calculate the object position of the moving object at the current time. However, hypotheses that are located outside the captured image and hypotheses that are located in an extremely crowded area are excluded from the calculation in advance because the reliability of the likelihood is low.

具体的には、物体位置判定手段５３は、物体情報記憶手段４１に記憶されている各移動物体の現時刻の仮説と形状モデルとテンプレートとを読み出し、各移動物体について、当該移動物体の仮説が示す候補位置に形状モデルを配置して形状モデルと重なる領域の部分画像を撮影画像から抽出し、部分画像の画像特徴と当該移動物体のテンプレートの画像特徴との類似度を算出し、類似度に応じた尤度を算出する。また、物体位置判定手段５３は物体情報記憶手段４１に尤度を仮説に対応付けて追加記憶させる。 Specifically, the object position determination means 53 reads out the current hypothesis, shape model, and template for each moving object stored in the object information storage means 41, and for each moving object, places the shape model at the candidate position indicated by the hypothesis for that moving object, extracts a partial image of the area overlapping with the shape model from the captured image, calculates the similarity between the image features of the partial image and the image features of the template for that moving object, and calculates a likelihood according to the similarity. The object position determination means 53 also causes the object information storage means 41 to additionally store the likelihood in association with the hypothesis.

その際、物体位置判定手段５３は、さらに、背景画像と撮影画像との差分処理を行って背景差分領域を検出し、各移動物体について、当該移動物体の部分画像において背景差分領域が占める割合に応じた補正値で尤度を補正してもよい。背景画像は後述する新規移動物体の検出処理のために記憶部４に記憶されている。 At this time, the object position determination means 53 may further perform a difference process between the background image and the captured image to detect a background difference region, and correct the likelihood for each moving object with a correction value according to the proportion of the background difference region in the partial image of the moving object. The background image is stored in the storage unit 4 for the new moving object detection process described later.

物体位置判定手段５３は、移動物体ごとに、当該移動物体の仮説が示す候補位置を当該仮説の尤度に基づいて統合して当該移動物体の物体位置を算出する。好適には、当該移動物体の仮説が示す候補位置を当該仮説の尤度で重みづけて平均した重み付け平均値を当該移動物体の物体位置とする。また、予め設定された閾値ＴＬ以上の尤度を複数選出し、選出された尤度と組をなしている候補位置を当該尤度で重み付けて平均した重み付け平均値を物体位置と判定することもできる。また、尤度の高い順に上位所定個数の尤度を選出し、選出された尤度と組をなしている候補位置を当該尤度で重み付けて平均した重み付け平均値を物体位置と判定することもできる。また、各移動物体について最大の尤度を選出するとともに当該尤度と閾値ＴＬとを比較し、最大の尤度が閾値ＴＬ以上であれば当該尤度と組をなしている候補位置を物体位置と判定してもよい。 The object position determination means 53 calculates the object position of each moving object by integrating the candidate positions indicated by the hypotheses of the moving object based on the likelihood of the hypotheses. Preferably, the weighted average value obtained by weighting the candidate positions indicated by the hypotheses of the moving object by the likelihood of the hypotheses is determined as the object position of the moving object. In addition, it is also possible to select a plurality of likelihoods equal to or greater than a preset threshold TL, and determine the weighted average value obtained by weighting the candidate positions paired with the selected likelihoods by the likelihoods and averaging them as the object position. It is also possible to select a predetermined number of likelihoods in descending order of likelihood, and determine the weighted average value obtained by weighting the candidate positions paired with the selected likelihoods by the likelihoods and averaging them as the object position. In addition, it is also possible to select the maximum likelihood for each moving object, compare the likelihood with the threshold TL, and determine the candidate position paired with the likelihood as the object position if the maximum likelihood is equal to or greater than the threshold TL.

算出した尤度を仮説に対応付けて物体情報記憶手段４１に記憶させ、算出した物体位置を移動物体と対応付けて物体情報記憶手段４１に記憶させる。 The calculated likelihood is associated with the hypothesis and stored in the object information storage means 41, and the calculated object position is associated with the moving object and stored in the object information storage means 41.

また、物体位置判定手段５３は、追跡中の移動物体について、物体位置やテンプレートの更新処理を行うと共に、新規移動物体の存在を判定し、当該新規移動物体について物体情報を登録する処理、及び消失移動物体についての処理を行う。以下、追跡中の移動物体についての処理、新規移動物体についての処理、及び消失移動物体についての処理を順次、説明する。 The object position determination means 53 also performs processing to update the object position and template for the moving object being tracked, determines the presence of a new moving object, registers object information for the new moving object, and processes a disappeared moving object. Below, the processing for the moving object being tracked, the processing for the new moving object, and the processing for the disappeared moving object will be explained in turn.

〔追跡中の移動物体〕
物体位置判定手段５３は物体位置が判定された移動物体について、物体情報記憶手段４１に、当該判定された物体位置を追加記憶させ、当該移動物体のテンプレートを現時刻の画像特徴により更新する。更新は、抽出された画像特徴を、記憶されている画像特徴と置き換えてもよいし、抽出された画像特徴と記憶されている画像特徴とを重み付け平均してもよい。 [Tracking moving object]
The object position determination means 53 additionally stores the determined object position of the moving object in the object information storage means 41, and updates the template of the moving object with the image features at the current time. The updating may be performed by replacing the extracted image features with the stored image features, or by taking a weighted average of the extracted image features and the stored image features.

〔新規移動物体〕
物体位置判定手段５３は、監視空間に移動物体が存在しないときに撮影された背景画像と撮影画像との差分処理を行って背景差分領域を検出するとともに、現時刻の物体位置それぞれに形状モデルを配置していずれの形状モデルとも重ならない背景差分領域を抽出する。そして、物体位置判定手段５３は、非重複の背景差分領域が移動物体（人）として有効な面積を有していれば、非重複の背景差分領域に新規移動物体が存在すると判定する。物体位置判定手段５３は、非重複の背景差分領域に形状モデルを当てはめて新規移動物体の物体位置および領域を決定し、物体ＩＤと対応付けて当該移動物体のテンプレート、当該移動物体の物体位置を物体情報記憶手段４１に記憶させる。また、物体位置判定手段５３は、移動物体が存在しないときの撮影画像を背景画像として記憶部４に記憶させ、背景差分領域が検出されなかった領域の撮影画像で背景画像を更新する。なお尤度の算出に際して背景差分領域を検出する場合は改めての検出を行わずに流用すればよい。 [New moving object]
The object position determination means 53 performs a difference process between a background image taken when there is no moving object in the monitoring space and the taken image to detect a background difference region, and also arranges a shape model at each object position at the current time to extract a background difference region that does not overlap with any shape model. Then, if the non-overlapping background difference region has an area that is effective as a moving object (person), the object position determination means 53 determines that a new moving object exists in the non-overlapping background difference region. The object position determination means 53 applies a shape model to the non-overlapping background difference region to determine the object position and region of the new moving object, and stores the template of the moving object and the object position of the moving object in the object information storage means 41 in association with the object ID. In addition, the object position determination means 53 stores the image taken when there is no moving object in the storage unit 4 as a background image, and updates the background image with the image taken in the region where the background difference region was not detected. Note that when detecting a background difference region in the calculation of the likelihood, it is sufficient to use it without performing a new detection.

〔消失移動物体〕
物体位置判定手段５３は、全候補位置の尤度が閾値ＴＬ以下となった移動物体を消失移動物体と判定し、当該移動物体の物体情報を削除する。遮蔽物により全体が隠蔽された移動物体や撮影画像外に移動した移動物体や超高混雑領域に移動した移動物体が消失移動物体と判定される。そして、追跡対象外の移動物体群として超高混雑領域記憶手段（図示省略）に記憶する。なお、当該移動物体は超高混雑領域から抜け出たときに新規移動物体として改めて追跡される。 [Disappearing Moving Object]
The object position determination means 53 determines a moving object whose likelihood of all candidate positions is equal to or less than the threshold TL as a lost moving object, and deletes the object information of the moving object. A moving object that is entirely hidden by an obstruction, a moving object that has moved outside the captured image, or a moving object that has moved into an ultra-highly congested area is determined to be a lost moving object. Then, the moving object is stored in an ultra-highly congested area storage means (not shown) as a group of moving objects not to be tracked. Note that when the moving object leaves the ultra-highly congested area, it is tracked again as a new moving object.

追跡結果出力手段３１は、例えば、追跡中の移動物体ごとの物体位置の系列をプロットした軌跡画像を生成するとともに、予め混雑度に対応する色を定めておき、混雑度マップの各画素と対応する画素に当該画素の混雑度に対応する色の画素値を設定した混雑度画像を生成して、軌跡画像と混雑度画像とを透過合成した画像を表示部６に出力する。さらに現時刻の撮影画像を重畳してもよい。 The tracking result output means 31, for example, generates a trajectory image in which a series of object positions for each moving object being tracked is plotted, and generates a congestion image in which a color corresponding to the congestion level is determined in advance and a pixel value of a color corresponding to the congestion level of each pixel in the congestion level map is set for each pixel corresponding to that pixel, and outputs an image in which the trajectory image and the congestion level image are transparently synthesized to the display unit 6. In addition, the captured image at the current time may be superimposed.

[動作例]
以下、移動物体追跡装置１の動作を説明する。図５は移動物体追跡装置１の動作の全体フロー図である。移動物体追跡装置１の動作が開始されると、撮影部２は予め規定したフレームレートで撮影画像を取得し、画像処理部５に撮影画像を出力する。画像処理部５は撮影画像が入力されるたびに（ステップＳ１）、ステップＳ２～Ｓ１１の一連の処理を繰り返す。 [Example]
The operation of the moving object tracking device 1 will be described below. Fig. 5 is an overall flow diagram of the operation of the moving object tracking device 1. When the operation of the moving object tracking device 1 is started, the image capture unit 2 acquires captured images at a predefined frame rate and outputs the captured images to the image processing unit 5. Every time a captured image is input (step S1), the image processing unit 5 repeats a series of processes from steps S2 to S11.

画像処理部５は撮影部２で取得した撮影画像に対し混雑度推定手段５０により混雑度マップを出力する。さらに物体情報記憶手段４１に記録された各追跡対象の移動物体の混雑度を追跡対象の現時刻の移動物体の物体位置を予測した位置に対応する混雑度マップの値を参照することにより求める。また、追跡が困難となる超高混雑領域を抽出する（ステップＳ２）。 The image processing unit 5 outputs a congestion map for the captured image acquired by the image capturing unit 2 using the congestion estimation means 50. Furthermore, the congestion degree of each moving object to be tracked recorded in the object information storage means 41 is calculated by referring to the value of the congestion map corresponding to the predicted object position of the moving object to be tracked at the current time. In addition, an ultra-high congestion area where tracking becomes difficult is extracted (step S2).

画像処理部５は各追跡対象の人物の混雑度から人物ごとの候補位置数の割り当てを候補位置数割当手段５１にて行う（ステップＳ３）。 The image processing unit 5 assigns the number of candidate positions to each person based on the degree of congestion of each tracked person using the candidate position number assignment means 51 (step S3).

画像処理部５は追跡対象の人物について追跡処理を行い現在の物体位置を推定する（ステップＳ４～Ｓ９）。画像処理部５は物体情報記憶手段４１に記録されている人物を順次、注目人物に設定して注目人物の追跡処理を行い、全ての人物について追跡処理が完了した場合は、画像処理部５は処理をステップＳ１０に進め、一方、未処理の人物が存在する場合は追跡処理を継続する（ステップＳ９）。 The image processing unit 5 performs a tracking process on the person to be tracked and estimates the current object position (steps S4 to S9). The image processing unit 5 sequentially sets the people recorded in the object information storage means 41 as people of interest and performs tracking process on the people of interest. When tracking process is complete for all people, the image processing unit 5 proceeds to step S10, whereas if there are unprocessed people, the tracking process continues (step S9).

以下、ステップＳ４～Ｓ９の追跡処理をさらに詳しく説明する。画像処理部５は候補位置設定手段５２として機能し、注目人物についてステップＳ３で割り当てられた候補位置数に基づき仮説の設定を行う（ステップＳ４）。すなわち、候補位置設定手段５２は、注目人物の過去の物体位置に現在の物体位置を外挿してその近傍に、割り当てられた個数と同じ数だけの候補位置を設定し、各候補位置を含んだ仮説を設定する。その後、候補位置が撮影画像外またはステップＳ２にて抽出された超高混雑領域内となっている仮説は削除する。 The tracking process of steps S4 to S9 will be explained in more detail below. The image processing unit 5 functions as a candidate position setting means 52, and sets hypotheses based on the number of candidate positions assigned to the person of interest in step S3 (step S4). That is, the candidate position setting means 52 extrapolates the current object position to the past object position of the person of interest, sets the same number of candidate positions in the vicinity as the assigned number, and sets hypotheses including each candidate position. Thereafter, hypotheses in which the candidate positions are outside the captured image or within the ultra-highly congested area extracted in step S2 are deleted.

画像処理部５は物体位置判定手段５３として機能し、物体位置判定手段５３はステップＳ４で設定された各仮説に対して尤度の算出を行う（ステップＳ５）。 The image processing unit 5 functions as an object position determination means 53, which calculates the likelihood for each hypothesis set in step S4 (step S5).

その後、物体位置判定手段５３は注目物体が消失移動物体か否かを判定し（ステップＳ６）、消失移動物体と判定した場合は追跡終了処理を行う（ステップＳ７）。すなわち物体位置判定手段５３はステップＳ４で削除されずに残った仮説においてすべての仮説の尤度が閾値ＴＬ未満であれば注目人物を消失移動物体と判定して物体情報記憶手段４１から注目人物に関する情報を削除する。 Then, the object position determination means 53 determines whether the target object is a vanishing moving object (step S6), and performs tracking termination processing if it is determined to be a vanishing moving object (step S7). That is, if the likelihood of all hypotheses remaining in step S4 that were not deleted is less than the threshold value TL, the object position determination means 53 determines that the target person is a vanishing moving object and deletes information about the target person from the object information storage means 41.

ステップＳ６にて追跡の継続が可能と判断された場合は、物体位置判定手段５３は、ステップＳ４で設定された仮説群の候補位置およびステップＳ５で算出された尤度に基づいて追跡人物の物体位置を推定する（ステップＳ８）。 If it is determined in step S6 that tracking can be continued, the object position determination means 53 estimates the object position of the tracked person based on the candidate positions of the hypothesis group set in step S4 and the likelihood calculated in step S5 (step S8).

上述の追跡処理Ｓ４～Ｓ９が物体情報記憶手段４１に登録された全ての人物に対して行われると、既に述べたように画像処理部５は処理をステップＳ１０に進め、物体位置判定手段５３により、撮影画像にてまだ追跡設定されていない人物の検出を行い、検出された場合は新規の追跡人物として追加する（ステップＳ１０）。 When the above-mentioned tracking processes S4 to S9 have been performed for all people registered in the object information storage means 41, as already mentioned, the image processing unit 5 advances the process to step S10, and the object position determination means 53 detects people in the captured image who have not yet been set for tracking, and if detected, adds them as new people to be tracked (step S10).

ステップＳ１で入力された撮影画像に対し上述した処理Ｓ２～Ｓ１０により人物の追跡が完了すると、画像処理部５は追跡結果を表示部６へ出力する（ステップＳ１１）。例えば、画像処理部５は追跡結果として全人物の推定位置を表示部６の表示装置等に表示させる。 When tracking of people is completed for the captured image input in step S1 by the above-mentioned processes S2 to S10, the image processing unit 5 outputs the tracking results to the display unit 6 (step S11). For example, the image processing unit 5 causes the display device of the display unit 6 to display the estimated positions of all people as the tracking results.

［変形例］
（１）上記実施形態では、混雑度推定手段５０が連続値を出力する推定器を用いた例を示したが、離散的な混雑度を出力する推定器を用いることもできる。 [Modification]
(1) In the above embodiment, an example was shown in which the congestion level estimation means 50 used an estimator that outputs a continuous value. However, an estimator that outputs a discrete congestion level may also be used.

例えば、推定器を多クラスＳＶＭ（Support Vector Machine）でモデル化し、混雑度の度合いに応じて「背景（無人）」、「低混雑度」、「中混雑度」、「高混雑度」、「超高混雑度」の５クラスに分類してラベル付けされた学習用画像を用いて当該モデルを学習させておく。そして、混雑度推定手段５０は、撮影画像の各画素を中心とする窓を設定して窓内の画像の特徴量を推定器に入力し、各画素のクラスを識別する。このように、混雑度推定手段５０で離散的な混雑度を推定した場合、候補位置数割当手段５１は、例えば図６に示す関数により候補位置数を算出する。図６は混雑度が５段階の離散的に推定される場合を想定したものであり、それぞれの混雑度の段階に応じて候補位置数も離散的に変化する。 For example, the estimator is modeled using a multi-class SVM (Support Vector Machine), and the model is trained using learning images that are classified and labeled into five classes according to the degree of congestion: "background (unmanned)", "low congestion", "medium congestion", "high congestion", and "very high congestion". The congestion degree estimation means 50 then sets a window centered on each pixel of the captured image, inputs the image features within the window to the estimator, and identifies the class of each pixel. In this way, when the congestion degree estimation means 50 estimates a discrete congestion degree, the candidate position number allocation means 51 calculates the number of candidate positions using, for example, the function shown in FIG. 6. FIG. 6 assumes a case in which the congestion degree is estimated discretely in five stages, and the number of candidate positions also changes discretely according to each congestion degree stage.

また、多クラスＳＶＭ以外にも、決定木型のランダムフォレスト法、多クラスのアダブースト（AdaBoost）法または多クラスロジスティック回帰法などにて学習した種々の多クラス識別器によっても推定器を実現できる。或いは識別型のＣＮＮによっても推定器を実現できる（ＣＮＮの場合、窓走査は不要）。また、クラス分類された学習用画像を用いる場合でも特徴量から混雑度を回帰する回帰型のモデルとすることによって連続値の混雑度を出力する推定器を実現することもできる。その場合、リッジ回帰法、サポートベクターリグレッション法、回帰木型のランダムフォレスト法またはガウス過程回帰（Gaussian Process Regression）などによって、特徴量から混雑度を求めるための回帰関数のパラメータを学習させる。或いは回帰型のＣＮＮを用いた推定器とすることもできる（ＣＮＮの場合、窓走査は不要）。 In addition to the multi-class SVM, the estimator can also be realized by various multi-class classifiers trained by the decision tree type random forest method, the multi-class AdaBoost method, or the multi-class logistic regression method. Alternatively, the estimator can be realized by a discriminative CNN (window scanning is not required for CNN). Even when using classified learning images, an estimator that outputs a continuous value of crowding degree can be realized by using a regression type model that regresses the crowding degree from the feature amount. In that case, the parameters of the regression function for calculating the crowding degree from the feature amount are learned by the ridge regression method, the support vector regression method, the regression tree type random forest method, or the Gaussian process regression. Alternatively, the estimator can be an estimator using a regression type CNN (window scanning is not required for CNN).

（２）上記各実施形態およびその各変形例においては、物体位置判定手段５３が背景差分処理に基づき新規移動物体を検出する例を示したが、その代わりに、追跡対象とする移動物体の画像を不特定多数機械学習した（例えば不特定多数の人の画像を深層学習した）学習済みモデルを用いて新規移動物体を検出してもよい。その場合、物体位置判定手段５３は、撮影画像を学習済みモデルに入力して移動物体の領域を検出し、いずれの形状モデルとも重複しない領域が閾値ＴＤ以上の大きさである移動物体の領域に新規移動物体が存在すると判定する。 (2) In each of the above embodiments and each of the modified examples, an example has been shown in which the object position determination means 53 detects a new moving object based on background subtraction processing. Instead, however, a new moving object may be detected using a trained model that has been machine-learned from an unspecified number of images of the moving object to be tracked (e.g., deep learning from an unspecified number of images of people). In this case, the object position determination means 53 inputs the captured image into the trained model to detect the area of the moving object, and determines that a new moving object exists in the area of the moving object where the area that does not overlap with any shape model is equal to or larger than the threshold value TD.

（３）上記実施形態およびその各変形例では１台の撮影部による追跡を例示したが、視野を共有した複数の撮影部を用いて追跡を行うこともできる。 (3) In the above embodiment and its modified examples, tracking using one image capture unit is exemplified, but tracking can also be performed using multiple image capture units that share a field of view.

まず、複数の撮影部に共通の３次元座標系を定義し、共通３次元座標系での各撮影部の位置と姿勢の関係を求め、３次元座標系から各撮影部の画像座標系への座標変換に必要なパラメータを算出する。物体位置および候補位置は、監視空間の地面等の水平平面上で直交するＸ軸、Ｙ軸と鉛直方向のＺ軸とで表される３次元座標系におけるＸ座標、Ｙ座標、Ｚ座標の組（Ｘ，Ｙ，Ｚ）で表される。 First, a three-dimensional coordinate system common to multiple image capture units is defined, the relationship between the position and orientation of each image capture unit in the common three-dimensional coordinate system is found, and the parameters required for coordinate conversion from the three-dimensional coordinate system to the image coordinate system of each image capture unit are calculated. The object position and candidate positions are represented by a set of X, Y, and Z coordinates (X, Y, Z) in a three-dimensional coordinate system represented by orthogonal X and Y axes and a vertical Z axis on a horizontal plane such as the ground in the monitored space.

混雑度推定手段５０は、まずすべての撮影部の撮影画像に対して混雑度マップを推定する。そして、共通３次元座標系での移動物体に対する混雑度を統合混雑度とし、各撮影部の混雑度マップに基づき算出する。具体的には、移動物体ごとに、共通３次元座標系での物体位置を各撮影部の画像座標系に変換し、各撮影部の混雑度マップ上での位置を求め、それぞれの撮影部の撮影画像上における混雑度を算出する。そして、移動物体ごとに、撮影部毎の混雑度のうち最も高い値を当該移動物体の総合混雑度とする。総合混雑度を撮影部毎の混雑度の最も高い値とするのは、総合混雑度に基づき候補位置数の配分を行う際に、最も候補位置数が多く必要となる混雑度の高い撮影部に合わせることで、いずれの撮影部についても追跡精度の維持に必要な候補位置数を満たすことができるためである。 The congestion degree estimation means 50 first estimates a congestion degree map for the images captured by all the imaging units. Then, the congestion degree for the moving object in the common three-dimensional coordinate system is calculated as an integrated congestion degree based on the congestion degree map of each imaging unit. Specifically, for each moving object, the object position in the common three-dimensional coordinate system is converted to the image coordinate system of each imaging unit, the position on the congestion degree map of each imaging unit is obtained, and the congestion degree on the image captured by each imaging unit is calculated. Then, for each moving object, the highest value of the congestion degrees for each imaging unit is calculated as the overall congestion degree of the moving object. The reason why the overall congestion degree is set to the highest value of the congestion degree for each imaging unit is that when allocating the number of candidate positions based on the overall congestion degree, the number of candidate positions required to maintain tracking accuracy can be met for each imaging unit by matching it to the imaging unit with the highest congestion degree, which requires the largest number of candidate positions.

なお、混雑度推定時点では現時刻の物体位置が不明であるが、混雑度推定手段５０は、各移動物体について、現時刻の物体位置を過去の物体位置、速度等から予測し、予測した現時刻の物体位置を用いて、各移動物体の総合混雑度を求める。 Although the object position at the current time is unknown at the time of congestion degree estimation, the congestion degree estimation means 50 predicts the object position at the current time for each moving object from the object's past positions, speed, etc., and calculates the overall congestion degree of each moving object using the predicted object position at the current time.

候補位置数割当手段５１は総合混雑度を用いて上記実施形態と同様に移動物体に対する候補位置数の割り当てを行い、候補位置設定手段５２は割り当て数に基づいて共通３次元座標系の候補位置を設定する。候補位置の尤度は総合混雑度の算出と同様に、各候補位置を撮影画像上の位置に撮影部毎に座標変換し、算出された撮影画像上の位置でテンプレートと特徴量の比較を行うことで撮影部毎の尤度を算出し、平均することで最終的な仮説（候補位置）の尤度とする。現時刻の物体位置は、各仮説（候補位置）の尤度から実施形態と同様に算出する。 The candidate position number allocation means 51 uses the overall congestion degree to allocate the number of candidate positions for moving objects in the same manner as in the above embodiment, and the candidate position setting means 52 sets candidate positions in a common three-dimensional coordinate system based on the allocated number. As with the calculation of the overall congestion degree, the likelihood of the candidate positions is calculated by converting the coordinates of each candidate position to a position on the captured image for each capture unit, and comparing the features with a template at the calculated position on the captured image to calculate the likelihood for each capture unit, and averaging them to determine the likelihood of the final hypothesis (candidate position). The object position at the current time is calculated from the likelihood of each hypothesis (candidate position) in the same manner as in the embodiment.

（４）本発明は、車両、動物等、混雑状態をなし得る人以外の移動物体にも適用できる。 (4) The present invention can also be applied to moving objects other than people that may cause congestion, such as vehicles and animals.

１…移動物体追跡装置、２…撮影部、３…通信部、４…記憶部、５…画像処理部、６…表示部、３０…画像取得部、３１…追跡結果出力手段、４０…混雑度記憶手段、４１…物体情報記憶手段、５０…混雑度推定手段、５１…候補位置数割当手段、５２…候補位置設定手段、５３…物体位置判定手段 1...moving object tracking device, 2...photographing unit, 3...communication unit, 4...storage unit, 5...image processing unit, 6...display unit, 30...image acquisition unit, 31...tracking result output means, 40...congestion level storage means, 41...object information storage means, 50...congestion level estimation means, 51...candidate position number allocation means, 52...candidate position setting means, 53...object position determination means

Claims

A moving object tracking device that tracks one or more moving objects based on sequentially captured images,
A congestion degree estimation means for estimating a congestion degree of the moving objects captured in the captured image;
a candidate position number allocation means for allocating a candidate position number for each moving object according to the congestion degree;
a candidate position setting means for determining candidate positions of the moving object at the current time from past positions of the moving object, the number of which is the same as the number of candidate positions assigned to each of the moving objects;
an object position determining means for determining an object position of the moving object at a current time based on the plurality of candidate positions for each of the moving objects;
Equipped with
The moving object tracking device according to claim 1, wherein the candidate position number allocating means allocates a larger number of candidate positions of the moving object as the degree of congestion increases .

the congestion degree estimation means inputs the photographed image to an estimator that has been trained in advance to output the congestion degree at an arbitrary position within the photographed image when the photographed image is input, and estimates the congestion degree at an arbitrary position within the photographed image;
2. The moving object tracking device according to claim 1, wherein said candidate position number allocating means allocates the number of candidate positions of each moving object in accordance with the degree of congestion of the object position of the moving object or the candidate positions on the captured image.

3. The moving object tracking device according to claim 1 , wherein the candidate position number allocation means compresses the number of candidate positions of each moving object so that the sum of the candidate positions of all the moving objects to be tracked at one time falls within a total number upper limit value.

4. The moving object tracking device according to claim 1, wherein the candidate position setting means sets the candidate positions of the moving object in a wider range as the degree of congestion becomes smaller.

A moving object tracking device that tracks one or more moving objects based on sequentially captured images,
A congestion degree estimation means for estimating a congestion degree of the moving objects captured in the captured image;
a candidate position number allocation means for allocating a candidate position number for each moving object according to the congestion degree;
a candidate position setting means for determining candidate positions of the moving object at the current time from past positions of the moving object, the number of which is the same as the number of candidate positions assigned to each of the moving objects;
an object position determining means for determining an object position of the moving object at a current time based on the plurality of candidate positions for each of the moving objects;
Equipped with
In an area where the degree of congestion is equal to or higher than a certain level, the candidate position number allocation means, the candidate position setting means, and the object position determination means do not perform processing,
A moving object tracking device characterized in that the positions of a group of moving objects on the sequentially captured images are stored.

A moving object tracking method using a moving object tracking device that tracks one or more moving objects based on sequentially captured images, comprising:
Estimating a congestion degree of the moving objects captured in the captured image;
the higher the congestion level, the greater the number of candidate positions of each of the moving objects is allocated;
determining candidate positions of the moving object at the current time from past positions of the moving object, the number of which is equal to the number of candidate positions assigned to each of the moving objects;
determining an object position of the moving object at a current time based on the plurality of candidate positions for each of the moving objects.

A moving object tracking program executed in a moving object tracking device that tracks one or more moving objects based on sequentially captured images, comprising:
A process of estimating a congestion degree of the moving objects captured in the captured image;
A process of allocating a number of candidate positions for each moving object according to the congestion degree;
A process of determining candidate positions of the moving object at the current time from past positions of the moving object, the number of which is equal to the number of candidate positions assigned to each of the moving objects;
A process of determining an object position of the moving object at a current time based on the plurality of candidate positions for each of the moving objects;
Run the command ,
A moving object tracking program, wherein the process of allocating the number of candidate positions allocates a larger number of candidate positions of the moving object as the degree of congestion increases.