JP6897095B2

JP6897095B2 - Image processing program, image processing device and image processing method

Info

Publication number: JP6897095B2
Application number: JP2016253900A
Authority: JP
Inventors: 敏浩藤本; 昌宏林; 剛美木村; 和幸宮田; 潤一植村; 浩之井出
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2016-12-27
Filing date: 2016-12-27
Publication date: 2021-06-30
Anticipated expiration: 2036-12-27
Also published as: JP2018106539A

Description

本発明は、画像処理プログラム、画像処理装置及び画像処理方法に関する。 The present invention relates to an image processing program, an image processing apparatus, and an image processing method.

多数の受講者が受講する講義や講演、講習（以下、講義等と称す）において、各受講者を講義等に集中させるために、どのような講義内容や講義テクニックが有効かを把握することは、講義等を提供する者（以下、講師等）にとって重要である。そして、講義内容や講義テクニックと各受講者の状態との相関を把握するためには、まず、受講中の各受講者の状態を定量的に解析することが求められる。 In lectures, lectures, and lectures (hereinafter referred to as lectures) that many students take, it is difficult to understand what kind of lecture content and lecture technique is effective in order to concentrate each student on the lecture. , Important for those who provide lectures (hereinafter, lecturers, etc.). Then, in order to grasp the correlation between the lecture content and the lecture technique and the state of each student, it is first required to quantitatively analyze the state of each student during the course.

これに対して、例えば、講義等を撮影した画像を処理し、各受講者の状態を解析する技術が提案されている。具体的には、多数の受講者を含む情景を撮影し、撮影により得られた画像に含まれる人物の顔の傾きを解析することで、人物の状態（例えば、頷き状態、居眠り状態等）を定量的に算出する画像処理システムが提案されている。 On the other hand, for example, a technique has been proposed in which an image of a lecture or the like is processed and the state of each student is analyzed. Specifically, by shooting a scene including a large number of students and analyzing the inclination of the face of the person included in the image obtained by the shooting, the state of the person (for example, nodding state, dozing state, etc.) can be obtained. An image processing system that calculates quantitatively has been proposed.

特開２００７−９７６６８号公報JP-A-2007-97668

しかしながら、上記画像処理システムを用いて、画像に含まれる人物の状態を解析するためには、撮影された画像において人物の詳細な顔情報（例えば、眉の位置、顔の輪郭等）を算出する必要がある。このため、多数の受講者が受講する講義等に適用しようとすると、高性能な撮像装置を多数設置する必要が生じ、コストがかかるといった問題がある。 However, in order to analyze the state of a person included in an image using the above image processing system, detailed facial information (for example, eyebrow position, facial contour, etc.) of the person is calculated in the captured image. There is a need. Therefore, if it is intended to be applied to a lecture or the like attended by a large number of students, it is necessary to install a large number of high-performance imaging devices, which causes a problem of cost.

このようなことから、講義等を撮影した画像に含まれる人物の状態を、簡易な画像処理システムを用いて（つまり、効率よく）解析できるようにすることが求められている。 For this reason, it is required to be able to analyze the state of a person included in an image of a lecture or the like using a simple image processing system (that is, efficiently).

一つの側面では、画像に含まれる人物の状態を効率よく解析できるようにすることを目的としている。 One aspect is to enable efficient analysis of the state of a person contained in an image.

一態様によれば、画像処理プログラムは、
同一の撮影範囲で行われる講義を異なるタイミングで撮影することで得た複数の画像それぞれから、該講義の受講者の顔が存在する位置を抽出し、
抽出した前記顔が存在する位置に基づいて、前記複数の画像それぞれについて、画像領域を特定し、
特定した前記画像領域における、顔の有無に応じた結果を、前記複数の画像それぞれを撮影したタイミングを示す情報と対応付け、前記講義の形式に応じたプロット周期で出力する、
処理をコンピュータに実行させる。
According to one aspect, the image processing program
The position where the face of the lecturer's face exists is extracted from each of the plurality of images obtained by shooting the lectures held in the same shooting range at different timings.
An image area is specified for each of the plurality of images based on the extracted position where the face exists.
The result according to the presence or absence of the face in the specified image area is associated with the information indicating the timing at which each of the plurality of images was taken, and is output at a plot cycle according to the format of the lecture.
Let the computer perform the process.

画像に含まれる人物の状態を効率よく解析することができる。 The state of the person included in the image can be efficiently analyzed.

画像処理システムのシステム構成の一例を示す図である。It is a figure which shows an example of the system configuration of an image processing system. 画像処理装置のハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware composition of an image processing apparatus. 画像処理装置の画像処理部の機能構成を示す図である。It is a figure which shows the functional structure of the image processing part of an image processing apparatus. 顔候補検出部の処理の具体例を示す図である。It is a figure which shows the specific example of the processing of a face candidate detection part. 検出結果情報の一例を示す図である。It is a figure which shows an example of the detection result information. 第１の絞り込み部の処理の具体例を示す図である。It is a figure which shows the specific example of the processing of the 1st narrowing-down part. 第２の絞り込み部の処理の具体例を示す図である。It is a figure which shows the specific example of the processing of the 2nd narrowing-down part. 検出件数時系列データ生成部の処理の具体例を示す図である。It is a figure which shows the specific example of the processing of the detection number time series data generation part. 変化件数時系列データ生成部の処理の具体例を示す第１の図である。It is the first figure which shows the specific example of the processing of the change number time series data generation part. 変化件数時系列データ生成部の処理の具体例を示す第２の図である。It is the 2nd figure which shows the specific example of the processing of the change number time series data generation part. 画像処理装置による画像処理の第１のフローチャートである。It is a 1st flowchart of image processing by an image processing apparatus. 画像処理装置による画像処理の第２のフローチャートである。It is a 2nd flowchart of image processing by an image processing apparatus. 顔検出件数の時系列データの他の一例を示す図である。It is a figure which shows another example of time-series data of the number of face detections.

以下、各実施形態について添付の図面を参照しながら説明する。なお、本明細書及び図面において、実質的に同一の機能構成を有する構成要素については、同一の符号を付することにより重複した説明を省く。 Hereinafter, each embodiment will be described with reference to the accompanying drawings. In the present specification and the drawings, components having substantially the same functional configuration are designated by the same reference numerals to omit duplicate explanations.

［第１の実施形態］
＜画像処理システムのシステム構成＞
はじめに、画像処理システム全体のシステム構成について説明する。図１は、画像処理システムのシステム構成の一例を示す図である。 [First Embodiment]
<System configuration of image processing system>
First, the system configuration of the entire image processing system will be described. FIG. 1 is a diagram showing an example of a system configuration of an image processing system.

図１に示すように、画像処理システム１００は、撮像装置１１０、動画像データ格納装置１２０、画像処理装置１３０を有する。動画像データ格納装置１２０と画像処理装置１３０とはネットワーク１４０を介して接続される。 As shown in FIG. 1, the image processing system 100 includes an image pickup device 110, a moving image data storage device 120, and an image processing device 130. The moving image data storage device 120 and the image processing device 130 are connected via the network 140.

撮像装置１１０は、例えば、大学の講義室等のように、多数の受講者が講義等を受講する空間において、受講者側に向けて設置され、受講者が着座する範囲を撮影範囲として撮影する。また、撮像装置１１０は、撮影することで得た動画像データを、動画像データ格納装置１２０に送信する。 The image pickup device 110 is installed toward the student side in a space where a large number of students take lectures, such as a lecture room of a university, and takes a picture of a range in which the student sits. .. Further, the imaging device 110 transmits the moving image data obtained by taking a picture to the moving image data storage device 120.

動画像データ格納装置１２０は、撮像装置１１０より送信された動画像データを格納する。また、動画像データ格納装置１２０は、画像処理装置１３０からの要求に応じて、ネットワーク１４０を介して、動画像データを画像処理装置１３０に送信する。 The moving image data storage device 120 stores the moving image data transmitted from the imaging device 110. Further, the moving image data storage device 120 transmits the moving image data to the image processing device 130 via the network 140 in response to a request from the image processing device 130.

画像処理装置１３０には、画像処理プログラムがインストールされている。画像処理装置１３０は、当該画像処理プログラムを実行することで、画像処理部１３１として機能する。 An image processing program is installed in the image processing device 130. The image processing device 130 functions as the image processing unit 131 by executing the image processing program.

画像処理部１３１は、動画像データをフレーム単位で処理することで、各フレームの画像データより、受講者の顔の候補（顔候補データと称す）を検出し、顔候補データのサイズ及び顔候補データが存在する位置を抽出する。また、画像処理部１３１は、検出した顔候補データを、抽出したサイズ及び位置に基づいて絞り込み、画像データごとに、顔を上げている受講者の数を示す「顔検出件数」を算出する。受講者が顔を上げている状態は、受講者が講義等に集中している状態である。したがって、顔検出件数を算出することで、各受講者が講義等に集中している状態であるか否かを解析することができる。 By processing the moving image data in frame units, the image processing unit 131 detects the student's face candidates (referred to as face candidate data) from the image data of each frame, and determines the size of the face candidate data and the face candidates. Extract the location where the data exists. In addition, the image processing unit 131 narrows down the detected face candidate data based on the extracted size and position, and calculates a "face detection number" indicating the number of students raising their faces for each image data. When the student is looking up, the student is concentrating on the lecture. Therefore, by calculating the number of face detections, it is possible to analyze whether or not each student is concentrating on lectures and the like.

画像処理部１３１は、画像データごとに算出した顔検出件数を、各画像データが撮影されたタイミングを示す時刻情報と対応付けて、時系列データとして解析結果格納部１３２に格納する。 The image processing unit 131 stores the number of face detection cases calculated for each image data in the analysis result storage unit 132 as time-series data in association with the time information indicating the timing at which each image data is taken.

更に、画像処理部１３１は、画像データごとの顔検出の結果に基づき、受講者の顔の上げ下げの頻度を示す「変化件数」を算出し、時刻情報と対応付けて、時系列データとして解析結果格納部１３２に格納する。受講者が頻繁に顔を上げ下げしている状態は、受講者がメモをとりながら講義等を受講している状態であり、講義等に集中している状態である。したがって、変化件数を算出することで、各受講者が講義等に集中している状態であるか否かを解析することができる。 Further, the image processing unit 131 calculates the "number of changes" indicating the frequency of raising and lowering the face of the student based on the result of face detection for each image data, associates it with the time information, and analyzes the result as time series data. It is stored in the storage unit 132. The state in which the student frequently raises and lowers his / her face means that the student is taking a lecture while taking notes, and is concentrating on the lecture. Therefore, by calculating the number of changes, it is possible to analyze whether or not each student is concentrating on lectures and the like.

このように、画像処理装置１３０は、画像に含まれる人物の状態を解析するために、顔検出件数と変化件数とを算出する。つまり、画像処理装置１３０の場合、人物の顔の検出を行う一方で、人物の詳細な顔情報の算出までは行わない。このため、画像処理システムとして、高性能な撮像装置を設置する必要がなく、また、撮像装置の設置台数を削減することができる。 In this way, the image processing device 130 calculates the number of face detection cases and the number of changes in order to analyze the state of the person included in the image. That is, in the case of the image processing device 130, while the face of the person is detected, the detailed face information of the person is not calculated. Therefore, it is not necessary to install a high-performance image pickup device as an image processing system, and the number of image pickup devices installed can be reduced.

つまり、簡易な画像処理システムがあれば足り、画像に含まれる人物の状態を、効率よく解析することができる。 That is, a simple image processing system is sufficient, and the state of a person included in the image can be efficiently analyzed.

＜画像処理装置のハードウェア構成＞
次に、画像処理装置１３０のハードウェア構成について説明する。図２は、画像処理装置のハードウェア構成の一例を示す図である。図２に示すように、画像処理装置１３０は、ＣＰＵ（Central Processing Unit）２０１、ＲＯＭ（Read Only Memory）２０２、ＲＡＭ（Random Access Memory）２０３を有する。ＣＰＵ２０１、ＲＯＭ２０２、ＲＡＭ２０３は、いわゆるコンピュータを形成する。 <Hardware configuration of image processing device>
Next, the hardware configuration of the image processing device 130 will be described. FIG. 2 is a diagram showing an example of the hardware configuration of the image processing device. As shown in FIG. 2, the image processing device 130 includes a CPU (Central Processing Unit) 201, a ROM (Read Only Memory) 202, and a RAM (Random Access Memory) 203. The CPU 201, ROM 202, and RAM 203 form a so-called computer.

また、画像処理装置１３０は、補助記憶装置２０４、表示装置２０５、操作装置２０６、Ｉ／Ｆ（Interface）装置２０７、ドライブ装置２０８を有する。なお、画像処理装置１３０の各部は、バス２０９を介して相互に接続されている。 Further, the image processing device 130 includes an auxiliary storage device 204, a display device 205, an operation device 206, an I / F (Interface) device 207, and a drive device 208. Each part of the image processing device 130 is connected to each other via a bus 209.

ＣＰＵ２０１は、補助記憶装置２０４にインストールされている各種プログラム（例えば、画像処理プログラム等）を実行するデバイスである。 The CPU 201 is a device that executes various programs (for example, an image processing program, etc.) installed in the auxiliary storage device 204.

ＲＯＭ２０２は、不揮発性メモリである。ＲＯＭ２０２は、補助記憶装置２０４にインストールされている各種プログラムをＣＰＵ２０１が実行するために必要な各種プログラム、データ等を格納する主記憶デバイスとして機能する。具体的には、ＲＯＭ２０２はＢＩＯＳ（Basic Input/Output System）やＥＦＩ（Extensible Firmware Interface）等のブートプログラム等を格納する。 ROM 202 is a non-volatile memory. The ROM 202 functions as a main storage device for storing various programs, data, and the like necessary for the CPU 201 to execute various programs installed in the auxiliary storage device 204. Specifically, the ROM 202 stores boot programs such as BIOS (Basic Input / Output System) and EFI (Extensible Firmware Interface).

ＲＡＭ２０３は、ＤＲＡＭ（Dynamic Random Access Memory）やＳＲＡＭ（Static Random Access Memory）等の揮発性メモリである。ＲＡＭ２０３は、補助記憶装置２０４にインストールされている各種プログラムがＣＰＵ２０１によって実行される際に展開される、作業領域を提供する主記憶デバイスとして機能する。 The RAM 203 is a volatile memory such as a DRAM (Dynamic Random Access Memory) or a SRAM (Static Random Access Memory). The RAM 203 functions as a main storage device that provides a work area that is expanded when various programs installed in the auxiliary storage device 204 are executed by the CPU 201.

補助記憶装置２０４は、各種プログラムや、各種プログラムが実行されることで生成される情報、ならびに各種プログラムが実行される際に用いられる情報を格納する補助記憶デバイスである。解析結果格納部１３２は、補助記憶装置２０４において実現される。 The auxiliary storage device 204 is an auxiliary storage device that stores various programs, information generated by executing various programs, and information used when various programs are executed. The analysis result storage unit 132 is realized in the auxiliary storage device 204.

表示装置２０５は、画像処理装置１３０における画像処理の結果を示す情報（例えば、後述する顔検出件数時系列データ、変化件数時系列データ等の、時刻情報と対応付けられた“顔の有無に応じた結果を示す情報”）を表示する表示デバイスである。操作装置２０６は、画像処理装置１３０の管理者が画像処理装置１３０に対して各種指示を入力するための入力デバイスである。 The display device 205 “depends on the presence or absence of a face” associated with time information such as information indicating the result of image processing in the image processing device 130 (for example, face detection number time series data, change number time series data, etc., which will be described later). It is a display device that displays information ") indicating the results. The operation device 206 is an input device for the administrator of the image processing device 130 to input various instructions to the image processing device 130.

Ｉ／Ｆ装置２０７は、ネットワーク１４０に接続し、動画像データ格納装置１２０と通信を行うための通信デバイスである。 The I / F device 207 is a communication device for connecting to the network 140 and communicating with the moving image data storage device 120.

ドライブ装置２０８は記録媒体２１０をセットするためのデバイスである。ここでいう記録媒体２１０には、ＣＤ−ＲＯＭ、フレキシブルディスク、光磁気ディスク等のように情報を光学的、電気的あるいは磁気的に記録する媒体が含まれる。また、記録媒体２１０には、ＲＯＭ、フラッシュメモリ等のように情報を電気的に記録する半導体メモリ等が含まれていてもよい。 The drive device 208 is a device for setting the recording medium 210. The recording medium 210 referred to here includes a medium such as a CD-ROM, a flexible disk, a magneto-optical disk, or the like that optically, electrically, or magnetically records information. Further, the recording medium 210 may include a semiconductor memory or the like for electrically recording information such as a ROM or a flash memory.

なお、補助記憶装置２０４に格納される各種プログラムは、例えば、配布された記録媒体２１０がドライブ装置２０８にセットされ、該記録媒体２１０に記録された各種プログラムがドライブ装置２０８により読み出されることでインストールされる。 The various programs stored in the auxiliary storage device 204 are installed, for example, by setting the distributed recording medium 210 in the drive device 208 and reading the various programs recorded in the recording medium 210 by the drive device 208. Will be done.

＜画像処理装置の機能構成＞
次に、画像処理装置１３０の画像処理部１３１の機能構成について説明する。図３は、画像処理装置の画像処理部の機能構成の一例を示す図である。 <Functional configuration of image processing device>
Next, the functional configuration of the image processing unit 131 of the image processing device 130 will be described. FIG. 3 is a diagram showing an example of the functional configuration of the image processing unit of the image processing device.

図３に示すように、画像処理装置１３０の画像処理部１３１は、動画像データ取得部３０１、顔候補検出部３０２、第１の絞り込み部３０３、第２の絞り込み部３０４を有する。また、画像処理装置１３０の画像処理部１３１は、検出件数時系列データ生成部３０５、変化件数時系列データ生成部３０６、表示制御部３０７を有する。 As shown in FIG. 3, the image processing unit 131 of the image processing device 130 includes a moving image data acquisition unit 301, a face candidate detection unit 302, a first narrowing unit 303, and a second narrowing unit 304. Further, the image processing unit 131 of the image processing device 130 includes a detection number time series data generation unit 305, a change number time series data generation unit 306, and a display control unit 307.

動画像データ取得部３０１は、動画像データ格納装置１２０より動画像データを取得し、フレーム単位の画像データを顔候補検出部３０２に通知する。 The moving image data acquisition unit 301 acquires moving image data from the moving image data storage device 120, and notifies the face candidate detection unit 302 of the image data in frame units.

顔候補検出部３０２は抽出部の一例である。顔候補検出部３０２は、動画像データ取得部３０１より、フレーム単位の画像データを受信し、受信した画像データを処理することで、受講者（画像に含まれる人物）の顔候補データを検出する。顔候補検出部３０２は、受講者の顔候補データを矩形領域として検出し、検出した矩形領域が存在する位置（ｘ座標、ｙ座標）及び検出した矩形領域のサイズ（横幅、高さ）を抽出する。また、顔候補検出部３０２は、抽出した位置及びサイズ（横幅、高さ）を、検出結果情報として、解析結果格納部１３２に格納する。 The face candidate detection unit 302 is an example of an extraction unit. The face candidate detection unit 302 receives image data in frame units from the moving image data acquisition unit 301 and processes the received image data to detect the face candidate data of the student (person included in the image). .. The face candidate detection unit 302 detects the student's face candidate data as a rectangular area, and extracts the position (x coordinate, y coordinate) where the detected rectangular area exists and the size (width, height) of the detected rectangular area. To do. Further, the face candidate detection unit 302 stores the extracted position and size (width, height) as detection result information in the analysis result storage unit 132.

第１の絞り込み部３０３は、解析結果格納部１３２に検出結果情報が新たに格納されると、当該検出結果情報に含まれる各顔候補データのサイズ（横幅、高さ）が、所定の条件を満たすか否かを判定する。第１の絞り込み部３０３は、所定の条件を満たさないと判定した顔候補データを、誤検出した顔候補データと判定し、検出結果情報から削除する。 When the detection result information is newly stored in the analysis result storage unit 132, the first narrowing-down unit 303 sets a predetermined condition for the size (width, height) of each face candidate data included in the detection result information. Determine if it meets or not. The first narrowing-down unit 303 determines that the face candidate data that does not satisfy the predetermined condition is erroneously detected face candidate data, and deletes the face candidate data from the detection result information.

第２の絞り込み部３０４は特定部の一例である。第２の絞り込み部３０４は、第１の絞り込み部３０３により、所定の条件を満たさないと判定された顔候補データが削除された検出結果情報について、残りの顔候補データを、位置（ｘ座標、ｙ座標）に基づき、予め定められた各画像領域に分類する。また、第２の絞り込み部３０４は、各画像領域に分類された顔候補データの数を集計し、集計結果を集計結果情報として、解析結果格納部１３２に格納する。また、第２の絞り込み部３０４は、集計結果情報において、所定数以下の画像領域に分類された顔候補データの数を減算することで顔検出件数を算出する。 The second narrowing-down unit 304 is an example of a specific unit. The second narrowing-down unit 304 sets the remaining face candidate data at the position (x coordinate, x-coordinate,) of the detection result information in which the face candidate data determined not to satisfy the predetermined condition is deleted by the first narrowing-down unit 303. Based on the y-coordinate), it is classified into each predetermined image area. Further, the second narrowing-down unit 304 aggregates the number of face candidate data classified in each image area, and stores the aggregated result as the aggregated result information in the analysis result storage unit 132. Further, the second narrowing-down unit 304 calculates the number of face detection cases by subtracting the number of face candidate data classified into the image area equal to or less than a predetermined number in the aggregation result information.

検出件数時系列データ生成部３０５は、第２の絞り込み部３０４により、各画像データについて算出された顔検出件数を、各画像データを撮影したタイミングを示す時刻情報と対応付けることで顔検出件数の時系列データを生成し、解析結果格納部１３２に格納する。 The number of detected cases The time-series data generation unit 305 associates the number of face detection cases calculated for each image data with the time information indicating the timing at which each image data was taken by the second narrowing-down unit 304, so that the number of face detection cases is the time. Series data is generated and stored in the analysis result storage unit 132.

変化件数時系列データ生成部３０６は、各画像データの集計結果情報同士を比較し、各画像領域について、それぞれの画像領域に分類された顔候補データの数の差分（絶対値）を算出し、差分情報として、解析結果格納部１３２に格納する。 The number of changes time-series data generation unit 306 compares the aggregated result information of each image data with each other, calculates the difference (absolute value) of the number of face candidate data classified into each image area for each image area, and calculates the difference (absolute value). It is stored in the analysis result storage unit 132 as the difference information.

なお、差分（絶対値）を算出するのは、受講者が顔を上げている状態から下げている状態へと変化した数と、受講者が顔を下げている状態から上げている状態へと変化した数の両方をカウントするためである。 The difference (absolute value) is calculated by the number of changes from the state in which the student is raising his face to the state in which he is lowering, and the state in which the student is raising his face from the state in which his face is lowered. This is to count both the number of changes.

また、変化件数時系列データ生成部３０６は、差分情報に基づいて各画像領域について算出した差分（絶対値）を集計することで、当該画像データにおける変化件数を算出する。更に、変化件数時系列データ生成部３０６は、各画像データについて算出した変化件数を、時刻情報と対応付けることで、変化件数の時系列データを生成し、解析結果格納部１３２に格納する。 Further, the change number time series data generation unit 306 calculates the change number in the image data by totaling the differences (absolute values) calculated for each image area based on the difference information. Further, the change number time series data generation unit 306 generates time series data of the change number by associating the change number calculated for each image data with the time information, and stores it in the analysis result storage unit 132.

表示制御部３０７は出力部の一例である。表示制御部３０７は、解析結果格納部１３２に格納された、顔検出件数の時系列データと変化件数の時系列データとを、時刻情報と対応付けられた“顔の有無に応じた結果を示す情報”として、表示装置２０５に出力する。これにより、表示装置２０５には、顔検出件数の時系列データと変化件数の時系列データとが表示される。 The display control unit 307 is an example of an output unit. The display control unit 307 displays the time-series data of the number of face detections and the time-series data of the number of changes stored in the analysis result storage unit 132 as "results according to the presence or absence of faces" associated with the time information. It is output to the display device 205 as "information". As a result, the display device 205 displays the time-series data of the number of face detection cases and the time-series data of the number of changes.

＜画像処理装置の画像処理部に含まれる各部の処理の具体例＞
次に、画像処理装置１３０の画像処理部１３１に含まれる各部（ここでは、顔候補検出部３０２、第１の絞り込み部３０３、第２の絞り込み部３０４、検出件数時系列データ生成部３０５、変化件数時系列データ生成部３０６）の処理の具体例について説明する。 <Specific example of processing of each part included in the image processing part of the image processing device>
Next, each unit included in the image processing unit 131 of the image processing device 130 (here, the face candidate detection unit 302, the first narrowing-down unit 303, the second narrowing-down unit 304, the number of detections time-series data generation unit 305, changes. A specific example of the processing of the number time series data generation unit 306) will be described.

（１）顔候補検出部の処理の具体例
図４は、顔候補検出部の処理の具体例を示す図である。このうち、図４（ａ）は、動画像データ取得部３０１により取得される動画像データの一例を示している。図４（ａ）に示すように、動画像データ４００には、時刻ｔ_０〜ｔ_ｎまでの各タイミングで撮像装置１１０が撮影することで得た画像データ４００＿０〜４００＿ｎが含まれる。顔候補検出部３０２は、動画像データ４００に含まれる画像データ４００＿０〜４００＿ｎそれぞれを、順次、動画像データ取得部３０１から受信し、顔候補データを検出する処理を行う。 (1) Specific Example of Processing of Face Candidate Detection Unit FIG. 4 is a diagram showing a specific example of processing of the face candidate detection unit. Of these, FIG. 4A shows an example of moving image data acquired by the moving image data acquisition unit 301. As shown in FIG. 4A, the moving image data 400 includes image data 400_0 to 400_n obtained by taking a picture at each timing from _{time t 0 to} _{t n.} The face candidate detection unit 302 sequentially receives each of the image data 400_0 to 400_n included in the moving image data 400 from the moving image data acquisition unit 301, and performs a process of detecting the face candidate data.

図４（ｂ）は、顔候補検出部３０２が、動画像データ取得部３０１から受信した、画像データ４００＿０を示している。図４（ｂ）に示すように、顔候補検出部３０２は、画像データ４００＿０の左上頂点を原点、横方向をｘ軸、縦方向をｙ軸と規定して、各画素の座標を特定することで顔候補データを検出する処理を行う。 FIG. 4B shows the image data 400_0 received by the face candidate detection unit 302 from the moving image data acquisition unit 301. As shown in FIG. 4B, the face candidate detection unit 302 defines the upper left vertex of the image data 400_0 as the origin, the horizontal direction as the x-axis, and the vertical direction as the y-axis, and specifies the coordinates of each pixel. Performs the process of detecting face candidate data with.

図４（ｃ）は、画像データ４００＿０より、顔候補検出部３０２が検出した顔候補データの一例を示している。顔候補検出部３０２は、画像データ４００＿０より顔候補データを矩形領域として検出し、検出した各顔候補データに対して識別子（ＩＤ（Identifier））を付与する。また、顔候補検出部３０２は、各顔候補データが存在する位置（矩形領域の左上頂点の座標）、高さ（矩形領域のｙ方向の長さ）、横幅（矩形領域のｘ方向の長さ）を抽出する。更に、顔候補検出部３０２は、検出した顔候補データの数（顔候補検出件数）を算出する。 FIG. 4C shows an example of face candidate data detected by the face candidate detection unit 302 from the image data 400_0. The face candidate detection unit 302 detects the face candidate data as a rectangular area from the image data 400_0, and assigns an identifier (ID) to each of the detected face candidate data. Further, the face candidate detection unit 302 has a position (coordinates of the upper left vertex of the rectangular area), a height (length of the rectangular area in the y direction), and a width (length of the rectangular area in the x direction) where each face candidate data exists. ) Is extracted. Further, the face candidate detection unit 302 calculates the number of detected face candidate data (number of face candidate detection cases).

図４（ｃ）の例は、顔候補データ４１０＿１に識別子（ＩＤ）として、“Ｆ００１”が付与され、座標＝（ｘ_１，ｙ_１）、高さ＝ｈ_１、横幅＝ｗ_１が抽出されたことを示している。画像データ４００＿０の場合、顔候補検出部３０２により、顔候補データ４１０＿１〜４１０＿ｍ_０までのｍ_０個の顔候補データが検出されている。 In the example of FIG. 4 (c), "F001" is assigned to the face candidate data 410_1 as an identifier (ID), and the coordinates = (x ₁ , y ₁ ), the height = h ₁ , and the width = w ₁ are extracted. It shows that. In the case of the image data 400_0, the face candidate detection unit 302 has detected m ₀ _{face candidate data from the face candidate data 410_1 to 410_m 0.}

なお、図４（ｃ）に示す顔候補データのうち、矩形領域の中に受講者の顔が含まれていない顔候補データ（例えば、顔候補データ４１０＿３、４１０＿４）は、受講者の顔でない領域を、受講者の顔と誤って検出したことを示している。 Of the face candidate data shown in FIG. 4C, the face candidate data (for example, face candidate data 410_3, 410_4) in which the student's face is not included in the rectangular area is an area that is not the student's face. Is mistakenly detected as the student's face.

顔候補検出部３０２は、検出した各顔候補データについて抽出した情報（座標、高さ、横幅）を、各顔候補データの識別子（ＩＤ）と対応付けて、検出結果情報として解析結果格納部１３２に格納する。更に、顔候補検出部３０２は、検出した各顔候補データの顔候補データの数（顔候補検出件数）を、検出結果情報として解析結果格納部１３２に格納する。 The face candidate detection unit 302 associates the information (coordinates, height, width) extracted for each detected face candidate data with the identifier (ID) of each face candidate data, and analyzes result storage unit 132 as detection result information. Store in. Further, the face candidate detection unit 302 stores the number of face candidate data (number of face candidate detection cases) of each detected face candidate data in the analysis result storage unit 132 as detection result information.

図５は、検出結果情報の一例を示す図である。図５に示すように、検出結果情報５００は、情報の項目として、“識別子（ＩＤ）”、“横起点（ｘ）”、“縦起点（ｙ）”、“横幅（ｗ）”、“高さ（ｈ）”、“顔候補検出件数”を含む。 FIG. 5 is a diagram showing an example of detection result information. As shown in FIG. 5, the detection result information 500 has "identifier (ID)", "horizontal origin (x)", "vertical origin (y)", "width (w)", and "high" as information items. (H) ”,“ Number of face candidate detections ”is included.

“識別子（ＩＤ）”には、顔候補データに付与された識別子が格納される。“横起点（ｘ）”には、顔候補データのｘ座標（矩形領域の左上頂点のｘ座標）が格納される。“縦起点（ｙ）”には、顔候補データのｙ座標（矩形領域の左上頂点のｙ座標）が格納される。 The "identifier (ID)" stores the identifier given to the face candidate data. The x-coordinate of the face candidate data (x-coordinate of the upper left vertex of the rectangular area) is stored in the "horizontal starting point (x)". The y-coordinate of the face candidate data (y-coordinate of the upper left vertex of the rectangular area) is stored in the "vertical starting point (y)".

“横幅（ｗ）”には、顔候補データの横幅（矩形領域のｘ方向の長さ）が格納される。“高さ（ｈ）”には、顔候補データの高さ（矩形領域のｙ方向の長さ）が格納される。“顔候補検出件数”には、検出された顔候補データの数が格納される。 In the "width (w)", the width of the face candidate data (the length of the rectangular area in the x direction) is stored. In the "height (h)", the height of the face candidate data (the length of the rectangular area in the y direction) is stored. The number of detected face candidate data is stored in the "number of detected face candidates".

なお、図５では、時刻ｔ_０の画像データ４００＿０において検出された顔候補データについて抽出した情報を、検出結果情報５００として格納した様子を示している。しかしながら、顔候補検出部３０２が格納する検出結果情報は図５に示す検出結果情報５００に限られない。顔候補検出部３０２は、時刻ｔ_１の画像データ４００＿１〜時刻ｔ_ｎの画像データ４００＿ｍ_０においてそれぞれ検出した顔候補データについて抽出した情報についても、同様に、検出結果情報として、順次、解析結果格納部１３２に格納するものとする。 In FIG. 5, the extracted information for the detected face candidate data in the image data 400_0 of time t _0, which shows a state stored as the detection result information 500. However, the detection result information stored in the face candidate detection unit 302 is not limited to the detection result information 500 shown in FIG. Face candidate detection section 302, for the information extracted for the face candidate data detected respectively in the image data 400_1～ time _{t n} image data 400_M ₀ of time _{t 1,} as well as the detection result information, sequentially, the analysis result storage It shall be stored in the unit 132.

（２）第１の絞り込み部の処理の具体例
図６は、第１の絞り込み部の処理の具体例を示す図である。上述したとおり、第１の絞り込み部３０３は、所定の条件を満たさないと判定した顔候補データについて抽出した情報を、識別子（ＩＤ）とともに検出結果情報から削除する。 (2) Specific Example of Processing of First Narrowing Section FIG. 6 is a diagram showing a specific example of processing of the first narrowing section. As described above, the first narrowing-down unit 303 deletes the extracted information about the face candidate data determined not to satisfy the predetermined condition from the detection result information together with the identifier (ID).

具体的には、第１の絞り込み部３０３は、横幅（ｗ）の上限値及び下限値、高さ（ｈ）の上限値及び下限値を予め保持しており、各顔候補データの横幅（ｗ）、高さ（ｈ）が、それぞれの上限値から下限値の範囲に含まれているか否かを判定する。 Specifically, the first narrowing-down unit 303 holds in advance the upper limit value and the lower limit value of the width (w) and the upper limit value and the lower limit value of the height (h), and the width (w) of each face candidate data. ), Height (h) is determined whether or not it is included in the range of the upper limit value to the lower limit value of each.

第１の絞り込み部３０３は、横幅（ｗ）または高さの少なくともいずれか一方が、それぞれの上限値から下限値の範囲に含まれていないと判定した場合、所定の条件を満たさないと判定する。また、第１の絞り込み部３０３は、当該顔候補データについて抽出した情報を、識別子（ＩＤ）とともに検出結果情報から削除する。 When it is determined that at least one of the width (w) and the height is not included in the range of the upper limit value to the lower limit value, the first narrowing-down unit 303 determines that the predetermined condition is not satisfied. .. In addition, the first narrowing-down unit 303 deletes the information extracted about the face candidate data from the detection result information together with the identifier (ID).

図６（ａ）の検出結果情報５００において、ハッチングした領域に格納された横幅（ｗ）、高さ（ｈ）は、第１の絞り込み部３０３により、横幅（ｗ）及び高さ（ｈ）それぞれの上限値から下限値の範囲に含まれていないと判定されたことを示している。 In the detection result information 500 of FIG. 6A, the width (w) and the height (h) stored in the hatched area are the width (w) and the height (h), respectively, by the first narrowing-down unit 303. It indicates that it was determined that the value was not included in the range from the upper limit value to the lower limit value of.

この場合、第１の絞り込み部３０３は、識別子（ＩＤ）＝“Ｆ００３”、“Ｆ００４”の顔候補データについて抽出した情報を、当該識別子（ＩＤ）とともに検出結果情報５００から削除する。 In this case, the first narrowing-down unit 303 deletes the information extracted from the face candidate data of the identifier (ID) = "F003" and "F004" from the detection result information 500 together with the identifier (ID).

図６（ｂ）は、第１の絞り込み部３０３により、所定の条件を満たさないと判定された顔候補データについて抽出した情報が削除された、削除後の検出結果情報６００を示している。図６（ｂ）の例では、少なくとも、識別子（ＩＤ）＝“Ｆ００３”、“Ｆ００４”の顔候補データについて抽出した情報が識別子とともに検出結果情報６００から削除されている。また、図６（ｂ）の例では、所定の条件を満たさないと判定された顔候補データについて抽出した情報が検出結果情報６００から削除されたことで、“顔候補検出件数”がｍ_０からｍ_０’へと変化している。 FIG. 6B shows the detected detection result information 600 after the deletion, in which the information extracted from the face candidate data determined not to satisfy the predetermined condition is deleted by the first narrowing-down unit 303. In the example of FIG. 6B, at least the information extracted for the face candidate data of the identifier (ID) = "F003" and "F004" is deleted from the detection result information 600 together with the identifier. Further, in the example of FIG. 6B, the information extracted from the face candidate data determined not to satisfy the predetermined condition is deleted from the detection result information 600, so that the “number of face candidate detections” starts from m _0. It has changed to _{m 0'.}

（３）第２の絞り込み部の処理の具体例
図７は、第２の絞り込み部の処理の具体例を示す図である。上述したとおり、第２の絞り込み部３０４は、所定の条件を満たさないと判定された顔候補データについて抽出した情報が削除された後の残りの顔候補データを、各顔候補データが存在する位置に基づき、予め定められた各画像領域に分類する。また、第２の絞り込み部３０４は、各画像領域に分類された顔候補データの数を集計し、所定数以下の画像領域に分類された顔候補データの数を減算することで、顔検出件数を算出する。 (3) Specific Example of Processing of Second Narrowing Section FIG. 7 is a diagram showing a specific example of processing of the second narrowing section. As described above, the second narrowing-down unit 304 uses the remaining face candidate data after the information extracted for the face candidate data determined not to satisfy the predetermined condition is deleted, at the position where each face candidate data exists. Based on, it is classified into each predetermined image area. Further, the second narrowing-down unit 304 totals the number of face candidate data classified into each image area and subtracts the number of face candidate data classified into a predetermined number or less of the image area to detect the number of faces. Is calculated.

図７（ａ）は、第２の絞り込み部３０４により処理される検出結果情報６００である。また、図７（ｂ）は、画像データ４００＿０を複数の画像領域に区分し、検出結果情報６００に含まれる各顔候補データを、位置に基づきいずれかの画像領域に分類し、各画像領域に分類された顔候補データの数を集計した集計結果情報７００を示している。 FIG. 7A is the detection result information 600 processed by the second narrowing-down unit 304. Further, in FIG. 7B, the image data 400_0 is divided into a plurality of image regions, and each face candidate data included in the detection result information 600 is classified into one of the image regions based on the position, and the image data is divided into the respective image regions. The aggregation result information 700 which aggregated the number of the classified face candidate data is shown.

図７（ｂ）の例は、画像データ４００＿０を予め定められた１２個の画像領域に区分したことを示している。また、図７（ｂ）の例は、１２個の画像領域のうち、ｘ座標＝０〜Ｘ_１、ｙ座標＝０〜Ｙ_１により特定される画像領域に、Ｐ_０１１個の顔候補データが分類されたことを示している。なお、図７（ｂ）の例では、検出結果情報６００に含まれる顔候補データ（識別子（ＩＤ）＝“Ｆ００１”）が、ｘ座標＝０〜Ｘ_１、ｙ座標＝０〜Ｙ_１により特定される画像領域に分類されている（矢印７０１参照）。 The example of FIG. 7B shows that the image data 400_0 is divided into 12 predetermined image regions. Further, examples of FIG. 7 (b), twelve of the image area, the image area specified by the x-coordinate = _{0 to X} 1, y-coordinate = 0~Y _{_1, P} 0 11 pieces of face candidate data Indicates that has been classified. In the example of FIG. 7B, the face candidate data (identifier (ID) = "F001") included in the detection result information 600 is specified by _{x-coordinate = 0 to X 1} and y-coordinate = 0 _{to Y 1.} It is classified into the image area to be used (see arrow 701).

また、図７（ｂ）の例は、ｘ座標＝０〜Ｘ_１、ｙ座標＝Ｙ_１〜Ｙ_２により特定される画像領域に、Ｐ_０１２個の顔候補データが分類されたことを示している。なお、図７（ｂ）の例では、検出結果情報６００に含まれる顔候補データ（識別子（ＩＤ）＝“Ｆ００２”）が、ｘ座標＝０〜Ｘ_１、ｙ座標＝Ｙ_１〜Ｙ_２により特定される画像領域に分類されている（矢印７０２参照）。 Further, the example of FIG. 7B shows _{that P 0} 12 face candidate data were classified into the image area specified by _{the x-coordinate = 0 to X 1} and the y-coordinate = Y _{1 to} _{Y 2.} ing. In the example of FIG. 7B, the face candidate data (identifier (ID) = "F002") included in the detection result information 600 is based on x-coordinate = 0 to X ₁ , y-coordinate = Y _{1 to} _{Y 2.} It is classified into the specified image area (see arrow 702).

以下、同様に、第２の絞り込み部３０４は、検出結果情報６００に含まれる全ての顔候補データを、顔候補データが存在する位置に基づき、いずれかの画像領域に分類し、各画像領域に分類した顔候補データの数を集計する（Ｐ_０１１〜Ｐ_０３４）。 Hereinafter, similarly, the second narrowing-down unit 304 classifies all the face candidate data included in the detection result information 600 into one of the image areas based on the position where the face candidate data exists, and sets each image area. The number of classified face candidate data is totaled (P ₀ 11 _{to P 0} 34).

図７（ｃ）は、第２の絞り込み部３０４により、画像領域ごとに集計した顔候補データの数が、所定数以下であるか否かが判定された様子を示している。図７（ｃ）の場合、ｘ座標＝Ｘ_１〜Ｘ_２、ｙ座標＝Ｙ_３〜Ｙ_４により特定される画像領域に分類された顔候補データの数（Ｐ_０２４個）が、所定数以下となっている。 FIG. 7C shows how the second narrowing-down unit 304 determines whether or not the number of face candidate data aggregated for each image area is equal to or less than a predetermined number. In the case of FIG. 7 (c), the number of face candidate data (P ₀ _{24) classified into the image area specified by x-coordinate = X 1 to} X ₂ and y-coordinate = Y _{3 to} _{Y 4} is a predetermined number. It is as follows.

この場合、第２の絞り込み部３０４は、ｘ座標＝Ｘ_１〜Ｘ_２、ｙ座標＝Ｙ_３〜Ｙ_４により特定される画像領域に分類された顔候補データの数（Ｐ_０２４個）を、顔候補検出件数（＝“ｍ’_０”）から減算することで、顔検出件数（Ｃ_０）を算出する。 In this case, the second narrowing-down unit 304 determines the number of face candidate data (P ₀ 24) _{classified in the image area specified by the x-coordinate = X 1 to} X ₂ and the y coordinate = Y _{3 to} _{Y 4.} It is subtracted from the face candidate detection number _{(= "m '0")} , and calculates a face detection count _{(C 0).}

図７（ｃ）の集計結果情報７１０は、第２の絞り込み部３０４が、画像データ４００＿０の顔検出件数（Ｃ_０）として、“ｍ’_０−Ｐ_０２４”を算出したことを示している。 Total result information 710 in FIG. 7 (c), the second narrowing portion 304, a face detection number of the image data 400_0 _{(C 0),} which indicates that the calculated "m _{_'0} -P ₀ 24" ..

（４）検出件数時系列データ生成部の処理の具体例
図８は、検出件数時系列データ生成部の処理の具体例を示す図である。上述したとおり、検出件数時系列データ生成部３０５は、第２の絞り込み部３０４により、各画像データについて算出された顔検出件数を、時刻情報と対応付ける。 (4) Specific Example of Processing of Detection Number Time Series Data Generation Unit FIG. 8 is a diagram showing a specific example of processing of the detection number time series data generation unit. As described above, the detection number time series data generation unit 305 associates the number of face detection cases calculated for each image data with the time information by the second narrowing down unit 304.

図８（ａ）は、検出件数時系列データ生成部３０５が、第２の絞り込み部３０４により、各画像データ４００＿０〜４００＿ｎについて算出された顔検出件数Ｃ_０〜Ｃ_ｎを、時刻情報と対応付けた様子を示している。また、図８（ｂ）は、顔検出件数の時系列データを示すグラフである。図８（ｂ）において、横軸は時刻ｔ_０からの経過時間を表し、縦軸はそれぞれの経過時間における顔検出件数を表している。 In FIG. 8A, the detection number time series data generation unit 305 associates the face detection cases C _{0 to} C _n calculated for each image data 400_0 to 400_n by the second narrowing-down unit 304 with the time information. It shows how it looks. Further, FIG. 8B is a graph showing time-series data of the number of face detection cases. In FIG. 8B, the horizontal axis _{represents the elapsed time from time t 0} , and the vertical axis represents the number of face detections in each elapsed time.

検出件数時系列データ生成部３０５は、生成した時系列データを、解析結果格納部１３２に格納する。 The detected number time series data generation unit 305 stores the generated time series data in the analysis result storage unit 132.

（５）変化件数時系列データ生成部の処理の具体例
図９は、変化件数時系列データ生成部の処理の具体例を示す第１の図である。上述したとおり、変化件数時系列データ生成部３０６は、各画像データの集計結果情報について、１フレーム前の画像データの集計結果情報との、画像領域ごとの差分（絶対値）を算出する。また、変化件数時系列データ生成部３０６は、画像領域ごとの差分（絶対値）を集計することで、変化件数を算出する。 (5) Specific Example of Processing of Change Number Time Series Data Generation Unit FIG. 9 is a first diagram showing a specific example of processing of the change number time series data generation unit. As described above, the change number time series data generation unit 306 calculates the difference (absolute value) for each image area from the aggregated result information of the image data one frame before for the aggregated result information of each image data. Further, the change number time series data generation unit 306 calculates the change number by totaling the difference (absolute value) for each image area.

図９（ａ）、（ｂ）は、それぞれ画像データ４００＿０の集計結果情報７１０及び画像データ４００＿１の集計結果情報９１０を示している。また、図９（ｃ）は、集計結果情報７１０及び集計結果情報９１０に基づいて算出された、画像データ４００＿１についての差分情報９２０を示している。図９（ｃ）の例では、ｘ座標＝０〜Ｘ_１、ｙ座標＝０〜Ｙ_１により特定される画像領域について、差分（絶対値）としてΔＰ_０＿１１１が算出されたことを示している。 9 (a) and 9 (b) show the aggregated result information 710 of the image data 400_1 and the aggregated result information 910 of the image data 400_1, respectively. Further, FIG. 9C shows the difference information 920 for the image data 400_1 calculated based on the aggregation result information 710 and the aggregation result information 910. In the example of FIG. 9C, it is shown that _{ΔP 0_1} 11 was calculated as the difference (absolute value) for the image region specified by _{the x coordinate = 0 to X 1} and the y coordinate = 0 _{to Y 1.} ..

なお、ΔＰ_０＿１１１は、集計結果情報７１０において当該画像領域において集計された顔候補データの数（Ｐ_０１１）と、集計結果情報９１０において当該画像領域において集計された顔候補データの数（Ｐ_１１１）との差分（絶対値）を算出することで得られる。 In addition, ΔP _{0_1} 11 is the number of face candidate data aggregated in the image area in the aggregation result information 710 (P ₀₁₁ ) and the number of face candidate data aggregated in the image area in the aggregation result information 910 (P). obtained by calculating the difference (absolute value) between ₁ 11).

以下、同様に、変化件数時系列データ生成部３０６は、各画像領域について差分（絶対値）を算出することで、図９（ｃ）に示すように、ΔＰ_０＿１１１〜ΔＰ_０＿１３４を算出し、差分情報９２０として解析結果格納部１３２に格納する。また、変化件数時系列データ生成部３０６は、算出したΔＰ_０＿１１１〜ΔＰ_０＿１３４を集計することで、画像データ４００＿１の変化件数（Ｄ_１）を算出し、差分情報９２０として解析結果格納部１３２に格納する。 Hereinafter, similarly, change the number time series data generating unit 306, by calculating the difference (absolute value) for each image area, as shown in FIG. 9 (c), to calculate the ΔP _{_{0_1}} 11~ΔP _{0_1} 34 , The difference information 920 is stored in the analysis result storage unit 132. The change number time series data generating unit 306, by aggregating ΔP _{_{0_1}} 11~ΔP _{0_1} 34 calculated, calculates the change number of the image data 400_1 and _{(D 1),} the analysis result storage unit as the difference information 920 132 Store in.

図１０は、変化件数時系列データ生成部の処理の具体例を示す第２の図である。上述したとおり、変化件数時系列データ生成部３０６は、各画像データ（ただし、時刻ｔ_０の画像データは除く）について算出した変化件数（Ｄ_１〜Ｄ_ｎ）を時刻情報と対応付ける。 FIG. 10 is a second diagram showing a specific example of processing of the change number time series data generation unit. As described above, the change number time series data generation unit 306 associates the change number (D _{1 to} D _n _{) calculated for each image data (excluding the image data at time t 0} ) with the time information.

図１０（ａ）は、各画像データ４００＿１〜４００＿ｎについて算出した変化件数Ｄ_１〜Ｄ_ｎを、時刻情報と対応付けた様子を示している。また、図１０（ｂ）は、変化件数の時系列データを示すグラフである。図１０（ｂ）において横軸は時刻ｔ_１からの経過時間を表し、縦軸はそれぞれの経過時間における変化件数を表している。 _{FIG. 10A shows how the number of changes D 1 to} D _n calculated for each image data 400_1 to 400_n is associated with the time information. Further, FIG. 10B is a graph showing time-series data of the number of changes. The horizontal axis in FIG. 10 (b) represents the time elapsed from the time t _1, the vertical axis represents the change number at each elapsed time.

変化件数時系列データ生成部３０６は、生成した時系列データを、解析結果格納部１３２に格納する。 The change number time series data generation unit 306 stores the generated time series data in the analysis result storage unit 132.

＜画像処理の流れ＞
次に、画像処理装置１３０の画像処理部１３１による画像処理の流れについて説明する。図１１及び図１２は、画像処理装置による画像処理の第１及び第２のフローチャートである。画像処理装置１３０の管理者により、画像処理プログラムが起動され、処理対象とする動画像データが指定されることで、図１１及び図１２に示す画像処理が実行される。 <Flow of image processing>
Next, the flow of image processing by the image processing unit 131 of the image processing device 130 will be described. 11 and 12 are first and second flowcharts of image processing by the image processing apparatus. The image processing program is started by the administrator of the image processing apparatus 130, and the moving image data to be processed is specified, so that the image processing shown in FIGS. 11 and 12 is executed.

ステップＳ１１０１において、動画像データ取得部３０１は、指定された動画像データを、動画像データ格納装置１２０から取得する。 In step S1101, the moving image data acquisition unit 301 acquires the designated moving image data from the moving image data storage device 120.

ステップＳ１１０２において、顔候補検出部３０２は、動画像データに含まれる、フレーム単位の画像データをカウントするカウンタｉに“０”を入力する。 In step S1102, the face candidate detection unit 302 inputs “0” to the counter i that counts the image data in frame units included in the moving image data.

ステップＳ１１０３において、顔候補検出部３０２は、動画像データの中から、ｉ番目の画像データを読み出す。 In step S1103, the face candidate detection unit 302 reads out the i-th image data from the moving image data.

ステップＳ１１０４において、顔候補検出部３０２は、ｉ番目の画像データについて顔候補データを検出し、検出した顔候補データについて抽出した情報を、検出結果情報として解析結果格納部１３２に格納する。また、検出した顔候補データの数（顔候補検出件数）を、検出結果情報として解析結果格納部１３２に格納する。 In step S1104, the face candidate detection unit 302 detects face candidate data for the i-th image data, and stores the extracted information about the detected face candidate data in the analysis result storage unit 132 as detection result information. Further, the number of detected face candidate data (number of face candidate detection cases) is stored in the analysis result storage unit 132 as detection result information.

ステップＳ１１０５において、第１の絞り込み部３０３は、ｉ番目の画像データより検出された顔候補データをカウントするカウンタｊに“１”を入力する。 In step S1105, the first narrowing-down unit 303 inputs “1” to the counter j that counts the face candidate data detected from the i-th image data.

ステップＳ１１０６において、第１の絞り込み部３０３は、ｊ番目の顔候補データのサイズ（横幅、高さ）が、所定の条件を満たすか否かを判定する。ステップＳ１１０６において所定の条件を満たすと判定した場合には（ステップＳ１１０６においてＹｅｓの場合には）、ステップＳ１１０８に進む。 In step S1106, the first narrowing-down unit 303 determines whether or not the size (width, height) of the j-th face candidate data satisfies a predetermined condition. If it is determined in step S1106 that the predetermined condition is satisfied (Yes in step S1106), the process proceeds to step S1108.

一方、ステップＳ１１０６において所定の条件を満たさないと判定した場合には（ステップＳ１１０６においてＮｏの場合には）、ステップＳ１１０７に進む。ステップＳ１１０７において、第１の絞り込み部３０３は、ｊ番目の顔候補データについて抽出した情報を、検出結果情報から削除するとともに、顔候補検出件数を変更する。 On the other hand, if it is determined in step S1106 that the predetermined condition is not satisfied (No in step S1106), the process proceeds to step S1107. In step S1107, the first narrowing-down unit 303 deletes the information extracted from the j-th face candidate data from the detection result information and changes the number of face candidate detection cases.

ステップＳ１１０８において、第１の絞り込み部３０３は、検出結果情報に格納された全ての顔候補データについて判定を行ったか否かを判定する。ステップＳ１１０８において、判定を行っていない顔候補データがあると判定した場合には（ステップＳ１１０８においてＮｏの場合には）、ステップＳ１１０９に進む。 In step S1108, the first narrowing-down unit 303 determines whether or not the determination has been performed for all the face candidate data stored in the detection result information. If it is determined in step S1108 that there is face candidate data that has not been determined (No in step S1108), the process proceeds to step S1109.

ステップＳ１１０９において、第１の絞り込み部３０３は、カウンタｊをインクリメントし、ステップＳ１１０６に戻る。 In step S1109, the first narrowing-down unit 303 increments the counter j and returns to step S1106.

一方、ステップＳ１１０８において、全ての顔候補データについて判定を行ったと判定した場合には（ステップＳ１１０８においてＹｅｓの場合には）、図１２のステップＳ１２０１に進む。 On the other hand, if it is determined in step S1108 that all the face candidate data have been determined (yes in step S1108), the process proceeds to step S1201 in FIG.

ステップＳ１２０１において、第２の絞り込み部３０４は、検出結果情報に含まれる顔候補データを、位置に基づき予め定められた各画像領域に分類し、各画像領域における顔候補データの数を集計する。また、第２の絞り込み部３０４は、ｉ番目の画像データについての集計結果を集計結果情報として、解析結果格納部１３２に格納する。 In step S1201, the second narrowing-down unit 304 classifies the face candidate data included in the detection result information into each image area predetermined based on the position, and totals the number of face candidate data in each image area. Further, the second narrowing-down unit 304 stores the aggregation result of the i-th image data as the aggregation result information in the analysis result storage unit 132.

ステップＳ１２０２において、第２の絞り込み部３０４は、集計結果情報において、顔候補データの数が所定数以下の画像領域を判定する。 In step S1202, the second narrowing-down unit 304 determines an image area in which the number of face candidate data is a predetermined number or less in the aggregation result information.

ステップＳ１２０３において、第２の絞り込み部３０４は、所定数以下の画像領域に含まれる顔候補データの数を顔候補検出件数から減算し、ｉ番目の画像データにおける顔検出件数を算出する。また、第２の絞り込み部３０４は、算出した顔検出件数をｉ番目の画像データについての集計結果情報として、解析結果格納部１３２に格納する。 In step S1203, the second narrowing-down unit 304 subtracts the number of face candidate data included in the image region of a predetermined number or less from the number of face candidate detection cases, and calculates the number of face detection cases in the i-th image data. Further, the second narrowing-down unit 304 stores the calculated number of face detections in the analysis result storage unit 132 as the aggregated result information for the i-th image data.

ステップＳ１２０４において、変化件数時系列データ生成部３０６は、（ｉ−１）番目の画像データについての集計結果情報を、解析結果格納部１３２から読み出す。 In step S1204, the change number time series data generation unit 306 reads the aggregation result information for the (i-1) th image data from the analysis result storage unit 132.

ステップＳ１２０５において、変化件数時系列データ生成部３０６は、（ｉ−１）番目の画像データについての集計結果情報と、ｉ番目の画像データについての集計結果情報とを用いて、ｉ番目の画像データにおける変化件数を算出する。また、変化件数時系列データ生成部３０６は、算出した変化件数をｉ番目の画像データについての差分情報として、解析結果格納部１３２に格納する。 In step S1205, the change number time series data generation unit 306 uses the aggregated result information for the (i-1) th image data and the aggregated result information for the i-th image data to generate the i-th image data. Calculate the number of changes in. Further, the change number time series data generation unit 306 stores the calculated change number in the analysis result storage unit 132 as the difference information for the i-th image data.

ステップＳ１２０６において、顔候補検出部３０２は、動画像データに含まれる全ての画像データについて処理を行ったか否かを判定する。ステップＳ１２０６において、処理を行っていない画像データがあると判定した場合には（ステップＳ１２０６においてＮｏの場合には）、ステップＳ１２０７に進む。ステップＳ１２０７において、顔候補検出部３０２は、カウンタｉをインクリメントし、図１１のステップＳ１１０３に戻る。 In step S1206, the face candidate detection unit 302 determines whether or not processing has been performed on all the image data included in the moving image data. If it is determined in step S1206 that there is image data that has not been processed (if No in step S1206), the process proceeds to step S1207. In step S1207, the face candidate detection unit 302 increments the counter i and returns to step S1103 of FIG.

一方、ステップＳ１２０６において、全ての画像データについて処理を行ったと判定した場合には（ステップＳ１２０６においてＹｅｓの場合には）、ステップＳ１２０８に進む。 On the other hand, if it is determined in step S1206 that all the image data has been processed (yes in step S1206), the process proceeds to step S1208.

ステップＳ１２０８において、検出件数時系列データ生成部３０５は、各画像データの時刻情報に、各画像データについて算出した顔検出件数を対応付けて、顔検出件数の時系列データを生成する。また、検出件数時系列データ生成部３０５は、生成した顔検出件数の時系列データを、解析結果格納部１３２に格納する。 In step S1208, the detection number time series data generation unit 305 associates the time information of each image data with the face detection number calculated for each image data, and generates time series data of the number of face detection cases. In addition, the detected number time series data generation unit 305 stores the generated time series data of the number of face detection cases in the analysis result storage unit 132.

ステップＳ１２０９において、変化件数時系列データ生成部３０６は、各画像データの時刻情報に、各画像データについて算出した変化件数を対応付けて、変化件数の時系列データを生成する。また、変化件数時系列データ生成部３０６は、生成した変化件数の時系列データを、解析結果格納部１３２に格納する。 In step S1209, the change number time series data generation unit 306 generates time series data of the change number by associating the time information of each image data with the change number calculated for each image data. Further, the change number time series data generation unit 306 stores the generated time series data of the change number in the analysis result storage unit 132.

以上の説明から明らかなように、第１の実施形態における画像処理装置１３０は、画像に含まれる人物の状態を解析するために、顔検出件数と変化件数とを算出する。このように、第１の実施形態における画像処理装置１３０では、人物の状態を解析するにあたり、人物の顔の検出を行う一方で、人物の詳細な顔情報（例えば、眉の位置、顔の輪郭等）の算出までは行わない。このため、画像処理システムとして高性能な撮像装置を設置する必要がなく、また、撮像装置の設置台数を削減することができる。 As is clear from the above description, the image processing device 130 in the first embodiment calculates the number of face detections and the number of changes in order to analyze the state of the person included in the image. As described above, in the image processing device 130 according to the first embodiment, when analyzing the state of the person, the face of the person is detected, while the detailed face information of the person (for example, the position of the eyebrows and the contour of the face). Etc.) is not calculated. Therefore, it is not necessary to install a high-performance image pickup device as an image processing system, and the number of image pickup devices installed can be reduced.

つまり、簡易な画像処理システムがあれば足り、画像に含まれる人物の状態を効率よく解析することが可能となる。 That is, a simple image processing system is sufficient, and it is possible to efficiently analyze the state of a person included in an image.

［第２の実施形態］
上記第１の実施形態では、各画像データについて算出された顔検出件数及び変化件数を時刻情報と対応付ける場合について説明したが、時刻情報と対応付けるデータは、顔検出件数及び変化件数に限定されない。 [Second Embodiment]
In the first embodiment, the case where the calculated number of face detections and the number of changes for each image data is associated with the time information has been described, but the data associated with the time information is not limited to the number of face detections and the number of changes.

例えば、受講者の状態に影響を与える要因を時刻情報と対応付けるようにしてもよい。これにより、受講者の状態に影響を与える要因を解析することが可能になるからである。 For example, factors that affect the student's condition may be associated with the time information. This makes it possible to analyze the factors that affect the student's condition.

図１３は、顔検出件数の時系列データの他の一例を示す図である。このうち、図１３（ａ）は、顔検出件数に加えて、講義内容を時刻情報と対応付けた様子を示している。図１３（ａ）によれば、受講者の状態と講義内容との間に相関があるのか否かを解析することができる。また、相関がある場合には、受講者の集中状態を高める講義内容はいずれの講義内容であるのかを特定することができる。 FIG. 13 is a diagram showing another example of time-series data of the number of face detection cases. Of these, FIG. 13A shows how the lecture content is associated with the time information in addition to the number of face detection cases. According to FIG. 13A, it is possible to analyze whether or not there is a correlation between the state of the students and the content of the lecture. In addition, when there is a correlation, it is possible to specify which lecture content is the lecture content that enhances the concentration of the students.

図１３（ｂ）は、顔検出件数に加えて、講師等が板書する文字の大きさを時刻情報と対応付けた様子を示している。図１３（ｂ）によれば、受講者の状態と講師等が板書する文字の大きさとの間に相関があるのか否かを解析することができる。また、相関がある場合には、受講者の集中状態を高める適切な文字の大きさを特定することができる。 FIG. 13B shows how the size of the characters written on the board by the instructor or the like is associated with the time information in addition to the number of face detection cases. According to FIG. 13B, it is possible to analyze whether or not there is a correlation between the state of the student and the size of the characters written on the board by the instructor or the like. In addition, when there is a correlation, it is possible to specify an appropriate character size that enhances the concentration of students.

図１３（ｃ）は、顔検出件数に加えて、講師等の立ち位置を時刻情報と対応付けた様子を示している。図１３（ｃ）によれば、受講者の状態と講師等の立ち位置との間に相関があるのか否かを解析することができる。また、相関がある場合には、受講者の集中状態を高める適切な立ち位置を特定することができる。 FIG. 13C shows how the standing position of the instructor or the like is associated with the time information in addition to the number of face detection cases. According to FIG. 13 (c), it is possible to analyze whether or not there is a correlation between the state of the student and the standing position of the instructor or the like. In addition, if there is a correlation, it is possible to identify an appropriate standing position that enhances the concentration of students.

［その他の実施形態］
上記各実施形態では、顔候補検出部３０２による顔候補データの検出方法について特に言及しなかったが、顔候補検出部３０２による顔候補データの検出方法は任意の検出方法を用いることができる。 [Other Embodiments]
In each of the above embodiments, the method of detecting the face candidate data by the face candidate detection unit 302 is not particularly mentioned, but any detection method can be used as the method of detecting the face candidate data by the face candidate detection unit 302.

また、上記各実施形態では、第１の絞り込み部３０３が、横幅（ｗ）の上限値及び下限値、高さ（ｈ）の上限値及び下限値を１種類保持しているものとして説明した。しかしながら、第１の絞り込み部３０３が保持する上限値、下限値は１種類に限定されない。例えば、撮像装置１１０の設置位置からの距離（画像データ上の位置）に応じて、複数種類の上限値、下限値を保持しておき、顔候補データが存在する位置によって、異なる上限値、下限値を読み出して判定するようにしてもよい。 Further, in each of the above embodiments, it has been described that the first narrowing-down unit 303 holds one type of upper limit value and lower limit value of the width (w) and one type of upper limit value and lower limit value of the height (h). However, the upper limit value and the lower limit value held by the first narrowing-down unit 303 are not limited to one type. For example, a plurality of types of upper limit values and lower limit values are held according to the distance from the installation position of the image pickup device 110 (position on the image data), and the upper limit value and lower limit value differ depending on the position where the face candidate data exists. The value may be read out for determination.

また、上記各実施形態では、第１の絞り込み部３０３が、横幅（ｗ）、高さ（ｈ）それぞれに、上限値及び下限値を保持しているものとして説明した。しかしながら、第１の絞り込み部３０３が保持する上限値、下限値はこれに限定されない。例えば、横幅（ｗ）×高さ（ｈ）の面積の上限値、下限値を保持していてもよい。 Further, in each of the above embodiments, it has been described that the first narrowing-down portion 303 holds an upper limit value and a lower limit value for each of the width (w) and the height (h), respectively. However, the upper limit value and the lower limit value held by the first narrowing-down unit 303 are not limited to this. For example, the upper limit value and the lower limit value of the area of width (w) × height (h) may be held.

また、上記各実施形態では、各画像データの時刻情報と対応付けることで、顔検出件数の時系列データ及び変化件数の時系列データを生成するものとして説明した。つまり、顔検出件数及び変化件数のプロット周期が、各画像データのフレーム周期（例えば、３００ｍｓｅｃ）と等しくなるように時系列データを生成するものとして説明した。 Further, in each of the above embodiments, it has been described that the time-series data of the number of face detections and the time-series data of the number of changes are generated by associating with the time information of each image data. That is, it has been described that the time series data is generated so that the plot period of the number of face detection cases and the number of changes is equal to the frame period (for example, 300 msec) of each image data.

しかしながら、顔検出件数及び変化件数のプロット周期が、各画像データのフレーム周期と等しくなるように時系列データを生成する必要はなく、各画像データのフレーム周期よりも長くなるように時系列データを生成してもよい（例えば、１秒）。また、顔検出件数及び変化件数のプロット周期は、例えば、講義の形式（板書中心の講義形式なのか、講義中心の講義形式なのか）によって、変更するようにしてもよい。 However, it is not necessary to generate time-series data so that the plot period of the number of face detections and the number of changes is equal to the frame period of each image data, and the time-series data is generated so as to be longer than the frame period of each image data. It may be generated (eg, 1 second). Further, the plot cycle of the number of face detection cases and the number of changes may be changed depending on, for example, the lecture format (whether the lecture format is centered on board writing or the lecture format is centered on lectures).

また、上記各実施形態では、第２の絞り込み部３０４が顔候補データを分類する際に用いる画像領域を、所定の形状及び大きさに区切るものとして説明したが、画像領域は、講義等が行われる空間の構造等に応じた形状、大きさに区切るようにしてもよい。 Further, in each of the above embodiments, the image area used by the second narrowing-down unit 304 for classifying the face candidate data has been described as being divided into a predetermined shape and size. It may be divided into shapes and sizes according to the structure of the space to be created.

なお、画像領域の大きさが大きすぎると、顔を上げている状態から下げている状態へと変化した受講者の数と、顔を下げている状態から上げている状態へと変化した受講者の数とが相殺され、変化件数の算出精度が低下することになる。このため、画像領域は適切な大きさに区切ることが求められる。 If the size of the image area is too large, the number of students who changed from the state where the face is raised to the state where the face is lowered and the number of students who changed from the state where the face is lowered to the state where the face is raised The number of changes will be offset, and the accuracy of calculating the number of changes will decrease. Therefore, the image area is required to be divided into an appropriate size.

また、上記各実施形態では、第２の絞り込み部３０４が各画像領域において算出した差分（絶対値）を集計することで変化件数を算出し、当該変化件数に基づいて、変化件数時系列データを生成するものとして説明した。しかしながら、第２の絞り込み部３０４が算出する変化件数は、これに限定されない。 Further, in each of the above embodiments, the number of changes is calculated by aggregating the differences (absolute values) calculated by the second narrowing-down unit 304 in each image area, and the number of changes time series data is obtained based on the number of changes. Described as being generated. However, the number of changes calculated by the second narrowing-down unit 304 is not limited to this.

例えば、第２の絞り込み部３０４が、各画像領域において算出した差分のうち、符号がプラスとなる値を集計して変化件数を算出し、変化件数時系列データを生成してもよい。これにより、顔を下げた状態から上げた状態へと変化した受講者の数の時系列データを算出することができる。 For example, the second narrowing-down unit 304 may calculate the number of changes by aggregating the values having a positive sign among the differences calculated in each image area, and generate time-series data on the number of changes. This makes it possible to calculate time-series data on the number of students who have changed from a face-down state to a face-up state.

あるいは、第２の絞り込み部３０４が、各画像領域において算出した差分のうち、符号がマイナスとなる値を集計して変化件数を算出し、変化件数時系列データを生成してもよい。これにより、顔を上げた状態から下げた状態へと変化した受講者の数の時系列データを算出することができる。 Alternatively, the second narrowing-down unit 304 may calculate the number of changes by aggregating the values having a negative sign among the differences calculated in each image area, and generate time-series data on the number of changes. This makes it possible to calculate time-series data on the number of students who have changed from a raised face to a lowered face.

また、上記各実施形態では、第１の絞り込み部３０３が処理を行った後に、第２の絞り込み部３０４が処理を行うものとして説明したが、第１の絞り込み部３０３と第２の絞り込み部３０４の処理の順序は逆であってもよい。 Further, in each of the above embodiments, it has been described that the second narrowing-down unit 304 performs the processing after the first narrowing-down unit 303 performs the processing, but the first narrowing-down unit 303 and the second narrowing-down unit 304 perform the processing. The order of processing may be reversed.

また、上記各実施形態では、画像処理装置１３０をネットワーク１４０を介して動画像データ格納装置１２０に接続するものとして説明したが、画像処理装置１３０として携帯可能な端末を用いる場合には、動画像データ格納装置１２０に直接接続してもよい。あるいは、撮像装置１１０に着脱可能な記録媒体を装着して動画像データを格納し、動画像データが格納された当該記録媒体を画像処理装置１３０に装着することで、画像処理装置１３０が動画像データを取得してもよい。 Further, in each of the above embodiments, the image processing device 130 has been described as being connected to the moving image data storage device 120 via the network 140, but when a portable terminal is used as the image processing device 130, the moving image It may be directly connected to the data storage device 120. Alternatively, a detachable recording medium is attached to the image pickup apparatus 110 to store the moving image data, and the recording medium in which the moving image data is stored is attached to the image processing apparatus 130, whereby the image processing apparatus 130 is attached to the moving image. Data may be acquired.

また、上記各実施形態では、画像処理システム１００として、１組の撮像装置１１０及び動画像データ格納装置１２０が含まれる場合について説明したが、撮像装置１１０及び動画像データ格納装置１２０は、画像処理システム１００に複数組含まれていてもよい。 Further, in each of the above embodiments, the case where the image processing system 100 includes a set of the image pickup device 110 and the moving image data storage device 120 has been described, but the image pickup device 110 and the moving image data storage device 120 have image processing. A plurality of sets may be included in the system 100.

なお、開示の技術では、以下に記載する付記のような形態が考えられる。
（付記１）
同一の撮影範囲を異なるタイミングで撮影することで得た複数の画像それぞれから、顔が存在する位置を抽出し、
抽出した前記顔が存在する位置に基づいて、前記複数の画像それぞれについて、画像領域を特定し、
特定した前記画像領域における、顔の有無に応じた結果を、前記複数の画像それぞれを撮影したタイミングを示す情報と対応付けて出力する、
処理をコンピュータに実行させることを特徴とする画像処理プログラム。
（付記２）
前記顔の有無に応じた結果を出力する際、特定した前記画像領域において顔を検出した数を出力することを特徴とする付記１に記載の画像処理プログラム。
（付記３）
前記顔の有無に応じた結果を出力する際、特定した前記画像領域における顔の有無が、１フレーム前の画像の当該画像領域における顔の有無と異なる数を出力することを特徴とする付記１に記載の画像処理プログラム。
（付記４）
前記顔の有無に応じた結果を出力する際、特定した前記画像領域において、１フレーム前の画像において顔を検出せず、処理対象のフレームの画像において顔を検出した数を出力することを特徴とする付記３に記載の画像処理プログラム。
（付記５）
前記顔の有無に応じた結果を出力する際、特定した前記画像領域において、１フレーム前の画像において顔を検出し、処理対象のフレームの画像において顔を検出しない数を出力することを特徴とする付記３に記載の画像処理プログラム。
（付記６）
同一の撮影範囲を異なるタイミングで撮影することで得た複数の画像それぞれから、顔が存在する位置を抽出し、
抽出した前記顔が存在する位置に基づいて、前記複数の画像それぞれについて、画像領域を特定し、
特定した前記画像領域における、顔の有無に応じた結果を、前記複数の画像それぞれを撮影したタイミングを示す情報と対応付けて出力する、
処理をコンピュータが実行することを特徴とする画像処理方法。
（付記７）
同一の撮影範囲を異なるタイミングで撮影することで得た複数の画像それぞれから、顔が存在する位置を抽出する抽出部と、
抽出した前記顔が存在する位置に基づいて、前記複数の画像それぞれについて、画像領域を特定する特定部と、
特定した前記画像領域における、顔の有無に応じた結果を、前記複数の画像それぞれを撮影したタイミングを示す情報と対応付けて出力する出力部と
を有することを特徴とする画像処理装置。 In addition, in the disclosed technology, a form as described in the appendix described below can be considered.
(Appendix 1)
The position where the face exists is extracted from each of the multiple images obtained by shooting the same shooting range at different timings.
An image area is specified for each of the plurality of images based on the extracted position where the face exists.
The result according to the presence or absence of the face in the specified image area is output in association with the information indicating the timing at which each of the plurality of images was taken.
An image processing program characterized by causing a computer to perform processing.
(Appendix 2)
The image processing program according to Appendix 1, wherein when outputting a result according to the presence or absence of the face, the number of detected faces in the specified image region is output.
(Appendix 3)
When outputting the result according to the presence / absence of the face, the presence / absence of the face in the specified image area is different from the presence / absence of the face in the image area of the image one frame before. The image processing program described in.
(Appendix 4)
When outputting the result according to the presence or absence of the face, the feature is that the number of faces detected in the image of the frame to be processed is output without detecting the face in the image one frame before in the specified image area. The image processing program according to Appendix 3.
(Appendix 5)
When outputting the result according to the presence or absence of the face, in the specified image area, the face is detected in the image one frame before, and the number in which the face is not detected in the image of the frame to be processed is output. The image processing program according to Appendix 3.
(Appendix 6)
The position where the face exists is extracted from each of the multiple images obtained by shooting the same shooting range at different timings.
An image area is specified for each of the plurality of images based on the extracted position where the face exists.
The result according to the presence or absence of the face in the specified image area is output in association with the information indicating the timing at which each of the plurality of images was taken.
An image processing method characterized in that processing is performed by a computer.
(Appendix 7)
An extraction unit that extracts the position where the face exists from each of the multiple images obtained by shooting the same shooting range at different timings,
Based on the extracted position where the face exists, a specific portion for specifying an image area for each of the plurality of images, and a specific portion for specifying the image area,
An image processing apparatus including an output unit that outputs a result according to the presence or absence of a face in the specified image area in association with information indicating the timing at which each of the plurality of images is taken.

なお、上記実施形態に挙げた構成等に、その他の要素との組み合わせ等、ここで示した構成に本発明が限定されるものではない。これらの点に関しては、本発明の趣旨を逸脱しない範囲で変更することが可能であり、その応用形態に応じて適切に定めることができる。 The present invention is not limited to the configurations shown here, such as combinations with other elements in the configurations and the like described in the above embodiments. These points can be changed without departing from the spirit of the present invention, and can be appropriately determined according to the application form thereof.

１００：画像処理システム
１１０：撮像装置
１２０：動画像データ格納装置
１３０：画像処理装置
１３１：画像処理部
１３２：解析結果格納部
３０１：動画像データ取得部
３０２：顔候補検出部
３０３：第１の絞り込み部
３０４：第２の絞り込み部
３０５：検出件数時系列データ生成部
３０６：変化件数時系列データ生成部
３０７：表示制御部
５００、６００：検出結果情報
７００、７１０、９１０：集計結果情報
９２０：差分情報 100: Image processing system 110: Imaging device 120: Moving image data storage device 130: Image processing device 131: Image processing unit 132: Analysis result storage unit 301: Moving image data acquisition unit 302: Face candidate detection unit 303: First Filtering unit 304: Second filtering unit 305: Number of detected cases Time-series data generation unit 306: Number of changes Time-series data generation unit 307: Display control unit 500, 600: Detection result information 700, 710, 910: Aggregation result information 920: Difference information

Claims

The position where the face of the lecturer's face exists is extracted from each of the plurality of images obtained by shooting the lectures held in the same shooting range at different timings.
An image area is specified for each of the plurality of images based on the extracted position where the face exists.
The result according to the presence or absence of the face in the specified image area is associated with the information indicating the timing at which each of the plurality of images was taken, and output at a plot cycle according to the format of the lecture.
An image processing program characterized by causing a computer to perform processing.

The image processing program according to claim 1, wherein when outputting a result according to the presence or absence of the face, the number of detected faces in the specified image region is output.

When outputting the result according to the presence / absence of the face, the claim is characterized in that the presence / absence of a face in the specified image region is different from the presence / absence of a face in the image region of the image one plot cycle before. Item 1. The image processing program according to item 1.

When outputting the result according to the presence or absence of the face, in the specified image region, the number of detected faces in the image of the frame to be processed is output without detecting the face in the image one plot cycle before. The image processing program according to claim 3, which is characterized.

When outputting the result according to the presence or absence of the face, in the specified image region, the face is detected in the image one plot cycle before, and the number in which the face is not detected in the image of the frame to be processed is output. The image processing program according to claim 3.

The position where the face of the lecturer's face exists is extracted from each of the plurality of images obtained by shooting the lectures held in the same shooting range at different timings.
An image area is specified for each of the plurality of images based on the extracted position where the face exists.
The result according to the presence or absence of the face in the specified image area is associated with the information indicating the timing at which each of the plurality of images was taken, and is output at a plot cycle according to the format of the lecture.
An image processing method characterized in that processing is performed by a computer.

An extraction unit that extracts the position where the face of the lecturer's face exists from each of the multiple images obtained by shooting the lectures held in the same shooting range at different timings.
Based on the extracted position where the face exists, a specific portion for specifying an image area for each of the plurality of images, and a specific portion for specifying the image area,
It has an output unit that associates the results according to the presence or absence of faces in the specified image area with information indicating the timing at which each of the plurality of images was taken, and outputs the results at a plot cycle according to the format of the lecture. An image processing device characterized by.