JP7840022B2

JP7840022B2 - Information processing equipment, methods, programs and systems

Info

Publication number: JP7840022B2
Application number: JP2024083816A
Authority: JP
Inventors: 大丹下; 奨貴大倉; 元也小林; 要介菅原; 俊北原
Original assignee: SHIFT Inc
Current assignee: SHIFT Inc
Priority date: 2024-05-23
Filing date: 2024-05-23
Publication date: 2026-04-03
Anticipated expiration: 2044-05-23
Also published as: JP2025178127A; JP2025177207A

Description

本開示は、情報処理装置、方法、プログラム及びシステムに関する。 This disclosure relates to information processing devices, methods, programs, and systems.

意味のある会議が実施できたか否かの指標として、会議が盛り上がったこと、又は会議に活気があったことは重要な要素である。近年、会議の盛り上がり度合いを測定する技術が提案されている（例えば、特許文献１）。 A key indicator of whether a meeting was meaningful is whether it was engaging and lively. In recent years, technologies for measuring the level of engagement in a meeting have been proposed (for example, Patent Document 1).

特開２０１６―０１２２１６号公報Japanese Patent Publication No. 2016-012216

特許文献１では、例えば、音声及び／又は映像を含んで記録された会議データより、会議の盛り上がり度を算出する会議分析装置について記載されている。当該会議分析装置では、会議データより会議の進行に沿った時系列としての特徴量を抽出し、当該特徴量に基づいて、絶対評価値として、会議の進行に沿った時系列としての盛り上がり度を算出する。 Patent Document 1 describes a meeting analysis device that calculates the level of engagement in a meeting from meeting data recorded, including audio and/or video. This meeting analysis device extracts feature quantities as a time series along the progress of the meeting from the meeting data, and calculates the level of engagement as an absolute evaluation value, based on these feature quantities, along with the time series along the progress of the meeting.

しかしながら、特許文献１では、会議データを会議分析装置に送信するようにしており、会議で話された情報が会議分析装置で記憶される。このため、会議のみで留めておきたい情報が外部に漏れる可能性がゼロではない。 However, Patent Document 1 describes a system where meeting data is transmitted to a meeting analysis device, and the information discussed in the meeting is stored in the analysis device. Therefore, there is a non-zero possibility that information intended to remain within the meeting could be leaked externally.

本開示の目的は、セキュリティを保持しつつ、所定の空間における盛り上がりを定量化することである。 The purpose of this disclosure is to quantify the level of excitement in a given space while maintaining security.

所定の空間に設置され、空間で生じる音声を収集する手段と、空間に設置され、音声を収集する手段により収集された音声の種類を分類し、分類された種類と音量とに基づき、盛り上がりに係る状態及び当該状態の度合いを決定する手段と、状態及び度合いを集計し、空間の盛り上がりの時間推移を算出する手段と、空間の盛り上がりの時間推移に基づく情報をユーザに提示する手段と、を備えるシステムである。 This system comprises: means installed in a designated space to collect sounds generated in that space; means installed in the space to classify the types of sounds collected by the sound-collecting means, and to determine the state and degree of excitement based on the classified type and volume; means to aggregate the states and degrees and calculate the time progression of excitement in the space; and means to present information based on the time progression of excitement in the space to the user.

本開示によれば、セキュリティを保持しつつ、所定の空間における盛り上がりを定量化できる。 According to this disclosure, it is possible to quantify the level of excitement in a given space while maintaining security.

システム１の全体構成を示すブロック図である。The overall configuration of System 1 is shown in the block diagram. 端末装置１０の機能的な構成例を示すブロック図である。This is a block diagram showing an example of the functional configuration of the terminal device 10. サーバ２０の機能的な構成例を示すブロック図である。This is a block diagram showing a functional configuration example of server 20. 解析装置３０の機能的な構成例を示すブロック図である。This is a block diagram showing an example of the functional configuration of the analysis device 30. テーブルのデータ構造を示す図である。This diagram shows the data structure of a table. テーブルのデータ構造を示す図である。This diagram shows the data structure of a table. テーブルのデータ構造を示す図である。This diagram shows the data structure of a table. テーブルのデータ構造を示す図である。This diagram shows the data structure of a table. システム１における処理の流れの一例を示す図である。This figure shows an example of the processing flow in System 1. システム１における処理の流れの一例を示す図である。This figure shows an example of the processing flow in System 1. 本開示の画面例を示す図である。This figure shows an example of the screen in this disclosure. 本開示の画面例を示す図である。This figure shows an example of the screen in this disclosure. 本開示の画面例を示す図である。This figure shows an example of the screen in this disclosure. 本開示の画面例を示す図である。This figure shows an example of the screen in this disclosure. コンピュータ９０の基本的なハードウェア構成を表すブロック図である。This is a block diagram showing the basic hardware configuration of computer 90.

以下、本開示の実施形態について図面を参照して説明する。実施形態を説明する全図において、共通の構成要素には同一の符号を付し、繰り返しの説明を省略する。なお、以下の実施形態は、特許請求の範囲に記載された本開示の内容を不当に限定するものではない。また、実施形態に示される構成要素のすべてが、本開示の必須の構成要素であるとは限らない。また、各図は模式図であり、必ずしも厳密に図示されたものではない。 The embodiments of this disclosure will be described below with reference to the drawings. In all drawings illustrating the embodiments, common components are denoted by the same reference numerals, and repeated descriptions are omitted. The following embodiments are not intended to unduly limit the scope of the claims of this disclosure. Furthermore, not all components shown in the embodiments are necessarily essential components of this disclosure. Also, each drawing is a schematic diagram and not necessarily a strictly accurate representation.

また、以下の説明において、「プロセッサ」は、１以上のプロセッサである。少なくとも１つのプロセッサは、典型的には、ＣＰＵ（Central Processing Unit）のようなマイクロプロセッサであるが、ＧＰＵ（Graphics Processing Unit）のような他種のプロセッサでもよい。少なくとも１つのプロセッサは、シングルコアでもよいしマルチコアでもよい。 Furthermore, in the following explanation, "processor" refers to one or more processors. At least one processor is typically a microprocessor such as a CPU (Central Processing Unit), but may be another type of processor such as a GPU (Graphics Processing Unit). At least one processor may be single-core or multi-core.

また、少なくとも１つのプロセッサは、処理の一部又は全部を行うハードウェア回路（例えばＦＰＧＡ（Field-Programmable Gate Array）又はＡＳＩＣ（Application Specific Integrated Circuit））といった広義のプロセッサでもよい。 Furthermore, at least one processor may be a broad-sense processor, such as a hardware circuit that performs some or all of the processing (e.g., an FPGA (Field-Programmable Gate Array) or an ASIC (Application Specific Integrated Circuit)).

また、以下の説明において、「ｘｘｘテーブル」といった表現により、入力に対して出力が得られる情報を説明することがあるが、この情報は、どのような構造のデータでもよいし、入力に対する出力を発生するニューラルネットワークのような学習モデルでもよい。従って、「ｘｘｘテーブル」を「ｘｘｘ情報」と言うことができる。 Furthermore, in the following explanation, the term "xxx table" may be used to describe information that yields an output for a given input. This information can be data of any structure, or it can be a learning model such as a neural network that generates an output for a given input. Therefore, "xxx table" can be referred to as "xxx information."

また、以下の説明において、各テーブルの構成は一例であり、１つのテーブルは、２以上のテーブルに分割されてもよいし、２以上のテーブルの全部又は一部が１つのテーブルであってもよい。 Furthermore, in the following explanation, the structure of each table is merely an example; one table may be divided into two or more tables, or all or part of two or more tables may constitute a single table.

また、以下の説明において、「プログラム」を主語として処理を説明する場合があるが、プログラムは、プロセッサによって実行されることで、定められた処理を、適宜に記憶部及び／又はインタフェース部などを用いながら行うため、処理の主語が、プロセッサ（或いは、そのプロセッサを有するコントローラのようなデバイス）とされてもよい。 Furthermore, in the following explanation, while "program" may be used as the subject to describe processing, since a program, when executed by a processor, performs defined processing using the memory and/or interface units as appropriate, the subject of the processing could also be the processor (or a device such as a controller containing that processor).

プログラムは、計算機のような装置にインストールされてもよいし、例えば、プログラム配布サーバ又は計算機が読み取り可能な（例えば非一時的な）記録媒体にあってもよい。また、以下の説明において、２以上のプログラムが１つのプログラムとして実現されてもよいし、１つのプログラムが２以上のプログラムとして実現されてもよい。 The program may be installed on a device such as a computer, or it may reside on a program distribution server or a computer-readable (e.g., non-temporary) recording medium. Furthermore, in the following description, two or more programs may be implemented as a single program, or one program may be implemented as two or more programs.

また、以下の説明において、種々の対象の識別情報として、識別番号が使用されるが、識別番号以外の種類の識別情報（例えば、英字や符号を含んだ識別子）が採用されてもよい。 Furthermore, while identification numbers are used as identification information for various objects in the following explanation, other types of identification information (for example, identifiers containing letters or symbols) may also be used.

また、以下の説明において、同種の要素を区別しないで説明する場合には、参照符号（又は、参照符号のうちの共通符号）を使用し、同種の要素を区別して説明する場合は、要素の識別番号（又は参照符号）を使用することがある。 Furthermore, in the following explanation, when describing similar elements without distinction, a reference code (or a common code among reference codes) may be used. When describing similar elements with distinction, the element's identification number (or reference code) may be used.

また、以下の説明において、制御線や情報線は、説明上必要と考えられるものを示しており、製品上必ずしも全ての制御線や情報線を示しているとは限らない。全ての構成が相互に接続されていてもよい。 Furthermore, in the following explanation, only control lines and information lines deemed necessary for the explanation are shown; not all control lines and information lines are necessarily shown in the actual product. All components may be interconnected.

各情報処理装置は演算装置と記憶装置とを備えたコンピュータにより構成されている。コンピュータの基本ハードウェア構成および、当該ハードウェア構成により実現されるコンピュータの基本機能構成は後述する。端末装置１０、サーバ２０のそれぞれについて、後述するコンピュータの基本ハードウェア構成およびコンピュータの基本機能構成と重複する説明は省略する。 Each information processing device consists of a computer equipped with an arithmetic unit and a memory device. The basic hardware configuration of the computer and the basic functional configuration of the computer realized by this hardware configuration will be described later. For each of the terminal device 10 and server 20, explanations that overlap with the basic hardware configuration and basic functional configuration of the computer described later will be omitted.

＜概要＞
本実施形態に係るシステムは、エッジＡＩで音声の種類を判別し、判別した種類と音量とに基づき、盛り上がりに係る数値を決定する。決定した数値を用い、空間の盛り上がりを推定する。 <Overview>
The system according to this embodiment uses edge AI to determine the type of sound, and based on the determined type and volume, determines a numerical value related to the level of excitement. Using the determined numerical value, it estimates the level of excitement in the space.

以下で、会議室の盛り上がりを推定する場合を例に説明する。但し、本実施形態が実施される空間は、会議室という閉じられた空間に限られず、オフィス内の開かれた空間であってもよい。オフィス内の開かれた空間は、例えば、ブース、サロン、食堂等のオフィス内のエリアであってもよい。以下で、オフィス内の開かれた空間をエリアと呼ぶ。 The following explanation uses the example of estimating the level of engagement in a conference room. However, the space in which this embodiment is implemented is not limited to a closed space such as a conference room; it may also be an open space within an office. An open space within an office may be, for example, a booth, a lounge, or a cafeteria. Hereafter, an open space within an office will be referred to as an "area."

＜１システム全体の構成図＞
図１は、システム１の全体構成の例を示すブロック図である。図１に示すシステム１は、例えば、端末装置１０、サーバ２０、解析装置３０、マイク３１、撮影装置３２、及びスピーカー３３を含む。端末装置１０、サーバ２０、解析装置３０、及びスピーカー３３は、例えば、ネットワーク４０を介して通信接続する。 <1. System Configuration Diagram>
Figure 1 is a block diagram showing an example of the overall configuration of System 1. System 1 shown in Figure 1 includes, for example, a terminal device 10, a server 20, an analysis device 30, a microphone 31, a camera 32, and a speaker 33. The terminal device 10, the server 20, the analysis device 30, and the speaker 33 are connected via, for example, a network 40.

図１において、システム１が端末装置１０を２台含む例を示しているが、システム１に含まれる端末装置１０の数は、２台に限定されない。システム１に含まれる端末装置１０は、３台未満であってもよいし、３台以上であってもよい。 Figure 1 shows an example where System 1 includes two terminal devices 10, but the number of terminal devices 10 included in System 1 is not limited to two. System 1 may include fewer than three terminal devices 10, or three or more.

本実施形態において、複数の装置の集合体を１つのサーバとしてもよい。１つ又は複数のハードウェアに対して本実施形態に係るサーバ２０を実現することに要する複数の機能の配分の仕方は、各ハードウェアの処理能力及び／又はサーバ２０に求められる仕様等に鑑みて適宜決定することができる。 In this embodiment, a collection of multiple devices may be treated as a single server. The method of allocating the multiple functions required to implement the server 20 according to this embodiment to one or more hardware components can be appropriately determined in consideration of the processing capabilities of each hardware component and/or the specifications required for the server 20.

図１に示す端末装置１０は、例えば、空間の盛り上がりを把握するユーザが使用する情報処理装置である。端末装置１０は、例えば、スマートフォン、タブレット等の携帯端末でもよいし、据え置き型のＰＣ（Personal Computer）、ラップトップＰＣであってもよい。また、ＨＭＤ（Head Mount Display）、腕時計型端末等のウェアラブル端末であってもよい。端末装置１０は、例えば、個人によって所有され携帯されてもよいし、従業員間で共有され会議室の壁に固定されてもよい。 The terminal device 10 shown in Figure 1 is, for example, an information processing device used by a user to grasp the elevation changes in a space. The terminal device 10 may be a mobile device such as a smartphone or tablet, or a stationary PC (Personal Computer) or laptop PC. It may also be a wearable device such as an HMD (Head-Mounted Display) or a smartwatch. The terminal device 10 may, for example, be owned and carried by an individual, or it may be shared among employees and fixed to the wall of a conference room.

端末装置１０は、通信ＩＦ（Interface）１２と、入力装置１３と、出力装置１４と、メモリ１５と、ストレージ１６と、プロセッサ１９とを備える。 The terminal device 10 comprises a communication interface (IF) 12, an input device 13, an output device 14, a memory 15, storage 16, and a processor 19.

通信ＩＦ１２は、端末装置１０が、例えば、サーバ２０等のシステム１内の装置と通信するため、信号を入出力するためのインタフェースである。 The communication interface 12 is an interface for inputting and outputting signals so that the terminal device 10 can communicate with a device within system 1, such as a server 20.

入力装置１３は、ユーザからの入力操作を受け付けるための装置（例えば、タッチパネル、タッチパッド、マウス等のポインティングデバイス、キーボード等）である。 The input device 13 is a device for receiving input operations from the user (for example, a touch panel, touchpad, pointing device such as a mouse, keyboard, etc.).

出力装置１４は、ユーザに対して情報を提示するための装置（ディスプレイ、スピーカー等）である。 The output device 14 is a device (display, speaker, etc.) for presenting information to the user.

メモリ１５は、プログラム、及びプログラム等で処理されるデータ等一時的に記憶するためのものであり、例えばＤＲＡＭ（Dynamic Random Access Memory）等の揮発性のメモリである。 Memory 15 is for temporarily storing programs and data processed by programs, etc., and is a volatile memory such as DRAM (Dynamic Random Access Memory).

ストレージ１６は、データを保存するためのものであり、例えばフラッシュメモリ、ＨＤＤ（Hard Disc Drive）である。 Storage 16 is for storing data and can be, for example, flash memory or an HDD (Hard Disk Drive).

プロセッサ１９は、プログラムに記述された命令セットを実行するためのハードウェアであり、演算装置、レジスタ、周辺回路等により構成される。 The processor 19 is hardware for executing the instruction set described in the program, and consists of an arithmetic unit, registers, peripheral circuits, etc.

サーバ２０は、例えば、解析装置３０から取得した情報に基づいて、盛り上がりを分析する情報処理装置である。サーバ２０は、例えば、ネットワーク４０に接続されたコンピュータにより実現される。図１に示すように、サーバ２０は、通信ＩＦ２２と、入出力ＩＦ２３と、メモリ２５と、ストレージ２６と、プロセッサ２９とを備える。 Server 20 is an information processing device that analyzes the rise based on information acquired from, for example, the analysis device 30. Server 20 is implemented, for example, by a computer connected to the network 40. As shown in Figure 1, Server 20 includes a communication interface 22, an input/output interface 23, a memory 25, a storage device 26, and a processor 29.

通信ＩＦ２２は、サーバ２０が、例えば、端末装置１０等のシステム１内の装置と通信するため、信号を入出力するためのインタフェースである。 The communication interface 22 is an interface for inputting and outputting signals so that the server 20 can communicate with devices within system 1, such as the terminal device 10.

入出力ＩＦ２３は、ユーザからの入力操作を受け付けるための入力装置、及び、ユーザに対して情報を提示するための出力装置とのインタフェースとして機能する。 The input/output interface 23 functions as an interface to an input device for receiving user input operations and an output device for presenting information to the user.

メモリ２５は、プログラム、及び、プログラム等で処理されるデータ等を一時的に記憶するためのものであり、例えばＤＲＡＭ等の揮発性のメモリである。 Memory 25 is for temporarily storing programs and data processed by programs, etc., and is a volatile memory such as DRAM.

ストレージ２６は、データを保存するためのものであり、例えばフラッシュメモリ、ＨＤＤである。 Storage 26 is for storing data and may be, for example, flash memory or an HDD.

プロセッサ２９は、プログラムに記述された命令セットを実行するためのハードウェアであり、演算装置、レジスタ、周辺回路等により構成される。 The processor 29 is hardware for executing the instruction set described in the program, and consists of an arithmetic unit, registers, peripheral circuits, etc.

解析装置３０は、例えば、盛り上がりの観点で音声および画像を解析する情報処理装置である。 The analysis device 30 is an information processing device that analyzes audio and images, for example, from the perspective of excitement.

１台の解析装置３０が、１の空間をカバーしてもよいし、複数の空間をカバーしてもよい。具体的には、１台の解析装置３０が、１つの会議室で得られた音声および画像を解析してもよいし、複数の会議室で得られた音声および画像を解析してもよい。解析装置３０は、情報解析が可能な場所に設置される。例えば、解析装置３０は、会議室の隅に置かれる。また、具体的には、１台の解析装置３０が、１のエリアで得られた音声および画像を解析してもよいし、複数のエリアで得られた音声および画像を解析してもよい。解析装置３０は、情報解析が可能な場所に設置される。例えば、解析装置３０は、エリアの隅に置かれる。 A single analysis device 30 may cover one space, or it may cover multiple spaces. Specifically, a single analysis device 30 may analyze audio and images obtained in one conference room, or it may analyze audio and images obtained in multiple conference rooms. The analysis device 30 is installed in a location where information analysis is possible. For example, the analysis device 30 is placed in a corner of the conference room. Furthermore, specifically, a single analysis device 30 may analyze audio and images obtained in one area, or it may analyze audio and images obtained in multiple areas. The analysis device 30 is installed in a location where information analysis is possible. For example, the analysis device 30 is placed in a corner of the area.

解析装置３０は、マイク３１、及び撮影装置３２から送信される情報を受信し、受信した情報を解析し、解析された情報をサーバ２０に送信する。また、解析装置３０は、サーバ２０から取得した情報をマイク３１、及び撮影装置３２へ送信する。サーバ２０から取得する情報には、例えば、マイク３１、又は撮影装置３２の設定を更新するための情報等が含まれる。 The analysis device 30 receives information transmitted from the microphone 31 and the imaging device 32, analyzes the received information, and transmits the analyzed information to the server 20. The analysis device 30 also transmits information acquired from the server 20 to the microphone 31 and the imaging device 32. The information acquired from the server 20 includes, for example, information for updating the settings of the microphone 31 or the imaging device 32.

マイク３１は、空間内の音声を収集できるように設置されている。例えば、マイク３１は、会議室全体の音声を収集できるように設置されている。具体的には、マイク３１は、例えば、会議室中央の机上に設置されている円盤型集音マイクである。また、例えば、マイク３１は、エリア全体の音声を収集できるように設置されている。具体的には、マイク３１は、例えば、エリア中央の天井に設置されている円盤型集音マイクである。マイク３１は、取得した音声信号を解析装置３０へ出力する。マイク３１は、１台で空間全体の音声を捉えきれない場合には、複数台設置される。このとき、マイク３１は、例えば、指向性が予め設定されているマイクであってもよい。 Microphone 31 is installed to collect sound within the space. For example, microphone 31 is installed to collect sound from the entire conference room. Specifically, microphone 31 is, for example, a disc-shaped sound-collecting microphone installed on a table in the center of the conference room. Alternatively, microphone 31 is installed to collect sound from the entire area. Specifically, microphone 31 is, for example, a disc-shaped sound-collecting microphone installed on the ceiling in the center of the area. Microphone 31 outputs the acquired audio signal to the analysis device 30. If one microphone 31 cannot capture the sound from the entire space, multiple microphones 31 are installed. In this case, microphone 31 may be, for example, a microphone with pre-set directivity.

撮影装置３２は、空間内の状況を撮影できるように設置されている。例えば、撮影装置３２は、会議室全体の状況を撮影できるように設置されている。具体的には、撮影装置３２は、例えば、会議室中央の机上に設置されている全方位カメラである。また、例えば、撮影装置３２は、エリア全体の状況を撮影できるように設置されている。具体的には、撮影装置３２は、例えば、エリア中央の天井に設置されている全方位カメラである。撮影装置３２は、取得した画像信号を解析装置３０へ出力する。撮影装置３２は、１台で空間全体の状況を捉えきれない場合には、複数台設置される。このとき、撮影装置３２は、例えば、撮影方向が予め設定されているカメラであってもよい。 The imaging device 32 is installed to capture the situation within the space. For example, the imaging device 32 is installed to capture the situation of the entire conference room. Specifically, the imaging device 32 is, for example, an omnidirectional camera installed on a table in the center of the conference room. Alternatively, the imaging device 32 is installed to capture the situation of the entire area. Specifically, the imaging device 32 is, for example, an omnidirectional camera installed on the ceiling in the center of the area. The imaging device 32 outputs the acquired image signal to the analysis device 30. If one imaging device 32 cannot capture the situation of the entire space, multiple devices are installed. In this case, the imaging device 32 may be, for example, a camera with a pre-set shooting direction.

スピーカー３３は、空間内に設置される、空間内で曲を流すためのスピーカーである。例えば、スピーカー３３は、会議室全体に音楽を届けるように設置されている。具体的には、スピーカー３３は、例えば、会議室中央の机上に設置されている全方位スピーカーである。また、例えば、スピーカー３３は、エリア全体に音楽を届けるように設置されている。具体的には、スピーカー３３は、例えば、エリア中央の天井に設置されている全方位スピーカーである。スピーカー３３は、サーバ２０から与えられる音声信号を音楽に変換して曲を空間内へ出力する。スピーカー３３は、１台で曲を空間全体に音楽を届けられない場合には、複数台設置される。 Speaker 33 is a speaker installed within a space to play music within that space. For example, speaker 33 is installed to deliver music throughout the entire conference room. Specifically, speaker 33 is, for example, an omnidirectional speaker installed on a table in the center of the conference room. Alternatively, speaker 33 is installed to deliver music to an entire area. Specifically, speaker 33 is, for example, an omnidirectional speaker installed on the ceiling in the center of the area. Speaker 33 converts the audio signal provided by server 20 into music and outputs the music into the space. If one speaker 33 is insufficient to deliver music to the entire space, multiple speakers 33 are installed.

＜２端末装置の機能的な構成＞
図２は、図１に示す端末装置１０の機能的な構成例を示すブロック図である。図２に示す端末装置１０は、携帯端末、ＰＣ、又はウェアラブル端末により実現される。図２に示すように、端末装置１０は、通信部１２０と、入力装置１３と、出力装置１４と、音声処理部１７と、マイク１７１と、スピーカー１７２と、カメラ１６１と、位置情報センサ１５０と、記憶部１８０と、制御部１９０とを備える。端末装置１０に含まれる各ブロックは、例えば、バス等により電気的に接続される。 <2. Functional Configuration of Terminal Devices>
Figure 2 is a block diagram showing a functional configuration example of the terminal device 10 shown in Figure 1. The terminal device 10 shown in Figure 2 can be implemented as a mobile terminal, a PC, or a wearable terminal. As shown in Figure 2, the terminal device 10 includes a communication unit 120, an input device 13, an output device 14, an audio processing unit 17, a microphone 171, a speaker 172, a camera 161, a location information sensor 150, a storage unit 180, and a control unit 190. Each block included in the terminal device 10 is electrically connected, for example, by a bus.

通信部１２０は、端末装置１０が他の装置と通信するための変復調処理等の処理を行う。通信部１２０は、制御部１９０で生成された信号に送信処理を施し、外部（例えば、サーバ２０）へ送信する。通信部１２０は、外部から受信した信号に受信処理を施し、制御部１９０へ出力する。 The communication unit 120 performs modulation and demodulation processing, etc., for the terminal device 10 to communicate with other devices. The communication unit 120 performs transmission processing on the signal generated by the control unit 190 and transmits it to an external source (e.g., server 20). The communication unit 120 performs reception processing on the signal received from the external source and outputs it to the control unit 190.

入力装置１３は、端末装置１０を操作するユーザが指示、又は情報を入力するための装置である。入力装置１３は、例えば、操作面へ触れることで指示が入力されるタッチ・センシティブ・デバイス１３１等により実現される。端末装置１０がＰＣ等である場合には、入力装置１３は、リーダー、キーボード、マウス等により実現されてもよい。入力装置１３は、ユーザから入力される指示を電気信号へ変換し、電気信号を制御部１９０へ出力する。なお、入力装置１３には、例えば、外部の入力機器から入力される電気信号を受け付ける受信ポートが含まれてもよい。 The input device 13 is a device for a user operating the terminal device 10 to input instructions or information. The input device 13 can be implemented, for example, by a touch-sensitive device 131 that inputs instructions by touching its operating surface. If the terminal device 10 is a PC, the input device 13 may be implemented by a reader, keyboard, mouse, etc. The input device 13 converts the instructions input by the user into electrical signals and outputs the electrical signals to the control unit 190. The input device 13 may also include, for example, a receiving port that accepts electrical signals input from an external input device.

出力装置１４は、端末装置１０を操作するユーザへ情報を提示するための装置である。出力装置１４は、例えば、ディスプレイ１４１等により実現される。ディスプレイ１４１は、制御部１９０の制御に応じたデータを表示する。ディスプレイ１４１は、例えば、ＬＣＤ（Liquid Crystal Display）、又は有機ＥＬ（Electro-Luminescence）ディスプレイ等によって実現される。 The output device 14 is a device for presenting information to the user operating the terminal device 10. The output device 14 is implemented, for example, by a display 141. The display 141 displays data according to the control of the control unit 190. The display 141 is implemented, for example, by an LCD (Liquid Crystal Display) or an organic EL (Electro-Luminescence) display.

音声処理部１７は、例えば、音声信号のデジタル－アナログ変換処理を行う。音声処理部１７は、マイク１７１から与えられる信号をデジタル信号に変換して、変換後の信号を制御部１９０へ与える。また、音声処理部１７は、音声信号をスピーカー１７２へ与える。音声処理部１７は、例えば音声処理用のプロセッサによって実現される。マイク１７１は、音声入力を受け付けて、当該音声入力に対応する音声信号を音声処理部１７へ与える。スピーカー１７２は、音声処理部１７から与えられる音声信号を音声に変換して当該音声を端末装置１０の外部へ出力する。 The audio processing unit 17 performs, for example, digital-to-analog conversion of the audio signal. The audio processing unit 17 converts the signal received from the microphone 171 into a digital signal and provides the converted signal to the control unit 190. The audio processing unit 17 also provides the audio signal to the speaker 172. The audio processing unit 17 is implemented, for example, by an audio processing processor. The microphone 171 receives audio input and provides the corresponding audio signal to the audio processing unit 17. The speaker 172 converts the audio signal received from the audio processing unit 17 into audio and outputs the audio to the outside of the terminal device 10.

カメラ１６１は、受光素子により光を受光し、画像信号として出力するためのデバイスである。 The camera 161 is a device that receives light using a photodetector and outputs it as an image signal.

位置情報センサ１５０は、端末装置１０の位置を検出するセンサであり、例えばＧＰＳ（Global Positioning System）モジュールである。ＧＰＳモジュールは、衛星測位システムで用いられる受信装置である。衛星測位システムでは、少なくとも３個または４個の衛星からの信号を受信し、受信した信号に基づいて、ＧＰＳモジュールが搭載される端末装置１０の現在位置を検出する。位置情報センサ１５０は、端末装置１０が接続する無線基地局の位置から、端末装置１０の現在の位置を検出してもよい。 The location information sensor 150 is a sensor that detects the position of the terminal device 10, and is, for example, a GPS (Global Positioning System) module. A GPS module is a receiving device used in a satellite positioning system. In a satellite positioning system, signals are received from at least three or four satellites, and based on the received signals, the current position of the terminal device 10, which is equipped with a GPS module, is detected. The location information sensor 150 may also detect the current position of the terminal device 10 from the position of the wireless base station to which the terminal device 10 is connected.

記憶部１８０は、例えば、メモリ１５、及びストレージ１６等により実現され、端末装置１０が使用するデータ、及びプログラムを記憶する。 The storage unit 180 is implemented, for example, by memory 15 and storage 16, and stores data and programs used by the terminal device 10.

制御部１９０は、プロセッサ１９が記憶部１８０に記憶されるプログラムを読み込み、プログラムに含まれる命令を実行することにより実現される。制御部１９０は、端末装置１０の動作を制御する。制御部１９０は、プログラムに従って動作することにより、操作受付部１９１と、送受信部１９２と、提示制御部１９３と、としての機能を発揮する。 The control unit 190 is realized when the processor 19 reads the program stored in the memory unit 180 and executes the instructions contained in the program. The control unit 190 controls the operation of the terminal device 10. By operating according to the program, the control unit 190 functions as an operation reception unit 191, a transmission/reception unit 192, and a presentation control unit 193.

操作受付部１９１は、入力装置１３から入力される指示、又は情報を受け付けるための処理を行う。具体的には、例えば、操作受付部１９１は、タッチ・センシティブ・デバイス１３１等から入力される指示に基づく情報を受け付ける。タッチ・センシティブ・デバイス１３１等を介して入力される指示は、例えば、ユーザからの編集指示である。 The operation reception unit 191 processes instructions or information input from the input device 13. Specifically, for example, the operation reception unit 191 receives information based on instructions input from a touch-sensitive device 131, etc. Instructions input via the touch-sensitive device 131, etc., are, for example, editing instructions from the user.

また、操作受付部１９１は、マイク１７１から入力される音声指示を受け付ける。具体的には、例えば、操作受付部１９１は、マイク１７１から入力され、音声処理部１７でデジタル信号に変換された音声信号を受信する。操作受付部１９１は、例えば、受信した音声信号を分析して所定の名詞を抽出することで、ユーザからの指示を取得する。 Furthermore, the operation reception unit 191 receives voice instructions input from the microphone 171. Specifically, for example, the operation reception unit 191 receives voice signals input from the microphone 171 and converted into digital signals by the voice processing unit 17. The operation reception unit 191 obtains instructions from the user by, for example, analyzing the received voice signals and extracting predetermined nouns.

送受信部１９２は、端末装置１０が、サーバ２０等の外部の装置と、通信プロトコルに従ってデータを送受信するための処理を行う。具体的には、例えば、送受信部１９２は、ユーザから入力された編集指示をサーバ２０へ送信する。また、送受信部１９２は、ユーザに関する情報を、サーバ２０から受信する。 The transmitting/receiving unit 192 performs processing to enable the terminal device 10 to send and receive data with external devices such as the server 20 according to a communication protocol. Specifically, for example, the transmitting/receiving unit 192 transmits editing instructions entered by the user to the server 20. The transmitting/receiving unit 192 also receives information about the user from the server 20.

提示制御部１９３は、サーバ２０から提供された情報をユーザに対して提示するため、出力装置１４を制御する。具体的には、例えば、提示制御部１９３は、サーバ２０から送信される情報をディスプレイ１４１に表示させる。また、提示制御部１９３は、サーバ２０から送信される情報をスピーカー１７２から出力させる。 The display control unit 193 controls the output device 14 to present information provided by the server 20 to the user. Specifically, for example, the display control unit 193 displays the information transmitted from the server 20 on the display 141. The display control unit 193 also outputs the information transmitted from the server 20 through the speaker 172.

＜３サーバの機能的な構成＞
図３は、サーバ２０の機能的な構成例を示すブロック図である。図３に示すように、サーバ２０は、通信部２０１と、記憶部２０２と、制御部２０３としての機能を発揮する。 <3. Functional Configuration of the Server>
Figure 3 is a block diagram showing an example of the functional configuration of server 20. As shown in Figure 3, server 20 functions as a communication unit 201, a storage unit 202, and a control unit 203.

通信部２０１は、サーバ２０が外部の装置と通信するための処理を行う。 The communication unit 201 performs processing to enable the server 20 to communicate with external devices.

記憶部２０２は、例えば、音声解析ログデータベース（ＤＢ）２０２１、画像解析ログデータベース（ＤＢ）２０２２等を有する。記憶部２０２で記憶されるテーブルは、これらに限定されない。 The storage unit 202 includes, for example, a voice analysis log database (DB) 2021, an image analysis log database (DB) 2022, etc. The tables stored in the storage unit 202 are not limited to these.

音声解析ログＤＢ２０２１は、各解析装置３０の音声解析ログＤＢ３０２１の内容を集積する。詳細は後述する。 The voice analysis log DB 2021 aggregates the contents of the voice analysis log DB 3021 from each analysis device 30. Details will be described later.

画像解析ログＤＢ２０２２は、各解析装置３０の画像解析ログＤＢ３０２２の内容を集積する。説明は後述する。 The image analysis log DB 2022 aggregates the contents of the image analysis log DB 3022 of each analysis device 30. The explanation will be given later.

制御部２０３は、プロセッサ２９が記憶部２０２に記憶されるプログラムを読み込み、プログラムに含まれる命令を実行することにより実現される。制御部２０３は、プログラムに従って動作することにより、受信制御モジュール２０３１、送信制御モジュール２０３２、分析制御モジュール２０３３、及び提示制御モジュール２０３４として示す機能を発揮する。 The control unit 203 is implemented when the processor 29 reads the program stored in the memory unit 202 and executes the instructions contained in the program. By operating according to the program, the control unit 203 performs the functions indicated as the receive control module 2031, the transmit control module 2032, the analysis control module 2033, and the presentation control module 2034.

受信制御モジュール２０３１は、サーバ２０が外部の装置から通信プロトコルに従って信号を受信する処理を制御する。 The receive control module 2031 controls the process by which the server 20 receives signals from an external device according to a communication protocol.

送信制御モジュール２０３２は、サーバ２０が外部の装置に対し通信プロトコルに従って信号を送信する処理を制御する。 The transmission control module 2032 controls the process by which the server 20 transmits signals to external devices according to a communication protocol.

分析制御モジュール２０３３は、解析装置３０から受信した情報を分析する処理を制御する。具体的には、分析制御モジュール２０３３は、例えば、空間の盛り上がりに関する情報を分析する処理を制御する。 The analysis control module 2033 controls the process of analyzing the information received from the analysis device 30. Specifically, the analysis control module 2033 controls the process of analyzing information related to spatial elevation, for example.

提示制御モジュール２０３４は、ユーザへ情報を提示する処理を制御する。 The presentation control module 2034 controls the process of presenting information to the user.

＜４解析装置の機能的な構成＞
図４は、解析装置３０の機能的な構成例を示すブロック図である。図４に示すように、解析装置３０は、通信部３０１と、記憶部３０２と、制御部３０３としての機能を発揮する。 <4. Functional Configuration of the Analysis Device>
Figure 4 is a block diagram showing an example of the functional configuration of the analysis device 30. As shown in Figure 4, the analysis device 30 functions as a communication unit 301, a storage unit 302, and a control unit 303.

通信部３０１は、解析装置３０が外部の装置と通信するための処理を行う。 The communication unit 301 performs processing to enable the analysis device 30 to communicate with an external device.

記憶部３０２は、例えば、音声解析ログデータベース（ＤＢ）３０２１、画像解析ログデータベース（ＤＢ）３０２２、第一学習済みモデル３０２３、第二学習済みモデル３０２４等を有する。記憶部３０２で記憶されるテーブルは、これらに限定されない。 The memory unit 302 includes, for example, a speech analysis log database (DB) 3021, an image analysis log database (DB) 3022, a first trained model 3023, a second trained model 3024, and the like. The tables stored in the memory unit 302 are not limited to these.

音声解析ログＤＢ３０２１は、マイク３１によって収集された空間内の音声についての情報を保持するためのデータベースである。詳細は後述する。 The audio analysis log DB 3021 is a database for storing information about the audio in the space collected by the microphone 31. Further details will be described later.

画像解析ログＤＢ３０２２は、撮影装置３２によって撮影された空間内の状況の画像についての情報を保持するためのデータベースである。詳細は後述する。 The image analysis log DB 3022 is a database for storing information about images of the spatial conditions captured by the imaging device 32. Further details will be described later.

第一学習済みモデル３０２３は、音声が入力されると、入力された音声に基づき、盛り上がりを評価するための情報を出力するように学習するモデルである。例えば、第一学習済みモデル３０２３は、音声が入力されると、空間内における音の種類、等を判別するように学習する。音声の種類は、例えば、会話声、歓声、笑い声、腹笑い、怒声、沈黙、叫び声、泣き声、うめき声、くしゃみ、鼻歌、タイピング、等である。 The first pre-trained model 3023 is a model that learns to output information for evaluating excitement based on the input audio. For example, the first pre-trained model 3023 learns to distinguish the type of sound in a given space when audio is input. Examples of sound types include conversation, cheers, laughter, snorting, shouting, silence, screams, crying, groans, sneezes, humming, typing, etc.

このとき、学習用データは、例えば、音声を入力データとし、この入力データに含まれる音の種類についての判断を正解出力データとする。 In this case, the training data would, for example, use audio as input data, and the correct output data would be the judgment regarding the type of sound contained in this input data.

第二学習済みモデル３０２４は、画像が入力されると、入力された画像に基づき、盛り上がりを評価するための情報を出力するように学習するモデルである。例えば、第二学習済みモデル３０２４は、画像が入力されると、空間内における人間の数、表情、等を判別するように学習する。表情の種類は、例えば、笑顔、真顔、泣き顔、等である。また、表情の種類は、例えば、うなずき、首振り、眉ひそめ、下を見る、等の仕草を含む。 The second pre-trained model 3024 is a model that, upon receiving an image as input, learns to output information for evaluating the level of excitement based on the input image. For example, the second pre-trained model 3024 learns to determine the number of people in a space, their facial expressions, etc., upon receiving an image as input. Examples of facial expressions include smiles, neutral faces, crying faces, etc. Furthermore, facial expressions include gestures such as nodding, shaking heads, frowning, and looking down.

このとき、学習用データは、例えば、人間の画像を入力データとし、この入力データに含まれる表情についての判断を正解出力データとする。 In this case, the training data would, for example, use images of humans as input data, and the judgments made about the facial expressions contained in this input data would be used as the correct output data.

第一学習済みモデル３０２３および第二学習済みモデル３０２４は、モデル学習プログラムに従い、機械学習モデルに機械学習を行わせることで生成されるモデルである。第一学習済みモデル３０２３および第二学習済みモデル３０２４は、例えば、入力されるデータに基づき、所定の推論を実施する、複数の関数が合成されたパラメータ付き合成関数である。パラメータ付き合成関数は、複数の調整可能な関数及びパラメータの組合せにより定義される。本実施形態に係る学習済みモデルは、上記の要請を満たす如何なるパラメータ付き合成関数であってもよい。 The first pre-trained model 3023 and the second pre-trained model 3024 are models generated by having a machine learning model perform machine learning according to a model training program. The first pre-trained model 3023 and the second pre-trained model 3024 are, for example, parameterized composite functions composed of multiple functions that perform predetermined inference based on input data. A parameterized composite function is defined by a combination of multiple tunable functions and parameters. The pre-trained models in this embodiment may be any parameterized composite function that satisfies the above requirements.

例えば、第一学習済みモデル３０２３および第二学習済みモデル３０２４が順伝播型の多層化ネットワークを用いて生成される場合、パラメータ付き合成関数は、例えば、重み行列を用いた各層間の線形関係、各層における活性化関数を用いた非線形関係（又は線形関係）、及びバイアスの組み合わせとして定義される。重み付行列、及びバイアスは、多層化ネットワークのパラメータと呼ばれる。パラメータ付き合成関数は、パラメータをどのように選ぶかで、関数としての形を変える。多層化ネットワークでは、構成するパラメータを適切に設定することで、出力層から好ましい結果を出力することが可能な関数を定義することができる。 For example, when the first trained model 3023 and the second trained model 3024 are generated using a feedforward multilayer network, the parameterized composite function is defined as a combination of, for example, linear relationships between layers using weight matrices, nonlinear (or linear) relationships using activation functions in each layer, and biases. The weight matrices and biases are called the parameters of the multilayer network. The parameterized composite function changes its form depending on how the parameters are chosen. In a multilayer network, by appropriately setting the constituent parameters, it is possible to define a function that can output desirable results from the output layer.

本実施形態に係る多層化ネットワークとしては、例えば、深層学習（Deep Learning）の対象となる多層ニューラルネットワークである深層ニューラルネットワーク（Deep Neural Network：ＤＮＮ）が用いられ得る。ＤＮＮとしては、例えば、画像を対象とする畳み込みニューラルネットワーク（Convolution Neural Network：ＣＮＮ）を用いてもよい。 As the multilayer network according to this embodiment, for example, a deep neural network (DNN), which is a multilayer neural network targeted by deep learning, may be used. As a DNN, for example, a convolutional neural network (CNN) for image processing may also be used.

制御部３０３は、解析装置３０内のプロセッサが記憶部３０２に記憶されるプログラムを読み込み、プログラムに含まれる命令を実行することにより実現される。制御部３０３は、プログラムに従って動作することにより、受信制御モジュール３０３１、送信制御モジュール３０３２、分析制御モジュール２０３３、及び提示制御モジュール３０３４として示す機能を発揮する。 The control unit 303 is realized when the processor within the analysis device 30 reads the program stored in the memory unit 302 and executes the instructions contained in the program. By operating according to the program, the control unit 303 performs the functions indicated as the receiving control module 3031, the transmitting control module 3032, the analysis control module 2033, and the presentation control module 3034.

受信制御モジュール３０３１は、解析装置３０が外部の装置から通信プロトコルに従って信号を受信する処理を制御する。 The reception control module 3031 controls the process by which the analysis device 30 receives signals from an external device according to a communication protocol.

送信制御モジュール３０３２は、解析装置３０が外部の装置に対し通信プロトコルに従って信号を送信する処理を制御する。 The transmission control module 3032 controls the process by which the analysis device 30 transmits signals to an external device according to a communication protocol.

解析制御モジュール３０３３は、マイク３１及び撮影装置３２から受信した情報を解析する処理を制御する。具体的には、解析制御モジュール３０３３は、盛り上がりの観点から音声と画像を解析する。より具体的には、例えば、解析制御モジュール３０３３は、第一学習済みモデル３０２３を用いて音声を解析する。また、例えば、解析制御モジュール３０３３は、第二学習済みモデル３０２４を用いて画像を解析する。 The analysis control module 3033 controls the process of analyzing the information received from the microphone 31 and the imaging device 32. Specifically, the analysis control module 3033 analyzes the audio and images from the perspective of excitement. More specifically, for example, the analysis control module 3033 analyzes the audio using the first trained model 3023. Also, for example, the analysis control module 3033 analyzes the images using the second trained model 3024.

提示制御モジュール３０３４は、ユーザへ情報を提示する処理を制御する。 The presentation control module 3034 controls the process of presenting information to the user.

＜５データ構造＞
テーブルのデータ構造を説明する。なお、説明するデータ構造は一例であり、記載されていないデータを除外するものではない。また、同一のテーブルに記載されるデータであっても、離れた記憶領域に記憶されていることもあり得る。 <5. Data Structure>
This section describes the data structure of a table. Note that the data structure described is just one example, and does not exclude data not listed. Furthermore, even data within the same table may be stored in separate memory areas.

図５は、音声解析ログＤＢ３０２１のデータ構造を示す図である。図５が示すように、音声解析ログＤＢ３０２１のレコードの各々は、例えば、項目「場所」と、項目「マイクＩＤ」と、項目「時間」と、項目「音声」と、項目「音量」と、項目「状態」と、項目「値」と、を含む。第一学習済みモデル３０２３により音声が出力されると、音声解析ログＤＢ３０２１に新たなレコードが作成され、該当する項目に各情報が記憶される。各解析装置３０の音声解析ログＤＢ３０２１は、当該解析装置３０がカバーする空間に関する情報を記録する。 Figure 5 shows the data structure of the voice analysis log DB 3021. As shown in Figure 5, each record in the voice analysis log DB 3021 includes, for example, the items "Location," "Microphone ID," "Time," "Voice," "Volume," "Status," and "Value." When voice is output by the first trained model 3023, a new record is created in the voice analysis log DB 3021, and the information is stored in the corresponding items. The voice analysis log DB 3021 of each analysis device 30 records information about the space covered by that analysis device 30.

項目「場所」は、解析装置３０がカバーする空間を示す。 The item "Location" indicates the space covered by the analysis device 30.

項目「マイクＩＤ」は、空間に設置されたマイク３１を識別するための識別情報を示す。 The item "Microphone ID" indicates identification information for identifying the microphone 31 installed in the space.

項目「時間」は、マイク３１によって音声が収集された時間を示す。 The "Time" field indicates the time when audio was collected by microphone 31.

項目「音声」は、音声の種類を示す。項目「音声」は、例えば、第一学習済みモデル３０２３により検出された音声の種類を示す。 The item "Speech" indicates the type of speech. For example, the item "Speech" indicates the type of speech detected by the first trained model 3023.

項目「音量」は、項目「音声」の音量を示す。項目「音量」は、例えば、３段階で表現される。サーバ２０は、マイク３１により取得された音の強度が設定している閾値を超える場合、超えた閾値に対応した値を設定する。なお、項目「音量」は、３段階に限定されない。 The "Volume" item indicates the volume of the "Sound" item. The "Volume" item can be expressed in, for example, three levels. If the sound intensity acquired by the microphone 31 exceeds a set threshold, the server 20 sets a value corresponding to the exceeded threshold. Note that the "Volume" item is not limited to three levels.

項目「状態」は、盛り上がりに係る状態を示す。項目「状態」は、例えば、「盛り上がり」と、「盛り下がり」と、「ニュートラル」と、を含む。「盛り上がり」には、笑い声などポジティブな音声が分類される。「盛り下がり」には、怒声などネガティブな音声が分類される。「ニュートラル」は、「盛り上がり」でも「盛り下がり」でもない状態を意味する。盛り上がりに係る状態は３種類に限定されない。例えば、「盛り上がり」と、「盛り下がり」との２種類であってもよい。 The "State" category indicates the state related to excitement. This category includes, for example, "Excitement," "Downturn," and "Neutral." "Excitement" includes positive sounds such as laughter. "Downturn" includes negative sounds such as shouting. "Neutral" means a state that is neither "Excitement" nor "Downturn." The states related to excitement are not limited to these three types. For example, there could be two types: "Excitement" and "Downturn."

項目「値」は、盛り上がりに係る状態の度合いを示す。項目「値」は、例えば、４段階で表現される。なお、項目「値」は、４段階に限定されない。 The "Value" item indicates the degree of the state related to the excitement. The "Value" item may be expressed on a scale of, for example, four levels. However, the "Value" item is not limited to four levels.

図７は、音声解析ログＤＢ２０２１のデータ構造を示す図である。図７が示すように、音声解析ログＤＢ２０２１のレコードの各々は、例えば、項目「場所」と、項目「マイクＩＤ」と、項目「時間」と、項目「音声」と、項目「音量」と、項目「状態」と、項目「値」と、を含む。後述するように、音声解析ログＤＢ２０２１は、各解析装置３０の音声解析ログＤＢ３０２１の内容を集積する。 Figure 7 shows the data structure of the voice analysis log DB 2021. As shown in Figure 7, each record in the voice analysis log DB 2021 includes, for example, the fields "Location," "Microphone ID," "Time," "Voice," "Volume," "Status," and "Value." As will be described later, the voice analysis log DB 2021 aggregates the contents of the voice analysis log DB 3021 of each analysis device 30.

音声解析ログＤＢ２０２１の各項目は音声解析ログＤＢ３０２１の各項目と同一のものを示すため、音声解析ログＤＢ２０２１の各項目の説明を省略する。 Since each item in the Voice Analysis Log DB 2021 represents the same information as each item in the Voice Analysis Log DB 3021, the explanations for each item in the Voice Analysis Log DB 2021 are omitted.

図６は、画像解析ログＤＢ３０２２のデータ構造を示す図である。図６が示すように、画像解析ログＤＢ３０２２のレコードの各々は、例えば、項目「場所」と、項目「カメラＩＤ」と、項目「時間」と、項目「表情」と、項目「人数」と、項目「状態」と、項目「値」と、を含む。第二学習済みモデル３０２４により表情が出力されると、画像解析ログＤＢ３０２２に新たなレコードが作成され、該当する項目に各情報が記憶される。各解析装置３０の画像解析ログＤＢ３０２２は、当該解析装置３０がカバーする空間に関する情報を記録する。 Figure 6 shows the data structure of the image analysis log DB 3022. As shown in Figure 6, each record in the image analysis log DB 3022 includes, for example, the items "Location," "Camera ID," "Time," "Facial Expression," "Number of People," "State," and "Value." When a facial expression is output by the second trained model 3024, a new record is created in the image analysis log DB 3022, and the information is stored in the corresponding items. The image analysis log DB 3022 of each analysis device 30 records information about the space covered by that analysis device 30.

項目「カメラＩＤ」は、空間に設置された撮影装置３２を識別するための識別情報を示す。 The item "Camera ID" indicates identification information for identifying the imaging device 32 installed in the space.

項目「時間」は、撮影装置３２によって空間が撮影された時間を示す。 The item "Time" indicates the time during which the space was photographed by the imaging device 32.

項目「表情」は、表情の種類を示す。項目「表情」は、例えば、第二学習済みモデル３０２４により検出された表情の種類を示す。項目「表情」は、うなずき、等の仕草に係る情報を含んでもよい。 The item "Facial Expression" indicates the type of facial expression. For example, it indicates the type of facial expression detected by the second trained model 3024. The item "Facial Expression" may also include information related to gestures such as nodding.

項目「人数」は、表情をとった人数を示す。項目「人数」は、例えば、第二学習済みモデル３０２４により検出された表情をとる人数を示す。 The "Number of People" field indicates the number of people who made facial expressions. For example, the "Number of People" field may indicate the number of people who made facial expressions as detected by the second trained model 3024.

項目「状態」は、盛り上がりに係る状態を示す。項目「状態」は、例えば、「盛り上がり」と、「盛り下がり」と、「ニュートラル」と、を含む。「盛り上がり」には、笑顔などポジティブな表情が分類される。「盛り下がり」には、眉ひそめなどネガティブな表情が分類される。「ニュートラル」は、「盛り上がり」でも「盛り下がり」でもない状態を意味する。盛り上がりに係る状態は３種類に限定されない。例えば、「盛り上がり」と、「盛り下がり」との２種類であってもよい。 The "State" item indicates the state related to excitement. The "State" item includes, for example, "Excitement," "Downturn," and "Neutral." "Excitement" includes positive expressions such as smiles. "Downturn" includes negative expressions such as frowning. "Neutral" means a state that is neither "Excitement" nor "Downturn." The states related to excitement are not limited to these three types. For example, there could be two types: "Excitement" and "Downturn."

図８は、画像解析ログＤＢ２０２２のデータ構造を示す図である。図８が示すように、画像解析ログＤＢ２０２２のレコードの各々は、例えば、項目「場所」と、項目「カメラＩＤ」と、項目「時間」と、項目「表情」と、項目「人数」と、項目「状態」と、項目「値」と、を含む。後述するように、画像解析ログＤＢ２０２２は、各解析装置３０の画像解析ログＤＢ３０２２の内容を集積する。 Figure 8 shows the data structure of the image analysis log DB 2022. As shown in Figure 8, each record in the image analysis log DB 2022 includes, for example, the fields "Location," "Camera ID," "Time," "Expression," "Number of People," "Status," and "Value." As will be described later, the image analysis log DB 2022 aggregates the contents of the image analysis log DB 3022 of each analysis device 30.

画像解析ログＤＢ２０２２の各項目は画像解析ログＤＢ３０２２の各項目と同一のものを示すため、画像解析ログＤＢ２０２２の各項目の説明を省略する。 Since each item in Image Analysis Log DB 2022 represents the same information as each item in Image Analysis Log DB 3022, the explanations for each item in Image Analysis Log DB 2022 are omitted.

＜６動作＞
システム１における処理の流れの一例を説明する。 <6 Operation>
An example of the processing flow in System 1 is described below.

図９は、解析装置３０が音声と画像を解析する際の動作の例を表すフローチャートである。 Figure 9 is a flowchart illustrating an example of the operation of the analysis device 30 when analyzing audio and images.

ステップＳ７０１において、解析装置３０は、音声および画像を受信する。具体的には、例えば、受信制御モジュール３０３１は、マイク３１によって収集された空間内の音声データをマイク３１から、および、撮影装置３２によって撮影された空間の画像データを撮影装置３２から、受信する。 In step S701, the analysis device 30 receives audio and images. Specifically, for example, the receiving control module 3031 receives audio data of the space collected by the microphone 31 from the microphone 31, and image data of the space captured by the imaging device 32 from the imaging device 32.

ステップＳ７０２において、制御部３０３は、音声データ、および、画像データを解析する。 In step S702, the control unit 303 analyzes the audio data and image data.

具体的には、解析制御モジュール３０３３は、例えば、受信された音声データを、すなわち、マイク３１によって収集された音声データを第一学習済みモデル３０２３に、周期的に入力する。第一学習済みモデル３０２３は、入力された音声に基づき、盛り上がりを評価するための情報を出力する。具体的には、第一学習済みモデル３０２３は、例えば、音声が入力されると、音声の種類を出力する。具体的には、第一学習済みモデル３０２３は、例えば、第２会議室における２０２４年４月１日１０時３０分１０秒の音声から笑い声を出力する。 Specifically, the analysis and control module 3033 periodically inputs received audio data, i.e., audio data collected by the microphone 31, to the first trained model 3023. The first trained model 3023 outputs information for evaluating the level of excitement based on the input audio. Specifically, the first trained model 3023 outputs the type of audio when audio is input. Specifically, the first trained model 3023 outputs laughter from audio recorded in the second conference room at 10:30:10 AM on April 1, 2024.

なお、入力の周期は、例えば、時間に基づく。当該時間は、例えば、人が一つのアクションを取るのにかかるおおよその時間に基づいて設定され得る。具体的には、人が、笑い声を立てて笑う１回の動作等にかかる時間に基づいて設定され得る。例えば、解析制御モジュール３０３３は、２～５秒毎に、音声データを第一学習済みモデル３０２３に入力する。 The input cycle is based on time, for example. This time can be set based on, for instance, the approximate time it takes for a person to perform a single action. Specifically, it can be set based on the time it takes for a person to laugh out loud, for example. For instance, the analysis control module 3033 inputs audio data to the first trained model 3023 every 2 to 5 seconds.

また、具体的には、解析制御モジュール３０３３は、例えば、受信された画像データを、すなわち、撮影装置３２によって撮影された画像データを第二学習済みモデル３０２４に、周期的に入力する。第二学習済みモデル３０２４は、入力された画像に基づき、盛り上がりを評価するための情報を出力する。具体的には、第二学習済みモデル３０２４は、例えば、画像が入力されると、表情の種類を出力する。また、第二学習済みモデル３０２４は、例えば、画像が入力されると、その表情の種類をとった人数を出力する。例えば、第二学習済みモデル３０２４は、例えば、第２会議室における２０２４年４月１日１０時３０分１０秒の画像から笑顔の二人を出力する。 Specifically, the analysis control module 3033 periodically inputs received image data, i.e., image data captured by the imaging device 32, to the second trained model 3024. The second trained model 3024 outputs information for evaluating excitement based on the input images. Specifically, the second trained model 3024 outputs, for example, the type of facial expression when an image is input. It also outputs the number of people with that type of facial expression when an image is input. For example, the second trained model 3024 outputs two smiling faces from an image taken in the second conference room at 10:30:10 AM on April 1, 2024.

なお、入力の周期は、例えば、時間に基づく。当該時間は、例えば、人が一つのアクションを取るのにかかるおおよその時間に基づいて設定され得る。具体的には、人が、１回のうなずき動作等にかかる時間に基づいて設定され得る。例えば、解析制御モジュール３０３３は、２～５秒毎に、画像データを第二学習済みモデル３０２４に入力する。第二学習済みモデル３０２４に入力される画像データは、１つの静止画であってもよいし、複数フレームについての複数の静止画であってもよい。 The input cycle is based on time, for example. This time can be set based on, for instance, the approximate time it takes for a person to perform a single action. Specifically, it can be set based on the time it takes for a person to perform a single nodding motion, for example. For instance, the analysis control module 3033 inputs image data to the second trained model 3024 every 2 to 5 seconds. The image data input to the second trained model 3024 may be a single still image or multiple still images from multiple frames.

次いで、具体的には、解析制御モジュール３０３３は、出力された情報に基づき、空間の盛り上がりを評価する。より具体的には、解析制御モジュール３０３３は、音声の種類および音量に基づいて、盛り上がりに係る状態、および、その度合いを決定する。例えば、解析制御モジュール３０３３は、音量２の笑い声を、状態「盛り上がり」の値「３」と評価する。また、より具体的には、解析制御モジュール３０３３は、表情およびその表情をとった人数に基づいて、盛り上がりに係る状態、および、その度合いを決定する。例えば、解析制御モジュール３０３３は、二人の眉ひそめを、状態「盛り下がり」の値「２」と評価する。 Next, the analysis control module 3033 evaluates the level of excitement in the space based on the outputted information. More specifically, the analysis control module 3033 determines the state and degree of excitement based on the type and volume of the sound. For example, the analysis control module 3033 evaluates laughter at volume 2 as a state of "excitement" with a value of "3". Furthermore, more specifically, the analysis control module 3033 determines the state and degree of excitement based on facial expressions and the number of people making those expressions. For example, the analysis control module 3033 evaluates two people frowning as a state of "decline" with a value of "2".

本実施形態において、例えば、音声の種類と、盛り上がりに係る状態とが予め関連付けられている。具体的には、例えば、歓声、笑い声、腹笑い等のポジティブな音声の種類は、盛り上がりに係る状態が「盛り上がり」と対応付けられている。また、例えば、怒声、沈黙、叫び声、泣き声、うめき声、鼻歌等のネガティブな音声の種類は、盛り上がりに係る状態が「盛り下がり」と対応付けられている。また、例えば、会話声、くしゃみ、タイピング等の一般的に発生し得る音声の種類は、盛り上がりに係る状態が「ニュートラル」と対応付けられている。なお、音声の種類と、盛り上がりに係る状態との関係はこれに限定されない。また、例えば、音量と、盛り上がり（盛り下がり）の度合いとが予め関連付けられている。具体的には、例えば、音量１と、盛り上がり（盛り下がり）の度合い１とが対応付けられ、音量２と、盛り上がり（盛り下がり）の度合い２とが対応付けられ、音量３と、盛り上がり（盛り下がり）の度合い３とが対応付けられている。 In this embodiment, for example, the type of sound and the state related to excitement are pre-associated. Specifically, for example, positive sound types such as cheers, laughter, and belly laughs are associated with the state of excitement as "excitement." Similarly, negative sound types such as shouting, silence, screaming, crying, groaning, and humming are associated with the state of excitement as "decline." Furthermore, commonly occurring sound types such as conversation, sneezing, and typing are associated with the state of excitement as "neutral." Note that the relationship between the type of sound and the state of excitement is not limited to these examples. Also, for example, volume and the degree of excitement (decline) are pre-associated. Specifically, for example, volume 1 is associated with degree of excitement (decline) 1, volume 2 is associated with degree of excitement (decline) 2, and volume 3 is associated with degree of excitement (decline) 3.

本実施形態において、例えば、表情と、盛り上がりに係る状態とが予め関連付けられている。具体的には、例えば、笑顔、うなずき等のポジティブな表情は、盛り上がりに係る状態が「盛り上がり」と対応付けられている。また、例えば、泣き顔、眉ひそめ、首振り等のネガティブな表情は、盛り上がりに係る状態が「盛り下がり」と対応付けられている。また、例えば、真顔等の一般的に発生し得る表情は、盛り上がりに係る状態が「ニュートラル」と対応付けられている。なお、表情と、盛り上がりに係る状態との関係はこれに限定されない。また、例えば、人数と、盛り上がり（盛り下がり）の度合いとが予め関連付けられている。具体的には、例えば、人数１と、盛り上がり（盛り下がり）の度合い１とが対応付けられ、人数２と、盛り上がり（盛り下がり）の度合い２とが対応付けられ、人数３と、盛り上がり（盛り下がり）の度合い３とが対応付けられている。 In this embodiment, for example, facial expressions and states related to excitement are pre-associated. Specifically, for example, positive facial expressions such as smiles and nods are associated with a state of "excitement." Negative facial expressions such as crying, frowning, and head shaking are associated with a state of "decline." Furthermore, commonly occurring facial expressions such as a neutral expression are associated with a state of "neutral." Note that the relationship between facial expressions and states of excitement is not limited to these examples. Also, for example, the number of people and the degree of excitement (decline) are pre-associated. Specifically, for example, person 1 is associated with degree of excitement (decline) 1, person 2 is associated with degree of excitement (decline) 2, and person 3 is associated with degree of excitement (decline) 3.

ステップＳ７０３において、制御部３０３は、ステップＳ７０２の解析に係る情報を保存する。具体的には、例えば、解析制御モジュール３０３３は、音声解析ログＤＢ３０２１において、項目「マイクＩＤ」、項目「場所」、項目「時間」、項目「音声」、項目「音量」、項目「状態」、項目「値」に対応する情報を記録する。また、具体的には、例えば、解析制御モジュール３０３３は、画像解析ログＤＢ３０２２において、項目「カメラＩＤ」、項目「場所」、項目「時間」、項目「表情」、項目「人数」、項目「状態」、項目「値」に対応する情報を記録する。 In step S703, the control unit 303 stores the information related to the analysis in step S702. Specifically, for example, the analysis control module 3033 records information corresponding to the items "microphone ID," "location," "time," "voice," "volume," "state," and "value" in the audio analysis log DB 3021. Furthermore, specifically, for example, the analysis control module 3033 records information corresponding to the items "camera ID," "location," "time," "facial expression," "number of people," "state," and "value" in the image analysis log DB 3022.

ステップＳ７０４において、制御部３０３は、ステップＳ７０２の解析に係る情報を送信する。具体的には、例えば、送信制御モジュール３０３２は、ステップＳ７０３で記録された音声解析ログＤＢ３０２１の内容と画像解析ログＤＢ３０２２の内容を、サーバ２０に送信する。 In step S704, the control unit 303 transmits information related to the analysis in step S702. Specifically, for example, the transmission control module 3032 transmits the contents of the audio analysis log DB 3021 and the image analysis log DB 3022 recorded in step S703 to the server 20.

送信制御モジュール３０３２は、間断なく、ステップＳ７０３で記録された情報をサーバ２０に送信する。また、送信制御モジュール３０３２は、未送信データの容量に基づいた一定周期で送信してもよい。具体的には、送信制御モジュール３０３２は、例えば、未送信データの容量が所定容量に達するたびに、未送信データをサーバ２０に送信する。また、送信制御モジュール３０３２は、未送信データの時間に基づいた一定周期で送信してもよい。具体的には、送信制御モジュール３０３２は、例えば、最新の送信から所定時間が経過するたびに、サーバ２０へ送信する。 The transmission control module 3032 continuously transmits the information recorded in step S703 to the server 20. Alternatively, the transmission control module 3032 may transmit at regular intervals based on the volume of untransmitted data. Specifically, for example, the transmission control module 3032 transmits untransmitted data to the server 20 each time the volume of untransmitted data reaches a predetermined volume. Alternatively, the transmission control module 3032 may transmit at regular intervals based on the time elapsed since the last transmission. Specifically, for example, the transmission control module 3032 transmits to the server 20 each time a predetermined time has elapsed since the most recent transmission.

図１０は、サーバ２０が空間の盛り上がりを分析する際の動作の例を表すフローチャートである。 Figure 10 is a flowchart illustrating an example of how server 20 operates when analyzing spatial elevation.

ステップＳ８０１において、制御部２０３は、各解析装置３０から空間の盛り上がりに係る情報を受信する。具体的には、例えば、受信制御モジュール２０３１は、ステップＳ７０４で送信された各解析装置３０の音声解析ログＤＢ３０２１の内容と画像解析ログＤＢ３０２２の内容を受信する。 In step S801, the control unit 203 receives information related to spatial elevation from each analysis device 30. Specifically, for example, the receiving control module 2031 receives the contents of the audio analysis log DB 3021 and the image analysis log DB 3022 transmitted in step S704 from each analysis device 30.

制御部２０３は、例えば、音声解析ログＤＢ３０２１の内容を音声解析ログＤＢ２０２１に、および、画像解析ログＤＢ３０２２の内容を画像解析ログＤＢ２０２２に、記録する。換言すれば、音声解析ログＤＢ２０２１は音声解析ログＤＢ３０２１の内容を、および、画像解析ログＤＢ２０２２は画像解析ログＤＢ３０２２の内容を、集積する。 The control unit 203 records, for example, the contents of the voice analysis log DB 3021 in the voice analysis log DB 2021, and the contents of the image analysis log DB 3022 in the image analysis log DB 2022. In other words, the voice analysis log DB 2021 stores the contents of the voice analysis log DB 3021, and the image analysis log DB 2022 stores the contents of the image analysis log DB 3022.

ステップＳ８０２において、制御部２０３は、受信された、空間内における盛り上がりに係る情報を分析する。 In step S802, the control unit 203 analyzes the received information regarding the elevation within the space.

（ステップＳ８０２Ａ進行中の１の集まりを評価）
具体的には、例えば、分析制御モジュール２０３３は、空間内で進行中の集まりの盛り上がりを評価する。例えば、分析制御モジュール２０３３は、空間内における１の集まりの盛り上がりに係る情報に基づいて、音声の観点で、または、表情の観点で、１の集まりの盛り上がりを示す指標値を算出する。具体的には、「状態」が「盛り上がり」でその「値」が大きいほど指標値は大きくなり、および、「状態」が「盛り下がり」でその「値」が大きいほど指標値は小さくなる。また、分析制御モジュール２０３３は、音声と表情の両観点で総合的に指標値を算出してもよい。総合的算出は、例えば、音声の値および表情の値の合算である。例えば、分析制御モジュール２０３３は、音声に係る状態「盛り上がり」の値「３」と表情に係る状態「盛り上がり」の値「１」に基づいて、指標値「４」を算出する。例えば、分析制御モジュール２０３３は、音声に係る状態「盛り上がり」の値「３」と表情に係る状態「盛り下がり」の値「１」に基づいて、指標値「２」を算出する。単純に合算するのみでなく、少なくともいずれか一方に重み付けをした上で、合算するようにしてもよい。 (Step S802A: Evaluate the ongoing set of 1)
Specifically, for example, the analysis control module 2033 evaluates the excitement of a gathering in progress within a space. For example, based on information relating to the excitement of a gathering in space, the analysis control module 2033 calculates an index value indicating the excitement of a gathering, either from a vocal perspective or from a facial expression perspective. Specifically, the larger the "value" for the "state" being "excited," the larger the index value, and the larger the "value" for the "state" being "decreased," the smaller the index value. The analysis control module 2033 may also calculate an index value comprehensively from both vocal and facial expression perspectives. Comprehensive calculation is, for example, the sum of the vocal value and the facial expression value. For example, the analysis control module 2033 calculates an index value "4" based on a vocal state "excited" value "3" and a facial state "excited" value "1". For example, the analysis control module 2033 calculates an index value "2" based on a vocal state "excited" value "3" and a facial state "decreased" value "1". Instead of simply adding them up, it would be better to weight at least one of the values before adding them up.

また、分析制御モジュール２０３３は、１の集まりの盛り上がりを継続的に評価することで、当該集まりの盛り上がりの指標値の時間的な推移を算出する。 Furthermore, the analysis and control module 2033 continuously evaluates the rise of a group of 1s and calculates the temporal change in the index value of that group's rise.

（ステップＳ８０２Ｂ盛り上がりに係る傾向を取得）
具体的には、例えば、分析制御モジュール２０３３は、盛り上がりに係る傾向を取得する。例えば、分析制御モジュール２０３３は、複数の過去の集まりにおける盛り上がりの時間的な推移を比較することで、盛り上がりに係る傾向を取得する。具体的には、分析制御モジュール２０３３は、例えば、時間、場所、主催者、参加者、テーマの観点で複数の集まりを比較することで、月曜日１０時から１１時、第３会議室、主催者としての営業部長、参加者としての営業部各課長、売上報告が集まりを盛り上げる要素であることを取得する。また、具体的には、分析制御モジュール２０３３は、例えば、発話の観点で複数の集まりを比較することで、集まり序盤におけるアイスブレイクおよびクロージングトークが集まりを盛り上げる要素であることを取得する。 (Step S802B Obtain the trend related to the surge)
Specifically, for example, the analysis control module 2033 acquires trends related to the level of excitement. For example, the analysis control module 2033 acquires trends related to excitement by comparing the temporal progression of excitement in multiple past gatherings. Specifically, for example, the analysis control module 2033 acquires that a gathering held on Monday from 10 to 11 a.m. in the third meeting room, with the sales manager as the organizer and sales department managers as participants, and that sales reports are elements that contribute to the excitement of the gathering, by comparing multiple gatherings from the perspective of speech. Furthermore, specifically, the analysis control module 2033 acquires that icebreakers and closing talks at the beginning of a gathering are elements that contribute to the excitement of the gathering, by comparing multiple gatherings from the perspective of speech.

比較される集まりは、同じ時間かつ異なる空間における集まりであってもよいし、異なる時間かつ同じ空間における集まりであってもよいし、異なる時間勝つ異なる空間における集まりであってもよい。 The groups being compared may be groups in the same time and different spaces, groups in different times and the same space, or groups in different times and different spaces.

（ステップＳ８０２Ｃ模範的推移を取得し、活用）
具体的には、例えば、分析制御モジュール２０３３は、盛り上がりの模範的推移を取得する。例えば、分析制御モジュール２０３３は、全般的に高い盛り上がりを呈した過去の集まりを参照することで、所定の模範的推移の軌跡を取得する。所定の模範的推移の軌跡は、例えば、周期的な盛り上がりのピークを迎えつつも全般的に右肩上がりの軌跡である。 (Step S802C Obtain and utilize exemplary transitions)
Specifically, for example, the analysis and control module 2033 obtains a model trend of the surge. For example, the analysis and control module 2033 obtains a predetermined model trend trajectory by referring to past collections that generally showed high surges. The predetermined model trend trajectory is, for example, a trajectory that generally slopes upward while reaching periodic peaks in the surge.

また、具体的には、例えば、分析制御モジュール２０３３は、盛り上がりの模範的推移を活用する。例えば、分析制御モジュール２０３３は、ステップＳ８０２Ａで評価されている進行中の集まりの盛り上がりの推移を模範的推移と比較することで、進行中の集まりの盛り上がりを採点する。進行中の集まりの盛り上がりの推移が模範的推移をなぞるほど、進行中の集まりの盛り上がりの推移は高く採点が算出される。また、集まりは、完了後に採点されてもよい。採点は、例えば、１００点満点の数的評価であってもよいし、ＡからＦまでのアルファベット評価でもよい。 Specifically, for example, the analysis and control module 2033 utilizes exemplary progressions of excitement. For instance, the analysis and control module 2033 scores the excitement of the ongoing gathering by comparing its progression, evaluated in step S802A, with an exemplary progression. The more closely the ongoing gathering's excitement progression mirrors the exemplary progression, the higher the score calculated. The gathering may also be scored after completion. The scoring could be, for example, a numerical evaluation out of 100 points, or an alphabetical evaluation from A to F.

（ステップＳ８０２Ｄ事業所を評価）
具体的には、例えば、分析制御モジュール２０３３は、事業所全体の活性を評価する。例えば、分析制御モジュール２０３３は、ステップ８０２Ａで算出された各空間内の集まりの盛り上がりを示す指標値に基づき、１の事業所を構成する全空間内の集まりの盛り上がりを表す指数を算出する。例えば、指数は、各空間における指標値の合計である。なお、合計に限らず、所定の統計学的手法により、指数を算出してもよい。指数は、各空間の大きさに応じて傾斜をかけられた指標値の合計でもよい。例えば、二つの空間の元の指標値が同じであっても、一方の空間の大きさが他方の空間の大きさの２倍である場合に、一方の空間の指標値は他方の空間の指標値の２倍である。 (Step S802D: Evaluate the business establishment)
Specifically, for example, the analysis and control module 2033 evaluates the activity of the entire business site. For example, based on the index values indicating the level of activity of the group in each space calculated in step 802A, the analysis and control module 2033 calculates an index representing the level of activity of the group in all the spaces that make up one business site. For example, the index is the sum of the index values in each space. However, the index may be calculated using a predetermined statistical method, not limited to a sum. The index may also be the sum of the index values that have been weighted according to the size of each space. For example, even if the original index values of two spaces are the same, if the size of one space is twice that of the other space, the index value of one space is twice that of the other space.

また、具体的には、例えば、分析制御モジュール２０３３は、事業所の活性に関して統計的データを算出する。例えば、分析制御モジュール２０３３は、区切られた時間における指数の平均値を算出する。区切られた時間は、たとえば、午前、曜日、一週、一月、一年、等である。さらに、例えば、分析制御モジュール２０３３は、区切られた時間における指数の中央値、最高値、最低値を算出してもよい。 Furthermore, specifically, the analysis and control module 2033 calculates statistical data regarding the activity of the business establishment. For example, the analysis and control module 2033 calculates the average value of the index over defined time periods. These defined time periods may be, for example, mornings, days of the week, weeks, months, or years. Additionally, the analysis and control module 2033 may calculate the median, maximum, and minimum values of the index over defined time periods.

（ステップＳ８０２Ｅアクションの効果を検証）
具体的には、例えば、分析制御モジュール２０３３は、盛り上がりを促すためのアクションの盛り上がりに関する効果を検証する。例えば、分析制御モジュール２０３３は、ステップＳ８０２Ａで算出された盛り上がりの指標値の推移においてアクションが行われた時点から一定時間指標値の推移を観察することで、当該アクションの盛り上がりに関する効果を検証する。アクションが行われた時点から一定時間指標値が上昇すれば、当該アクションはが上昇したことを判定することで、分析制御モジュール２０３３は当該アクションが盛り上がりに関して有効であると判定する。 (Step S802E Verify the effect of the action)
Specifically, for example, the analysis control module 2033 verifies the effect of an action on increasing excitement. For example, the analysis control module 2033 verifies the effect of an action on increasing excitement by observing the trend of the excitement index value calculated in step S802A for a certain period of time from the time the action is performed. If the index value rises for a certain period of time from the time the action is performed, the analysis control module 2033 determines that the action has increased excitement and that the action is effective in increasing excitement.

盛り上がりを促すためのアクションは、人間の活性にかかわる活動である。例えば、盛り上がりを促すためのアクションは、ラジオ体操、コーヒー配膳、プロジェクトキックオフのスピーチ、等である。 Actions designed to foster enthusiasm are those that stimulate human activity. For example, actions to encourage enthusiasm include morning exercises, coffee distribution, and project kickoff speeches.

（ステップＳ８０２Ｆ音楽と音量の選定）
具体的には、例えば、分析制御モジュール２０３３は、ステップ８０２Ａで算出された空間における集まりの盛り上がりの指標値に応じて、当該空間に流す音楽と音量を選定する。例えば、分析制御モジュール２０３３は、現在の指標値が小さい空間には、盛り上がりを促すために、アップテンポの曲と大きい音量を選定する。また、例えば、分析制御モジュール２０３３は、現在の指標値が大きい空間には、盛り上がりを維持するために、音量を下げる。 (Step S802F: Music and Volume Selection)
Specifically, for example, the analysis control module 2033 selects the music and volume to play in a space according to the index value of the excitement level of the gathering in that space, which was calculated in step 802A. For example, in spaces where the current index value is low, the analysis control module 2033 selects an up-tempo song and a high volume to encourage excitement. Also, for example, in spaces where the current index value is high, the analysis control module 2033 lowers the volume to maintain the excitement level.

曲は、例えば、選定の度にインターネットからダウンロードされてもよいし、記憶部２０２に保存されていてもよい。 The songs may, for example, be downloaded from the internet each time a selection is made, or they may be stored in the memory unit 202.

ステップＳ８０３において、制御部２０３は、ステップＳ８０２での分析に基づき、空間にフィードバッグする。 In step S803, the control unit 203 provides feedback to the space based on the analysis in step S802.

（ステップＳ８０３Ａ空間の端末に表示）
具体的には、例えば、送信制御モジュール３０３２は空間に関する分析結果を端末装置１０に送信する。例えば、送信制御モジュール３０３２は、ステップＳ８０２Ａで算出された１の空間における集まりの盛り上がりの指標値の推移を、および、ステップＳ８０２Ｃで取得された盛り上がりの模範的推移と盛り上がりに対する採点を、当該空間に設置された端末装置１０に送信する。送信制御モジュール３０３２は、集まりの進行中にリアルタイムで継続的に送信してもよいし、集まりの完了時に一回的に送信してもよい。 (Step S803A Displayed at the terminal in space)
Specifically, for example, the transmission control module 3032 transmits the analysis results regarding the space to the terminal device 10. For example, the transmission control module 3032 transmits to the terminal device 10 installed in the space the trend of the index value of the crowd's excitement in space, calculated in step S802A, and the exemplary trend of the excitement and the score for the excitement, acquired in step S802C. The transmission control module 3032 may transmit continuously in real time while the crowd is in progress, or it may transmit once when the crowd is completed.

（ステップＳ８０３Ｂ事業所の端末に表示）
具体的には、例えば、送信制御モジュール３０３２は事業所に関する分析結果を端末装置１０に送信する。例えば、送信制御モジュール３０３２は、ステップＳ８０２Ｄで算出された１の事業所全体の活性を表す指数を、当該事業所に設置された端末装置１０に送信する。 (Step S803B Displayed on the terminal at the business office)
Specifically, for example, the transmission control module 3032 transmits the analysis results regarding the business establishment to the terminal device 10. For example, the transmission control module 3032 transmits the index representing the overall activity of one business establishment, calculated in step S802D, to the terminal device 10 installed at that business establishment.

（ステップＳ８０３Ｃスピーカー）
具体的には、例えば、送信制御モジュール３０３２は空間に流す音楽をスピーカー３３に送信する。例えば、送信制御モジュール３０３２は、ステップＳ８０２Ｆで選定された音楽と音量を、当該空間に設置されたスピーカー３３に送信する。 (Step S803C speaker)
Specifically, for example, the transmission control module 3032 transmits the music to be played in the space to the speaker 33. For example, the transmission control module 3032 transmits the music and volume selected in step S802F to the speaker 33 installed in the space.

＜７画面例＞
本開示の画面例を説明する。 <7. Screen Example>
An example of the screen shown in this disclosure is explained below.

図１１は、端末装置１０のディスプレイ１４１の表示画面の一例を表す模式図である。図１１では、２０１会議室における進行中の集まりの盛り上がりを表示する場合を例に説明している。 Figure 11 is a schematic diagram showing an example of the display screen of the terminal device 10's display 141. Figure 11 illustrates the case where the progress of an ongoing meeting in conference room 201 is displayed.

テキスト９００１は、集まりが行われている空間を示すテキストである。 Text 9001 is a text that describes the space where the gathering is taking place.

テキスト９００２は、現在の時間を示すテキストである。 Text 9002 is the text indicating the current time.

グラフ９００３は、集まりの盛り上がりの指標値の推移を表現するためのグラフである。グラフ９００３は、盛り上がりの指標値を示す縦軸および時間推移を示す横軸で構成される。縦軸の値は大きいほど、盛り上がりの高い指標値を示す。横軸の原点は、例えば、進行中の集まりの開始時間を示してもよいし、事業所の始業時間を示してよい。 Graph 9003 is a graph that shows the trend of the indicator value of the level of enthusiasm at a gathering. Graph 9003 consists of a vertical axis showing the indicator value of enthusiasm and a horizontal axis showing the trend over time. A higher value on the vertical axis indicates a higher indicator value of enthusiasm. The origin of the horizontal axis may, for example, represent the start time of the ongoing gathering or the start time of the business establishment.

実線９００４は、ステップＳ８０２Ａで算出された進行中の盛り上がりの指標値の推移を描く実線である。実線９００４は、リアルタイム処理で指標値を描いてもよいし、バッチ処理で指標値を描いてもよいし、集まりの完了後に指標値をまとめて描いてもよい。バッチ処理は、例えば、３０秒毎に行われる。 The solid line 9004 represents the trend of the ongoing surge index value calculated in step S802A. The solid line 9004 may plot the index value in real-time processing, in batch processing, or all at once after the collection is complete. Batch processing is performed, for example, every 30 seconds.

破線９００５は、実線９００４との比較用の破線である。具体的には、例えば、破線９００５は、過去における同時間の集まりに関する指標値の平均値の推移を描く。また、具体的には、例えば、破線９００５は、ステップＳ８０２Ｃで取得された模範的推移を描いてもよい。 The dashed line 9005 is a dashed line for comparison with the solid line 9004. Specifically, for example, the dashed line 9005 depicts the trend of the average index value for the same time period in the past. Alternatively, for example, the dashed line 9005 may depict the exemplary trend obtained in step S802C.

破線９００５は、現在時刻までの推移だけを描いてもよいし、現在時刻より先の推移を描いてもよい。 The dashed line 9005 may depict only the time progression up to the current time, or it may depict the time progression beyond the current time.

図１２は、端末装置１０のディスプレイ１４１の表示画面の一例を表す模式図である。図１２では、２０１会議室における完了した集まりの採点を表示する場合を例に説明している。 Figure 12 is a schematic diagram showing an example of the display screen of the terminal device 10's display 141. Figure 12 illustrates the case where the scores of completed meetings in conference room 201 are displayed.

テキスト１００１は、集まりが行われた空間を示すテキストである。 Text 1001 is a text that describes the space where the gathering took place.

テキスト１００２は、集まりが行われた時間を示すテキストである。 Text 1002 is the text indicating the time the gathering took place.

アイコン１００３は、集まりの盛り上がりの内容を形容するアイコンである。分析制御モジュール２０３３は、今回の集まりを時間で区切ったうえで、各区切りにおいて当該集まりの推移を過去の平均的推移と比較する。分析制御モジュール２０３３は、当該集まりが全般的に平均的推移を上回っていれば当該集まりが盛り上がっていると判定し、当該集まりが全般的に平均的推移を下回っていれば当該集まりが盛り下がっていると判定する。分析制御モジュール２０３３は、例えば、集まりの前半は盛り上がったが集まりの後半は盛り下がった集まりに、晴のち曇アイコンを出力する。提示制御モジュール２０３４は、出力されたアイコンをユーザに提示する。 Icon 1003 is an icon that describes the level of excitement at a gathering. The analysis control module 2033 divides the current gathering into time segments and compares the progress of each segment with the average progress of past gatherings. The analysis control module 2033 determines that the gathering is exciting if it generally exceeds the average progress, and that it is unexciting if it generally falls below the average progress. For example, the analysis control module 2033 outputs a "sunny then cloudy" icon for a gathering that was exciting in the first half but unexciting in the second half. The presentation control module 2034 presents the outputted icons to the user.

テキスト１００４は、集まりの盛り上がりに対する採点を表すテキストである。例えば、テキスト１００４は、ステップＳ８０２Ｃにおいて算出された採点を表す。 Text 1004 represents the score given for the level of enthusiasm at the gathering. For example, text 1004 represents the score calculated in step S802C.

テキスト１００５は、集まりの盛り上がりに対するアドバイスを表すテキストである。分析制御モジュール２０３３は、今回の集まりの推移を模範的推移と比較し、当該集まりの盛り上がりが、模範的推移に対してどれだけ足りていないかを分析する。分析制御モジュール２０３３は、盛り上がりの際に基づくアドバイスを出力する。例えば、分析制御モジュール２０３３は、前半は盛り上がったが、後半は模範的推移に対して盛り下がった集まりに対して、「次はクロージングトークで盛り上げてみよう」とのアドバイスを出力する。提示制御モジュール２０３４は、出力されたアドバイスをユーザに提示する。 Text 1005 is text representing advice regarding the engagement level of the gathering. The analysis control module 2033 compares the progress of the current gathering to a model progress and analyzes how much the engagement level of the gathering falls short of the model progress. The analysis control module 2033 outputs advice based on the engagement level. For example, for a gathering that was engaging in the first half but declined in the second half compared to the model progress, the analysis control module 2033 would output the advice, "Next time, try to liven things up with the closing talk." The presentation control module 2034 then presents the outputted advice to the user.

図１３は、端末装置１０のディスプレイ１４１の表示画面の一例を表す模式図である。図１３では、麻布台事業所における活況を表示する場合を例に説明している。 Figure 13 is a schematic diagram showing an example of the display screen of the terminal device 10's display 141. Figure 13 illustrates the case where activity levels at the Azabudai business site are displayed.

テキスト１１０１は、事業所、および、当該事業所への本日の出社人数を示すテキストである。 Text 1101 is a text indicating the business location and the number of employees present at that location today.

グラフ１１０２は、エモーショナルビート、すなわち、事業所における盛り上がりを表現するためのグラフである。グラフ１１０２は、盛り上がりの指数を示す縦軸および時間推移を示す横軸で構成される。縦軸の値は大きいほど、盛り上がりの高い指数を示す。横軸は、例えば、一日、一週、一月、三月、半年、一年等の時間を示す。横軸の時間は、ユーザによってインタフェースの操作を介して切り替えられてもよい。 Graph 1102 is a graph that represents emotional beat, or excitement, within a workplace. Graph 1102 consists of a vertical axis showing an index of excitement and a horizontal axis showing the progression over time. A higher value on the vertical axis indicates a higher index of excitement. The horizontal axis represents time periods such as one day, one week, one month, three months, six months, or one year. The time period on the horizontal axis may be switched by the user through interface operations.

グラフ１１０２は、エモーションを表すアイコンが付されてもよい。例えば、グラフ１１０２で所定の指数を超えた場面は、笑顔を表すアイコンが付される。 Graph 1102 may be accompanied by icons representing emotions. For example, scenes in Graph 1102 that exceed a predetermined index may be accompanied by an icon representing a smile.

実線１１０３は、ステップＳ８０２Ｄで算出された１の事業所の盛り上がりの指数の推移を描く実線である。実線は、リアルタイム処理で指数を描いてもよいし、バッチ処理で指数を描いてもよい。バッチ処理は、例えば、３０秒毎に行われる。 The solid line 1103 represents the trend of the index of activity for one business establishment, calculated in step S802D. The index may be plotted in real-time or in batch processing. Batch processing is performed, for example, every 30 seconds.

破線１１０４は、実線１１０３との比較用の破線である。具体的には、例えば、破線１１０４は、過去における同時間の事業所に関する指数の平均値の推移を描く。 The dashed line 1104 is a comparison line with the solid line 1103. Specifically, for example, the dashed line 1104 shows the trend of the average index for the same time period in the past.

破線１１０４は、現在時刻までの推移だけを描いてもよいし、現在時刻より先の推移を描いてもよい。 The dashed line 1104 may depict only the progression up to the current time, or it may depict the progression beyond the current time.

円グラフ１１０５は、エモーショナルレート、すなわち、事業所に係る表情の割合を示す。具体的には、例えば、円グラフ１１０５は、出社人数のうち特定の表情をとる人数の割合を示す円グラフである。具体的には、分析制御モジュール２０３３は、画像解析ログＤＢ２０２２において項目「時間」が現在時刻であるレコードの中から、項目「状態」が状態「盛り上がり」である人数の和を、項目「状態」が状態「盛り下がり」である人数の和を、および、項目「状態」が状態「ニュートラル」である人数の和を算出する。また、分析制御モジュール２０３３は、現在の出社人数から上記三つの和の和を引くことによって、「その他」の人数を算出する。次いで、具体的には、分析制御モジュール２０３３は、「盛り上がり」の人数の出社人数に対する割合を、「盛り下がり」の人数の出社人数に対する割合を、「ニュートラル」の人数の出社人数に対する割合を、および、「その他」の人数の出社人数に対する割合を、算出する。次いで、具体的には、分析制御モジュール２０３３は、笑顔アイコンを付して「盛り上がり」の人数の割合を、不安顔アイコンを付して「盛り下がり」の人数の割合を、真顔アイコンを付して「ニュートラル」の人数の割合を、および、文書アイコンを付して「その他」の人数の割合を、出力する。提示制御モジュール２０３４は、出力された割合およびアイコンをユーザに提示する。 Pie chart 1105 shows the emotional rate, that is, the proportion of facial expressions related to the workplace. Specifically, for example, pie chart 1105 is a pie chart that shows the proportion of people with a specific facial expression among the number of people who are present at work. Specifically, the analysis control module 2033 calculates the sum of the number of people whose "State" is "Excited", the sum of the number of people whose "State" is "Depressed", and the sum of the number of people whose "State" is "Neutral", from the records in the image analysis log DB 2022 where the item "Time" is the current time. The analysis control module 2033 also calculates the number of "Other" people by subtracting the sum of the above three sums from the current number of people present at work. Then, specifically, the analysis control module 2033 calculates the proportion of people who are "Excited" relative to the number of people present at work, the proportion of people who are "Depressed", the proportion of people who are "Neutral", and the proportion of people who are "Other" relative to the number of people present at work. Specifically, the analysis control module 2033 outputs the percentage of people who are "excited" with a smiling face icon, the percentage of people who are "disappointed" with an anxious face icon, the percentage of people who are "neutral" with a serious face icon, and the percentage of people who are "other" with a document icon. The presentation control module 2034 then presents the output percentages and icons to the user.

リスト１１０６は、事業所における盛り上がりを促すためのアクションとその参加者を示すリストである。 List 1106 is a list of actions and participants to encourage enthusiasm within the workplace.

図１４は、端末装置１０のディスプレイ１４１の表示画面の一例を表す模式図である。図１４では、事業所における各空間における集まりのリアルタイムの盛り上がりを表示する場合を例に説明している。 Figure 14 is a schematic diagram showing an example of the display screen of the terminal device 10's display 141. Figure 14 illustrates an example where the real-time activity level of gatherings in various spaces within a business premises is displayed.

マップ１２０１は、事業所全体および事業所を構成する空間を示すマップである。空間は、集まりが行われ盛り上がりが評価される場所であり、その空間の名前を表すテキストが付される。空間の名前は、例えば、会議室Ａ、サロン卓１、等である。マップ１２０１は、平面的に描かれもよいし、立体的に描かれてもよい。 Map 1201 is a map showing the entire business premises and the spaces that comprise them. Each space is a place where gatherings take place and where activity is evaluated, and it is labeled with text indicating its name. Examples of space names include "Meeting Room A," "Salon Table 1," etc. Map 1201 may be drawn in two dimensions or three dimensions.

ボリュームメーター１２０２－１からボリュームメーター１２０２－３で以て例示されるボリュームメーター１２０２は、ステップＳ８０２Ａで算出されたリアルタイムの盛り上がりを示す指標値に基づいて、空間における集まりのリアルタイムの盛り上がりを示すボリュームメーターである。ボリュームメーター１２０２は、例えば、指標値の存在しうる最高値から最低値までを５段階に区切ったうえで、５段階に相応する５つのバーでリアルタイムの指標値を表現する。指標値が高いほど、すなわち、集まりが盛り上がるほど、ボリュームメーター１２０２のバーは積み上がる。 The volume meters 1202, as exemplified by volume meters 1202-1 to 1202-3, are volume meters that indicate the real-time rise of a cluster in space based on the real-time index value of the cluster calculated in step S802A. For example, volume meter 1202 divides the possible range of the index value from the highest to the lowest value into five stages, and represents the real-time index value with five bars corresponding to these five stages. The higher the index value, that is, the more the cluster rises, the more the bars on volume meter 1202 stack up.

ボリュームメーター１２０２は、盛り上がりのトレンドを示すアイコンが付されてもよい。例えば、直近５分間で盛り上がりが上昇トレンドにある空間のボリュームメーターは、上向きの矢印が付される。 The volume meter 1202 may be marked with an icon indicating the trend of the rise. For example, the volume meter for a space where the rise has been on an upward trend over the past five minutes may be marked with an upward arrow.

＜８小括＞
以上のように、上記実施形態では、システム１は、所定の空間（オフィス内の空間、又は、会議室）に設置され空間で生じる音声を収集する手段と、所定の空間に設置され音声を収集する手段により収集された音声の種類を分類し分類された種類と音量とに基づき盛り上がりに係る状態及び当該状態の度合いを決定する手段と、状態及び度合いを集計し空間の盛り上がりの時間推移を算出する手段と、空間の盛り上がりの時間推移に基づく情報をユーザに提示する手段と、を備える。これにより、未加工の音声データと画像データをクラウドに送信することはなく、空間の盛り上がりを算出することが可能となる。したがって、本実施形態に係るプログラム、方法、情報処理装置、システムによれば、セキュリティを保持しつつ、所定の空間における盛り上がりを音声の観点で自動的に定量化できる。 <8 Summary>
As described above, in the above embodiment, System 1 includes means for collecting sounds generated in a predetermined space (a space within an office or a conference room) that is installed in the predetermined space; means for classifying the types of sounds collected by the means for collecting sounds installed in the predetermined space and determining the state and degree of excitement based on the classified type and volume; means for aggregating the states and degrees and calculating the time progression of excitement in the space; and means for presenting information based on the time progression of excitement in the space to the user. As a result, it is possible to calculate the excitement in the space without transmitting raw audio and image data to the cloud. Therefore, according to the program, method, information processing device, and system of this embodiment, it is possible to automatically quantify the excitement in a predetermined space from an audio perspective while maintaining security.

また、上記実施形態では、システム１は、所定の空間に設置され空間内の画像を撮影する手段を備え、決定する手段は、撮影する手段により収集された画像に含まれる表情の種類を分類し、分類された種類と当該種類の表情の人数とに基づき、盛り上がりに係る状態及び当該状態の度合いを決定する。これにより、未加工の音声データと画像データをクラウドに送信することはなく、セキュリティを保持しつつ、所定の空間における盛り上がりを参加者の表情の観点で自動的に定量化することができる。 Furthermore, in the above embodiment, System 1 is installed in a predetermined space and includes means for capturing images within that space. The determination means classifies the types of facial expressions contained in the images collected by the capturing means, and determines the state of excitement and its degree based on the classified types and the number of people with each type of expression. This eliminates the need to transmit raw audio and image data to the cloud, maintaining security while automatically quantifying the level of excitement in a predetermined space from the perspective of participants' facial expressions.

また、上記実施形態では、システム１は、空間における複数の集まりの盛り上がりの推移を比較し、盛り上がりに関する所定の傾向を取得する手段を備える。これにより、サーバ２０は、複数の集まりを比較しながら、盛り上がりを促進するための施策を取得することが可能となる。また、上記実施形態では、システム１は、取得された傾向に基づく盛り上がりに関する改善提案を提示する手段を備える。これにより、ユーザは盛り上がりを促進するための施策を知ることが可能となり、自身が参加する集まりの活気を向上させることが可能となる。 Furthermore, in the above embodiment, System 1 includes means for comparing the progression of excitement levels in multiple gatherings within a space and acquiring predetermined trends regarding excitement. This allows Server 20 to acquire measures to promote excitement while comparing multiple gatherings. Additionally, in the above embodiment, System 1 includes means for presenting improvement suggestions regarding excitement based on the acquired trends. This allows users to learn about measures to promote excitement and improve the vitality of the gatherings they participate in.

また、上記実施形態では、システム１は、盛り上がりを促すためのアクションを受けた盛り上がりを示す指標値に基づき、アクションの盛り上がりに関する効果を検証する手段を備える。これにより、ユーザは、どのようなアクションが盛り上がりに貢献するかを認識することが可能となる。従来では、ユーザは盛り上がりを促進するための施策は定量的検証が難しかったが、これによりユーザは定量的検証の裏付けを得た施策を知れる。 Furthermore, in the above embodiment, System 1 includes means for verifying the effect of an action on increasing excitement based on an index value indicating the level of excitement achieved by the action. This allows users to recognize which actions contribute to increasing excitement. Previously, it was difficult for users to quantitatively verify measures to promote excitement, but this allows users to learn about measures that have been supported by quantitative verification.

また、上記実施形態では、システム１は、複数の空間における盛り上がりを示す指標値に基づき、事業体全体の活性を表す指数を統計的に算出する手段を備える。これにより、事業所全体の活性を自動的に定量化することができる。従来では、事業体の活性度は視覚化が難しかったが、本実施形態により、事業体の活性度を可視化することが可能となる。これにより、ユーザは可視化された活性度に基づき、事業体の状態を推定することが可能となる。例えば、事業体の活性度が低く示されている場合、事業体の経営者は、新たなプロジェクトを立ち上げることで、事業体の活性度を上げるというような判断をすることが可能となる。 Furthermore, in the above embodiment, System 1 includes means for statistically calculating an index representing the overall activity of the business entity based on indicator values showing the level of activity in multiple spaces. This allows for the automatic quantification of the overall activity of the business entity. While visualizing the activity level of a business entity was previously difficult, this embodiment makes it possible to visualize the activity level of the business entity. This allows users to estimate the state of the business entity based on the visualized activity level. For example, if the activity level of the business entity is shown to be low, the business entity's manager can make a decision to increase the activity level of the business entity by launching a new project.

また、上記実施形態では、システム１は、空間の盛り上がり指標値に応じた音楽を空間に流す手段を備える。これにより、空間の盛り上がりを促進および高い盛り上がりを維持することができる。 Furthermore, in the above embodiment, System 1 includes means for playing music in the space according to the spatial excitement index value. This allows for the promotion and maintenance of a high level of excitement in the space.

＜９変形例＞
上記実施形態では、盛り上がりを測定するためにマイクで空間内の音声を収集し、撮影装置で空間内の様子を撮影する場合を例に説明した。しかしながら、盛り上がりを測定するために、ビーコンを用いて空間内の人の数と人の位置を取得してもよい。 <9. Variant>
In the above embodiment, we described an example where a microphone is used to collect sound within the space and a camera is used to capture images of the space in order to measure the level of excitement. However, beacons may also be used to obtain the number of people and their locations within the space in order to measure the level of excitement.

例えば、ビーコン専用アプリをインストールしたユーザ端末は、空間内に設置された解析装置３０から送信されるビーコン信号を受信する。次いで、例えば、ユーザ端末は、ユーザ端末の位置情報を付近の解析装置３０に送信し、解析装置３０は、ユーザ端末の位置情報をサーバ２０に送信する。もしくは、ユーザ端末は、ユーザ端末の位置情報をサーバ２０に直接送信してもよい。これにより、サーバ２０は、空間内の人の数と人の位置を取得する。ユーザがビーコン信号を送信するデバイスを有し、解析装置３０が当該デバイスから送信されるビーコン信号に応答するようにしてもよい。 For example, a user terminal with a dedicated beacon application installed receives a beacon signal transmitted from an analysis device 30 installed in the space. Then, for example, the user terminal transmits its location information to a nearby analysis device 30, and the analysis device 30 transmits the user terminal's location information to the server 20. Alternatively, the user terminal may transmit its location information directly to the server 20. This allows the server 20 to obtain the number of people and their locations in the space. Alternatively, the user may have a device that transmits beacon signals, and the analysis device 30 may respond to beacon signals transmitted from that device.

解析装置３０は、空間内の数、位置等を解析する機能を有してもよい。解析装置３０は、検出された数に基づき、盛り上がりに係る状態及び当該状態の度合いを決定する。具体的には、解析装置３０は、検出されたユーザの数、密度、変化量又はこれらのうち少なくとも２つの組み合わせ等に基づき、盛り上がりに係る状態及び当該状態の度合いを決定する。解析装置３０は、例えば、検出されたユーザの数が短時間のうちに所定人数以上増えている場合、状態「盛り上がり」の値「３」と評価する。解析装置３０は、解析結果をサーバ２０へ送信する。 The analysis device 30 may have functions to analyze the number, position, etc., within the space. Based on the detected numbers, the analysis device 30 determines the state related to the "boom" and the degree of that state. Specifically, the analysis device 30 determines the state related to the "boom" and the degree of that state based on the number, density, change, or at least two combinations of these detected users. For example, if the number of detected users increases by a predetermined number or more in a short period of time, the analysis device 30 evaluates the state "boom" to a value of "3". The analysis device 30 transmits the analysis results to the server 20.

この変形例では、システム１は、例えば、所定の空間に設置され空間内のユーザの位置を検出する手段を備え、決定する手段は、検出された数に基づき、盛り上がりに係る状態及び当該状態の度合いを決定する。これにより、所定の空間における盛り上がりをユーザの位置の観点で自動的に定量化することができる。 In this modified version, System 1, for example, is installed in a predetermined space and includes means for detecting the user's position within that space. The determination means determines the state of elevation and the degree of that state based on the number of detected points. This allows for the automatic quantification of elevation in a predetermined space from the perspective of the user's position.

また、ビーコンを用いて各人に割りてられる識別番号を用いて空間内の人を特定してもよい。 Alternatively, individuals within a space may be identified using identification numbers assigned to each person via beacons.

例えば、ビーコン専用アプリをインストールしたユーザ端末は、空間内に設置された解析装置３０から送信されるビーコン信号を受信する。次いで、例えば、ユーザ端末は、ユーザ端末の位置情報を付近の解析装置３０に送信する。解析装置３０は、ユーザ端末の位置情報をサーバ２０に送信する。もしくは、ユーザ端末は、ユーザ端末の位置情報をサーバ２０に直接送信してもよい。送受信される情報は、各人の当該アプリのアカウントごとに割当てられる識別番号を含む。サーバ２０は、空間内の人物を識別番号に基づいて特定する。ユーザがビーコン信号を送信するデバイスを有し、解析装置３０が当該デバイスから送信されるビーコン信号に応答するようにしてもよい。 For example, a user terminal with a dedicated beacon application installed receives a beacon signal transmitted from an analysis device 30 installed in the space. Then, for example, the user terminal transmits its location information to a nearby analysis device 30. The analysis device 30 transmits the user terminal's location information to the server 20. Alternatively, the user terminal may transmit its location information directly to the server 20. The transmitted and received information includes an identification number assigned to each person's account in the application. The server 20 identifies individuals in the space based on their identification numbers. Alternatively, a user may have a device that transmits beacon signals, and the analysis device 30 may respond to beacon signals transmitted from that device.

サーバ２０は、空間における盛り上がりと、識別番号に基づいて特定される人物との関連性を分析する。これにより、例えば、特定の人物が空間にいると、当該空間が盛り上がる等、空間における盛り上がりと空間内の人物との相関関係を取得できる。 Server 20 analyzes the correlation between the level of activity in a space and individuals identified based on their identification numbers. This allows for the acquisition of correlations between the level of activity in a space and the individuals within that space; for example, the presence of a specific individual in a space can cause that space to become more active.

また、上記実施形態では、解析装置３０がディスプレイを有さない場合を例に説明した。しかしながら、解析装置３０は、解析装置３０が設置された空間の盛り上がりに係る情報をユーザに示すためのディスプレイを有してもよい。例えば、提示制御モジュール３０３４は、ステップＳ８０２Ａで算出された１の空間における集まりの盛り上がりの指標値の推移を、および、ステップＳ８０２Ｃで取得された盛り上がりの模範的推移と盛り上がりに対する採点を、解析装置３０のディスプレイに表示させる。 Furthermore, the above embodiment described an example where the analysis device 30 does not have a display. However, the analysis device 30 may have a display for showing the user information related to the elevation of the space in which the analysis device 30 is installed. For example, the display control module 3034 causes the analysis device 30's display to show the trend of the elevation index value of the cluster in space 1, calculated in step S802A, and the exemplary trend of the elevation and the score for the elevation obtained in step S802C.

＜１０コンピュータの基本ハードウェア構成＞
図１５は、コンピュータ９０の基本的なハードウェア構成を表すブロック図である。コンピュータ９０は、プロセッサ９０１、主記憶装置９０２、補助記憶装置９０３、通信ＩＦ９９１（インタフェース、Interface）を少なくとも備える。これらは通信バス９２１により相互に電気的に接続される。 <10. Basic Hardware Configuration of a Computer>
Figure 15 is a block diagram showing the basic hardware configuration of computer 90. Computer 90 comprises at least a processor 901, main memory 902, auxiliary storage 903, and a communication interface 991. These are electrically connected to each other by a communication bus 921.

プロセッサ９０１とは、プログラムに記述された命令セットを実行するためのハードウェアである。プロセッサ９０１は、演算装置、レジスタ、周辺回路等から構成される。 The processor 901 is hardware for executing the instruction set described in the program. The processor 901 consists of an arithmetic unit, registers, peripheral circuits, etc.

主記憶装置９０２とは、プログラム、及びプログラム等で処理されるデータ等を一時的に記憶するためのものである。例えば、ＤＲＡＭ（Dynamic Random Access Memory）等の揮発性のメモリである。 The main memory 902 is for temporarily storing programs and data processed by programs, etc. For example, it is a volatile memory such as DRAM (Dynamic Random Access Memory).

補助記憶装置９０３とは、データ及びプログラムを保存するための記憶装置である。例えば、フラッシュメモリ、ＨＤＤ（Hard Disc Drive）、光磁気ディスク、ＣＤ－ＲＯＭ、ＤＶＤ－ＲＯＭ、半導体メモリ等である。 The auxiliary storage device 903 is a storage device for storing data and programs. Examples include flash memory, HDD (Hard Disc Drive), magneto-optical disk, CD-ROM, DVD-ROM, and semiconductor memory.

通信ＩＦ９９１とは、有線又は無線の通信規格を用いて、他のコンピュータとネットワークを介して通信するための信号を入出力するためのインタフェースである。
ネットワークは、インターネット、ＬＡＮ、無線基地局等によって構築される各種移動通信システム等で構成される。例えば、ネットワークには、３Ｇ、４Ｇ、５Ｇ移動通信システム、ＬＴＥ（Long Term Evolution）、所定のアクセスポイントによってインターネットに接続可能な無線ネットワーク（例えばWi-Fi（登録商標））等が含まれる。無線で接続する場合、通信プロトコルとして例えば、Ｚ－Ｗａｖｅ（登録商標）、ＺｉｇＢｅｅ（登録商標）、Ｂｌｕｅｔｏｏｔｈ（登録商標）等が含まれる。有線で接続する場合は、ネットワークには、ＵＳＢ（Universal Serial Bus）ケーブル等により直接接続するものも含む。 The communication interface IF991 is an interface for inputting and outputting signals for communication with other computers via a network using wired or wireless communication standards.
A network consists of various mobile communication systems, such as the Internet, LANs, and wireless base stations. For example, a network includes 3G, 4G, and 5G mobile communication systems, LTE (Long Term Evolution), and wireless networks that can connect to the Internet via designated access points (e.g., Wi-Fi®). When connecting wirelessly, communication protocols include, for example, Z-Wave®, ZigBee®, and Bluetooth®. When connecting via a wired connection, the network also includes connections made directly via USB (Universal Serial Bus) cables, etc.

なお、各ハードウェア構成の全部または一部を複数のコンピュータ９０に分散して設け、ネットワークを介して相互に接続することによりコンピュータ９０を仮想的に実現することができる。このように、コンピュータ９０は、単一の筐体、ケースに収納されたコンピュータ９０だけでなく、仮想化されたコンピュータシステムも含む概念である。 Furthermore, by distributing all or part of each hardware configuration across multiple computers 90 and connecting them to each other via a network, a virtual computer 90 can be realized. Thus, the concept of computer 90 includes not only a computer 90 housed in a single enclosure or case, but also a virtualized computer system.

＜コンピュータ９０の基本機能構成＞
コンピュータ９０の基本ハードウェア構成により実現されるコンピュータの機能構成を説明する。コンピュータは、制御部、記憶部、通信部の機能ユニットを少なくとも備える。 <Basic Functional Configuration of Computer 90>
The functional configuration of the computer realized by the basic hardware configuration of computer 90 will be described. The computer comprises at least one functional unit: a control unit, a memory unit, and a communication unit.

なお、コンピュータ９０が備える機能ユニットは、それぞれの機能ユニットの全部または一部を、ネットワークで相互に接続された複数のコンピュータ９０に分散して設けても実現することができる。コンピュータ９０は、単一のコンピュータ９０だけでなく、仮想化されたコンピュータシステムも含む概念である。 Furthermore, the functional units of computer 90 can also be realized by distributing all or part of each functional unit across multiple computers 90 interconnected via a network. The concept of computer 90 includes not only a single computer 90 but also a virtualized computer system.

制御部は、プロセッサ９０１が補助記憶装置９０３に記憶された各種プログラムを読み出して主記憶装置９０２に展開し、当該プログラムに従って処理を実行することにより実現される。制御部は、プログラムの種類に応じて様々な情報処理を行う機能ユニットを実現することができる。これにより、コンピュータは情報処理を行う情報処理装置として実現される。 The control unit is realized when the processor 901 reads various programs stored in the auxiliary storage device 903, loads them into the main memory device 902, and executes processing according to those programs. The control unit can realize various functional units that perform information processing depending on the type of program. In this way, the computer is realized as an information processing device that performs information processing.

記憶部は、主記憶装置９０２、補助記憶装置９０３により実現される。記憶部は、データ、各種プログラム、各種データベースを記憶する。また、プロセッサ９０１は、プログラムに従って記憶部に対応する記憶領域を主記憶装置９０２または補助記憶装置９０３に確保することができる。また、制御部は、各種プログラムに従ってプロセッサ９０１に、記憶部に記憶されたデータの追加、更新、削除処理を実行させることができる。 The memory unit is implemented by the main memory 902 and the auxiliary memory 903. The memory unit stores data, various programs, and various databases. The processor 901 can allocate memory areas corresponding to the memory unit in the main memory 902 or the auxiliary memory 903 according to the program. The control unit can also cause the processor 901 to perform addition, update, and deletion operations on data stored in the memory unit according to the various programs.

データベースは、リレーショナルデータベースを指し、行と列によって構造的に規定された表形式のテーブル、マスタと呼ばれるデータ集合を、互いに関連付けて管理するためのものである。データベースでは、表をテーブル、マスタ、表の列をカラム、表の行をレコードと呼ぶ。リレーショナルデータベースでは、テーブル、マスタ同士の関係を設定し、関連付けることができる。
通常、各テーブル、各マスタにはレコードを一意に特定するための主キーとなるカラムが設定されるが、カラムへの主キーの設定は必須ではない。制御部は、各種プログラムに従ってプロセッサ９０１に、記憶部に記憶された特定のテーブル、マスタにレコードを追加、削除、更新を実行させることができる。
また、記憶部に、データ、各種プログラム、各種データベースを記憶させることにより、本開示にかかる情報処理装置、情報処理システムが製造されたものとして捉えることができる。 A database, specifically a relational database, is used to manage and associate data sets called masters, which are structured in a tabular format defined by rows and columns. In a database, tables are called tables, masters are called masters, the columns of tables are called columns, and the rows of tables are called records. In a relational database, relationships can be established and linked between tables and masters.
Typically, each table and master has a primary key column to uniquely identify records, but setting a primary key column is not mandatory. The control unit can instruct the processor 901 to add, delete, or update records in specific tables and masters stored in the storage unit, according to various programs.
Furthermore, by storing data, various programs, and various databases in the memory unit, the information processing device and information processing system related to this disclosure can be considered to have been manufactured.

なお、本開示におけるデータベース、マスタは、情報が構造的に規定された任意のデータ構造体（リスト、辞書、連想配列、オブジェクトなど）を含み得る。データ構造体には、データと、任意のプログラミング言語により記述された関数、クラス、メソッドなどを組み合わせることにより、データ構造体と見なし得るデータも含むものとする。 Furthermore, the databases and masters in this disclosure may include any data structures (lists, dictionaries, associative arrays, objects, etc.) in which information is structurally defined. Data structures also include data that can be considered as data structures by combining data with functions, classes, methods, etc., written in any programming language.

通信部は、通信ＩＦ９９１により実現される。通信部は、ネットワークを介して他のコンピュータ９０と通信を行う機能を実現する。通信部は、他のコンピュータ９０から送信された情報を受信し、制御部へ入力することができる。制御部は、各種プログラムに従ってプロセッサ９０１に、受信した情報に対する情報処理を実行させることができる。また、通信部は、制御部から出力された情報を他のコンピュータ９０へ送信することができる。 The communication unit is implemented by the communication IF 991. The communication unit provides the functionality to communicate with other computers 90 via the network. The communication unit can receive information transmitted from other computers 90 and input it to the control unit. The control unit can cause the processor 901 to perform information processing on the received information according to various programs. Furthermore, the communication unit can transmit information output from the control unit to other computers 90.

また、上記の各構成、機能、処理部、処理手段等は、それらの一部又は全部を、例えば集積回路で設計する等によりハードウェアで実現してもよい。また、本発明は、実施例の機能を実現するソフトウェアのプログラムコードによっても実現できる。この場合、プログラムコードを記録した記憶媒体をコンピュータに提供し、そのコンピュータが備えるプロセッサが記憶媒体に格納されたプログラムコードを読み出す。この場合、記憶媒体から読み出されたプログラムコード自体が前述した実施例の機能を実現することになり、そのプログラムコード自体、及びそれを記憶した記憶媒体は本発明を構成することになる。このようなプログラムコードを供給するための記憶媒体としては、例えば、フレキシブルディスク、ＣＤ－ＲＯＭ、ＤＶＤ－ＲＯＭ、ハードディスク、ＳＳＤ、光ディスク、光磁気ディスク、ＣＤ－Ｒ、磁気テープ、不揮発性のメモリカード、ＲＯＭなどが用いられる。 Furthermore, some or all of the above-described configurations, functions, processing units, and processing means may be implemented in hardware, for example, by designing them as integrated circuits. The present invention can also be implemented by software program code that realizes the functions of the embodiments. In this case, a storage medium containing the program code is provided to a computer, and the computer's processor reads the program code stored on the storage medium. In this case, the program code read from the storage medium itself realizes the functions of the embodiments described above, and the program code itself, and the storage medium storing it, constitute the present invention. Examples of storage media used to supply such program code include flexible disks, CD-ROMs, DVD-ROMs, hard disks, SSDs, optical disks, magneto-optical disks, CD-Rs, magnetic tapes, non-volatile memory cards, and ROMs.

また、本実施例に記載の機能を実現するプログラムコードは、例えば、アセンブラ、Ｃ／Ｃ＋＋、ｐｅｒｌ、Ｓｈｅｌｌ、ＰＨＰ、Ｊａｖａ（登録商標）等の広範囲のプログラム又はスクリプト言語で実装できる。 Furthermore, the program code that implements the functions described in this embodiment can be implemented in a wide range of programming or scripting languages, such as assembler, C/C++, Perl, Shell, PHP, and Java®.

さらに、実施例の機能を実現するソフトウェアのプログラムコードを、ネットワークを介して配信することによって、それをコンピュータのハードディスクやメモリ等の記憶手段又はＣＤ－ＲＷ、ＣＤ－Ｒ等の記憶媒体に格納し、コンピュータが備えるプロセッサが当該記憶手段や当該記憶媒体に格納されたプログラムコードを読み出して実行するようにしてもよい。 Furthermore, the program code for the software that implements the functionality of the embodiment may be distributed via a network and stored in a storage means such as a computer's hard disk or memory, or in a storage medium such as a CD-RW or CD-R. The computer's processor may then read and execute the program code stored in the storage means or storage medium.

本明細書中に記載されている構成要素により実現される機能は、当該記載された機能を実現するようにプログラムされた、汎用プロセッサ、特定用途プロセッサ、集積回路、ASICs (Application Specific Integrated Circuits)、CPU (a Central Processing Unit)、従来型の回路、および／又はそれらの組合せを含む、circuitry又はprocessing circuitryにおいて実装されてもよい。プロセッサは、トランジスタやその他の回路を含み、circuitry又はprocessing circuitryとみなされる。プロセッサは、メモリに格納されたプログラムを実行する、programmed processorであってもよい。
本明細書において、circuitry、ユニット、手段は、記載された機能を実現するようにプログラムされたハードウェア、又は実行するハードウェアである。当該ハードウェアは、本明細書に開示されているあらゆるハードウェア、又は、当該記載された機能を実現するようにプログラムされた、又は、実行するものとして知られているあらゆるハードウェアであってもよい。
当該ハードウェアがcircuitryのタイプであるとみなされるプロセッサである場合、当該circuitry、手段、又はユニットは、ハードウェアと、当該ハードウェア及び又はプロセッサを構成するために用いられるソフトウェアの組合せである。 The functions realized by the components described herein may be implemented in a circuit or processing circuitry, including general-purpose processors, application-specific processors, integrated circuits, ASICs (Application Specific Integrated Circuits), CPUs (a Central Processing Unit), conventional circuits, and/or combinations thereof, programmed to realize the functions described herein. A processor is considered to be a circuit or processing circuitry, including transistors and other circuits. A processor may be a programmed processor that executes a program stored in memory.
In this specification, circuitry, unit, and means are hardware programmed to perform or execute the functions described herein. Such hardware may be any hardware disclosed herein, or any hardware known to be programmed to perform or execute the functions described herein.
If the hardware is a processor that is considered to be a type of circuitry, then the circuitry, means, or unit is a combination of hardware and software used to constitute the hardware and/or processor.

以上、本開示のいくつかの実施形態を説明したが、これら実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれると同様に、特許請求の範囲に記載された発明とその均等の範囲に含まれるものとする。 While several embodiments of this disclosure have been described above, these embodiments can be implemented in various other forms, and various omissions, substitutions, and modifications are permitted without departing from the spirit of the invention. These embodiments and their variations are included within the scope and spirit of the invention, as well as within the scope of the claims and its equivalents.

＜付記＞
以上の各実施形態で説明した事項を以下に付記する。
（付記１）
所定の空間に設置され、空間で生じる音声を収集する手段と、空間に設置され、音声を収集する手段により収集された音声の種類を分類し、分類された種類と音量とに基づき、盛り上がりに係る状態及び当該状態の度合いを決定する手段と、状態及び度合いを集計し、空間の盛り上がりの時間推移を算出する手段と、空間の盛り上がりの時間推移に基づく情報をユーザに提示する手段とを備えるシステム。
（付記２）
空間に設置され、空間内の画像を撮影する手段を備え、決定する手段は、撮影する手段により収集された画像に含まれる表情の種類を分類し、分類された種類と当該種類の表情の人数とに基づき、盛り上がりに係る状態及び当該状態の度合いを決定する、（付記１）に記載のシステム。
（付記３）
空間に設置され、空間内のユーザの位置を検出する手段を備え、決定する手段は、検出された数に基づき、盛り上がりに係る状態及び当該状態の度合いを決定する、（付記１）又は（付記２）に記載のシステム。
（付記４）
所定の空間はオフィス内の空間である、（付記１）から（付記３）のいずれかに記載のシステム。
（付記５）
所定の空間は会議室である、（付記１）から（付記３）のいずれかに記載のシステム。
（付記６）
空間における複数の集まりの盛り上がりの推移を比較し、盛り上がりに関する所定の傾向を取得する手段を備える、（付記１）から（付記５）のいずれかに記載のシステム。
（付記７）
傾向を取得する手段は、取得された傾向に基づく盛り上がりに関する改善提案を提示する、（付記６）に記載のシステム。
（付記８）
盛り上がりを促すためのアクションを受けた盛り上がりを示す指標値に基づき、アクションの盛り上がりに関する効果を検証する手段を備える、（付記１）から（付記７）のいずれかに記載のシステム。
（付記９）
複数の空間における盛り上がりを示す指標値に基づき、事業体全体の活性を表す指数を統計的に算出する手段を備える、（付記１）から（付記８）のいずれかにに記載のシステム。
（付記１０）
空間の盛り上がり指標値に応じた音楽を空間に流す手段を備える、（付記１）から（付記９）のいずれかにに記載のシステム。
（付記１１）
マイクと、解析装置と、サーバとを備えるシステムに実行される方法であって、マイクが、所定の空間に設置され、空間で生じる音声を収集するステップと、解析装置が、空間に設置され、音声を収集するステップおいて収集された音声の種類を分類し、分類された種類と音量とに基づき、盛り上がりに係る状態及び当該状態の度合いを決定するステップと、サーバが、状態及び度合いを集計し、空間の盛り上がりの時間推移を算出するステップと、サーバが、空間の盛り上がりの時間推移に基づく情報をユーザに提示するステップとを実行する方法。
（付記１２）
プロセッサと、メモリとを備える、所定の空間に設置されるコンピュータを動作させるためのプログラムであって、プログラムは、プロセッサに、空間に設置されるマイクによって収集される、空間で生じる音声を取得するステップと、取得された音声の種類を分類し、分類された種類と音量とに基づき、盛り上がりに係る状態及び当該状態の度合いを決定するステップと、を実行させるプログラム。
（付記１３）
制御部と、記憶部とを備える、所定の空間に設置される情報処理装置であって、制御部が、空間に設置されるマイクによって収集される、空間で生じる音声を取得するステップと、取得された音声の種類を分類し、分類された種類と音量とに基づき、盛り上がりに係る状態及び当該状態の度合いを決定するステップと、を実行する情報処理装置。 <Note>
The details described in each of the above embodiments are noted below.
(Note 1)
A system comprising: means installed in a predetermined space to collect sounds generated in that space; means installed in the space to classify the types of sounds collected by the sound-collecting means, and determining the state of excitement and the degree of that state based on the classified type and volume; means to aggregate the state and degree and calculate the time progression of excitement in the space; and means to present information based on the time progression of excitement in the space to the user.
(Note 2)
The system described in (Appendix 1), which is installed in a space and includes means for capturing images within the space, wherein the determining means classifies the types of facial expressions contained in the images collected by the capturing means, and determines the state of excitement and the degree of that state based on the classified types and the number of people with that type of facial expression.
(Note 3)
The system described in (Appendix 1) or (Appendix 2), which is installed in space and includes means for detecting the position of a user in the space, wherein the means for determining the position determines the state of elevation and the degree of said elevation based on the number of detected positions.
(Note 4)
The designated space is a space within the office, and the system is one of the systems described in (Appendix 1) to (Appendix 3).
(Note 5)
The designated space is a conference room, and the system is as described in any of (Appendix 1) to (Appendix 3).
(Note 6)
A system according to any one of (Appendix 1) to (Appendix 5), comprising means for comparing the progression of excitement of multiple gatherings in space and obtaining a predetermined trend regarding the excitement.
(Note 7)
The means for acquiring trends is the system described in (Appendix 6), which presents improvement suggestions regarding the excitement based on the acquired trends.
(Note 8)
A system according to any one of (Appendix 1) to (Appendix 7), comprising means for verifying the effect of an action on the level of excitement, based on an index value indicating the level of excitement in response to an action taken to promote excitement.
(Note 9)
A system as described in any of (Appendix 1) to (Appendix 8), comprising means for statistically calculating an index representing the activity of the entire business entity based on indicator values showing the level of activity in multiple spaces.
(Note 10)
A system described in any of (Appendix 1) to (Appendix 9), comprising means for playing music in a space according to an index value of the space's excitement.
(Note 11)
A method for performing operations on a system comprising a microphone, an analysis device, and a server, the method comprising: the steps of: the microphone being installed in a predetermined space and collecting sounds occurring in the space; the analysis device being installed in the space and classifying the types of sounds collected in the sound collection step, and determining the state of excitement and the degree of said state based on the classified type and volume; the server aggregating the states and degrees and calculating the time progression of excitement in the space; and the server presenting information based on the time progression of excitement in the space to the user.
(Note 12)
A program for operating a computer installed in a predetermined space, comprising a processor and memory, the program causing the processor to perform the steps of: acquiring sound generated in the space, collected by a microphone installed in the space; classifying the type of sound acquired; and determining a state related to excitement and the degree of said state based on the classified type and volume.
(Note 13)
An information processing device installed in a predetermined space, comprising a control unit and a storage unit, wherein the control unit performs the steps of: acquiring sound generated in the space, collected by a microphone installed in the space; classifying the type of sound acquired; and determining a state related to excitement and the degree of said state based on the classified type and volume.

１…システム
１０…端末装置
１２…通信ＩＦ
１３…入力装置
１４…出力装置
１５…メモリ
１６…ストレージ
１９…プロセッサ
２０…サーバ
２２…通信ＩＦ
２３…入出力ＩＦ
２５…メモリ
２６…ストレージ
２９…プロセッサ
３０…情報処理装置
３１…マイク
３２…撮影装置
３３…スピーカー
４０…ネットワーク
1...System 10...Terminal device 12...Communication IF
13...Input device 14...Output device 15...Memory 16...Storage 19...Processor 20...Server 22...Communication interface
23... Input/Output Interface
25...Memory 26...Storage 29...Processor 30...Information Processing Unit 31...Microphone 32...Photography Device 33...Speaker 40...Network

Claims

A system comprising a microphone, an analysis device, and a server,
The aforementioned microphone is
It is installed in a predetermined space and collects sound generated in the said space .
The aforementioned analysis device is
The aforementioned space is installed
A means for classifying the type of sound by inputting the collected sound into a trained model that has been trained to output the type of sound,
A means for determining the state of excitement and the degree of said state based on the classified type and the volume of the collected sound ,
A means for transmitting information regarding the determined state of the rise and the degree of said state to the server,
Equipped with,
The aforementioned server,
A means for aggregating the state and degree and calculating the time progression of the rise in the space,
A system comprising means for presenting to the user information based on the time progression of the rise in the aforementioned space.

The space is equipped with a camera that is installed in the space and captures images of the space,
The system according to claim 1, wherein the means for determining the above-mentioned means classifies the types of facial expressions contained in the image captured by the imaging device , and determines the state of excitement and the degree of that state based on the classified types and the number of people with that type of facial expression.

Installed in the aforementioned space and equipped with means for detecting the position of a user within the aforementioned space,
The system according to claim 1, wherein the means for determining the above-mentioned state of elevation and the degree of said state are determined based on the number detected.

The system according to claim 1, wherein the predetermined space is a space within an office.

The system according to claim 1, wherein the predetermined space is a conference room.

The system according to claim 1, wherein the server comprises means for comparing the progression of the rise of multiple gatherings in the space and obtaining a predetermined trend regarding the rise.

The system according to claim 6, wherein the means for acquiring the aforementioned trend presents improvement suggestions regarding the surge based on the acquired trend.

The system according to claim 1, wherein the server includes means for verifying the effect of an action on the level of excitement, based on an index value indicating the level of excitement after an action has been taken to promote excitement.

The system according to claim 1, wherein the server comprises means for statistically calculating an index representing the activity of the entire business entity based on indicator values showing the level of activity in multiple spaces.

The system according to claim 1, further comprising a speaker that plays music into the space according to an index value of the excitement level of the space.

A method performed on a system comprising a microphone, an analysis device, and a server,
The steps include: installing the microphone in a predetermined space and collecting sound generated in the space;
The analysis device is installed in the space, and the steps include: inputting the collected audio into a trained model that has been trained to output types of sounds, thereby classifying the types of sounds collected;
The analysis device performs the steps of determining the state of excitement and the degree of that state based on the classified type and the volume of the collected sound ,
The analysis device transmits information regarding the determined state of the elevation and the degree of that state to the server.
The server performs the steps of aggregating the state and degree and calculating the time progression of the rise in the space,
The server provides the user with information based on the time progression of the rise in the space,
How to do it.

A program for operating a computer installed in a predetermined space, comprising a processor and memory, wherein the program is configured to operate the processor,
The steps include acquiring sound generated in the space, collected by a microphone installed in the space,
The steps include: inputting the acquired audio into a trained model that has been trained to output the type of sound, thereby classifying the type of sound acquired;
A step of determining the state related to excitement and the degree of said state based on the classified type and the volume of the acquired sound ,
The steps include transmitting information regarding the determined state of the elevation and the degree of that state to an external device that aggregates and analyzes the information,
A program that executes the command.

An information processing device installed in a predetermined space, comprising a control unit and a storage unit, wherein the control unit is
The steps include acquiring sound generated in the space, collected by a microphone installed in the space,
The steps include: inputting the acquired audio into a trained model that has been trained to output the type of sound, thereby classifying the type of sound acquired;
A step of determining the state related to excitement and the degree of said state based on the classified type and the volume of the acquired sound ,
The steps include transmitting information regarding the determined state of the elevation and the degree of that state to an external device that aggregates and analyzes the information,
An information processing device that performs the following actions.