JP7586148B2

JP7586148B2 - Information processing device, method, and program

Info

Publication number: JP7586148B2
Application number: JP2022118310A
Authority: JP
Inventors: 航加来
Original assignee: Toyota Motor Corp
Current assignee: Toyota Motor Corp
Priority date: 2022-07-25
Filing date: 2022-07-25
Publication date: 2024-11-19
Anticipated expiration: 2042-07-25
Also published as: JP2024015924A

Description

本開示は、情報処理装置、方法、及びプログラムに関する。 This disclosure relates to an information processing device, method, and program.

従来、音場シミュレーションに関する技術が知られている。例えば、特許文献１には、音波の反射経路の解析において、音波の反射方向の演算を高速化するため、所定の面は考慮せずに反射方向を演算する技術が開示されている。 Technologies related to sound field simulation are known in the past. For example, Patent Document 1 discloses a technology that calculates the reflection direction of a sound wave without taking into account a specific surface in order to speed up the calculation of the reflection direction of the sound wave in the analysis of the reflection path of the sound wave.

特開２００４－０７７３５２号公報JP 2004-077352 A

音場、特に街レベルの広い屋外音場を仮想三次元空間内で再現する音場シミュレーションでは、音源として多様な物体が存在するため計算量が膨大になる。しかし、従来は計算量の低減が不十分であり、音場シミュレーションに関する技術には改善の余地があった。 In sound field simulations that reproduce sound fields, particularly large outdoor sound fields at the level of a city, in a virtual three-dimensional space, the amount of calculations required is enormous due to the presence of a wide variety of objects as sound sources. However, in the past, there was insufficient reduction in the amount of calculations required, and there was room for improvement in sound field simulation technology.

かかる事情に鑑みてなされた本開示の目的は、音場シミュレーションに関する技術を改善することにある。 In light of these circumstances, the purpose of this disclosure is to improve technology related to sound field simulation.

本開示の一実施形態に係る情報処理装置は、
制御部を備える情報処理装置であって、
前記制御部は、
音場に対応する仮想三次元空間内の複数の音源のうち、複数の音源グループに分類された音源について、観察期間中に前記音場で取得された録音データに基づいて、各音源グループの周波数ごとの音圧レベルを示す音源データを取得し、
取得された音源データで示される音圧レベルに応じて、前記複数の音源グループの各音源グループにつき代表周波数帯を設定し、
前記複数の音源グループに分類された音源とは異なる評価対象音源から前記仮想三次元空間内を伝搬して１つの観測点に到達する音をシミュレートするとともに、設定された各音源グループの代表周波数帯に限定して、各音源グループに属する音源から前記仮想三次元空間内を伝搬して前記１つの観測点に到達する音をシミュレートする。 An information processing device according to an embodiment of the present disclosure includes:
An information processing device including a control unit,
The control unit is
For a plurality of sound sources in a virtual three-dimensional space corresponding to a sound field, sound source data indicating a sound pressure level for each frequency of each sound source group is acquired based on recording data acquired in the sound field during an observation period, for the sound sources classified into a plurality of sound source groups.
setting a representative frequency band for each of the plurality of sound source groups according to a sound pressure level indicated by the acquired sound source data;
A sound is simulated that propagates through the virtual three-dimensional space from a sound source to be evaluated that is different from the sound sources classified into the plurality of sound source groups and reaches one observation point, and a sound is simulated that propagates through the virtual three-dimensional space from a sound source belonging to each sound source group and reaches the one observation point, limited to a representative frequency band of each set sound source group.

本開示の一実施形態に係る方法は、
情報処理装置が実行する方法であって、
音場に対応する仮想三次元空間内の複数の音源のうち、複数の音源グループに分類された音源について、観察期間中に前記音場で取得された録音データに基づいて、各音源グループの周波数ごとの音圧レベルを示す音源データを取得すること、
取得された音源データで示される音圧レベルに応じて、前記複数の音源グループの各音源グループの代表周波数帯を設定すること、及び
前記複数の音源グループに分類された音源とは異なる評価対象音源から前記仮想三次元空間内を伝搬して１つの観測点に到達する音をシミュレートするとともに、設定された各音源グループの代表周波数帯に限定して、各音源グループに属する音源から前記仮想三次元空間内を伝搬して前記１つの観測点に到達する音をシミュレートすること
を含む。 According to one embodiment of the present disclosure, the method comprises:
A method executed by an information processing device, comprising:
acquiring sound source data indicating a sound pressure level for each frequency of each sound source group based on recording data acquired in the sound field during an observation period, for sound sources classified into a plurality of sound source groups among a plurality of sound sources in a virtual three-dimensional space corresponding to the sound field;
setting a representative frequency band of each of the plurality of sound source groups according to a sound pressure level indicated by the acquired sound source data; and simulating a sound that propagates through the virtual three-dimensional space from a sound source to be evaluated that is different from the sound sources classified into the plurality of sound source groups and reaches one observation point, and simulating a sound that propagates through the virtual three-dimensional space from a sound source belonging to each sound source group and reaches the one observation point, limited to the representative frequency band of each of the set sound source groups.

本開示の一実施形態に係るプログラムは、
情報処理装置に、
音場に対応する仮想三次元空間内の複数の音源のうち、複数の音源グループに分類された音源について、観察期間中に前記音場で取得された録音データに基づいて、各音源グループの周波数ごとの音圧レベルを示す音源データを取得すること、
取得された音源データで示される音圧レベルに応じて、前記複数の音源グループの各音源グループの代表周波数帯を設定すること、及び
前記複数の音源グループに分類された音源とは異なる評価対象音源から前記仮想三次元空間内を伝搬して１つの観測点に到達する音をシミュレートするとともに、設定された各音源グループの代表周波数帯に限定して、各音源グループに属する音源から前記仮想三次元空間内を伝搬して前記１つの観測点に到達する音をシミュレートすること
を実行させる。 A program according to an embodiment of the present disclosure includes:
In the information processing device,
acquiring sound source data indicating a sound pressure level for each frequency of each sound source group based on recording data acquired in the sound field during an observation period, for sound sources classified into a plurality of sound source groups among a plurality of sound sources in a virtual three-dimensional space corresponding to the sound field;
setting a representative frequency band of each of the plurality of sound source groups according to a sound pressure level indicated by the acquired sound source data; and simulating a sound that propagates through the virtual three-dimensional space from a sound source to be evaluated that is different from the sound sources classified into the plurality of sound source groups and reaches one observation point, and simulating a sound that propagates through the virtual three-dimensional space from a sound source belonging to each sound source group and reaches the one observation point, limited to the representative frequency band of each of the set sound source groups.

本開示の一実施形態によれば、音場シミュレーションに関する技術が改善される。 According to one embodiment of the present disclosure, technology relating to sound field simulation is improved.

本開示の一実施形態に係るシステムの概略構成を示すブロック図である。1 is a block diagram showing a schematic configuration of a system according to an embodiment of the present disclosure. 記録装置の概略構成を示すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of a recording apparatus. 情報処理装置の概略構成を示すブロック図である。1 is a block diagram showing a schematic configuration of an information processing device; 情報処理装置の動作を示すフローチャートである。4 is a flowchart showing an operation of the information processing device. 仮想三次元空間の具体例を示す図である。FIG. 1 is a diagram showing a specific example of a virtual three-dimensional space.

以下、本開示の実施形態について説明する。 The following describes an embodiment of the present disclosure.

＜実施形態の概要＞
図１を参照して、本開示の実施形態に係るシステム１の概要について説明する。システム１は、記録装置１０と、情報処理装置２０とを備える。記録装置１０及び情報処理装置２０は、例えばインターネット及び移動体通信網等を含むネットワーク３０と通信可能に接続される。 <Overview of the embodiment>
An overview of a system 1 according to an embodiment of the present disclosure will be described with reference to Fig. 1. The system 1 includes a recording device 10 and an information processing device 20. The recording device 10 and the information processing device 20 are communicatively connected to a network 30 including, for example, the Internet and a mobile communication network.

記録装置１０は、例えばマイクロフォンを備えた録音装置であるが、これに限られず任意の装置であってよい。記録装置１０は、本実施形態では、複数の音源４１ａ，・・・，４１ｎが存在する音場４０に設置される。以下では、複数の音源４１ａ，・・・，４１ｎを「複数の音源４１」とまとめて表記することもある。音場４０は、本実施形態では屋外音場（例えば、街中の広場）であるが、屋内音場（例えば、工場の建屋内）であってもよい。記録装置１０が設置される音場４０内の位置は、本実施形態に係る１つの観測点に相当する。システム１が備える記録装置１０の数は、任意に定め得る。 The recording device 10 is, for example, a sound recording device equipped with a microphone, but is not limited to this and may be any device. In this embodiment, the recording device 10 is installed in a sound field 40 in which multiple sound sources 41a, ..., 41n exist. In the following, the multiple sound sources 41a, ..., 41n may be collectively referred to as "multiple sound sources 41". In this embodiment, the sound field 40 is an outdoor sound field (e.g., a city square), but it may also be an indoor sound field (e.g., inside a factory building). The position in the sound field 40 where the recording device 10 is installed corresponds to one observation point in this embodiment. The number of recording devices 10 provided in the system 1 may be determined arbitrarily.

情報処理装置２０は、例えばサーバ装置等のコンピュータである。情報処理装置２０は、ネットワーク３０を介して記録装置１０と通信可能である。情報処理装置２０は、ネットワーク３０を介して記録装置１０から、取得された音源データを受信する。 The information processing device 20 is, for example, a computer such as a server device. The information processing device 20 is capable of communicating with the recording device 10 via the network 30. The information processing device 20 receives acquired sound source data from the recording device 10 via the network 30.

まず、本実施形態の概要について説明し、詳細については後述する。情報処理装置２０は、音場に対応する仮想三次元空間内の複数の音源４１のうち、複数の音源グループに分類された音源について、観察期間中に音場４０で取得された、周波数ごとの音圧レベルを示す音源データを取得する。情報処理装置２０は、取得された音源データで示される音圧レベルに応じて、複数の音源グループの各音源グループの代表周波数帯を設定する。情報処理装置２０は、複数の音源グループに分類された音源とは異なる評価対象音源４２から仮想三次元空間内を伝搬して１つの観測点に到達する音をシミュレートするとともに、設定された各音源グループの代表周波数帯に限定して、各音源グループに属する音源から仮想三次元空間内を伝搬して１つの観測点に到達する音をシミュレートする。 First, an overview of this embodiment will be described, and details will be described later. The information processing device 20 acquires sound source data indicating the sound pressure level for each frequency acquired in the sound field 40 during an observation period for sound sources classified into multiple sound source groups among multiple sound sources 41 in a virtual three-dimensional space corresponding to a sound field. The information processing device 20 sets a representative frequency band for each of the multiple sound source groups according to the sound pressure level indicated by the acquired sound source data. The information processing device 20 simulates a sound that propagates through the virtual three-dimensional space from an evaluation target sound source 42 that is different from the sound sources classified into the multiple sound source groups and reaches one observation point, and also simulates a sound that propagates through the virtual three-dimensional space from a sound source belonging to each sound source group and reaches one observation point, limited to the representative frequency band of each set sound source group.

このように、本実施形態によれば、評価対象音源４２以外の音源である、各音源グループに属する音源の発する音を代表周波数帯に限定して、音場４０のシミュレーションが実行される。このため、例えば街レベルの広い屋外音場を仮想三次元空間内で再現する際、評価対象音源４２以外の音源の発する音について計算量を低減できる。したがって、音場の再現性を担保しつつ計算量の増加を抑制しやすくなる点で、音場シミュレーションに関する技術が改善される。 In this way, according to this embodiment, the sound emitted by sound sources belonging to each sound source group, which are sound sources other than the sound source 42 to be evaluated, is limited to a representative frequency band, and the sound field 40 is simulated. Therefore, when reproducing a wide outdoor sound field at the level of a city, for example, in a virtual three-dimensional space, the amount of calculations for sounds emitted by sound sources other than the sound source 42 to be evaluated can be reduced. Therefore, technology related to sound field simulation is improved in that it becomes easier to suppress an increase in the amount of calculations while ensuring the reproducibility of the sound field.

次に、システム１の各構成について詳細に説明する。 Next, each component of System 1 will be described in detail.

＜記録装置の構成＞
図２に示すように、記録装置１０は、音源データ取得部１１と、測位部１２と、通信部１３と、入力部１４と、出力部１５と、記憶部１６と、制御部１７と、を備える。 <Configuration of Recording Device>
As shown in FIG. 2, the recording device 10 includes a sound source data acquisition unit 11 , a positioning unit 12 , a communication unit 13 , an input unit 14 , an output unit 15 , a storage unit 16 , and a control unit 17 .

音源データ取得部１１は、例えばマイクロフォンであるが、これに限られず任意の装置であってよい。マイクロフォンは、モノラル録音又はステレオ録音等、任意の録音方式のマイクロフォンである。ステレオ録音は、バイノーラル録音を含む。バイノーラル録音とは、人間の頭部の音響効果を再現するダミーヘッド等の機器を用いて鼓膜に届く状態で音を記録する録音方式のことである。バイノーラル録音方式を採用する場合、音源データ取得部１１は、人間の頭部を模したダミーヘッド等の機器の両耳に組み込まれるマイクロフォンカプセル、又は人間の両耳に装着されるイヤフォン型マイクロフォンであり得る。 The sound source data acquisition unit 11 is, for example, a microphone, but is not limited to this and may be any device. The microphone is a microphone of any recording method, such as monaural recording or stereo recording. Stereo recording includes binaural recording. Binaural recording is a recording method in which sound is recorded in a state where it reaches the eardrum using equipment such as a dummy head that reproduces the acoustic effects of a human head. When the binaural recording method is adopted, the sound source data acquisition unit 11 may be a microphone capsule incorporated into both ears of equipment such as a dummy head that mimics a human head, or an earphone-type microphone worn on both human ears.

測位部１２は、記録装置１０の位置情報を取得する１つ以上の装置を含む。具体的には、測位部１２は、例えばＧＰＳ（Global Positioning System）に対応する受信機を含むが、これに限られず、任意の衛星測位システムに対応する受信機を含んでもよい。 The positioning unit 12 includes one or more devices that acquire location information of the recording device 10. Specifically, the positioning unit 12 includes, for example, a receiver compatible with the Global Positioning System (GPS), but is not limited to this and may include a receiver compatible with any satellite positioning system.

通信部１３は、ネットワーク３０に接続する１つ以上の通信インタフェースを含む。当該通信インタフェースは、例えば４Ｇ（4th Generation）又は５Ｇ（5th Generation）等の移動体通信規格に対応するが、これらに限られない。本実施形態において、記録装置１０は、通信部１３及びネットワーク３０を介して情報処理装置２０と通信する。 The communication unit 13 includes one or more communication interfaces that connect to the network 30. The communication interfaces correspond to mobile communication standards such as, but are not limited to, 4G (4th Generation) or 5G (5th Generation). In this embodiment, the recording device 10 communicates with the information processing device 20 via the communication unit 13 and the network 30.

入力部１４は、少なくとも１つの入力用インタフェースを含む。入力用インタフェースは、例えば、物理キー、静電容量キー、ポインティングデバイス、又はディスプレイと一体的に設けられたタッチスクリーンである。入力部１４は、記録装置１０の動作に用いられるデータを入力する操作を受け付ける。入力部１４は、記録装置１０に備えられる代わりに、外部の入力機器として記録装置１０に接続されてもよい。接続用インタフェースとしては、例えば、ＵＳＢ（Universal Serial Bus）、ＨＤＭＩ（High-Definition Multimedia Interface）（登録商標）、又はBluetooth（登録商標）等の規格に対応したインタフェースを用いることができる。また、入力部１４は、音場４０の状況を示す情報を取得可能な任意のセンサモジュールも含む。音場４０の状況を示す情報は、例えば複数の音源４１のそれぞれに関する、音場４０内の位置（以下、「音の発出位置」ともいう。）及び１つの観測点との距離（以下、「音の到達距離」ともいう。）等を示す情報を含む。例えば、センサモジュールは、カメラ、赤外線センサ、ＬｉＤＡＲ（light detection and ranging）等の距離センサ、又はこれらの組合せを含む。 The input unit 14 includes at least one input interface. The input interface is, for example, a physical key, a capacitance key, a pointing device, or a touch screen integral with a display. The input unit 14 accepts an operation to input data used in the operation of the recording device 10. The input unit 14 may be connected to the recording device 10 as an external input device instead of being provided in the recording device 10. As the connection interface, for example, an interface corresponding to a standard such as USB (Universal Serial Bus), HDMI (High-Definition Multimedia Interface) (registered trademark), or Bluetooth (registered trademark) can be used. The input unit 14 also includes any sensor module capable of acquiring information indicating the state of the sound field 40. The information indicating the state of the sound field 40 includes, for example, information indicating the position in the sound field 40 (hereinafter also referred to as the "sound emission position") and the distance from one observation point (hereinafter also referred to as the "sound reach distance") for each of the multiple sound sources 41. For example, the sensor module may include a camera, an infrared sensor, a distance sensor such as LiDAR (light detection and ranging), or a combination thereof.

出力部１５は、情報を出力してユーザに通知する１つ以上の出力装置を含んでもよい。当該出力装置は、例えば情報を音声で出力するスピーカ、又は情報を画像若しくは映像で出力するディスプレイ等であるが、これらに限られない。出力部１５は、外部の出力装置を接続するためのインタフェースを含んでもよい。 The output unit 15 may include one or more output devices that output information to notify the user. Examples of the output devices include, but are not limited to, speakers that output information as audio, or displays that output information as images or videos. The output unit 15 may include an interface for connecting an external output device.

記憶部１６は、１つ以上のメモリを含む。メモリは、例えば半導体メモリ、磁気メモリ、又は光メモリ等であるが、これらに限られない。記憶部１６に含まれる各メモリは、例えば主記憶装置、補助記憶装置、又はキャッシュメモリとして機能してもよい。記憶部１６は、記録装置１０の動作に用いられる任意の情報を記憶する。例えば、記憶部１６は、システムプログラム、アプリケーションプログラム、及び組み込みソフトウェア等を記憶してもよい。記憶部１６に記憶された情報は、例えば通信部１３を介してネットワーク３０から取得される情報で更新可能であってもよい。 The storage unit 16 includes one or more memories. The memories are, for example, but not limited to, semiconductor memories, magnetic memories, or optical memories. Each memory included in the storage unit 16 may function, for example, as a main storage device, an auxiliary storage device, or a cache memory. The storage unit 16 stores any information used in the operation of the recording device 10. For example, the storage unit 16 may store system programs, application programs, embedded software, and the like. The information stored in the storage unit 16 may be updatable, for example, with information obtained from the network 30 via the communication unit 13.

制御部１７は、１つ以上のプロセッサ、１つ以上のプログラマブル回路、１つ以上の専用回路、又はこれらの組合せを含む。プロセッサは、例えばＣＰＵ（Central Processing Unit）若しくはＧＰＵ（Graphics Processing Unit）等の汎用プロセッサ、又は特定の処理に特化した専用プロセッサであるがこれらに限られない。プログラマブル回路は、例えばＦＰＧＡ（Field-Programmable Gate Array）であるがこれに限られない。専用回路は、例えばＡＳＩＣ（Application Specific Integrated Circuit）であるがこれに限られない。制御部１７は、記録装置１０全体の動作を制御する。 The control unit 17 includes one or more processors, one or more programmable circuits, one or more dedicated circuits, or a combination of these. The processor is, for example, but is not limited to, a general-purpose processor such as a CPU (Central Processing Unit) or a GPU (Graphics Processing Unit), or a dedicated processor specialized for a specific process. The programmable circuit is, for example, but is not limited to, an FPGA (Field-Programmable Gate Array). The dedicated circuit is, for example, but is not limited to, an ASIC (Application Specific Integrated Circuit). The control unit 17 controls the operation of the entire recording device 10.

＜情報処理装置の構成＞
図３に示すように、情報処理装置２０は、通信部２１と、入力部２２と、出力部２３と、記憶部２４と、制御部２５と、を備える。 <Configuration of information processing device>
As shown in FIG. 3, the information processing device 20 includes a communication unit 21, an input unit 22, an output unit 23, a storage unit 24, and a control unit 25.

通信部２１は、ネットワーク３０に接続する１つ以上の通信インタフェースを含む。当該通信インタフェースは、例えば移動体通信規格、有線ＬＡＮ（Local Area Network）規格、又は無線ＬＡＮ規格に対応するが、これらに限られず、任意の通信規格に対応してもよい。本実施形態において、情報処理装置２０は、通信部２１及びネットワーク３０を介して記録装置１０と通信する。なお、情報処理装置２０は、通信部２１を介して、本実施形態に係る仮想三次元空間内の複数の音源のうち１つ以上の音源と、ネットワーク３０を介して又は直接通信してもよい。 The communication unit 21 includes one or more communication interfaces that connect to the network 30. The communication interface corresponds to, for example, a mobile communication standard, a wired LAN (Local Area Network) standard, or a wireless LAN standard, but is not limited to these, and may correspond to any communication standard. In this embodiment, the information processing device 20 communicates with the recording device 10 via the communication unit 21 and the network 30. Note that the information processing device 20 may communicate with one or more of the multiple sound sources in the virtual three-dimensional space according to this embodiment via the communication unit 21, via the network 30 or directly.

入力部２２は、少なくとも１つの入力用インタフェースを含む。入力用インタフェースは、例えば、物理キー、静電容量キー、ポインティングデバイス、又はディスプレイと一体的に設けられたタッチスクリーンである。入力部２２は、情報処理装置２０の動作に用いられるデータを入力する操作を受け付ける。入力部２２は、情報処理装置２０に備えられる代わりに、外部の入力機器として情報処理装置２０に接続されてもよい。接続用インタフェースとしては、例えば、ＵＳＢ、ＨＤＭＩ（登録商標）、又はBluetooth（登録商標）等の規格に対応したインタフェースを用いることができる。 The input unit 22 includes at least one input interface. The input interface is, for example, a physical key, a capacitive key, a pointing device, or a touch screen that is integral with a display. The input unit 22 accepts an operation to input data used in the operation of the information processing device 20. Instead of being provided in the information processing device 20, the input unit 22 may be connected to the information processing device 20 as an external input device. As the connection interface, for example, an interface compatible with standards such as USB, HDMI (registered trademark), or Bluetooth (registered trademark) can be used.

出力部２３は、情報を出力してユーザに通知する１つ以上の出力装置を含んでもよい。当該出力装置は、例えば情報を音声で出力するスピーカ、又は情報を画像若しくは映像で出力するディスプレイ等であるが、これらに限られない。出力部２３は、外部の出力装置を接続するためのインタフェースを含んでもよい。 The output unit 23 may include one or more output devices that output information to notify the user. Examples of the output devices include, but are not limited to, speakers that output information as audio, or displays that output information as images or videos. The output unit 23 may include an interface for connecting an external output device.

記憶部２４は、１つ以上のメモリを含む。記憶部２４に含まれる各メモリは、例えば主記憶装置、補助記憶装置、又はキャッシュメモリとして機能してもよい。記憶部２４は、情報処理装置２０の動作に用いられる任意の情報を記憶する。例えば、記憶部２４は、システムプログラム、アプリケーションプログラム、データベース、及び地図情報等を記憶してもよい。記憶部２４に記憶された情報は、例えば通信部２１を介してネットワーク３０から取得される情報で更新可能であってもよい。 The storage unit 24 includes one or more memories. Each memory included in the storage unit 24 may function, for example, as a main storage device, an auxiliary storage device, or a cache memory. The storage unit 24 stores any information used in the operation of the information processing device 20. For example, the storage unit 24 may store system programs, application programs, databases, map information, and the like. The information stored in the storage unit 24 may be updatable, for example, with information obtained from the network 30 via the communication unit 21.

制御部２５は、１つ以上のプロセッサ、１つ以上のプログラマブル回路、１つ以上の専用回路、又はこれらの組合せを含む。制御部２５は、情報処理装置２０全体の動作を制御する。 The control unit 25 includes one or more processors, one or more programmable circuits, one or more dedicated circuits, or a combination of these. The control unit 25 controls the operation of the entire information processing device 20.

本実施形態において、情報処理装置２０の記憶部２４には、音源データベースが記憶される。音源データベースには、複数の音源グループに分類された音源について、観察期間中に音場４０で取得された、周波数ごとの音圧レベルを示す音源データが記憶される。 In this embodiment, a sound source database is stored in the storage unit 24 of the information processing device 20. The sound source database stores sound source data indicating the sound pressure level for each frequency acquired in the sound field 40 during the observation period for sound sources classified into multiple sound source groups.

音源データベースにおいて、各音源は、対応する音源グループに分類される。音源の分類の粒度は、任意に設定可能であるが、本実施形態では計算量低減の観点から粗く設定される。例えば、情報処理装置２０の制御部２５は、「車両」、「自転車」、「人」、及び「自然音」という音源グループを設定してもよい。「自然音」は、例えば風雨等の音が属する音源グループである。一方、音源の分類の粒度は、音場４０の再現性の観点からは、細かいほど好ましい。例えば、音源グループ「人」を年齢又は性別ごとに細分化してもよい（「子供」及び「大人」又は「男性」及び「女性」等）。ただし、細分化するほど計算量が増加する傾向があるので、音源の分類の粒度は、音場４０の再現性と計算量とのバランスを考慮して適宜決定し得る。そして、制御部２５は、設定された音源グループの何れかに各音源を分類し、各音源にそれぞれの属する音源グループを示すフラグを付けた情報を音源データベースに記憶することで、各音源を対応する音源グループに分類し得る。 In the sound source database, each sound source is classified into a corresponding sound source group. The granularity of the sound source classification can be set arbitrarily, but in this embodiment, it is set roughly from the viewpoint of reducing the amount of calculation. For example, the control unit 25 of the information processing device 20 may set sound source groups of "vehicles", "bicycles", "people", and "natural sounds". "Natural sounds" is a sound source group to which sounds such as wind and rain belong. On the other hand, from the viewpoint of reproducibility of the sound field 40, the finer the granularity of the sound source classification, the more preferable it is. For example, the sound source group "people" may be subdivided by age or gender (such as "children" and "adults" or "males" and "females"). However, since the amount of calculation tends to increase as the classification is subdivided, the granularity of the sound source classification can be appropriately determined taking into consideration the balance between the reproducibility of the sound field 40 and the amount of calculation. Then, the control unit 25 classifies each sound source into one of the set sound source groups, and stores information in which each sound source is flagged to indicate the sound source group to which it belongs in the sound source database, thereby classifying each sound source into the corresponding sound source group.

このように設定された音源グループごとに、その音源グループを代表する音源データが作成される。音源データの作成には、任意の手法が採用可能である。例えば、記録装置１０の制御部１７は、観察期間中に音源データ取得部１１に入力された信号をアナログデジタル変換して複数の音源４１のそれぞれから発せられた音を表す録音データを作成してもよい。制御部１７は、ネットワーク３０及び通信部１３を介して情報処理装置２０の通信部２１に、各音源の録音データを送信してもよい。情報処理装置２０の制御部２５は、通信部２１を介して各音源の録音データが受信される度に、各音源の録音データを、各音源の属する音源グループの音サンプルとして音源データベースに蓄積してもよい。そして、制御部２５は、各音源グループの音サンプルを使用した実験に基づいて、各音源グループの音源データを作成してもよい。実験として、例えば、制御部２５は、ある音源グループについて蓄積された複数の音サンプルを、出力部２３を介して出力してもよい。ユーザ（実験参加者）は、各音サンプルと、実際の音場４０で聴こえる音とを聴き比べて、音場４０で聴こえる音に最も近い音サンプルを特定し、入力部２２を介して、特定された音サンプルの識別情報を情報処理装置２０に入力してもよい。制御部２５は、入力された識別情報で特定される音サンプルを、対応する音源グループの音源データとして、音源データベースに記憶する。この実験を各音源グループについて繰り返すことで、各音源グループの音源データが作成され、音源データベースに記憶され得る。本実施形態のように音源の分類の粒度を粗くする場合、各音源グループの音源データは、その音源グループに含まれる音源の典型的な音を示す。例えば、製造業者の異なる様々な車種の自転車が１つの音源グループ「自転車」に属する場合、当該音源グループの音は、種々の自転車の典型的な音を示す。 For each sound source group set in this way, sound source data representative of the sound source group is created. Any method can be adopted for creating the sound source data. For example, the control unit 17 of the recording device 10 may convert the signal input to the sound source data acquisition unit 11 during the observation period into analog digital to create recording data representing the sounds emitted from each of the multiple sound sources 41. The control unit 17 may transmit the recording data of each sound source to the communication unit 21 of the information processing device 20 via the network 30 and the communication unit 13. The control unit 25 of the information processing device 20 may accumulate the recording data of each sound source in the sound source database as a sound sample of the sound source group to which each sound source belongs each time the recording data of each sound source is received via the communication unit 21. Then, the control unit 25 may create sound source data of each sound source group based on an experiment using the sound sample of each sound source group. As an experiment, for example, the control unit 25 may output a plurality of sound samples accumulated for a certain sound source group via the output unit 23. The user (experiment participant) may compare each sound sample with the sound heard in the actual sound field 40, identify the sound sample that is closest to the sound heard in the sound field 40, and input the identification information of the identified sound sample to the information processing device 20 via the input unit 22. The control unit 25 stores the sound sample identified by the input identification information in the sound source database as sound source data of the corresponding sound source group. By repeating this experiment for each sound source group, sound source data for each sound source group can be created and stored in the sound source database. When the granularity of the sound source classification is coarse as in this embodiment, the sound source data of each sound source group indicates a typical sound of the sound source included in that sound source group. For example, when various models of bicycles from different manufacturers belong to one sound source group "bicycles", the sound of the sound source group indicates a typical sound of various bicycles.

このようにして、複数の音源グループに分類された音源について、観察期間中に音場４０で取得された録音データに基づいて、各音源グループの周波数ごとの音圧レベルを示す音源データが作成され、音源データベースに記憶される。制御部２５は、以下で説明する図４の動作を実行する度に、音源グループをクエリとして音源データベースを検索することで、音場４０のシミュレーションに使用される音源データを取得し得る。 In this way, for sound sources classified into multiple sound source groups, sound source data indicating the sound pressure level for each frequency of each sound source group is created based on the recording data acquired in the sound field 40 during the observation period, and stored in the sound source database. Each time the control unit 25 executes the operation of FIG. 4 described below, it can obtain the sound source data used in the simulation of the sound field 40 by searching the sound source database using the sound source group as a query.

＜情報処理装置の動作フロー＞
図４を参照して、本実施形態に係る情報処理装置２０の動作について説明する。図４の動作は、本実施形態に係る方法に相当する。図４の動作は、例えば所定の周期で、繰り返し実行される。所定の周期は、任意に定め得る。 <Operation flow of information processing device>
The operation of the information processing device 20 according to the present embodiment will be described with reference to Fig. 4. The operation of Fig. 4 corresponds to a method according to the present embodiment. The operation of Fig. 4 is repeatedly executed, for example, at a predetermined cycle. The predetermined cycle can be arbitrarily determined.

ステップＳ１００：情報処理装置２０の制御部２５は、音場４０に対応する仮想三次元空間内の複数の音源のうち、複数の音源グループに分類された音源について、観察期間中に音場４０で取得された録音データに基づいて、各音源グループの周波数ごとの音圧レベルを示す音源データを取得する。 Step S100: The control unit 25 of the information processing device 20 acquires sound source data indicating the sound pressure level for each frequency of each sound source group based on the recording data acquired in the sound field 40 during the observation period for the sound sources classified into multiple sound source groups among the multiple sound sources in the virtual three-dimensional space corresponding to the sound field 40.

具体的には、まず、制御部２５は、音場４０に対応する仮想三次元空間を作成する。仮想三次元空間の作成には、任意の手法が採用可能である。例えば、制御部２５は、ネットワーク３０及び通信部２１を介して記録装置１０の通信部１３から、第１情報及び第２情報を受信する。第１情報は、記録装置１０の測位部１２を介して取得された、記録装置１０が設置された音場４０内の位置、すなわち本実施形態に係る１つの観測点の位置を示す位置情報である。第２情報は、記録装置１０の入力部１４を介して取得された、音場４０の状況を示す情報、具体的には、複数の音源４１の位置情報（すなわち、音の発出位置を示す情報）である。制御部２５は、第１情報及び第２情報に基づいて、複数の音源４１のそれぞれにつき、１つの観測点との距離（すなわち、音の到達距離）を算出する。音の発出位置及び音の到達距離は、本実施形態では三次元座標から算出されるが、これに限られず任意の手法が採用可能である。制御部２５は、１つの観測点の位置並びに算出された音の発出位置及び音の到達距離に基づいて、音場４０に対応する仮想三次元空間を作成する。例えば、制御部２５は、図５に示すような仮想三次元空間ＶＳを作成してもよい。点Ｐは、本実施形態の１つの観測点に相当する。音源４１ａ、４１ｂ、４１ｃは、本実施形態の複数の音源に相当する。 Specifically, first, the control unit 25 creates a virtual three-dimensional space corresponding to the sound field 40. Any method can be adopted to create the virtual three-dimensional space. For example, the control unit 25 receives the first information and the second information from the communication unit 13 of the recording device 10 via the network 30 and the communication unit 21. The first information is position information indicating the position in the sound field 40 where the recording device 10 is installed, that is, the position of one observation point according to this embodiment, acquired via the positioning unit 12 of the recording device 10. The second information is information indicating the situation of the sound field 40, specifically, position information of the multiple sound sources 41 (i.e., information indicating the emission position of the sound), acquired via the input unit 14 of the recording device 10. The control unit 25 calculates the distance (i.e., the reach of the sound) from one observation point for each of the multiple sound sources 41 based on the first information and the second information. In this embodiment, the emission position and the reach of the sound are calculated from three-dimensional coordinates, but any method can be adopted without being limited to this. The control unit 25 creates a virtual three-dimensional space corresponding to the sound field 40 based on the position of one observation point and the calculated sound emission position and sound arrival distance. For example, the control unit 25 may create a virtual three-dimensional space VS as shown in FIG. 5. Point P corresponds to one observation point in this embodiment. Sound sources 41a, 41b, and 41c correspond to multiple sound sources in this embodiment.

次に、制御部２５は、記憶部２４に格納された音源データベースを参照して、音場４０を再現するシミュレーションに使用される複数の音源を選択する。図５に示す例では、シミュレーションに使用される音源として、音源４１ａ（自動運転車両）、音源４１ｂ（ロードバイク）、及び音源４１ｃ（男性）を含めた複数の音源が選択される。制御部２５は、各音源に対応する音源グループを特定する。図５に示す例では、音源４１ａの音源グループとして「車両」、音源４１ｂの音源グループとして「自転車」、音源４１ｃの音源グループとして「人」が特定される。上述したとおり、音源データベースには、音場４０に対応する仮想三次元空間内の複数の音源４１のうち、複数の音源グループに分類された音源について、観察期間中に音場４０で取得された録音データに基づいて作成された、各音源グループの周波数ごとの音圧レベルを示す音源データが蓄積されている。したがって、制御部２５は、特定された音源グループをクエリとして音源データベースを検索することで、仮想三次元空間内の各音源について、音源データを取得し得る。図５に示す例では、仮想三次元空間ＶＳ内の音源４１ａ、４１ｂ、４１ｃを含む各音源について、音源データが取得される。 Next, the control unit 25 refers to the sound source database stored in the storage unit 24 and selects multiple sound sources to be used in the simulation to reproduce the sound field 40. In the example shown in FIG. 5, multiple sound sources including sound source 41a (automatic driving vehicle), sound source 41b (road bike), and sound source 41c (man) are selected as the sound sources to be used in the simulation. The control unit 25 identifies a sound source group corresponding to each sound source. In the example shown in FIG. 5, "vehicle" is identified as the sound source group of sound source 41a, "bicycle" is identified as the sound source group of sound source 41b, and "person" is identified as the sound source group of sound source 41c. As described above, the sound source database accumulates sound source data indicating the sound pressure level for each frequency of each sound source group, which was created based on the recording data acquired in the sound field 40 during the observation period, for the sound sources classified into multiple sound source groups among the multiple sound sources 41 in the virtual three-dimensional space corresponding to the sound field 40. Therefore, the control unit 25 can obtain sound source data for each sound source in the virtual three-dimensional space by searching the sound source database using the identified sound source group as a query. In the example shown in FIG. 5, sound source data is obtained for each sound source, including sound sources 41a, 41b, and 41c, in the virtual three-dimensional space VS.

このようにして、制御部２５は、音場４０に対応する仮想三次元空間内の複数の音源のうち、複数の音源グループに分類された音源について、観察期間中に音場４０で取得された録音データに基づいて、各音源グループの周波数ごとの音圧レベルを示す音源データを取得する。これにより、音場４０を再現するシミュレーションに使用される、評価対象音源４２以外の各音源の音源データが取得される。音源データベースには、複数の音源４１に加えて、観察期間中に音場４０で取得された、評価対象音源４２の人工音以外の音（後述）の音源データも蓄積し、取得されてもよい。しかしながら、音源データの取得には、これらの例に限られず任意の手法が採用可能である。 In this way, the control unit 25 acquires sound source data indicating the sound pressure level for each frequency of each sound source group for the sound sources classified into multiple sound source groups out of the multiple sound sources in the virtual three-dimensional space corresponding to the sound field 40, based on the recording data acquired in the sound field 40 during the observation period. In this way, sound source data for each sound source other than the evaluation target sound source 42, which is used in the simulation of reproducing the sound field 40, is acquired. In addition to the multiple sound sources 41, the sound source database may also store and acquire sound source data for sounds (described later) other than the artificial sound of the evaluation target sound source 42, which was acquired in the sound field 40 during the observation period. However, the acquisition of sound source data is not limited to these examples and any method can be adopted.

ステップＳ１０１：制御部２５は、ステップＳ１００で取得された音源データで示される音圧レベルに応じて、複数の音源グループの各音源グループの代表周波数帯を設定する。 Step S101: The control unit 25 sets a representative frequency band for each of the multiple sound source groups according to the sound pressure level indicated by the sound source data acquired in step S100.

具体的には、制御部２５は、記憶部２４に格納された音源データベースを参照して、ステップＳ１００で設定された各音源グループにつき、各音源グループの音源データの全体の周波数範囲から特徴的な周波数帯を抽出する。制御部２５は、抽出された周波数帯を各音源グループの代表周波数帯として設定する。特徴的な周波数帯の抽出には、任意の手法が採用可能である。例えば、制御部２５は、音源データベースを検索して、ある音源グループの音源データで示される周波数ごとの音圧レベルを参照する。そして、制御部２５は、音圧レベルのピークが現れる周波数を中心とした所定の範囲の周波数を、当該音源グループの特徴的な周波数帯として設定してもよい。図５に示す例では、制御部２５は、音源４１ａの属する音源グループである「車両」につき、音源データで示される周波数ごとの音圧レベルを参照する。そして、制御部２５は、音圧レベルのピークが現れる周波数を中心とした所定の範囲の周波数を、音源グループ「車両」の特徴的な周波数帯として設定してもよい。所定の範囲は、任意に設定し得るが、以下に具体例を示す。 Specifically, the control unit 25 refers to the sound source database stored in the storage unit 24 and extracts a characteristic frequency band from the entire frequency range of the sound source data of each sound source group for each sound source group set in step S100. The control unit 25 sets the extracted frequency band as the representative frequency band of each sound source group. Any method can be adopted for extracting the characteristic frequency band. For example, the control unit 25 searches the sound source database and refers to the sound pressure level for each frequency indicated in the sound source data of a certain sound source group. Then, the control unit 25 may set a predetermined range of frequencies centered on a frequency at which a peak of the sound pressure level appears as the characteristic frequency band of the sound source group. In the example shown in FIG. 5, the control unit 25 refers to the sound pressure level for each frequency indicated in the sound source data for the sound source group "vehicle" to which the sound source 41a belongs. Then, the control unit 25 may set a predetermined range of frequencies centered on a frequency at which a peak of the sound pressure level appears as the characteristic frequency band of the sound source group "vehicle". The predetermined range can be set arbitrarily, but a specific example is shown below.

図５には、縦軸を音圧、横軸を周波数とする２つのグラフが示されている。右側のグラフは、音源４１ａが属する音源グループ「車両」の音源データで示される、周波数ごとの音圧レベルの値をプロットしたグラフである。グラフの右側半分に相当する周波数帯（以下、「高い方の周波数帯」ともいう。）の音圧レベルと、グラフの左側半分に相当する周波数帯（以下、「低い方の周波数帯」ともいう。）の音圧レベルとを比較すると、低い方の周波数帯（ハッチング範囲）に音圧レベルのピークが現れている。したがって、制御部２５は、低い方の周波数帯を、音源グループ「車両」の特徴的な周波数帯として設定し得る。なお、周波数帯の境界は、例えば、最大の音圧レベルから所定のデシベル（ｄＢ）下がった周波数として設定することができる。左側のグラフは、音源４１ｂの属する音源グループである「自転車」の音源データで示される、周波数ごとの音圧レベルの値をプロットしたグラフである。高い方の周波数帯の音圧レベルと、低い方の周波数帯の音圧レベルとを比較すると、高い方の周波数帯（ハッチング範囲）に音圧レベルのピークが現れている。したがって、制御部２５は、高い方の周波数帯を、音源グループ「自転車」の特徴的な周波数帯として設定し得る。図示の便宜上、音源４１ｃのグラフは省略しているが、音源４１ａ、４１ｂと同様に特徴的な周波数帯を設定し得る。 5 shows two graphs with sound pressure on the vertical axis and frequency on the horizontal axis. The graph on the right plots the sound pressure level values for each frequency indicated by the sound source data of the sound source group "vehicle" to which the sound source 41a belongs. When comparing the sound pressure level of the frequency band corresponding to the right half of the graph (hereinafter also referred to as the "higher frequency band") with the sound pressure level of the frequency band corresponding to the left half of the graph (hereinafter also referred to as the "lower frequency band"), a peak in the sound pressure level appears in the lower frequency band (hatched range). Therefore, the control unit 25 can set the lower frequency band as the characteristic frequency band of the sound source group "vehicle". Note that the boundary between the frequency bands can be set, for example, as a frequency that is a predetermined decibel (dB) lower than the maximum sound pressure level. The graph on the left plots the sound pressure level values for each frequency indicated by the sound source data of the sound source group "bicycle" to which the sound source 41b belongs. When comparing the sound pressure level of the higher frequency band with the sound pressure level of the lower frequency band, a peak in the sound pressure level appears in the higher frequency band (hatched area). Therefore, the control unit 25 can set the higher frequency band as the characteristic frequency band of the sound source group "bicycle." For convenience of illustration, the graph for sound source 41c is omitted, but a characteristic frequency band can be set in the same way as for sound sources 41a and 41b.

このようにして、制御部２５は、取得された音源データで示される音圧レベルに応じて、複数の音源グループの各音源グループの代表周波数帯を設定する。 In this way, the control unit 25 sets the representative frequency band of each of the multiple sound source groups according to the sound pressure level indicated by the acquired sound source data.

ステップＳ１０２：制御部２５は、複数の音源４１のうち、複数の音源グループに分類された音源とは異なる音源である評価対象音源４２から仮想三次元空間内を伝搬して１つの観測点に到達する音をシミュレートする。 Step S102: The control unit 25 simulates a sound that propagates through a virtual three-dimensional space from an evaluation target sound source 42, which is a sound source that is different from the sound sources classified into the multiple sound source groups among the multiple sound sources 41, and reaches one observation point.

本実施形態において、評価対象音源４２は、人工音及び人工音以外の音を発するロボットである。図５に示す例では、評価対象音源４２であるロボットは、仮想三次元空間ＶＳ内の点Ｑに位置する自動搬送ロボットである。しかしながら、ロボットは、これに限られず任意のロボットであってよい。制御部２５は、ロボットについて作成された音を人工音としてシミュレートする。図５に示す例では、制御部２５は、点Ｑに位置する自動搬送ロボットから仮想三次元空間ＶＳ内を伝搬して１つの観測点に到達する音として、自動搬送ロボットについて作成された音Ｓ０を、人工音としてシミュレートする。人工音は、本実施形態ではロボットの挙動（例えば、発進、停止、曲がる方向等）を人間に理解させやすくする音であるが、これに限られず、例えば人間の声を模した合成音声を含んでもよい。人工音の作成には、任意の手法が採用可能である。例えば、人工音は、数学計算に基づいて人間が気付きやすい音色を求め、当該音色を元に作成されたメロディであってもよい。 In this embodiment, the sound source 42 to be evaluated is a robot that emits artificial sounds and sounds other than artificial sounds. In the example shown in FIG. 5, the robot that is the sound source 42 to be evaluated is an automatic transport robot located at point Q in the virtual three-dimensional space VS. However, the robot is not limited to this and may be any robot. The control unit 25 simulates the sound created for the robot as an artificial sound. In the example shown in FIG. 5, the control unit 25 simulates the sound S0 created for the automatic transport robot as an artificial sound, as a sound that propagates from the automatic transport robot located at point Q through the virtual three-dimensional space VS to reach one observation point. In this embodiment, the artificial sound is a sound that makes it easier for humans to understand the behavior of the robot (e.g., starting, stopping, turning direction, etc.), but is not limited to this and may include, for example, a synthetic voice that imitates a human voice. Any method can be used to create the artificial sound. For example, the artificial sound may be a melody created based on a tone that is easily noticeable to humans based on mathematical calculations.

また、ロボットについて作成された音を人工音としてシミュレートする際は、複数の人工音をシミュレートすることが望ましい。多数の人工音をシミュレートするほど、実際の音場４０でロボットからどのような音を人間に表出すれば安全が図りやすくなるか（例えば、人間がロボットの接近に気付きやすくする音）を判定する手掛かりが増える。その結果、人工音のシミュレーションの信頼性を高めやすくなる。さらに、ロボットが発する人工音以外の音（例えば、走行音又は冷蔵システムの音）の影響も考慮することが望ましい。人工音以外の音の影響も考慮した音場シミュレーションを行えば、音場４０の再現性がより高まり、音場４０の再現性が高まるほど、実際の音場４０で人間に理解されやすい人工音の周波数が特定しやすくなる。 When simulating sounds created for a robot as artificial sounds, it is desirable to simulate multiple artificial sounds. The more artificial sounds that are simulated, the more clues there are to determine what kind of sounds the robot should emit to humans in the actual sound field 40 to ensure safety (for example, sounds that make it easier for humans to notice the robot's approach). As a result, it becomes easier to increase the reliability of the artificial sound simulation. Furthermore, it is desirable to take into account the influence of sounds other than the artificial sounds emitted by the robot (for example, the sound of the robot running or the sound of the refrigeration system). If a sound field simulation is performed that takes into account the influence of sounds other than the artificial sounds, the reproducibility of the sound field 40 will be improved, and the higher the reproducibility of the sound field 40, the easier it will be to identify the frequencies of artificial sounds that are easily understood by humans in the actual sound field 40.

したがって、制御部２５は、ロボットについて作成された音を人工音としてシミュレートする際、複数の人工音及び人工音以外の音をシミュレートし、１つの観測点における各人工音の識別性の高さを評価してもよい。 Therefore, when simulating a sound created for a robot as an artificial sound, the control unit 25 may simulate multiple artificial sounds and sounds other than artificial sounds, and evaluate the degree of discriminability of each artificial sound at one observation point.

また、制御部２５は、評価対象音源４２から仮想三次元空間内を伝搬して１つの観測点に到達する音をシミュレートする際、当該音の距離減衰量を算出してもよい。具体的には、制御部２５は、評価対象音源４２であるロボットから仮想三次元空間内を伝搬して１つの観測点に到達する人工音及び人工音以外の音を示すデータに対して、算出された距離減衰量を適用して、人工音及び人工音以外の音をシミュレートしてもよい。距離減衰量の算出には、任意の手法が採用可能である。例えば、制御部２５は、周波数及び音場４０の環境条件（温度分布、温湿度、風向き等）を含めた所定の条件に基づいて、空気吸収による減衰の推定値を算出してもよい。制御部２５は、算出された推定値を、当該音を示すデータに適用することで、ロボットから発せられる音について距離減衰量を算出し得る。 In addition, when simulating a sound propagating from the sound source 42 to be evaluated in the virtual three-dimensional space and reaching an observation point, the control unit 25 may calculate the distance attenuation of the sound. Specifically, the control unit 25 may apply the calculated distance attenuation to data representing the artificial sound and the sound other than the artificial sound propagating from the robot, which is the sound source 42 to be evaluated, in the virtual three-dimensional space and reaching an observation point, to simulate the artificial sound and the sound other than the artificial sound. Any method may be used to calculate the distance attenuation. For example, the control unit 25 may calculate an estimate of the attenuation due to air absorption based on predetermined conditions including the frequency and the environmental conditions of the sound field 40 (temperature distribution, temperature and humidity, wind direction, etc.). The control unit 25 may calculate the distance attenuation of the sound emitted from the robot by applying the calculated estimate to the data representing the sound.

ステップＳ１０３：制御部２５は、ステップＳ１０１で設定された各音源グループの代表周波数帯に限定して、各音源グループに属する音源から仮想三次元空間内を伝搬して１つの観測点に到達する音をシミュレートする。 Step S103: The control unit 25 simulates the sound that propagates through the virtual three-dimensional space from the sound sources belonging to each sound source group and reaches one observation point, limited to the representative frequency band of each sound source group set in step S101.

具体的には、制御部２５は、仮想三次元空間内の複数の音源４１のそれぞれが属する各音源グループの音源データのうち、代表周波数帯以外の周波数に対応する部分は計算対象から除外し、代表周波数帯に対応する部分のみを計算対象とする。換言すると、各音源グループの音源データのうち、各音源グループに属する音源から仮想三次元空間内を伝搬して１つの観測点に到達する音のシミュレーションに使用される部分が、ステップＳ１０１で設定された代表周波数帯に対応する部分に限定される。例えば、図５に示す音源４１ａについて、制御部２５は、音源グループ「車両」の音源データのうち、低い方の周波数帯に対応する部分のみを計算対象として作成した音Ｓ１を、音源４１ａの音としてシミュレートする。また、音源４１ｂについて、制御部２５は、音源グループ「自転車」の音源データのうち、高い方の周波数帯に対応する部分のみを計算対象として作成した音Ｓ２を、音源４１ｂの音としてシミュレートする。便宜上、音源４１ｃは図示を省略するが、音源４１ｃの音も同様に代表周波数帯に限定してシミュレートし得る。 Specifically, the control unit 25 excludes from the calculation the parts of the sound source data of each sound source group to which each of the multiple sound sources 41 in the virtual three-dimensional space belongs that correspond to frequencies other than the representative frequency band, and calculates only the parts that correspond to the representative frequency band. In other words, the parts of the sound source data of each sound source group that are used to simulate the sound that propagates through the virtual three-dimensional space from the sound sources belonging to each sound source group and reaches one observation point are limited to the parts that correspond to the representative frequency band set in step S101. For example, for the sound source 41a shown in FIG. 5, the control unit 25 simulates the sound S1 created using only the parts of the sound source data of the sound source group "vehicle" that correspond to the lower frequency band as the sound of the sound source 41a. For the sound source 41b, the control unit 25 simulates the sound S2 created using only the parts of the sound source data of the sound source group "bicycle" that correspond to the higher frequency band as the sound of the sound source 41b. For convenience, the sound source 41c is not shown in the figure, but the sound of the sound source 41c can also be simulated by limiting it to a representative frequency band.

なお、説明の便宜上、ステップＳ１０２の実行後にステップＳ１０３を実行するかのように記述しているが、ステップＳ１０２とステップＳ１０３は、逆順又は同時に実行されてもよい。ステップＳ１０２及びＳ１０３を経て、プロセスは終了する。 For ease of explanation, step S103 is described as being executed after step S102, but steps S102 and S103 may be executed in the reverse order or simultaneously. After steps S102 and S103, the process ends.

このようにして、制御部２５は、複数の音源グループに分類された音源とは異なる評価対象音源４２から仮想三次元空間内を伝搬して１つの観測点に到達する音をシミュレートするとともに、設定された各音源グループの代表周波数帯に限定して、各音源グループに属する音源から仮想三次元空間内を伝搬して１つの観測点に到達する音をシミュレートする。これにより、複数の音源グループに分類された音源については周波数帯を限定して、評価対象音源４２については周波数帯を限定せずに、音場シミュレーションを実行することができる。その結果、計算量低減と音場の再現性とのバランスを考慮した音場シミュレーションが実現しやすくなる。 In this way, the control unit 25 simulates sound that propagates through the virtual three-dimensional space from the evaluation target sound source 42 that is different from the sound sources classified into multiple sound source groups and reaches one observation point, and also simulates sound that propagates through the virtual three-dimensional space from the sound sources belonging to each sound source group and reaches one observation point, limited to the representative frequency band of each set sound source group. This makes it possible to perform a sound field simulation by limiting the frequency band for the sound sources classified into multiple sound source groups, and without limiting the frequency band for the evaluation target sound source 42. As a result, it becomes easier to realize a sound field simulation that takes into account the balance between reducing the amount of calculation and reproducibility of the sound field.

なお、ステップＳ１０２で述べた人工音の識別性とは、複数の音源４１が存在する音場４０においてある人工音を再生した際の、当該人工音の再生事実及び所期の意図の理解しやすさのことである。人工音の識別性の評価には、任意の手法が採用可能である。例えば、ロボットの発する人工音以外の音及び各音源グループに属する音源の音を再生しながら、作業中のユーザ（実験参加者）に対して抜き打ちで人工音のサンプルを再生し、サンプルの再生事実及び所期の意図がユーザにどの程度理解されるかを調査する実験を繰り返す。ユーザは、各サンプルの識別性のスコアリングを行う。例えば、ユーザが各サンプルの再生に気付くまでの時間が短いほど、又は各サンプルの所期の意図の理解度が高いほど、高いスコアを付与してもよい。スコアの集計結果は、人工音の各サンプルとスコアとを紐付けた情報として、音源データベースに記憶されてもよい。制御部２５は、音源データベースに記憶された集計結果に基づいて、例えば最も合計スコアの高いサンプルを最も識別性の高いサンプルとして抽出してもよい。制御部２５は、出力部２３を介して、抽出されたサンプルをユーザに出力してもよい。このようにして、制御部２５は、ロボットについて作成された音を人工音としてシミュレートする際、複数の人工音及び人工音以外の音をシミュレートし、１つの観測点における各人工音の識別性の高さを評価し得る。その結果、実際の音場４０でロボットから人間に表出される音として、より人間の安全が図りやすい音を特定しやすくなる。 The distinguishability of the artificial sound mentioned in step S102 refers to the ease of understanding the fact of reproduction of the artificial sound and the intended intention when an artificial sound is reproduced in a sound field 40 in which multiple sound sources 41 exist. Any method can be adopted for evaluating the distinguishability of the artificial sound. For example, while reproducing sounds other than the artificial sound emitted by the robot and sounds of the sound sources belonging to each sound source group, a sample of the artificial sound is reproduced randomly for a user (experiment participant) who is working, and an experiment is repeated to investigate the degree to which the user understands the fact of reproduction of the sample and the intended intention. The user scores the distinguishability of each sample. For example, the shorter the time it takes for the user to notice the reproduction of each sample, or the higher the degree of understanding of the intended intention of each sample, the higher the score may be given. The score compilation result may be stored in the sound source database as information linking each sample of the artificial sound with the score. Based on the compilation result stored in the sound source database, the control unit 25 may extract, for example, the sample with the highest total score as the most distinguishable sample. The control unit 25 may output the extracted samples to the user via the output unit 23. In this way, when simulating the sound created for the robot as an artificial sound, the control unit 25 can simulate multiple artificial sounds and sounds other than artificial sounds and evaluate the degree of discriminability of each artificial sound at one observation point. As a result, it becomes easier to identify sounds that are more likely to ensure human safety as sounds that are expressed by the robot to humans in the actual sound field 40.

以上述べたように、本実施形態に係る情報処理装置２０は、音場に対応する仮想三次元空間内の複数の音源４１のうち、複数の音源グループに分類された音源について、観察期間中に音場４０で取得された、周波数ごとの音圧レベルを示す音源データを取得する。情報処理装置２０は、取得された音源データで示される音圧レベルに応じて、複数の音源グループの各音源グループの代表周波数帯を設定する。情報処理装置２０は、複数の音源グループに分類された音源とは異なる評価対象音源４２から仮想三次元空間内を伝搬して１つの観測点に到達する音をシミュレートするとともに、設定された各音源グループの代表周波数帯に限定して、各音源グループに属する音源から仮想三次元空間内を伝搬して１つの観測点に到達する音をシミュレートする。 As described above, the information processing device 20 according to this embodiment acquires sound source data indicating the sound pressure level for each frequency acquired in the sound field 40 during an observation period for sound sources classified into multiple sound source groups among multiple sound sources 41 in a virtual three-dimensional space corresponding to a sound field. The information processing device 20 sets a representative frequency band for each of the multiple sound source groups according to the sound pressure level indicated by the acquired sound source data. The information processing device 20 simulates a sound that propagates through the virtual three-dimensional space from an evaluation target sound source 42 that is different from the sound sources classified into the multiple sound source groups and reaches one observation point, and also simulates a sound that propagates through the virtual three-dimensional space from a sound source belonging to each sound source group and reaches one observation point, limited to the representative frequency band of each set sound source group.

かかる構成によれば、評価対象音源４２以外の音源である、各音源グループに属する音源の発する音を代表周波数帯に限定して、音場４０のシミュレーションが実行される。このため、例えば街レベルの広い屋外音場を仮想三次元空間内で再現する際、評価対象音源４２以外の音源の発する音について計算量を低減できる。したがって、音場の再現性を担保しつつ計算量の増加を抑制しやすくなる点で、音場シミュレーションに関する技術が改善される。 According to this configuration, the sound field 40 is simulated by limiting the sound emitted by sound sources belonging to each sound source group, which are sound sources other than the sound source 42 to be evaluated, to a representative frequency band. Therefore, when reproducing a wide outdoor sound field at the level of a city, for example, in a virtual three-dimensional space, the amount of calculations for the sound emitted by sound sources other than the sound source 42 to be evaluated can be reduced. This improves the technology related to sound field simulation in that it becomes easier to suppress an increase in the amount of calculations while ensuring the reproducibility of the sound field.

本開示を諸図面及び実施例に基づき説明してきたが、当業者であれば本開示に基づき種々の変形及び改変を行ってもよいことに注意されたい。したがって、これらの変形及び改変は本開示の範囲に含まれることに留意されたい。例えば、各構成部又は各ステップ等に含まれる機能等は論理的に矛盾しないように再配置可能であり、複数の構成部又はステップ等を１つに組み合わせたり、或いは分割したりすることが可能である。 Although the present disclosure has been described based on the drawings and examples, it should be noted that a person skilled in the art may make various modifications and alterations based on the present disclosure. Therefore, it should be noted that these modifications and alterations are included in the scope of the present disclosure. For example, the functions included in each component or step can be rearranged so as not to cause logical inconsistencies, and multiple components or steps can be combined into one or divided.

例えば、上述した実施形態において、情報処理装置２０の構成及び動作を、互いに通信可能な複数のコンピュータに分散させた実施形態も可能である。また例えば、情報処理装置２０の一部又は全部の構成要素を記録装置１０に設けた実施形態も可能である。 For example, in the above-described embodiment, the configuration and operation of the information processing device 20 may be distributed among multiple computers that can communicate with each other. Also, for example, an embodiment in which some or all of the components of the information processing device 20 are provided in the recording device 10 may be possible.

また、例えば汎用のコンピュータを、上述した実施形態に係る情報処理装置２０として機能させる実施形態も可能である。具体的には、上述した実施形態に係る情報処理装置２０の各機能を実現する処理内容を記述したプログラムを、汎用のコンピュータのメモリに格納し、プロセッサによって当該プログラムを読み出して実行させる。したがって、本開示は、プロセッサが実行可能なプログラム、又は当該プログラムを記憶する非一時的なコンピュータ可読媒体としても実現可能である。 In addition, for example, an embodiment is also possible in which a general-purpose computer functions as the information processing device 20 according to the above-described embodiment. Specifically, a program describing the processing contents for realizing each function of the information processing device 20 according to the above-described embodiment is stored in the memory of the general-purpose computer, and the program is read and executed by a processor. Thus, the present disclosure can also be realized as a program executable by a processor, or a non-transitory computer-readable medium that stores the program.

１システム
１０記録装置
１１音源データ取得部
１２測位部
１３通信部
１４入力部
１５出力部
１６記憶部
１７制御部
２０情報処理装置
２１通信部
２２入力部
２３出力部
２４記憶部
２５制御部
３０ネットワーク
４０音場
４１複数の音源
４１ａ自動運転車両
４１ｂロードバイク
４１ｃ男性
４２評価対象音源
Ｐ、Ｑ点
Ｓ０、Ｓ１、Ｓ２音
ＶＳ仮想三次元空間 REFERENCE SIGNS LIST 1 System 10 Recording device 11 Sound source data acquisition unit 12 Positioning unit 13 Communication unit 14 Input unit 15 Output unit 16 Memory unit 17 Control unit 20 Information processing device 21 Communication unit 22 Input unit 23 Output unit 24 Memory unit 25 Control unit 30 Network 40 Sound field 41 Multiple sound sources 41a Self-driving vehicle 41b Road bike 41c Man 42 Sound sources to be evaluated P, Q Points S0, S1, S2 Sound VS Virtual three-dimensional space

Claims

An information processing device including a control unit,
The control unit is
For a plurality of sound sources in a virtual three-dimensional space corresponding to a sound field, sound source data indicating a sound pressure level for each frequency of each sound source group is acquired based on recording data acquired in the sound field during an observation period, for the sound sources classified into a plurality of sound source groups.
setting a representative frequency band for each of the plurality of sound source groups according to a sound pressure level indicated by the acquired sound source data;
an information processing device that simulates a sound that propagates through the virtual three-dimensional space from a sound source to be evaluated that is different from the sound sources classified into the plurality of sound source groups and reaches one observation point, and that simulates a sound that propagates through the virtual three-dimensional space from a sound source belonging to each sound source group, limited to a representative frequency band of each set sound source group, and reaches the one observation point.

2. The information processing device according to claim 1,
the evaluation target sound source is a robot that emits an artificial sound and a sound other than an artificial sound,
The control unit is an information processing device that simulates a sound created for the robot as the artificial sound.

3. The information processing device according to claim 2,
the control unit, when simulating the sound created for the robot as the artificial sound, further includes simulating a plurality of artificial sounds and sounds other than the artificial sounds, and evaluating a level of discriminability of each artificial sound at the one observation point.

A method executed by an information processing device, comprising:
acquiring sound source data indicating a sound pressure level for each frequency of each sound source group based on recording data acquired in the sound field during an observation period, for sound sources classified into a plurality of sound source groups among a plurality of sound sources in a virtual three-dimensional space corresponding to the sound field;
setting a representative frequency band for each of the plurality of sound source groups according to a sound pressure level indicated by the acquired sound source data; and simulating a sound that propagates through the virtual three-dimensional space from a sound source to be evaluated that is different from the sound sources classified into the plurality of sound source groups and reaches one observation point, and simulating a sound that propagates through the virtual three-dimensional space from a sound source belonging to each sound source group and reaches the one observation point, limited to the representative frequency band of each of the set sound source groups.

In the information processing device,
acquiring sound source data indicating a sound pressure level for each frequency of each sound source group based on recording data acquired in the sound field during an observation period, for sound sources classified into a plurality of sound source groups among a plurality of sound sources in a virtual three-dimensional space corresponding to the sound field;
setting a representative frequency band of each of the plurality of sound source groups according to a sound pressure level indicated by the acquired sound source data; and simulating a sound that propagates through the virtual three-dimensional space from a sound source to be evaluated that is different from the sound sources classified into the plurality of sound source groups and reaches one observation point, and simulating a sound that propagates through the virtual three-dimensional space from a sound source belonging to each sound source group and reaches the one observation point, limited to the representative frequency band of each of the set sound source groups.