JP6127792B2

JP6127792B2 - Sound reproduction apparatus and sound reproduction program

Info

Publication number: JP6127792B2
Application number: JP2013148746A
Authority: JP
Inventors: 洋平関; 桂樹岡林
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2013-07-17
Filing date: 2013-07-17
Publication date: 2017-05-17
Anticipated expiration: 2033-07-17
Also published as: JP2015023354A

Description

本件開示は、音響再生装置及び音響再生プログラムに関する。 The present disclosure relates to a sound reproduction device and a sound reproduction program.

ヘッドホンなどで音響を再生する際に、人物の左右の耳に対応する頭部伝達関数(ＨＲＴＦ：Head Related Transfer Function)を用いた音像定位処理を施すことで、音源が人物に対して所定の方向にあることを知覚させる技術がある。 When sound is reproduced with headphones or the like, sound source localization processing using a head related transfer function (HRTF) corresponding to the left and right ears of the person is performed, so that the sound source has a predetermined direction with respect to the person. There is a technology that makes you feel that

音像定位処理は、例えば、３次元空間を表現する映像に含まれる物体の動きとともに、動きに伴う音を、人物と物体との間の仮想的な距離に応じた遅延で聴取させることで臨場感の高い映像を与える技術などへの適用が提案されている(例えば、特許文献１参照)。 The sound image localization processing is, for example, by listening to the sound accompanying the movement with a delay corresponding to the virtual distance between the person and the object together with the movement of the object included in the video representing the three-dimensional space. Application to a technique for providing a high-quality video has been proposed (see, for example, Patent Document 1).

特開平９−１６０５４９号公報JP-A-9-160549

ところで、特許文献１の技術は、写実的な画像で表される物体の動きを人物に見せるタイミングと当該物体の動きに伴う音響を人物に聴かせるタイミングとのズレによって、物体までの距離感を人物に知覚させている。このため、例えば、人物に提示している映像によって表現された３次元空間の外側にある物体が音源である場合、つまり、人物の視界に音源またはその画像が含まれていない場合に、特許文献１の技術により人物に音源までの距離感を知覚させることは困難である。 By the way, the technique of Patent Document 1 gives a sense of distance to an object by a difference between a timing at which a person sees the movement of an object represented by a realistic image and a timing at which a person hears the sound accompanying the movement of the object. Let people perceive. For this reason, for example, when an object outside the three-dimensional space represented by a video presented to a person is a sound source, that is, when a sound source or an image thereof is not included in the person's field of view, It is difficult for a person to perceive a sense of distance to a sound source by the technique 1.

本件開示の音響再生装置及び音響再生プログラムは、人物の視界に音源が含まれているか否かにかかわらず、音源までの距離を人物に直感的に認識させる技術を提供することを目的とする。 It is an object of the present disclosure to provide a sound reproduction device and a sound reproduction program that allow a person to intuitively recognize the distance to a sound source regardless of whether the sound field is included in the person's field of view.

一つの観点によれば、音響再生装置は、音源から発せられる音を示す音響信号を受け、受けた音響信号から再生した音を人物の聴覚に与える再生部と、音響信号で示される音の強さの変化に応じて変化する強度を持つ刺激を人物の視覚に与える提示部と、音源の位置を示す情報と人物の位置を示す情報とに基づいて、音源と人物との距離を算出する第１算出部と、再生部によって人物の聴覚に音を与える第１タイミングと提示部によって人物の視覚に刺激を与える第２タイミングとの間の時間差を、第１算出部で算出された距離に応じて変更する制御を行う制御部とを有する。 According to one aspect, the sound reproducing device receives an acoustic signal indicating a sound emitted from a sound source, and gives a sound reproduced from the received acoustic signal to a person's hearing, and a sound intensity indicated by the acoustic signal. The distance between the sound source and the person is calculated based on the presentation unit that gives the person's vision a stimulus having a strength that changes according to the change of the height, the information indicating the position of the sound source, and the information indicating the position of the person. The time difference between the first calculation unit and the first timing at which the sound is given to the person's hearing by the reproduction unit and the second timing at which the presentation unit gives the stimulus to the person's vision depends on the distance calculated by the first calculation unit And a control unit that performs control to be changed.

別の観点によれば、音響再生プログラムは、音源から発せられる音を示す音響信号を受け、音響信号で示される音の強さに基づいて、人物の視覚に提示する刺激の強度を決定し、音源の位置を示す情報と人物の位置を示す情報とに基づいて、音源と人物との距離を算出し、音響信号から再生される音を人物の聴覚に与える第１タイミングと決定された強度を持つ刺激を人物の視覚に与える第２タイミングとの間の時間差を、算出された距離に応じて変更する制御を行う、処理をコンピュータに実行させる。 According to another aspect, the sound reproduction program receives an acoustic signal indicating the sound emitted from the sound source, determines the intensity of the stimulus to be presented to the human vision based on the intensity of the sound indicated by the acoustic signal, Based on the information indicating the position of the sound source and the information indicating the position of the person, the distance between the sound source and the person is calculated, and the first timing for giving the sound reproduced from the acoustic signal to the person's hearing is determined as the first timing. The computer is caused to execute a process of performing control to change the time difference from the second timing for giving the stimulus to the person's vision according to the calculated distance.

本件開示の音響再生装置及び音響再生プログラムは、人物の視界に音源が含まれているか否かにかかわらず、音源までの距離を人物に直感的に認識させることができる。 The sound reproduction device and the sound reproduction program of the present disclosure can make a person intuitively recognize the distance to a sound source regardless of whether or not the sound field is included in the person's field of view.

音響再生装置の一実施形態を示す図である。It is a figure which shows one Embodiment of a sound reproducing device. 図１に示した提示部によって与える視覚刺激の例を示す図である。It is a figure which shows the example of the visual stimulus given by the presentation part shown in FIG. 図１に示した音源と人物との位置関係の例を示す図である。It is a figure which shows the example of the positional relationship of the sound source shown in FIG. 1, and a person. 図３に示した音源から発せられた音の強さの変化と視覚刺激として表示される図形などの輝度の変化及び再生される音の強さの変化との関係を示す図である。It is a figure which shows the relationship between the change of the intensity | strength of the sound emitted from the sound source shown in FIG. 3, the change of the brightness | luminance of the figure etc. which are displayed as a visual stimulus, and the change of the intensity | strength of the sound reproduced | regenerated. 図１に示した音響再生装置の動作を示す図である。It is a figure which shows operation | movement of the sound reproduction apparatus shown in FIG. 音声再生装置の別実施形態を示す図である。It is a figure which shows another embodiment of an audio | voice reproduction apparatus. 図６に示した遅延部で設定される遅延時間の例を示す図である。It is a figure which shows the example of the delay time set by the delay part shown in FIG. 図６に示した提示部の別実施形態を示す図である。It is a figure which shows another embodiment of the presentation part shown in FIG. 図６に示した音響再生装置を適用した案内システムの例を示す図である。It is a figure which shows the example of the guidance system to which the sound reproduction apparatus shown in FIG. 6 is applied. 音響再生装置のハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware constitutions of a sound reproduction apparatus. 図１０に示した音響再生装置の動作を示す図である。It is a figure which shows operation | movement of the sound reproduction apparatus shown in FIG. 注目されている音源を判別する処理の例を示す図である。It is a figure which shows the example of the process which discriminates the sound source which attracts attention. 音及び視覚刺激を与える処理の例を示す図である。It is a figure which shows the example of the process which gives a sound and a visual stimulus.

以下、図面に基づいて、実施形態を説明する。 Hereinafter, embodiments will be described with reference to the drawings.

図１は、音響再生装置の一実施形態を示す。 FIG. 1 shows an embodiment of a sound reproducing device.

図１に示した音響再生装置１０は、例えば、眼鏡型の表示画面１１１ａ，１１１ｂを有する提示部１１と、ヘッドホンなどの再生部１２と、第１算出部１３と、制御部１４とを含んでいる。 The sound reproduction device 10 illustrated in FIG. 1 includes, for example, a presentation unit 11 having glasses-type display screens 111a and 111b, a reproduction unit 12 such as headphones, a first calculation unit 13, and a control unit 14. Yes.

提示部１１は、人物Ｐ１の頭部に装着され、表示画面１１１ａ，１１１ｂに表示させた画像を人物Ｐ１の視覚に与える。提示部１１は、左右の目に対応する表示画面１１１ａ，１１１ｂを有する眼鏡型に限らず、両眼を覆う形状の一つの表示画面を有していてもよい。なお、表示画面１１１ａ，１１１ｂは、表示画面１１１ａ，１１１ｂを通して外部の環境が観察可能な程度の透明度を持っていることが望ましい。また、提示部１１は、表示画面１１１ａ，１１１ｂの代わりに、人物Ｐ１の視野内に配置された少なくとも一つの発光素子を含んでいてもよい。なお、発光素子を含む提示部１１の例については、図８を用いて後述する。 The presentation unit 11 is attached to the head of the person P1, and gives the image displayed on the display screens 111a and 111b to the vision of the person P1. The presentation unit 11 is not limited to the glasses type having the display screens 111a and 111b corresponding to the left and right eyes, and may have a single display screen that covers both eyes. It is desirable that the display screens 111a and 111b have such transparency that the external environment can be observed through the display screens 111a and 111b. In addition, the presentation unit 11 may include at least one light emitting element arranged in the field of view of the person P1 instead of the display screens 111a and 111b. In addition, the example of the presentation part 11 containing a light emitting element is later mentioned using FIG.

また、再生部１２は、人物Ｐ１の頭部に装着された状態で、人物Ｐ１の左右の耳にそれぞれ対向するスピーカを内蔵しており、内蔵したスピーカによって出力される音響を人物Ｐ１の聴覚に与える。 In addition, the playback unit 12 has built-in speakers facing the left and right ears of the person P1 while being worn on the head of the person P1, and the sound output from the built-in speakers is used as the hearing of the person P1. give.

図１に示した音源Ｓは、例えば、壁ＷＬなどにより、人物Ｐ１から隔てられた場所、あるいは、音源Ｓから発せられる音を人物Ｐ１が聞き取れる距離よりも人物Ｐ１から離れた場所に設置されている。 The sound source S shown in FIG. 1 is installed in a place separated from the person P1 by a wall WL or the like, or in a place further away from the person P1 than the distance at which the person P1 can hear the sound emitted from the sound source S. Yes.

音源Ｓから発せられた音は、例えば、音源Ｓに近接した位置に配置されたマイクロホンＭＣにより、音響信号に変換される。また、マイクロホンＭＣによる変換で得られた音響信号は、例えば、サーバ装置ＳＶに搭載された無線通信機能を用いて音響信号を示す情報を送信することで、音源Ｓから離れた場所にいる人物Ｐ１に装着された音響再生装置１０の制御部１４に渡される。つまり、音響再生装置１０は、人物Ｐ１から離れた場所にある音源Ｓによって発せされる音を、マイクロホンＭＣで得られた音響信号に基づいて再生し、再生部１２により、人物Ｐ１の聴覚に与える。なお、サーバ装置ＳＶは、人物Ｐ１によって携帯されているスマートホンやタブレット型端末などの携帯端末ＵＥに搭載された無線通信機能を利用することで、音響信号を携帯端末ＵＥに渡し、携帯端末ＵＥ経由で音響信号を音響再生装置１０に渡してもよい。 The sound emitted from the sound source S is converted into an acoustic signal by, for example, the microphone MC disposed at a position close to the sound source S. The acoustic signal obtained by the conversion by the microphone MC is, for example, a person P1 who is away from the sound source S by transmitting information indicating the acoustic signal using a wireless communication function installed in the server device SV. To the control unit 14 of the sound reproducing device 10 attached to the. That is, the sound reproducing device 10 reproduces the sound emitted by the sound source S located at a location away from the person P1 based on the acoustic signal obtained by the microphone MC, and gives the sound to the hearing of the person P1 by the reproducing unit 12. . Note that the server device SV uses the wireless communication function installed in the mobile terminal UE such as a smart phone or a tablet terminal carried by the person P1, thereby passing the acoustic signal to the mobile terminal UE. The sound signal may be passed to the sound reproduction device 10 via the route.

また、第１算出部１３は、例えば、人物Ｐ１によって携帯されているスマートホンやタブレット型端末などの携帯端末ＵＥから、携帯端末ＵＥに含まれているＧＰＳ(Global Positioning System)センサＧｐなどで得られた人物Ｐ１の位置を示す位置情報Ｐｄｐを受ける。人物Ｐ１の位置を示す位置情報Ｐｄｐは、携帯端末ＵＥに搭載されたＧＰＳセンサＧｐで得られた位置情報に限られない。例えば、人物Ｐ１の位置を示す位置情報Ｐｄｐは、携帯端末ＵＥに内蔵された加速度センサとジャイロセンサとで得られた情報に基づいて人物Ｐ１の動きを追跡することで得られる位置情報でもよい。また、人物Ｐ１の位置を示す位置情報Ｐｄｐは、複数の電波源から携帯端末ＵＥが受信した信号の強さに基づいて人物Ｐ１の位置を測定することで得られる位置情報でもよい。なお、第１算出部１３は、携帯端末ＵＥに搭載されたＧＰＳセンサＧｐから位置情報Ｐｄｐを受ける代わりに、例えば、提示部１１あるいは再生部１２とともに人物Ｐ１の頭部に装着された別のＧＰＳセンサなどで得られた位置情報を受けてもよい。 Further, the first calculation unit 13 is obtained from, for example, a GPS (Global Positioning System) sensor Gp included in the mobile terminal UE from a mobile terminal UE such as a smart phone or a tablet terminal carried by the person P1. The position information Pdp indicating the position of the given person P1 is received. The position information Pdp indicating the position of the person P1 is not limited to the position information obtained by the GPS sensor Gp mounted on the mobile terminal UE. For example, the position information Pdp indicating the position of the person P1 may be position information obtained by tracking the movement of the person P1 based on information obtained by an acceleration sensor and a gyro sensor built in the mobile terminal UE. The position information Pdp indicating the position of the person P1 may be position information obtained by measuring the position of the person P1 based on the strength of signals received by the mobile terminal UE from a plurality of radio wave sources. The first calculation unit 13 receives, for example, another GPS mounted on the head of the person P1 together with the presentation unit 11 or the playback unit 12 instead of receiving the position information Pdp from the GPS sensor Gp mounted on the mobile terminal UE. You may receive the positional information obtained with the sensor etc.

また、第１算出部１３は、例えば、サーバ装置ＳＶから、サーバ装置ＳＶに搭載された無線通信機能などを用いて送信された情報を受信することで、音源Ｓについて予め登録された位置を示す位置情報Ｐｄｓを受ける。なお、位置情報Ｐｄｓは、第１算出部１３の内部に設けられたメモリに予め保持されていてもよい。 Moreover, the 1st calculation part 13 shows the position registered beforehand about the sound source S by receiving the information transmitted, for example from the server apparatus SV using the wireless communication function etc. which were mounted in the server apparatus SV. Receives position information Pds. The position information Pds may be held in advance in a memory provided in the first calculation unit 13.

第１算出部１３は、受けた位置情報Ｐｄｐ，Ｐｄｓに基づいて、人物Ｐ１と音源Ｓとの間の距離Ｌ１を算出し、算出した距離Ｌ１を示す情報を制御部１４に渡す。 The first calculation unit 13 calculates a distance L1 between the person P1 and the sound source S based on the received position information Pdp and Pds, and passes information indicating the calculated distance L1 to the control unit 14.

制御部１４は、サーバ装置ＳＶから受けた音響信号に基づいて、音源Ｓから発せられる音の強さを示す音圧を算出する。制御部１４は、例えば、マイクロホンＭＣから受けた音響信号の振幅をサンプリングすることで得られた電圧値から、数ミリ秒〜十数ミリ秒に設定された期間毎に、音圧の大きさを示す振幅の実効値を算出する。例えば、制御部１４で算出される実効値Ｖは、平均化を行う期間内のサンプリングで得られたＮ個(Ｎは正の整数)の電圧値Ｖ(ｊ)（ｊはサンプル数Ｎ以下の正の整数）を用いて、式(１)で示される。 The control unit 14 calculates a sound pressure indicating the intensity of the sound emitted from the sound source S based on the acoustic signal received from the server device SV. For example, from the voltage value obtained by sampling the amplitude of the acoustic signal received from the microphone MC, the control unit 14 sets the magnitude of the sound pressure for each period set to several milliseconds to several tens of milliseconds. The effective value of the indicated amplitude is calculated. For example, the effective value V calculated by the control unit 14 is N (N is a positive integer) voltage values V (j) (j is the number of samples N or less) obtained by sampling within the averaging period. (Positive integer) is used to express the equation (1).

また、制御部１４は、式(１)を用いて算出した実効値Ｖを用いて、提示部１１の表示画面１１１ａ，１１１ｂの少なくとも一方に、音響信号で示される音源Ｓからの音の強さに応じて明るさが変化する画像を表示させる画像データを生成する。制御部１４で生成された画像データに基づいて表示画面１１１ａ，１１１ｂの少なくとも一方に表示された画像は、音響信号で示される音の強さに応じた強度を持ち、人物Ｐ１の視覚に与えられる刺激の一例である。また、表示画面１１１ａ，１１１ｂの少なくとも一方に表示された画像の明るさは、人物Ｐ１の視覚に与える刺激の強度の一例である。以下の説明において、「視覚に与える刺激」は、視覚刺激と称される場合がある。

Moreover, the control part 14 uses the effective value V calculated using Formula (1), and the intensity | strength of the sound from the sound source S shown with an acoustic signal on at least one of the

display screens

111a and 111b of the presentation part 11 is shown. Image data for displaying an image whose brightness changes in accordance with is generated. An image displayed on at least one of the

display screens

111a and 111b based on the image data generated by the control unit 14 has an intensity corresponding to the intensity of the sound indicated by the acoustic signal, and is given to the vision of the person P1. It is an example of a stimulus. The brightness of the image displayed on at least one of the

display screens

111a and 111b is an example of the intensity of the stimulus given to the vision of the person P1. In the following description, “stimulation given to vision” may be referred to as visual stimulation.

制御部１４は、例えば、円や矩形、矢印などの図形や文字列あるいは文字列と図形とを組み合わせたマークを表す画像データの輝度値を実効値Ｖに応じて変化させることで、提示部１１によって与える視覚刺激の強度を変化させる。また、制御部１４は、提示部１１によって視覚刺激として人物Ｐ１に与える図形などの色の濃度や大きさを実効値Ｖに応じて変化させることで、人物Ｐ１に与える視覚刺激の強度を変化させてもよい。以下では、提示部１１によって人物Ｐ１の視覚に与える刺激の強度を制御する手法の一例として、視覚刺激として表示画面１１１ａ，１１１ｂに表示する図形などの明るさを制御部１４によって制御する場合が説明される。 For example, the control unit 14 changes the luminance value of image data representing a graphic such as a circle, a rectangle, or an arrow, a character string, or a combination of a character string and a graphic according to the effective value V, thereby providing the presentation unit 11. The intensity of the visual stimulus given by is changed. Further, the control unit 14 changes the intensity of the visual stimulus given to the person P1 by changing the density and size of the color of the figure or the like given to the person P1 as the visual stimulus by the presentation unit 11 according to the effective value V. May be. In the following, as an example of a method for controlling the intensity of the stimulus given to the vision of the person P1 by the presentation unit 11, the case where the control unit 14 controls the brightness of a graphic or the like displayed on the display screens 111a and 111b as a visual stimulus will be described. Is done.

図２(Ａ)，(Ｂ)，(Ｃ)は、図１に示した提示部１１によって人物Ｐ１の視覚に与える刺激の例を示す。なお、図２に示した構成要素のうち、図１に示した構成要素と同等のものについては、同一の符号を付して示し、その説明は省略する。図２(Ａ)，(Ｂ)，(Ｃ)は、提示部１１を装着した人物Ｐ１から見た表示画面１１１ａ，１１１ｂの例を示す。 2A, 2B, and 2C show examples of stimuli given to the vision of the person P1 by the presentation unit 11 shown in FIG. 2 that are the same as those shown in FIG. 1 are given the same reference numerals, and descriptions thereof are omitted. 2A, 2 B, and 2 C show examples of display screens 111 a and 111 b viewed from the person P 1 wearing the presentation unit 11.

図２(Ａ)は、提示部１１に含まれる２つの表示画面１１１ａ，１１１ｂの一方（この例では表示画面１１１ｂ）に、図１に示した音源Ｓに対応する視覚刺激として、図形（例えば、円形１１２ｃ）を図１に示した制御部１４により表示させる例を示す。 FIG. 2A shows a graphic (for example, a visual stimulus corresponding to the sound source S shown in FIG. 1 on one of the two display screens 111a and 111b (in this example, the display screen 111b) included in the presentation unit 11. An example in which the circle 112c) is displayed by the control unit 14 shown in FIG.

制御部１４は、例えば、図２(Ａ)に示した円形１１２ｃの輝度を、式(１)で求めた実効値Ｖの変化に応じて変化させる。円形１１２ｃの輝度は、円形１１２ｃを表示させる画像データの輝度値を変化させることで変化する。これにより、制御部１４は、音源Ｓから発せられる音の強さの変化を、視覚刺激として人物Ｐ１に与える円形１１２ｃの輝度の変化に反映させる。 For example, the control unit 14 changes the luminance of the circle 112c illustrated in FIG. 2A in accordance with the change in the effective value V obtained by Expression (1). The luminance of the circle 112c is changed by changing the luminance value of the image data for displaying the circle 112c. Thereby, the control part 14 reflects the change of the intensity of the sound emitted from the sound source S in the change of the luminance of the circle 112c given to the person P1 as a visual stimulus.

図２(Ｂ)は、提示部１１に含まれる２つの表示画面１１１ａ，１１１ｂの一方（この例では表示画面１１１ｂ）に、図１に示した音源Ｓに対応する視覚刺激として、文字列「Ｆｊ」を示すテキスト１１２ｔを制御部１４により表示させる例を示す。 2B shows a character string “Fj” as a visual stimulus corresponding to the sound source S shown in FIG. 1 on one of the two display screens 111a and 111b (in this example, the display screen 111b) included in the presentation unit 11. An example in which a text 112t indicating "is displayed by the control unit 14 is shown.

制御部１４は、例えば、図２(Ｂ)に示したテキスト１１２ｔの輝度を、式(１)で求めた実効値Ｖの変化に応じて変化させる。テキスト１１２ｔの輝度は、テキスト１１２ｔを表示させる画像データの輝度値を変化させることで変化する。これにより、制御部１４は、音源Ｓから発せられる音の強さの変化を、視覚刺激として人物Ｐ１に与えるテキスト１１２ｔの輝度の変化に反映させる。 For example, the control unit 14 changes the luminance of the text 112t shown in FIG. 2B according to the change in the effective value V obtained by the equation (1). The luminance of the text 112t is changed by changing the luminance value of the image data for displaying the text 112t. Thereby, the control part 14 reflects the change of the intensity of the sound emitted from the sound source S in the change of the luminance of the text 112t given to the person P1 as a visual stimulus.

図２(Ｃ)は、提示部１１に含まれる２つの表示画面１１１ａ，１１１ｂの一方（この例では表示画面１１１ｂ）に、図１に示した音源Ｓに対応する視覚刺激として、文字列と図形とを組み合わせたマーク１１２ｍを制御部１４により表示させる例を示す。 FIG. 2C shows a character string and a graphic as a visual stimulus corresponding to the sound source S shown in FIG. 1 on one of the two display screens 111a and 111b (in this example, the display screen 111b) included in the presentation unit 11. An example is shown in which the control unit 14 displays a mark 112m combining the above.

図２(Ｃ)に示したマーク１１２ｍは、文字列「Ｆｊ」を表すテキスト１１２ｔと、テキスト１１２ｔの背景１１２ｂと、テキスト１１２ｔ及び背景１１２ｂを囲む枠１１２ｆとを含んでいる。 The mark 112m shown in FIG. 2C includes a text 112t representing the character string “Fj”, a background 112b of the text 112t, and a frame 112f surrounding the text 112t and the background 112b.

図１に示した制御部１４は、図２(Ｃ)に示したテキスト１１２ｔ、背景１１２ｂ、枠１１２ｆのいずれかの輝度を、式(１)で求めた実効値Ｖの変化に応じて変化させることで、音源Ｓから発せられる音の強さの変化をマーク１１２ｍの明るさに反映させる。また、制御部１４は、図２(Ｃ)に示した視覚刺激であるマーク１１２ｍに含まれる要素から選んだ２つ以上の要素の輝度を、式(１)で求めた実効値Ｖに応じて変化させることで、音源Ｓから発せられる音の強さの変化を視覚刺激の明るさに反映させてもよい。マーク１１２ｍの要素であるテキスト１１２ｔ、背景１１２ｂ、枠１１２ｆのそれぞれの輝度は、テキスト１１２ｔ、背景１１２ｂ、枠１１２ｆのそれぞれを表示させる画像データの輝度値を変化させることで変化する。例えば、制御部１４は、式(１)で求めた実効値Ｖに応じて変化する輝度値を用いて、テキスト１１２ｔと枠１１２ｆとをそれぞれ表す画像データを生成してもよい。 The control unit 14 illustrated in FIG. 1 changes the luminance of any of the text 112t, the background 112b, and the frame 112f illustrated in FIG. 2C in accordance with the change in the effective value V obtained by Expression (1). Thus, the change in the intensity of the sound emitted from the sound source S is reflected in the brightness of the mark 112m. Further, the control unit 14 determines the brightness of two or more elements selected from the elements included in the mark 112m that is the visual stimulus shown in FIG. 2C according to the effective value V obtained by the expression (1). By changing, the change in the intensity of the sound emitted from the sound source S may be reflected in the brightness of the visual stimulus. The brightness of the text 112t, the background 112b, and the frame 112f, which are elements of the mark 112m, is changed by changing the brightness value of the image data that displays the text 112t, the background 112b, and the frame 112f. For example, the control unit 14 may generate image data representing each of the text 112t and the frame 112f using a luminance value that changes in accordance with the effective value V obtained by Expression (1).

なお、制御部１４は、式(１)で求められる実効値Ｖの変化に応じて、提示部１１によって表示が可能な範囲で音源Ｓに対応する視覚刺激の明るさを変化させることが望ましい。例えば、制御部１４は、音源Ｓから発せられる音の音圧として想定される最大の音圧に対応する実効値Ｖｍと、提示部１１によって表示が可能な最大の輝度Ｂｍとを用いて、式(２)により、実効値Ｖに対応する輝度値Ｂを求めてもよい。
Ｂ＝Ｂｍ×Ｖ／Ｖｍ・・・（２）
また、制御部１４は、２つの表示画面１１１ａ，１１１ｂのどちらに音源Ｓに対応する視覚刺激を表示させる場合でも、人物Ｐ１の視界を妨げない位置に表示させることが望ましい。 Note that the control unit 14 desirably changes the brightness of the visual stimulus corresponding to the sound source S within a range that can be displayed by the presentation unit 11 in accordance with a change in the effective value V obtained by Expression (1). For example, the control unit 14 uses the effective value Vm corresponding to the maximum sound pressure assumed as the sound pressure of the sound emitted from the sound source S and the maximum luminance Bm that can be displayed by the presentation unit 11, and The luminance value B corresponding to the effective value V may be obtained from (2).
B = Bm × V / Vm (2)
Moreover, it is desirable that the control unit 14 display the visual stimulus corresponding to the sound source S on either of the two display screens 111a and 111b at a position that does not obstruct the view of the person P1.

また、制御部１４は、第１算出部１３から受けた距離Ｌ１を示す情報に基づいて、距離Ｌ１を音が伝搬する時間を求める。また、制御部１４は、サーバ装置ＳＶから受けた音響信号に、求めた時間に相当する遅延を与え、遅延を与えた音響信号を再生部１２に渡す。これにより、制御部１４は、提示部１１によって人物Ｐ１に与えられる視覚刺激の強度が音響信号で示される音の強さに応じて変化するタイミングから、距離Ｌ１を音が伝播する時間を遅れさせたタイミングで、再生部１２に音響信号で示される音を再生させる。即ち、制御部１４は、距離Ｌ１が大きいほど大きい遅延時間を再生部１２に渡す音響信号に与えることで、再生部１２により人物Ｐ１に音を与える第１タイミングと、提示部１１により人物Ｐ１に視覚刺激を与える第２タイミングとの間の時間差を大きく設定する。音響信号を遅延させて再生部１２に渡す制御は、再生部１２で再生される音の強さが変化する第１タイミングと提示部１１により人物Ｐ１に与える視覚刺激の明るさが音の強さの変化に応じて変化する第２タイミングとに時間差を設定する制御の一例である。 Further, the control unit 14 obtains the time for sound to propagate through the distance L1 based on the information indicating the distance L1 received from the first calculation unit 13. In addition, the control unit 14 gives a delay corresponding to the obtained time to the acoustic signal received from the server device SV, and passes the delayed acoustic signal to the reproduction unit 12. As a result, the control unit 14 delays the time during which the sound propagates the distance L1 from the timing at which the intensity of the visual stimulus given to the person P1 by the presentation unit 11 changes according to the sound intensity indicated by the acoustic signal. The sound indicated by the acoustic signal is played back by the playback unit 12 at the determined timing. In other words, the control unit 14 gives a larger delay time to the acoustic signal passed to the reproduction unit 12 as the distance L1 is larger, thereby causing the reproduction unit 12 to give a sound to the person P1 and the presentation unit 11 to the person P1. A large time difference from the second timing for applying the visual stimulus is set. In the control of delaying the acoustic signal and passing it to the playback unit 12, the first timing when the strength of the sound played back by the playback unit 12 changes and the brightness of the visual stimulus given to the person P1 by the presentation unit 11 is the strength of the sound. It is an example of the control which sets a time difference to the 2nd timing which changes according to the change of.

ここで、図１に示した提示部１１により人物Ｐ１の視覚に与える刺激は、音源Ｓとなる物体を表す画像ではなく、音源Ｓから発せられる音の強さに応じた強度を持つ刺激である。また、提示部１１により人物Ｐ１の視覚に与える刺激の強度は、再生部１２によって人物Ｐ１の聴覚に与えられる音の強さの変化とともに、音源Ｓから発せられる同一の音の強さの変化に応じて変化する。このため、提示部１１から与えられる視覚への刺激と、再生部１２によって再生される音とは、人物Ｐ１により、対応付けて認識される。そして、例えば、人物Ｐ１の視覚に与える刺激が変化する第２タイミングと、人物Ｐ１の聴覚に与える音が変化する第１タイミングとの間の時間差を大きくするほど、再生された音を発した音源Ｓまでの距離が大きい印象を人物Ｐ１に与えることができる。すなわち、図１に示した音響再生装置１０は、人物Ｐ１の視覚への刺激と聴覚への刺激とを時間差を持たせて与えることにより、音源Ｓとなる物体を表す画像を表示する機能を用いずに、人物Ｐ１に、音源Ｓまでの距離Ｌ１を認識させることができる。つまり、図１に示した音響再生装置１０は、人物Ｐ１の視界に音源Ｓとなる物体を表す画像が含まれていない場合であっても、人物Ｐ１による音源Ｓまでの距離Ｌ１の認識を支援することができる。 Here, the stimulus given to the vision of the person P1 by the presentation unit 11 shown in FIG. 1 is not an image representing an object that becomes the sound source S but a stimulus having an intensity corresponding to the intensity of the sound emitted from the sound source S. . The intensity of the stimulus given to the visual of the person P1 by the presentation unit 11 is a change in the intensity of the same sound emitted from the sound source S together with the change in the intensity of the sound given to the hearing of the person P1 by the reproduction unit 12. Will change accordingly. For this reason, the visual stimulus given from the presentation unit 11 and the sound reproduced by the reproduction unit 12 are recognized in association with each other by the person P1. For example, a sound source that emits the reproduced sound as the time difference between the second timing at which the stimulus given to the visual of the person P1 changes and the first timing at which the sound given to the hearing of the person P1 changes is increased. The impression that the distance to S is large can be given to the person P1. That is, the sound reproducing device 10 shown in FIG. 1 uses a function of displaying an image representing an object serving as the sound source S by giving a stimulus to the visual and auditory stimuli of the person P1 with a time difference. Instead, the person P1 can recognize the distance L1 to the sound source S. That is, the sound reproducing device 10 illustrated in FIG. 1 supports the recognition of the distance L1 to the sound source S by the person P1 even when the image representing the object serving as the sound source S is not included in the field of view of the person P1. can do.

なお、制御部１４により、第１タイミングと第２タイミングとの間に時間差を設定する手法は、音響信号を遅延させて再生部１２に渡すことで、第１タイミングを第２タイミングに対して遅れさせる制御に限られない。例えば、制御部１４は、再生部１２によって音響信号から音を再生する第１タイミングに対して、提示部１１により、音響信号で示される音の強さに応じた強度を持つ刺激を人物Ｐ１の視覚に与える第２タイミングを遅れさせてもよい。 Note that the method of setting the time difference between the first timing and the second timing by the control unit 14 is to delay the first timing with respect to the second timing by delaying the acoustic signal and passing it to the reproduction unit 12. It is not limited to the control. For example, the control unit 14 causes the presentation unit 11 to apply a stimulus having an intensity corresponding to the intensity of the sound indicated by the acoustic signal to the person P1 with respect to the first timing at which the reproduction unit 12 reproduces the sound from the acoustic signal. You may delay the 2nd timing given to vision.

また、人物Ｐ１の移動に伴って、図１に示した音響再生装置１０は、音響信号で示される音源Ｓからの音とともに、人物Ｐ１と音源Ｓとの距離の変化に応じて調整された時間差で、音響信号から生成された視覚刺激を人物Ｐ１に与える。したがって、図１に示した再生部１２及び提示部１１を装着した状態で移動する人物Ｐ１は、再生部１２で再生される音響と提示部１１によって人物Ｐ１の視覚に与えられる刺激との時間差の変化に基づいて、音源Ｓまでの距離の変化を認識することが可能である。 In addition to the movement of the person P1, the sound reproducing device 10 shown in FIG. 1 adjusts the time difference adjusted according to the change in the distance between the person P1 and the sound source S together with the sound from the sound source S indicated by the sound signal. The visual stimulus generated from the acoustic signal is given to the person P1. Therefore, the person P1 who moves while wearing the reproduction unit 12 and the presentation unit 11 shown in FIG. 1 has a time difference between the sound reproduced by the reproduction unit 12 and the stimulus given to the visual of the person P1 by the presentation unit 11. It is possible to recognize a change in the distance to the sound source S based on the change.

図３は、図１に示した音源Ｓと人物Ｐ１との位置関係の例を示す。なお、図３は、展示会場ＥＨ内を移動中の人物Ｐ１に装着された音響再生装置１０により、展示物ＥＸ１及び人物Ｐ２，Ｐ３，Ｐ４を含む音源Ｓから発せられる音(例えば、音声)を聴取させる例を示す。 FIG. 3 shows an example of the positional relationship between the sound source S and the person P1 shown in FIG. FIG. 3 shows a sound (for example, sound) emitted from the sound source S including the exhibit EX1 and the persons P2, P3, and P4 by the sound reproducing device 10 attached to the person P1 moving in the exhibition hall EH. An example of listening is shown.

図３に示した音源Ｓは、人物Ｐ１とは壁ＷＬによって隔てられた位置にある。すなわち、図３に示した人物Ｐ１と音源Ｓとの位置関係は、人物Ｐ１が音源Ｓを直接に視認することが困難な位置関係の一例である。 The sound source S shown in FIG. 3 is located at a position separated from the person P1 by the wall WL. That is, the positional relationship between the person P1 and the sound source S shown in FIG. 3 is an example of a positional relationship in which it is difficult for the person P1 to visually recognize the sound source S directly.

また、図３の例において、音源Ｓからの音を示す音響信号は、展示物ＥＸ１に近接して配置されたマイクロホンＭＣからサーバ装置ＳＶに送られ、更に、サーバ装置ＳＶからアクセスポイントＡＰ及び携帯端末ＵＥを介して音響再生装置１０に渡される。サーバ装置ＳＶは、マイクロホンＭＣから受けた音響信号を符号化することで、音源Ｓから発せられる音を示す符号化された音情報を生成し、生成した音情報を、アクセスポイントＡＰを介して携帯端末ＵＥに渡してもよい。また、携帯端末ＵＥは、符号化された音情報を受けた場合に、受けた音情報を復号することで音源Ｓからの音を示す音響信号を復元し、復元した音響信号を音響再生装置１０に含まれる図１に示した制御部１４に渡す。 In the example of FIG. 3, the acoustic signal indicating the sound from the sound source S is sent to the server device SV from the microphone MC disposed close to the exhibit EX1, and the server device SV further accesses the access point AP and the mobile phone. It is passed to the sound reproduction device 10 via the terminal UE. The server device SV generates encoded sound information indicating the sound emitted from the sound source S by encoding the acoustic signal received from the microphone MC, and carries the generated sound information via the access point AP. You may pass to the terminal UE. Further, when the mobile terminal UE receives the encoded sound information, the mobile terminal UE decodes the received sound information to restore the acoustic signal indicating the sound from the sound source S, and the restored sound signal is restored to the acoustic reproduction device 10. 1 is transferred to the control unit 14 shown in FIG.

制御部１４は、音源Ｓからの音を示す音響信号に基づいて、図１に示した提示部１１に音源Ｓに対応して明るさが変化する図形などの視覚刺激を表示させるとともに、距離Ｌ１に応じた時間差で図１に示した再生部１２に音源Ｓからの音を再生させる。 Based on the acoustic signal indicating the sound from the sound source S, the control unit 14 causes the presentation unit 11 illustrated in FIG. 1 to display a visual stimulus such as a graphic whose brightness changes corresponding to the sound source S, and the distance L1. The sound from the sound source S is reproduced by the reproducing unit 12 shown in FIG.

図４は、図３に示した音源Ｓから発せられた音の強さの変化と視覚刺激として表示される図形などの輝度の変化及び再生される音の強さの変化との関係を示す。図４において、横軸ｔは時間の経過を示す。 FIG. 4 shows the relationship between the change in the intensity of the sound emitted from the sound source S shown in FIG. 3, the change in the luminance of the graphic displayed as a visual stimulus, and the change in the intensity of the reproduced sound. In FIG. 4, the horizontal axis t indicates the passage of time.

図４(Ａ)は、音源Ｓから発せされる音の音圧の変化の例を示す。図４(Ａ)の縦軸ＳＰは、音圧を示す。 FIG. 4A shows an example of a change in sound pressure of sound emitted from the sound source S. The vertical axis SP in FIG. 4A indicates sound pressure.

図４(Ｂ)は、音源Ｓに対応する視覚刺激として、図１に示した提示部１１の表示画面１１１ａ，１１１ｂの少なくとも一方に表示された図形などの画像の輝度の変化の例を示す。図４(Ｂ)の縦軸Ｂｒは、音源Ｓに対応する視覚刺激として表示された画像の輝度値を示す。 FIG. 4B shows an example of a change in luminance of an image such as a graphic displayed on at least one of the display screens 111a and 111b of the presentation unit 11 shown in FIG. 1 as a visual stimulus corresponding to the sound source S. The vertical axis Br in FIG. 4B represents the luminance value of the image displayed as the visual stimulus corresponding to the sound source S.

図４(Ｃ)は、図１に示した再生部１２によって再生される音の音圧の変化の例を示す。図４(Ｃ)の縦軸ＳＰは、音圧を示す。なお、図４(Ｃ)は、図１に示した制御部１４により、音源Ｓからの音を示す音響信号に対して、人物Ｐ１と音源Ｓとの距離に応じた遅延が付加された後に再生部１２に渡された音響信号から再生された音の音圧の変化を示している。 FIG. 4C shows an example of a change in sound pressure of the sound reproduced by the reproducing unit 12 shown in FIG. The vertical axis SP in FIG. 4C indicates the sound pressure. 4C is reproduced after a delay corresponding to the distance between the person P1 and the sound source S is added to the acoustic signal indicating the sound from the sound source S by the control unit 14 shown in FIG. The change of the sound pressure of the sound reproduced | regenerated from the acoustic signal passed to the part 12 is shown.

例えば、図４(Ａ)，(Ｂ)，(Ｃ)に示した時刻Ｔ０において、図３に示した人物Ｐ１が音源Ｓから距離Ｌ１だけ離れている場合に、時刻Ｔ０における音源Ｓからの音の音圧の変化は、図４(Ｃ)に示すように、距離Ｌ１に対応する時間ＤＬ１だけ遅れて再生される。一方、図４(Ａ)，(Ｂ)の比較から分かるように、提示部１１により、音源Ｓに対応する視覚刺激として表示される画像の輝度は、音源Ｓからの音の音圧の変化とほぼ同期して変化する。 For example, when the person P1 shown in FIG. 3 is separated from the sound source S by the distance L1 at the time T0 shown in FIGS. 4A, 4B, and 4C, the sound from the sound source S at the time T0. The change in sound pressure is reproduced with a delay of time DL1 corresponding to the distance L1, as shown in FIG. On the other hand, as can be seen from the comparison between FIGS. 4A and 4B, the luminance of the image displayed as the visual stimulus corresponding to the sound source S by the presentation unit 11 is the change in the sound pressure of the sound from the sound source S. It changes almost synchronously.

その後、図３に示した人物Ｐ１が展示会場ＥＨ内で移動し、図４(Ａ)，(Ｂ)，(Ｃ)に示した時刻Ｔ１において、人物Ｐ１の位置が図３に破線で示した位置Ｐ１’に変化した場合に、図１に示した再生部１２で再生される音のタイミングは、次のように変化する。位置Ｐ１’と音源Ｓとの距離Ｌ１’が距離Ｌ１よりも短い場合に、時刻Ｔ１における音源Ｓからの音の音圧の変化は、図４(Ｃ)に示すように、時間ＤＬ１よりも短い時間ＤＬ１’だけ遅れて再生される。 Thereafter, the person P1 shown in FIG. 3 moves in the exhibition hall EH, and the position of the person P1 is indicated by a broken line in FIG. 3 at time T1 shown in FIGS. 4 (A), (B), and (C). When it changes to position P1 ', the timing of the sound reproduced | regenerated by the reproducing part 12 shown in FIG. 1 changes as follows. When the distance L1 ′ between the position P1 ′ and the sound source S is shorter than the distance L1, the change in the sound pressure of the sound from the sound source S at time T1 is shorter than the time DL1, as shown in FIG. Playback is delayed by time DL1 '.

図１に示した制御部１４は、例えば、音源Ｓから人物Ｐ１までの距離Ｌ１と、空気中の音速とを用いて、式(３)により、第１タイミングと第２タイミングとの間に設定する時間差を求めてもよい。なお、式(３)において、符号ＤＬは、第１タイミングと第２タイミングとの間の時間差を示し、符号γｓは空気中の音速を示す。また、式(３)において、符号αは、例えば、１以上の実数に設定される係数を示す。
ＤＬ＝α×Ｌ１／γｓ・・・(３)
例えば、制御部１４は、係数αを含む式(３)を用いて算出した時間差ＤＬを遅延させた音響信号を再生部１２に渡す。これにより、聴覚へ刺激を与える第１タイミングと視覚へ刺激を与える第２タイミングとの間の時間差を、距離Ｌ１と距離Ｌ１’との差以上に強調することができる。例えば、人物Ｐ１と音源Ｓとの距離Ｌ１から距離Ｌ１’への変化に応じて、制御部１４によって第１タイミングと第２タイミングとの間に設定される時間差ＤＬは、距離Ｌ１、Ｌ１’の変化分を音が伝播する時間よりも大きく変化させられる。これにより、式(３)において係数αの値を数値「１」として時間差ＤＬを求める場合に比べて、人物Ｐ１は、視覚への刺激と聴覚への刺激との間の時間差ＤＬの変化に基づいて、音源Ｓまでの距離の変化を認識しやすくなる。なお、式(３)に含まれる係数αは、例えば、音源Ｓから人物Ｐ１までの距離Ｌ１が想定される最大値となった場合に、時間差ＤＬの値が数秒程度となる値に設定されることが望ましい。例えば、人物Ｐ１が図３に示した展示会場ＥＨの内部を移動する場合に、係数αは、展示会場ＥＨを示す矩形の対角線の長さで音速γｓを除算した値に、時間差の上限を示す時間(例えば、２秒)を乗じた値に設定される。 The control unit 14 illustrated in FIG. 1 is set between the first timing and the second timing by using the distance L1 from the sound source S to the person P1 and the speed of sound in the air, using Equation (3), for example. You may obtain | require the time difference to do. In Equation (3), the symbol DL indicates the time difference between the first timing and the second timing, and the symbol γs indicates the speed of sound in the air. Moreover, in Formula (3), code | symbol (alpha) shows the coefficient set to 1 or more real numbers, for example.
DL = α × L1 / γs (3)
For example, the control unit 14 passes the acoustic signal obtained by delaying the time difference DL calculated using Expression (3) including the coefficient α to the reproduction unit 12. Thereby, the time difference between the 1st timing which gives a stimulus to hearing, and the 2nd timing which gives a stimulus to vision can be emphasized more than the difference of distance L1 and distance L1 '. For example, the time difference DL set between the first timing and the second timing by the control unit 14 according to the change from the distance L1 between the person P1 and the sound source S to the distance L1 ′ is the distance L1, L1 ′. The amount of change can be changed larger than the time for sound to propagate. As a result, the person P1 is based on the change of the time difference DL between the visual stimulus and the auditory stimulus compared to the case where the time difference DL is obtained by setting the value of the coefficient α to the numerical value “1” in Equation (3). Thus, it becomes easy to recognize a change in the distance to the sound source S. Note that the coefficient α included in the expression (3) is set to a value at which the value of the time difference DL is about several seconds when the distance L1 from the sound source S to the person P1 is an assumed maximum value, for example. It is desirable. For example, when the person P1 moves inside the exhibition hall EH shown in FIG. 3, the coefficient α indicates the upper limit of the time difference to a value obtained by dividing the sound speed γs by the length of the rectangular diagonal line indicating the exhibition hall EH. A value obtained by multiplying time (for example, 2 seconds) is set.

図５は、図１に示した音響再生装置１０の動作を示す。図５に示したステップＳ３０１〜ステップＳ３０５の処理は、図１に示した音響再生装置１０の動作を示すとともに、音響再生プログラムの例を示す。例えば、図５に示す処理は、音響再生装置１０に搭載されたプロセッサが音響再生プログラムを実行することで実現される。なお、図５に示す処理は、音響再生装置１０に搭載されるハードウェアによって実行されてもよい。すなわち、図５は、音響再生プログラムの一実施形態を示す。 FIG. 5 shows the operation of the sound reproducing device 10 shown in FIG. The process of step S301 to step S305 shown in FIG. 5 shows the operation of the sound reproduction device 10 shown in FIG. 1 and an example of the sound reproduction program. For example, the process shown in FIG. 5 is realized by a processor mounted in the sound reproduction device 10 executing a sound reproduction program. Note that the processing shown in FIG. 5 may be executed by hardware mounted on the sound reproduction device 10. That is, FIG. 5 shows an embodiment of a sound reproduction program.

ステップＳ３０１において、図１に示した制御部１４は、図１に示した音源Ｓから発せされる音を示す音響信号を受ける。制御部１４は、図１に示したマイクロホンＭＣにより生成された音響信号をサーバ装置ＳＶから受けてもよいし、図３に示したように、サーバ装置ＳＶ及び携帯端末ＵＥを介して受けてもよい。また、マイクロホンＭＣに無線通信機能が搭載されている場合は、マイクロホンＭＣの無線通信機能により音響信号を音響再生装置１０に送信することで、音響再生装置１０に音響信号を渡してもよい。 In step S301, the control unit 14 shown in FIG. 1 receives an acoustic signal indicating a sound emitted from the sound source S shown in FIG. The control unit 14 may receive the acoustic signal generated by the microphone MC shown in FIG. 1 from the server device SV, or may receive it via the server device SV and the mobile terminal UE as shown in FIG. Good. When the microphone MC has a wireless communication function, the sound signal may be passed to the sound reproduction device 10 by transmitting the sound signal to the sound reproduction device 10 by the wireless communication function of the microphone MC.

ステップＳ３０２において、制御部１４は、受けた音響信号に基づいて、音響信号で示される音の強さに応じて、人物Ｐ１に与える視覚刺激の強度を決定する。制御部１４は、例えば、図２(Ａ)，(Ｂ)，(Ｃ)を用いて説明したように、図形や文字あるいは図形と文字とを組み合わせたマークを表す画像の少なくとも一部の明るさを示す輝度値を、音響信号で示される音の強さに応じて設定する。 In step S302, the control unit 14 determines the intensity of the visual stimulus to be given to the person P1 based on the received acoustic signal according to the intensity of the sound indicated by the acoustic signal. For example, as described with reference to FIGS. 2A, 2 B, and 2 C, the control unit 14 brightness of at least a part of an image representing a graphic, a character, or a mark that combines a graphic and a character. Is set according to the sound intensity indicated by the acoustic signal.

ステップＳ３０３において、図１に示した第１算出部１３は、人物Ｐ１の位置を示す位置情報Ｐｄｐと音源Ｓの位置を示す位置情報Ｐｄｓとを受け、受けた位置情報Ｐｄｓ，Ｐｄｐに基づいて、人物Ｐ１と音源Ｓとの間の距離を算出する。 In step S303, the first calculation unit 13 shown in FIG. 1 receives the position information Pdp indicating the position of the person P1 and the position information Pds indicating the position of the sound source S, and based on the received position information Pds and Pdp, The distance between the person P1 and the sound source S is calculated.

ステップＳ３０４において、制御部１４は、算出された距離に基づいて、例えば、距離に比例する大きさの時間差で、再生部１２によって音響信号で示される音を再生させるとともに、提示部１１によって音響信号で示される音に対応する視覚刺激を与える。 In step S304, based on the calculated distance, the control unit 14 reproduces the sound indicated by the acoustic signal by the reproduction unit 12 with a time difference having a magnitude proportional to the distance, and the presentation unit 11 reproduces the acoustic signal. A visual stimulus corresponding to the sound indicated by is given.

ステップＳ３０５において、制御部１４は、例えば、人物Ｐ１の操作などにより、音源Ｓから発せられた音を示す音響信号に基づいて、音響信号で示される音を再生する処理である音響再生処理の終了が指示されたか否かを判定する。 In step S305, the control unit 14 ends the sound reproduction process that is a process of reproducing the sound indicated by the sound signal based on the sound signal indicating the sound emitted from the sound source S by, for example, the operation of the person P1. Whether or not is instructed is determined.

音響再生処理の終了が指示されていない場合に(ステップＳ３０５の否定判定(ＮＯ))、制御部１４はステップＳ３０１の処理に戻り、音源Ｓから発せられる音から新たに生成される音響信号について、ステップＳ３０１〜ステップＳ３０５の処理を繰り返す。 When the end of the sound reproduction process is not instructed (No determination in step S305 (NO)), the control unit 14 returns to the process of step S301, and for the sound signal newly generated from the sound emitted from the sound source S, Steps S301 to S305 are repeated.

一方、音響再生処理の終了が指示された場合に(ステップＳ３０５の肯定判定(ＹＥＳ))、制御部１４は、音声を再生する処理を終了する。 On the other hand, when the end of the sound reproduction process is instructed (Yes determination in step S305 (YES)), the control unit 14 ends the process of reproducing the sound.

図１に示した音響再生装置１０は、例えば、数ミリ秒〜十数ミリ秒毎に、図５に示したステップＳ３０１〜ステップＳ３０５の処理を繰り返し実行することが望ましい。これにより、音響再生装置１０は、人物Ｐ１の移動に伴う音源Ｓまでの距離の変化を、聴覚へ刺激を与える第１タイミングと視覚へ刺激を与える第２タイミングとの間の時間差の変化に反映させ、人物Ｐ１による音源Ｓまでの距離の把握を支援することができる。 The sound reproducing device 10 shown in FIG. 1 desirably executes the processing of steps S301 to S305 shown in FIG. 5 repeatedly every several milliseconds to several tens of milliseconds, for example. As a result, the sound reproducing device 10 reflects the change in the distance to the sound source S accompanying the movement of the person P1 in the change in the time difference between the first timing that gives the stimulus to the auditory sense and the second timing that gives the stimulus to the visual sense. Thus, it is possible to support the grasp of the distance to the sound source S by the person P1.

なお、図５に示したステップＳ３０１及びステップＳ３０２の処理とステップＳ３０３の処理とは、図５に示したフローチャートとは逆の順序で実行されてもよい。また、ステップＳ３０１及びステップＳ３０２の処理とステップＳ３０３の処理とは、並行して実行されてもよい。 Note that the processing in steps S301 and S302 and the processing in step S303 shown in FIG. 5 may be executed in the reverse order of the flowchart shown in FIG. Moreover, the process of step S301 and step S302 and the process of step S303 may be performed in parallel.

図６は、音声再生装置１０の別実施形態を示す。なお、図６に示した構成要素のうち、図１に示した構成要素と同等のものは、同一の符号で示され、その説明が省略される場合がある。 FIG. 6 shows another embodiment of the audio reproduction device 10. 6 that are equivalent to the components shown in FIG. 1 are denoted by the same reference numerals, and the description thereof may be omitted.

図６に示した音響再生装置１０は、図１に示した提示部１１、再生部１２、第１算出部１３及び制御部１４に加えて、メモリ１３１と第２算出部１５とを含んでいる。 The sound reproduction device 10 illustrated in FIG. 6 includes a memory 131 and a second calculation unit 15 in addition to the presentation unit 11, the reproduction unit 12, the first calculation unit 13, and the control unit 14 illustrated in FIG. .

メモリ１３１は、例えば、音響再生装置１０の外部にあるサーバ装置ＳＶから、サーバ装置ＳＶに音源Ｓについて予め登録された位置を示す位置情報Ｐｄｓを受け、受けた位置情報Ｐｄｓを保持する。また、メモリ１３１は、保持した位置情報Ｐｄｓを第１算出部１３と第２算出部１５との双方に渡す。 For example, the memory 131 receives position information Pds indicating a position registered in advance with respect to the sound source S in the server apparatus SV from the server apparatus SV outside the sound reproducing apparatus 10, and holds the received position information Pds. Further, the memory 131 passes the held position information Pds to both the first calculation unit 13 and the second calculation unit 15.

第２算出部１５は、人物Ｐ１が携帯している携帯端末ＵＥなどから、人物Ｐ１の位置を示す位置情報Ｐｄｐを受けるとともに、例えば、人物Ｐ１の頭部に装着されたジャイロセンサＧｙｒから、人物Ｐ１の頭部の姿勢を示す角度情報Ｐｄａを受ける。また、第２算出部１５は、受けた位置情報Ｐｄｓ、位置情報Ｐｄｐ及び角度情報Ｐｄａに基づいて、人物Ｐ１の頭部の向きを基準とした音源Ｓの方向を算出する。例えば、第２算出部１５は、位置情報Ｐｄｓで示される音源Ｓの位置と位置情報Ｐｄｐで示される人物Ｐ１の位置とを結ぶ線分と、角度情報Ｐｄａで示される人物Ｐ１の頭部の向きとがなす角を、音源Ｓの方向として算出する。 The second calculation unit 15 receives position information Pdp indicating the position of the person P1 from a portable terminal UE or the like carried by the person P1 and, for example, from a gyro sensor Gyr attached to the head of the person P1 The angle information Pda indicating the posture of the head of P1 is received. In addition, the second calculation unit 15 calculates the direction of the sound source S based on the orientation of the head of the person P1 based on the received position information Pds, position information Pdp, and angle information Pda. For example, the second calculation unit 15 uses a line segment connecting the position of the sound source S indicated by the position information Pds and the position of the person P1 indicated by the position information Pdp, and the orientation of the head of the person P1 indicated by the angle information Pda. The angle formed by and is calculated as the direction of the sound source S.

なお、第２算出部１５に渡される角度情報Ｐｄａは、ジャイロセンサＧｙｒによって検知された情報に限らず、人物Ｐ１の頭部の姿勢を示す情報であればよい。例えば、第２算出部１５は、人物Ｐ１の頭部の向きの変化を示す角速度を受けてもよい。角度情報Ｐｄａとして人物Ｐ１の頭部の向きの変化を示す角速度を受けた場合に、第２算出部１５は、受けた角速度から人物Ｐ１の頭部の向きを算出する処理を行い、算出した頭部の向きに基づいて、人物Ｐ１の頭部の向きを基準とした音源Ｓの方向を算出する。 The angle information Pda passed to the second calculation unit 15 is not limited to information detected by the gyro sensor Gyr, and may be information indicating the posture of the head of the person P1. For example, the second calculation unit 15 may receive an angular velocity indicating a change in the orientation of the head of the person P1. When the angular velocity indicating the change in the orientation of the head of the person P1 is received as the angle information Pda, the second calculation unit 15 performs a process of calculating the orientation of the head of the person P1 from the received angular velocity, and the calculated head Based on the direction of the part, the direction of the sound source S with respect to the direction of the head of the person P1 is calculated.

また、図６に示した制御部１４は、生成部１４１と、遅延部１４２と、変換部１４３と、判定部１４４と、調整部１４５とを含んでいる。マイクロホンＭＣにより、音源Ｓから発せられる音から生成された音響信号は、サーバ装置ＳＶに搭載された無線通信機能を介して、生成部１４１と遅延部１４２との双方に渡されている。また、第１算出部１３によって算出された人物Ｐ１と音源Ｓとの距離は、調整部１４５に渡される。また、第２算出部１５によって算出された音源Ｓの方向は、生成部１４１と判定部１４４と変換部１４３とに渡される。 The control unit 14 illustrated in FIG. 6 includes a generation unit 141, a delay unit 142, a conversion unit 143, a determination unit 144, and an adjustment unit 145. The acoustic signal generated from the sound emitted from the sound source S by the microphone MC is passed to both the generation unit 141 and the delay unit 142 via the wireless communication function mounted on the server device SV. The distance between the person P1 and the sound source S calculated by the first calculation unit 13 is passed to the adjustment unit 145. Further, the direction of the sound source S calculated by the second calculation unit 15 is passed to the generation unit 141, the determination unit 144, and the conversion unit 143.

生成部１４１は、マイクロホンＭＣから受けた音響信号をサンプリングすることで得られた電圧値に対して式(１)を用いることにより、数ミリ秒〜十数ミリ秒に設定される所定の期間ごとに、音響信号で示される音の強さを示す実効値を求める。また、生成部１４１は、求めた実効値に基づいて、例えば、式(２)により、人物Ｐ１の視覚に与える刺激の強さを示す輝度値を算出する。また、生成部１４１は、視覚刺激として、提示部１１により、人物Ｐ１の視覚に与える視覚刺激として、表示画面１１１ａ，１１１ｂに図２(Ａ)，(Ｂ)，(Ｃ)に示した図形などを表示するための画像データの輝度値として、算出した輝度値を設定する。 The generation unit 141 uses the expression (1) for the voltage value obtained by sampling the acoustic signal received from the microphone MC, so that the generation unit 141 has a predetermined period set to several milliseconds to several tens of milliseconds. Then, an effective value indicating the intensity of the sound indicated by the acoustic signal is obtained. In addition, based on the calculated effective value, the generation unit 141 calculates, for example, a luminance value indicating the strength of the stimulus given to the vision of the person P1 by Expression (2). Further, the generation unit 141 displays the graphics shown in FIGS. 2A, 2B, and 2C on the display screens 111a and 111b as visual stimuli given to the visual of the person P1 by the presentation unit 11 as visual stimuli. The calculated luminance value is set as the luminance value of the image data for displaying.

また、生成部１４１は、第２算出部１５から渡された音源Ｓの方向を示す情報に基づいて、例えば、音源Ｓが人物Ｐ１の右側にあるか左側にあるかを判断し、判断結果に応じて、表示画面１１１ａ，１１１ｂのいずれかに視覚刺激となる図形等を表示させてもよい。例えば、生成部１４１は、音源Ｓが人物Ｐ１の右側にあると判断した場合に、表示画面１１１ａに視覚刺激となる図形等を表示する画像データを生成する。一方、音源Ｓが人物Ｐ１の左側にあると判断した場合に、生成部１４１は、表示画面１１１ｂに視覚刺激となる図形等を表示する画像データを生成する。 In addition, the generation unit 141 determines, for example, whether the sound source S is on the right side or the left side of the person P1 based on the information indicating the direction of the sound source S passed from the second calculation unit 15, and determines the determination result. In response, a graphic or the like that becomes a visual stimulus may be displayed on either of the display screens 111a and 111b. For example, when the generation unit 141 determines that the sound source S is on the right side of the person P1, the generation unit 141 generates image data that displays a graphic or the like that serves as a visual stimulus on the display screen 111a. On the other hand, when determining that the sound source S is on the left side of the person P1, the generation unit 141 generates image data for displaying a graphic or the like serving as a visual stimulus on the display screen 111b.

また、生成部１４１は、音源Ｓの方向を示す情報に基づいて、音源Ｓの方向を示す角度に対応する表示画面１１１ａ，１１１ｂにおける位置に、視覚刺激を示す図形などを表示させてもよい。例えば、生成部１４１は、人物Ｐ１の頭部の正面から反時計回りに測った所定の角度の範囲を表示画面１１１ａの左右方向の長さとを対応付け、時計回りに図った所定の角度の範囲を表示画面１１１ｂの左右方向の長さとを対応付けてもよい。そして、生成部１４１は、第２算出部１５で算出された角度に対応する表示画面１１１ａ、１１１ｂのいずれかの位置に、視覚刺激となる図形等を表示する画像データを生成する。 Further, the generation unit 141 may display a graphic or the like indicating a visual stimulus at a position on the display screen 111a or 111b corresponding to the angle indicating the direction of the sound source S based on the information indicating the direction of the sound source S. For example, the generation unit 141 associates a range of a predetermined angle measured counterclockwise from the front of the head of the person P1 with a length in the left-right direction of the display screen 111a, and a range of the predetermined angle designed clockwise. May be associated with the length in the left-right direction of the display screen 111b. Then, the generation unit 141 generates image data that displays a graphic or the like that becomes a visual stimulus at any position on the display screens 111 a and 111 b corresponding to the angle calculated by the second calculation unit 15.

生成部１４１によって生成された画像データは、提示部１１に渡され、提示部１１に含まれる表示画面１１１ａ，１１１ｂによる図形やマークなどの表示に用いられる。そして、表示画面１１１ａ，１１１ｂに表示された図形やマークなどは、図形やマークなどが表示された位置に対応する方向からの刺激として人物Ｐ１の視覚に与えられる。したがって、人物Ｐ１は、視野内に表示された図形やマークなどの位置に基づいて、音源Ｓのおおよその方向を認識することができる。 The image data generated by the generation unit 141 is transferred to the presentation unit 11 and used for displaying graphics, marks, and the like on the display screens 111a and 111b included in the presentation unit 11. The figures, marks, etc. displayed on the display screens 111a, 111b are given to the vision of the person P1 as stimuli from the direction corresponding to the positions where the figures, marks, etc. are displayed. Therefore, the person P1 can recognize the approximate direction of the sound source S based on the positions of figures and marks displayed in the field of view.

判定部１４４は、例えば、第２算出部１５によって算出された音源Ｓの方向が、所定の期間以上にわたって人物Ｐ１の頭部の正面を示す方向を示す場合に、人物Ｐ１が音源Ｓに注目していると判定する。ここで、第２算出部１５で得られる音源Ｓの方向は、人物Ｐ１の頭部の向きの変化を反映する情報であり、人物Ｐ１の動きを示す情報の一例である。すなわち、判定部１４４は、人物Ｐ１の動きを示す情報に基づいて、人物Ｐ１が音源Ｓに注目しているか否かを判定する。 For example, the determination unit 144 pays attention to the sound source S when the direction of the sound source S calculated by the second calculation unit 15 indicates a direction indicating the front of the head of the person P1 for a predetermined period or longer. It is determined that Here, the direction of the sound source S obtained by the second calculation unit 15 is information that reflects a change in the orientation of the head of the person P1, and is an example of information that indicates the movement of the person P1. That is, the determination unit 144 determines whether the person P1 is paying attention to the sound source S based on the information indicating the movement of the person P1.

調整部１４５は、判定部１４４により、人物Ｐ１が音源Ｓに注目していると判定された場合に、第１算出部１３によって算出された音源Ｓと人物Ｐ１との距離を短縮する調整を行い、調整後の距離を遅延部１４２に渡す。 The adjustment unit 145 performs adjustment to shorten the distance between the sound source S calculated by the first calculation unit 13 and the person P1 when the determination unit 144 determines that the person P1 is paying attention to the sound source S. Then, the adjusted distance is passed to the delay unit 142.

調整部１４５は、例えば、音源Ｓと人物Ｐ１との距離と、人物Ｐ１が音源Ｓに注目していることを示す判定結果が判定部１４４から得られている時間とに基づいて、式(４)により、調整された距離を算出する。なお、式(４)において、符号Ｌ１ａは、音源Ｓと人物Ｐ１との間の調整された距離を示し、符号Ｌ１は、第１算出部１３によって算出された距離を示す。また、式(４)において、符号ｔｆは、判定部１４４により、人物Ｐ１が音源Ｓに注目しているとされた期間の長さ、即ち注目時間を示す。また、式(４)において、符号Ｒ１は、注目時間ｔｆに応じて、調整された距離Ｌ１ａの短縮される傾向の大きさを示す係数である。また、式(４)において、関数ｍａｘ(Ａ，Ｂ)は、数Ａと数Ｂとのうち大きい方を選択する関数である。
Ｌ１ａ＝ｍａｘ（０，Ｌ１−ｔｆ×Ｒ１）・・・（４）
また、調整部１４５は、調整された距離Ｌ１ａの算出に、式(４)に代えて、式(５)を用いてもよい。なお、式(５)において、式(４)と同等の要素は式(４)と同じ符号で示される。また、式(５)において、符号Ｒ２は、調整された距離Ｌ１ａが値０となるまでの時間を示す係数である。
Ｌ１ａ＝ｍａｘ（０，Ｌ１（１−ｔｆ／Ｒ２））・・・（５）
一方、調整部１４５は、判定部１４４により、人物Ｐ１が音源Ｓに注目していないと判定された場合に、第１算出部１３によって算出された音源Ｓと人物Ｐ１との距離をそのまま遅延部１４２に渡す。 For example, the adjustment unit 145 uses the equation (4) based on the distance between the sound source S and the person P1 and the time when the determination result indicating that the person P1 is paying attention to the sound source S is obtained from the determination unit 144. ) To calculate the adjusted distance. In Expression (4), the symbol L1a indicates the adjusted distance between the sound source S and the person P1, and the symbol L1 indicates the distance calculated by the first calculation unit 13. Further, in Expression (4), the symbol tf indicates the length of the period in which the person P1 is paying attention to the sound source S, that is, the attention time. In the equation (4), the symbol R1 is a coefficient indicating the magnitude of the tendency of the distance L1a adjusted according to the attention time tf to be shortened. In the formula (4), the function max (A, B) is a function for selecting the larger one of the numbers A and B.
L1a = max (0, L1-tf × R1) (4)
Further, the adjustment unit 145 may use Expression (5) instead of Expression (4) for calculation of the adjusted distance L1a. In addition, in Formula (5), the element equivalent to Formula (4) is shown with the same code | symbol as Formula (4). Further, in the equation (5), the symbol R2 is a coefficient indicating the time until the adjusted distance L1a becomes 0.
L1a = max (0, L1 (1-tf / R2)) (5)
On the other hand, when the determination unit 144 determines that the person P1 is not paying attention to the sound source S, the adjustment unit 145 determines the distance between the sound source S calculated by the first calculation unit 13 and the person P1 as it is as a delay unit. 142.

遅延部１４２は、調整部１４５から渡された調整前あるいは調整後の音源Ｓと人物Ｐ１との距離に基づいて、例えば、式(３)を用いることで、音響信号に基づき、音源Ｓからの音を再生する第１タイミングと視覚刺激を与える第２タイミングとの時間差を算出する。また、遅延部１４２は、サーバ装置ＳＶから渡される音響信号に、算出した時間差に相当する遅延を与え、遅延させた音響信号を変換部１４３に渡す。 The delay unit 142 uses the expression (3), for example, based on the distance between the pre-adjustment or post-adjustment sound source S passed from the adjustment unit 145 and the person P1, and based on the acoustic signal, The time difference between the first timing for reproducing the sound and the second timing for applying the visual stimulus is calculated. In addition, the delay unit 142 gives a delay corresponding to the calculated time difference to the acoustic signal delivered from the server device SV, and passes the delayed acoustic signal to the conversion unit 143.

変換部１４３は、例えば、第２算出部１５によって求められた音源Ｓの方向に対応する頭部伝達関数を用いた変換処理を、遅延部１４２から受けた遅延させられた音響信号に対して実行する。すなわち、変換部１４３は、人物Ｐ１の頭部の向きを基準とした音源Ｓの方向に対応する頭部伝達関数を、遅延させられた音響信号に畳み込むことで、人物Ｐ１から見た音源Ｓの方向への音像定位処理を行う。 For example, the conversion unit 143 performs conversion processing using the head-related transfer function corresponding to the direction of the sound source S obtained by the second calculation unit 15 for the delayed acoustic signal received from the delay unit 142. To do. In other words, the conversion unit 143 convolves the head-related transfer function corresponding to the direction of the sound source S with respect to the head direction of the person P1 into the delayed acoustic signal, so that the sound source S viewed from the person P1. Sound image localization processing in the direction.

また、変換部１４３は、音像定位処理後の音響信号を再生部１２に渡し、再生部１２に音像定位処理後の音響信号から音響を再生させることで、人物Ｐ１から見た音源Ｓの方向に音像定位された音響を人物Ｐ１に聴取させる。 In addition, the conversion unit 143 passes the sound signal after the sound image localization process to the reproduction unit 12, and causes the reproduction unit 12 to reproduce sound from the sound signal after the sound image localization process, thereby causing the sound source S to be viewed from the person P1. The person P1 listens to the sound that has been localized.

図６に示した制御部１４は、生成部１４１により、人物Ｐ１の頭部の向きを基準とした音源Ｓの方向に応じた位置に視覚刺激となる図形やマークなどを表示する画像データを生成し、変換部１４３により、人物Ｐ１から見た音源Ｓの方向への音像定位処理を行う。そして、制御部１４は、再生部１２により、音像定位された音響を人物Ｐ１に聴かせるとともに、提示部１１により、音源Ｓの方向に応じた位置に視覚刺激となる図形やマークを表示させる。すなわち、生成部１４１及び変換部１４３を有する制御部１４は、音像定位させた音響と視覚刺激となる図形等を表示させる位置によって音源Ｓの方向を示すとともに、視覚への刺激と音響の再生との間の時間差で音源Ｓまでの距離を示す。これにより、図６に示した制御部１４を含む音響再生装置１０は、音源Ｓの実際の位置が人物Ｐ１の視界に含まれるか否かにかかわらず、人物Ｐ１による音源Ｓの位置の直感的な把握を支援することができる。 The control unit 14 shown in FIG. 6 uses the generation unit 141 to generate image data that displays a graphic or mark that becomes a visual stimulus at a position corresponding to the direction of the sound source S with respect to the head direction of the person P1. Then, the conversion unit 143 performs sound image localization processing in the direction of the sound source S viewed from the person P1. Then, the control unit 14 causes the person P1 to listen to the sound image-localized sound by the reproduction unit 12, and causes the presentation unit 11 to display a graphic or mark that becomes a visual stimulus at a position according to the direction of the sound source S. That is, the control unit 14 having the generation unit 141 and the conversion unit 143 indicates the direction of the sound source S according to the position where the sound image localized sound and the graphic or the like serving as the visual stimulus are displayed, as well as visual stimulation and sound reproduction. The distance to the sound source S is indicated by the time difference between. Thereby, the sound reproducing device 10 including the control unit 14 illustrated in FIG. 6 intuitively determines the position of the sound source S by the person P1 regardless of whether or not the actual position of the sound source S is included in the field of view of the person P1. Can help you understand.

また、図６に示した制御部１４において、調整部１４５は、判定部１４４にて人物Ｐ１が音源Ｓに注目しているとされた場合に、音源Ｓと人物Ｐ１との間の距離を第１算出部１３で算出された距離よりも短くすることで、遅延部１４２で設定される遅延を短縮する。 In the control unit 14 illustrated in FIG. 6, the adjustment unit 145 determines the distance between the sound source S and the person P1 when the determination unit 144 determines that the person P1 is paying attention to the sound source S. The delay set by the delay unit 142 is shortened by making the distance shorter than the distance calculated by the 1 calculating unit 13.

図７は、図６に示した遅延部１４２で設定される遅延時間の例を示す。図７(Ａ)は、調整部１４５において、調整後の距離の算出に式(４)が用いられた場合に遅延部１４２で設定される遅延時間の変化の例を示す。また、図７(Ｂ)は、調整部１４５において、調整後の距離の算出に式(５)が用いられた場合に遅延部１４２で設定される遅延時間の変化の例を示す。なお、図７(Ａ)，(Ｂ)において、横軸ｔは時間を示し、縦軸ＤＬは、遅延時間の大きさを示す。 FIG. 7 shows an example of the delay time set by the delay unit 142 shown in FIG. FIG. 7A shows an example of a change in delay time set by the delay unit 142 when the adjustment unit 145 uses Equation (4) to calculate the adjusted distance. FIG. 7B shows an example of a change in the delay time set by the delay unit 142 when the adjustment unit 145 uses Equation (5) to calculate the adjusted distance. In FIGS. 7A and 7B, the horizontal axis t indicates time, and the vertical axis DL indicates the magnitude of the delay time.

図７(Ａ)に示した遅延時間ＤＬ、即ち、調整部１４５において、調整後の距離の算出に式(４)が用いられた場合の遅延時間ＤＬは、人物Ｐ１が音源Ｓに注目していることが検出された時刻Ｔａから単調に減少する。そして、時刻Ｔａから、第１算出部１３で算出された距離Ｌ１を式(４)に示した係数Ｒ１で除算した値で示される時間Ｌ１／Ｒ１が経過した時刻Ｔａ＋Ｌ１／Ｒ１以降において、遅延時間ＤＬの値は数値０となる。 The delay time DL shown in FIG. 7A, that is, the delay time DL when the adjustment unit 145 uses the equation (4) for calculating the adjusted distance, pays attention to the sound source S by the person P1. It decreases monotonically from the time Ta when it is detected. Then, after time Ta + L1 / R1 when a time L1 / R1 indicated by a value obtained by dividing the distance L1 calculated by the first calculation unit 13 by the coefficient R1 shown in Equation (4) from the time Ta, the delay time is reached. The value of DL is 0.

同様に、図７(Ｂ)に示した遅延時間ＤＬ、即ち、調整部１４５において、調整後の距離の算出に式(５)が用いられた場合の遅延時間ＤＬは、人物Ｐ１が音源Ｓに注目していることが検出された時刻Ｔａから単調に減少する。一方、図７(Ｂ)に示した遅延時間ＤＬは、第１算出部１３で算出された距離Ｌ１にかかわらず、式(５)に示した係数Ｒ２で示される時間が経過した時刻Ｔａ＋Ｒ２以降において数値０となる。 Similarly, the delay time DL shown in FIG. 7B, that is, the delay time DL when the adjustment unit 145 uses the equation (5) to calculate the adjusted distance, It decreases monotonously from the time Ta when it is detected that attention is paid. On the other hand, the delay time DL shown in FIG. 7B is after the time Ta + R2 when the time indicated by the coefficient R2 shown in the equation (5) has passed, regardless of the distance L1 calculated by the first calculation unit 13. The value is 0.

遅延部１４３によって設定される遅延時間は、提示部１１によって音響信号で示される音に対応する強さを持つ刺激が人物Ｐ１の視覚に与えられる第２タイミングと再生部１２により音響信号で示される音が再生される第１タイミングとの時間差に相当する。すなわち、図６に示した制御部１４は、人物Ｐ１が音源Ｓに注目している場合に、第２タイミングと第１タイミングとの時間差を、注目していない場合に比べて短縮することで、人物Ｐ１が音響信号によって示される音源Ｓからの音に集中しやすくすることができる。 The delay time set by the delay unit 143 is indicated by an acoustic signal by the reproduction unit 12 and the second timing when a stimulus having a strength corresponding to the sound indicated by the acoustic signal by the presentation unit 11 is given to the visual sense of the person P1. This corresponds to the time difference from the first timing at which the sound is reproduced. That is, the control unit 14 illustrated in FIG. 6 reduces the time difference between the second timing and the first timing when the person P1 is paying attention to the sound source S as compared with the case where the person P1 is not paying attention, The person P1 can easily concentrate on the sound from the sound source S indicated by the acoustic signal.

なお、調整部１４５により調整後の距離を算出する手法は、式(４)あるいは式(５)を用いる手法に限らず、判定部１４４により、人物Ｐ１が音源Ｓに注目しているとされた場合に、第１算出部１３で算出された距離を短く調整する手法であればよい。例えば、調整部１４５は、注目時間が所定値だけ増大する毎に、調整後の距離を段階的に短縮してもよいし、また、人物Ｐ１が音源Ｓに注目していることが検出された場合に、注目時間の長さにかかわらず、調整後の距離を数値０としてもよい。 The method for calculating the adjusted distance by the adjustment unit 145 is not limited to the method using the formula (4) or the formula (5), and the determination unit 144 has determined that the person P1 is paying attention to the sound source S. In this case, any method may be used as long as the distance calculated by the first calculation unit 13 is adjusted to be short. For example, the adjustment unit 145 may reduce the adjusted distance step by step each time the attention time increases by a predetermined value, or it is detected that the person P1 is paying attention to the sound source S. In this case, the adjusted distance may be set to 0 regardless of the length of the attention time.

図８は、図６に示した提示部１１の別実施形態を示す。図８(Ａ)は、人物Ｐ１の頭部に装着された提示部１１を示す。また、図８(Ｂ)は、人物Ｐ１の頭部の向きを基準とする方向と提示部１１に含まれる発光素子との対応関係を示す。 FIG. 8 shows another embodiment of the presentation unit 11 shown in FIG. FIG. 8A shows the presentation unit 11 worn on the head of the person P1. FIG. 8B shows a correspondence relationship between a direction based on the orientation of the head of the person P 1 and the light emitting elements included in the presentation unit 11.

図８(Ａ)に示した提示部１１において、発光ダイオードなどの複数の発光素子１１３−１，１１３−２，…，１１３−ｋ(ｋは２以上の整数)は、板状の部材１１４上に例えばほぼ等間隔で配置されている。また、発光素子１１３−１，…，１１３−ｋが配置された板状の部材１１４は、例えば、再生部１２から伸びる蔓状の部材１１５により、人物Ｐ１の顔の正面に保持される。 In the presentation unit 11 shown in FIG. 8A, a plurality of light emitting elements 113-1, 113-2,..., 113-k (k is an integer of 2 or more) such as light emitting diodes For example, they are arranged at almost equal intervals. Further, the plate-like member 114 on which the light emitting elements 113-1,..., 113-k are arranged is held in front of the face of the person P1 by a vine-like member 115 extending from the reproducing unit 12, for example.

図８(Ｂ)に示した提示部１１は、７個の発光素子１１３−１，１１３−２，１１３−３，１１３−４，１１３−５，１１３−６，１１３−７を含んでいる。すなわち、図８(Ｂ)は、図８(Ａ)に示した提示部１１において、ｋ＝７である場合を示す。７個の発光素子１１３−１〜１１３−７のそれぞれは、人物Ｐ１の頭部の正面を基準とする所定の角度の範囲を例えば７等分することで得られる７つの範囲φ１，φ２，φ３，φ４，φ５，φ６，φ７のそれぞれに対応付けられている。以下の説明において、ｋ個の発光素子１１３−１〜１１３−ｋは、単に、発光素子１１３と総称される場合がある。 The presentation unit 11 illustrated in FIG. 8B includes seven light emitting elements 113-1, 113-2, 113-3, 113-4, 113-5, 113-6, and 113-7. That is, FIG. 8B illustrates a case where k = 7 in the presentation unit 11 illustrated in FIG. Each of the seven light emitting elements 113-1 to 113-7 has seven ranges φ1, φ2, and φ3 obtained by dividing a range of a predetermined angle with respect to the front of the head of the person P1, for example, into seven equal parts. , Φ4, φ5, φ6, and φ7. In the following description, the k light emitting elements 113-1 to 113-k may be simply referred to as the light emitting element 113 in some cases.

図８(Ｂ)の例において、範囲φ１は、人物Ｐ１の頭部の正面を角度０とし、時計回り方向の角度を正の値で示す角度座標にて、マイナス７０度からマイナス５０度の範囲である。同様に、図８(Ｂ)の例において、範囲φ２は、マイナス５０度からマイナス３０度の範囲であり、範囲φ３は、マイナス３０度からマイナス１０度の範囲である。また、図８(Ｂ)の例において、範囲φ４は、マイナス１０度からプラス１０度の範囲であり、範囲φ５は、プラス１０度からプラス３０度の範囲である。そして、図８(Ｂ)の例において、範囲φ６は、プラス３０度からプラス５０度の範囲であり、範囲φ７は、プラス５０度からプラス７０度の範囲である。 In the example of FIG. 8B, the range φ1 is a range from minus 70 degrees to minus 50 degrees in an angle coordinate in which the front of the head of the person P1 is an angle 0 and the clockwise angle is a positive value. It is. Similarly, in the example of FIG. 8B, the range φ2 is a range from −50 degrees to −30 degrees, and the range φ3 is a range from −30 degrees to −10 degrees. In the example of FIG. 8B, the range φ4 is a range from minus 10 degrees to plus 10 degrees, and the range φ5 is a range from plus 10 degrees to plus 30 degrees. In the example of FIG. 8B, the range φ6 is a range from plus 30 degrees to plus 50 degrees, and the range φ7 is a range from plus 50 degrees to plus 70 degrees.

なお、提示部１１に含まれる発光素子１１３のそれぞれと、人物Ｐ１の頭部の正面を基準とする角度の範囲との対応関係は、図８(Ｂ)の例に限られない。例えば、各発光素子１１３に対応付けられる角度の範囲は、範囲φ１〜φ７のそれぞれで示した角度の範囲よりも大きい範囲でもよいし、逆に、範囲φ１〜φ７のそれぞれで示した角度の範囲よりも小さい範囲でもよい。 Note that the correspondence relationship between each of the light emitting elements 113 included in the presentation unit 11 and the range of angles based on the front of the head of the person P1 is not limited to the example in FIG. 8B. For example, the range of angles associated with each light emitting element 113 may be larger than the range of angles indicated by each of the ranges φ1 to φ7, or conversely, the range of angles indicated by each of the ranges φ1 to φ7. A smaller range may be used.

図８(Ｂ)に示した提示部１１を用いる場合に、図６に示した制御部１４の生成部１４１は、第２算出部１５によって算出された音源Ｓの方向に基づいて、図８(Ｂ)に示した範囲φ１〜φ７の中から、算出された音源Ｓの方向を含む範囲を判別する。また、生成部１４１は、式(２)を用いて算出した輝度値で示される光量を発光素子１１３に放出させるための駆動信号を生成し、判別した範囲に対応付けられた発光素子１１３に対して、生成した駆動信号を供給する。例えば、生成部１４１は、図８に示した発光素子１１３−５を音響信号で示される音の強さに応じた輝度で選択的に発光させることで、人物Ｐ１に対して、音源Ｓの方向が人物Ｐ１の頭部の向きを基準として１０度〜３０度の方向であることを示すことができる。 When the presentation unit 11 shown in FIG. 8B is used, the generation unit 141 of the control unit 14 shown in FIG. 6 uses the direction of the sound source S calculated by the second calculation unit 15 as shown in FIG. A range including the calculated direction of the sound source S is determined from the ranges φ1 to φ7 shown in B). In addition, the generation unit 141 generates a drive signal for causing the light emitting element 113 to emit the light amount indicated by the luminance value calculated using the expression (2), and for the light emitting element 113 associated with the determined range. The generated drive signal is supplied. For example, the generation unit 141 causes the light emitting element 113-5 illustrated in FIG. 8 to selectively emit light at a luminance according to the intensity of the sound indicated by the acoustic signal, so that the direction of the sound source S with respect to the person P1. Can be shown to be a direction of 10 degrees to 30 degrees with respect to the head direction of the person P1.

以上に説明した音響再生装置１０は、例えば、展示会場などにおいて、予め指定された展示の位置を利用者に案内する案内システムなどにおいて有用である。 The sound reproducing device 10 described above is useful, for example, in a guidance system that guides a user to a pre-designated exhibition position in an exhibition hall or the like.

図９は、図６に示した音響再生装置１０を適用した案内システムの例を示す。図９に示した案内システムＧＳは、展示会場ＥＨにおいて、利用者である人物Ｐ１に対して、人物Ｐ１によって予め指定された複数の展示の位置を案内するために、音響再生装置１０の機能を利用する。 FIG. 9 shows an example of a guidance system to which the sound reproducing device 10 shown in FIG. 6 is applied. The guidance system GS shown in FIG. 9 has the function of the sound reproduction device 10 for guiding the positions of a plurality of exhibitions designated in advance by the person P1 to the person P1 who is a user at the exhibition hall EH. Use.

案内システムＧＳは、例えば、音響再生装置１０と、人物Ｐ１が携帯する携帯端末ＵＥと、サーバ装置ＳＶと、展示会場ＥＨに展示されている各展示の位置を示す位置情報及び展示内容を示す情報を蓄積する展示データベースＤＢとを含んでいる。サーバ装置ＳＶは、展示データベースＤＢに接続されており、展示データベースＤＢに蓄積された情報を参照可能である。また、サーバ装置ＳＶと携帯端末ＵＥとは、例えば、ネットワークＮＷ及び展示会場ＥＨに設けられたアクセスポイントＡＰを介して接続されており、互いに情報の授受が可能である。また、携帯端末ＵＥは、サーバ装置ＳＶから受けた情報を音響再生装置１０に渡す機能を有している。 The guidance system GS includes, for example, the position information indicating the position of each exhibit displayed at the exhibition hall EH, and the information indicating the contents of the exhibition, the sound reproducing device 10, the portable terminal UE carried by the person P1, the server device SV, and the like. And an exhibition database DB for storing information. The server device SV is connected to the exhibition database DB and can refer to information stored in the exhibition database DB. In addition, the server device SV and the mobile terminal UE are connected via, for example, the network NW and the access point AP provided in the exhibition hall EH, and can exchange information with each other. Further, the mobile terminal UE has a function of passing information received from the server device SV to the sound reproduction device 10.

展示会場ＥＨにおいて、展示Ｅｘ１−１，Ｅｘ１−２，Ｅｘ１−３，Ｅｘ１−４は、壁ＷＬ１に沿って配置されている。また、展示会場ＥＨにおいて、展示Ｅｘ２−１，Ｅｘ２−２，Ｅｘ２−３，Ｅｘ２−４は、壁ＷＬ２に沿って配置されている。 In the exhibition hall EH, the exhibitions Ex1-1, Ex1-2, Ex1-3, and Ex1-4 are arranged along the wall WL1. In the exhibition hall EH, the exhibits Ex2-1, Ex2-2, Ex2-3, and Ex2-4 are arranged along the wall WL2.

展示データベースＤＢは、展示Ｅｘ１−１〜Ｅｘ１−４及び展示Ｅｘ２−１〜Ｅｘ２−４のそれぞれの位置を示す情報とともに、それぞれの展示内容を示す情報の一部として、展示内容を説明する音声を表す音声情報を蓄積している。 The exhibition database DB includes information indicating the positions of the exhibitions Ex1-1 to Ex1-4 and the exhibitions Ex2-1 to Ex2-4, as well as a sound for explaining the exhibition contents as part of the information indicating the exhibition contents. The voice information that represents is stored.

図９に示した案内システムＧＳは、各展示から発せられる音を表す音響信号の代わりに、展示データベースＤＢに蓄積された音声情報に基づいて生成した音響信号を音響再生装置１０に渡す。そして、音響再生装置１０は、受け取った音響信号から再生される音響を人物Ｐ１の聴覚に与えるとともに、音響信号で示される音の強さに対応する強度を持つ刺激を人物Ｐ１の視覚に与える。 The guidance system GS shown in FIG. 9 passes the acoustic signal generated based on the audio information stored in the exhibition database DB to the acoustic reproduction device 10 instead of the acoustic signal representing the sound emitted from each exhibition. Then, the sound reproducing device 10 gives the sound reproduced from the received sound signal to the hearing of the person P1, and gives a stimulus having an intensity corresponding to the intensity of the sound indicated by the sound signal to the vision of the person P1.

例えば、人物Ｐ１によって予め展示Ｅｘ１−３と展示Ｅｘ２−４とが案内の対象として指定された場合に、音響再生装置１０は、展示データベースＤＢに展示Ｅｘ１−３及び展示Ｅｘ２−４に対応して蓄積された音声情報及び位置情報をサーバ装置ＳＶから受ける。 For example, when the exhibition Ex1-3 and the exhibition Ex2-4 are designated in advance by the person P1 as guidance targets, the sound reproducing device 10 corresponds to the exhibition Ex1-3 and the exhibition Ex2-4 in the exhibition database DB. The stored voice information and position information are received from the server device SV.

例えば、音響再生装置１０は、展示Ｅｘ１−３の位置情報に基づいて、サーバ装置ＳＶから受けた音声情報から再生される音響に対する音像定位処理を制御するとともに、音声情報から再生される音に対応する視覚刺激を示す図形などを表示する位置を制御する。これにより、音響再生装置１０は、サーバ装置ＳＶから受けた音声情報から再生される音に対応する音像が定位された方向及び人物Ｐ１の視覚への刺激として光を放出させる発光素子１１３の位置により、人物Ｐ１に展示Ｅｘ１−３の方向を示すことができる。また、サーバ装置ＳＶから受けた音声情報から音を再生するタイミングと当該音に対応する視覚刺激を与えるタイミングとに時間差を設けることで、人物Ｐ１に展示Ｅｘ１−３までの距離Ｌ１を示すことができる。 For example, the sound reproducing device 10 controls sound image localization processing for sound reproduced from the sound information received from the server device SV based on the position information of the exhibition Ex1-3, and supports sound reproduced from the sound information. Controls the position to display the graphic showing the visual stimulus. As a result, the sound reproducing device 10 determines the sound image corresponding to the sound reproduced from the sound information received from the server device SV in the direction in which the sound image is localized and the position of the light emitting element 113 that emits light as a visual stimulus for the person P1. The direction of the exhibition Ex1-3 can be shown to the person P1. Further, by providing a time difference between the timing of reproducing the sound from the audio information received from the server device SV and the timing of applying the visual stimulus corresponding to the sound, the person P1 can be shown the distance L1 to the exhibition Ex1-3. it can.

同様に、音響再生装置１０は、展示Ｅｘ２−４の位置情報に基づいて、サーバ装置ＳＶから受けた音声情報から再生される音響に対する音像定位処理を制御するとともに、音響情報から再生される音に対応する視覚刺激を示す図形などを表示する位置を制御する。これにより、音響再生装置１０は、サーバ装置ＳＶから受けた音声情報から再生される音に対応する音像が定位された方向及び人物Ｐ１の視覚への刺激として発光させられる発光素子１１３の位置により、人物Ｐ１に展示Ｅｘ２−４の方向を示すことができる。また、サーバ装置ＳＶから受けた音声情報から音を再生するタイミングと当該音に対応する視覚刺激を与えるタイミングとに時間差を設けることで、人物Ｐ１に展示Ｅｘ２−４までの距離Ｌ２を示すことができる。 Similarly, the sound reproduction device 10 controls the sound image localization processing for sound reproduced from the sound information received from the server device SV based on the position information of the exhibition Ex2-4, and converts the sound reproduced from the sound information. Controls the position to display a figure showing the corresponding visual stimulus. As a result, the sound reproducing device 10 can determine the sound image corresponding to the sound reproduced from the sound information received from the server device SV by the direction in which the sound image is localized and the position of the light emitting element 113 that is caused to emit light as a visual stimulus for the person P1. The direction of the exhibition Ex2-4 can be shown to the person P1. Further, by providing a time difference between the timing of reproducing the sound from the audio information received from the server device SV and the timing of applying the visual stimulus corresponding to the sound, the person P1 can be shown the distance L2 to the exhibition Ex2-4. it can.

つまり、音響再生装置１０により展示Ｅｘ１−３及び展示Ｅｘ２−４のそれぞれに対応して再生される音響及び視覚に与えられる刺激に基づいて、人物Ｐ１は、展示Ｅｘ１−３及び展示Ｅｘ２−４のそれぞれの位置を直感的に把握することができる。すなわち、人物Ｐ１は、展示Ｅｘ１−３と展示Ｅｘ２−４との双方が、人物Ｐ１から別の展示Ｅｘ１−４や壁ＷＬ２によって隔てられていても、各展示Ｅｘ１−３、Ｅｘ２−４のおおよその方向と各展示Ｅｘ１−３、Ｅｘ２−４までのおおよその距離を把握できる。例えば、人物Ｐ１は、展示Ｅｘ２−４の方向を示す視覚刺激の明るさが変化するタイミングと対応する音声の強さが変化するタイミングとのズレが、展示Ｅｘ１−３に対応するズレよりも小さいことから、展示Ｅｘ２−４の方が近傍にあることを知ることができる。 That is, based on the sound and the stimuli given to the vision that are reproduced by the sound reproducing device 10 corresponding to each of the exhibition Ex1-3 and the exhibition Ex2-4, the person P1 has the exhibition Ex1-3 and the exhibition Ex2-4. Each position can be grasped intuitively. In other words, the person P1 has approximately the exhibition Ex1-3 and Ex2-4 even though both the exhibition Ex1-3 and the exhibition Ex2-4 are separated from the person P1 by another exhibition Ex1-4 and the wall WL2. And the approximate distance to each exhibition Ex1-3, Ex2-4. For example, for the person P1, the difference between the timing at which the brightness of the visual stimulus indicating the direction of the exhibition Ex2-4 changes and the timing at which the corresponding sound intensity changes is smaller than the deviation corresponding to the exhibition Ex1-3. From this, it can be known that the exhibition Ex2-4 is closer.

なお、図８(Ｂ)に示した提示部１１は、例えば、複数の発光素子を用いて実現可能であり、音源となる展示Ｅｘ１−３，Ｅｘ２−４の高精細な画像を表示する機能を有するヘッドマウントディスプレイ装置などに比べて、ハードウェアが小規模である。また、図２に示した図形やマークなどを表す画像を視覚刺激として人物Ｐ１に与える場合でも、音源となる物体の高精細な画像を表示する機能を有するヘッドマウントディスプレイ装置に比べて、低解像度の表示画面１１１ａ，１１１ｂを用いて実現可能である。すなわち、図９に示した音響再生装置１０を適用した案内システムは、指定された展示(例えば、展示Ｅｘ１−３，Ｅｘ２−４)の写実的な画像を人物Ｐ１に提示するための表示装置を用いる場合に比べて、低コストで実現可能である。 Note that the presentation unit 11 illustrated in FIG. 8B can be realized using, for example, a plurality of light emitting elements, and has a function of displaying high-definition images of the exhibitions Ex1-3 and Ex2-4 serving as sound sources. Compared with the head-mounted display apparatus etc. which have, hardware is small. Further, even when an image representing a figure, a mark, or the like shown in FIG. 2 is given to the person P1 as a visual stimulus, the resolution is lower than that of a head-mounted display device having a function of displaying a high-definition image of an object serving as a sound source. The display screens 111a and 111b can be used. That is, the guidance system to which the sound reproducing device 10 shown in FIG. 9 is applied has a display device for presenting a realistic image of a designated exhibition (for example, exhibition Ex1-3, Ex2-4) to the person P1. Compared with the case where it uses, it is realizable at low cost.

以上に説明した本件開示の音響再生装置１０は、例えば、スマートホンやタブレット型端末などの携帯端末を含むコンピュータ装置を用いて実現することができる。 The sound reproduction device 10 of the present disclosure described above can be realized by using a computer device including a mobile terminal such as a smart phone or a tablet terminal, for example.

図１０は、音響再生装置１０のハードウェア構成の一例を示す。なお、図１０に示した構成要素のうち、図１、図６及び図９に示した構成要素と同等のものは、同一の符号で示され、その説明は省略される場合がある。 FIG. 10 shows an example of a hardware configuration of the sound reproducing device 10. 10 that are equivalent to the components shown in FIGS. 1, 6, and 9 are denoted by the same reference numerals, and the description thereof may be omitted.

図１０に示した携帯端末ＵＥは、人物Ｐ１に所持されているスマートホンやタブレット型端末などである。携帯端末ＵＥは、プロセッサ２１と、メモリ２２と、ＧＰＳセンサＧｐと、汎用インタフェース２４と、音声処理部２５と、ネットワークインタフェース２６と、表示制御部２７と、タッチパネル２８と、入力制御部２９とを含んでいる。図１０に示したプロセッサ２１と、メモリ２２と、ＧＰＳセンサＧｐと、汎用インタフェース２４と、音声処理部２５と、ネットワークインタフェース２６と、表示制御部２７と、タッチパネル２８と、入力制御部２９とは、バスを介して互いに接続されている。 The mobile terminal UE shown in FIG. 10 is a smart phone or a tablet-type terminal possessed by the person P1. The mobile terminal UE includes a processor 21, a memory 22, a GPS sensor Gp, a general-purpose interface 24, an audio processing unit 25, a network interface 26, a display control unit 27, a touch panel 28, and an input control unit 29. Contains. The processor 21, the memory 22, the GPS sensor Gp, the general-purpose interface 24, the voice processing unit 25, the network interface 26, the display control unit 27, the touch panel 28, and the input control unit 29 illustrated in FIG. Are connected to each other via a bus.

携帯端末ＵＥは、ネットワークインタフェース２６を介して、ネットワークＮＷに接続されており、更に、ネットワークＮＷを介してサーバ装置ＳＶに接続されている。また、携帯端末ＵＥは、汎用インタフェース２４を介して、例えば、図８(Ｂ)に示した発光素子１１３−１〜１１３−７を含む提示部１１に接続されている。また、携帯端末ＵＥに含まれる音声処理部２５は、例えば、携帯端末ＵＥの利用者である人物Ｐ１の頭部に装着されたヘッドホン型の再生部１２に接続されている。また、図１０に示した提示部１１は、再生部１２に含まれる耳当て部に蔓状の部材で固定されている。また、ジャイロセンサＧｙｒは、再生部１２に含まれる耳当て部に固定されている。ジャイロセンサＧｙｒによって検知される角度情報Ｐｄａは、汎用インタフェース２４を介してプロセッサ２１に渡される。 The mobile terminal UE is connected to the network NW via the network interface 26, and is further connected to the server device SV via the network NW. The mobile terminal UE is connected to the presentation unit 11 including, for example, the light emitting elements 113-1 to 113-7 illustrated in FIG. 8B via the general-purpose interface 24. In addition, the audio processing unit 25 included in the mobile terminal UE is connected to, for example, a headphone-type playback unit 12 attached to the head of a person P1 who is a user of the mobile terminal UE. In addition, the presentation unit 11 illustrated in FIG. 10 is fixed to the ear pad included in the reproduction unit 12 with a vine-like member. The gyro sensor Gyr is fixed to an ear pad included in the playback unit 12. The angle information Pda detected by the gyro sensor Gyr is passed to the processor 21 via the general-purpose interface 24.

携帯端末ＵＥに含まれるプロセッサ２１と、メモリ２２と、ＧＰＳセンサＧｐと、汎用インタフェース２４と、音声処理部２５と、ネットワークインタフェース２６とは、音響再生装置１０に含まれる。 The processor 21, the memory 22, the GPS sensor Gp, the general-purpose interface 24, the sound processing unit 25, and the network interface 26 included in the mobile terminal UE are included in the sound reproduction device 10.

また、携帯端末ＵＥに含まれる表示制御部２７は、プロセッサ２１からの指示に従って、タッチパネル２８に含まれる表示画面に画像を表示するための制御を行う。また、入力制御部２９は、人物Ｐ１がタッチパネル２８を操作することで入力した指示をプロセッサ２１に渡す機能を持っている。したがって、人物Ｐ１は、タッチパネル２８を操作することで、音響再生装置１０に対して、例えば、提示の対象となる音源を指定する指示や、指定した音源に対応する音響及び視覚刺激を与える処理の開始あるいは終了させる指示などを入力することができる。例えば、人物Ｐ１は、図９に示した展示会場ＥＨ内の見取り図などをタッチパネル２８の表示画面に表示させ、見取り図に含まれる所望の展示を示すアイコン等をタッチすることで、音響再生装置１０によって音響を再生させる音源として指定する。 In addition, the display control unit 27 included in the mobile terminal UE performs control for displaying an image on the display screen included in the touch panel 28 in accordance with an instruction from the processor 21. The input control unit 29 has a function of passing an instruction input by the person P1 operating the touch panel 28 to the processor 21. Therefore, the person P1 operates the touch panel 28 to perform, for example, an instruction for designating a sound source to be presented to the sound reproducing device 10 or a process of giving sound and visual stimulus corresponding to the designated sound source. An instruction to start or end can be input. For example, the person P1 displays the floor plan in the exhibition hall EH shown in FIG. 9 on the display screen of the touch panel 28, and touches an icon or the like indicating a desired exhibition included in the floor plan, so that the sound reproduction device 10 Specify as a sound source to play sound.

図１０に示したメモリ２２は、携帯端末ＵＥのオペレーティングシステムとともに、プロセッサ２１が、指定された音源から発せされる音を再生する処理である音響再生処理を実行するためのアプリケーションプログラムを格納している。なお、音響再生処理を実行するためのアプリケーションプログラムは、例えば、ネットワークインタフェース２６及びネットワークＮＷを介して、サーバ装置ＳＶなどからダウンロードすることで、メモリ２２に読み込ませてもよい。 The memory 22 shown in FIG. 10 stores an application program for the processor 21 to execute an acoustic reproduction process, which is a process for reproducing a sound emitted from a designated sound source, together with the operating system of the mobile terminal UE. Yes. Note that the application program for executing the sound reproduction process may be downloaded to the memory 22 by downloading from the server device SV or the like via the network interface 26 and the network NW, for example.

プロセッサ２１は、音響再生処理のためのアプリケーションプログラムを実行することにより、例えば、図６に示した第１算出部１３と、第２算出部１５と、制御部１４とのそれぞれの機能を果たす。また、図６に示したメモリ１３１の機能は、図１０に示したメモリ２２の記憶領域の一部を用いて実現される。 The processor 21 performs the functions of the first calculation unit 13, the second calculation unit 15, and the control unit 14 illustrated in FIG. 6, for example, by executing an application program for sound reproduction processing. Further, the function of the memory 131 shown in FIG. 6 is realized by using a part of the storage area of the memory 22 shown in FIG.

また、図１０に示した展示データベースＤＢは、図９に示した展示会場ＥＨ内に配置された展示Ｅｘ１−１〜Ｅｘ１−４及び展示Ｅｘ２−１〜Ｅｘ２−４に対応して、それぞれの位置を示す位置情報と展示内容を示す音声情報を蓄積している。また、図１０に示した音響再生装置１０は、例えば、ネットワークＮＷ及びサーバ装置ＳＶを介して、展示データベースＤＢから所望の展示に対応付けられた音声情報を取得することができる。 The exhibition database DB shown in FIG. 10 corresponds to the exhibition Ex1-1 to Ex1-4 and the exhibition Ex2-1 to Ex2-4 arranged in the exhibition hall EH shown in FIG. The position information indicating the voice information indicating the contents of the exhibition is stored. 10 can acquire audio information associated with a desired exhibition from the exhibition database DB via the network NW and the server apparatus SV, for example.

図１１は、図１０に示した音響再生装置１０の動作を示す。図１１に示したステップＳ３０１〜ステップＳ３０５及びステップＳ３１１〜ステップＳ３１６の処理は、図１０に示した音響再生装置１０の動作を示すとともに音響再生プログラムの別実施形態を示す。図１１に示したステップＳ３０１〜ステップＳ３０５及びステップＳ３１１〜ステップＳ３１６の処理は、音響再生処理のためのアプリケーションプログラムに含まれる処理の一例である。また、ステップＳ３０１〜ステップＳ３０５及びステップＳ３１１〜ステップＳ３１６の処理は、図１０に示したプロセッサ２１によって実行される。 FIG. 11 shows the operation of the sound reproducing device 10 shown in FIG. The process of step S301 to step S305 and step S311 to step S316 shown in FIG. 11 shows the operation of the sound reproduction device 10 shown in FIG. 10 and another embodiment of the sound reproduction program. The process of step S301 to step S305 and step S311 to step S316 illustrated in FIG. 11 is an example of the process included in the application program for the sound reproduction process. Further, the processing of step S301 to step S305 and step S311 to step S316 is executed by the processor 21 shown in FIG.

ステップＳ３１１において、プロセッサ２１は、例えば、図１０に示した入力制御部２９から、利用者によるタッチパネル２８への操作で指定された音源を示す情報を受ける。例えば、プロセッサ２１は、表示制御部２７により、図９に示した展示会場ＥＨ内の見取り図などをタッチパネル２８の表示画面に表示させ、利用者に対して、所望の展示を示すアイコン等をタッチする操作を促す。また、プロセッサ２１は、入力制御部２９から利用者がタッチしたアイコン等を示す情報を受けた場合に、受けた情報で示されるアイコン等に対応付けられた展示を、音響再生装置１０によって音響を再生する対象となる音源とする。 In step S311, the processor 21 receives, for example, information indicating the sound source designated by the user's operation on the touch panel 28 from the input control unit 29 illustrated in FIG. For example, the processor 21 causes the display control unit 27 to display a floor plan or the like in the exhibition hall EH shown in FIG. 9 on the display screen of the touch panel 28 and touches an icon or the like indicating a desired exhibition to the user. Encourage operation. Further, when the processor 21 receives information indicating an icon touched by the user from the input control unit 29, the processor 21 displays the display associated with the icon indicated by the received information with the sound reproducing device 10. The sound source to be played back.

また、ステップＳ３１１において、プロセッサ２１は、ステップＳ３１４の処理で用いる変数である静止時間、注目時間及び注目フラグに、それぞれ初期値「０」を設定する。 In step S311, the processor 21 sets initial values “0” for the still time, the attention time, and the attention flag, which are variables used in the process of step S314.

ステップＳ３１２において、プロセッサ２１は、音響再生装置１０によって音響を再生する対象となる音源として指定された展示の位置を示す位置情報を音源の位置を示す位置情報として受ける。例えば、プロセッサ２１は、図１０に示したネットワークインタフェース２６を介して、サーバ装置ＳＶに対して、指定された展示に対応して展示データベースＤＢに蓄積された位置情報を照会する。また、プロセッサ２１は、サーバ装置ＳＶへの照会で得られた位置情報を、音源として指定された展示に対応してメモリ２２に格納する。なお、ステップＳ３１１の処理で、提示の対象となる音源として複数の展示が指定された場合に、プロセッサ２１は、各展示についてそれぞれ位置情報をサーバ装置ＳＶに照会し、照会で得られた位置情報を各展示に対応してメモリ２２に格納する。例えば、図９に示した展示Ｅｘ１−３と展示Ｅｘ２−４とが人物Ｐ１によって指定された場合に、プロセッサ２１は、指定された音源の位置情報として、展示Ｅｘ１−３，Ｅｘ２−４のそれぞれについて展示データベースＤＢに保持された位置情報を取得する。 In step S312, the processor 21 receives position information indicating the position of an exhibition designated as a sound source to be reproduced by the sound reproducing apparatus 10 as position information indicating the position of the sound source. For example, the processor 21 inquires of the server apparatus SV of the location information stored in the exhibition database DB corresponding to the designated exhibition via the network interface 26 shown in FIG. Further, the processor 21 stores the position information obtained by the inquiry to the server device SV in the memory 22 corresponding to the exhibition designated as the sound source. When a plurality of exhibitions are specified as the sound source to be presented in the process of step S311, the processor 21 inquires the server device SV for position information for each exhibition, and the position information obtained by the inquiry. Are stored in the memory 22 corresponding to each exhibition. For example, when the exhibition Ex1-3 and the exhibition Ex2-4 shown in FIG. 9 are designated by the person P1, the processor 21 uses each of the exhibitions Ex1-3 and Ex2-4 as the position information of the designated sound source. The position information held in the exhibition database DB is acquired.

図１１の例では、プロセッサ２１は、ステップＳ３０１の処理に先立って、ステップＳ３０３、ステップＳ３１３〜ステップＳ３１６の処理を実行する。また、プロセッサ２１は、図１１に示したステップＳ３０３、ステップＳ３１３〜ステップＳ３１６、ステップＳ３０１、ステップＳ３０２及びステップＳ３０４を含むループの処理を、数ミリ秒〜十数ミリ秒に設定された所定の時間間隔で実行することが望ましい。 In the example of FIG. 11, the processor 21 executes the processes of step S303 and steps S313 to S316 prior to the process of step S301. Further, the processor 21 performs the processing of the loop including step S303, step S313 to step S316, step S301, step S302, and step S304 shown in FIG. 11 for a predetermined time set to several milliseconds to tens of milliseconds. It is desirable to run at intervals.

ステップＳ３０３において、プロセッサ２１は、ステップＳ３１２の処理で受けた各音源の位置情報に基づいて、音源である展示のそれぞれと人物Ｐ１との間の距離を算出する。例えば、プロセッサ２１は、まず、人物Ｐ１の位置を示す位置情報として、図１０に示したＧＰＳセンサＧｐから携帯端末ＵＥの位置を示す情報を受ける。また、プロセッサ２１は、ＧＰＳセンサＧｐから受けた人物Ｐ１の位置情報とステップＳ３１２の処理で各音源の位置情報として受けた展示Ｅｘ１−３，Ｅｘ２−４の位置情報とに基づいて、人物と展示Ｅｘ１−３，Ｅｘ２−４のそれぞれとの間の距離を算出する。 In step S303, the processor 21 calculates the distance between each of the exhibits that are sound sources and the person P1 based on the position information of each sound source received in the process of step S312. For example, the processor 21 first receives information indicating the position of the mobile terminal UE from the GPS sensor Gp illustrated in FIG. 10 as position information indicating the position of the person P1. The processor 21 also displays the person and the exhibition based on the position information of the person P1 received from the GPS sensor Gp and the position information of the exhibits Ex1-3 and Ex2-4 received as the position information of each sound source in the process of step S312. The distance between each of Ex1-3 and Ex2-4 is calculated.

ステップＳ３１３において、プロセッサ２１は、図１０に示したジャイロセンサＧｙｒから人物Ｐ１の頭部の向きを示す角度情報Ｐｄａを取得し、取得した角度情報Ｐｄａに基づいて、人物Ｐ１の頭部の向きを基準として、各音源の方向を算出する。プロセッサ２１は、例えば、ジャイロセンサＧｙｒから取得した角度情報と、各音源に対応する位置情報と、人物Ｐ１の位置を示す位置情報とに基づいて、人物Ｐ１の頭部の正面方向から測った各音源の方向を示す角度を算出する。 In step S313, the processor 21 acquires angle information Pda indicating the orientation of the head of the person P1 from the gyro sensor Gyr shown in FIG. 10, and determines the orientation of the head of the person P1 based on the acquired angle information Pda. As a reference, the direction of each sound source is calculated. For example, the processor 21 measures each from the front direction of the head of the person P1 based on the angle information acquired from the gyro sensor Gyr, the position information corresponding to each sound source, and the position information indicating the position of the person P1. An angle indicating the direction of the sound source is calculated.

ステップＳ３１４において、プロセッサ２１は、図１０に示した人物Ｐ１の動きを示す情報に基づいて、指定された音源のうち、人物Ｐ１によって注目されている音源を判別する。プロセッサ２１は、人物Ｐ１によって注目されている音源を判別する処理に、例えば、ステップＳ３１３の処理で取得した角度情報Ｐｄａを人物Ｐ１の動きを示す情報として用いる。また、プロセッサ２１は、角度情報Ｐｄａで示される人物Ｐ１の頭部の向きが所定の期間以上にわたって維持され、かつ、指定された音源のいずれかが人物Ｐ１の正面にある場合に、人物Ｐ１の正面方向にある音源を注目されている音源として判別する。プロセッサ２１は、例えば、図１２に示すステップＳ３２１〜ステップＳ３３０の処理を実行することで、注目されている音源を判別してもよい。 In step S 314, the processor 21 determines a sound source that is noticed by the person P 1 among the designated sound sources based on the information indicating the movement of the person P 1 shown in FIG. 10. The processor 21 uses, for example, the angle information Pda acquired in the process of step S313 as information indicating the movement of the person P1 in the process of determining the sound source focused by the person P1. In addition, the processor 21 maintains the head direction of the person P1 indicated by the angle information Pda for a predetermined period or more, and when any of the designated sound sources is in front of the person P1, A sound source in the front direction is determined as a sound source that has received attention. For example, the processor 21 may determine the sound source that has been noticed by executing the processing of steps S321 to S330 shown in FIG.

図１２は、図１１にステップＳ３１４で示した注目されている音源を判別する処理の例を示す。図１２に示したステップＳ３２１〜ステップＳ３３０の処理は、音響再生処理のためのアプリケーションプログラムに含まれる注目されている音源を判別する処理の一例である。また、ステップＳ３２１〜ステップＳ３３０の処理は、図１０に示したプロセッサ２１によって実行される。 FIG. 12 shows an example of the process of determining the focused sound source shown in step S314 in FIG. The process of step S321 to step S330 illustrated in FIG. 12 is an example of a process of determining a focused sound source included in an application program for sound reproduction processing. Further, the processing in steps S321 to S330 is executed by the processor 21 shown in FIG.

ステップＳ３２１において、プロセッサ２１は、図１０に示したジャイロセンサＧｙｒによって過去τ秒間に得られた角度情報Ｐｄａを取得する。プロセッサ２１は、例えば、汎用インタフェース２４を介してジャイロセンサＧｙｒから受けた角度情報Ｐｄａを、１秒程度に設定される時間τにわたってメモリ２２に保持させ、メモリ２２に保持させた角度情報Ｐｄａを取得する。 In step S321, the processor 21 acquires angle information Pda obtained in the past τ seconds by the gyro sensor Gyr shown in FIG. For example, the processor 21 holds the angle information Pda received from the gyro sensor Gyr via the general-purpose interface 24 in the memory 22 for a time τ set to about 1 second, and acquires the angle information Pda held in the memory 22. To do.

ステップＳ３２２において、プロセッサ２１は、ステップＳ３２１の処理で取得した角度情報Ｐｄａを互いに比較することで、角度情報Ｐｄａで示される人物Ｐ１の頭部の向きが所定の閾値で示される範囲に収まっているか否かを判定する。プロセッサ２１は、ステップＳ３２２の処理として、例えば、過去τ秒間の角度情報Ｐｄａで示される人物Ｐ１の頭部の向きが、図８(Ｂ)に示した範囲φ１〜φ７のそれぞれに対応する角度の範囲(例えば２０度)に収まっているか否かを判定してもよい。 In step S322, the processor 21 compares the angle information Pda acquired in the process of step S321 with each other, so that the head direction of the person P1 indicated by the angle information Pda is within the range indicated by the predetermined threshold value. Determine whether or not. For example, the processor 21 performs processing in step S322 as follows. For example, the head direction of the person P1 indicated by the angle information Pda for the past τ seconds is an angle corresponding to each of the ranges φ1 to φ7 illustrated in FIG. You may determine whether it is in the range (for example, 20 degree | times).

人物Ｐ１の頭部の向きの変化が、例えば２０度程度に設定された範囲内に収まっていた場合に(ステップＳ３２２の肯定判定(ＹＥＳ))、プロセッサ２１は、人物Ｐ１の頭部の向きはほぼ一定であると判断し、ステップＳ３２３の処理に進む。 If the change in the orientation of the head of the person P1 is within a range set to about 20 degrees, for example (Yes in step S322 (YES)), the processor 21 determines the orientation of the head of the person P1. It is determined that it is substantially constant, and the process proceeds to step S323.

ステップＳ３２３において、プロセッサ２１は、人物Ｐ１の頭部の向きが一定している時間の長さを示す変数である静止時間に、図１１に示したステップＳ３０３からステップＳ３０５までのループの処理が繰り返される間隔に相当する所定値τ１を加算する。 In step S323, the processor 21 repeats the processing of the loop from step S303 to step S305 shown in FIG. 11 during a stationary time that is a variable indicating the length of time that the head direction of the person P1 is constant. A predetermined value τ1 corresponding to the interval is added.

ステップＳ３２４において、プロセッサ２１は、それまでの処理により、注目フラグに、人物Ｐ１が音源のいずれかに注目している可能性があることを示す値「１」が設定されているか否かを判定する。 In step S324, the processor 21 determines whether or not the value “1” indicating that there is a possibility that the person P1 is paying attention to any of the sound sources is set in the attention flag by the processing so far. To do.

まだ、注目フラグに値「１」が設定されていない場合に（ステップＳ３２４の否定判定(ＮＯ)）、プロセッサ２１は、ステップＳ３２５の処理に進む。 If the value “1” has not yet been set for the attention flag (No at Step S324), the processor 21 proceeds to the process at Step S325.

ステップＳ３２５において、プロセッサ２１は、プロセッサ２１は、静止時間の値が、例えば、数秒程度に設定される所定の閾値以上であるか否かを判定する。 In step S325, the processor 21 determines whether or not the value of the stationary time is equal to or greater than a predetermined threshold set to, for example, about several seconds.

まだ、静止時間が閾値未満である場合に(ステップＳ３２５の否定判定(ＮＯ))、プロセッサ２１は、ステップＳ３２７の処理に進む。 If the still time is still less than the threshold value (No determination in step S325 (NO)), the processor 21 proceeds to the process of step S327.

ステップＳ３２７において、プロセッサ２１は、人物Ｐ１によって注目されている音源はないことを示す情報を出力する。その後、プロセッサ２１は、注目されている音源を判別する処理を終了し、図１１に示したステップ３１５の処理に進む。 In step S327, the processor 21 outputs information indicating that there is no sound source focused by the person P1. Thereafter, the processor 21 ends the process of determining the sound source of interest, and proceeds to the process of step 315 shown in FIG.

ここで、プロセッサ２１は、ステップＳ３０３〜ステップＳ３０５のループに含まれるステップＳ３１４の処理として、図１２に示したステップＳ３２１〜ステップＳ３２９の処理を、例えば、数ミリ秒〜十数ミリ秒毎に実行する。これに伴って、静止時間は、ステップＳ３１４の処理が繰り返される過程で、ステップＳ３２２において、人物Ｐ１の頭部の向きはほぼ一定であると判断されるごとに、ステップＳ３２３の処理で所定値τ１ずつ加算される。 Here, the processor 21 executes the processing of step S321 to step S329 shown in FIG. 12, for example, every few milliseconds to several tens of milliseconds as the processing of step S314 included in the loop of steps S303 to S305. To do. Accordingly, the stationary time is a predetermined value τ1 in the process of step S323 every time it is determined in step S322 that the head direction of the person P1 is substantially constant in the process in which the process of step S314 is repeated. It is added one by one.

そして、ステップＳ３２３の処理において、所定値τ１が加算された静止時間が、静止時間について設定された閾値以上となった場合に（ステップＳ３２５の肯定判定（ＹＥＳ））、プロセッサ２１は、ステップＳ３２６の処理に進む。 Then, in the process of step S323, when the stationary time to which the predetermined value τ1 is added becomes equal to or greater than the threshold set for the stationary time (Yes determination in step S325) (YES in step S325), the processor 21 proceeds to step S326. Proceed to processing.

ステップＳ３２６において、プロセッサ２１は、注目フラグに値「１」を設定することで、人物Ｐ１が音源のいずれかに注目している可能性があることを示す。ステップＳ３２６の処理の終了後に、プロセッサ２１は、ステップＳ３２７の処理に進み、人物Ｐ１によって注目されている音源はないことを示す情報を出力する。その後、プロセッサ２１は、注目されている音源を判別する処理を終了し、図１１に示したステップ３１５の処理に進む。 In step S326, the processor 21 sets a value “1” in the attention flag to indicate that the person P1 may be paying attention to any of the sound sources. After the process of step S326 is completed, the processor 21 proceeds to the process of step S327, and outputs information indicating that there is no sound source focused by the person P1. Thereafter, the processor 21 ends the process of determining the sound source of interest, and proceeds to the process of step 315 shown in FIG.

ステップＳ３２６の処理により、注目フラグに値「１」が設定された後に、再び、ステップＳ３１４の処理を行う過程で、プロセッサ２１は、ステップＳ３２４の肯定判定ルート(ＹＥＳ)に従って、ステップＳ３２８の処理に進む。 After the value “1” is set in the attention flag by the process of step S326, the processor 21 proceeds to the process of step S328 according to the affirmative determination route (YES) of step S324 in the process of performing the process of step S314 again. move on.

ステップＳ３２８において、プロセッサ２１は、図１１に示したステップＳ３１３の処理により、指定された音源のいずれかについて、人物Ｐ１の頭部の正面を示す方向が算出されたか否かを判定する。例えば、プロセッサ２１は、図１１に示したステップＳ３１３の処理で算出された各音源の方向のいずれかが、図８(Ｂ)に示した範囲φ４に含まれるか否かによって、ステップＳ３２５の判定処理を行ってもよい。ここで、図８(Ｂ)に示した範囲φ４は、人物Ｐ１の頭部の正面に相当している。 In step S328, the processor 21 determines whether or not a direction indicating the front of the head of the person P1 has been calculated for any of the designated sound sources by the process of step S313 illustrated in FIG. For example, the processor 21 determines whether the direction of each sound source calculated in the process of step S313 illustrated in FIG. 11 is included in the range φ4 illustrated in FIG. Processing may be performed. Here, the range φ4 shown in FIG. 8B corresponds to the front of the head of the person P1.

指定された音源のそれぞれについて算出された方向が、いずれも人物Ｐ１の頭部の正面を示す方向ではない場合に(ステップＳ３２８の否定判定(ＮＯ))、プロセッサ２１は、ステップＳ３２７の処理に進む。そして、プロセッサ２１は、ステップＳ３２７において、人物Ｐ１によって注目されている音源はないことを示す情報を出力した後に、注目されている音源を判別する処理を終了し、図１１に示したステップ３１５の処理に進む。 When none of the directions calculated for each of the designated sound sources is a direction indicating the front of the head of the person P1 (No determination in step S328 (NO)), the processor 21 proceeds to the process of step S327. . Then, in step S327, the processor 21 outputs the information indicating that there is no sound source that has been noticed by the person P1, and then ends the process of determining the sound source that has been noticed. In step 315 shown in FIG. Proceed to processing.

一方、例えば、図９に示した展示Ｅｘ１−３についてステップＳ３１３の処理で算出した方向が、人物Ｐ１の頭部の正面を示す場合に(ステップＳ３２８の肯定判定(ＹＥＳ))、プロセッサ２１は、ステップＳ３２９の処理に進む。 On the other hand, for example, when the direction calculated in the process of step S313 for the exhibition Ex1-3 illustrated in FIG. 9 indicates the front of the head of the person P1 (Yes determination in step S328 (YES)), the processor 21 The process proceeds to step S329.

ステップＳ３２９において、プロセッサ２１は、ステップＳ３２８の処理で、人物Ｐ１の頭部の正面に当たる方向にあるとされた音源(例えば、展示Ｅｘ１−３)を、人物Ｐ１によって注目されている音源として判別する。また、プロセッサ２１は、人物Ｐ１が音源のいずれかに注目している時間を示す変数である注目時間に、ステップＳ３２３において静止時間に加算された所定値τ１と同じ所定値τ１を加算する。その後、プロセッサ２１は、注目されている音源を判別する処理を終了し、図１１に示したステップＳ３１５の処理に進む。 In step S329, the processor 21 determines the sound source (for example, exhibition Ex1-3) determined to be in the direction hitting the front of the head of the person P1 in the process of step S328 as the sound source attracting attention by the person P1. . Further, the processor 21 adds the predetermined value τ1 that is the same as the predetermined value τ1 added to the stationary time in step S323 to the attention time that is a variable indicating the time during which the person P1 is paying attention to any of the sound sources. Thereafter, the processor 21 ends the process of determining the sound source of interest, and proceeds to the process of step S315 illustrated in FIG.

ところで、人物Ｐ１の頭部の向きが、例えば２０度程度に設定された範囲よりも大きく変化した場合に(ステップＳ３２２の否定判定(ＮＯ))、プロセッサ２１は、ステップＳ３３０の処理に進む。 By the way, when the head direction of the person P1 changes more than a range set to, for example, about 20 degrees (No determination in step S322 (NO)), the processor 21 proceeds to the process of step S330.

ステップＳ３３０において、プロセッサ２１は、静止時間及び注目時間に初期値「０」を設定するとともに、注目フラグをクリアする。その後、プロセッサ２１は、ステップＳ３２７において、人物Ｐ１によって注目されている音源はないことを示す情報を出力した後に、注目されている音源を判別する処理を終了し、図１１に示したステップ３１５の処理に進む。 In step S330, the processor 21 sets initial values “0” for the stationary time and the attention time, and clears the attention flag. After that, in step S327, the processor 21 outputs information indicating that there is no sound source that has been noticed by the person P1, and then ends the process of determining the sound source that has been noticed. In step 315 shown in FIG. Proceed to processing.

以上に説明したステップＳ３２１〜ステップＳ３３０の処理を実行することにより、プロセッサ２１は、静止時間についての閾値以上にわたって、人物Ｐ１の頭部の向きが一定しており、かつ、頭部の正面に音源がある場合に、当該音源が注目されていると判断する。すなわち、プロセッサ２１は、以上に説明したステップＳ３２１〜ステップＳ３３０の処理を実行することにより、図６に示した判定部１４４の機能を果たしている。 By executing the processing of steps S321 to S330 described above, the processor 21 has the head orientation of the person P1 constant over the threshold value for the stationary time, and the sound source in front of the head. If there is, it is determined that the sound source is receiving attention. That is, the processor 21 fulfills the function of the determination unit 144 illustrated in FIG. 6 by executing the processes of steps S321 to S330 described above.

図１１に示したステップＳ３１５において、プロセッサ２１は、ステップＳ３１４の処理の過程で、人物Ｐ１によって注目されている音源が判別されたか否かを判定する。 In step S315 illustrated in FIG. 11, the processor 21 determines whether or not the sound source focused by the person P1 is determined in the process of step S314.

例えば、図１２に示したステップＳ３２９において、展示Ｅｘ１−３が人物Ｐ１によって注目されている音源として判別された場合に、プロセッサ２１は、ステップＳ３１５の肯定判定ルート(ＹＥＳ)に従ってステップＳ３１６に進む。 For example, in step S329 shown in FIG. 12, when the exhibition Ex1-3 is determined as a sound source attracting attention by the person P1, the processor 21 proceeds to step S316 according to the affirmative determination route (YES) of step S315.

ステップＳ３１６において、プロセッサ２１は、式(４)あるいは式(５)を用いて、人物Ｐ１によって注目されている音源までの距離を調整する。例えば、注目されている音源として、図９に示した展示Ｅｘ１−３が判別された場合に、プロセッサ２１は、ステップＳ３１３の処理で展示Ｅｘ１−３と人物Ｐ１との間の距離は、式(４)または式(５)を用いた調整により、元の距離Ｌ１よりも短縮される。その後、プロセッサ２１は、ステップＳ３０１の処理に進む。即ち、プロセッサ２１は、ステップＳ３１６の処理を実行することで、図６に示した調整部１４５の機能を果たす。 In step S316, the processor 21 adjusts the distance to the sound source attracting attention by the person P1 using the formula (4) or the formula (5). For example, when the exhibition Ex1-3 shown in FIG. 9 is determined as the sound source of interest, the processor 21 determines the distance between the exhibition Ex1-3 and the person P1 in the process of step S313 using the formula ( By adjustment using 4) or equation (5), the distance is shorter than the original distance L1. Thereafter, the processor 21 proceeds to the process of step S301. That is, the processor 21 performs the function of the adjustment unit 145 illustrated in FIG. 6 by executing the process of step S316.

一方、ステップＳ３１４の処理により、人物Ｐ１によって注目されている音源が判別されなかった場合に(ステップＳ３１５の否定判定(ＮＯ))、プロセッサ２１は、ステップＳ３１６の処理を行うことなくステップＳ３０１の処理に進む。 On the other hand, if the sound source focused by the person P1 is not determined by the process of step S314 (No determination of step S315 (NO)), the processor 21 performs the process of step S301 without performing the process of step S316. Proceed to

ステップＳ３０１において、プロセッサ２１は、指定された音源のそれぞれから発せられる音を示す音響信号として、サーバ装置ＳＶから、音源として指定された展示のそれぞれに対応して展示データベースＤＢに蓄積された音声情報の一部を受ける。例えば、プロセッサ２１は、展示データベースＤＢに蓄積された音声情報を数ミリ秒〜十数ミリ秒で再生可能な長さで区切り、ステップＳ３０１の処理を実行する毎に、区切られた音声情報を順次に受ける。なお、展示データベースＤＢに蓄積された音声情報が符号化された音声情報である場合に、プロセッサ２１は、例えば、受けた音声情報から、符号化される前の音声を示す音響信号あるいは音響信号の振幅の変化を示す情報を取得する。プロセッサ２１は、符号化される前の音声を示す音響信号あるいは音響信号の振幅の変化を示す情報を取得する際に、図１０に示した音声処理部２５の機能を用いてもよい。また、プロセッサ２１は、取得した音響信号あるいは音響信号の振幅の変化を示す情報を、ステップＳ３０２以下の処理において用いる。 In step S301, the processor 21 stores the audio information stored in the exhibition database DB corresponding to each of the exhibits designated as sound sources from the server device SV as an acoustic signal indicating the sound emitted from each designated sound source. Receive a part of. For example, the processor 21 divides the audio information stored in the exhibition database DB by a length that can be reproduced in several milliseconds to tens of milliseconds, and sequentially executes the divided audio information every time the process of step S301 is executed. To receive. When the audio information stored in the exhibition database DB is encoded audio information, for example, the processor 21 determines whether an audio signal indicating an audio before encoding or an audio signal from the received audio information. Information indicating a change in amplitude is acquired. The processor 21 may use the function of the sound processing unit 25 illustrated in FIG. 10 when acquiring the sound signal indicating the sound before encoding or the information indicating the change in the amplitude of the sound signal. In addition, the processor 21 uses the acquired acoustic signal or information indicating a change in the amplitude of the acoustic signal in the processing of step S302 and subsequent steps.

ステップＳ３０２において、プロセッサ２１は、ステップＳ３０１の処理で受けた音響信号のそれぞれに基づいて、図１０に示した提示部１１によって人物Ｐ１の視覚に与える刺激の強度を決定する。例えば、プロセッサ２１は、各音響信号の振幅の変化を示す情報と式(１)及び式(２)とを用いて、音響信号で示される音の強さに応じた輝度値を算出し、算出した輝度値を視覚刺激の強度とする。即ち、プロセッサ２１は、ステップＳ３０２の処理を実行することで、図６に示した生成部１４１の機能を果たす。 In step S302, the processor 21 determines the intensity of the stimulus given to the vision of the person P1 by the presentation unit 11 illustrated in FIG. 10 based on each of the acoustic signals received in the process of step S301. For example, the processor 21 uses the information indicating the change in the amplitude of each acoustic signal and the expressions (1) and (2) to calculate a luminance value corresponding to the intensity of the sound indicated by the acoustic signal. The obtained luminance value is set as the intensity of the visual stimulus. That is, the processor 21 performs the function of the generation unit 141 illustrated in FIG. 6 by executing the process of step S302.

ステップＳ３０４において、プロセッサ２１は、人物Ｐ１と各音源との間の距離に応じた時間差で、ステップＳ３０１の処理で各音源に対応して取得した音響信号から再生した音響と、音響信号で示される音の強さに対応する強度を持つ刺激とを人物Ｐ１に与える。プロセッサ２１は、指定された音源のうち、ステップＳ３１４の処理で人物Ｐ１によって注目されていることが示された音源については、ステップＳ３１６の処理で調整された距離に応じた時間差を適用する。一方、人物Ｐ１によって注目されていることが示された音源以外の他の音源について、プロセッサ２１は、ステップＳ３０３の処理で算出された距離に応じた時間差を適用する。プロセッサ２１は、例えば、図１３に示すステップＳ３３１〜ステップＳ３３５の各処理を実行することにより、図１０に示した提示部１１による視覚刺激の提示と再生部１２による音の再生とを制御する。 In step S304, the processor 21 is indicated by the sound and the sound reproduced from the sound signal acquired corresponding to each sound source in the process of step S301 with the time difference according to the distance between the person P1 and each sound source. A stimulus having an intensity corresponding to the intensity of the sound is given to the person P1. The processor 21 applies the time difference corresponding to the distance adjusted in the process of step S316 to the sound source indicated as being noticed by the person P1 in the process of step S314 among the designated sound sources. On the other hand, the processor 21 applies a time difference according to the distance calculated in the process of step S303 for sound sources other than the sound source indicated to be noticed by the person P1. For example, the processor 21 controls the presentation of the visual stimulus by the presentation unit 11 and the reproduction of the sound by the reproduction unit 12 illustrated in FIG. 10 by executing the processes of steps S331 to S335 illustrated in FIG.

図１３は、図１１にステップＳ３０４で示した音及び視覚刺激を与える処理の例を示す。図１３に示したステップＳ３３１〜ステップＳ３３５の処理は、音響再生処理のためのアプリケーションプログラムに含まれる音及び視覚刺激を与える処理の一例である。また、ステップＳ３３１〜ステップＳ３３５の処理は、図１０に示したプロセッサ２１によって実行される。 FIG. 13 shows an example of the process for giving the sound and visual stimulus shown in step S304 in FIG. The process of step S331 to step S335 shown in FIG. 13 is an example of a process for providing sound and visual stimulus included in the application program for the sound reproduction process. Further, the processing in steps S331 to S335 is executed by the processor 21 shown in FIG.

ステップＳ３３１において、プロセッサ２１は、例えば、式(３)を用いて、図１１に示したステップＳ３０３の処理で算出された距離、あるいは、図１１に示したステップＳ３１６の処理で調整された距離から、距離に応じた時間差を算出する。例えば、図９に示した展示Ｅｘ１−３に人物Ｐ１が注目しているとされた場合に、プロセッサ２１は、ステップＳ３１６の処理で調整された距離に基づいて、式(３)で示される時間差を求める。このため、展示Ｅｘ１−３に対応して取得した音響信号を再生する第１タイミングと、当該音響信号に対応する視覚刺激を与える第２タイミングとの間の時間差は、図９に示した距離Ｌ１に対応する時間差に比べて短い値となる。一方、プロセッサ２１は、展示Ｅｘ２−４についてステップＳ３０３の処理で得られた距離を用いて、展示Ｅｘ２−４に対応して取得した音響信号を再生する第１タイミングと、当該音響信号に対応する視覚刺激を与える第２タイミングとの間の時間差を求める。 In step S331, the processor 21 uses, for example, the equation (3) to calculate the distance calculated in the process of step S303 shown in FIG. 11 or the distance adjusted in the process of step S316 shown in FIG. The time difference according to the distance is calculated. For example, when it is determined that the person P1 is paying attention to the exhibition Ex1-3 illustrated in FIG. 9, the processor 21 calculates the time difference represented by Expression (3) based on the distance adjusted in the process of step S316. Ask for. For this reason, the time difference between the 1st timing which reproduces the sound signal acquired corresponding to exhibition Ex1-3, and the 2nd timing which gives the visual stimulus corresponding to the sound signal concerned is distance L1 shown in FIG. The time difference is shorter than the time difference corresponding to. On the other hand, the processor 21 uses the distance obtained in the process of step S303 for the exhibition Ex2-4, and corresponds to the first timing for reproducing the acoustic signal acquired corresponding to the exhibition Ex2-4 and the acoustic signal. A time difference from the second timing for applying the visual stimulus is obtained.

ステップＳ３３２において、プロセッサ２１は、ステップＳ３３１の処理で算出した時間差に基づき、ステップＳ３０１で取得した各音声情報に対応する音響を再生する第１タイミングと、当該音響に対応する視覚刺激を与える第２タイミングとを設定する。例えば、プロセッサ２１は、図９に示した展示Ｅｘ１−３に対応して取得した音声情報と、展示Ｅｘ２−４に対応して取得した音声情報とについて、それぞれ第１タイミングと第２タイミングとを設定する。プロセッサ２１は、例えば、ステップＳ３０２の処理により、音響の強さに応じて強度が決定された視覚刺激を与える第２タイミングに対して、ステップＳ３２３の処理で算出した時間差だけ遅れさせたタイミングを、音響を再生する第１タイミングとして設定する。 In step S332, based on the time difference calculated in step S331, the processor 21 reproduces the sound corresponding to each piece of audio information acquired in step S301, and the second timing for giving the visual stimulus corresponding to the sound. Set the timing. For example, the processor 21 sets the first timing and the second timing for the audio information acquired corresponding to the exhibition Ex1-3 shown in FIG. 9 and the audio information acquired corresponding to the exhibition Ex2-4, respectively. Set. For example, the processor 21 delays the timing delayed by the time difference calculated in the process of step S323 with respect to the second timing that gives the visual stimulus whose intensity is determined according to the intensity of the sound by the process of step S302. It is set as the first timing for reproducing sound.

即ち、プロセッサ２１は、ステップＳ３３１及びステップＳ３３２の処理を実行することで、図６に示した遅延部１４２の機能を果たす。 That is, the processor 21 performs the functions of the delay unit 142 illustrated in FIG. 6 by executing the processes of steps S331 and S332.

ステップＳ３３３において、プロセッサ２１は、図１１のステップＳ３１３で求めた各音源の方向に基づいて、それぞれ音像定位処理を実行することで、人物Ｐ１の頭部の向きを基準とした音源の方向に音像定位された音響を表す音響信号を生成する。つまり、プロセッサ２１は、展示Ｅｘ１−３に対応して取得した音声情報から生成された音響信号に対して、ステップＳ３１３で算出された展示Ｅｘ１−３の方向への音像定位処理を行う。同様に、プロセッサ２１は、展示Ｅｘ２−４に対応して取得した音声情報から生成された音響信号に対して、ステップＳ３１３で算出された展示Ｅｘ２−４の方向への音像定位処理を行う。なお、プロセッサ２１は、音像定位処理を実行する際に、図１０に示した音声処理部２５に搭載された機能を利用してもよい。 In step S333, the processor 21 performs a sound image localization process based on the direction of each sound source obtained in step S313 of FIG. 11, thereby obtaining a sound image in the direction of the sound source with reference to the head direction of the person P1. An acoustic signal representing the localized sound is generated. That is, the processor 21 performs sound image localization processing in the direction of the exhibition Ex1-3 calculated in step S313 on the acoustic signal generated from the audio information acquired corresponding to the exhibition Ex1-3. Similarly, the processor 21 performs sound image localization processing in the direction of the exhibition Ex2-4 calculated in step S313 on the acoustic signal generated from the audio information acquired corresponding to the exhibition Ex2-4. The processor 21 may use a function installed in the audio processing unit 25 shown in FIG. 10 when executing the sound image localization process.

即ち、プロセッサ２１は、ステップＳ３３３の処理を実行することで、図６に示した変換部１４３の機能を果たす。 That is, the processor 21 performs the function of the conversion unit 143 illustrated in FIG. 6 by executing the process of step S333.

ステップＳ３３４において、プロセッサ２１は、図１１のステップＳ３１３の処理で求めた音源のそれぞれの方向に基づいて、図８(Ｂ)に示した発光素子１１３−１〜１１３−７の中から、音源に対応する視覚刺激として発光させる発光素子１１３を選択する。プロセッサ２１は、図８を用いて説明したように、ステップＳ３１３の処理で求めた音源の方向を含む範囲に対応付けられた発光素子１１３を選択する。例えば、プロセッサ２１は、図９に示した展示Ｅｘ１−３の方向として、人物Ｐ１の頭部のほぼ正面を示す角度を得た場合に、図８(Ｂ)に示した範囲φ４に対応する発光素子１１３−４を展示Ｅｘ１−３に対応して選択する。一方、プロセッサ２１は、図９に示した展示Ｅｘ２−４の方向として、例えば、人物Ｐ１の頭部の正面を基準として、図８(Ｂ)に示した範囲φ２に含まれる角度を得た場合に、範囲φ２に対応する発光素子１１３−２を展示Ｅｘ２−４に対応して選択する。 In step S334, the processor 21 selects a sound source from among the light emitting elements 113-1 to 113-7 shown in FIG. 8B based on the directions of the sound sources obtained in the process of step S313 in FIG. The light emitting element 113 that emits light as the corresponding visual stimulus is selected. As described with reference to FIG. 8, the processor 21 selects the light emitting element 113 associated with the range including the direction of the sound source obtained in step S 313. For example, the processor 21 emits light corresponding to the range φ4 shown in FIG. 8B when the angle indicating the front of the head of the person P1 is obtained as the direction of the exhibition Ex1-3 shown in FIG. The element 113-4 is selected corresponding to the exhibition Ex1-3. On the other hand, when the processor 21 obtains an angle included in the range φ2 shown in FIG. 8B with respect to the front of the head of the person P1, for example, as the direction of the exhibition Ex2-4 shown in FIG. In addition, the light emitting element 113-2 corresponding to the range φ2 is selected corresponding to the exhibition Ex2-4.

ステップＳ３３５において、プロセッサ２１は、ステップＳ３３２の処理で音源毎に設定された第１タイミングで、図１０に示した再生部１２に音響を再生させるとともに、音源毎に設定された第２タイミングで提示部１１に人物Ｐ１の視覚への刺激を出力させる。プロセッサ２１は、例えば、内部のクロックで示される時刻が、展示Ｅｘ１−３につき設定した第１タイミングを示したときに、展示Ｅｘ１−３に対応して取得した音声情報から生成した音響信号の再生部１２による再生を開始させる。また、プロセッサ２１は、展示Ｅｘ１−３につき設定した第２タイミングにおいて、展示Ｅｘ１−３に対応して選択した発光素子１１３に、展示Ｅｘ１−３に対応して決定した輝度の光を放出させる駆動信号を供給することで、人物Ｐ１の視覚に刺激を与える。同様に、プロセッサ２１は、例えば、時刻が、展示Ｅｘ２−４につき設定した第１タイミングを示したときに、展示Ｅｘ２−４に対応して取得した音声情報から生成した音響信号の再生部１２による再生を開始させる。また、プロセッサ２１は、展示Ｅｘ２−４につき設定した第２タイミングにおいて、展示Ｅｘ２−４に対応して選択した発光素子１１３に、展示Ｅｘ２−４に対応して決定した輝度の光を放出させる駆動信号を供給することで、人物Ｐ１の視覚に刺激を与える。 In step S335, the processor 21 causes the reproduction unit 12 illustrated in FIG. 10 to reproduce sound at the first timing set for each sound source in the process of step S332, and presents it at the second timing set for each sound source. The unit 11 is caused to output a visual stimulus of the person P1. For example, when the time indicated by the internal clock indicates the first timing set for the exhibition Ex1-3, the processor 21 reproduces the acoustic signal generated from the audio information acquired corresponding to the exhibition Ex1-3. Playback by the unit 12 is started. Further, the processor 21 causes the light emitting element 113 selected corresponding to the exhibition Ex1-3 to emit light having the luminance determined corresponding to the exhibition Ex1-3 at the second timing set for the exhibition Ex1-3. By supplying a signal, a stimulus is given to the vision of the person P1. Similarly, for example, when the time indicates the first timing set for the exhibition Ex2-4, the processor 21 uses the acoustic signal reproducing unit 12 generated from the audio information acquired corresponding to the exhibition Ex2-4. Start playback. Further, the processor 21 causes the light emitting element 113 selected corresponding to the exhibition Ex2-4 to emit light having the luminance determined corresponding to the exhibition Ex2-4 at the second timing set for the exhibition Ex2-4. By supplying a signal, a stimulus is given to the vision of the person P1.

また、プロセッサ２１は、ステップＳ３３１〜ステップＳ３３５の処理の終了後に、図１１に示したステップＳ３０５の処理に進む。 The processor 21 proceeds to the process of step S305 illustrated in FIG. 11 after the processes of step S331 to step S335 are completed.

ステップＳ３０５において、プロセッサ２１は、音響の再生及び音響に対応する視覚刺激の提示を終了するか否かを判定する。 In step S305, the processor 21 determines whether or not to end the reproduction of the sound and the presentation of the visual stimulus corresponding to the sound.

まだ、音響の再生及び音響に対応する視覚刺激の提示を終了することを示す指示を受けていない場合に(ステップＳ３０５の否定判定(ＮＯ))、プロセッサ２１は、ステップＳ３０３の処理に戻り、各音源に対応して新たに受ける音響信号についての処理を開始する。プロセッサ２１は、例えば、ステップＳ３０１の処理で取得する音声情報の一つの区切りを再生する時間(例えば、数ミリ秒〜十数ミリ秒)ごとに、ステップＳ３０３〜ステップＳ３０５の処理を繰り返すことが望ましい。 When the instruction indicating that the reproduction of the sound and the presentation of the visual stimulus corresponding to the sound are not yet received has been received (No determination in step S305 (NO)), the processor 21 returns to the process of step S303, The processing for the newly received sound signal corresponding to the sound source is started. For example, the processor 21 desirably repeats the processing in steps S303 to S305 every time (for example, several milliseconds to several tens of milliseconds) for reproducing one segment of the audio information acquired in the processing in step S301. .

一方、図１０に示した入力制御部２９から、音響の再生及び音響に対応する視覚刺激の提示を終了することを示す指示を受けた場合に(ステップＳ３０５の肯定判定(ＹＥＳ))、プロセッサ２１は、音響再生処理のためのアプリケーションプログラムの処理を終了する。 On the other hand, when receiving an instruction from the input control unit 29 shown in FIG. 10 to end the reproduction of the sound and the presentation of the visual stimulus corresponding to the sound (Yes in step S305 (YES)), the processor 21 Ends the processing of the application program for the sound reproduction process.

図９から図１３を用いて説明したように、図１０に示した音響再生装置１０は、例えば、展示Ｅｘ１−３，Ｅｘ２−４に対応する説明音声の再生と説明音声に対応する視覚刺激の提示とに時間差を設定することで、人物Ｐ１に距離の違いを直感的に知覚させる。また、図１０に示した音響再生装置１０は、再生部１２に再生させる音響に音像定位処理を適用し、また、音源の方向に応じて選択された発光素子を点灯させることで、人物Ｐ１に対して音源として指定された展示Ｅｘ１−３，Ｅｘ２−４の方向を示すことができる。すなわち、図１０に示した音響再生装置１０は、展示会場ＥＨなどにおける複数の展示Ｅｘ１−３，Ｅｘ２−４の位置を、人物Ｐ１に直感的に把握させることができる。 As described with reference to FIGS. 9 to 13, for example, the sound reproducing device 10 illustrated in FIG. 10 reproduces the explanation sound corresponding to the exhibitions Ex1-3 and Ex2-4 and performs visual stimulation corresponding to the explanation sound. By setting a time difference to the presentation, the person P1 intuitively perceives the difference in distance. 10 applies a sound image localization process to the sound to be reproduced by the reproduction unit 12, and turns on the light emitting element selected according to the direction of the sound source, thereby causing the person P1 to turn on. On the other hand, the directions of the exhibits Ex1-3 and Ex2-4 designated as sound sources can be shown. That is, the sound reproducing device 10 illustrated in FIG. 10 can make the person P1 intuitively grasp the positions of the plurality of exhibitions Ex1-3 and Ex2-4 in the exhibition hall EH and the like.

ここで、図１０に示した音響再生装置１０は、音響が再生されるタイミングと視覚刺激の強度が音響の強さに応じて変化するタイミングとの時間差により、人物Ｐ１に音源までの距離を示す。したがって、図１０に示した音響再生装置１０は、人物Ｐ１の視界に、音源となる展示Ｅｘ１−３、Ｅｘ２−４が含まれているか否かにかかわらず、音源となる展示Ｅｘ１−３、Ｅｘ２−４までの距離の人物Ｐ１による直感的な把握を支援することができる。図１０に示した音響再生装置１０が有するこの特徴は、広い展示会場などで、人物Ｐ１に、所望の展示の位置を案内するシステムなどへの適用において有用である。 Here, the sound reproducing device 10 illustrated in FIG. 10 indicates the distance to the sound source from the person P1 by the time difference between the timing at which the sound is reproduced and the timing at which the intensity of the visual stimulus changes according to the intensity of the sound. . Therefore, the sound reproducing device 10 shown in FIG. 10 has the exhibits Ex1-3 and Ex2 as sound sources regardless of whether or not the views Ex1-3 and Ex2-4 as sound sources are included in the field of view of the person P1. Intuitive grasping by the person P1 at a distance up to -4 can be supported. This characteristic of the sound reproducing device 10 shown in FIG. 10 is useful in application to a system or the like for guiding a person P1 to a desired display position in a large exhibition hall or the like.

更に、図１０に示した音響再生装置１０は、人物Ｐ１が注目している音源を判別し、判別した音源に対応する音響信号から生成される視覚への刺激と聴覚への刺激との時間差を短縮する調整を行う。すなわち、図１０に示した音響再生装置１０は、人物Ｐ１が注目している音源に対応する音響信号から再生された音響を、実際の音源までの距離よりも近くにある音源からの音響として再生する。そして、人物Ｐ１が注目している音源に対応する音響信号から生成される視覚への刺激と聴覚への刺激との時間差が短縮されたことにより、人物Ｐ１は、注目している音源からの音響を聴くことに集中することが可能となる。 Furthermore, the sound reproducing device 10 shown in FIG. 10 determines the sound source that the person P1 is paying attention to, and calculates the time difference between the visual stimulus and the auditory stimulus generated from the acoustic signal corresponding to the determined sound source. Make adjustments to shorten. That is, the sound reproducing device 10 shown in FIG. 10 reproduces the sound reproduced from the sound signal corresponding to the sound source focused on by the person P1 as the sound from the sound source closer to the actual sound source. To do. Then, the time difference between the visual stimulus and the auditory stimulus generated from the acoustic signal corresponding to the sound source focused on by the person P1 is shortened, so that the person P1 receives the sound from the focused sound source. It becomes possible to concentrate on listening.

以上の詳細な説明により、実施形態の特徴点及び利点は明らかになるであろう。これは、特許請求の範囲が、その精神及び権利範囲を逸脱しない範囲で、前述のような実施形態の特徴点及び利点にまで及ぶことを意図するものである。また、当該技術分野において通常の知識を有する者であれば、あらゆる改良及び変更を容易に想到できるはずである。したがって、発明性を有する実施形態の範囲を前述したものに限定する意図はなく、実施形態に開示された範囲に含まれる適当な改良物及び均等物に拠ることも可能である。 From the above detailed description, features and advantages of the embodiment will become apparent. It is intended that the scope of the claims extend to the features and advantages of the embodiments described above without departing from the spirit and scope of the right. Further, any improvement and change should be easily conceived by those having ordinary knowledge in the technical field. Therefore, there is no intention to limit the scope of the inventive embodiments to those described above, and appropriate improvements and equivalents included in the scope disclosed in the embodiments can be used.

以上の説明に関して、更に、以下の各項を開示する。
(付記１)
音源から発せられる音を示す音響信号を受け、受けた音響信号から再生した音を人物の聴覚に与える再生部と、
前記音響信号で示される音の強さの変化に応じて変化する強度を持つ刺激を前記人物の視覚に与える提示部と、
前記音源の位置を示す情報と前記人物の位置を示す情報とに基づいて、前記音源と前記人物との距離を算出する第１算出部と、
前記再生部によって前記人物の聴覚に音を与える第１タイミングと前記提示部によって前記人物の視覚に前記刺激を与える第２タイミングとの間の時間差を、前記第１算出部で算出された距離に応じて変更する制御を行う制御部と、
を有することを特徴とする音響再生装置。
(付記２)
付記１に記載の音響再生装置において、
前記制御部は、
前記人物の動きを示す情報に基づいて、前記人物が前記音源に注目しているか否かを判定する判定部を有し、
前記判定部により、前記人物が前記音源に注目しているとされた場合に、前記音源と前記人物との距離に応じて設定される値よりも、前記時間差を小さくする制御を行う
ことを特徴とする音響再生装置。
(付記３)
付記２に記載の音響再生装置において、
前記制御部は、
前記人物が前記音源に注目している時間が長いほど前記時間差を小さくする制御を行う
ことを特徴とする音響再生装置。
(付記４)
付記１乃至付記３のいずれか１に記載の音響再生装置において、
前記音源の位置を示す情報と前記人物の位置を示す情報とに基づいて、前記人物の位置を基準とする前記音源の方向を求める第２算出部を有し、
前記提示部は、前記第２算出部によって求められた前記音源の方向から前記人物の視覚に前記刺激を与え、
前記再生部は、前記第２算出部によって求められた前記音源の方向に音像定位された音響信号から再生した音を前記人物の聴覚に与える
ことを特徴とする音響再生装置。
(付記５)
付記１乃至付記４のいずれか１に記載の音響再生装置において、
前記制御部は、
前記第２タイミングよりも第１タイミングを遅れさせる方向に前記時間差を設定する
ことを特徴とする音響再生装置。
(付記６)
付記１に記載の音響再生装置において、
前記制御部は、前記第１タイミングと前記第２タイミングとの間の時間差を、前記第１算出部で算出された前記距離を音が伝搬する時間よりも大きく設定する
ことを特徴とする音響再生装置。
(付記７)
付記１乃至付記６のいずれか１に記載の音響再生装置において、
前記提示部は、前記音の強さの変化に応じて、前記人物の視覚に対して与える前記刺激の輝度を変化させる
ことを特徴とする音響再生装置。
(付記８)
音源から発せられる音を示す音響信号を受け、
前記音響信号で示される音の強さに基づいて、人物の視覚に与える刺激の強度を決定し、
前記音源の位置を示す情報と前記人物の位置を示す情報とに基づいて、前記音源と前記人物との距離を算出し、
前記音響信号から再生される音を前記人物の聴覚に与える第１タイミングと前記決定された強度を持つ刺激を前記人物の視覚に与える第２タイミングとの間の時間差を、前記算出された距離に応じて変更する制御を行う、
処理をコンピュータに実行させることを特徴とする音響再生プログラム。 Regarding the above description, the following items are further disclosed.
(Appendix 1)
A playback unit that receives an acoustic signal indicating a sound emitted from a sound source and gives a sound reproduced from the received acoustic signal to a person's hearing;
A presentation unit that gives the person's vision a stimulus having an intensity that changes in accordance with a change in sound intensity indicated by the acoustic signal;
A first calculation unit that calculates a distance between the sound source and the person based on information indicating the position of the sound source and information indicating the position of the person;
A time difference between a first timing at which sound is given to the hearing of the person by the reproducing unit and a second timing at which the stimulus is given to the visual of the person by the presentation unit is set to a distance calculated by the first calculating unit. A control unit that performs control to change according to the
A sound reproducing device comprising:
(Appendix 2)
In the sound reproduction device according to attachment 1,
The controller is
A determination unit that determines whether the person is paying attention to the sound source based on information indicating the movement of the person;
When the determination unit determines that the person is paying attention to the sound source, control is performed to make the time difference smaller than a value set according to a distance between the sound source and the person. A sound reproducing device.
(Appendix 3)
In the sound reproduction device according to attachment 2,
The controller is
The sound reproducing apparatus is characterized in that control is performed to reduce the time difference as the time during which the person focuses on the sound source is longer.
(Appendix 4)
In the sound reproduction device according to any one of appendix 1 to appendix 3,
Based on information indicating the position of the sound source and information indicating the position of the person, a second calculation unit for determining the direction of the sound source with respect to the position of the person,
The presenting unit gives the stimulus to the visual sense of the person from the direction of the sound source obtained by the second calculation unit,
The sound reproduction device, wherein the reproduction unit gives sound reproduced from the sound signal localized in the direction of the sound source obtained by the second calculation unit to the hearing of the person.
(Appendix 5)
In the sound reproduction device according to any one of appendix 1 to appendix 4,
The controller is
The sound reproduction apparatus according to claim 1, wherein the time difference is set in a direction in which the first timing is delayed from the second timing.
(Appendix 6)
In the sound reproduction device according to attachment 1,
The control unit sets the time difference between the first timing and the second timing to be greater than the time for sound to propagate through the distance calculated by the first calculation unit. apparatus.
(Appendix 7)
In the sound reproduction device according to any one of supplementary notes 1 to 6,
The sound reproducing device, wherein the presenting unit changes the luminance of the stimulus given to the person's vision according to a change in the intensity of the sound.
(Appendix 8)
Receive an acoustic signal indicating the sound emitted from the sound source,
Based on the intensity of the sound indicated by the acoustic signal, determine the intensity of the stimulus given to the human vision,
Based on the information indicating the position of the sound source and the information indicating the position of the person, the distance between the sound source and the person is calculated,
A time difference between a first timing at which a sound reproduced from the acoustic signal is given to the auditory sense of the person and a second timing at which a stimulus having the determined intensity is given to the visual sense of the person is defined as the calculated distance. Control to change accordingly,
A sound reproduction program that causes a computer to execute processing.

１０…音響再生装置；１１…提示部；１２…再生部；１３…第１算出部；１４…制御部；１５…第２算出部；１１１ａ，１１１ｂ…表示画面；１１２ｃ…円形；１１２ｔ…テキスト；１１２ｍ…マーク；１１２ｂ…背景；１１２ｆ…枠；１１３…発光素子；１１４…板状の部材；１１５…蔓状の部材；１４１…生成部；１４２…遅延部；１４３…変換部；１４４…判定部；１４５…調整部；２１…プロセッサ；２２…メモリ；２４…汎用インタフェース；２５…音声処理部；２６…ネットワークインタフェース；２７…表示制御部；２８…タッチパネル；２９…入力制御部；Ｐ１…人物；ＵＥ…携帯端末；Ｇｐ…ＧＰＳセンサ；Ｓ…音源；ＭＣ…マイクロホン；ＳＶ…サーバ装置；ＡＰ…アクセスポイント；Ｅｘ１，Ｅｘ１−１，Ｅｘ１−２，Ｅｘ１−３，Ｅｘ１−４，Ｅｘ２−１，Ｅｘ２−２，Ｅｘ２−３，Ｅｘ２−４…展示；ＤＢ…展示データベース；ＮＷ…ネットワーク；ＧＳ…案内システム
DESCRIPTION OF SYMBOLS 10 ... Sound reproduction apparatus; 11 ... Presentation part; 12 ... Reproduction part; 13 ... 1st calculation part; 14 ... Control part; 15 ... 2nd calculation part; 111a, 111b ... Display screen; 112m ... mark; 112b ... background; 112f ... frame; 113 ... light emitting element; 114 ... plate-like member; 115 ... vine-like member; 141 ... generating part; 142 ... delaying part; 143 ... converting part; 145: Adjustment unit; 21 ... Processor; 22 ... Memory; 24 ... General-purpose interface; 25 ... Audio processing unit; 26 ... Network interface; 27 ... Display control unit; 28 ... Touch panel; 29 ... Input control unit; UE ... portable terminal; Gp ... GPS sensor; S ... sound source; MC ... microphone; SV ... server apparatus; AP ... access point; Ex1, Ex1-1, Ex1-2 Ex1-3, Ex1-4, Ex2-1, Ex2-2, Ex2-3, Ex2-4 ... exhibitions; DB ... exhibition database; NW ... network; GS ... guidance system

Claims

A playback unit that receives an acoustic signal indicating a sound emitted from a sound source and gives a sound reproduced from the received acoustic signal to a person's hearing;
A presentation unit that gives the person's vision a stimulus having an intensity that changes in accordance with a change in sound intensity indicated by the acoustic signal;
A first calculation unit that calculates a distance between the sound source and the person based on information indicating the position of the sound source and information indicating the position of the person;
A time difference between a first timing at which sound is given to the hearing of the person by the reproducing unit and a second timing at which the stimulus is given to the visual of the person by the presentation unit is set to a distance calculated by the first calculating unit. A control unit that performs control to change according to the
A sound reproducing apparatus comprising:

The sound reproducing device according to claim 1,
The controller is
A determination unit that determines whether the person is paying attention to the sound source based on information indicating the movement of the person;
When the determination unit determines that the person is paying attention to the sound source, control is performed to make the time difference smaller than a value set according to a distance between the sound source and the person. A sound reproducing device.

The sound reproducing device according to claim 1 or 2,
Based on information indicating the position of the sound source and information indicating the position of the person, a second calculation unit for determining the direction of the sound source with respect to the position of the person,
The presenting unit gives the stimulus to the visual sense of the person from the direction of the sound source obtained by the second calculation unit,
The sound reproduction device, wherein the reproduction unit gives sound reproduced from the sound signal localized in the direction of the sound source obtained by the second calculation unit to the hearing of the person.

The sound reproducing device according to claim 1, wherein
The controller is
The sound reproduction apparatus according to claim 1, wherein the time difference is set in a direction in which the first timing is delayed from the second timing.

Receive an acoustic signal indicating the sound emitted from the sound source,
Based on the sound intensity indicated by the acoustic signal, determine the intensity of the stimulus to be presented to the human vision,
Based on the information indicating the position of the sound source and the information indicating the position of the person, the distance between the sound source and the person is calculated,
A time difference between a first timing at which a sound reproduced from the acoustic signal is given to the auditory sense of the person and a second timing at which a stimulus having the determined intensity is given to the visual sense of the person is defined as the calculated distance. Control to change accordingly,
A sound reproduction program that causes a computer to execute processing.