JP7188545B2

JP7188545B2 - Out-of-head localization system and out-of-head localization method

Info

Publication number: JP7188545B2
Application number: JP2021194105A
Authority: JP
Inventors: 寿子村田; 正也小西; 優美藤井; 敬洋下条; 邦明高地; 俊明永井
Original assignee: JVCKenwood Corp
Current assignee: JVCKenwood Corp
Priority date: 2018-09-28
Filing date: 2021-11-30
Publication date: 2022-12-13
Anticipated expiration: 2038-09-28
Also published as: JP2022024154A

Description

本発明は、頭外定位処理システム及び頭外定位処理方法に関する。 The present invention relates to an out-of-head localization processing system and out-of-head localization processing method.

特許文献１には、ディスプレイ装置で画像を見ながら、イヤホンやヘッドホンによって
音声を聴く場合において、音像定位を制御するシステムが開示されている。特許文献１の
システムでは、スピーカから両耳に至る伝達関数を時間領域に変換したインパルス応答を
畳み込むフィルタを用いている。さらに、イヤホン又はヘッドホンの左右の出力毎に加算
回路が設けられている。加算回路は、デジタルフィルタからの音声信号を加算している。
加算回路からの音声信号が、ＤＡＣ(Digital to Analog Converter)でアナログ信号に変
換される。変換後の音声信号は、増幅回路を介して、トランスデューサに供給される。さ
らに、特許文献１では、ディスプレイ又はリスナー頭部の移動距離と回転とに応じて、音
像の定位位置を制御している。 Patent Literature 1 discloses a system for controlling sound image localization when listening to audio through earphones or headphones while viewing an image on a display device. The system of Patent Document 1 uses a filter that convolves an impulse response obtained by transforming a transfer function from a speaker to both ears into a time domain. Further, an adder circuit is provided for each of the left and right outputs of the earphone or headphone. The adding circuit adds the audio signals from the digital filters.
An audio signal from the adder circuit is converted into an analog signal by a DAC (Digital to Analog Converter). The converted audio signal is supplied to the transducer through an amplifier circuit. Furthermore, in Patent Document 1, the localization position of the sound image is controlled according to the movement distance and rotation of the display or the listener's head.

ところで、音像定位技術として、ヘッドホンを用いて受聴者の頭部の外側に音像を定位
させる頭外定位技術がある。頭外定位技術では、ヘッドホンから耳までの特性をキャンセ
ルし、ステレオスピーカから耳までの４本の特性（空間音響伝達特性）を与えることによ
り、音像を頭外に定位させている。 By the way, as a sound image localization technique, there is an out-of-head localization technique in which a sound image is localized outside the listener's head using headphones. In the out-of-head localization technology, the sound image is localized out of the head by canceling the characteristics from the headphones to the ears and giving the four characteristics (spatial sound transfer characteristics) from the stereo speakers to the ears.

頭外定位再生においては、例えば、２チャンネル（以下、ｃｈと記載）のスピーカから
発した測定信号（インパルス音等）を聴取者（リスナー）本人の耳に設置したマイクロフ
ォン（以下、マイクとする）で録音する。これにより、スピーカからマイクまでの特性（
空間音響伝達特性、空間音響伝達関数、頭部伝達関数ＨＲＴＦ等とも称する）そして、測
定信号を集音して得られた空間音響伝達特性に基づいて、処理装置がフィルタを生成する
。頭外定位処理装置は、フィルタを２ｃｈのオーディオ信号に畳み込む。 In out-of-head localization reproduction, for example, a measurement signal (impulse sound, etc.) emitted from a two-channel (hereinafter referred to as ch) speaker is placed in the ear of the listener (hereinafter referred to as a microphone). to record. This allows the characteristics from the speaker to the microphone (
Also referred to as spatial acoustic transfer characteristics, spatial acoustic transfer function, head-related transfer function HRTF, etc.), and based on the spatial acoustic transfer characteristics obtained by collecting the measured signals, a processing unit generates a filter. The out-of-head localization processing device convolves the filter with the 2ch audio signal.

さらに、ヘッドホンから耳までの特性をキャンセルするフィルタを生成するために、ヘ
ッドホンから耳元乃至鼓膜までの特性（外耳道伝達関数ＥＣＴＦ、外耳道伝達特性とも称
する）を聴取者本人の耳に設置したマイクで測定する。 Furthermore, in order to generate a filter that cancels the characteristics from the headphones to the ear, the characteristics from the headphones to the ear to the eardrum (also called the external auditory canal transfer function ECTF, external auditory canal transfer characteristics) are measured with a microphone placed in the listener's own ear. do.

特開２０１０－１４７５２９号公報JP 2010-147529 A

頭外定位処理を行う場合、聴取者本人の耳に設置したマイクで特性を測定することが好
ましい。しかしながら、聴取者本人の耳にマイクを適切に設置することは困難である。さ
らに、スピーカからマイクまでの空間音響伝達特性を測定する測定場所も制限される。一
方、聴取者以外の特性を用いた場合、適切に頭外定位処理を行うことができないおそれが
ある。特に、人毎、あるいは耳毎に外耳道の形状が異なるため、他人の外耳道伝達特性に
応じたフィルタを用いた場合、処理を適切に行なうことができない場合がある。 When performing out-of-head localization processing, it is preferable to measure the characteristics with a microphone placed in the listener's own ear. However, it is difficult to properly place the microphone in the listener's own ear. Furthermore, the measurement location for measuring the spatial sound transfer characteristics from the speaker to the microphone is also limited. On the other hand, when characteristics other than those of the listener are used, there is a possibility that out-of-head localization processing cannot be appropriately performed. In particular, since the shape of the ear canal differs from person to person or from ear to ear, it may not be possible to perform appropriate processing when using a filter that corresponds to another person's ear canal transfer characteristics.

本発明は上記の点に鑑みなされたものであり、適切に処理することができる頭外定位処
理装置、処理方法、及びプログラムを提供することを目的とする。 SUMMARY OF THE INVENTION It is an object of the present invention to provide an out-of-head localization processing apparatus, a processing method, and a program capable of performing appropriate processing.

本実施形態にかかる頭外定位処理システムは、ユーザが座席に着席する前に、前記ユーザの耳に装着されたマイクを用いて、伝達特性を測定する測定装置と、前記伝達特性に応じたフィルタを用いて頭外定位処理を行う頭外定位処理装置と、前記ユーザの識別情報に基づいて、前記伝達特性に応じたフィルタを、前記頭外定位処理装置に送信するサーバと、を備えたものである。 The out-of-head localization processing system according to the present embodiment includes a measuring device for measuring transfer characteristics using a microphone attached to the ear of the user, and a filter corresponding to the transfer characteristics before the user sits on the seat. and a server for transmitting a filter corresponding to the transfer characteristic to the out-of-head localization processing apparatus based on the identification information of the user. is.

本実施形態にかかる頭外定位処理方法は、ユーザが乗り物に搭乗する前に、前記ユーザ
の耳に装着されたマイクを用いて、伝達特性を測定するステップと、前記ユーザの識別情
報に基づいて、前記乗り物に設置された頭外定位処理装置に前記伝達特性に応じたフィル
タを送信するステップと、前記頭外定位処理装置が、再生信号に対して前記フィルタを用
いた頭外定位処理を行うステップと、前記識別情報に対応する前記ユーザに対して、前記
頭外定位処理された再生信号をヘッドホン又はイヤホンから出力するステップと、を備え
たものである。 The out-of-head localization processing method according to this embodiment comprises the steps of measuring transmission characteristics using a microphone worn on the user's ear before the user boards a vehicle, and a step of transmitting a filter corresponding to the transfer characteristics to an out-of-head localization processing device installed in the vehicle; and outputting the out-of-head localization-processed reproduction signal from headphones or earphones to the user corresponding to the identification information.

本実施形態にかかるフィルタ生成装置は、ユーザが装着したヘッドホン又はイヤホンか
らマイクまでの外耳道伝達特性を取得する外耳道伝達特性取得部と、スピーカからマイク
までの空間音響伝達特性に対応する第１の特性データと、前記外耳道伝達特性に対応する
第２の特性データとを１セットとして、複数セット分を格納するデータベースを参照する
ことで、前記ユーザの外耳道伝達特性の第１の周波数帯域における周波数特性に基づいて
、第１のセットを選択する第１の選択部と、前記第１の選択部で選択された前記第１のセ
ットに含まれる第２の特性データを取得する第１の取得部と、前記データベースを参照す
ることで、前記ユーザの外耳道伝達特性の第２の周波数帯域における周波数特性に基づい
て、第２のセットを選択する第２の選択部と、前記第２の選択部で選択された前記第２の
セットに含まれる第２の特性データを取得する第２の取得部と、予め設定されたプリセッ
トデータを取得する第３の取得部と、前記第１のセットの前記第２の特性データと、前記
第２のセットの第２の特性データと、前記プリセットデータとに基づいて、前記ユーザの
空間音響伝達特性に応じたフィルタを生成するフィルタ生成部と、を備えたものである。 The filter generation device according to the present embodiment includes an ear canal transfer characteristic acquisition unit that acquires the ear canal transfer characteristic from the headphone or earphone worn by the user to the microphone, and a first characteristic corresponding to the spatial sound transfer characteristic from the speaker to the microphone. Data and the second characteristic data corresponding to the ear canal transfer characteristics are set as one set, and by referring to a database storing a plurality of sets, the frequency characteristics in the first frequency band of the ear canal transfer characteristics of the user are obtained. a first selection unit that selects a first set based on, a first acquisition unit that acquires second characteristic data included in the first set selected by the first selection unit; A second selection unit that selects a second set based on frequency characteristics in a second frequency band of the ear canal transfer characteristics of the user by referring to the database, and selected by the second selection unit a second acquisition unit that acquires second characteristic data included in the second set; a third acquisition unit that acquires preset data set in advance; and the second acquisition unit of the first set. a filter generation unit that generates a filter corresponding to the spatial sound transfer characteristic of the user based on the characteristic data, the second characteristic data of the second set, and the preset data. .

本実施形態にかかるフィルタ生成方法は、ユーザが装着したヘッドホン又はイヤホンか
らマイクまでの外耳道伝達特性を取得するステップと、スピーカからマイクまでの空間音
響伝達特性に対応する第１の特性データと、前記外耳道伝達特性に対応する第２の特性デ
ータとを１セットとして、複数セット分を格納するデータベースを参照することで、前記
ユーザの外耳道伝達特性の第１の周波数帯域における周波数特性に基づいて、第１のセッ
トを選択するステップと、前記第１のセットに含まれる第２の特性データを取得するステ
ップと、前記データベースを参照することで、前記ユーザの外耳道伝達特性の第２の周波
数帯域における周波数特性に基づいて、第２のセットを選択するステップと、前記第２の
セットに含まれる第２の特性データを取得するステップと、予め設定されたプリセットデ
ータを取得するステップと、前記第１のセットの前記第２の特性データと、前記第２のセ
ットの第２の特性データと、前記プリセットデータとに基づいて、前記ユーザの空間音響
伝達特性に応じたフィルタを生成するステップと、を備えたフィルタものである。 The filter generation method according to the present embodiment includes the step of acquiring the ear canal transfer characteristic from the headphone or earphone worn by the user to the microphone, the first characteristic data corresponding to the spatial sound transfer characteristic from the speaker to the microphone, and By referring to a database that stores a plurality of sets of second characteristic data corresponding to the ear canal transfer characteristic as one set, based on the frequency characteristic in the first frequency band of the ear canal transfer characteristic of the user, the first selecting one set; obtaining second characteristic data included in the first set; and referring to the database to determine the frequency in the second frequency band of the user's ear canal transfer characteristic. selecting a second set based on the characteristics; obtaining second characteristic data included in the second set; obtaining preset data; generating a filter according to the spatial sound transfer characteristics of the user based on the set of second characteristic data, the second set of second characteristic data, and the preset data. It is a filter thing.

本発明によれば、適切な処理を行うことができる頭外定位処理システム、フィルタ生成
装置、方法、及びプログラムを提供することを目的とする。 An object of the present invention is to provide an out-of-head localization processing system, a filter generation device, a method, and a program capable of performing appropriate processing.

本実施の形態に係る頭外定位処理装置を示すブロック図である。1 is a block diagram showing an out-of-head localization processing apparatus according to this embodiment; FIG. 外耳道伝達特性を測定するための測定装置を示す模式図である。It is a schematic diagram which shows the measuring apparatus for measuring an external auditory canal transfer characteristic. システムの全体構成を示す図である。It is a figure which shows the whole structure of a system. サーバ端末の構成を示す図である。It is a figure which shows the structure of a server terminal. 航空機内に設置された搭乗席と座席端末を示す図である。1 is a diagram showing a boarding seat and a seat terminal installed in an aircraft; FIG. 測定装置における処理を示すフローチャートである。It is a flowchart which shows the process in a measuring device. サーバ端末における処理を示すフローチャートである。4 is a flowchart showing processing in a server terminal; 実施の形態２にかかるフィルタ生成装置の構成を示すブロック図である。2 is a block diagram showing the configuration of a filter generation device according to a second embodiment; FIG. データベースに格納されている第１及び第２の特性データを示す表である。4 is a table showing first and second characteristic data stored in a database; 収音信号における直接音及び反射音を説明するための図である。FIG. 2 is a diagram for explaining direct sound and reflected sound in a collected sound signal; FIG. 実施の形態３にかかる処理装置の構成を示すブロック図である。FIG. 11 is a block diagram showing the configuration of a processing device according to a third embodiment; FIG. ５．１ｃｈの再生信号での畳み込み処理と音量調整を説明するための図である。FIG. 10 is a diagram for explaining convolution processing and volume adjustment in a 5.1 ch reproduction signal; 実施の形態４の例１において、ＬＰＦをかける処理前後のフィルタを示す図である。FIG. 10 is a diagram showing filters before and after applying an LPF in example 1 of the fourth embodiment; FIG. 実施の形態４の例２において、残響成分を付加したフィルタを示す図である。FIG. 20 is a diagram showing a filter to which a reverberation component is added in Example 2 of Embodiment 4;

本実施の形態にかかる頭外定位処理の概要について説明する。本実施形態にかかる頭外
定位処理は、空間音響伝達特性と外耳道伝達特性を用いて頭外定位処理を行うものである
。空間音響伝達特性は、スピーカなどの音源から外耳道までの伝達特性である。外耳道伝
達特性は、ヘッドホン又はイヤホンのスピーカユニットから鼓膜までの伝達特性である。
本実施形態では、ヘッドホン又はイヤホンを装着していない状態で測定された空間音響伝
達特性に応じたフィルタを用いて、頭外定位処理を行うことができる。さらに、ヘッドホ
ン又はイヤホンを装着した状態で測定された外耳道伝達特性に応じたフィルタを用いて、
頭外定位処理を行うことができる。 An outline of out-of-head localization processing according to the present embodiment will be described. The out-of-head localization processing according to this embodiment uses the spatial sound transfer characteristics and the ear canal transfer characteristics to perform the out-of-head localization processing. Spatial sound transfer characteristics are transfer characteristics from a sound source such as a speaker to the ear canal. The ear canal transfer characteristic is the transfer characteristic from the speaker unit of the headphone or earphone to the eardrum.
In this embodiment, out-of-head localization processing can be performed using a filter corresponding to the spatial sound transfer characteristics measured without wearing headphones or earphones. Furthermore, using a filter according to the ear canal transfer characteristics measured while wearing headphones or earphones,
Out-of-head localization processing can be performed.

本実施の形態にかかる頭外定位処理は、パーソナルコンピュータ、スマートホン、タブ
レットＰＣなどのユーザ端末で実行される。さらに、頭外定位処理を行うユーザ端末は、
航空機、電車（鉄道車両）、船舶、バス等の乗り物に搭載された再生装置であってもよい
。この場合、乗り物の搭乗席に搭乗した搭乗者が頭外定位受聴を行う。ユーザ端末は、プ
ロセッサ等の処理手段、メモリやハードディスクなどの記憶手段、液晶モニタ等の表示手
段、タッチパネル、ボタン、キーボード、マウスなどの入力手段を有する情報処理装置で
ある。ユーザ端末は、データを送受信する通信機能を有していてもよい。さらに、ユーザ
端末には、ヘッドホン又はイヤホンを有する出力手段（出力ユニット）が接続される。以
下の説明ではヘッドホンを用いる例について説明するが、ヘッドホンの代わりにイヤホン
を用いてもよい。
実施の形態１．
（頭外定位処理装置）
本実施の形態にかかる音場再生装置の一例である頭外定位処理装置１００を図１に示す
。図１は、頭外定位処理装置１００のブロック図である。頭外定位処理装置１００は、ヘ
ッドホン４３を装着するユーザＵに対して音場を再生する。そのため、頭外定位処理装置
１００は、ＬｃｈとＲｃｈのステレオ入力信号ＸＬ、ＸＲについて、頭外定位処理を行う
。ＬｃｈとＲｃｈのステレオ入力信号ＸＬ、ＸＲは、ＣＤ（Compact Disc）プレイヤーな
どから出力されるアナログのオーディオ再生信号、又は、mp3(MPEG Audio Layer-3)等の
デジタルオーディオデータである。なお、オーディオ再生信号、又はデジタルオーディオ
データをまとめて再生信号と称する。すなわち、ＬｃｈとＲｃｈのステレオ入力信号ＸＬ
、ＸＲが再生信号となっている。 The out-of-head localization processing according to this embodiment is executed by a user terminal such as a personal computer, a smart phone, or a tablet PC. Furthermore, the user terminal that performs out-of-head localization processing is
It may be a playback device mounted on a vehicle such as an aircraft, a train (railway vehicle), a ship, or a bus. In this case, the passenger in the passenger seat of the vehicle performs out-of-head localized listening. A user terminal is an information processing device having processing means such as a processor, storage means such as a memory and a hard disk, display means such as a liquid crystal monitor, and input means such as a touch panel, buttons, keyboard, and mouse. A user terminal may have a communication function for transmitting and receiving data. Furthermore, output means (output unit) having headphones or earphones are connected to the user terminal. Although an example using headphones will be described below, earphones may be used instead of headphones.
Embodiment 1.
(Out-of-head stereotactic processing device)
FIG. 1 shows an out-of-head localization processing device 100, which is an example of a sound field reproducing device according to the present embodiment. FIG. 1 is a block diagram of an out-of-head localization processing apparatus 100. As shown in FIG. The out-of-head localization processing device 100 reproduces a sound field for the user U wearing the headphones 43 . Therefore, the out-of-head localization processing apparatus 100 performs out-of-head localization processing on the Lch and Rch stereo input signals XL and XR. The Lch and Rch stereo input signals XL and XR are analog audio reproduction signals output from a CD (Compact Disc) player or the like, or digital audio data such as mp3 (MPEG Audio Layer-3). Note that the audio reproduction signal or digital audio data will be collectively referred to as a reproduction signal. That is, Lch and Rch stereo input signal XL
, and XR are reproduction signals.

なお、頭外定位処理装置１００は、物理的に単一な装置に限られるものではなく、一部
の処理が異なる装置で行われてもよい。例えば、一部の処理がスマートホンなどにより行
われ、残りの処理がヘッドホン４３に内蔵されたＤＳＰ(Digital Signal Processor)など
により行われてもよい。 It should be noted that the out-of-head localization processing apparatus 100 is not limited to a physically single apparatus, and a part of the processing may be performed by a different apparatus. For example, part of the processing may be performed by a smart phone or the like, and the rest of the processing may be performed by a DSP (Digital Signal Processor) built in the headphones 43 or the like.

頭外定位処理装置１００は、頭外定位処理部１０、フィルタ部４１、フィルタ部４２、
及びヘッドホン４３を備えている。頭外定位処理部１０、フィルタ部４１、及びフィルタ
部４２は、具体的にはプロセッサ等により実現可能である。 The out-of-head localization processing device 100 includes an out-of-head localization processing unit 10, a filter unit 41, a filter unit 42,
and headphones 43 are provided. The out-of-head localization processing unit 10, filter unit 41, and filter unit 42 can be specifically realized by a processor or the like.

頭外定位処理部１０は、畳み込み演算部１１～１２、２１～２２、及び加算器２４、２
５を備えている。畳み込み演算部１１～１２、２１～２２は、空間音響伝達特性を用いた
畳み込み処理を行う。頭外定位処理部１０には、ＣＤプレイヤーなどからのステレオ入力
信号ＸＬ、ＸＲが入力される。頭外定位処理部１０には、空間音響伝達特性が設定されて
いる。頭外定位処理部１０は、各ｃｈのステレオ入力信号ＸＬ、ＸＲに対し、空間音響伝
達特性のフィルタ（以下、空間音響フィルタとも称する）を畳み込む。空間音響伝達特性
は被測定者の頭部や耳介で測定した頭部伝達関数ＨＲＴＦでもよいし、ダミーヘッドまた
は第三者の頭部伝達関数であってもよい。 The out-of-head localization processing unit 10 includes convolution calculation units 11 to 12, 21 to 22, and adders 24, 2
5. The convolution calculation units 11 to 12 and 21 to 22 perform convolution processing using spatial acoustic transfer characteristics. Stereo input signals XL and XR from a CD player or the like are input to the out-of-head localization processing unit 10 . Spatial sound transfer characteristics are set in the out-of-head localization processing unit 10 . The out-of-head localization processing unit 10 convolves a spatial acoustic transfer characteristic filter (hereinafter, also referred to as a spatial acoustic filter) with stereo input signals XL and XR of each channel. The spatial sound transfer characteristic may be a head-related transfer function HRTF measured from the head or pinna of the person to be measured, or may be a head-related transfer function of a dummy head or a third person.

４つの空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓを１セットとしたものを空間
音響伝達関数とする。畳み込み演算部１１、１２、２１、２２で畳み込みに用いられるデ
ータが空間音響フィルタとなる。空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓを所
定のフィルタ長で切り出すことで、空間音響フィルタが生成される。 A set of four spatial sound transfer characteristics Hls, Hlo, Hro, and Hrs is defined as a spatial sound transfer function. The data used for convolution in the convolution calculation units 11, 12, 21, and 22 serve as spatial acoustic filters. A spatial acoustic filter is generated by cutting out the spatial acoustic transfer characteristics Hls, Hlo, Hro, and Hrs with a predetermined filter length.

空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓのそれぞれは、インパルス応答測定
などにより、事前に取得されている。例えば、ユーザＵが左右の耳にマイクをそれぞれ装
着する。ユーザＵの前方に配置された左右のスピーカが、インパルス応答測定を行うため
の、インパルス音をそれぞれ出力する。そして、スピーカから出力されたインパルス音等
の測定信号をマイクで収音する。マイクでの収音信号に基づいて、空間音響伝達特性Ｈｌ
ｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓが取得される。左スピーカと左マイクとの間の空間音響伝達
特性Ｈｌｓ、左スピーカと右マイクとの間の空間音響伝達特性Ｈｌｏ、右スピーカと左マ
イクとの間の空間音響伝達特性Ｈｒｏ、右スピーカと右マイクとの間の空間音響伝達特性
Ｈｒｓが測定される。 Spatial sound transfer characteristics Hls, Hlo, Hro, and Hrs are obtained in advance by impulse response measurement or the like. For example, the user U wears microphones on the left and right ears, respectively. Left and right speakers placed in front of the user U output impulse sounds for impulse response measurement. Then, a measurement signal such as an impulse sound output from the speaker is picked up by a microphone. Spatial sound transfer characteristic Hl
s, Hlo, Hro, Hrs are obtained. Spatial sound transfer characteristics Hls between the left speaker and the left microphone, Spatial sound transfer characteristics Hlo between the left speaker and the right microphone, Spatial sound transfer characteristics Hro between the right speaker and the left microphone, Right speaker and the right microphone The spatial sound transfer characteristic Hrs between is measured.

そして、畳み込み演算部１１は、Ｌｃｈのステレオ入力信号ＸＬに対して空間音響伝達
特性Ｈｌｓに応じた空間音響フィルタを畳み込む。畳み込み演算部１１は、畳み込み演算
データを加算器２４に出力する。畳み込み演算部２１は、Ｒｃｈのステレオ入力信号ＸＲ
に対して空間音響伝達特性Ｈｒｏに応じた空間音響フィルタを畳み込む。畳み込み演算部
２１は、畳み込み演算データを加算器２４に出力する。加算器２４は２つの畳み込み演算
データを加算して、フィルタ部４１に出力する。 Then, the convolution calculation unit 11 convolves the Lch stereo input signal XL with a spatial acoustic filter corresponding to the spatial acoustic transfer characteristic Hls. The convolution calculation unit 11 outputs the convolution calculation data to the adder 24 . The convolution calculation unit 21 converts the Rch stereo input signal XR
A spatial acoustic filter corresponding to the spatial acoustic transfer characteristic Hro is convoluted with respect to . The convolution calculation unit 21 outputs the convolution calculation data to the adder 24 . The adder 24 adds the two pieces of convolution operation data and outputs the result to the filter section 41 .

畳み込み演算部１２は、Ｌｃｈのステレオ入力信号ＸＬに対して空間音響伝達特性Ｈｌ
ｏに応じた空間音響フィルタを畳み込む。畳み込み演算部１２は、畳み込み演算データを
、加算器２５に出力する。畳み込み演算部２２は、Ｒｃｈのステレオ入力信号ＸＲに対し
て空間音響伝達特性Ｈｒｓに応じた空間音響フィルタを畳み込む。畳み込み演算部２２は
、畳み込み演算データを、加算器２５に出力する。加算器２５は２つの畳み込み演算デー
タを加算して、フィルタ部４２に出力する。 The convolution calculation unit 12 calculates the spatial sound transfer characteristic Hl for the Lch stereo input signal XL.
Convolve the spatial acoustic filter according to o. The convolution calculation unit 12 outputs the convolution calculation data to the adder 25 . The convolution calculation unit 22 convolves a spatial acoustic filter corresponding to the spatial acoustic transfer characteristic Hrs with respect to the Rch stereo input signal XR. The convolution calculation unit 22 outputs the convolution calculation data to the adder 25 . The adder 25 adds the two pieces of convolution operation data and outputs the result to the filter section 42 .

フィルタ部４１、４２にはヘッドホン特性（ヘッドホンの再生ユニットとマイク間の特
性）をキャンセルする逆フィルタＬｉｎｖ、Ｒｉｎｖが設定されている。そして、頭外定
位処理部１０での処理が施された再生信号（畳み込み演算信号）に逆フィルタＬｉｎｖ、
Ｒｉｎｖを畳み込む。フィルタ部４１で加算器２４からのＬｃｈ信号に対して、Ｌｃｈ側
のヘッドホン特性の逆フィルタＬｉｎｖを畳み込む。同様に、フィルタ部４２は加算器２
５からのＲｃｈ信号に対して、Ｒｃｈ側のヘッドホン特性の逆フィルタＲｉｎｖを畳み込
む。逆フィルタＬｉｎｖ、Ｒｉｎｖは、ヘッドホン４３を装着した場合に、ヘッドホンユ
ニットからマイクまでの特性をキャンセルする。マイクは、外耳道入口から鼓膜までの間
ならばどこに配置してもよい。 Inverse filters Linv and Rinv for canceling headphone characteristics (characteristics between the reproduction unit of the headphones and the microphone) are set in the filter units 41 and 42 . Then, an inverse filter Linv,
Convolve Rinv. In the filter unit 41, the Lch signal from the adder 24 is convoluted with an inverse filter Linv of headphone characteristics on the Lch side. Similarly, filter section 42 includes adder 2
The Rch signal from 5 is convolved with an inverse filter Rinv of headphone characteristics on the Rch side. The inverse filters Linv and Rinv cancel the characteristics from the headphone unit to the microphone when the headphones 43 are worn. The microphone can be placed anywhere between the ear canal entrance and the eardrum.

フィルタ部４１は、処理されたＬｃｈ信号ＹＬをヘッドホン４３の左ユニット４３Ｌに
出力する。フィルタ部４２は、処理されたＲｃｈ信号ＹＲをヘッドホン４３の右ユニット
４３Ｒに出力する。ユーザＵは、ヘッドホン４３を装着している。ヘッドホン４３は、Ｌ
ｃｈ信号ＹＬとＲｃｈ信号ＹＲ（以下、Ｌｃｈ信号ＹＬとＲｃｈ信号をまとめてステレオ
信号ともいう）をユーザＵに向けて出力する。これにより、ユーザＵの頭外に定位された
音像を再生することができる。 Filter section 41 outputs processed Lch signal YL to left unit 43L of headphone 43 . The filter section 42 outputs the processed Rch signal YR to the right unit 43R of the headphone 43 . A user U wears headphones 43 . Headphones 43 are L
A ch signal YL and an Rch signal YR (hereinafter, the Lch signal YL and the Rch signal are collectively referred to as a stereo signal) are output to the user U. Thereby, a sound image localized outside the head of the user U can be reproduced.

このように、頭外定位処理装置１００は、空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、
Ｈｒｓに応じた空間音響フィルタと、ヘッドホン特性の逆フィルタＬｉｎｖ，Ｒｉｎｖを
用いて、頭外定位処理を行っている。以下の説明において、空間音響伝達特性Ｈｌｓ、Ｈ
ｌｏ、Ｈｒｏ、Ｈｒｓに応じた空間音響フィルタと、ヘッドホン特性の逆フィルタＬｉｎ
ｖ，Ｒｉｎｖとをまとめて頭外定位処理フィルタとする。２ｃｈのステレオ再生信号の場
合、頭外定位フィルタは、４つの空間音響フィルタと、２つの逆フィルタとから構成され
ている。そして、頭外定位処理装置１００は、ステレオ再生信号に対して合計６個の頭外
定位フィルタを用いて畳み込み演算処理を行うことで、頭外定位処理を実行する。頭外定
位フィルタは、ユーザＵ個人の測定に基づくものであることが好ましい。例えば，ユーザ
Ｕの耳に装着されたマイクが収音した収音信号に基づいて、頭外定位フィルタが設定され
ている。なお、メモリ容量を削減するため、フィルタ同士を演算し、４つのフィルタにま
とめてもよい。 In this way, the out-of-head localization processing apparatus 100 uses the spatial sound transfer characteristics Hls, Hlo, Hro,
Out-of-head localization processing is performed using a spatial acoustic filter corresponding to Hrs and inverse filters Linv and Rinv of headphone characteristics. In the following description, spatial sound transfer characteristics Hls, H
Spatial acoustic filter corresponding to lo, Hro, Hrs and inverse filter Lin for headphone characteristics
v and Rinv are collectively used as an out-of-head localization processing filter. In the case of a 2ch stereo reproduction signal, the out-of-head localization filter is composed of four spatial acoustic filters and two inverse filters. Then, the out-of-head localization processing apparatus 100 performs out-of-head localization processing by performing convolution processing on the stereo reproduction signal using a total of six out-of-head localization filters. The out-of-head localization filter is preferably based on user U's individual measurements. For example, an out-of-head localization filter is set based on a sound signal picked up by a microphone attached to the user's U ear. In order to reduce the memory capacity, the filters may be calculated and combined into four filters.

このように空間音響フィルタと、ヘッドホン特性の逆フィルタはオーディオ信号用のフ
ィルタである。これらのフィルタが再生信号（ステレオ入力信号ＸＬ、ＸＲ）に畳み込ま
れることで、頭外定位処理装置１００が、頭外定位処理を実行する。つまり、頭外定位処
理装置１００は、頭外に音像が定位された音場を再生する再生装置となる。
（測定装置）
次に、図２を用いて、伝達特性を測定する測定装置について説明する。図２は、測定装
置２００の測定構成を模式的に示す図である。なお、測定装置２００は、図１に示す頭外
定位処理装置１００と共通の装置であってもよい。あるいは、測定装置２００の一部又は
全部が頭外定位処理装置１００と異なる装置となっていてもよい。 Thus, the spatial acoustic filter and the inverse headphone characteristic filter are filters for audio signals. By convolving these filters with the reproduced signals (stereo input signals XL and XR), the out-of-head localization processing apparatus 100 executes out-of-head localization processing. In other words, the out-of-head localization processing device 100 serves as a reproduction device that reproduces a sound field in which a sound image is localized out-of-head.
(measuring device)
Next, a measuring device for measuring transfer characteristics will be described with reference to FIG. FIG. 2 is a diagram schematically showing the measurement configuration of the measurement device 200. As shown in FIG. Note that the measuring device 200 may be a common device with the out-of-head localization processing device 100 shown in FIG. Alternatively, part or all of the measurement device 200 may be a device different from the out-of-head localization processing device 100 .

図２に示すように、測定装置２００は、ステレオスピーカ５と、ステレオマイク２と、
ヘッドホン４３と、処理装置２０１とを有している。ステレオスピーカ５が測定環境に設
置されている。測定環境は、被測定者１の自宅の部屋やオーディオシステムの販売店舗や
ショールーム等でもよい。また、測定環境は空港、駅、港湾、バスターミナルなど、乗り
物に搭乗する際に利用する各種施設に設置されていてもよい。 As shown in FIG. 2, the measuring device 200 includes a stereo speaker 5, a stereo microphone 2,
It has a headphone 43 and a processing device 201 . A stereo speaker 5 is installed in the measurement environment. The measurement environment may be a room in the person to be measured 1's home, an audio system store, a showroom, or the like. Also, the measurement environment may be installed in various facilities used when boarding a vehicle, such as airports, stations, harbors, and bus terminals.

本実施の形態では、測定装置２００の処理装置２０１が、測定結果に応じて、頭外定位
フィルタを適切に生成するための演算処理を行っている。処理装置２０１は、測定信号生
成部２１１と、収音信号取得部２１２と、フィルタ生成部２１３と、を備えている。処理
装置２０１は、パーソナルコンピュータ（ＰＣ）、タブレット端末、スマートホン等であ
り、メモリ、及びＣＰＵを備えている。メモリは、処理プログラムや各種パラメータや測
定データなどを記憶している。ＣＰＵは、メモリに格納された処理プログラムを実行する
。ＣＰＵが処理プログラムを実行することで、測定信号生成部２１１、収音信号取得部２
１２、フィルタ生成部２１３の各処理が実行される。 In this embodiment, the processing device 201 of the measuring device 200 performs arithmetic processing for appropriately generating an out-of-head localization filter according to the measurement result. The processing device 201 includes a measurement signal generation section 211 , a collected sound signal acquisition section 212 and a filter generation section 213 . The processing device 201 is a personal computer (PC), tablet terminal, smart phone, or the like, and includes a memory and a CPU. The memory stores processing programs, various parameters, measurement data, and the like. The CPU executes a processing program stored in memory. By executing the processing program by the CPU, the measurement signal generation unit 211 and the collected sound signal acquisition unit 2
12, each process of the filter generator 213 is executed.

測定信号生成部２１１は、外耳道伝達特性又は空間音響伝達特性を測定するための測定
信号を生成する。測定信号は、例えば、インパルス信号やＴＳＰ（ＴｉｍｅＳｔｒｅｃ
ｈｅｄＰｕｌｓｅ）信号等である。ここでは、測定信号としてインパルス音を用いて、
測定装置２００がインパルス応答測定を実施している。 The measurement signal generator 211 generates a measurement signal for measuring ear canal transfer characteristics or spatial sound transfer characteristics. The measurement signal is, for example, an impulse signal or a TSP (Time Stroke
hed pulse) signal and the like. Here, using an impulse sound as the measurement signal,
A measurement device 200 is performing an impulse response measurement.

空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓを測定する場合、測定装置２００は
、ステレオスピーカ５を用いた測定を行う。つまり、被測定者１がヘッドホン４３を装着
せずに、ステレオマイク２のみを装着する。そして、ステレオスピーカ５から測定信号を
出力し、ステレオマイク２が測定信号を収音する。空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈ
ｒｏ、Ｈｒｓを測定する場合、ヘッドホン４３は使用されない。 When measuring the spatial sound transfer characteristics Hls, Hlo, Hro, and Hrs, the measurement device 200 performs measurement using the stereo speakers 5 . That is, the person 1 to be measured does not wear the headphones 43 but only the stereo microphone 2 . Then, a measurement signal is output from the stereo speaker 5 and the measurement signal is picked up by the stereo microphone 2 . Spatial sound transfer characteristics Hls, Hlo, H
Headphones 43 are not used when measuring ro, Hrs.

外耳道伝達特性を測定する場合、測定装置２００は、ヘッドホン４３を用いた測定を行
う。つまり、被測定者１がステレオマイク２、及びヘッドホン４３を装着する。そして、
ヘッドホン４３から測定信号を出力し、ステレオマイク２が測定信号を収音する。外耳道
伝達特性を測定する場合、ステレオスピーカ５は使用されない。 When measuring the ear canal transfer characteristics, the measurement device 200 performs measurement using the headphones 43 . That is, the subject 1 wears the stereo microphone 2 and the headphones 43 . and,
A measurement signal is output from the headphone 43, and the stereo microphone 2 picks up the measurement signal. Stereo speakers 5 are not used when measuring ear canal transfer characteristics.

まず、空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓ（以下、単に伝達特性Ｈｌｓ
、Ｈｌｏ、Ｈｒｏ、Ｈｒｓともいう）の測定と、空間音響フィルタの生成について説明す
る。ステレオスピーカ５は、左スピーカ５Ｌと右スピーカ５Ｒを備えている。例えば、被
測定者１の前方に左スピーカ５Ｌと右スピーカ５Ｒが設置されている。左スピーカ５Ｌと
右スピーカ５Ｒは、インパルス応答測定を行うための測定信号を出力する。以下、本実施
の形態では、音源となるスピーカの数を２（ステレオスピーカ）として説明するが、測定
に用いる音源の数は２に限らず、１以上であればよい。すなわち、1chのモノラル、また
は、5.1ch、7.1ch等の、いわゆるマルチチャンネル環境においても同様に、本実施の形態
を適用することができる。 First, spatial sound transfer characteristics Hls, Hlo, Hro, Hrs (hereinafter simply transfer characteristics Hls
, Hlo, Hro, and Hrs) and generating a spatial acoustic filter. The stereo speaker 5 has a left speaker 5L and a right speaker 5R. For example, a left speaker 5L and a right speaker 5R are installed in front of the person 1 to be measured. The left speaker 5L and the right speaker 5R output measurement signals for impulse response measurement. In the following description of the present embodiment, the number of speakers serving as sound sources is two (stereo speakers), but the number of sound sources used for measurement is not limited to two, and may be one or more. That is, the present embodiment can be similarly applied in a so-called multi-channel environment such as 1ch monaural, 5.1ch, 7.1ch, and the like.

ステレオマイク２は、左のマイク２Ｌと右のマイク２Ｒを有している。左のマイク２Ｌ
は、被測定者１の左耳９Ｌに設置され、右のマイク２Ｒは、被測定者１の右耳９Ｒに設置
されている。具体的には、左耳９Ｌ、右耳９Ｒの外耳道入口から鼓膜までの位置にマイク
２Ｌ、２Ｒを設置することが好ましい。マイク２Ｌ、２Ｒは、ステレオスピーカ５から出
力された測定信号を収音して、収音信号を取得する。マイク２Ｌ、２Ｒは収音信号を処理
装置２０１に出力する。被測定者１は、人でもよく、ダミーヘッドでもよい。すなわち、
本実施形態において、被測定者１は人だけでなく、ダミーヘッドを含む概念である。 The stereo microphone 2 has a left microphone 2L and a right microphone 2R. Left mic 2L
is installed on the subject's 1 left ear 9L, and the right microphone 2R is installed on the subject's 1 right ear 9R. Specifically, it is preferable to install the microphones 2L and 2R at positions from the entrance of the ear canal of the left ear 9L and the right ear 9R to the eardrum. The microphones 2L and 2R pick up the measurement signal output from the stereo speaker 5 to acquire the picked-up sound signal. The microphones 2L and 2R output picked-up sound signals to the processing device 201. FIG. The person 1 to be measured may be a person or a dummy head. i.e.
In this embodiment, the person to be measured 1 is a concept including not only a person but also a dummy head.

ステレオマイク２の左マイク２Ｌ、右マイク２Ｒがそれぞれ測定信号を収音し、収音信
号を処理装置２０１に出力する。収音信号取得部２１２は、左マイク２Ｌ、右マイク２Ｒ
で収音された収音信号を取得する。なお、収音信号取得部２１２は、マイク２Ｌ、２Ｒか
らの収音信号をＡ／Ｄ変換するＡ／Ｄ変換器を備えていてもよい。収音信号取得部２１２
は、複数回の測定により得られた信号を同期加算してもよい。 The left microphone 2 L and the right microphone 2 R of the stereo microphone 2 each pick up the measurement signal and output the picked-up sound signal to the processing device 201 . The collected sound signal acquisition unit 212 receives the left microphone 2L, the right microphone 2R,
Acquire the collected sound signal. The collected sound signal acquisition unit 212 may include an A/D converter that A/D converts the collected sound signals from the microphones 2L and 2R. Acquired sound signal acquisition unit 212
may synchronously add signals obtained by multiple measurements.

左スピーカ５Ｌがインパルス音を出力することで、収音信号取得部２１２は、伝達特性
Ｈｌｓに対応する収音信号と、伝達特性Ｈｌｏに対応する収音信号を取得する。その後、
右スピーカ５Ｒがインパルス音を出力することで、収音信号取得部２１２は、伝達特性Ｈ
ｒｓに対応する収音信号と、伝達特性Ｈｒｏに対応する収音信号を取得する。なお、左ス
ピーカ５Ｌによる測定と、右スピーカ５Ｒによる測定との順番は反対でもよい。 When the left speaker 5L outputs the impulse sound, the collected sound signal acquisition unit 212 acquires a collected sound signal corresponding to the transfer characteristic Hls and a collected sound signal corresponding to the transfer characteristic Hlo. after that,
By outputting the impulse sound from the right speaker 5R, the collected sound signal acquisition unit 212 acquires the transfer characteristic H
A picked-up sound signal corresponding to rs and a picked-up sound signal corresponding to the transfer characteristic Hro are obtained. The order of the measurement by the left speaker 5L and the measurement by the right speaker 5R may be reversed.

上記のように、左右のスピーカ５Ｌ、５Ｒで出力されたインパルス音をマイク２Ｌ、２
Ｒで測定することでインパルス応答が測定される。処理装置２０１は、インパルス応答測
定に基づいて取得した収音信号をメモリなどに記憶する。これにより、左スピーカ５Ｌと
左マイク２Ｌとの間の伝達特性Ｈｌｓ、左スピーカ５Ｌと右マイク２Ｒとの間の伝達特性
Ｈｌｏ、右スピーカ５Ｒと左マイク２Ｌとの間の伝達特性Ｈｒｏ、右スピーカ５Ｒと右マ
イク２Ｒとの間の伝達特性Ｈｒｓが測定される。すなわち、左スピーカ５Ｌから出力され
た測定信号を左マイク２Ｌが収音することで、伝達特性Ｈｌｓが取得される。左スピーカ
５Ｌから出力された測定信号を右マイク２Ｒが収音することで、伝達特性Ｈｌｏが取得さ
れる。右スピーカ５Ｒから出力された測定信号を左マイク２Ｌが収音することで、伝達特
性Ｈｒｏが取得される。右スピーカ５Ｒから出力された測定信号を右マイク２Ｒが収音す
ることで、伝達特性Ｈｒｓが取得される。 As described above, the impulse sounds output by the left and right speakers 5L and 5R are transferred to the microphones 2L and 2
Measuring with R measures the impulse response. The processing device 201 stores the picked-up sound signal acquired based on the impulse response measurement in a memory or the like. As a result, the transfer characteristic Hls between the left speaker 5L and the left microphone 2L, the transfer characteristic Hlo between the left speaker 5L and the right microphone 2R, the transfer characteristic Hro between the right speaker 5R and the left microphone 2L, the right speaker A transfer characteristic Hrs between 5R and the right microphone 2R is measured. That is, the measurement signal output from the left speaker 5L is picked up by the left microphone 2L to obtain the transfer characteristic Hls. The transfer characteristic Hlo is acquired by the right microphone 2R picking up the measurement signal output from the left speaker 5L. The measurement signal output from the right speaker 5R is picked up by the left microphone 2L to obtain the transfer characteristic Hro. The transfer characteristic Hrs is acquired by the right microphone 2R picking up the measurement signal output from the right speaker 5R.

そして、フィルタ生成部２１３は、収音信号に基づいて、左右のスピーカ５Ｌ、５Ｒか
ら左右のマイク２Ｌ、２Ｒまでの伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓに応じたフィ
ルタを生成する。フィルタ生成部２１３は、伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓを
所定のフィルタ長で切り出して、所定の演算処理を行う。このようにすることで、処理装
置２０１は、頭外定位処理装置１００の畳み込み演算に用いられる空間音響フィルタを生
成する。 Then, the filter generation unit 213 generates filters according to the transfer characteristics Hls, Hlo, Hro, Hrs from the left and right speakers 5L, 5R to the left and right microphones 2L, 2R based on the collected sound signals. The filter generator 213 cuts out the transfer characteristics Hls, Hlo, Hro, and Hrs with a predetermined filter length and performs predetermined arithmetic processing. By doing so, the processing device 201 generates a spatial acoustic filter used in the convolution operation of the out-of-head localization processing device 100 .

次に、外耳道伝達特性の測定と、逆フィルタＬｉｎｖ、Ｒｉｎｖの生成について説明す
る。被測定者１が、左右の耳９Ｌ、９Ｒにマイク２Ｌ、２Ｒを装着した状態で、ヘッドホ
ン４３を装着する。すなわち、被測定者１は左右のマイク２Ｌ、２Ｒの上から、ヘッドホ
ン４３を装着する。左マイク２Ｌ、及び右マイク２Ｒは、ヘッドホン４３に干渉しないよ
うに構成されている。すなわち、左マイク２Ｌ、及び右マイク２Ｒは左耳９Ｌ、右耳９Ｒ
の適切な位置に配置された状態で、被測定者１がヘッドホン４３を装着することができる
。 Next, the measurement of the ear canal transfer characteristics and the generation of the inverse filters Linv and Rinv will be described. The subject 1 wears the headphones 43 with the microphones 2L and 2R attached to the left and right ears 9L and 9R. That is, the subject 1 wears the headphones 43 over the left and right microphones 2L and 2R. The left microphone 2L and the right microphone 2R are configured so as not to interfere with the headphone 43. FIG. That is, the left microphone 2L and the right microphone 2R are connected to the left ear 9L and the right ear 9R.
The person to be measured 1 can wear the headphones 43 while the headphones 43 are arranged at appropriate positions.

測定信号生成部２１１が生成した測定信号は、ヘッドホン４３の左ユニット４３Ｌ、右
ユニット４３Ｒからそれぞれ出力される。左マイク２Ｌは、ヘッドホン４３の左ユニット
４３Ｌから出力された音を収音する。右マイク２Ｒは、ヘッドホン４３の右ユニット４３
Ｒから出力された音を収音する。 The measurement signal generated by the measurement signal generator 211 is output from the left unit 43L and the right unit 43R of the headphone 43, respectively. The left microphone 2L picks up sound output from the left unit 43L of the headphone 43 . The right microphone 2R is connected to the right unit 43 of the headphone 43.
Sound output from R is collected.

このように、マイク２Ｌ、２Ｒは、ヘッドホン４３から出力された測定信号を収音して
、収音信号を検出する。収音信号取得部２１２は、マイク２Ｌ、２Ｒからの収音信号を取
得する。なお、外耳道伝達特性と空間音響伝達特性の測定は、別の処理装置２０１を用い
て、別の場所で行われてもよい。したがって、ステレオスピーカ５が設けられている測定
環境以外の場所でも、外耳道伝達特性を測定することが可能である。 In this way, the microphones 2L and 2R pick up the measurement signal output from the headphone 43 and detect the picked-up sound signal. The collected sound signal acquisition unit 212 acquires the collected sound signals from the microphones 2L and 2R. It should be noted that the measurement of the ear canal transfer characteristics and the spatial sound transfer characteristics may be performed using another processing device 201 and at another location. Therefore, it is possible to measure the ear canal transfer characteristics even in a place other than the measurement environment where the stereo speakers 5 are provided.

処理装置２０１は、インパルス応答測定に基づく収音信号をメモリなどに記憶する。こ
れにより、左ユニット４３Ｌと左マイク２Ｌとの間の伝達特性（すなわち、左耳９Ｌの外
耳道伝達特性ＥＣＴＦＬ）と、右ユニット４３Ｒと右マイク２Ｒとの間の伝達特性（すな
わち、右耳９Ｒの外耳道伝達特性ＥＣＴＦＲ）が取得される。処理装置２０１は、測定デ
ータを記憶するメモリなどを有している。 The processing device 201 stores the picked-up sound signal based on the impulse response measurement in a memory or the like. As a result, the transfer characteristic between the left unit 43L and the left microphone 2L (that is, the ear canal transfer characteristic ECTFL of the left ear 9L) and the transfer characteristic between the right unit 43R and the right microphone 2R (that is, the Ear canal transfer characteristics (ECTFR) are obtained. The processing device 201 has a memory or the like for storing measurement data.

処理装置２０１は、外耳道伝達特性ＥＣＴＦＬ、ＥＣＴＦＲに基づいて、逆フィルタＬ
ｉｎｖ、Ｒｉｎｖをそれぞれ算出する。例えば、処理装置２０１は、離散フーリエ変換や
離散コサイン変換などにより、外耳道伝達特性の周波数振幅特性及び周波数位相特性を算
出する。そして、処理装置２０１は、周波数振幅特性の逆特性を求める。なお、処理装置
２０１は、周波数帯域毎に、周波数振幅特性、又はその逆特性等を補正してもよい。処理
装置２０１は、逆離散フーリエ変換等により、逆特性と位相特性とを用いて時間信号を算
出する。処理装置２０１は、時間信号を所定のフィルタ長で切り出すことで、逆フィルタ
を算出する。上記のように、逆フィルタはヘッドホン特性（ヘッドホンの再生ユニットと
マイク間の特性）をキャンセルするフィルタである。なお、逆フィルタの算出方法につい
ては、公知の手法を用いることができるため、詳細な説明を省略する。 The processing device 201 applies an inverse filter L
Calculate inv and Rinv respectively. For example, the processing device 201 calculates frequency-amplitude characteristics and frequency-phase characteristics of the ear canal transfer characteristics by discrete Fourier transform, discrete cosine transform, or the like. Then, the processing device 201 obtains the inverse characteristic of the frequency-amplitude characteristic. Note that the processing device 201 may correct the frequency amplitude characteristic or its inverse characteristic for each frequency band. The processing unit 201 calculates a time signal using the inverse characteristic and the phase characteristic by inverse discrete Fourier transform or the like. The processing device 201 calculates an inverse filter by cutting out the time signal with a predetermined filter length. As described above, the inverse filter is a filter that cancels the headphone characteristics (the characteristics between the headphone playback unit and the microphone). In addition, since a well-known method can be used for the calculation method of the inverse filter, detailed description thereof will be omitted.

処理装置２０１は、空間音響伝達特性Ｈｌｓ、Ｈｌｏ、Ｈｒｏ、Ｈｒｓに応じたフィル
タ、及び左右の逆フィルタＬｉｎｖ、Ｒｉｎｖを保存する。処理装置２０１と頭外定位処
理装置１００とが異なる装置の場合、処理装置２０１は、フィルタ、及び逆フィルタを頭
外定位処理装置１００に送信する。なお、ヘッドホン４３又はステレオスピーカ５と、処
理装置２０１との接続は、Ｂｌｕｅｔｏｏｔｈ（登録商標）などを用いた無線接続であっ
てもよい。 The processing unit 201 stores filters according to the spatial sound transfer characteristics Hls, Hlo, Hro, Hrs and left and right inverse filters Linv, Rinv. If the processing device 201 and the out-of-head localization processing device 100 are different devices, the processing device 201 transmits the filter and the inverse filter to the out-of-head localization processing device 100 . Note that the connection between the headphones 43 or the stereo speakers 5 and the processing device 201 may be a wireless connection using Bluetooth (registered trademark) or the like.

ここで、頭外定位処理受聴を行うユーザＵに対して、測定装置２００が測定を行うこと
で、ユーザＵに適した頭外定位フィルタを生成することができる。つまり、ユーザＵを被
測定者１として、測定装置２００が、空間音響伝達特性、及び外耳道伝達特性を行うこと
で、ユーザＵ固有の頭外定位フィルタ（以下、単にフィルタとも称する）を生成すること
ができる。
（システム構成）
次に、フィルタ生成、及び頭外定位処理を行うシステム構成について、図３を用いて説
明する。図３は、システムの全体構成を模式的に示す図である。システム１０００は測定
装置２００と、サーバ端末６００と、を備えている。図３では、航空機５００の搭乗者に
対して、頭外定位処理を行うためのシステム１０００が示されている。つまり、図１で示
したユーザＵが航空機５００の搭乗者となる。ここでは、フィルタと搭乗者の識別情報と
を対応付けて格納することで、搭乗者毎にフィルタを設定することができる。従って、搭
乗者毎に異なるフィルタを用いて、頭外定位処理を行うことができる。 Here, an out-of-head localization filter suitable for the user U can be generated by measuring the user U who performs out-of-head localization processing listening with the measurement device 200 . That is, with the user U as the person to be measured 1, the measurement apparatus 200 performs spatial acoustic transfer characteristics and ear canal transfer characteristics to generate an out-of-head localization filter (hereinafter simply referred to as a filter) unique to the user U. can be done.
(System configuration)
Next, a system configuration for filter generation and out-of-head localization processing will be described with reference to FIG. FIG. 3 is a diagram schematically showing the overall configuration of the system. A system 1000 includes a measuring device 200 and a server terminal 600 . Referring to FIG. 3, a system 1000 for performing out-of-head localization for occupants of aircraft 500 is shown. In other words, user U shown in FIG. 1 becomes a passenger on aircraft 500 . Here, the filter can be set for each passenger by storing the filter and the identification information of the passenger in association with each other. Therefore, out-of-head localization processing can be performed using a different filter for each passenger.

測定装置２００は、図２で示した測定装置２００と同様である。ここでは、測定装置２
００は、空港に設置されている。例えば、測定装置２００は、航空会社のラウンジ等に設
置されていてもよい。ラウンジを測定環境とする場合、ラウンジには、ステレオスピーカ
５、ヘッドホン４３、及び処理装置２０１等が設置されている。さらに、ラウンジには、
測定装置２００を設置するためのデスクなどが設けられていてもよい。処理装置２０１は
、図１で示した頭外定位処理装置１００の頭外定位処理機能を有していてもよい。 The measuring device 200 is similar to the measuring device 200 shown in FIG. Here, the measuring device 2
00 is installed at the airport. For example, the measuring device 200 may be installed in an airline lounge or the like. When the lounge is used as the measurement environment, stereo speakers 5, headphones 43, processing device 201, and the like are installed in the lounge. In addition, the lounge has
A desk or the like for installing the measuring device 200 may be provided. The processing device 201 may have the out-of-head localization processing function of the out-of-head localization processing device 100 shown in FIG.

測定装置２００には、入力手段２２１と、表示手段２２２と、通信手段２２３と、アン
テナ２２４が設けられている。入力手段２２１は、キーボード、マウス、タッチパネルな
どを備えており、搭乗者又は操作者からの入力を受け付ける。あるいは、音声入力を受け
付ける場合、入力手段２２１は、マイク及び音声認識機能を備えている。表示手段２２２
は、モニタなどを備えており、入力画面や測定画面を表示する。 The measuring device 200 is provided with input means 221 , display means 222 , communication means 223 and an antenna 224 . The input means 221 includes a keyboard, mouse, touch panel, etc., and receives input from the passenger or operator. Alternatively, when accepting voice input, the input means 221 has a microphone and a voice recognition function. display means 222
has a monitor, etc., and displays an input screen and a measurement screen.

搭乗者は、表示手段２２２に表示された入力画面を確認しながら、入力手段２２１を操
作することで、搭乗者を識別するための識別情報を入力する。識別情報は搭乗者毎に与え
られている固有の情報である。例えば、搭乗者が搭乗する航空機５００の航空会社におけ
るマイレージクラブの会員番号（会員ＩＤ）を、識別情報として用いることができる。こ
の場合、入力手段２２１は、マイレージカードをスキャンするカードスキャナなどであっ
てもよい。入力手段２２１は、スマートホンの画面に表示した識別情報やスマートホンに
内蔵されたＩＣチップに記憶された識別情報などを読み取る装置であってもよい。 The passenger operates the input means 221 while checking the input screen displayed on the display means 222 to input identification information for identifying the passenger. The identification information is unique information given to each passenger. For example, a membership number (member ID) of a mileage club of an airline company of aircraft 500 boarded by a passenger can be used as identification information. In this case, the input means 221 may be a card scanner or the like for scanning a mileage card. The input unit 221 may be a device that reads identification information displayed on the screen of the smartphone or identification information stored in an IC chip built into the smartphone.

図２で説明したように、測定装置２００は、測定を行って、フィルタを生成する。ここ
で、ヘッドホン４３は、航空機５００に搭載されたヘッドホンと同じタイプのものである
。ヘッドホン４３は、航空会社から貸与される。あるいは、搭乗者がヘッドホン４３を購
入することも可能となっている。また、複数のヘッドホン４３を用意しておいて、測定に
使用したヘッドホン４３を搭乗者が機内に持ち込むようにしてもよい。フライト毎にヘッ
ドホン４３の機種が異なる場合、測定環境に、それぞれの機種のヘッドホン４３を用意し
ておくことが好ましい。 As described in FIG. 2, the measurement device 200 makes measurements and generates filters. Here, the headphones 43 are of the same type as the headphones carried on the aircraft 500 . Headphones 43 are loaned from airlines. Alternatively, the passenger can purchase the headphones 43 . Alternatively, a plurality of headphones 43 may be prepared and passengers may carry the headphones 43 used for measurement into the cabin. When the model of the headphones 43 differs for each flight, it is preferable to prepare the headphones 43 of each model in the measurement environment.

測定装置２００は、生成したフィルタをサーバ端末６００に送信する。測定装置２００
は、識別情報に対応付けてフィルタをサーバ端末６００に送信する。これにより、サーバ
端末６００は、搭乗者の識別情報とフィルタとを受信する。サーバ端末６００は、機内に
配置されていてもよく、機外に配置されていてもよい。 Measuring device 200 transmits the generated filter to server terminal 600 . Measuring device 200
transmits the filter to the server terminal 600 in association with the identification information. Accordingly, server terminal 600 receives the identification information of the passenger and the filter. The server terminal 600 may be placed inside the aircraft or outside the aircraft.

例えば、通信手段２２３は、フィルタや識別情報のデータに対して、変調などを行う変
調回路等を有している。通信手段２２３で変調されたデータが、アンテナ２２４から送信
される。サーバ端末６００のアンテナ６０１は、アンテナ２２４から送信されたデータを
受信する。 For example, the communication means 223 has a modulation circuit or the like that modulates data of filters and identification information. Data modulated by communication means 223 is transmitted from antenna 224 . Antenna 601 of server terminal 600 receives data transmitted from antenna 224 .

図４は、サーバ端末６００を模式的に示す図である。受信手段６０２は、受信したデー
タを復調する復調回路等を備えている。サーバ端末６００は、識別情報に応じて、フィル
タを航空機５００内の座席端末５１１に設定する。 FIG. 4 is a diagram schematically showing the server terminal 600. As shown in FIG. The receiving means 602 includes a demodulation circuit or the like for demodulating the received data. Server terminal 600 sets a filter to seat terminal 511 in aircraft 500 according to the identification information.

航空機５００は、複数の搭乗席と、複数の座席端末５１１と、が備えている。つまり、
航空機５００内では、搭乗席毎に座席端末５１１が設けられている。図５に搭乗席と座席
端末５１１の一例を示す。図５では、搭乗席５２１に搭乗者であるユーザＵ（搭乗者）が
座っている。ユーザＵがヘッドホン４３を装着している。機内においてユーザＵが装着す
るヘッドホン４３は、測定を行ったヘッドホン４３と同じ機種のものとなっている。 Aircraft 500 includes multiple passenger seats and multiple seat terminals 511 . in short,
In the aircraft 500, a seat terminal 511 is provided for each passenger seat. FIG. 5 shows an example of a passenger seat and a seat terminal 511. As shown in FIG. In FIG. 5 , a user U (passenger) who is a passenger is sitting in a passenger seat 521 . A user U is wearing headphones 43 . The headphones 43 worn by the user U in the cabin are of the same model as the headphones 43 used for the measurement.

座席端末５１１は、例えば、搭乗席５２１の下や肘掛け等に設置されている。ここでは
、搭乗席５２１毎に座席端末５１１が設置されている。そして、座席端末５１１に設けら
れたイヤホンジャック（不図示）には、ヘッドホン４３が接続されている。座席端末５１
１が図１で示した頭外定位処理装置１００に対応する。 The seat terminal 511 is installed, for example, under the passenger seat 521 or on an armrest. Here, a seat terminal 511 is installed for each passenger seat 521 . A headphone 43 is connected to an earphone jack (not shown) provided on the seat terminal 511 . seat terminal 51
1 corresponds to the out-of-head localization processing apparatus 100 shown in FIG.

座席端末５１１は、搭乗席５２１毎に設置されている。つまり、複数の搭乗席５２１と
複数の座席端末５１１は、１対１に対応付けられている。サーバ端末６００は、搭乗席５
２１の座席番号と、座席端末５１１の端末番号を対応付けて、保存している。座席端末５
１１の端末番号は、例えば、座席端末５１１のＩＰアドレス等の固有の情報である。さら
に、航空機５００では、搭乗者毎に搭乗する搭乗席５２１が指定されている。つまり、サ
ーバ端末６００は、搭乗者の識別情報と搭乗席とを対応付けて、保存している。 A seat terminal 511 is installed for each passenger seat 521 . That is, the plurality of boarding seats 521 and the plurality of seat terminals 511 are associated one-to-one. The server terminal 600 is connected to the passenger seat 5
21 and the terminal number of the seat terminal 511 are associated and stored. seat terminal 5
The terminal number 11 is unique information such as the IP address of the seat terminal 511, for example. Furthermore, in aircraft 500, a boarding seat 521 is designated for each passenger. In other words, the server terminal 600 associates and stores the identification information of the passenger and the boarding seat.

例えば、航空会社は、搭乗者の識別情報と、フライト番号と、座席番号と、端末番号と
、を管理している。よって、サーバ端末６００は、識別情報を参照することで、搭乗者が
搭乗する航空機とその搭乗席を特定することができる。そして、サーバ端末６００は、特
定した搭乗席５２１に対応する座席端末５１１にフィルタを送信する。 For example, airlines manage passenger identification information, flight numbers, seat numbers, and terminal numbers. Therefore, the server terminal 600 can specify the aircraft and the boarding seat of the passenger by referring to the identification information. The server terminal 600 then transmits the filter to the seat terminal 511 corresponding to the specified boarding seat 521 .

このように、測定装置２００は、搭乗者を識別するための識別情報に対応付けて、フィ
ルタをサーバ端末６００に送信する。そして、サーバ端末６００は、識別情報を参照して
、搭乗者の搭乗席５２１の座席端末５１１にフィルタを送信する。これにより、システム
１０００は、搭乗者毎に適切なフィルタを設定することができる。 In this way, the measuring device 200 transmits the filter to the server terminal 600 in association with the identification information for identifying the passenger. Then, the server terminal 600 refers to the identification information and transmits the filter to the seat terminal 511 of the passenger's boarding seat 521 . This allows system 1000 to set an appropriate filter for each passenger.

次に、測定装置２００における処理について、図６を用いて説明する。図６は、測定装
置２００における処理を示すフローチャートである。まず、搭乗者（ユーザＵ）の操作に
よって、処理装置２０１に搭乗者のＩＤ（識別情報）が入力される（Ｓ１）。処理装置２
０１は、識別情報を記憶する。そして、処理装置２０１が識別情報に対応する搭乗者のフ
ライト番号と座席番号を確認する（Ｓ２）。例えば、処理装置２０１は、サーバ端末６０
０の情報を参照して、フライト番号と座席番号を確認する。処理装置２０１が、フライト
番号又は座席番号を確認できない場合（Ｓ２のＮＯ）、エラーメッセージを表示し（Ｓ３
）、終了する。つまり、処理装置２０１は、搭乗者の識別情報に対応するフライト番号と
座席番号を特定することができないため、エラーメッセージを表示した上で、処理を終了
する。 Next, processing in the measuring device 200 will be described using FIG. FIG. 6 is a flowchart showing processing in the measuring device 200. As shown in FIG. First, the ID (identification information) of the passenger is input to the processing device 201 by the operation of the passenger (user U) (S1). Processing device 2
01 stores identification information. Then, the processing device 201 confirms the passenger's flight number and seat number corresponding to the identification information (S2). For example, the processing device 201 may use the server terminal 60
Check the flight number and seat number by referring to the information in 0. If the processing device 201 cannot confirm the flight number or seat number (NO in S2), it displays an error message (S3
),finish. That is, since the processing device 201 cannot specify the flight number and seat number corresponding to the passenger identification information, the processing ends after displaying an error message.

処理装置２０１が、フライト番号又は座席番号を確認できた場合（Ｓ２のＹＥＳ）、測
定装置２００が、空間音響伝達特性及び外耳道伝達特性の測定を行う（Ｓ４）。なお、空
間音響伝達特性及び外耳道伝達特性を測定する順番は、特に限定されるものではない。そ
して、処理装置２０１が、測定結果に基づいて、フィルタを生成する（Ｓ５）。ここでは
、処理装置２０１は、４つの空間音響フィルタ、及び左右の逆フィルタＬｉｎｖ、Ｒｉｎ
ｖを含む頭外定位処理フィルタを保存する。 If the processing device 201 can confirm the flight number or seat number (YES in S2), the measuring device 200 measures the spatial sound transfer characteristics and the ear canal transfer characteristics (S4). The order of measuring the spatial sound transfer characteristics and the ear canal transfer characteristics is not particularly limited. Then, the processing device 201 generates a filter based on the measurement result (S5). Here, the processing unit 201 includes four spatial acoustic filters and left and right inverse filters Linv, Rin
Save the out-of-head localization filter containing v.

次に、Ｓ５で生成されたフィルタを用いて、頭外定位処理装置１００がユーザＵに頭外
定位処理された再生信号を試聴させる（Ｓ６）。つまり、ユーザＵがヘッドホン４３を装
着して、頭外定位受聴を行なう。これにより、頭外に音像が定位された音場を再生するこ
とができる。ここで、頭外定位処理装置１００は、Ｓ４での測定やＳ５でのフィルタ生成
を行った処理装置２０１と共通の装置であってもよく、別の装置であってもよい。処理装
置２０１と頭外定位処理装置１００とが、物理的に別の装置となっている場合、処理装置
２０１は、無線又は有線で、頭外定位処理装置１００にフィルタを送信する。 Next, using the filter generated in S5, the out-of-head localization processing device 100 causes the user U to listen to the reproduced signal after out-of-head localization processing (S6). That is, the user U wears the headphones 43 and performs out-of-head localization listening. As a result, a sound field in which the sound image is localized outside the head can be reproduced. Here, the out-of-head localization processing device 100 may be the same device as the processing device 201 that performed the measurement in S4 and the filter generation in S5, or may be a separate device. If the processing device 201 and the out-of-head localization processing device 100 are physically separate devices, the processing device 201 transmits the filter to the out-of-head localization processing device 100 wirelessly or by wire.

そして、処理装置２０１又は頭外定位処理装置１００が、この音場を採用するか否かを
判定する（Ｓ７）。処理装置２０１又は頭外定位処理装置１００は、ユーザ入力に応じて
、この音場を採用するか否かを判定する。例えば、表示手段２２２が、試聴した音場でよ
いか否かを確認するためのメッセージと選択ボタンなどをモニタ上に表示させる。あるい
は、処理装置２０１は、音声メッセージで問い合わせを行うようにしてもよい。 Then, the processing device 201 or the out-of-head localization processing device 100 determines whether or not to adopt this sound field (S7). The processing device 201 or the out-of-head localization processing device 100 determines whether or not to adopt this sound field according to user input. For example, the display means 222 displays on the monitor a message and a selection button for confirming whether or not the auditioned sound field is acceptable. Alternatively, the processing unit 201 may ask for a voice message.

ユーザＵは、入力手段２２１を操作することで、試聴した音場を採用するか否かを選択
することができる。つまり、ユーザＵは、頭外定位受聴の受聴結果に応じて、音場が適切
であるか否かを指定する。ユーザＵは、頭外定位処理された音場に違和感などを覚えた場
合、この音場を採用しないように、入力を行う。頭外定位再生された音場が好みの音場で
ある場合、ユーザＵは、この音場を採用するように、入力を行う。ユーザＵは、音場を試
聴した聴感に応じて、音場（フィルタ）の採用又は不採用のボタンを選択（例えばクリッ
クやタップなど）する。そして、処理装置２０１がユーザ入力に基づいて、音場を採用す
るか否かを判定する。 By operating the input means 221, the user U can select whether or not to adopt the sound field that has been auditioned. That is, the user U specifies whether the sound field is appropriate or not according to the listening result of the out-of-head localization listening. If the user U feels uncomfortable with the sound field subjected to out-of-head localization processing, he or she makes an input so as not to adopt this sound field. If the out-of-head localization-reproduced sound field is the desired sound field, the user U makes an input to adopt this sound field. The user U selects (for example, clicks or taps) a button for adopting or not adopting the sound field (filter) according to the auditory sense of the sound field. Then, based on the user input, the processing device 201 determines whether or not to adopt the sound field.

この音場を採用しないと判定した場合（Ｓ７のＮＯ）。処理装置２０１は、再測定の要
求があるか否かを判定する（Ｓ８）。処理装置２０１又は頭外定位処理装置１００は、ユ
ーザ入力に応じて、再測定の要求があるか否かを判定する。例えば、表示手段２２２が、
再測定ボタンと終了ボタンなどをモニタ上に表示させる。あるいは、処理装置２０１は、
音声メッセージで再測定の問い合わせを行うようにしてもよい。 When it is determined that this sound field is not adopted (NO in S7). The processing device 201 determines whether or not there is a request for remeasurement (S8). The processing device 201 or the out-of-head localization processing device 100 determines whether or not there is a request for re-measurement according to user input. For example, the display means 222
Display a remeasurement button, an end button, etc. on the monitor. Alternatively, the processing device 201
An inquiry about remeasurement may be made by a voice message.

ユーザＵは再測定を行なう場合、再測定ボタンを選択する。ユーザＵは、再測定を行な
わない場合、終了ボタンを選択する。処理装置２０１は、ユーザ入力を受け付けると、ユ
ーザ入力に基づいて、再測定を行なうか否かを判定する。もちろん、音声メッセージを用
いた入出力であってもよい。 The user U selects the remeasurement button when performing remeasurement. The user U selects the end button when not re-measuring. Upon receiving the user input, the processing device 201 determines whether or not to perform re-measurement based on the user input. Of course, input/output using a voice message may be used.

Ｓ８において、再測定要求が無いと判定された場合（Ｓ８のＮＯ）、処理装置２０１は
、処理を終了する。Ｓ８において、再測定要求が有ると判定された場合（Ｓ８のＹＥＳ）
、Ｓ４に戻り、再測定を行う。そして、再測定結果に基づいて、測定装置２００がフィル
タを生成する（Ｓ５）。そして、頭外定位処理装置１００が再測定により得られたフィル
タを用いて、再度、試聴を実施する。（Ｓ６）。 If it is determined in S8 that there is no remeasurement request (NO in S8), the processing device 201 ends the process. If it is determined in S8 that there is a remeasurement request (YES in S8)
, and returns to S4 for re-measurement. Then, based on the re-measurement result, the measuring device 200 generates a filter (S5). Then, the out-of-head localization processing apparatus 100 uses the filter obtained by the remeasurement to perform trial listening again. (S6).

Ｓ７において、処理装置２０１が、この音場を採用すると判定した場合（Ｓ７のＹＥＳ
）、ユーザＵによる料金の支払いを受け付ける（Ｓ９）。ここで、料金はクレジットカー
ド、又は現金による支払いに限らず、航空会社のマイル（マイレージ）等のポイントによ
る支払いでもよい。そして、処理装置２０１は、フィルタ及び識別情報を含む個人データ
をサーバ端末６００に送信する（Ｓ１０）。これにより、処理が終了する。 In S7, when the processing device 201 determines to adopt this sound field (YES in S7
), and accepts the payment of the fee by the user U (S9). Here, the fee is not limited to payment by credit card or cash, but payment by points such as airline miles (mileage) may be used. Then, the processing device 201 transmits personal data including the filter and identification information to the server terminal 600 (S10). This ends the processing.

次に、サーバ端末６００における処理について、図７を用いて、説明する。図７は、サ
ーバ端末６００における処理を示すフローチャートである。まず、サーバ端末６００は、
搭乗者のＩＤ（識別情報）を受信したか否かを判定する（Ｓ１１）。ここでは、サーバ端
末６００が、図６のＳ１において入力された識別情報を処理装置２０１から受信したか否
かを判定する。サーバ端末６００が、識別情報を受信していない場合（Ｓ１１のＮＯ）、
受信するまで処理を繰り返す。 Next, processing in server terminal 600 will be described using FIG. FIG. 7 is a flow chart showing processing in the server terminal 600. As shown in FIG. First, the server terminal 600
It is determined whether or not the passenger's ID (identification information) has been received (S11). Here, it is determined whether or not the server terminal 600 has received the identification information input in S1 of FIG. If the server terminal 600 has not received the identification information (NO in S11),
Repeat the process until received.

サーバ端末６００は、識別情報を受信した場合（Ｓ１１のＹＥＳ）、識別情報に対応す
るフライト番号、座席番号、座席端末番号を検索して、処理装置２０１に送信する（Ｓ１
２）。サーバ端末６００は、航空会社が管理しているフライト情報等を参照して、識別情
報の搭乗者が搭乗するフライト番号、座席番号、座席端末番号を特定する。なお、処理装
置２０１では、Ｓ１２で送信されたフライト番号、座席番号に基づいて、図６のＳ２での
判定を行っている。 When the server terminal 600 receives the identification information (YES in S11), it searches for the flight number, seat number, and seat terminal number corresponding to the identification information, and transmits them to the processing device 201 (S1
2). The server terminal 600 identifies the flight number, seat number, and seat terminal number of the passenger of the identification information by referring to flight information managed by the airline company. Note that the processing device 201 performs the determination in S2 of FIG. 6 based on the flight number and seat number transmitted in S12.

次に、サーバ端末６００は、支払い意思通知を受信したか否かを判定する（Ｓ１３）。
ここでは、サーバ端末６００は、図６のＳ９での料金支払いが行なわれているか否かを判
定する。サーバ端末６００は、料金の支払い意思通知を受信していない場合（Ｓ１３のＮ
Ｏ）、支払い意思通知を受信するまで待機する。 Next, the server terminal 600 determines whether or not the notice of intention to pay has been received (S13).
Here, server terminal 600 determines whether or not payment has been made in S9 of FIG. If the server terminal 600 has not received the notice of intention to pay the fee (N in S13
O), wait until receiving notification of intention to pay;

サーバ端末６００は、料金の支払い意思通知を受信した場合（Ｓ１３のＹＥＳ）、支払
い処理を実行する（Ｓ１４）。そして、サーバ端末６００は、支払いを完了したか否かを
判定する（Ｓ１５）。サーバ端末６００は、支払いを完了していない場合（Ｓ１５のＮＯ
）、エラーメッセージを表示手段２２２に表示させる（Ｓ１６）。つまり、サーバ端末６
００は、ポイント残高が不足している場合などに、エラーメッセージを処理装置２０１に
送信する。これにより、処理装置２０１の表示手段２２２がエラーメッセージをユーザＵ
に対して表示する。 When the server terminal 600 receives the notice of intention to pay the fee (YES in S13), it executes the payment process (S14). Then, the server terminal 600 determines whether or not the payment has been completed (S15). If the payment has not been completed (NO in S15), the server terminal 600
), and an error message is displayed on the display means 222 (S16). That is, the server terminal 6
00 transmits an error message to the processing device 201 when the point balance is insufficient. As a result, the display means 222 of the processing device 201 displays the error message to the user U.
display for

支払いを完了した場合（Ｓ１５のＹＥＳ）、サーバ端末６００は、個人データを受信す
る（Ｓ１７）。つまり、図６のＳ１０で処理装置２０１が送信した個人データをサーバ端
末６００が受信する。個人データは、識別情報とフィルタとを含んでいる。そして、サー
バ端末６００は、識別情報を参照して、座席端末５１１にフィルタを転送する。つまり、
識別情報に対応する搭乗席の座席端末５１１に、フィルタを送信する（Ｓ１８）。これに
より、搭乗者に応じたフィルタが、座席端末５１１に設定される。 If the payment has been completed (YES in S15), the server terminal 600 receives personal data (S17). That is, the server terminal 600 receives the personal data transmitted by the processing device 201 in S10 of FIG. Personal data includes identification information and filters. Then, server terminal 600 refers to the identification information and transfers the filter to seat terminal 511 . in short,
The filter is transmitted to the seat terminal 511 of the boarding seat corresponding to the identification information (S18). As a result, a filter corresponding to the passenger is set in the seat terminal 511. FIG.

搭乗者が航空機に搭乗した後に、頭外定位処理機能をオンとすると、座席端末５１１が
フィルタを用いて頭外定位処理を行う。つまり、座席端末５１１が図１の頭外定位処理装
置１００として機能する。これにより、ヘッドホン４３が、頭外定位処理された再生信号
を搭乗者に対して出力する。このように、搭乗者に対して測定された空間音響伝達特性、
及び外耳道伝達特性に応じたフィルタを用いて、搭乗者が頭外定位受聴を行なうことがで
きる。このようにすることで、適切なフィルタを用いて、頭外定位受聴を行うことができ
る。 When the passenger turns on the out-of-head localization processing function after boarding the aircraft, the seat terminal 511 performs out-of-head localization processing using a filter. In other words, the seat terminal 511 functions as the out-of-head localization processing device 100 in FIG. As a result, the headphone 43 outputs the reproduction signal subjected to the out-of-head localization processing to the passenger. Thus, the spatial sound transfer characteristics measured with respect to the occupant,
And, using a filter according to the ear canal transfer characteristics, the passenger can perform out-of-head stereotactic listening. By doing so, out-of-head localization listening can be performed using an appropriate filter.

よって、搭乗者は、搭乗席５２１に着席している間において、リラックスして、再生信
号を受聴することができるため、長時間の移動でも快適に過ごすことができる。識別情報
とフィルタが紐付いているため、乗り継ぎ後の航空機でも、搭乗者は、同様に頭外定位受
聴を楽しむことができる。さらに、フライト到着後において、生成されたフィルタを識別
情報に対応付けて、サーバ端末６００に保存しておくことも可能である。このようにする
ことで、次回以降のフライト時において、搭乗者に対する測定の一部又は全部を省略する
ことができる。さらに、残存するマイルの利用を促進することができる。 Therefore, the passenger can relax and listen to the reproduced signal while sitting on the passenger seat 521, so that the passenger can spend a long time comfortably. Since the identification information and the filter are linked, the passenger can enjoy out-of-head stereophonic listening in the same way even after connecting flights. Furthermore, after flight arrival, the generated filter can be associated with the identification information and stored in the server terminal 600 . By doing so, it is possible to omit part or all of the measurement of the passengers in the next and subsequent flights. In addition, the use of remaining miles can be encouraged.

さらに、航空機に搭載されているエンターテインメントシステムにおいて、頭外定位処
理を行なうことができる。よって、搭乗者は、音楽の再生信号だけでなく、映画やゲーム
などの再生信号を頭外定位受聴することができる。 Furthermore, out-of-head localization processing can be performed in the entertainment system on board the aircraft. Therefore, the passenger can hear not only the reproduction signal of music but also the reproduction signal of movies and games out of head.

以上まとめると、本実施の形態１にかかるシステム１０００は、測定装置２００と、座
席端末５１１と、サーバ端末６００とを備えている。測定装置２００は、ユーザＵが乗り
物に搭乗する前に、ユーザの耳に装着されたマイクを用いて、伝達特性を測定する。座席
端末５１１は、乗り物に設置され、伝達特性に応じたフィルタを用いて頭外定位処理を行
う頭外定位処理装置である。サーバ端末は、ユーザの識別情報に基づいて、伝達特性に応
じたフィルタを、座席端末５１１に送信する。これにより、搭乗者（ユーザＵ）に対して
適切に頭外定位処理を行うことができる。 In summary, the system 1000 according to the first embodiment includes the measuring device 200, the seat terminal 511, and the server terminal 600. The measuring device 200 measures transfer characteristics using a microphone attached to the user's ear before the user U gets on the vehicle. The seat terminal 511 is an out-of-head localization processing device that is installed in a vehicle and performs out-of-head localization processing using a filter according to transfer characteristics. The server terminal transmits a filter according to the transfer characteristics to the seat terminal 511 based on the user's identification information. As a result, the out-of-head localization process can be appropriately performed for the passenger (user U).

なお、上記の説明では、航空機の搭乗者が頭外定位受聴を行う例について説明したが、
搭乗者が搭乗する乗り物は、航空機に限られるものではない。先に述べた電車、バス、船
舶等の搭乗者に対して、ヘッドホン４３が頭外定位処理された再生信号を出力してもよい
。電車やバス、船舶などの場合は、駅やバスターミナル、港湾の待合室などに、測定装置
２００を設置しておけばよい。さらには、乗り物は、アミューズメントパークのアトラク
ション等であってもよい。この場合、頭外定位処理装置１００は、実際に移動する乗り物
に限らず、その場に滞在したままの乗り物に搭載されていてもよい。 In the above description, an example of out-of-head localization listening performed by an aircraft passenger was described.
Vehicles boarded by passengers are not limited to aircraft. The headphone 43 may output a reproduced signal subjected to out-of-head localization processing to passengers on trains, buses, ships, etc., as described above. In the case of trains, buses, ships, etc., the measuring device 200 may be installed in stations, bus terminals, port waiting rooms, and the like. Furthermore, the ride may be an amusement park attraction or the like. In this case, the out-of-head localization processing device 100 may be installed in a vehicle that is not limited to a vehicle that actually moves, but a vehicle that stays in place.

頭外定位処理装置となる端末が搭乗者の識別情報と対応付けられていればよい。そして
、それぞれの搭乗者が搭乗席において、ヘッドホンやイヤホンを装着する。そして、再生
装置が搭乗席毎に設置されており、再生装置が搭乗者に適したフィルタを用いて、頭外定
位処理を行う。もちろん、一部の搭乗者については、頭外定位処理を行わなくてもよい。
さらには、複数の搭乗者に対して共通の再生信号を用いてもよい。この場合、再生信号を
再生する装置は共通となっており、頭外定位フィルタを用いた処理が、搭乗者毎に実施さ
れていればよい。また、ヘッドホン又はイヤホンは、測定環境に設置されたものに限らず
、ユーザＵが持参したものでもよい。
実施の形態２．
（空間音響フィルタの生成）
また、実施の形態１では、測定装置２００が、空間音響伝達特性と、外耳道伝達特性の
両方を測定するものとして説明したが、一部の測定を行うことができない場合がある。測
定環境の制限などから、頭外定位受聴を行うユーザＵに対して、空間音響伝達特性Ｈｌｓ
、Ｈｌｏ、Ｈｒｏ、Ｈｒｓと、左右の外耳道伝達特性ＥＣＴＦＬ、ＥＣＴＦＲの全てを測
定できないことがある。特に、空間音響伝達特性の測定では、ユーザＵから離れた位置に
１台又は複数台のスピーカを設置する必要がある。よって、空港等に広くて静かな測定環
境を用意できない場合がある。 It is sufficient that the terminal serving as the out-of-head localization processing device is associated with the identification information of the passenger. Then, each passenger wears headphones or earphones in the passenger seat. A playback device is installed for each passenger seat, and the playback device performs out-of-head localization processing using a filter suitable for the passenger. Of course, it is not necessary to perform out-of-head localization processing for some passengers.
Furthermore, a common reproduced signal may be used for a plurality of passengers. In this case, the apparatus for reproducing the reproduced signal is common, and the processing using the out-of-head localization filter may be performed for each passenger. Moreover, the headphones or earphones are not limited to those installed in the measurement environment, and may be those brought by the user U.
Embodiment 2.
(Generation of spatial acoustic filter)
Also, in the first embodiment, the measurement apparatus 200 measures both the spatial sound transfer characteristic and the ear canal transfer characteristic, but there are cases where part of the measurement cannot be performed. Due to limitations in the measurement environment, etc., the spatial sound transfer characteristic Hls
, Hlo, Hro, Hrs, and the left and right ear canal transfer characteristics ECTFL, ECTFR may not all be measured. In particular, in the measurement of spatial sound transfer characteristics, it is necessary to install one or more speakers at positions distant from the user U. Therefore, it may not be possible to provide a wide and quiet measurement environment at an airport or the like.

一方、外耳道伝達特性は、ヘッドホンを装着した状態で測定される。このため、外耳道
伝達特性の測定では、空間音響伝達特性の測定ほど、広くて静かな測定環境は要求されな
い。よって、搭乗者に対して、外耳道伝達特性の測定のみを行うことで、頭外定位フィル
タの全てを取得できることが好ましい。つまり、スピーカを用いた空間音響伝達特性の測
定を行わずに、搭乗者に適した空間音響フィルタを生成することが望まれる。以下、外耳
道伝達特性の測定結果から、搭乗者に適した空間音響フィルタを生成する方法について、
説明する。 On the other hand, the ear canal transfer characteristics are measured while wearing headphones. For this reason, measurement of ear canal transfer characteristics does not require a large and quiet environment as much as measurement of spatial sound transfer characteristics. Therefore, it is preferable that all of the out-of-head localization filters can be obtained by only measuring the ear canal transfer characteristics of the passenger. In other words, it is desired to generate a spatial acoustic filter suitable for a passenger without measuring spatial acoustic transfer characteristics using a speaker. The method for generating a spatial acoustic filter suitable for passengers from the measurement results of the ear canal transfer characteristics is described below.
explain.

外耳道伝達特性ＥＣＴＦＬ、ＥＣＴＦＲの測定結果に基づいて、空間音響フィルタを生
成するフィルタ生成装置について、図８を用いて説明する。図８は、フィルタ生成装置９
００の構成を示すブロック図である。なお、フィルタ生成装置９００は、処理装置２０１
と同一の装置であってもよく、異なる装置であってもよい。さらには、フィルタ生成装置
９００は、物理的に単一な装置に限られるものではない。例えば、フィルタ生成装置９０
０と、処理装置２０１とが異なる装置の場合、後述する処理の一部が処理装置２０１にお
いて実施されていてもよい。また、データベース９０１は、異なる装置に格納されていて
もよく、複数の装置に分散して格納されていてもよい。 A filter generation device that generates a spatial acoustic filter based on the measurement results of the ear canal transfer characteristics ECTFL and ECTFR will be described with reference to FIG. FIG. 8 shows the filter generation device 9
00 is a block diagram showing the configuration of FIG. It should be noted that the filter generation device 900 is the processing device 201
It may be the same device as , or it may be a different device. Furthermore, filter generation device 900 is not limited to a physically single device. For example, the filter generator 90
0 and the processing device 201 are different devices, the processing device 201 may perform part of the processing described later. Also, the database 901 may be stored in different devices, or may be distributed and stored in a plurality of devices.

フィルタ生成装置９００は、データベース９０１と、第１の選択部９０２と、第１の取
得部９０３と、第２の選択部９０４と、第２の取得部９０５と、第３の取得部９０６と、
第１の調整部９０７と、第１の合成部９０８と、第２の調整部９１１と、第２の合成部９
１２と、生成部９２０と、外耳道伝達特性取得部９３０と、を備えている。 The filter generation device 900 includes a database 901, a first selection unit 902, a first acquisition unit 903, a second selection unit 904, a second acquisition unit 905, a third acquisition unit 906,
First adjuster 907 , first combiner 908 , second adjuster 911 , and second combiner 9
12 , a generation unit 920 , and an ear canal transfer characteristic acquisition unit 930 .

外耳道伝達特性取得部９３０は、ユーザＵの外耳道伝達特性の測定結果を取得する。な
お。フィルタ生成装置９００が処理装置２０１と別の装置とする場合、外耳道伝達特性取
得部９３０は、有線通信又は無線通信により、ユーザＵの外耳道伝達特性が送信されてい
る。 The ear canal transfer characteristic acquisition unit 930 acquires the measurement result of the user's U ear canal transfer characteristic. note that. When the filter generation device 900 is a device different from the processing device 201, the ear canal transfer characteristics of the user U are transmitted to the ear canal transfer characteristics acquisition unit 930 by wired communication or wireless communication.

データベース９０１は、複数人分の特性データを格納している。つまり、複数の被測定
者１に対して、予め、空間音響伝達特性及び外耳道伝達特性の測定が行われている。そし
て、データベース９０１は、複数人に対する測定結果に基づく空間音響伝達特性及び外耳
道伝達特性のデータを特性データとして格納している。具体的には、データベース９０１
は、第１の特性データと、第２の特性データと、を１セットとして、複数セット分のデー
タを格納している。例えば、Ｎ人（Ｎは２以上の整数）の被測定者１に対して、外耳道伝
達特性の測定が事前に行われている。よって、データベース９０１は、左耳に関してＮセ
ット分の特性データを格納し、右耳に関してＮセット分の特性データを格納する。 A database 901 stores characteristic data for a plurality of persons. That is, the spatial sound transfer characteristics and the ear canal transfer characteristics are measured in advance for a plurality of persons 1 to be measured. The database 901 stores data of spatial sound transfer characteristics and ear canal transfer characteristics based on measurement results for a plurality of people as characteristic data. Specifically, the database 901
stores a plurality of sets of data, with first characteristic data and second characteristic data as one set. For example, the ear canal transfer characteristics are measured in advance for N subjects 1 (N is an integer equal to or greater than 2). Therefore, the database 901 stores N sets of characteristic data for the left ear and N sets of characteristic data for the right ear.

第１の特性データは、音源となるスピーカからマイクまでの空間音響伝達特性に対応す
るデータである。第１の特性データは、例えば、空間音響伝達特性の周波数特性である。
具体的には、第１の特性データは、周波数領域の振幅特性を備えている。もちろん、第１
の特性データは、振幅特性の代わりにパワー特性を備えていてもよい。また、第１の特性
データは、空間音響伝達特性の直接音部分の周波数振幅特性を有していることが好ましい
。第１の特性データは時間領域の信号を有していてもよい。例えば、時間領域の信号は測
定装置２００のマイク２Ｌ、２Ｒで収音された収音信号である。あるいは、時間領域の信
号は、マイク２Ｌ、２Ｒで収音された収音信号を所定のフィルタ長で切りだした信号であ
ってもよい。 The first characteristic data is data corresponding to spatial sound transfer characteristics from a speaker, which is a sound source, to a microphone. The first characteristic data is, for example, frequency characteristics of spatial sound transfer characteristics.
Specifically, the first characteristic data includes amplitude characteristics in the frequency domain. Of course, the first
The characteristic data of may include power characteristics instead of amplitude characteristics. Moreover, it is preferable that the first characteristic data have the frequency amplitude characteristic of the direct sound portion of the spatial sound transfer characteristic. The first characteristic data may comprise time domain signals. For example, the time domain signals are sound signals picked up by the microphones 2L and 2R of the measuring device 200. FIG. Alternatively, the time-domain signal may be a signal obtained by extracting the sound signals picked up by the microphones 2L and 2R with a predetermined filter length.

第２の特性データは、外耳道伝達特性に対応するデータである。第２の特性データは、
例えば、外耳道伝達特性の周波数特性である。具体的には、第２の特性データは、周波数
領域の振幅特性を備えている。もちろん、第２の特性データは、振幅特性の代わりにパワ
ー特性を備えていてもよい。さらには、第２の特性データは、周波数領域の位相特性を有
していてもよい。第２の特性データは時間領域の信号を有していてもよい。例えば、時間
領域の信号は測定装置２００のマイク２Ｌ、２Ｒで収音された収音信号である。あるいは
、時間領域の信号は、マイク２Ｌ、２Ｒで収音された収音信号を所定のフィルタ長で切り
だした信号であってもよい。 The second characteristic data is data corresponding to the ear canal transfer characteristic. The second characteristic data are
For example, it is the frequency characteristic of the ear canal transfer characteristic. Specifically, the second characteristic data includes amplitude characteristics in the frequency domain. Of course, the second characteristic data may have power characteristics instead of amplitude characteristics. Furthermore, the second characteristic data may have phase characteristics in the frequency domain. The second characteristic data may comprise time domain signals. For example, the time domain signals are sound signals picked up by the microphones 2L and 2R of the measuring device 200 . Alternatively, the time-domain signal may be a signal obtained by extracting the sound signals picked up by the microphones 2L and 2R with a predetermined filter length.

処理装置２０１又はフィルタ生成装置９００等が、時間領域の収音信号に対して、離散
フーリエ変換や離散コサイン変換などを施すことで、周波数振幅特性等が求められる。ま
た、収音信号を所定のフィルタ長で切り出すことで得られたフィルタに対して、離散フー
リエ変換や離散コサイン変換などを施すことで、周波数振幅特性を求めてもよい。あるい
は、データベース９０１は、第１の特性データ、及び第２の特性データとして、時間領域
の収音信号やフィルタを記憶しており、フィルタ生成処理を行う毎に高速フーリエ変換（
ＦＦＴ）等を行うことで、周波数振幅特性を求めてもよい。 The processing device 201, the filter generation device 900, or the like applies discrete Fourier transform, discrete cosine transform, or the like to the collected sound signal in the time domain, thereby obtaining the frequency amplitude characteristics and the like. Further, the frequency-amplitude characteristic may be obtained by performing discrete Fourier transform, discrete cosine transform, or the like on a filter obtained by cutting out the picked-up sound signal with a predetermined filter length. Alternatively, the database 901 stores time-domain picked-up signals and filters as the first characteristic data and the second characteristic data, and fast Fourier transform (
FFT) or the like may be performed to obtain the frequency-amplitude characteristic.

データベース９０１に格納された第１及び第２の特性データについて、図９を用いて説
明する。１人目（１セット目）の被測定者１について、空間音響伝達特性Ｈｌｓ、Ｈｌｏ
、Ｈｒｏ、Ｈｒｓに関するデータをそれぞれ第１の特性データＨｌｓ＿ＤＢ１、Ｈｌｏ＿
ＤＢ１、Ｈｒｏ＿ＤＢ１、Ｈｒｓ＿ＤＢ１とする。Ｎ人目等についても、それぞれ第１の
特性データＨｌｓ＿ＤＢＮ、Ｈｌｏ＿ＤＢＮ、Ｈｒｏ＿ＤＢＮ、Ｈｒｓ＿ＤＢＮ等と称す
る。データベース９０１に格納されたＮ人分の第１の特性データＨｌｓ＿ＤＢ１～Ｈｌｓ
＿ＤＢＮをまとめて、第１の特性データＨｌｓ＿ＤＢと称する。同様に、第１の特性デー
タＨｌｏ＿ＤＢ１～Ｈｌｏ＿ＤＢＮ、Ｈｒｏ＿ＤＢ１～Ｈｒｏ＿ＤＢＮ、Ｈｒｓ＿ＤＢ１
～Ｈｒｓ＿ＤＢＮについても、同様に、Ｎ人分のデータをまとめて、第１の特性データＨ
ｌｏ＿ＤＢ、Ｈｒｏ＿ＤＢ、Ｈｒｓ＿ＤＢと称する。 The first and second characteristic data stored in database 901 will be described with reference to FIG. Spatial sound transfer characteristics Hls, Hlo for the first person (first set) subject 1
, Hro and Hrs as first characteristic data Hls_DB1 and Hlo_
Let DB1, Hro_DB1, and Hrs_DB1. The N-th person is also referred to as the first characteristic data Hls_DBN, Hlo_DBN, Hro_DBN, Hrs_DBN, etc., respectively. First characteristic data Hls_DB1 to Hls for N persons stored in database 901
_DBN are collectively referred to as first characteristic data Hls_DB. Similarly, first characteristic data Hlo_DB1 to Hlo_DBN, Hro_DB1 to Hro_DBN, Hrs_DB1
~Hrs_DBN, similarly, the data for N persons are put together to obtain the first characteristic data H
They are referred to as lo_DB, Hro_DB, and Hrs_DB.

１人目の被測定者１について、外耳道伝達特性ＥＣＴＦＬ、ＥＣＴＦＲに関するデータ
をそれぞれ第２の特性データＥＣＴＦＬ＿ＤＢ１、ＥＣＴＦＲ＿ＤＢ１とする。Ｎ人目等
の被測定者１についても、外耳道伝達特性ＥＣＴＦＬ、ＥＣＴＦＲに関するデータを第２
の特性データＥＣＴＦＬ＿ＤＢＮ、ＥＣＴＦＲ＿ＤＢＮ等と称する。また、データベース
９０１に格納されたＮ人分の第２の特性データＥＣＴＦＬ＿ＤＢ１～ＥＣＴＦＬ＿ＤＢＮ
をまとめて、第２の特性データＥＣＴＦＬ＿ＤＢと称する。Ｎ人分の第２の特性データＥ
ＣＴＦＲ＿ＤＢ１～ＥＣＴＦＲ＿ＤＢＮをまとめて、第２の特性データＥＣＴＦＲ＿ＤＢ
と称する。 For the first person to be measured 1, the data on the ear canal transfer characteristics ECTFL and ECTFR are assumed to be second characteristic data ECTFL_DB1 and ECTFR_DB1, respectively. Data on the ear canal transfer characteristics ECTFL and ECTFR for the person 1 to be measured such as the N-th person are also sent to the second
characteristic data ECTFL_DBN, ECTFR_DBN, and the like. In addition, the second characteristic data ECTFL_DB1 to ECTFL_DBN for N persons stored in the database 901
are collectively referred to as second characteristic data ECTFL_DB. Second characteristic data E for N people
CTFR_DB1 to ECTFR_DBN are put together to form second characteristic data ECTFR_DB
called.

データベース９０１は、１人目の被測定者１の左耳に関する第１の特性データＨｌｓ＿
ＤＢ１、Ｈｒｏ＿ＤＢ１と、第２の特性データＥＣＴＦＬ＿ＤＢ１を１セットにして記憶
する。同様に、データベース９０１は、Ｎ人目の被測定者１の左耳に関する第１の特性デ
ータＨｌｓ＿ＤＢＮ、Ｈｒｏ＿ＤＢＮと、第２の特性データＥＣＴＦＬ＿ＤＢＮとを、１
セットとして記憶する。また、データベース９０１は、１人目の被測定者１の右耳に関す
る第１の特性データＨｌｏ＿ＤＢ１、Ｈｒｓ＿ＤＢ１と、第２の特性データＥＣＴＦＲ＿
ＤＢ１を１セットにして記憶する。データベース９０１は、Ｎ人目の被測定者１の右耳に
関する第１の特性データＨｌｏ＿ＤＢＮ、Ｈｒｓ＿ＤＢＮと、第２の特性データＥＣＴＦ
Ｒ＿ＤＢＮとを１セットとして記憶する。 The database 901 stores the first characteristic data Hls_
DB1, Hro_DB1, and second characteristic data ECTFL_DB1 are stored as one set. Similarly, the database 901 stores the first characteristic data Hls_DBN, Hro_DBN and the second characteristic data ECTFL_DBN regarding the left ear of the N-th subject 1 as 1
Store as a set. The database 901 also stores first characteristic data Hlo_DB1 and Hrs_DB1 related to the right ear of the first subject 1, and second characteristic data ECTFR_
Store DB1 as one set. The database 901 stores first characteristic data Hlo_DBN, Hrs_DBN and second characteristic data ECTF regarding the right ear of the N-th person 1 to be measured.
and R_DBN are stored as one set.

したがって、１セットは、少なくとも３つの周波数振幅特性を備えている。データベー
ス９０１は、同じ被測定者１であっても異なる耳の特性データは異なるセットとして格納
する。もちろん、スピーカのチャネル数に応じて、１セットに含まれる第１の特性データ
の数が変化する。また、データベース９０１は、第１及び第２の特性データを識別情報に
対応付けて記憶してもよい。 A set therefore comprises at least three frequency-amplitude characteristics. The database 901 stores different ear characteristic data for the same subject 1 as different sets. Of course, the number of first characteristic data included in one set changes according to the number of speaker channels. Also, the database 901 may store the first and second characteristic data in association with the identification information.

さらに、第１の特性データＨｌｓ＿ＤＢ、Ｈｌｏ＿ＤＢ、Ｈｒｏ＿ＤＢ、Ｈｒｓ＿ＤＢ
は、それぞれ２種類の周波数振幅特性を備えていることが好ましい。例えば、第１の特性
データＨｌｓ＿ＤＢ１は、直接音部分の周波数振幅特性と、直接音部分及び反射音部分の
周波数振幅特性を有していることが好ましい。直接音部分及び反射音部分の周波数振幅特
性は、直接音と反射音とを含む時間領域の収音信号をＦＦＴすることで求めることができ
る。直接音部分の周波数振幅特性は、反射音を含まずに直接音のみを含む時間領域の収音
信号をＦＦＴすることで求めることができる。なお、直接音は、音源（スピーカ）から直
接耳（マイク）に到達する音であり、反射音は、音源から壁面などで反射して、耳に到達
する音である。反射音は、直接音の後にマイクに到達する。他の第１の特性データＨｌｏ
＿ＤＢ１、Ｈｒｏ＿ＤＢ１、Ｈｒｓ＿ＤＢ１等についても同様とする。 Further, first characteristic data Hls_DB, Hlo_DB, Hro_DB, Hrs_DB
preferably have two types of frequency-amplitude characteristics. For example, the first characteristic data Hls_DB1 preferably has frequency amplitude characteristics of the direct sound portion and frequency amplitude characteristics of the direct sound portion and the reflected sound portion. The frequency-amplitude characteristics of the direct sound portion and the reflected sound portion can be obtained by performing FFT on the collected sound signal in the time domain including the direct sound and the reflected sound. The frequency-amplitude characteristics of the direct sound portion can be obtained by performing FFT on the collected sound signal in the time domain that contains only the direct sound and does not contain the reflected sound. The direct sound is the sound that reaches the ear (microphone) directly from the sound source (speaker), and the reflected sound is the sound that reaches the ear after being reflected from the sound source by a wall or the like. Reflected sound reaches the microphone after the direct sound. Other first characteristic data Hlo
The same applies to _DB1, Hro_DB1, Hrs_DB1, and the like.

例えば、図１０のように、空間音響伝達特性の測定で、０～４０９５サンプルの収音信
号を収音している場合について説明する。この場合、０～４０９５サンプルの収音信号の
全体をフーリエ変換することで直接音及び反射音の周波数振幅特性が得られる。０～４０
９５サンプルの収音信号から０～Ｘ（Ｘは１以上の整数）サンプルの直接音信号（図１０
の点線部分）を切り出して、切り出した直接音信号をフーリエ変換することで、直接音の
周波数振幅特性が得られる。 For example, as shown in FIG. 10, a case where 0 to 4095 samples of sound signals are collected in the measurement of spatial sound transfer characteristics will be described. In this case, the frequency amplitude characteristics of the direct sound and the reflected sound can be obtained by Fourier transforming the entire collected sound signal of 0 to 4095 samples. 0-40
A direct sound signal of 0 to X (X is an integer of 1 or more) samples from the 95 samples of the collected sound signal (Fig. 10
) is cut out, and the cut-out direct sound signal is Fourier-transformed to obtain the frequency-amplitude characteristics of the direct sound.

このように、データベース９０１において、第１の特性データＨｌｓ＿ＤＢ１は、直接
音の周波数振幅特性と、直接音及び反射音の周波数振幅特性とをそれぞれ含んでいること
が好ましい。同様に、第１の特性データＨｌｏ＿ＤＢ１、Ｈｒｏ＿ＤＢ１、Ｈｒｓ＿ＤＢ
１は、直接音部分の周波数振幅特性と、直接音及び反射音部分の周波数振幅特性とをそれ
ぞれ含んでいることが好ましい。もちろん、２～Ｎ人目についても同様とする。 Thus, in the database 901, the first characteristic data Hls_DB1 preferably includes the frequency-amplitude characteristic of the direct sound and the frequency-amplitude characteristics of the direct sound and the reflected sound. Similarly, the first characteristic data Hlo_DB1, Hro_DB1, Hrs_DB
1 preferably includes the frequency-amplitude characteristics of the direct sound portion and the frequency-amplitude characteristics of the direct and reflected sound portions, respectively. Of course, the same applies to the 2nd to Nth persons.

図２で示した測定装置２００は、頭外定位受聴を行うユーザＵに対して外耳道伝達特性
ＥＣＴＦＬ、ＥＣＴＦＲを測定する。ユーザＵに対する測定結果が外耳道伝達特性取得部
９３０に入力される。以下、搭乗者となるユーザＵに対して測定された外耳道伝達特性Ｅ
ＣＴＦＬ、ＥＣＴＦＲを外耳道伝達特性ＥＣＴＦＬ＿Ｕ、外耳道伝達特性ＥＣＴＦＲ＿Ｕ
とする。 The measuring device 200 shown in FIG. 2 measures the external auditory canal transfer characteristics ECTFL and ECTFR for a user U who performs out-of-head stereotactic listening. A measurement result for the user U is input to the ear canal transfer characteristic acquisition unit 930 . Hereinafter, the ear canal transfer characteristics E measured for the user U who is a passenger
CTFL and ECTFR are defined as ear canal transfer characteristics ECTFL_U and ear canal transfer characteristics ECTFR_U.
and

フィルタ生成装置９００は、外耳道伝達特性ＥＣＴＦＬ＿Ｕに対して、フィルタ生成処
理を行う。これにより、フィルタ生成装置９００は、ユーザＵの左耳に対する空間音響伝
達特性Ｈｌｓ、Ｈｒｏに関するフィルタ（以下、フィルタＦ＿Ｈｌｓ＿Ｕ、Ｆ＿Ｈｒｏ＿
Ｕと称する）をそれぞれ生成する。つまり、フィルタ生成装置９００は、外耳道伝達特性
ＥＣＴＦＬ＿Ｕに基づいて、２つのフィルタＦ＿Ｈｌｓ＿Ｕ及びフィルタＦ＿Ｈｒｏ＿Ｕ
を生成する。このとき、フィルタ生成装置９００は、複数の被測定者１の左耳に関するセ
ットのみを参照してもよく、両耳に関するセットを参照してもよい。 The filter generation device 900 performs filter generation processing on the ear canal transfer characteristic ECTFL_U. As a result, the filter generation device 900 generates filters related to the spatial acoustic transfer characteristics Hls and Hro for the left ear of the user U (hereinafter referred to as filters F_Hls_U and F_Hro_
U) respectively. That is, the filter generation device 900 generates two filters F_Hls_U and F_Hro_U based on the ear canal transfer characteristic ECTFL_U.
to generate At this time, the filter generation device 900 may refer to only the set relating to the left ears of the plurality of subjects 1, or may refer to the set relating to both ears.

同様に、フィルタ生成装置９００は、外耳道伝達特性ＥＣＴＦＲ＿Ｕに対して、フィル
タ生成処理を行う。これにより、フィルタ生成装置９００は、ユーザＵの右耳に対する空
間音響伝達特性Ｈｌｏ、Ｈｒｓに関するフィルタ（以下、フィルタＦ＿Ｈｌｏ＿Ｕ、Ｆ＿
Ｈｒｓ＿Ｕと称する）をそれぞれ生成する。つまり、フィルタ生成装置９００は、外耳道
伝達特性ＥＣＴＦＲ＿Ｕに基づいて、２つのフィルタＦ＿Ｈｌｏ＿Ｕ、Ｆ＿Ｈｒｓ＿Ｕを
生成する。このとき、フィルタ生成装置９００は、複数の被測定者１の右耳に関するセッ
トのみを参照してもよく、両耳に関するセットを参照してもよい。 Similarly, the filter generation device 900 performs filter generation processing on the ear canal transfer characteristic ECTFR_U. As a result, the filter generation device 900 generates filters related to the spatial acoustic transfer characteristics Hlo and Hrs for the right ear of the user U (hereinafter referred to as filters F_Hlo_U, F_
Hrs_U) respectively. That is, the filter generating device 900 generates two filters F_Hlo_U and F_Hrs_U based on the ear canal transfer characteristic ECTFR_U. At this time, the filter generation device 900 may refer to only the set related to the right ear of the plurality of subjects 1, or may refer to the set related to both ears.

なお、フィルタＦ＿Ｈｌｓ＿Ｕ、Ｆ＿Ｈｒｏ＿Ｕの処理と、フィルタＦ＿Ｈｌｏ＿Ｕ、
Ｆ＿Ｈｒｓ＿Ｕの処理は同様である。よって、以下の説明では、外耳道伝達特性ＥＣＴＦ
Ｌ＿Ｕに基づいて、フィルタＦ＿Ｈｌｓ＿Ｕ、Ｆ＿Ｈｒｏ＿Ｕを生成する処理について説
明する。 Note that the processing of filters F_Hls_U and F_Hro_U and the processing of filters F_Hlo_U and
The processing of F_Hrs_U is similar. Therefore, in the following description, the ear canal transfer characteristic ECTF
Processing for generating filters F_Hls_U and F_Hro_U based on L_U will be described.

第１の選択部９０２は、データベース９０１を参照することで、外耳道伝達特性ＥＣＴ
ＦＬ＿Ｕに基づいて、第１のセットを選択する。例えば、第１の選択部９０２は、第１の
周波数帯域（例えば、１ｋＨｚ～４ｋＨｚ）において、外耳道伝達特性ＥＣＴＦＬ＿Ｕの
周波数振幅特性を、第２の特性データＥＣＴＦＬ＿ＤＢと比較する。具体的には、第１の
選択部９０２は、第２の特性データＥＣＴＦＬ＿ＤＢ１～ＥＣＴＦＬ＿ＤＢＮのそれぞれ
について、外耳道伝達特性ＥＣＴＦＬ＿Ｕとの相関値を算出する。第１の選択部９０２は
、第１の周波数帯域における周波数振幅特性の相関値を求める。そして、第１の選択部９
０２は、最も相関値が大きい第２の特性データＥＣＴＦＬ＿ＤＢｋ（ｋは１以上Ｎ以下の
任意の整数）を含むセットを選択する。第１の選択部９０２が選択したセットを第１のセ
ットとする。 The first selection unit 902 refers to the database 901 to select the ear canal transfer characteristics ECT.
Based on FL_U, select the first set. For example, the first selection unit 902 compares the frequency amplitude characteristic of the ear canal transfer characteristic ECTFL_U with the second characteristic data ECTFL_DB in a first frequency band (eg, 1 kHz to 4 kHz). Specifically, the first selection unit 902 calculates a correlation value between each of the second characteristic data ECTFL_DB1 to ECTFL_DBN and the ear canal transfer characteristic ECTFL_U. First selection section 902 obtains the correlation value of the frequency-amplitude characteristic in the first frequency band. And the first selection unit 9
02 selects a set including the second characteristic data ECTFL_DBk (k is an arbitrary integer from 1 to N) having the largest correlation value. The set selected by the first selection unit 902 is referred to as the first set.

第１の取得部９０３は、データベース９０１から、第１のセットに含まれる第１の特性
データＨｌｓ＿ＤＢｋ、Ｈｒｏ＿ＤＢｋを取得する。第１の取得部９０３は、直接音部分
の周波数振幅特性を第１の合成部９０８に出力し、直接音及び反射音部分の周波数振幅特
性を第１の調整部９０７に出力する。 A first acquisition unit 903 acquires first characteristic data Hls_DBk and Hro_DBk included in the first set from the database 901 . First acquisition section 903 outputs the frequency amplitude characteristics of the direct sound portion to first synthesizing section 908 , and outputs the frequency amplitude characteristics of the direct sound and reflected sound portions to first adjustment section 907 .

第２の選択部９０４は、データベース９０１を参照することで、外耳道伝達特性ＥＣＴ
ＦＬ＿Ｕに基づいて、第２のセットを選択する。例えば、第２の選択部９０４は、第２の
周波数帯域（例えば、４ｋＨｚ～１５ｋＨｚ）において、外耳道伝達特性ＥＣＴＦＬ＿Ｕ
の周波数振幅特性を、第２の特性データＥＣＴＦＬ＿ＤＢと比較する。第２の選択部９０
４は、第２の特性ＥＣＴＦＬ＿ＤＢ１～ＥＣＴＦＬ＿ＤＢ１のそれぞれについて、外耳道
伝達特性ＥＣＴＦＬ＿Ｕとの相関値を算出する。第２の選択部９０４は、第２の周波数帯
域における周波数振幅特性の相関値を求める。そして、第２の選択部９０４は、最も相関
値が大きい第２の特性データＥＣＴＦＬ＿ＤＢｍ（ｍは１以上Ｎ以下の整数）を含むセッ
トを選択する。第２の選択部９０４が選択したセットを第２のセットとする。 A second selection unit 904 refers to the database 901 to select the ear canal transfer characteristics ECT.
Select the second set based on FL_U. For example, the second selection unit 904 selects the ear canal transfer characteristic ECTFL_U in the second frequency band (eg, 4 kHz to 15 kHz).
is compared with the second characteristic data ECTFL_DB. Second selection unit 90
4 calculates a correlation value between each of the second characteristics ECTFL_DB1 to ECTFL_DB1 and the ear canal transfer characteristic ECTFL_U. Second selection section 904 obtains the correlation value of the frequency-amplitude characteristic in the second frequency band. Then, second selection section 904 selects a set including second characteristic data ECTFL_DBm (m is an integer equal to or greater than 1 and equal to or less than N) having the largest correlation value. The set selected by the second selection unit 904 is referred to as the second set.

第２の取得部９０５は、データベース９０１から、第２のセットに含まれる第１の特性
データＨｌｓ＿ＤＢｍ、Ｈｒｏ＿ＤＢｍを取得する。第２の取得部９０５は、直接音部分
の周波数振幅特性を第１の合成部９０８に出力し、直接音及び反射音部分の周波数振幅特
性を第１の調整部９０７に出力する。 A second acquisition unit 905 acquires the first characteristic data Hls_DBm and Hro_DBm included in the second set from the database 901 . The second acquisition unit 905 outputs the frequency amplitude characteristics of the direct sound portion to the first synthesizing unit 908 and outputs the frequency amplitude characteristics of the direct sound and reflected sound portions to the first adjustment unit 907 .

第１の調整部９０７は、第１の特性データＨｌｓ＿ＤＢｋ、Ｈｒｏ＿ＤＢｋと、第１の
特性データＨｌｓ＿ＤＢｍ、Ｈｒｏ＿ＤＢｍとの振幅レベルを調整するためのゲイン値を
求める。例えば、調整用周波数帯域（２００Ｈｚ～１ｋＨｚ）において、第１の特性デー
タＨｌｓ＿ＤＢｋと第１の特性データＨｌｓ＿ＤＢｍとの間で振幅のレベルが等しくなる
ようなゲイン値を第１の調整部９０７が求める。そして、一方、または両方の第１の特性
データにゲイン値を乗じることで、第１の特性データの振幅レベルを上下させることがで
きる。具体的には、調整用周波数帯域における離散的な振幅の総和が等しくなるように、
振幅特性にゲイン値（係数）を乗じることで、振幅レベルが調整される。 A first adjustment unit 907 obtains gain values for adjusting the amplitude levels of the first characteristic data Hls_DBk, Hro_DBk and the first characteristic data Hls_DBm, Hro_DBm. For example, in the frequency band for adjustment (200 Hz to 1 kHz), the first adjuster 907 obtains a gain value that makes the amplitude level equal between the first characteristic data Hls_DBk and the first characteristic data Hls_DBm. By multiplying one or both of the first characteristic data by a gain value, the amplitude level of the first characteristic data can be increased or decreased. Specifically, so that the sum of discrete amplitudes in the adjustment frequency band is equal,
The amplitude level is adjusted by multiplying the amplitude characteristic by the gain value (coefficient).

具体的には、調整用周波数帯域において、第１の特性データＨｌｓ＿ＤＢｋの振幅レベ
ルが第１の特性データＨｌｓ＿ＤＢｍの振幅レベルよりも高い場合、第１の調整部９０７
は、第１の特性データＨｌｓ＿ＤＢｋの振幅レベルを下げるためのゲイン値を求める。あ
るいは、第１の特性データＨｌｓ＿ＤＢｋの振幅レベルが第１の特性データＨｌｓ＿ＤＢ
ｍの振幅レベルよりも高い場合、第１の調整部９０７は、第１の特性データＨｌｓ＿ＤＢ
ｍの振幅レベルを上げるためのゲイン値を求める。もちろん、第１の調整部９０７は、両
方の振幅レベルが所定の範囲に含まれるように、２つのゲイン値を算出してもよい。この
場合、第１の特性データＨｌｓ＿ＤＢｍ、第１の特性データＨｌｓ＿ＤＢｋのそれぞれに
ゲイン値が乗じられる。なお、第１の調整部９０７は、第１の特性データＨｌｓ＿ＤＢｋ
、Ｈｒｏ＿ＤＢｋについて、ゲイン値を共通としてもよく、それぞれに異なるゲイン値を
求めてもよい。 Specifically, in the adjustment frequency band, when the amplitude level of the first characteristic data Hls_DBk is higher than the amplitude level of the first characteristic data Hls_DBm, the first adjusting section 907
obtains a gain value for lowering the amplitude level of the first characteristic data Hls_DBk. Alternatively, the amplitude level of the first characteristic data Hls_DBk is the first characteristic data Hls_DB
m amplitude level, the first adjustment unit 907 adjusts the first characteristic data Hls_DB
Find a gain value to increase the amplitude level of m. Of course, the first adjusting section 907 may calculate two gain values so that both amplitude levels are within a predetermined range. In this case, each of the first characteristic data Hls_DBm and the first characteristic data Hls_DBk is multiplied by the gain value. Note that the first adjustment unit 907 uses the first characteristic data Hls_DBk
, Hro_DBk may have a common gain value, or different gain values may be obtained for each.

上記のように、第１の特性データは、複数の被測定者１に対して行われた測定結果によ
るものである。それぞれの測定において、スピーカの音量やマイク感度が同じとなってい
るとは限らない。さらには、測定環境が異なっていたり、異なるスピーカやマイクが使用
されていたりすることもある。したがって、第１の調整部９０７は、異なるセットの第１
の特性データの振幅レベルを調整する。例えば、第１の調整部９０７は２つの周波数振幅
特性のバランスを調整するためのゲイン値を算出する。これにより、異なるセットの第１
の特性データの振幅レベルのバランスを調整することができる。ここでは、直接音及び反
射音の周波数振幅特性において、ゲイン値を算出している。 As described above, the first characteristic data are based on the results of measurements performed on a plurality of subjects 1 . In each measurement, the loudspeaker volume and microphone sensitivity are not always the same. Furthermore, the measurement environment may differ, or different speakers and microphones may be used. Therefore, the first adjuster 907 may select different sets of first
Adjust the amplitude level of the characteristic data. For example, the first adjuster 907 calculates a gain value for adjusting the balance between the two frequency amplitude characteristics. This allows the first
The amplitude level balance of the characteristic data can be adjusted. Here, the gain value is calculated for the frequency amplitude characteristics of the direct sound and the reflected sound.

そして、第１の調整部９０７は、振幅レベルを調整するためのゲイン値を第１の合成部
９０８に出力する。第１の合成部９０８は、第１の特性データにゲイン値を乗じて、レベ
ル調整を行った後、第１の特性データＨｌｓ＿ＤＢｋと、第１の特性データＨｌｓ＿ＤＢ
ｍとを合成する。第１の合成部９０８は、第１の特性データにゲイン値を乗じて、レベル
調整を行った後、第１の特性データＨｒｏ＿ＤＢｋと、第１の特性データＨｒｏ＿ＤＢｍ
とを合成する。 First adjusting section 907 then outputs a gain value for adjusting the amplitude level to first synthesizing section 908 . The first synthesizing unit 908 multiplies the first characteristic data by the gain value to perform level adjustment, and then combines the first characteristic data Hls_DBk and the first characteristic data Hls_DB.
Composite with m. The first synthesizing unit 908 multiplies the first characteristic data by the gain value, adjusts the level, and then combines the first characteristic data Hro_DBk and the first characteristic data Hro_DBm.
Synthesize with

ここでは、第１の合成部９０８は、直接音を合成する。すなわち、合成する第１の特性
データＨｌｓ＿ＤＢｋ、Ｈｒｏ＿ＤＢｋ及び第１の特性データＨｌｓ＿ＤＢｍ、Ｈｒｏ＿
ＤＢｍは、直接音の周波数振幅特性となっている。第１の合成部９０８は、第１の周波数
帯域（１ｋＨｚ～４ｋＨｚ）における第１の特性データＨｌｓ＿ＤＢｍの振幅値を第１の
特性データＨｌｓ＿ＤＢｋの振幅値に置き換える。第１の合成部９０８は、第１の周波数
帯域における第１の特性データＨｒｏ＿ＤＢｍの振幅値を第１の特性データＨｒｏ＿ＤＢ
ｋの振幅値に置き換える。第１の合成部９０８は、置換後の周波数振幅特性を第１の合成
データＨｌｓ＿ｃｏｍ１、Ｈｒｏ＿ｃｏｍ１として、第２の調整部９１１に出力する。 Here, the first synthesizing unit 908 synthesizes the direct sound. That is, the first characteristic data Hls_DBk, Hro_DBk and the first characteristic data Hls_DBm, Hro_
DBm is the frequency amplitude characteristic of the direct sound. The first synthesizing unit 908 replaces the amplitude value of the first characteristic data Hls_DBm in the first frequency band (1 kHz to 4 kHz) with the amplitude value of the first characteristic data Hls_DBk. First synthesizing section 908 converts the amplitude value of first characteristic data Hro_DBm in the first frequency band into first characteristic data Hro_DB
Replace with the amplitude value of k. First synthesizing section 908 outputs the frequency-amplitude characteristic after replacement to second adjusting section 911 as first synthesized data Hls_com1 and Hro_com1.

第１の合成データＨｌｓ＿ｃｏｍ１、Ｈｒｏ＿ｃｏｍ１において、第１の周波数帯域の
振幅値は、第１の特性データＨｌｓ＿ＤＢｋ、Ｈｒｏ＿ＤＢｋに基づくものとなっており
、第１の周波数帯域以外の振幅値は、第２の特性データＨｌｓ＿ＤＢｍ、Ｈｒｏ＿ＤＢｍ
に基づくものとなっている。あるいは、第１の合成データＨｌｓ＿ｃｏｍ１、Ｈｒｏ＿ｃ
ｏｍ１において、第２の周波数帯域の振幅値は、第１の特性データＨｌｓ＿ＤＢｍ、Ｈｒ
ｏ＿ＤＢｍに基づくものとし、第２の周波数帯域以外の振幅値は、第１の特性データＨｌ
ｓ＿ＤＢｋ、Ｈｒｏ＿ＤＢｋに基づくものとするようにしてもよい。第１の合成部９０８
は、第１の特性データＨｌｓ＿ＤＢｍ、Ｈｒｏ＿ＤＢｍと第１の特性データＨｌｓ＿ＤＢ
ｋ、Ｈｒｏ＿ＤＢｋのバランスを取りながら、すなわち、調整用周波数帯域のレベルが所
定の範囲に含まれるように周波数振幅特性を合成してもよい。 In the first synthesized data Hls_com1 and Hro_com1, the amplitude values of the first frequency band are based on the first characteristic data Hls_DBk and Hro_DBk, and the amplitude values of other than the first frequency band are the second Characteristic data Hls_DBm, Hro_DBm
It is based on Alternatively, the first combined data Hls_com1, Hro_c
In om1, the amplitude value of the second frequency band is the first characteristic data Hls_DBm, Hr
o_DBm, and amplitude values other than the second frequency band are based on the first characteristic data Hl
It may be based on s_DBk and Hro_DBk. First combiner 908
are the first characteristic data Hls_DBm, Hro_DBm and the first characteristic data Hls_DB
The frequency-amplitude characteristics may be synthesized while balancing k and Hro_DBk, that is, so that the level of the adjustment frequency band is included in a predetermined range.

第３の取得部９０６は、データベース９０１を参照して、予め設定された第１の特性デ
ータＨｌｓ＿ＤＢｐｓ、Ｈｒｏ＿ＤＢｐｓ（以下、プリセットデータＨｌｓ＿ＤＢｐｓ、
Ｈｒｏ＿ＤＢｐｓとする）を取得する。プリセットデータＨｌｓ＿ＤＢｐｓ、Ｈｒｏ＿Ｄ
Ｂｐｓは、第１の特性データＨｌｓ＿ＤＢ、Ｈｒｏ＿ＤＢのうちの、代表的な１人分（１
セット分）のデータである。 The third acquisition unit 906 refers to the database 901 and obtains preset first characteristic data Hls_DBps, Hro_DBps (hereinafter referred to as preset data Hls_DBps,
Hro_DBps). Preset data Hls_DBps, Hro_D
Bps is a representative one-person (1
set).

ここで、プリセットデータＨｌｓ＿ＤＢｐｓ、Ｈｒｏ＿ＤＢｐｓは、左右の位相特性や
振幅レベルのバランスが取れている被測定者１の第１の特性データであることが好ましい
。つまり、プリセットデータＨｌｓ＿ＤＢｐｓ、Ｈｌｏ＿ＤＢｐｓ、Ｈｒｏ＿ＤＢｐｓ、
Ｈｒｓ＿ＤＢｐｓは同一の被測定者１の第１の特性データとなる。さらに、１ｋＨｚ以下
の周波数振幅特性に大きなディップがない第１の特性データがプリセットデータとして設
定されていることが好ましい。また、１００ｋＨｚの４つの空間音響伝達特性Ｈｌｓ、Ｈ
ｌｏ、Ｈｒｏ、Ｈｒｓにおいて、１００ｋＨｚ以下の周波数振幅特性が揃っている被測定
者１の第１の特性データをプリセットデータとすることが好ましい。プリセットデータは
、システムの管理者等により予め設定されている。 Here, the preset data Hls_DBps and Hro_DBps are preferably the first characteristic data of the subject 1 in which the left and right phase characteristics and amplitude levels are balanced. That is, the preset data Hls_DBps, Hlo_DBps, Hro_DBps,
Hrs_DBps is the first characteristic data of the same subject 1 . Furthermore, it is preferable that the first characteristic data having no large dip in the frequency amplitude characteristic of 1 kHz or less is set as the preset data. In addition, the four spatial acoustic transfer characteristics Hls at 100 kHz, H
In lo, Hro, and Hrs, it is preferable to use the first characteristic data of the person to be measured 1 having uniform frequency amplitude characteristics of 100 kHz or less as the preset data. The preset data is set in advance by a system administrator or the like.

さらに、プリセットデータの候補を複数セット分用意して、外耳道伝達特性ＥＣＴＦＬ
＿Ｕに基づいて、１セット分のプリセットデータを選択してもよい。この場合、プリセッ
トデータの候補となる複数セットの中で、外耳道伝達特性ＥＣＴＦＬ＿Ｕとの相関値が最
も高くなる第２の特性データを有する１セットを選択してもよい。 Furthermore, multiple sets of preset data candidates are prepared, and the ear canal transfer characteristic ECTFL
One set of preset data may be selected based on _U. In this case, one set having the second characteristic data having the highest correlation value with the ear canal transfer characteristic ECTFL_U may be selected from among the multiple sets as preset data candidates.

第３の取得部９０６は、プリセットデータＨｌｓ＿ＤＢｐｓ、Ｈｒｏ＿ＤＢｐｓを取得
して、第２の合成部９１２、及び第２の調整部９１１に出力する。プリセットデータＨｌ
ｓ＿ＤＢｐｓ、Ｈｒｏ＿ＤＢｐｓは直接音部分の周波数振幅特性であるが、直接音及び反
射音部分の周波数振幅特性となっていてもよい。 The third acquisition unit 906 acquires the preset data Hls_DBps and Hro_DBps and outputs them to the second synthesis unit 912 and the second adjustment unit 911 . Preset data Hl
s_DBps and Hro_DBps are frequency amplitude characteristics of the direct sound portion, but may be frequency amplitude characteristics of the direct sound and reflected sound portions.

第１の合成データＨｌｓ＿ｃｏｍ１、Ｈｒｏ＿ｃｏｍ１と第１のプリセットデータＨｌ
ｓ＿ＤＢｐｓ、Ｈｒｏ＿ＤＢｐｓの振幅レベルを調整するためのゲイン値を算出する。第
２の調整部９１１は、第１の調整部９０７と同様に、調整用周波数帯域（２００Ｈｚ～１
ｋＨｚ）において、第１の合成データＨｌｓ＿ｃｏｍ１とプリセットデータＨｌｓ＿ＤＢ
ｐｓとの間で振幅のレベルが等しくなるようなゲイン値を第２の調整部９１１が求める。
そして、ゲイン値が第１の合成データ及び第１のプリセットデータの少なくとも一方に乗
じられることで、振幅レベルが調整される。第２の調整部９１１は、第１の調整部９０７
と同様の処理を行うため、説明を省略する。 First combined data Hls_com1, Hro_com1 and first preset data Hl
Gain values for adjusting the amplitude levels of s_DBps and Hro_DBps are calculated. Similar to the first adjustment unit 907, the second adjustment unit 911 adjusts the adjustment frequency band (200 Hz to 1
kHz), the first combined data Hls_com1 and the preset data Hls_DB
The second adjuster 911 obtains a gain value that makes the amplitude level equal to ps.
Then, the amplitude level is adjusted by multiplying at least one of the first synthesized data and the first preset data by the gain value. The second adjuster 911 adjusts the first adjuster 907
Since the same processing as is performed, the description is omitted.

第２の調整部９１１は、ゲイン値を第２の合成部９１２に出力する。第２の合成部９１
２は、ゲイン値を用いてレベル調整を行った後、第１の合成データＨｌｓ＿ｃｏｍ１と、
プリセットデータＨｌｓ＿ＤＢｐｓとを合成する。第２の合成部９１２は、ゲイン値を用
いてレベル調整を行った後、第１の合成データＨｒｏ＿ｃｏｍ１と、プリセットデータＨ
ｒｏ＿ＤＢｐｓとを合成する。これにより、第２の合成データＨｒｏ＿ｃｏｍ２、Ｈｒｏ
＿ｃｏｍ２が生成される。 Second adjusting section 911 outputs the gain value to second synthesizing section 912 . Second synthesizing unit 91
2 is the first combined data Hls_com1 after performing level adjustment using the gain value;
Synthesize with preset data Hls_DBps. After performing level adjustment using the gain value, the second combining unit 912 combines the first combined data Hro_com1 and the preset data H
ro_DBps. As a result, the second combined data Hro_com2, Hro
_com2 is generated.

第２の合成部９１２は、直接音を合成する。すなわち、第１の合成データＨｌｓ＿ｃｏ
ｍ１、Ｈｒｏ＿ｃｏｍ１及びプリセットデータＨｌｓ＿ＤＢｐｓ、Ｈｒｏ＿ＤＢｐｓは、
直接音の周波数振幅特性となっている。第２の合成部９１２は、第３の周波数帯域（最低
周波数～１ｋＨｚ）における第１の合成データＨｌｓ＿ｃｏｍ１の振幅値をプリセットデ
ータＨｌｓ＿ＤＢｐｓの振幅値に置き換える。第２の合成部９１２は、第３の周波数帯域
（最低周波数～１ｋＨｚ）における第１の合成データＨｒｏ＿ｃｏｍ１の振幅値をプリセ
ットデータＨｒｏ＿ＤＢｐｓの振幅値に置き換える。なお、最低周波数はＦＦＴで得られ
る周波数特性における最も低い周波数であり、例えば、１Ｈｚとなる。 A second synthesizing unit 912 synthesizes the direct sound. That is, the first combined data Hls_co
m1, Hro_com1 and preset data Hls_DBps, Hro_DBps are
It is the frequency amplitude characteristic of the direct sound. The second synthesizing unit 912 replaces the amplitude value of the first synthesized data Hls_com1 in the third frequency band (lowest frequency to 1 kHz) with the amplitude value of the preset data Hls_DBps. The second combining unit 912 replaces the amplitude value of the first combined data Hro_com1 in the third frequency band (lowest frequency to 1 kHz) with the amplitude value of the preset data Hro_DBps. The lowest frequency is the lowest frequency in frequency characteristics obtained by FFT, and is 1 Hz, for example.

第２の合成データＨｌｓ＿ｃｏｍ２、Ｈｒｏ＿ｃｏｍ２において、第３の周波数帯域（
最低周波数～１ｋＨｚ）の振幅値は、プリセットデータＨｌｓ＿ＤＢｐｓ、Ｈｒｏ＿ＤＢ
ｐｓに基づくものとなっている。第２の合成データＨｌｓ＿ｃｏｍ２、Ｈｒｏ＿ｃｏｍ２
において、第１の周波数帯域（１ｋＨｚ～４ｋＨｚ）の振幅値は、第１の特性データＨｌ
ｓ＿ＤＢｋ、Ｈｒｏ＿ＤＢｋに基づくものとなっている。第２の合成データＨｌｓ＿ｃｏ
ｍ２、Ｈｒｏ＿ｃｏｍ２において、第２の周波数帯域（４ｋＨｚ～１５ｋＨｚ）の振幅値
は、第２の特性データＨｌｓ＿ＤＢｍ、Ｈｒｏ＿ＤＢｍに基づくものとなっている。なお
、第１の周波数帯域、第２の周波数帯域、及び第３の周波数帯域以外の第４の周波数帯域
（１５ｋＨｚ～最高周波数）の振幅値は、第２の特性データＨｌｓ＿ＤＢｍ、Ｈｒｏ＿Ｄ
Ｂｍに基づくものとすることができるが、プリセットデータＨｌｓ＿ＤＢｐｓ、Ｈｒｏ＿
ＤＢｐｓに基づくものとしてもよい。最高周波数はＦＦＴで得られる周波数特性における
最も高い周波数である。ここで、ＦＦＴのフレームサイズをｆｒａｍｅ＿ｓｉｚｅとする
と、最も高い周波数は、(ＦＳ／ｆｒａｍｅ＿ｓｉｚｅ）*(ｆｒａｍｅ＿ｓｉｚｅ／２－
１)で求めることができる。また、各周波数帯域における振幅レベルは第１の調整部９０
７、又は第２の調整部９１１で求めたゲイン値で調整されている。よって、適切に周波数
振幅特性を合成することができる。 In the second synthesized data Hls_com2, Hro_com2, the third frequency band (
(minimum frequency to 1 kHz) are preset data Hls_DBps, Hro_DB
It is based on ps. Second synthetic data Hls_com2, Hro_com2
, the amplitude value of the first frequency band (1 kHz to 4 kHz) is the first characteristic data Hl
It is based on s_DBk and Hro_DBk. Second synthetic data Hls_co
In m2 and Hro_com2, the amplitude values of the second frequency band (4 kHz to 15 kHz) are based on the second characteristic data Hls_DBm and Hro_DBm. The amplitude value of the fourth frequency band (15 kHz to highest frequency) other than the first frequency band, the second frequency band, and the third frequency band is the second characteristic data Hls_DBm, Hro_D
Bm, but preset data Hls_DBps, Hro_
It may be based on DBps. The highest frequency is the highest frequency in frequency characteristics obtained by FFT. Here, if the FFT frame size is frame_size, the highest frequency is (FS/frame_size)*(frame_size/2-
1). Also, the amplitude level in each frequency band is adjusted by the first adjusting section 90
7 or the gain value obtained by the second adjustment unit 911 . Therefore, it is possible to appropriately synthesize frequency-amplitude characteristics.

第２の合成部９１２は、第２の合成データＨｌｓ＿ｃｏｍ２、Ｈｒｏ＿ｃｏｍ２は生成
部９２０に出力する。生成部９２０は、第２の合成データＨｌｓ＿ｃｏｍ２、Ｈｒｏ＿ｃ
ｏｍ２に基づいて、フィルタＦ＿Ｈｌｓ＿Ｕ、Ｆ＿Ｈｒｏ＿Ｕを生成する。例えば、生成
部９２０は、第２の合成データＨｌｓ＿ｃｏｍ２、Ｈｒｏ＿ｃｏｍ２を、それぞれ逆フー
リエ変換等することで、時間領域における第２の合成データＨｌｓ＿ｃｏｍ２＿Ｔｉｍｅ
、Ｈｒｏ＿ｃｏｍ２＿Ｔｉｍｅを算出する。なお、逆フーリエ変換において用いられる位
相特性は、第２のセットのものとすることができるが、プリセットデータのものであって
もよい。 The second synthesizer 912 outputs the second synthesized data Hls_com2 and Hro_com2 to the generator 920 . The generation unit 920 generates the second combined data Hls_com2, Hro_c
Based on om2, generate filters F_Hls_U, F_Hro_U. For example, the generating unit 920 performs an inverse Fourier transform on the second combined data Hls_com2 and Hro_com2, respectively, to obtain the second combined data Hls_com2_Time in the time domain.
, Hro_com2_Time. It should be noted that the phase characteristics used in the inverse Fourier transform may be those of the second set, but may also be those of the preset data.

これにより、生成部９２０は、フィルタＦ＿Ｈｌｓ＿Ｕ、Ｆ＿Ｈｒｏ＿Ｕの直接音部分
を求めることができる。そして、生成部９２０は、第２の合成データＨｌｓ＿ｃｏｍ２＿
ＴｉｍｅにプリセットデータＨｌｓ＿ＤＢｐｓを合成する。また、生成部９２０は、第２
の合成データＨｒｏ＿ｃｏｍ２＿Ｔｉｍｅに、プリセットデータＨｒｏ＿ＤＢｐｓの反射
音部分を合成する。例えば、フィルタＦ＿Ｈｌｓ＿Ｕ、Ｆ＿Ｈｒｏ＿Ｕの直接音部分（つ
まり、０～Ｘサンプル）は、第２の合成データＨｌｓ＿ｃｏｍ２＿Ｔｉｍｅ、Ｈｒｏ＿ｃ
ｏｍ２＿Ｔｉｍｅとなっている。反射音部分（（Ｘ＋１）～４０９５サンプル）は、時間
領域のプリセットデータＨｌｓ＿ＤＢｐｓ、Ｈｒｏ＿ＤＢｐｓの反射音部分（（Ｘ＋１）
～４０９５サンプル）を切り出したものとなっている。なお、フィルタＦ＿Ｈｌｓ＿Ｕ、
Ｆ＿Ｈｒｏ＿Ｕの反射音部分は、プリセットデータＨｌｓ＿ＤＢｐｓ、Ｈｒｏ＿ＤＢｐｓ
のものとするが、第２のセットのものとなっていてもよい。なお、プリセットデータにつ
いては、データベース９０１が、予め時間領域の第１の特性データを格納しておくことが
好ましい。 Thereby, the generator 920 can obtain the direct sound portion of the filters F_Hls_U and F_Hro_U. Then, the generation unit 920 generates the second synthesized data Hls_com2_
Preset data Hls_DBps is combined with Time. In addition, the generating unit 920 generates the second
The reflected sound portion of the preset data Hro_DBps is synthesized with the synthesized data Hro_com2_Time of the above. For example, the direct sound portion (i.e., 0-X samples) of filters F_Hls_U, F_Hro_U is the second synthesized data Hls_com2_Time, Hro_c
om2_Time. The reflected sound portion ((X+1) to 4095 samples) is the reflected sound portion ((X+1)
~ 4095 samples). Note that the filters F_Hls_U,
The reflected sound portion of F_Hro_U is the preset data Hls_DBps, Hro_DBps
but may be of the second set. As for the preset data, it is preferable that the database 901 stores the first characteristic data in the time domain in advance.

上記の処理により、フィルタ生成装置９００が、フィルタＦ＿Ｈｌｓ＿Ｕ、Ｆ＿Ｈｒｏ
＿Ｕを生成することができる。また、同様の処理によって、フィルタ生成装置９００は、
フィルタＦ＿Ｈｌｏ＿Ｕ、Ｆ＿Ｈｒｓ＿Ｕを生成する。これにより、ユーザＵが外耳道伝
達特性のみしか測定できない場合であっても、ユーザＵに適したフィルタＦ＿Ｈｌｓ＿Ｕ
、Ｆ＿Ｈｌｏ＿Ｕ、Ｆ＿Ｈｒｏ＿Ｕ、Ｆ＿Ｈｒｓ＿Ｕを生成することができる。 Through the above processing, the filter generation device 900 generates the filters F_Hls_U, F_Hro
_U can be generated. Further, through similar processing, the filter generation device 900
Generate filters F_Hlo_U, F_Hrs_U. As a result, even if the user U can only measure the ear canal transfer characteristic, the filter F_Hls_U suitable for the user U
, F_Hlo_U, F_Hro_U, F_Hrs_U can be generated.

なお、上記説明における第１～第３の周波数帯域の上限周波数及び下限周波数、信号の
サンプル数、最高周波数、最低周波数の値は、例示的な値であり、特に限定されるもので
はない。また、第１の合成部９０８と第２の合成部９１２との処理順は特に限定されるも
のではない。例えば、プリセットデータと第２のセットの第１の特性データとを合成した
後に、第１のセットの第１の特性データを合成してもよい。あるいは、プリセットデータ
と第２のセットの第１の特性データと第１のセットの第１の特性データとをまとめて合成
してもよい。また、周波数領域における振幅値の代わりにパワー値を用いてもよい。 Note that the values of the upper limit frequency and lower limit frequency of the first to third frequency bands, the number of signal samples, the highest frequency, and the lowest frequency in the above description are exemplary values, and are not particularly limited. Also, the processing order of the first synthesizing unit 908 and the second synthesizing unit 912 is not particularly limited. For example, the first set of first characteristic data may be synthesized after the preset data and the second set of first characteristic data are synthesized. Alternatively, the preset data, the second set of first characteristic data, and the first set of first characteristic data may be combined together. Also, power values may be used instead of amplitude values in the frequency domain.

なお、実施の形態２では、頭外定位受聴を行うユーザは、航空機などの乗り物の搭乗者
に限られるものではない。つまり、スマートホンやタブレット端末などのユーザが所有す
るユーザ端末において、頭外定位処理を行う場合に、実施の形態２の処理を適用すること
ができる。従って、識別情報は不要となる。 In the second embodiment, users who perform out-of-head localized listening are not limited to passengers in vehicles such as airplanes. In other words, the process of the second embodiment can be applied to a user terminal such as a smart phone or a tablet terminal owned by the user, when the out-of-head localization process is performed. Therefore, no identification information is required.

以上まとめると、実施の形態２に係るフィルタ生成装置９００は、外耳道伝達特性取得
部９３０と、第１の選択部９０２と、第１の取得部９０３と、第２の選択部９０４と、第
２の取得部９０５と、第３の取得部９０６と、生成部９２０と、を備えている。外耳道伝
達特性取得部９３０は、ユーザが装着したヘッドホン又はイヤホンからマイクまでの外耳
道伝達特性を取得する。第１の選択部９０２は、スピーカからマイクまでの空間音響伝達
特性に対応する第１の特性データと、外耳道伝達特性に対応する第２の特性データとを１
セットとして、複数セット分を格納するデータベース９０１を参照することで、ユーザの
外耳道伝達特性の第１の周波数帯域における周波数特性に基づいて、第１のセットを選択
する。第１の取得部９０３は、第１の選択部で選択された第１のセットに含まれる第２の
特性データを取得する。第２の選択部９０４は、データベース９０１を参照することで、
ユーザの外耳道伝達特性の第２の周波数帯域における周波数特性に基づいて、第２のセッ
トを選択する。第２の取得部９０５は、第２の選択部９０３で選択された第２のセットに
含まれる第２の特性データを取得する。第３の選択部９０６は、予め設定されたプリセッ
トデータを取得する。生成部９２０は、第１のセットの第２の特性データと、第２のセッ
トの第２の特性データと、プリセットデータとに基づいて、ユーザの空間音響伝達特性に
応じたフィルタを生成する。これにより、適切な処理を行うことができるフィルタを生成
することができる。
実施の形態３．
実施の形態３では、５．１ｃｈの再生信号を用いて、頭外定位処理を行っている。５．
１ｃｈの場合、６個のスピーカがある。つまり、測定装置２００の測定環境には、センタ
ースピーカ（正面スピーカ）、右前方スピーカ、左前方スピーカ、右後方スピーカ、左後
方スピーカ、低音サブウーファースピーカが配置されている。従って、図２に示した測定
装置２００に、センタースピーカ、左後方スピーカ、右後方スピーカ、サブウーファース
ピーカが追加されている。センタースピーカは、被測定者１の正面前方に配置される。セ
ンタースピーカは、例えば、左前方スピーカと右前方スピーカとの間に配置される。 In summary, the filter generation device 900 according to Embodiment 2 includes an ear canal transfer characteristic acquisition unit 930, a first selection unit 902, a first acquisition unit 903, a second selection unit 904, a second acquisition unit 905 , a third acquisition unit 906 , and a generation unit 920 . The ear canal transfer characteristic acquisition unit 930 acquires the ear canal transfer characteristic from the headphone or earphone worn by the user to the microphone. A first selection unit 902 selects the first characteristic data corresponding to the spatial sound transfer characteristic from the speaker to the microphone and the second characteristic data corresponding to the ear canal transfer characteristic.
By referring to the database 901 that stores a plurality of sets as sets, the first set is selected based on the frequency characteristics in the first frequency band of the ear canal transfer characteristics of the user. A first acquisition unit 903 acquires second characteristic data included in the first set selected by the first selection unit. The second selection unit 904 refers to the database 901 to
A second set is selected based on frequency characteristics in a second frequency band of the user's ear canal transfer characteristics. A second acquisition unit 905 acquires second characteristic data included in the second set selected by the second selection unit 903 . A third selection unit 906 acquires preset data set in advance. Based on the first set of second characteristic data, the second set of second characteristic data, and the preset data, the generation unit 920 generates a filter according to the spatial sound transfer characteristic of the user. This makes it possible to generate a filter capable of performing appropriate processing.
Embodiment 3.
In the third embodiment, out-of-head localization processing is performed using a 5.1ch reproduced signal. 5.
In the case of 1ch, there are 6 speakers. That is, a center speaker (front speaker), a right front speaker, a left front speaker, a right rear speaker, a left rear speaker, and a bass subwoofer speaker are arranged in the measurement environment of the measurement apparatus 200 . Therefore, a center speaker, a left rear speaker, a right rear speaker, and a subwoofer speaker are added to the measurement apparatus 200 shown in FIG. A center speaker is arranged in front of the person 1 to be measured. The center speaker is arranged, for example, between the left front speaker and the right front speaker.

左前方スピーカから左耳、及び右耳までの空間音響伝達特性を、実施の形態１と同様に
Ｈｌｓ、Ｈｌｏとする。右前方スピーカから左耳、及び右耳までの空間音響伝達特性を、
実施の形態１と同様にＨｒｏ、Ｈｒｓとする。センタースピーカから左耳及び右耳までの
空間音響伝達特性をＣＨｌ、ＣＨｒとする。左後方スピーカから左耳及び右耳までの空間
音響伝達特性をＳＨｌｓ、ＳＨｌｏとする。右後方スピーカから左耳、及び右耳までの空
間音響伝達特性を、ＳＨｒｏ、ＳＨｒｓとする。低音出力用のサブウーファースピーカか
ら左耳及び右耳までの空間音響伝達特性をＳＷＨｌ、ＳＷＨｒとする。 Let Hls and Hlo be the spatial sound transfer characteristics from the left front speaker to the left ear and the right ear, as in the first embodiment. Spatial sound transfer characteristics from the right front speaker to the left and right ears are
Let Hro and Hrs be the same as in the first embodiment. Let CHl and CHr be the spatial sound transfer characteristics from the center speaker to the left and right ears. Let SHls and SHlo be the spatial sound transfer characteristics from the left rear speaker to the left and right ears. Let SHro and SHrs be the spatial sound transfer characteristics from the right rear speaker to the left and right ears. Let SWHl and SWHr be the spatial sound transfer characteristics from the subwoofer speaker for bass output to the left and right ears.

従って、空間音響伝特性Ｈｌｓ、Ｈｌｏ、ＣＨｌ、ＣＨｒ、Ｈｒｏ、Ｈｒｓ、ＳＨｌｓ
、ＳＨｌｏ、ＳＨｒｏ、ＳＨｒｓ、ＳＷＨｌ、ＳＷＨｒに対応する１２個のフィルタを用
いて、畳み込み演算処理が実施される。空間音響伝特性Ｈｌｓ、Ｈｌｏ、ＣＨｌ、ＣＨｒ
、Ｈｒｏ、Ｈｒｓ、ＳＨｌｓ、ＳＨｌｏ、ＳＨｒｏ、ＳＨｒｓ、ＳＷＨｌ、ＳＷＨｒに対
応するフィルタを、Ｆ＿Ｈｌｓ、Ｆ＿Ｈｌｏ、Ｆ＿ＣＨｌ、Ｆ＿ＣＨｒ、Ｆ＿Ｈｒｏ、Ｆ
＿Ｈｒｓ、Ｆ＿ＳＨｌｓ、Ｆ＿ＳＨｌｏ、Ｆ＿ＳＨｒｏ、Ｆ＿ＳＨｒｓ、Ｆ＿ＳＷＨｌ、
Ｆ＿ＳＷＨｒとする。 Therefore, the spatial acoustic transfer characteristics Hls, Hlo, CHl, CHr, Hro, Hrs, SHls
, SHlo, SHro, SHrs, SWHl, and SWHr are used to perform the convolution operation. Spatial Acoustic Transfer Hls, Hlo, CHl, CHr
, Hro, Hrs, SHls, SHlo, SHro, SHrs, SWHl, SWHr corresponding to F_Hls, F_Hlo, F_CHl, F_CHr, F_Hro, F
_Hrs, F_SHls, F_SHlo, F_SHro, F_SHrs, F_SWHl,
F_SWHr.

図１１は、本実施の形態にかかる処理装置７００の構成を示すブロック図である。処理
装置７００は、図２に示す処理装置２０１に対応するものであり、フィルタを生成する。
さらに、処理装置７００は、図１で示したような頭外定位処理を行うものである。ここで
は、処理装置７００は、１２個のフィルタＦ＿Ｈｌｓ、Ｆ＿Ｈｌｏ、Ｆ＿ＣＨｌ、Ｆ＿Ｃ
Ｈｒ、Ｆ＿Ｈｒｏ、Ｆ＿Ｈｒｓ、Ｆ＿ＳＨｌｓ、Ｆ＿ＳＨｌｏ、Ｆ＿ＳＨｒｏ、Ｆ＿ＳＨ
ｒｓ、Ｆ＿ＳＷＨｌ、Ｆ＿ＳＷＨｒを用いて、畳み込み演算処理を行う。もちろん、実施
の形態１のシステムのように、フィルタを生成する装置と、頭外定位処理を行う装置が異
なる装置となっていてもよい
処理装置７００は、音源ファイル７０１と、測定手段７０２と、フィルタ生成手段７０
３と、畳み込み手段７０４と、再生手段７０５と、送受信手段７０６と、メモリ７０７と
、センタｃｈ用残響除去手段７０８と、センタｃｈ用音量可変手段７０９と、を備えてい
る。さらに、処理装置７００には、セリフ音量制御手段７１１が接続されている。 FIG. 11 is a block diagram showing the configuration of a processing device 700 according to this embodiment. The processing device 700 corresponds to the processing device 201 shown in FIG. 2 and generates a filter.
Further, the processing device 700 performs out-of-head localization processing as shown in FIG. Here, the processing unit 700 includes 12 filters F_Hls, F_Hlo, F_CHl, F_C
Hr, F_Hro, F_Hrs, F_SHls, F_SHlo, F_SHro, F_SH
rs, F_SWHl, and F_SWHr are used to perform convolution operation processing. Of course, as in the system of Embodiment 1, the device that generates the filter and the device that performs the out-of-head localization processing may be different devices. Filter generating means 70
3, convolution means 704, reproduction means 705, transmission/reception means 706, memory 707, center channel dereverberation means 708, and center channel volume varying means 709. Further, the processing device 700 is connected with dialogue volume control means 711 .

測定手段７０２は、空間音響伝達特性を測定する。測定手段７０２は、６個のスピーカ
に対して、それぞれインパルス測定を行う。具体的には、図２に示した測定信号生成部２
１１のように、各スピーカにインパルス音を出力する。さらに、測定手段７０２は、収音
信号取得部２１２のように、マイク２Ｌ、２Ｒからの収音信号を取得する。 Measuring means 702 measures spatial sound transfer characteristics. The measurement means 702 performs impulse measurement for each of the six speakers. Specifically, the measurement signal generator 2 shown in FIG.
Like 11, an impulse sound is output to each speaker. Furthermore, the measurement means 702 acquires sound signals from the microphones 2L and 2R, like the sound signal acquisition unit 212 .

フィルタ生成手段７０３は、フィルタ生成部２１３と同様に、収音信号に基づいて、フ
ィルタを生成する。ここでは、６個のスピーカ及び２個のマイクを用いて測定が行われて
いるため、１２個のフィルタが生成される。メモリ７０７は、１２個のフィルタを格納す
る。なお、送受信手段７０６は、例えば、図３で示したサーバ端末６００にフィルタを送
信してもよい。これにより、データベースにフィルタが格納される。 Similar to the filter generation unit 213, the filter generation unit 703 generates a filter based on the collected sound signal. Here, 12 filters are generated because the measurements are made with 6 loudspeakers and 2 microphones. Memory 707 stores 12 filters. Note that the transmitting/receiving means 706 may transmit the filter to the server terminal 600 shown in FIG. 3, for example. This will store the filter in the database.

５．１ｃｈの再生信号を再生している場合、ユーザＵがセリフ音量を個別に調整するこ
とができるようになっている。つまり、セリフの音声信号を出力するセンタースピーカの
音量のみが独立して調整可能になっている。セリフ音量制御手段７１１は、ユーザＵから
の入力を受け付けて、セリフ音量（センタｃｈの音量）を制御する。例えば、セリフ音量
制御手段７１１は、音量調整用のボタンやレバーを表示させる。そして、頭外定位受聴結
果に応じて、ユーザＵが、セリフ音量を上げたり、下げたりする。セリフ音量制御手段７
１１は、セリフ音量を示す音量信号をセンタｃｈ用音量可変手段７０９に出力する。セン
タｃｈ用音量可変手段７０９は可変増幅器を有しており、入力に応じて増幅率を可変する
。音量信号は、例えば音量の大きさを示すＶｏｌの数値または音量の増幅率を用いてもよ
い。 When reproducing a 5.1ch reproduction signal, the user U can individually adjust the dialogue volume. In other words, only the volume of the center speaker that outputs the audio signal of the dialogue can be adjusted independently. The dialogue volume control means 711 receives an input from the user U and controls the dialogue volume (the volume of the center channel). For example, the dialogue volume control means 711 displays buttons and levers for volume adjustment. Then, the user U raises or lowers the dialogue volume according to the out-of-head localization listening result. Dialogue volume control means 7
11 outputs a volume signal indicating the dialogue volume to the center channel volume varying means 709 . The center channel volume varying means 709 has a variable amplifier and varies the amplification factor according to the input. For the volume signal, for example, a numerical value of Vol indicating the magnitude of the volume or an amplification factor of the volume may be used.

センタｃｈ用残響除去手段７０８は、音量信号に基づいて、センタｃｈの残響を削除す
るための処理を行う。ここでは、センタｃｈ用残響除去手段７０８は、センタｃｈのフィ
ルタＦ＿ＣＨｌ、Ｆ＿ＣＨｒに対して窓掛けを行う。窓掛け後のフィルタをＦ＿ＷＣＨｌ
、Ｆ＿ＷＣＨｒとする。 Center channel dereverberation means 708 performs processing for deleting center channel reverberation based on the volume signal. Here, the center channel dereverberation unit 708 performs windowing on the center channel filters F_CHl and F_CHr. The filter after windowing is F_WCHl
, F_WCHr.

例えば、センタｃｈの音量が閾値以上の場合、センタｃｈ用残響除去手段７０８は、フ
ィルタＦ＿ＣＨｌ、Ｆ＿ＣＨｒの後半部分のデータをゼロにする窓関数を用いて窓掛けを
行う。このようにすることでノイズを減少することができ、自然な響きを得ることができ
る。また、所定時間までは一定であり、所定時間後、徐々に減少するような窓関数をセン
タｃｈ用残響除去手段７０８が用いてもよい。窓掛け後のフィルタＦ＿ＷＣＨｌ、Ｆ＿Ｗ
ＣＨｒを用いることで、セリフ部分の残響を抑制することができる。また、音量が閾値未
満の場合、窓掛けを行わなくてもよいが、便宜上矩形窓を用いて窓掛けを行ったものとす
る。 For example, when the sound volume of the center channel is equal to or higher than the threshold, the center channel dereverberation unit 708 performs windowing using a window function that zeroes the data in the latter half of the filters F_CHl and F_CHr. By doing so, noise can be reduced and natural sound can be obtained. Further, the center channel dereverberation means 708 may use a window function that is constant until a predetermined time and gradually decreases after the predetermined time. Filter F_WCHl, F_W after windowing
By using CHr, it is possible to suppress the reverberation of the dialogue portion. Also, if the sound volume is less than the threshold, windowing may not be performed, but for the sake of convenience, it is assumed that windowing is performed using a rectangular window.

さらに、セリフ音量に応じて、窓関数を変化させるようにしてもよい。セリフ音量が大
きくなるにつれて、窓の長さが長くなるような窓関数を用いることが可能である。或いは
、セリフ音量が大きくなるにつれて、窓の長さが短くなるような窓関数を用いることも可
能である。 Furthermore, the window function may be changed according to the speech volume. It is possible to use a window function such that the length of the window increases as the volume of the dialogue increases. Alternatively, it is possible to use a window function in which the length of the window becomes shorter as the speech volume increases.

音源ファイル７０１には、５．１ｃｈの再生信号が格納されている。５．１ｃｈの再生
信号は、畳み込み手段７０４に入力される。畳み込み手段７０４は、６個の再生信号に対
して、１２個のフィルタを用いて畳み込み演算処理を行う。 A sound source file 701 stores a 5.1ch reproduction signal. The 5.1ch reproduced signal is input to convolution means 704 . Convolution means 704 performs convolution arithmetic processing on the six reproduced signals using 12 filters.

畳み込み手段７０４は、フィルタＦ＿Ｈｌｓ、Ｆ＿Ｈｌｏ、Ｆ＿ＷＣＨｌ、Ｆ＿ＷＣＨ
ｒ、Ｆ＿Ｈｒｏ、Ｆ＿Ｈｒｓ、Ｆ＿ＳＨｌｓ、Ｆ＿ＳＨｌｏ、Ｆ＿ＳＨｒｏ、Ｆ＿ＳＨｒ
ｓ、Ｆ＿ＳＷＨｌ、Ｆ＿ＳＷＨｒを用いて、畳み込み演算を行う。図１２は、５．１ｃｈ
の再生信号の場合の畳み込み演算とセリフ音量調整を説明するための図である。 The convolution means 704 includes filters F_Hls, F_Hlo, F_WCHl, F_WCH
r, F_Hro, F_Hrs, F_SHls, F_SHlo, F_SHro, F_SHr
A convolution operation is performed using s, F_SWHl, and F_SWHr. FIG. 12 shows the 5.1ch
FIG. 10 is a diagram for explaining the convolution operation and dialogue volume adjustment in the case of the reproduction signal of .

左前方ｃｈの再生信号をＬ（ｔ）、センタｃｈの再生信号をＣ（ｔ）、右前方ｃｈの再
生信号をＲ（ｔ）とする。左後方ｃｈの再生信号をＳＬ（ｔ）、右後方チャネルの再生信
号をＳＲ（ｔ）、サブウーファーｃｈの再生信号をＬＦＥ（ｔ）とする。そして、それぞ
れの再生信号には、対応するフィルタを畳み込む。例えば、センタｃｈの再生信号Ｃ（ｔ
）には、フィルタＦ＿ＷＣＨｌ、Ｆ＿ＷＣＨｒがそれぞれ畳み込まれている。 Let L(t) be the reproduction signal of the left front channel, C(t) be the reproduction signal of the center channel, and R(t) be the reproduction signal of the right front channel. Let SL(t) be the reproduced signal of the left rear channel, SR(t) be the reproduced signal of the right rear channel, and LFE(t) be the reproduced signal of the subwoofer channel. Then, each reproduced signal is convoluted with a corresponding filter. For example, the reproduction signal C(t
) are convoluted with filters F_WCHl and F_WCHr, respectively.

そして、加算器２４は、フィルタＦ＿Ｈｌｓ、Ｆ＿ＷＣＨｌ、Ｆ＿Ｈｒｏ、Ｆ＿ＳＨｌ
ｓ、Ｆ＿ＳＨｒｏ、Ｆ＿ＳＷＨｌが畳み込まれた６つの畳み込み信号を加算して、加算信
号ＨＲｌ（ｔ）を生成する。加算器２４は、加算信号ＨＲｌ（ｔ）をフィルタ部４１（図
１参照）に出力する。加算器２５は、フィルタＦ＿Ｈｌｏ、Ｆ＿ＷＣＨｒ、Ｆ＿Ｈｒｓ、
Ｆ＿ＳＨｌｏ、Ｆ＿ＳＨｒｓ、Ｆ＿ＳＷＨｒが畳み込まれた畳み込み信号を加算して、加
算信号ＨＲｒ（ｔ）を生成する。加算器２５は、加算信号ＨＲｒ（ｔ）をフィルタ部４２
（図１参照）に出力する。再生手段７０５は、加算信号ＨＲｌ（ｔ）、ＨＲｒ（ｔ）にそ
れぞれ逆フィルタＬｉｎｖ，Ｒｉｎｖを畳み込む。そして、逆フィルタが畳み込まれた加
算信号ＨＲｌ（ｔ）、ＨＲｒ（ｔ）がヘッドホン４３から出力される。 Adder 24 then adds filters F_Hls, F_WCHl, F_Hro, F_SHl
The six convolved signals with s, F_SHro, and F_SWHl are summed to generate summed signal HRl(t). The adder 24 outputs the addition signal HRl(t) to the filter section 41 (see FIG. 1). Adder 25 adds filters F_Hlo, F_WCHr, F_Hrs,
The convolved signals with F_SHlo, F_SHrs, and F_SWHr are added to generate summed signal HRr(t). The adder 25 passes the addition signal HRr(t) to the filter section 42
(See FIG. 1). The reproducing means 705 convolves the summed signals HRl(t) and HRr(t) with the inverse filters Linv and Rinv, respectively. Then, the added signals HRl(t) and HRr(t) convoluted with the inverse filters are output from the headphone 43 .

ここで、フィルタＦ＿ＷＣＨｌ、Ｆ＿ＷＣＨｒが畳み込まれた畳み込み信号は、可変増
幅器７２１を介して、加算器２４、２５に入力されている。セリフ音量制御手段７１１は
、入力された音量に応じて、可変増幅器７２１の振幅増幅率を変化させる。これにより、
センタｃｈによるセリフ音量をユーザＵの好みに応じて調整することができる。よって、
頭外定位処理をより適切に行うことができる。
実施の形態４.
実施の形態４では、頭外定位受聴において、頭外定位処理装置１００がユーザをよりリ
ラックスさせるための処理を行っている。例えば、頭外定位処理装置１００がフィルタの
ダイナミックレンジを圧縮したり、再生信号の供給方法を変更したりすることで、ＢＧＭ
（Back-Ground Music）のように聴こえる様な処理を行っている。
例１.
例１では、フィルタの高周波数帯域を圧縮することで、フィルタが生成されている。具
体的には、フィルタ生成装置が、測定した空間音響伝達特性のそれぞれに対して、ローパ
スフィルタ（ＬＰＦ）をかけている。例えば、２ｋＨｚをカットオフ周波数とするＬＰＦ
をかけた例を図１３に示す。図１３には、フィルタＦ＿ＨｌｓにＬＰＦをかける処理前後
の音圧レベルを示している。ＬＰＦをかけたフィルタを用いて頭外定位処理を行うことで
、ユーザＵはよりリラックスすることができる。
例２.
例２では、フィルタ生成装置が、各フィルタにおいて、直接音の振幅を変えたものを残
響成分として付加している。具体的には、フィルタ生成装置が、測定された伝達特性の直
接音信号を切り出し、直接音信号の振幅を変えた信号を残響成分として、直接音信号の後
に付加することでフィルタが生成されている。図１４は、図１０に示した収音信号に対し
て、残響成分を付加した後のフィルタＦ＿Ｈｌｓを示すものである。図１０では、７個の
残響成分Ｄ１～Ｄ７が追加されている。それぞれの残響成分は、直接音信号の振幅を個別
に調整したものである。なお、追加する残響成分の数は、特に限定されるものではない。
このようなフィルタを用いて頭外定位処理を行うことで、ユーザＵはよりリラックスする
ことができる。 Here, convoluted signals obtained by convoluting the filters F_WCH1 and F_WCHr are input to the adders 24 and 25 via the variable amplifier 721 . The dialogue volume control means 711 changes the amplitude amplification factor of the variable amplifier 721 according to the input volume. This will
The speech volume by the center channel can be adjusted according to user U's preference. Therefore,
Out-of-head localization processing can be performed more appropriately.
Embodiment 4.
In Embodiment 4, the out-of-head localization processing apparatus 100 performs processing to make the user more relaxed in out-of-head localization listening. For example, the out-of-head localization processing device 100 compresses the dynamic range of the filter or changes the method of supplying the reproduced signal, thereby
(Back-Ground Music).
Example 1.
In example 1, the filter is generated by compressing the high frequency band of the filter. Specifically, the filter generator applies a low-pass filter (LPF) to each of the measured spatial acoustic transfer characteristics. For example, an LPF with a cutoff frequency of 2 kHz
is shown in FIG. FIG. 13 shows the sound pressure levels before and after applying the LPF to the filter F_Hls. By performing the out-of-head localization process using the LPF-applied filter, the user U can be more relaxed.
Example 2.
In Example 2, the filter generation device adds a reverberation component obtained by changing the amplitude of the direct sound in each filter. Specifically, the filter generation device cuts out the direct sound signal with the measured transfer characteristics, and adds a signal obtained by changing the amplitude of the direct sound signal as a reverberation component after the direct sound signal to generate the filter. there is FIG. 14 shows the filter F_Hls after adding the reverberation component to the collected sound signal shown in FIG. In FIG. 10, seven reverberation components D1-D7 are added. Each reverberation component is a separate adjustment of the amplitude of the direct sound signal. Note that the number of reverberation components to be added is not particularly limited.
By performing out-of-head localization processing using such a filter, the user U can be more relaxed.

上記処理のうちの一部又は全部は、コンピュータプログラムによって実行されてもよい
。上述したプログラムは、様々なタイプの非一時的なコンピュータ可読媒体（ｎｏｎ－ｔ
ｒａｎｓｉｔｏｒｙｃｏｍｐｕｔｅｒｒｅａｄａｂｌｅｍｅｄｉｕｍ）を用いて格
納され、コンピュータに供給することができる。非一時的なコンピュータ可読媒体は、様
々なタイプの実体のある記録媒体（ｔａｎｇｉｂｌｅｓｔｏｒａｇｅｍｅｄｉｕｍ）
を含む。非一時的なコンピュータ可読媒体の例は、磁気記録媒体（例えばフレキシブルデ
ィスク、磁気テープ、ハードディスクドライブ）、光磁気記録媒体（例えば光磁気ディス
ク）、ＣＤ－ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＣＤ－Ｒ、ＣＤ－Ｒ／Ｗ、
半導体メモリ（例えば、マスクＲＯＭ、ＰＲＯＭ（ＰｒｏｇｒａｍｍａｂｌｅＲＯＭ)
、ＥＰＲＯＭ（ＥｒａｓａｂｌｅＰＲＯＭ)、フラッシュＲＯＭ、ＲＡＭ（Ｒａｎｄｏ
ｍＡｃｃｅｓｓＭｅｍｏｒｙ））を含む。また、プログラムは、様々なタイプの一時
的なコンピュータ可読媒体（ｔｒａｎｓｉｔｏｒｙｃｏｍｐｕｔｅｒｒｅａｄａｂｌ
ｅｍｅｄｉｕｍ)によってコンピュータに供給されてもよい。一時的なコンピュータ可
読媒体の例は、電気信号、光信号、及び電磁波を含む。一時的なコンピュータ可読媒体は
、電線及び光ファイバ等の有線通信路、又は無線通信路を介して、プログラムをコンピュ
ータに供給できる。 A part or all of the above processes may be executed by a computer program. The programs described above may be stored on various types of non-transitory computer-readable media (non-t
stored using a random computer readable medium) and delivered to a computer. Non-transitory computer readable media include various types of tangible storage media.
including. Examples of non-transitory computer-readable media include magnetic recording media (eg, flexible discs, magnetic tapes, hard disk drives), magneto-optical recording media (eg, magneto-optical discs), CD-ROMs (Read Only Memory), CD-Rs, CD-R/W,
Semiconductor memory (e.g., mask ROM, PROM (Programmable ROM)
, EPROM (Erasable PROM), Flash ROM, RAM (Rando
m Access Memory)). Programs may also be stored on various types of transitory computer readable media.
e medium) to the computer. Examples of transitory computer-readable media include electrical signals, optical signals, and electromagnetic waves. Transitory computer-readable media can deliver the program to the computer via wired channels, such as wires and optical fibers, or wireless channels.

以上、本発明者によってなされた発明を実施の形態に基づき具体的に説明したが、本発
明は上記実施の形態に限られたものではなく、その要旨を逸脱しない範囲で種々変更可能
であることは言うまでもない。 The invention made by the present inventor has been specifically described above based on the embodiments, but the present invention is not limited to the above embodiments, and various modifications can be made without departing from the scope of the invention. Needless to say.

Ｕユーザ
１被測定者
１０頭外定位処理部
１１畳み込み演算部
１２畳み込み演算部
２１畳み込み演算部
２２畳み込み演算部
２４加算器
２５加算器
４１フィルタ部
４２フィルタ部
４３ヘッドホン
２００測定装置
２０１処理装置
２１１測定信号生成部
２１２収音信号取得部
２１３フィルタ生成部
５１１座席端末
５２１搭乗席 U User 1 Subject 10 Out-of-Head Localization Processor 11 Convolution Calculator 12 Convolution Calculator 21 Convolution Calculator 22 Convolution Calculator 24 Adder 25 Adder 41 Filter 42 Filter 43 Headphone 200 Measurement Device 201 Processing Device 211 Measurement Signal generation unit 212 Collected sound signal acquisition unit 213 Filter generation unit 511 Seat terminal 521 Boarding seat

Claims

A measuring device that measures transfer characteristics using a microphone attached to the ear of the user before the user sits on the seat;
an out-of-head localization processing device that performs out-of-head localization processing using a filter according to the transfer characteristics;
an out-of-head localization processing system, comprising: a server that transmits a filter corresponding to the transfer characteristics to the out-of-head localization processing device based on the identification information of the user.

a plurality of seats and a plurality of out-of-head stereotactic processing devices are installed in association with each other;
2. The out-of-head localization processing system according to claim 1, wherein the server identifies the seat of the user by referring to the identification information, and transmits the filter to the out-of-head localization processing device corresponding to the identified seat. .

3. The out-of-head localization processing system according to claim 1, wherein payment by the user is accepted after the user performs trial listening.

measuring the ear canal transfer characteristics from the headphones or earphones worn by the user to the microphone;
By referring to a database that stores a plurality of sets of first characteristic data corresponding to spatial sound transfer characteristics from a speaker to the microphone and second characteristic data corresponding to the ear canal transfer characteristics as one set. 4. The out-of-head localization processing system according to any one of claims 1 to 3, wherein a filter corresponding to said spatial sound transfer characteristics of said user is acquired based on said ear canal transfer characteristics of said user.

before the user is seated in the seat, using a microphone worn in the user's ear to perform a transfer characteristic;
a step of transmitting a filter corresponding to the transfer characteristic to an out-of-head localization processing device installed in the seat based on the identification information of the user;
a step in which the out-of-head localization processing device performs out-of-head localization processing on the reproduced signal using the filter;
and outputting the reproduced signal after the out-of-head localization processing from headphones or earphones to the user corresponding to the identification information.