JP7583638B2

JP7583638B2 - Object-based audio rendering device and program

Info

Publication number: JP7583638B2
Application number: JP2021023419A
Authority: JP
Inventors: 陽佐々木; 岳大杉本; 敏行西口; 弘樹久保; 知美小倉; 洋幸大久保
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2021-02-17
Filing date: 2021-02-17
Publication date: 2024-11-14
Anticipated expiration: 2041-02-17
Also published as: JP2022125686A

Description

本発明は、オブジェクトベース音響の再生装置に関し、特に、ＷＦＳ（Wave Field Synthesis：波面合成方式）の考え方に基づいてスピーカの駆動信号を算出するオブジェクトベース音響レンダリング装置及びプログラムに関する。 The present invention relates to an object-based audio playback device, and in particular to an object-based audio rendering device and program that calculates speaker drive signals based on the concept of WFS (Wave Field Synthesis).

近年、映画音響を中心にオブジェクトベース音響技術が脚光を浴びつつある。従来は、複数の音声素材を規定のチャンネルフォーマットへミックスダウンして記録するチャンネルベース音響が主流であった。 In recent years, object-based audio technology has been gaining attention, particularly in the field of movie sound. Previously, channel-based audio was the mainstream, in which multiple audio materials were mixed down to a specified channel format and then recorded.

オブジェクトベース音響では、個々の音声素材を音響オブジェクトとして別々に記録しておき、音響オブジェクトのレベル及び座標等が記述された音響メタデータに基づいてレンダリングを行う。 In object-based audio, each piece of audio material is recorded separately as an audio object, and rendering is performed based on audio metadata that describes the level and coordinates of the audio object.

以下、このようなオブジェクトベース音響方式の再生装置をオブジェクトベース音響レンダリング装置という。オブジェクトベース音響レンダリング装置は、音響オブジェクトに対し、音響メタデータと配置された複数のスピーカ位置に基づいて音響オブジェクトのレンダリングを行い、それぞれのスピーカの位置に応じた駆動信号を生成することで、試聴環境に応じた音響再生を行う。したがって、コンテンツの制作時とは異なるスピーカ配置で再生する場合であっても、それに適応した再生が可能となる。 Hereinafter, such an object-based audio playback device will be referred to as an object-based audio rendering device. The object-based audio rendering device renders audio objects based on audio metadata and the positions of multiple speakers arranged for the audio objects, and generates drive signals according to the positions of each speaker, thereby playing audio according to the listening environment. Therefore, even if the content is played back with a speaker arrangement different from that used when it was created, playback that is adapted to the listening environment is possible.

〔ＶＢＡＰ〕
従来、オブジェクトベース音響のレンダリングには、ＶＢＡＰ（Vector Based Amplitude Panning）と呼ばれるアルゴリズムが多く利用される（例えば、非特許文献１を参照）。 [VBAP]
Conventionally, an algorithm called Vector Based Amplitude Panning (VBAP) is often used for rendering object-based audio (see, for example, Non-Patent Document 1).

ＶＢＡＰでは、再生空間を３個のスピーカからなる三角形領域で分割し、音源座標を含む三角形の各頂点に位置するスピーカに対して重みを算出する。そして、音源信号を前記算出した重みで分配することにより、振幅パンニングを行う。 In VBAP, the playback space is divided into a triangular region consisting of three speakers, and weights are calculated for the speakers located at each vertex of the triangle that contains the sound source coordinates. Amplitude panning is then performed by distributing the sound source signal using the calculated weights.

しかしながら、ＶＢＡＰでは、スピーカがある特定の半径の球面状に配置され、音源がその球面上に配置されているものとして処理が行われる。そのため、音源の距離による遠近の表現をすることができない。 However, VBAP processes the sound as if the speakers are arranged on a sphere of a specific radius, and the sound source is placed on that sphere. Therefore, it is not possible to express the sense of perspective based on the distance of the sound source.

ＶＢＡＰを用いて遠近も含めた音場の表現を行う際には、距離減衰及び残響等を付加して表現することがあるが（例えば、非特許文献２を参照）、それらは心理音響に基づく手法であり、音場を物理的に再現することを目的とした方法ではない。 When using VBAP to represent a sound field that includes both near and far, distance attenuation and reverberation may be added (see, for example, Non-Patent Document 2), but these are methods based on psychoacoustics and are not intended to physically reproduce the sound field.

また、ＶＢＡＰの重み算出アルゴリズムによれば、スイートスポットで聴取することを想定して重みが算出されるため、そこから離れた位置で試聴する場合において、再生品質は保証されるものではない。 In addition, VBAP's weight calculation algorithm calculates weights assuming listening in the sweet spot, so playback quality is not guaranteed when listening from a position away from that sweet spot.

〔ＷＦＳ〕
一方、波動音響に基づく音場の表現方法として、ＷＦＳと呼ばれる手法が知られている。ＷＦＳは、以下の式にて表されるキルヒホッフ－ヘルムホルツ積分方程式によって波動音響的に裏付けられた音場再現手法である（例えば、非特許文献３を参照）。

[WFS]
On the other hand, a method called WFS is known as a method for expressing a sound field based on wave acoustics. WFS is a sound field reproduction method supported by wave acoustics using the Kirchhoff-Helmholtz integral equation expressed by the following formula (see, for example, Non-Patent Document 3):

ここで、ｐ（ｒ）は、境界∂Ｖで閉じられたある領域Ｖ内の任意の点ｒでの音圧を示し、Ｇ（ｒ｜ｒ_o）は、境界∂Ｖ上の点ｒ_oから境界∂Ｖ内の任意の点ｒまでのグリーン関数を示す。ｎ（ｒ_o）は、点ｒ_oにおける境界∂Ｖの法線方向内向きの単位ベクトルを示す。ｐ（ｒ_o）は、境界∂Ｖ上の点ｒ_oにおける音圧を示し、∂ｐ（ｒ_o）／∂ｎは、境界∂Ｖ上の点ｒ_oにおける単位ベクトルｎ（ｒ_o）方向の音圧勾配ベクトルを示す。dＳ_Oは、境界∂Ｖ上の微小面積である。 Here, p(r) denotes the sound pressure at any point r in a region V enclosed by the boundary ∂V, and G(r|r _o ) denotes the Green's function from point r _o on the boundary ∂V to any point r within the boundary ∂V. n(r _o ) denotes the unit vector inward in the normal direction of the boundary ∂V at point r _o . p(r _o ) denotes the sound pressure at point r _o on the boundary ∂V, and ∂p(r _o )/∂n denotes the sound pressure gradient vector in the direction of unit vector n(r _o ) at point r _o on the boundary ∂V. dS _o is an infinitesimal area on the boundary ∂V.

またここで、境界が任意の形状をしている場合に、境界∂Ｖにおける境界条件として、∂Ｇ（ｒ｜ｒ_o）／∂ｎ＝０なるノイマン条件を想定したグリーン関数を設定する。そして、点ｒ_oにおける平均音響インテンシティＩ（ｒ_o）及び単位ベクトルｎ（ｒ_o）によって決定される以下の窓関数ａ（ｒ_o）を導入する。

In addition, when the boundary has an arbitrary shape, a Green's function assuming the Neumann condition of ∂G(r|r _o )/∂n=0 is set as the boundary condition at the boundary ∂V. Then, the following window function a(r _o ) is introduced, which is determined by the average acoustic intensity I(r _o ) at the point r _o and the unit vector n(r _o ):

これにより、以下の等式が成り立つ。

This results in the following equality:

Ｇ_o（ｒ｜ｒ_o）は、点ｒ_oから点ｒまでの自由音場グリーン関数を示す。キルヒホッフ－ヘルムホルツ積分方程式は、ある領域の外部に音源が存在する場合、その領域内部の音場は、境界面の境界法線方向の音圧勾配分布によって決定されるという事実を示している。 G _o (r|r _o ) denotes the free-field Green's function from point r _o to point r. The Kirchhoff-Helmholtz integral equation expresses the fact that when a sound source exists outside a certain region, the sound field inside the region is determined by the sound pressure gradient distribution in the boundary normal direction of the boundary surface.

ＷＦＳでは、音場を再現したい領域の境界面上に密にスピーカを配置し、音源（点音源を想定）の座標を与えることで、キルヒホッフ－ヘルムホルツ積分方程式に基づいて、スピーカの駆動信号を決定する。 In WFS, speakers are densely arranged on the boundary surfaces of the area in which the sound field is to be reproduced, and the coordinates of the sound source (assumed to be a point source) are given, and the driving signals for the speakers are determined based on the Kirchhoff-Helmholtz integral equation.

VlLLE PULKKI,“Virtual Sound Source Positioning Using Vector Base Amplitude Panning”,J. Audio Eng. Soc., Vol. 45, No.6, 1997 JuneVlLLE PULKKI, “Virtual Sound Source Positioning Using Vector Base Amplitude Panning”, J. Audio Eng. Soc., Vol. 45, No.6, 1997 June 勧告ITU-R. BS.2127-0Recommendation ITU-R.BS.2127-0 Ahrens Jens, Rabenstein Rudolph, Spors Sascha,“The Theory of wave field synthesis revisited”, in 124th Conv. Audio Eng. Soc., 2008Ahrens Jens, Rabenstein Rudolph, Spors Sascha, “The Theory of wave field synthesis revisited”, in 124th Conv. Audio Eng. Soc., 2008

このように、ＷＦＳを用いることにより、再生領域の内側で波面を再合成することができるため、ＶＢＡＰとは異なり、スイートスポットから離れた点において再生品質が低下するという問題点は生じない。 In this way, by using WFS, the wavefront can be resynthesized inside the playback area, so unlike VBAP, there is no problem with the playback quality deteriorating at points away from the sweet spot.

しかしながら、ＷＦＳを用いて厳密に波面を合成するためには、境界面に密にスピーカを配置する必要があり、また、所定の周波数特性を持つフィルタ及び遅延等の要素を加える必要がある。このため、ＷＦＳを用いた場合には、ＶＢＡＰに比べてシステムの規模が大きくなるという課題があった。 However, in order to precisely synthesize wavefronts using WFS, it is necessary to place speakers closely on the boundary surfaces, and it is also necessary to add elements such as filters with specific frequency characteristics and delays. For this reason, when using WFS, there is an issue that the system size is larger than that of VBAP.

そこで、本発明は前記課題を解決するためになされたものであり、その目的は、システムの規模を拡大することなく、ＷＦＳに基づき音源の距離の遠近を表現する駆動信号を生成可能なオブジェクトベース音響レンダリング装置及びプログラムを提供することにある。 The present invention has been made to solve the above problems, and its purpose is to provide an object-based audio rendering device and program that can generate drive signals that represent the distance of a sound source based on WFS without expanding the scale of the system.

前記課題を解決するために、請求項１のオブジェクトベース音響レンダリング装置は、音響メタデータに記述された位置に配置された音源オブジェクトに対しレンダリングを行うことで、試聴環境に配置された複数のスピーカのそれぞれに対する駆動信号を生成するオブジェクトベース音響レンダリング装置において、前記複数のスピーカの個数をＬ、前記スピーカの番号をｌ（＝１，・・・，Ｌ）、第ｌ番目の前記スピーカを第ｌ番目スピーカ、音源信号が出力される仮想音源の座標を仮想音源座標ｒ_S、前記第ｌ番目スピーカの座標をスピーカ座標ｒ_l、前記第ｌ番目スピーカの位置での境界面内向き法線ベクトルをｎ（ｒ_l）、前記第ｌ番目スピーカの位置を中心とする面積要素をΔＳ_l、前記第ｌ番目スピーカに対する前記駆動信号をｄ（ｒ_l）として、前記複数のスピーカのそれぞれについて、前記仮想音源座標ｒ_Sの位置から前記第ｌ番目スピーカの前記スピーカ座標ｒ_lの位置までのベクトルと前記第ｌ番目スピーカの前記境界面内向き法線ベクトルｎ（ｒ_l）との成す角θ_lを算出する角度算出部、並びに、前記複数のスピーカのそれぞれに対応する信号増幅部、ＨＰＦ（ハイパスフィルタ）、減衰付加部、面積要素付加部及び第１角度重み付加部を備え、前記信号増幅部が、前記音源信号を定数にて増幅し、前記ＨＰＦが、前記仮想音源座標ｒ_S及び前記第ｌ番目スピーカの前記スピーカ座標ｒ_lに基づいて、ＨＰＦｈ（ｒ_l，ｒ_S）を設定し、前記信号増幅部により増幅された信号に対し、前記ＨＰＦｈ（ｒ_l，ｒ_S）を用いてフィルタ処理を施し、前記減衰付加部が、前記仮想音源座標ｒ_Sの位置から前記第ｌ番目スピーカの前記スピーカ座標ｒ_lの位置までのベクトルの絶対値を求め、前記絶対値を２乗した結果の逆数を減衰係数１／||ｒ_l－ｒ_S||²として設定し、前記フィルタ処理が施された信号に対し前記減衰係数１／||ｒ_l－ｒ_S||²を乗算し、前記面積要素付加部が、前記減衰付加部により乗算された信号に対し前記面積要素ΔＳ_lを乗算し、前記第１角度重み付加部が、前記角度算出部により算出された前記第ｌ番目スピーカの前記成す角θ_lに基づき、０から１までの範囲において、前記成す角θ_lの絶対値が０°に近いほど１に近い値をとり、前記成す角θ_lの絶対値が９０°に近いほど０に近い値をとり、前記成す角θ_lの絶対値が０°以下または９０°以上のときに０の値をとる角度重みｗ_C（θ_l）を設定し、前記面積要素付加部により乗算された信号に対し前記角度重みｗ_C（θ_l）を乗算することで、前記第ｌ番目スピーカに対する前記駆動信号ｄ（ｒ_l）を生成する、ことを特徴とする。 In order to solve the above problem, an object-based audio rendering device according to claim 1 is an object-based audio rendering device that generates a drive signal for each of a plurality of speakers arranged in a listening environment by performing rendering on a sound source object arranged at a position described in audio metadata, the object-based audio rendering device generating a drive signal for each of a plurality of speakers arranged in a listening environment, the object-based audio rendering _device generating a drive signal for each of a plurality of speakers arranged in a listening environment by performing rendering on a sound source object arranged at a position described in audio _metadata , the object-based audio rendering device generating a drive signal for each of a plurality of speakers by performing rendering on a sound source object arranged at a position described in audio _metadata , the object- _based audio rendering device generating a drive signal for each of a plurality of speakers by performing _rendering on a sound source object arranged at a position described in audio _metadata , the object- _based audio rendering device generating a drive signal for each of _a plurality of speakers by performing rendering on ), as well as signal amplification units, HPFs (high pass filters), attenuation application units, area element application units and first angle weighting units corresponding to each of the plurality of speakers, wherein the signal amplification units amplify the sound source signal by _a constant, the HPF sets HPFh(r l , r S ) based on the virtual sound source coordinates r _S and the speaker coordinates _r _l of the first speaker, and performs filtering on the signal amplified by the signal amplification units using the HPFh(r _l , r _S ), the attenuation application units obtain an absolute value of a vector from a position of the virtual sound source _{coordinates r S} _to a position of the speaker coordinates r _l of the first speaker, set the reciprocal of the result of squaring the absolute value as an attenuation coefficient 1/||r _l -r _S || ² , and apply the attenuation coefficient 1/||r _l -r _S || to the signal that has been subjected to the filtering processing. ² , the area element adding unit multiplies the signal multiplied by the attenuation adding unit by the area element ΔS _l , the first angle weighting adding unit sets an angle weight w C (θ l ) which, in a range from 0 to 1 based on the angle θ _l of the first speaker calculated by the angle calculation unit, takes a value closer to 1 as the absolute value of the angle θ _l approaches 0°, takes a value closer to 0 as the absolute value of the angle θ _l approaches 90°, and takes a value of 0 when the absolute value of the angle _θ _l is 0° or less or 90° or more, and generates the drive signal d _{(r l} ₎ for the first speaker by multiplying the signal multiplied by the area element adding unit by the angle weight w _C (θ _l ).

また、請求項２のオブジェクトベース音響レンダリング装置は、請求項１に記載のオブジェクトベース音響レンダリング装置において、さらに、前記仮想音源座標ｒ_Sへの仮想音源方向の単位ベクトル及び全てのスピーカにおける前記スピーカ座標ｒ_lへのスピーカ方向の単位ベクトルに基づいて、前記仮想音源を内部に含む３つのスピーカをＶＢＡＰ（Vector Based Amplitude Panning）対象の３つのスピーカとして特定し、前記ＶＢＡＰ対象の３つのスピーカの単位ベクトルｒ_n1，ｒ_n2，ｒ_n3を取得し、前記仮想音源座標ｒ_Sへの仮想音源方向の単位ベクトル及び前記ＶＢＡＰ対象の３つのスピーカの単位ベクトルｒ_n1，ｒ_n2，ｒ_n3に基づいて、前記ＶＢＡＰ対象の３つのスピーカに対応する係数ｇ_l＝ｇ_n1，ｇ_n2，ｇ_n3を求め、前記全てのスピーカのうち、前記ＶＢＡＰ対象の３つのスピーカ以外のスピーカに対応する係数ｇ_l＝０を設定し、前記係数ｇ_lを出力するＶＢＡＰ係数算出部、並びに、前記複数のスピーカのそれぞれに対応するＶＢＡＰ係数乗算部、第２角度重み付加部及び加算部を備え、前記ＶＢＡＰ係数乗算部が、前記面積要素付加部により乗算された信号に対し、前記ＶＢＡＰ係数算出部により出力された前記係数ｇ_lを乗算し、前記第２角度重み付加部が、前記角度算出部により算出された前記第ｌ番目スピーカの前記成す角θ_lに基づき、０から１までの範囲において、前記成す角θ_lの絶対値が０°に近いほど０に近い値をとり、前記成す角θ_lの絶対値が９０°に近いほど１に近い値をとり、前記成す角θ_lの絶対値が０°以下または９０°以上のときに０の値をとる角度重みｗ_S（θ_l）を設定し、前記ＶＢＡＰ係数乗算部により乗算された信号に対し前記角度重みｗ_S（θ_l）を乗算することで、第２駆動信号を生成し、前記第１角度重み付加部が、前記角度重みｗ_C（θ_l）を設定し、前記面積要素付加部により乗算された信号に対し前記角度重みｗ_C（θ_l）を乗算することで、第１駆動信号を生成し、前記加算部が、前記第１駆動信号及び前記第２駆動信号を加算することで、前記第ｌ番目スピーカに対する前記駆動信号ｄ（ｒ_l）を生成する、ことを特徴とする。 The object-based acoustic rendering device of claim 2 is the object-based acoustic rendering device of claim 1, further comprising: identifying three speakers including the virtual sound source therein as three speakers to be subjected to Vector Based Amplitude Panning ( _VBAP ) based on a unit vector of a virtual sound source direction toward the virtual sound source coordinate _rS and a unit vector of a speaker direction toward the speaker coordinate r1 of all speakers; obtaining unit vectors _rn1 , _rn2 , _rn3 of the three speakers to be subjected to VBAP; determining coefficients gl = _gn1 , _gn2 , _gn3 corresponding to the three speakers to be subjected to VBAP based on the unit vector of the virtual sound source direction toward _{the virtual} sound source coordinate rS and the unit vectors _rn1 , _rn2 , _rn3 of the three speakers to be subjected to VBAP; setting coefficient _gl = 0 corresponding to speakers other than the three speakers to be subjected to _VBAP among all the speakers; the VBAP coefficient multiplication unit multiplies the signal multiplied by the area element addition unit by the coefficient _{g output by the VBAP coefficient calculation unit, the second angle weighting unit sets an angle weight wS(θl) which, in a range from 0 to 1} _, takes a value closer to 0 as the absolute value of the angle θl approaches 0°, takes a value closer to 1 as the absolute value of the angle _θl approaches 90°, and takes a value of 0 when the absolute value of the angle _θl is 0° or less or 90° or more, and generates a second drive signal by multiplying the signal multiplied by the VBAP coefficient multiplication unit, and the first angle weighting unit sets the angle weight _wS ( _θl ) which takes a value closer to 0 as the absolute value of the angle _θl approaches 0°, takes a value closer to 1 as the absolute value of the angle _θl approaches 90°, and takes a value of 0 when the absolute value of the angle _θl is 0° or less or 90° or more, based on the angle _θl of the first speaker calculated by the angle _calculation unit _. ), and multiplying the signal multiplied by the area element adding unit by the angle weight w _C (θ _l ) to generate a first drive signal, and the adder adds the first drive signal and the second drive signal to generate the drive signal d(r _l ) for the l-th speaker.

また、請求項３のオブジェクトベース音響レンダリング装置は、請求項２に記載のオブジェクトベース音響レンダリング装置において、前記ＶＢＡＰ係数乗算部が、前記面積要素付加部により乗算された信号の代わりに、前記音源信号を入力し、前記音源信号に対し前記係数ｇ_lを乗算する、ことを特徴とする。 The object-based acoustic rendering apparatus of claim 3 is characterized in that, in the object-based acoustic rendering apparatus of claim 2, the VBAP coefficient multiplication unit inputs the sound source signal instead of the signal multiplied by the area element addition unit, and multiplies the sound source signal by the coefficient g _l .

また、請求項４のオブジェクトベース音響レンダリング装置は、請求項１から３までのいずれか一項に記載のオブジェクトベース音響レンダリング装置において、前記ＨＰＦが、予め設定されたパラメータをα、予め設定されたフィルタ次数をＮ、及びｎ＝０，１，・・・，Ｎとして、以下の式：

にて、ＨＰＦ係数ｈ_n（ｒ_l，ｒ_S）を設定し、
ｘ［ｔ］及びｙ［ｔ］を時刻ｔにおける当該ＨＰＦの入力及び出力として、入出力特性が以下の式：

となるように、前記信号増幅部により増幅された信号に対し、前記ＨＰＦ係数ｈ_n（ｒ_l，ｒ_S）を用いてフィルタ処理を施す、ことを特徴とする。 The object-based acoustic rendering apparatus of claim 4 is the object-based acoustic rendering apparatus of any one of claims 1 to 3, wherein the HPF is expressed by the following formula: where α is a preset parameter, N is a preset filter order, and n=0, 1, . . . , N:

Set the HPF coefficients h _n (r _l , r _S ) in
Let x[t] and y[t] be the input and output of the HPF at time t, and the input/output characteristics are expressed by the following formula:

The signal amplified by the signal amplifier is filtered using the HPF coefficients h _n (r ₁ , r _s ) so that:

また、請求項５のオブジェクトベース音響レンダリング装置は、請求項１から４までのいずれか一項に記載のオブジェクトベース音響レンダリング装置において、前記減衰付加部が、予め設定されたパラメータをβ₁，β₂として、以下の式：

にて減衰係数ｇ（ｒ_l，ｒ_S）を求め、前記フィルタ処理が施された信号に対し前記減衰係数ｇ（ｒ_l，ｒ_S）を乗算する、ことを特徴とする。 According to a fifth aspect of the present invention, there is provided an object-based sound rendering apparatus according to any one of the first to fourth aspects, wherein the attenuation adding section calculates a value of β ₁ and β ₂ by the following equation:

and multiplying the filtered signal _by the attenuation coefficient g(r _l , r _S ₎ .

さらに、請求項６のプログラムは、コンピュータを、請求項１から５までのいずれか一項に記載のオブジェクトベース音響レンダリング装置として機能させることを特徴とする。 Furthermore, the program of claim 6 is characterized in that it causes a computer to function as an object-based audio rendering device according to any one of claims 1 to 5.

以上のように、本発明によれば、システムの規模を拡大することなく、ＷＦＳに基づき音源の距離の遠近を表現する駆動信号を生成することができる。 As described above, according to the present invention, it is possible to generate a drive signal that expresses the distance of a sound source based on WFS without expanding the scale of the system.

実施例１のオブジェクトベース音響レンダリング装置の構成例を示すブロック図である。1 is a block diagram showing an example of the configuration of an object-based acoustic rendering device according to a first embodiment. 実施例１のオブジェクトベース音響レンダリング装置の処理例を示す疑似コードである。4 is a pseudo code showing an example of processing of the object-based audio rendering device of the first embodiment. 角度算出部の構成例を示すブロック図である。4 is a block diagram showing an example of the configuration of an angle calculation unit; FIG. スピーカが疎に配置されている場合を説明する図である。FIG. 13 is a diagram illustrating a case where speakers are sparsely arranged. 実施例２のオブジェクトベース音響レンダリング装置の構成例を示すブロック図である。FIG. 11 is a block diagram showing an example of the configuration of an object-based acoustic rendering device according to a second embodiment. 実施例２のオブジェクトベース音響レンダリング装置の処理例を示す疑似コードである。11 is a pseudo code showing an example of processing of the object-based audio rendering device of the second embodiment. ＶＢＡＰ係数算出部の構成例を示すブロック図である。11 is a block diagram showing an example of the configuration of a VBAP coefficient calculation unit. FIG. 実施例３のオブジェクトベース音響レンダリング装置の構成例を示すブロック図である。FIG. 11 is a block diagram showing an example of the configuration of an object-based acoustic rendering device according to a third embodiment. 実施例３のオブジェクトベース音響レンダリング装置の処理例を示す疑似コードである。13 is a pseudo code showing an example of processing of the object-based audio rendering device of the third embodiment. ＶＢＡＰを説明するための概念図である。FIG. 1 is a conceptual diagram for explaining VBAP. ＷＦＳを説明するための概念図である。FIG. 1 is a conceptual diagram for explaining WFS.

以下、本発明を実施するための形態について図面を用いて詳細に説明する。本発明の実施例１のオブジェクトベース音響レンダリング装置は、ＷＦＳの構成を近似により単純化することで、システムの規模を拡大することなく、ＷＦＳに基づき音源の距離の遠近を表現する駆動信号を生成する。また、実施例２，３のオブジェクトベース音響レンダリング装置は、実施例１の構成に対してＶＢＡＰの構成を組み込むことで、スピーカが疎に配置された場合に駆動信号が小さくなる問題を解決する。 The following describes in detail the embodiments of the present invention with reference to the drawings. The object-based acoustic rendering device of the first embodiment of the present invention generates a drive signal that expresses the distance of a sound source based on the WFS without expanding the scale of the system by simplifying the WFS configuration through approximation. The object-based acoustic rendering device of the second and third embodiments incorporates a VBAP configuration into the configuration of the first embodiment, thereby solving the problem of the drive signal becoming small when speakers are sparsely arranged.

本発明の実施例１，２，３のオブジェクトベース音響レンダリング装置について説明する前に、ＶＢＡＰ及びＷＦＳによる音場再現手法の概要について説明する。 Before explaining the object-based acoustic rendering devices of the first, second, and third embodiments of the present invention, we will provide an overview of the sound field reproduction methods using VBAP and WFS.

〔ＶＢＡＰの概要〕
まず、ＶＢＡＰの概要について説明する。前述のとおり、ＶＢＡＰは、再生空間を３個のスピーカからなる三角形領域で分割し、音源座標を含む三角形の各頂点に位置するスピーカに対して重みを算出し、音源信号を分配することにより、振幅パンニングを行う振幅パンニング手法である。 [Outline of VBAP]
First, an overview of VBAP will be described. As described above, VBAP is an amplitude panning technique that divides a reproduction space into triangular regions each consisting of three speakers, calculates weights for the speakers located at each vertex of a triangle including the sound source coordinates, and distributes the sound source signal to perform amplitude panning.

図１０は、ＶＢＡＰを説明するための概念図である。ｘｙｚ空間において、受音点１００を原点とし、仮想音源１０１方向の単位ベクトルをｒ_nS＝（ｘ_nS，ｙ_nS，ｚ_nS）^Tとする。また、分割された三角形領域のうち、仮想音源１０１を内部に含む３つのスピーカ１０２－１，１０２－２，１０２－３方向の単位ベクトルをそれぞれ、ｒ_n1＝（ｘ_n1，ｙ_n1，ｚ_n1）^T，ｒ_n2＝（ｘ_n2，ｙ_n2，ｚ_n2）^T，ｒ_n3＝（ｘ_n3，ｙ_n3，ｚ_n3）^Tとする。 10 is a conceptual diagram for explaining VBAP. In the xyz space, the sound receiving point 100 is set as the origin, and the unit vector in the direction of the virtual sound source 101 is set as r _nS = (x _nS , _ynS , z _nS ) ^T. In addition, among the divided triangular regions, the unit vectors in the directions of the three speakers 102-1, 102-2, and 102-3 that include the virtual sound source 101 inside are set as r _n1 = (x _n1 , _yn1 , z _n1 ) ^T , r _n2 = (x _n2 , _yn2 , z _n2 ) ^T , and r _n3 = (x _n3 , _yn3 , z _n3 ) ^T , respectively.

このとき、仮想音源１０１方向の単位ベクトルｒ_nSは、スピーカ１０２－１，１０２－２，１０２－３方向の単位ベクトルｒ_n1，ｒ_n2，ｒ_n3及び係数ベクトルｇ＝（ｇ_n1，ｇ_n2，ｇ_n3）^Tを用いて以下のように表される。

ここで、Ｒ＝[ｒ_n1 ｒ_n2 ｒ_n3]^Tとする。 In this case, the unit vector r _nS in the direction of the virtual sound source 101 is expressed as follows using the unit vectors r _n1 , r _n2 , and r _n3 in the directions of the speakers 102-1, 102-2, and 102-3 and the coefficient vector g=(g _n1 , g _n2 , g _n3 ) ^T :

Here, let R ₌ [ _rn1rn2rn3 _] ^T .

係数ベクトルｇは、以下の式にて算出することができる。

Ｒ^-1はＲの逆行列である。 The coefficient vector g can be calculated by the following formula.

R ⁻¹ is the inverse matrix of R.

前記式（５）にて算出された係数ベクトルｇ＝（ｇ_n1，ｇ_n2，ｇ_n3）^Tを用いて重み付けが行われる。そして、重み付けが行われた信号を３つのスピーカ１０２－１，１０２－２，１０２－３から再生することにより、聴取者は、単位ベクトルｒ_nSの方向に音像を定位することができる。尚、仮想音源１０１を内部に含まない三角形を成すスピーカの重みは全て０とする。 Weighting is performed using the coefficient vector g = (g _n1 , g _n2 , g _n3 ) ^T calculated by the above formula (5). Then, by playing back the weighted signals from the three speakers 102-1, 102-2, and 102-3, the listener can localize the sound image in the direction of the unit vector r _nS . Note that the weights of all speakers forming a triangle that does not include the virtual sound source 101 inside are set to 0.

〔ＷＦＳの概要〕
次に、ＷＦＳの概要について説明する。前述のとおり、ＷＦＳは、音場を再現したい領域の境界面上に密にスピーカを配置し、前記式（１）のキルヒホッフ－ヘルムホルツ積分方程式に基づいて、スピーカ位置において、再現したい音場である原音場での境界面の法線方向音圧勾配を再現することで、境界面内部の音場を再現する音場再現手法である。 [Outline of WFS]
Next, an overview of WFS will be described. As described above, WFS is a sound field reproduction method in which speakers are densely arranged on the boundary surface of an area in which a sound field is to be reproduced, and the sound pressure gradient in the normal direction of the boundary surface in the original sound field, which is the sound field to be reproduced, is reproduced at the speaker positions based on the Kirchhoff-Helmholtz integral equation of the above formula (1), thereby reproducing the sound field inside the boundary surface.

図１１は、ＷＦＳを説明するための概念図であり、（１）は原音場を示し、（２）は再現音場を示す。 Figure 11 is a conceptual diagram to explain WFS, where (1) shows the original sound field and (2) shows the reproduced sound field.

（１）の原音場は、ｘｙｚ空間において、領域Ｃ１の外側の座標ｒ_S＝（ｘ_S，ｙ_S，ｚ_S）^Tの点に配置されている音源１０３から放射される音波によって形成される。このときの領域Ｃ１の境界面上の座標ｒ_l＝（ｘ_l，ｙ_l，ｚ_l）^Tにおける音圧ｐは、以下の式にて表される。ｌ（エル）は、座標ｒ_lの点の番号を示し、後述するスピーカ１０６－ｌの番号（系統の番号）に相当する。

ここで、ｋは波数である。尚、図１１の座標ｒ₁は、ｌ＝１の場合を示している。 The original sound field of (1) is formed in the xyz space by sound waves radiated from a sound source 103 located at a point with coordinate r _S = (x _S , y _S , z _S ) ^T outside the area C1. The sound pressure p at coordinate r _l = (x _l , y _l , z _l ) ^T on the boundary surface of the area C1 in this case is expressed by the following formula: l (el) indicates the number of the point with coordinate r _l , and corresponds to the number (system number) of the speaker 106-l described later.

Here, k is the wave number. Note that the coordinate _r1 in FIG. 11 indicates the case where l=1.

さらに、領域Ｃ１の境界面上の座標ｒ_l＝（ｘ_l，ｙ_l，ｚ_l）^Tの点における境界法線方向内向きの単位ベクトル（境界面内向き法線ベクトル）をｎ（ｒ_l）とする。また、座標ｒ_Sの点から座標ｒ_lの点までを結んだベクトル（ｒ_l－ｒ_S）と境界面内向き法線ベクトルｎ（ｒ_l）との成す角をθ_lとする。座標ｒ_lの点における法線方向の音圧勾配ベクトルは、以下の式で表される。

Furthermore, the unit vector inward in the boundary normal direction (boundary surface inward normal vector) at the point of coordinate r _l = (x _l , y _l , z _l ) ^T on the boundary surface of area C1 is defined as n(r _l ). Also, the angle between the vector (r _l -r _S ) connecting the point of coordinate r _S to the point of coordinate r _l and the boundary surface inward normal vector n(r _l ) is defined as θ _l . The sound pressure gradient vector in the normal direction at the point of coordinate r _l is expressed by the following formula.

したがって、キルヒホッフ－ヘルムホルツ積分方程式によれば、（２）の再現音場において離散的に配置されたスピーカアレイ１０５を、各スピーカ要素の座標ｒ_lに応じて以下の式の駆動信号ｄ（ｒ_l）にて駆動させることにより、座標ｒ_Sの点に仮想音源１０４の点音源が配置された音場を、領域Ｃ２内部に再現することができる。

Therefore, according to the Kirchhoff-Helmholtz integral equation, by driving the speaker array 105, which is discretely arranged in the reproduced sound field of (2), with a drive signal d(r _l ) of the following equation according to the coordinate r _l of each speaker element, it is possible to reproduce a sound field in which a point sound source of the virtual sound source 104 is arranged at the point of coordinate r _S within the area C2.

ここで、ΔＳ_lは、領域Ｃ２の境界面を、スピーカ位置を中心とする面積要素に分割したときのｌ番目の要素の面積（面積要素）であり、スピーカ要素が密に配置されているほど小さい値をとり、疎に配置されているほど大きい値をとる。 Here, ΔS _l is the area (area element) of the lth element when the boundary surface of area C2 is divided into area elements centered on the speaker position, and the more densely the speaker elements are arranged, the smaller the value, and the more sparsely the speaker elements are arranged, the larger the value.

尚、ｘｙｚ空間において、ｒ_Sは、仮想音源１０４の座標（仮想音源座標）であり、ｒ_lは、スピーカアレイ１０５を構成する第ｌ番目のスピーカ要素の座標（スピーカ座標）である。 In the xyz space, r _S is the coordinate of the virtual sound source 104 (virtual sound source coordinates), and r _l is the coordinate of the l-th speaker element that constitutes the speaker array 105 (speaker coordinates).

また、ｗ_C（θ_l）は、以下の式にて定義される関数である。

Furthermore, w _C (θ _l ) is a function defined by the following equation.

〔オブジェクトベース音響レンダリング装置〕
次に、実施例１，２，３のオブジェクトベース音響レンダリング装置について説明する。 [Object-based audio rendering device]
Next, the object-based audio rendering apparatuses according to the first, second and third embodiments will be described.

実施例１では、ＷＦＳの音場再現手法により決定されるスピーカの駆動信号ｄ（ｒ_l）を示す前記式（８）において、仮想音源座標ｒ_Sの位置に配置された仮想音源１０４とスピーカ座標ｒ_lの位置に配置されたスピーカとの間の距離による時間遅延を表しているｅ^-jk||rl-rs||の項を無視することで、厳密な波面の再現と引き換えにはなるが、システムを簡略化する。 In the first embodiment, in the above equation (8) showing the speaker drive signal d(r _l ) determined by the WFS sound field reproduction method, the term e -jk||r _{l -rs||, which represents the time delay due to the distance between the virtual sound source 104 located at the virtual sound source coordinate r S} and the speaker located at the ^speaker coordinate r _l, is ignored, thereby simplifying the system at the expense of accurate wavefront reproduction.

また、実施例１では、前記式（８）のｊｋ||ｒ_l－ｒ_S||＋１の項は、仮想音源１０４とスピーカとの間の距離により変化するハイパス特性を持ち、この特性を、当該距離に依存する低次のハイパスフィルタｈ（ｒ_l，ｒ_S）（ＨＰＦ）にて近似する。 In addition, in the first embodiment, the term jk∥r _l -r _S ∥+1 in the above equation (8) has a high-pass characteristic that changes depending on the distance between the virtual sound source 104 and the speaker, and this characteristic is approximated by a low-order high-pass filter h(r _l , r _S ) (HPF) that depends on the distance.

これにより、実施例１では、ＷＦＳの構成を近似により単純化することで、システムの規模を拡大することなく、ＷＦＳに基づき音源の距離の遠近を表現する駆動信号を生成することができる。 As a result, in Example 1, by simplifying the WFS configuration through approximation, it is possible to generate a drive signal that expresses the distance of a sound source based on the WFS without expanding the scale of the system.

また、実施例２，３では、実施例１の構成に対してＶＢＡＰの構成を組み込むことで、実施例１の効果に加え、スピーカが疎に配置された場合に駆動信号が小さくなる問題を解決する。 In addition, in Examples 2 and 3, by incorporating a VBAP configuration into the configuration of Example 1, in addition to the effects of Example 1, the problem of the drive signal becoming small when the speakers are sparsely placed is resolved.

〔実施例１〕
まず、実施例１のオブジェクトベース音響レンダリング装置について説明する。前述のとおり、実施例１は、スピーカの駆動信号ｄ（ｒ_l）を生成する前記式（８）において、時間遅延を表すｅ^-jk||rl-rs||の項を無視し、ｊｋ||ｒ_l－ｒ_S||＋１の項をＨＰＦとして扱い、仮想音源１０４とスピーカとの間の距離により変化するハイパス特性を、当該距離に依存する低次のハイパスフィルタｈ（ｒ_l，ｒ_S）（ＨＰＦ）にて近似する。 Example 1
First, an object-based acoustic rendering device according to Example 1 will be described. As described above, in Example 1, in the above-mentioned formula (8) for generating the speaker drive signal d(r _l ), the term e ^{-jk||r l -rs||} representing a time delay is ignored, and the term jk||r _l -r _S ||+1 is treated as an HPF, and the high-pass characteristics that change depending on the distance between the virtual sound source 104 and the speaker are approximated by a low-order high-pass filter h(r _l , r _S ) (HPF) that depends on the distance.

図１は、実施例１のオブジェクトベース音響レンダリング装置の構成例を示すブロック図であり、図２は、実施例１のオブジェクトベース音響レンダリング装置の処理例を示す疑似コードである。 Figure 1 is a block diagram showing an example of the configuration of an object-based acoustic rendering device of the first embodiment, and Figure 2 is pseudocode showing an example of the processing of the object-based acoustic rendering device of the first embodiment.

このオブジェクトベース音響レンダリング装置１は、信号増幅部９－１～９－Ｌ、ＨＰＦ（ハイパスフィルタ）１０－１～１０－Ｌ、減衰付加部１１－１～１１－Ｌ、面積要素付加部１２－１～１２－Ｌ、角度重み付加部１３－１～１３－Ｌ及び角度算出部２０を備えている。Ｌはスピーカ１０６－１～１０６－Ｌの個数であり、系統の数でもある。 This object-based acoustic rendering device 1 includes signal amplifiers 9-1 to 9-L, HPFs (high pass filters) 10-1 to 10-L, attenuation adding units 11-1 to 11-L, area element adding units 12-1 to 12-L, angle weighting adding units 13-1 to 13-L, and an angle calculation unit 20. L is the number of speakers 106-1 to 106-L, and is also the number of systems.

以下、オブジェクトベース音響レンダリング装置１において、第ｌ（エル）番目のスピーカ１０６－ｌ（ｌ番目の系統）に対応する構成部を、それぞれ信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ、面積要素付加部１２－ｌ及び角度重み付加部１３－ｌとする。つまり、オブジェクトベース音響レンダリング装置１は、Ｌ系統の信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ、面積要素付加部１２－ｌ及び角度重み付加部１３－ｌを備え、さらに角度算出部２０を備えている。ｌ＝１，・・・，Ｌである。 Hereinafter, in the object-based acoustic rendering device 1, the components corresponding to the lth speaker 106-l (lth system) are respectively the signal amplifier 9-l, HPF 10-l, attenuation adding unit 11-l, surface area element adding unit 12-l, and angle weighting adding unit 13-l. In other words, the object-based acoustic rendering device 1 comprises L system signal amplifiers 9-l, HPF 10-l, attenuation adding unit 11-l, surface area element adding unit 12-l, and angle weighting adding unit 13-l, and further comprises an angle calculation unit 20. l = 1, ..., L.

図２を参照してオブジェクトベース音響レンダリング装置１の全体処理について説明する。オブジェクトベース音響レンダリング装置１は、音源信号Ｓを入力すると共に、仮想音源座標ｒ_Sを入力する（ステップＳ２０１）。 The overall processing of the object-based sound rendering apparatus 1 will be described with reference to Fig. 2. The object-based sound rendering apparatus 1 receives a sound source signal S and virtual sound source coordinates _rS (step S201).

オブジェクトベース音響レンダリング装置１は、スピーカ１０６－１～１０６－Ｌのそれぞれについて（スピーカ１０６－ｌの系統について）、スピーカ１０６－ｌのスピーカ座標ｒ_l、スピーカ１０６－ｌの面積要素ΔＳ_l、及びスピーカ１０６－ｌの位置での境界面内向き法線ベクトルｎ（ｒ_l）を入力する（ステップＳ２０２）。 The object-based acoustic rendering device 1 inputs, for each of the speakers 106-1 to 106-L (for the speaker 106-l system), the speaker coordinates r _l of the speaker 106-l, the area element ΔS _l of the speaker 106-l, and the inward normal vector n(r _l ) of the boundary surface at the position of the speaker 106-l (step S202).

ここで、スピーカ座標ｒ_l、面積要素ΔＳ_l及び境界面内向き法線ベクトルｎ（ｒ_l）における下付けの“ｌ”（エル）は、スピーカ１０６－１～１０６－Ｌの番号（系統の番号）である。 Here, the subscript "l" in the speaker coordinate r _l , the surface area element ΔS _l and the boundary surface inward normal vector n(r _l ) is the number (system number) of the speakers 106-1 to 106-L.

また、音源信号Ｓは、例えば放送局から放送される信号であり、仮想音源座標ｒ_Sは、例えば放送局から放送される音響メタデータに含まれるデータである。スピーカ座標ｒ_l、面積要素ΔＳ_l及び境界面内向き法線ベクトルｎ（ｒ_l）は、例えば当該オブジェクトベース音響レンダリング装置１を操作するユーザにより予め設定されるスピーカレイアウト情報に含まれるデータである。 The sound source signal S is, for example, a signal broadcast from a broadcast station, and the virtual sound source coordinates r _S are, for example, data included in the audio metadata broadcast from the broadcast station. The speaker coordinates r _l , the area element ΔS _l and the boundary surface inward normal vector n(r _l ) are, for example, data included in speaker layout information preset by a user who operates the object-based audio rendering device 1.

この場合、オブジェクトベース音響レンダリング装置１は、放送局から送信された音源信号Ｓ及び仮想音源座標ｒ_Sを受信し、ユーザにより予め設定されたスピーカ座標ｒ_l、面積要素ΔＳ_l及び境界面内向き法線ベクトルｎ（ｒ_l）を入力する。 In this case, the object-based acoustic rendering device 1 receives a sound source signal S and virtual sound source coordinates r _S transmitted from a broadcasting station, and inputs speaker coordinates r _l , area element ΔS _{l ,} and boundary surface inward normal vector n(r _l ) preset by the user.

オブジェクトベース音響レンダリング装置１は、スピーカ１０６－ｌについて、ステップＳ２０１にて入力した仮想音源座標ｒ_S、及びステップＳ２０２にて入力したスピーカ座標ｒ_lに基づき、所定の次数の係数を有するＨＰＦ１０－ｌのＨＰＦｈ（ｒ_l，ｒ_S）を設定する（ステップＳ２０３）。 The object-based acoustic rendering device 1 sets HPFh(r l , r _{S )} of the HPF 10-l having coefficients of a predetermined order for the speaker 106-l based on the virtual sound source coordinates r _S input in step S201 and the speaker coordinates r _l input in step _S202 (step S203).

オブジェクトベース音響レンダリング装置１は、スピーカ１０６－ｌについて、ステップＳ２０１にて入力した仮想音源座標ｒ_S、及びステップＳ２０２にて入力したスピーカ座標ｒ_lに基づき、仮想音源１０４とスピーカ１０６－ｌとの間の距離に応じた減衰付加部１１－ｌの減衰係数１／||ｒ_l－ｒ_S||²を設定する（ステップＳ２０４）。 The object-based acoustic rendering device 1 sets the attenuation coefficient 1/||r l -r S ∥ 2 of the attenuation adding unit 11-l for the _speaker 106-l based on the virtual sound source coordinates r _S input in step S201 and the speaker coordinates _r _l input in step ^S202 , in accordance with the distance between the virtual sound source 104 and the speaker 106-l (step S204).

つまり、オブジェクトベース音響レンダリング装置１は、仮想音源１０４位置からスピーカ１０６－ｌ位置までのベクトルの絶対値||ｒ_l－ｒ_S||を求め、当該絶対値を２乗した結果||ｒ_l－ｒ_S||²の逆数を減衰係数１／||ｒ_l－ｒ_S||²として設定する。 In other words, the object-based acoustic rendering device 1 calculates the absolute value ||r _l -r _S || of the vector from the position of the virtual sound source 104 to the position of the speaker 106-l, and sets the inverse of the result, ||r _l -r _S || ² , of squaring the absolute value as the attenuation coefficient 1/||r _l -r _S || ² .

オブジェクトベース音響レンダリング装置１は、スピーカ１０６－ｌについて、ステップＳ２０１にて入力した仮想音源座標ｒ_S、並びにステップＳ２０２にて入力したスピーカ座標ｒ_l及び境界面内向き法線ベクトルｎ（ｒ_l）に基づき、仮想音源１０４位置からスピーカ１０６－ｌ位置を結んだベクトル（ｒ_l－ｒ_S）とスピーカ１０６－ｌ位置での境界面内向き法線ベクトルｎ（ｒ_l）との成す角θ_lを算出する（ステップＳ２０５）。この成す角θ_lは、角度算出部２０により算出される。 The object-based acoustic rendering device 1 calculates an angle θ _l between a vector (r _l -r S ) connecting the position of the virtual sound source 104 to the position of the speaker 106-l and the boundary surface inward normal vector n(r _l ) at the position of the speaker 106-l, based on the virtual sound source coordinate r _S input in step S201, _and the speaker _coordinate r l and boundary surface inward normal vector n(r _l ) input in step S202 (step S205). This angle θ _l is calculated by the angle calculation unit 20.

オブジェクトベース音響レンダリング装置１は、スピーカ１０６－ｌについて、ステップＳ２０５にて算出した成す角θ_lの絶対値｜θ_l｜が０°よりも大きく、かつ９０°よりも小さいか否かを判定し、前記式（９）にて角度重みｗ_C（θ_l）を設定する（ステップＳ２０６）。 The object-based acoustic rendering device 1 determines whether the absolute value |θ _l | of the angle θ _l calculated for the speaker 106-l in step S205 is greater than 0° and less than 90°, and sets the angle weight w _C (θ _l ) in accordance with equation (9) (step S206).

具体的には、オブジェクトベース音響レンダリング装置１は、成す角｜θ_l｜が０°よりも大きく、かつ９０°よりも小さいと判定した場合（前記式（９）ではｃｏｓθ_l＞０の場合）、角度重みｗ_C（θ_l）＝ｃｏｓθ_lを設定する。一方、オブジェクトベース音響レンダリング装置１は、成す角｜θ_l｜が０°以下であるか、または９０°以上であると判定した場合（前記式（９）ではｃｏｓθ_l＞０以外の場合）、角度重みｗ_C（θ_l）＝０を設定する。この角度重みｗ_C（θ_l）は、角度重み付加部１３－ｌにて用いられる。 Specifically, if the object-based sound rendering device 1 determines that the formed angle |θ _l | is greater than 0° and less than 90° (if cos θ _l > 0 in the above formula (9)), it sets the angle weight w _C (θ _l ) = cos θ _l . On the other hand, if the object-based sound rendering device 1 determines that the formed angle |θ _l | is less than 0° or greater than 90° (if other than cos θ _l > 0 in the above formula (9)), it sets the angle weight w _C (θ _l ) = 0. This angle weight w _C (θ _l ) is used by angle weight adding section 13-l.

角度重みｗ_C（θ_l）は、０から１までの範囲の値をとり、成す角｜θ_l｜が０°＜｜θ_l｜＜９０°の範囲において、成す角｜θ_l｜が０°に近いほど１に近い値をとり、９０°に近いほど０に近い値をとり、成す角｜θ_l｜が｜θ_l｜≦０°または９０°≦｜θ_l｜の範囲において、０の値をとる。 The angle weight _wC ( _θl ) takes a value in the range from 0 to 1. When the angle | _θl | is in the range of 0°<| _θl |<90°, the closer the angle | _θl | is to 0°, the closer the value is to 1. When the angle |θl| is closer to 90°, the closer the value is to 0. When the angle | _θl | is in the range of | _θl |≦0° or 90°≦| _θl |, the angle weight takes a value of 0.

オブジェクトベース音響レンダリング装置１は、スピーカ１０６－ｌについて、定数ｕ（例えばｕ＝１／２π、系統毎に異なる値となる場合もあり得る）、ステップＳ２０１にて入力した音源信号Ｓ及び仮想音源座標ｒ_S、ステップＳ２０２にて入力したスピーカ座標ｒ_l及び面積要素ΔＳ_l、ステップＳ２０３にて設定したＨＰＦｈ（ｒ_l，ｒ_S）、ステップＳ２０４にて設定した減衰係数１／||ｒ_l－ｒ_S||²及びステップＳ２０６にて設定した角度重みｗ_C（θ_l）を用いて、以下の式にて駆動信号ｄ（ｒ_l）を算出し出力する（ステップＳ２０７）。

＊は畳み込みを表す。この駆動信号ｄ（ｒ_l）は、信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ、面積要素付加部１２－ｌ及び角度重み付加部１３－ｌにより算出される。 The object-based acoustic rendering device 1 calculates and outputs a drive signal d(r l ) for the speaker 106-l using the constant u (for example, u = 1/2π, which may be a different value for each system), the sound source signal S and virtual sound source coordinates r _S input in step S201, the speaker coordinates r _l and area element ΔS _l input in step S202, the HPFh(r _l , r _S ) set in step S203, the attenuation coefficient 1/||r _l _-r _S || ² set in step S204, and the angle weight w _C (θ _l ) set in step S206, using the following equation (step S207).

The symbol * denotes convolution. This drive signal d(r _l ) is calculated by a signal amplifier 9-l, a HPF 10-l, an attenuation adding section 11-l, an area element adding section 12-l and an angle weighting adding section 13-l.

次に、図１を参照してオブジェクトベース音響レンダリング装置１の構成部の処理について説明する。信号増幅部９－ｌ（９－１～９－Ｌ）は、音源信号Ｓを入力し、音源信号Ｓに対し定数ｕ（例えば１／２π）を乗算することで音源信号Ｓを増幅し、増幅した信号をＨＰＦ１０－ｌに出力する。 Next, the processing of the components of the object-based acoustic rendering device 1 will be described with reference to Fig. 1. The signal amplifier 9-l (9-1 to 9-L) inputs the sound source signal S, amplifies the sound source signal S by multiplying it by a constant u (e.g., 1/2π), and outputs the amplified signal to the HPF 10-l.

ＨＰＦ１０－ｌ（１０－１～１０－Ｌ）は、仮想音源座標ｒ_S及びスピーカ座標ｒ_lを入力する。そして、ＨＰＦ１０－ｌは、仮想音源座標ｒ_S及びスピーカ座標ｒ_lに基づいて、ステップＳ２０３のとおり、所定の次数の係数を有するＨＰＦｈ（ｒ_l，ｒ_S）を設定する。そして、ＨＰＦ１０－ｌは、信号増幅部９－ｌから増幅された信号を入力し、当該信号に対し、ＨＰＦｈ（ｒ_l，ｒ_S）を用いてフィルタ処理を施し、フィルタ処理を施した信号を減衰付加部１１－ｌに出力する。 The HPF 10-l (10-1 to 10-L) receives the virtual sound source coordinates r _S and the speaker coordinates r _L. Based on the virtual sound source coordinates r _S and the speaker coordinates r _L , the HPF 10-l sets an HPFh (r _L , r _S ) having a predetermined order of coefficients as in step S203. The HPF 10-l receives the amplified signal from the signal amplifier 9-l, filters the signal using the HPFh (r _L , r _S ), and outputs the filtered signal to the attenuation adder 11-l.

減衰付加部１１－ｌ（１１－１～１１－Ｌ）は、仮想音源座標ｒ_S及びスピーカ座標ｒ_lを入力し、仮想音源座標ｒ_S及びスピーカ座標ｒ_lに基づいて、ステップＳ２０４のとおり、減衰係数１／||ｒ_l－ｒ_S||²を設定する。そして、減衰付加部１１－ｌは、ＨＰＦ１０－ｌからフィルタ処理が施された信号を入力し、当該信号に対し減衰係数１／||ｒ_l－ｒ_S||²を乗算することで、仮想音源１０４とスピーカ１０６－ｌとの間の距離に応じて信号を減衰させる。減衰付加部１１－ｌは、減衰させた信号（減衰を付加した信号）を面積要素付加部１２－ｌに出力する。 The attenuation adding unit 11-l (11-1 to 11-L) receives the virtual sound source coordinates r _S and the speaker coordinates r _L , and sets the attenuation coefficient 1/||r _L -r _S ^∥2 based on the virtual sound source coordinates r _S and the speaker coordinates r _L as in step S204. The attenuation adding unit 11-l then receives the filtered signal from the HPF 10-l, and multiplies the signal by the attenuation coefficient 1/||r _L -r _S ^∥2 to attenuate the signal in accordance with the distance between the virtual sound source 104 and the speaker 106-l. The attenuation adding unit 11-l outputs the attenuated signal (the signal with attenuation added) to the surface element adding unit 12-l.

面積要素付加部１２－ｌ（１２－１～１２－Ｌ）は、面積要素ΔＳ_lを入力する。面積要素ΔＳ_lは、スピーカ１０６－ｌが密に配置されている場合、小さい値となり、スピーカ１０６－ｌが疎に配置されている場合、大きい値となる。 The area element adding unit 12-l (12-1 to 12-L) inputs an area element ΔS _l . The area element ΔS _l takes a small value when the speakers 106-l are densely arranged, and takes a large value when the speakers 106-l are sparsely arranged.

そして、面積要素付加部１２－ｌは、減衰付加部１１－ｌから減衰が付加された信号を入力し、当該信号に対し面積要素ΔＳ_lを乗算することで、スピーカ１０６－ｌの離散度合いに応じて面積要素を付加した信号を生成する。面積要素付加部１２－ｌは、面積要素を付加した信号を角度重み付加部１３－ｌに出力する。 The area element adding unit 12-l receives the signal to which attenuation has been added from the attenuation adding unit 11-l and multiplies the signal by an area element ΔS _l to generate a signal to which an area element has been added according to the degree of discreteness of the speaker 106-l. The area element adding unit 12-l outputs the signal to which the area element has been added to the angle weighting unit 13-l.

角度算出部２０は、仮想音源座標ｒ_S、スピーカ座標ｒ₁～ｒ_L及び境界面内向き法線ベクトルｎ（ｒ₁）～ｎ（ｒ_L）を入力する。そして、角度算出部２０は、ステップＳ２０５のとおり、スピーカ１０６－１～１０６－Ｌのそれぞれ（スピーカ１０６－ｌ）について、仮想音源１０４位置からスピーカ１０６－ｌ位置を結んだベクトル（ｒ_l－ｒ_S）を求め、当該ベクトル（ｒ_l－ｒ_S）とスピーカ１０６－ｌ位置での境界面内向き法線ベクトルｎ（ｒ_l）との成す角θ_lを算出する。角度算出部２０は、成す角θ_l（θ₁～θ_L）を、対応する角度重み付加部１３－ｌ（１３－１～１３－Ｌ）に出力する。 The angle calculation unit 20 inputs the virtual sound source coordinates r _S , the speaker coordinates r ₁ to r _L and the boundary surface inward normal vectors n (r ₁ ) to n (r _L ). Then, as in step S205, the angle calculation unit 20 obtains a vector (r _l -r S ₎ connecting the virtual sound source 104 position to the speaker 106-l position for each of the speakers 106-1 to 106-L (speaker 106-l), and calculates the angle θ _l between the vector (r _l -r _S ) and the boundary surface inward normal vector n (r _l ) at the speaker 106-l position. The angle calculation unit 20 outputs the formed angle θ _l (θ ₁ to θ _L ) to the corresponding angle weighting unit 13-l (13-1 to 13-L).

図３は、角度算出部２０の構成例を示すブロック図である。この角度算出部２０は、算出部３０－１～３０－Ｌを備えている。算出部３０－１は、スピーカ１０６－１についての成す角θ₁を算出する構成部であり、仮想音源座標ｒ_S、並びに第ｌ＝１番目のスピーカ座標ｒ₁及び境界面内向き法線ベクトルｎ（ｒ₁）を入力する。そして、算出部３０－１は、仮想音源１０４位置からスピーカ１０６－ｌ位置を結んだベクトル（ｒ₁－ｒ_S）を求め、当該ベクトル（ｒ₁－ｒ_S）とスピーカ１０６－１位置での境界面内向き法線ベクトルｎ（ｒ₁）との成す角θ₁を算出する。算出部３０－１は、成す角θ₁を角度重み付加部１３－１に出力する。 3 is a block diagram showing an example of the configuration of the angle calculation unit 20. The angle calculation unit 20 includes calculation units 30-1 to 30-L. The calculation unit 30-1 is a component that calculates the angle θ ₁ for the speaker 106-1, and inputs the virtual sound source coordinate r _S , the l=1st speaker coordinate r ₁ , and the boundary surface inward normal vector n (r ₁ ). The calculation unit 30-1 then obtains a vector (r ₁ -r _S ) that connects the virtual sound source 104 position to the speaker 106-l position, and calculates the angle θ ₁ between the vector (r ₁ -r _S ) and the boundary surface inward normal vector n (r ₁ ) at the speaker 106-1 position. The calculation unit 30-1 outputs the angle θ ₁ to the angle weighting unit 13-1.

同様に、算出部３０－Ｌは、スピーカ１０６－Ｌについての成す角θ_Lを算出する構成部であり、仮想音源座標ｒ_S、並びに第ｌ＝Ｌ番目のスピーカ座標ｒ_L及び境界面内向き法線ベクトルｎ（ｒ_L）を入力する。そして、算出部３０－Ｌは、仮想音源１０４位置からスピーカ１０６－ｌ位置を結んだベクトル（ｒ_L－ｒ_S）を求め、当該ベクトル（ｒ_L－ｒ_S）とスピーカ１０６－Ｌ位置での境界面内向き法線ベクトルｎ（ｒ_L）との成す角θ_Lを算出する。算出部３０－Ｌは、成す角θ_Lを角度重み付加部１３－Ｌに出力する。 Similarly, the calculation unit 30-L is a component that calculates the angle θ _L for the speaker 106-L, and receives as input the virtual sound source coordinates r _S , the l=Lth speaker coordinates r _L and the boundary surface inward normal vector n(r _L ). The calculation unit 30-L then obtains a vector (r _L -r _S ) connecting the position of the virtual sound source 104 to the position of the speaker 106-l, and calculates the angle θ _L between the vector (r _L -r _S ) and the boundary surface inward normal vector n(r _L ) at the speaker 106-L position. The calculation unit 30-L outputs the angle θ _L to the angle weighting unit 13-L.

算出部３０－２～３０－（Ｌ－１）についても、同様の処理が行われ、仮想音源１０４位置からスピーカ１０６－２～１０６－（Ｌ－１）位置を結んだベクトル（ｒ₂～ｒ_L-1－ｒ_S）とスピーカ１０６－２～１０６－（Ｌ－１）位置での境界面内向き法線ベクトルｎ（ｒ₂）～ｎ（ｒ_L-1）との成す角θ₂～θ_L-1がそれぞれ算出される。 Similar processing is performed for calculation units 30-2 to 30-(L-1), and angles θ 2 to θ L-1 between the vector (r ₂ to r _L-1 -r _S ) connecting the position of the virtual sound source 104 to the positions of the speakers 106-2 to 106-(L-1) and the inward normal vectors n(r ₂ ) to n(r _L-1 ) of the boundary surface at the positions of the speakers _106-2 to 106-( _L-1 ) are calculated, respectively.

図１に戻って、角度重み付加部１３－ｌ（１３－１～１３－Ｌ）は、角度算出部２０から成す角θ_l（θ₁～θ_L）を入力し、成す角θ_lに基づいて、ステップＳ２０６のとおり、前記式（９）にて角度重みｗ_C（θ_l）を設定する。そして、角度重み付加部１３－ｌは、面積要素付加部１２－ｌから面積要素が付加された信号を入力し、当該信号に対し角度重みｗ_C（θ_l）を乗算することで、成す角θ_lを反映した信号を生成する。 1, angle weighting unit 13-l (13-1 to 13-L) receives angle θ _l (θ ₁ to θ _L ) from angle calculation unit 20, and sets angle weight w _C (θ _l ) using equation (9) based on angle θ _l as in step S206. Then, angle weighting unit 13-l receives a signal to which an area element has been added from area element adding unit 12-l, and multiplies the signal by angle weight w _C (θ _l ) to generate a signal reflecting angle θ _l .

成す角θ_lを反映した信号は、成す角θ_lが０°に近いほど、面積要素が付加された信号に近くなる。一方、成す角θ_lを反映した信号は、成す角θ_lが９０°に近いほど、０の信号に近くなる。 The signal reflecting the angle _θl approaches a signal to which an area element has been added as the angle _θl approaches 0°. On the other hand, the signal reflecting the angle _θl approaches a signal of 0 as the angle _θl approaches 90°.

角度重み付加部１３－ｌは、成す角θ_lを反映した信号を駆動信号ｄ（ｒ_l）としてスピーカ１０６－ｌへ出力する。 The angle weighting unit 13-l outputs a signal reflecting the formed angle θ _l as a drive signal d(r _l ) to the speaker 106-l.

尚、図１及び図２では、１つの音響オブジェクトの音源信号Ｓに対するオブジェクトベース音響のレンダリングの構成及び処理を示しているが、実際は、複数の音響オブジェクトの音源信号Ｓに対して処理が行われる。この場合、音響オブジェクト毎に生成された駆動信号ｄ（ｒ_l）がスピーカ１０６－ｌ毎に加算され、加算結果の駆動信号がスピーカ１０６－ｌへ出力される。後述する図５，６（実施例２）及び図８，９（実施例３）についても同様である。 1 and 2 show the configuration and processing of object-based audio rendering for the sound source signal S of one audio object, but in reality, processing is performed for the sound source signals S of a plurality of audio objects. In this case, the drive signals d(r _l ) generated for each audio object are added for each speaker 106-l, and the drive signal resulting from the addition is output to the speaker 106-l. The same applies to Figs. 5 and 6 (Example 2) and Figs. 8 and 9 (Example 3) described later.

以上のように、実施例１のオブジェクトベース音響レンダリング装置１によれば、スピーカ１０６－ｌの駆動信号ｄ（ｒ_l）を生成する前記式（８）において、時間遅延を表すｅ^-jk||rl-rs||の項を無視し、ｊｋ||ｒ_l－ｒ_S||＋１の項を、仮想音源１０４とスピーカ１０６－ｌとの間の距離に依存する低次のハイパスフィルタｈ（ｒ_l，ｒ_S）（ＨＰＦ）として近似するようにした。 As described above, according to the object-based acoustic rendering device 1 of the first embodiment, in the above equation (8) for generating the drive signal d(r _l ) for the speaker 106-l, the term e ^{-jk||r l -rs||} representing a time delay is ignored, and the term jk||r _l -r _S || + 1 is approximated as a low-order high-pass filter h(r _l , r _S ) (HPF) that depends on the distance between the virtual sound source 104 and the speaker 106-l.

具体的には、信号増幅部９－ｌは、音源信号Ｓに対し定数ｕ（系統毎に異なる値となる場合もあり得る）を乗算することで、音源信号Ｓを増幅する。ＨＰＦ１０－ｌは、仮想音源座標ｒ_S及びスピーカ座標ｒ_lに基づいて、所定の次数の係数を有するＨＰＦｈ（ｒ_l，ｒ_S）を設定する。そして、ＨＰＦ１０－ｌは、信号増幅部９－ｌにより増幅された信号に対し、ＨＰＦｈ（ｒ_l，ｒ_S）を用いてフィルタ処理を施す。 Specifically, the signal amplifier 9-l amplifies the sound source signal S by multiplying the sound source signal S by a constant u (which may have a different value for each system). The HPF 10-l sets an HPFh(r _l , r _S ) having a coefficient of a predetermined order based on the virtual sound source coordinates r _S and the speaker coordinates r _l . The HPF 10-l then performs filtering on the signal amplified by the signal amplifier 9-l using the HPFh(r _l , r _S ).

減衰付加部１１－ｌは、仮想音源座標ｒ_S及びスピーカ座標ｒ_lに基づいて減衰係数１／||ｒ_l－ｒ_S||²を設定する。そして、減衰付加部１１－ｌは、フィルタ処理が施された信号に対し減衰係数１／||ｒ_l－ｒ_S||²を乗算することで、仮想音源１０４とスピーカ１０６－ｌとの間の距離に応じて減衰した信号を生成する。 The attenuation adding unit 11-l sets an attenuation coefficient 1/∥r _l -r _S ∥ ² based on the virtual sound source coordinates r _S and the speaker coordinates r _l . Then, the attenuation adding unit 11-l multiplies the filtered signal by the attenuation coefficient 1/∥r _l -r _S ∥ ² to generate a signal attenuated according to the distance between the virtual sound source 104 and the speaker 106-l.

面積要素付加部１２－ｌは、減衰した信号に対しスピーカ１０６－ｌの面積要素ΔＳ_lを乗算することで、面積要素を付加した信号を生成する。 The area element adding unit 12-l multiplies the attenuated signal by an area element ΔS _l of the speaker 106-l to generate a signal to which an area element has been added.

角度算出部２０は、仮想音源１０４位置からスピーカ１０６－ｌ位置を結んだベクトル（ｒ_l－ｒ_S）とスピーカ１０６－ｌ位置での境界面内向き法線ベクトルｎ（ｒ_l）との成す角θ_lを算出する。 The angle calculation unit 20 calculates the angle θ l between the vector (r _l -r _s ) connecting the position of the virtual sound source ₁₀₄ to the position of the speaker 106-l and the inward normal vector n (r _l ) of the boundary surface at the position of the speaker 106-l.

角度重み付加部１３－ｌは、成す角θ_lに基づいて、前記式（９）にて角度重みｗ_C（θ_l）を設定する。そして、角度重み付加部１３－ｌは、面積要素が付加された信号に対し角度重みｗ_C（θ_l）を乗算することで、成す角θ_lを反映した信号を駆動信号ｄ（ｒ_l）としてスピーカ１０６－ｌへ出力する。 The angle weighting unit 13-l sets the angle weight w _C (θ _l ) in accordance with the equation (9) based on the angle θ _l . Then, the angle weighting unit 13-l multiplies the signal to which the area element has been added by the angle weight w _C (θ _l ) to output a signal reflecting the angle θ _l to the speaker 106-l as a drive signal d(r _l ).

これにより、前記式（８）における時間遅延を表すｅ^-jk||rl-rs||の項を無視するようにしたから、システムを簡略化することができる。また、ｊｋ||ｒ_l－ｒ_S||＋１の項を近似したＨＰＦｈ（ｒ_l，ｒ_S）のＨＰＦ１０－ｌを用いることで、システムを簡略化することができる。 This allows the system to be simplified because the term e ^-jk||rl-rs|| , which indicates the time delay in equation (8), is ignored. Also, the system can be simplified by using the HPF 10-l of HPFh(r _l , r _S ), which approximates the term jk||r _l -r _S ||+1.

したがって、ＷＦＳの構成を近似により単純化することで、システムの規模を拡大することなく、ＷＦＳに基づき仮想音源１０４の距離の遠近を表現する駆動信号ｄ（ｒ_l）を生成することができる。 Therefore, by simplifying the configuration of the WFS through approximation, it is possible to generate a driving signal d(r _l ) that expresses the distance of the virtual sound source 104 based on the WFS without increasing the scale of the system.

（ＨＰＦ１０－ｌの他の例）
尚、図１に示したオブジェクトベース音響レンダリング装置１のＨＰＦ１０－ｌ（１０－１～１０－Ｌ）は、前記式（８）においてｊｋ||ｒ_l－ｒ_S||＋１の項を近似したＨＰＦｈ（ｒ_l，ｒ_S）を用いて、信号増幅部９－ｌ（９－１～９－Ｌ）により増幅された信号に対してフィルタ処理を施すようにした。 (Another example of HPF10-1)
The HPF 10-l (10-1 to 10-L) of the object-based acoustic rendering device 1 shown in FIG. 1 performs filtering on the signal amplified by the signal amplifier 9-l (9-1 to 9-L) using HPFh (r _l , r _S ) which approximates the term jk∥r _l -r _S ∥+1 in the above equation (8).

例えば、前記式（８）においてｊｋ||ｒ_l－ｒ_S||＋１の項は、以下の式で表すハイパスフィルタに置き換えて使用する。

For example, the term jk∥r _l -r _s ∥+1 in the above equation (8) is replaced with a high-pass filter expressed by the following equation.

ｘ［ｔ］及びｙ［ｔ］は、時刻ｔにおける当該ＨＰＦ１０－ｌの入力と出力である。ＨＰＦ係数ｈ_n（ｒ_l，ｒ_S）は、以下の式で表されるものとする。ｎ＝０，１，・・・，Ｎである。

αは、フィルタの特性を決定するためのパラメータであり、正の実数である。Ｎはフィルタ次数である。これらのパラメータα及びフィルタ次数Ｎの値は、ユーザが任意に決定するものとする。 x[t] and y[t] are the input and output of the HPF 10-l at time t. The HPF coefficients h _n (r _l , r _s ) are expressed by the following equation, where n=0, 1, . . . , N.

Here, α is a parameter for determining the characteristics of the filter and is a positive real number, and N is the filter order. The values of the parameter α and the filter order N are determined arbitrarily by the user.

つまり、ＨＰＦ１０－ｌは、仮想音源座標ｒ_S及びスピーカ座標ｒ_lを入力すると共に、ユーザにより予め設定されたパラメータα及びフィルタ次数Ｎを入力する。そして、ＨＰＦ１０－ｌは、仮想音源座標ｒ_S、スピーカ座標ｒ_l、パラメータα及びフィルタ次数Ｎに基づいて、前記式（１２）を係数として有するＨＰＦ係数を設定する。 That is, the HPF 10-l receives the virtual sound source coordinates r _S and the speaker coordinates r _l , as well as the parameter α and the filter order N preset by the user. Then, the HPF 10-l sets HPF coefficients having the above-mentioned formula (12) as coefficients, based on the virtual sound source coordinates r _S , the speaker coordinates r _l , the parameter α, and the filter order N.

そして、ＨＰＦ１０－ｌは、信号増幅部９－ｌ（９－１～９－Ｌ）により増幅された信号を入力し、当該信号に対し、前記式（１２）のＨＰＦ係数ｈ_n（ｒ_l，ｒ_S）を用いて、当該ＨＰＦ１０－ｌの入出力特性が前記式（１１）となるようにフィルタ処理を施す。つまり、ＨＰＦ１０－ｌは、前記式（１１）の入出力特性となるようなＨＰＦにより構成される。 The HPF 10-l receives the signal amplified by the signal amplifier 9-l (9-1 to 9-L) and performs filtering on the signal using the HPF coefficient h _n (r _l , r _S ) of the above formula (12) so that the input/output characteristics of the HPF 10-l satisfy the above formula (11). In other words, the HPF 10-l is configured with an HPF that has the input/output characteristics of the above formula (11).

これにより、オブジェクトベース音響レンダリング装置１全体として、システムの規模が拡大することなく、簡略化することができる。 This allows the object-based acoustic rendering device 1 as a whole to be simplified without increasing the system scale.

（減衰付加部１１－ｌの他の例）
また、図１に示したオブジェクトベース音響レンダリング装置１の減衰付加部１１－ｌ（１１－１～１１－Ｌ）は、仮想音源座標ｒ_S及びスピーカ座標ｒ_lに基づいて、減衰係数１／||ｒ_l－ｒ_S||²を設定し、フィルタ処理が施された信号に対して減衰係数１／||ｒ_l－ｒ_S||²を乗算することで、仮想音源１０４とスピーカ１０６－ｌとの間の距離に応じて減衰した信号を生成するようにした。 (Another Example of the Attenuation Addition Unit 11-l)
In addition, the attenuation adding unit 11-l (11-1 to 11-L) of the object-based acoustic rendering device 1 shown in Figure 1 sets an attenuation coefficient 1/||r _l -r _S || ² based on the virtual sound source coordinates r _S and the speaker coordinates r _l , and multiplies the filtered signal by the attenuation coefficient 1/||r _l -r _S || ² to generate a signal that is attenuated in accordance with the distance between the virtual sound source 104 and the speaker 106-l.

しかしながら、減衰係数１／||ｒ_l－ｒ_S||²は、仮想音源座標ｒ_Sとスピーカ座標ｒ_lとが一致する場合、無限大に発散してしまい、結果として信号を再生することができなくなってしまう。そこで、減衰付加部１１－ｌは、減衰係数１／||ｒ_l－ｒ_S||²を近似した以下の式で表す減衰係数ｇ（ｒ_l，ｒ_S）を用いる。

However, when the virtual sound source coordinate r _S and the speaker coordinate r _l coincide with each other, the attenuation coefficient 1/||r _l -r _S || ² diverges to infinity, and as a result, the signal cannot be reproduced. Therefore, the attenuation adding unit 11-l uses the attenuation coefficient g(r _l , r _S ) expressed by the following formula that approximates the attenuation coefficient 1/||r _l -r _S || ² :

β₁は、信号振幅を調整するための正の実数であり、β₂は、減衰係数ｇ（ｒ_l，ｒ_S）の発散を抑えるための正の実数である。これらのパラメータβ₁，β₂の値は、ユーザが任意に決定するものとする。特に、パラメータβ₂の値は、減衰係数ｇ（ｒ_l，ｒ_S）の発散を抑えるように、微小な値がユーザにより設定される。 _β1 is a positive real number for adjusting the signal amplitude, and _β2 is a positive real number for suppressing the divergence of the attenuation coefficient g(r _l , r _S ). The values of these parameters _β1 and _β2 are determined arbitrarily by the user. In particular, the value of the parameter _β2 is set by the user to a small value so as to suppress the divergence of the attenuation coefficient g(r _l , r _S ).

つまり、減衰付加部１１－ｌは、仮想音源１０４位置からスピーカ１０６－ｌ位置までのベクトルの絶対値||ｒ_l－ｒ_S||を求め、当該絶対値を２乗した結果にパラメータβ₂を加算して加算結果（||ｒ_l－ｒ_S||²＋β₂）を求め、パラメータβ₁を加算結果（||ｒ_l－ｒ_S||²＋β₂）で除算することで、減衰係数ｇ（ｒ_l，ｒ_S）を求める。 In other words, the attenuation adding unit 11-l finds the absolute value ||r _l -r _S || of the vector from the position of the virtual sound source 104 to the position of the speaker 106-l, squares the absolute value, adds the parameter β ₂ to the result to find the addition result (||r _l -r _S || ² + β ₂ ), and divides the parameter β ₁ by the addition result (||r _l -r _S || ² + β ₂ ) to find the attenuation coefficient g(r _l , r _S ).

そして、減衰付加部１１－ｌは、フィルタ処理が施された信号に対してｇ（ｒ_l，ｒ_S）を乗算することで、仮想音源１０４とスピーカ１０６－ｌとの間の距離に応じて減衰した信号を生成する。 Then, the attenuation adding unit 11-l multiplies the filtered signal by g(r _l , r _s ) to generate a signal attenuated according to the distance between the virtual sound source 104 and the speaker 106-l.

パラメータβ₂の値が小さいほど、減衰係数ｇ（ｒ_l，ｒ_S）の振る舞いは減衰係数１／||ｒ_l－ｒ_S||²に近づく。一方で、仮想音源座標ｒ_Sがスピーカ座標ｒ_lに近づくほど、信号振幅が極端に増大する。 As the value of parameter _β2 becomes smaller, the behavior of the attenuation coefficient g(r _l , r _S ) approaches the attenuation coefficient 1/||r _l -r _S || ^2. On the other hand, as the virtual sound source coordinate r _S approaches the speaker coordinate r _l , the signal amplitude increases drastically.

これにより、信号の発散を抑制することができ、ユーザは、自らが設定するパラメータβ₁，β₂の値に応じて、音響オブジェクトの位置が変化するときの信号振幅の変動を調整することができる。 This makes it possible to suppress signal divergence, and the user can adjust the fluctuation in signal amplitude when the position of the sound object changes according to the values of parameters β ₁ and β ₂ that the user sets.

また、ユーザがパラメータβ₁，β₂の値をβ₁＝β₂に設定した場合、仮想音源座標ｒ_Sとスピーカ座標ｒ_lとが一致するときに（ｒ_S＝ｒ_l）、減衰係数ｇ（ｒ_l，ｒ_S）＝１となる。 Furthermore, if the user sets the values of the parameters _β1 and _β2 to _β1 = _β2 , when the virtual sound source coordinate _rS and the speaker coordinate _r1 coincide ( _rS = _r1 ), the attenuation coefficient g( _r1 , _r1 ) = 1.

このため、ユーザは、信号を不要に増大または減衰させることのないように、これらのパラメータβ₁，β₂の値を調整の目安とすることができる。 Therefore, the user can use the values of these parameters β ₁ and β ₂ as a guide for adjustment so as not to unnecessarily increase or attenuate the signal.

また、ユーザがパラメータβ₁，β₂の値をβ₁＝β₂／ΔＳ_lに設定した場合、仮想音源座標ｒ_Sとスピーカ座標ｒ_lとが一致するときに（ｒ_S＝ｒ_l）、減衰係数ｇ（ｒ_l，ｒ_S）ΔＳ_l＝１、さらにはｈ（ｒ_l，ｒ_S）ｇ（ｒ_l，ｒ_S）ΔＳ_lｗ_C（θ_l）＝１となる。ｈ（ｒ_l，ｒ_S）＝１及びｗ_C（θ_l）＝１だからである。 Furthermore, if the user sets the values of parameters _β1 and _β2 to _β1 = _β2 / _ΔS1 , when the virtual sound source coordinate _rS and the speaker coordinate _r1 coincide ( _rS = _r1 ), the attenuation coefficient g( _r1 , _rS ) _ΔS1 = 1, and furthermore h( _r1 , rS ₎ g( _r1 , _rS ) ΔS1 _wC ( _θl ) = _1. This is because h( _r1 , _rS ) = 1 and _wC ( _θl ) = 1.

このため、仮想音源座標ｒ_Sとスピーカ座標ｒ_lとが一致するときには、角度重み付加部１３－ｌからスピーカ１０６－ｌへ出力される駆動信号ｄ（ｒ_l）は、信号増幅部９－ｌにより増幅された信号と同じになる。つまり、スピーカ座標ｒ_lに位置するスピーカ１０６－ｌから、当該オブジェクトベース音響レンダリング装置１が入力した音源信号Ｓに対して増幅された信号が再生されることとなる。 Therefore, when the virtual sound source coordinate r _S and the speaker coordinate r _l coincide with each other, the drive signal d(r _l ) output from the angle weighting unit 13-l to the speaker 106-l becomes the same as the signal amplified by the signal amplifier 9-l. In other words, the speaker 106-l located at the speaker coordinate r _l reproduces an amplified signal of the sound source signal S input to the object-based acoustic rendering device 1.

このように、減衰付加部１１－ｌが前記式（１３）で表される減衰係数ｇ（ｒ_l，ｒ_S）を用いることで、仮想音源座標ｒ_Sとスピーカ座標ｒ_lとが一致した場合であっても、信号が無限大に発散することがない。このため、信号を再生できなくなるという不具合を解消することができる。 In this way, by using the attenuation coefficient g(r _l , r _S ) expressed by the above formula (13) in the attenuation adding unit 11-l, even if the virtual sound source coordinate r _S and the speaker coordinate r _l match, the signal does not diverge to infinity. This makes it possible to eliminate the problem of being unable to reproduce the signal.

〔実施例２〕
次に、実施例２のオブジェクトベース音響レンダリング装置について説明する。前述のとおり、実施例２は、実施例１の構成に対してＶＢＡＰの構成を組み込むことで、実施例１の効果に加え、スピーカ１０６－ｌが疎に配置された場合に駆動信号ｄ（ｒ_l）が小さくなる問題を解決する。より詳細には、実施例２は、実施例１に示した単純化したＷＦＳの一部の構成とＶＢＡＰの構成とを、角度重みに応じてシームレスに切り替えることで、駆動信号ｄ（ｒ_l）を生成する。 Example 2
Next, an object-based acoustic rendering device according to a second embodiment will be described. As described above, the second embodiment incorporates the configuration of VBAP into the configuration of the first embodiment, thereby achieving the effects of the first embodiment and solving the problem that the drive signal d(r _l ) becomes small when the speakers 106-l are sparsely arranged. More specifically, the second embodiment generates the drive signal d(r _l ) by seamlessly switching between the configuration of a part of the simplified WFS shown in the first embodiment and the configuration of VBAP according to the angle weight.

一般に、ＷＦＳの音場再現手法を用いる場合、多数のスピーカ１０６－ｌを密に配置することを想定する。しかし、家庭用のオーディオにおいては、多くのスピーカ１０６－ｌを密に配置することは困難であり、スピーカ１０６－ｌを疎に配置するのが通常である。 In general, when using the WFS sound field reproduction method, it is assumed that many speakers 106-l are arranged closely together. However, in home audio, it is difficult to arrange many speakers 106-l closely together, and it is common to arrange the speakers 106-l sparsely.

図４は、スピーカ１０６－ｌが疎に配置されている場合を説明する図である。図４に示すように、再現音場において、スピーカ１０６－ｌが疎に配置されており、仮想音源１０４の位置（仮想音源座標ｒ_S）が音場を再現したい領域の境界面に近接した場合を想定する。 4 is a diagram for explaining a case where the speakers 106-l are sparsely arranged. As shown in FIG. 4, it is assumed that the speakers 106-l are sparsely arranged in a reproduced sound field, and the position of the virtual sound source 104 (virtual sound source coordinates r _S ) is close to the boundary surface of the area where the sound field is to be reproduced.

この場合、全てのスピーカ１０６－ｌ（１０６－１～１０６－Ｌ）において、仮想音源１０４位置からスピーカ１０６－ｌ位置を結んだベクトル（ｒ_l－ｒ_S）とスピーカ１０６－l位置での境界面内向き法線ベクトルｎ（ｒ_l）との成す角θ_lは、９０°に近い値または９０°を超える値となってしまう。 In this case, in all speakers 106-l (106-1 to 106-L), the angle θ l between the vector (r _l -r _S ) connecting the position of the virtual sound source 104 to the position of the speaker 106-l and the inward normal vector n (r _l ) of the boundary surface at the position of the speaker 106- _l becomes a value close to or exceeding 90°.

このため、角度重み付加部１３－ｌにて用いる角度重みｗ_C（θ_l）は、前記式（９）から０に近い値または０となる。そして、全てのスピーカ１０６－ｌに対する駆動信号ｄ（ｒ_l）の値が小さい値または０になってしまい、仮想音源１０４が境界面に近接しているにも関わらず、再生信号が小さくなるという問題が生じる。 For this reason, the angle weight w _C (θ _l ) used in the angle weighting unit 13-l becomes a value close to 0 or 0 according to the above formula (9). Then, the values of the drive signals d(r _l ) for all the speakers 106-l become small or 0, which causes a problem that the reproduced signal becomes small even though the virtual sound source 104 is close to the boundary surface.

この問題を解決するため、実施例２では、図１に示した実施例１のオブジェクトベース音響レンダリング装置１において、ＷＦＳの信号分配則に対し、成す角θ_lに応じて連続的にＶＢＡＰの信号分配則にシームレスに切り替わるように、ＶＢＡＰの構成を追加する。 In order to solve this problem, in the second embodiment, a VBAP configuration is added to the object-based acoustic rendering device 1 of the first embodiment shown in FIG. 1 so that the signal distribution law of the WFS is continuously and seamlessly switched to the VBAP signal distribution law in accordance with the angle _θl .

ここで、図１に示した単純化したＷＦＳである実施例１のオブジェクトベース音響レンダリング装置１において、ＨＰＦ１０－ｌのＨＰＦｈ（ｒ_l，ｒ_S）は、周波数の補正に関する項である。また、減衰付加部１１－ｌの減衰係数１／||ｒ_l－ｒ_S||²を、仮想音源１０４の位置による距離減衰に関する項、面積要素付加部１２－ｌの面積要素ΔＳ_lを、スピーカ１０６－ｌを離散化することによる補正に関する項であると解釈する。さらに、角度重み付加部１３－ｌの角度重みｗ_C（θ_l）を、各スピーカ１０６－ｌに対する信号の分配側に関する項であると解釈する。 Here, in the object-based acoustic rendering device 1 of the first embodiment, which is the simplified WFS shown in Fig. 1, the HPFh(r _l , r _S ) of the HPF 10-l is a term related to frequency correction. Also, the attenuation coefficient 1/||r _l -r _S || ² of the attenuation adding unit 11-l is interpreted as a term related to distance attenuation due to the position of the virtual sound source 104, and the area element ΔS _l of the area element adding unit 12-l is interpreted as a term related to correction by discretizing the speaker 106-l. Furthermore, the angle weight w _C (θ _l ) of the angle weight adding unit 13-l is interpreted as a term related to the distribution side of the signal to each speaker 106-l.

実施例２では、ＷＦＳにおける角度重み付加部１３－ｌの角度重みｗ_C（θ_l）による各スピーカ１０６－ｌに対する信号の分配則に関する項に、ＶＢＡＰにより決定される各スピーカ１０６－ｌに対する信号の分配則に関する項を並列に接続する。 In the second embodiment, a term relating to the distribution law of signals for each speaker 106-l determined by VBAP is connected in parallel to a term relating to the distribution law of signals for each speaker 106- _l based on the angle weight w _C (θ l ) of the angle weighting unit 13-l in WFS.

図５は、実施例２のオブジェクトベース音響レンダリング装置の構成例を示すブロック図であり、図６は、実施例２のオブジェクトベース音響レンダリング装置の処理例を示す疑似コードである。 Figure 5 is a block diagram showing an example of the configuration of an object-based acoustic rendering device of the second embodiment, and Figure 6 is pseudocode showing an example of the processing of the object-based acoustic rendering device of the second embodiment.

このオブジェクトベース音響レンダリング装置２は、信号増幅部９－１～９－Ｌ、ＨＰＦ１０－１～１０－Ｌ、減衰付加部１１－１～１１－Ｌ、面積要素付加部１２－１～１２－Ｌ、角度重み付加部１３－１～１３－Ｌ、ＶＢＡＰ係数乗算部１４－１～１４－Ｌ、角度重み付加部１５－１～１５－Ｌ、加算部１６－１～１６－Ｌ、角度算出部２０及びＶＢＡＰ係数算出部２１を備えている。 This object-based acoustic rendering device 2 includes signal amplifiers 9-1 to 9-L, HPFs 10-1 to 10-L, attenuation adding units 11-1 to 11-L, surface area element adding units 12-1 to 12-L, angle weighting adding units 13-1 to 13-L, VBAP coefficient multipliers 14-1 to 14-L, angle weighting adding units 15-1 to 15-L, adders 16-1 to 16-L, an angle calculation unit 20, and a VBAP coefficient calculation unit 21.

以下、オブジェクトベース音響レンダリング装置２において、第ｌ（エル）番目のスピーカ１０６－ｌ（ｌ番目の系統）に対応する構成部を、信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ、面積要素付加部１２－ｌ、角度重み付加部１３－ｌ、ＶＢＡＰ係数乗算部１４－ｌ、角度重み付加部１５－ｌ及び加算部１６－ｌとする。つまり、オブジェクトベース音響レンダリング装置２は、Ｌ系統分の信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ、面積要素付加部１２－ｌ、角度重み付加部１３－ｌ、ＶＢＡＰ係数乗算部１４－ｌ、角度重み付加部１５－ｌ及び加算部１６－ｌを備え、さらに、角度算出部２０及びＶＢＡＰ係数算出部２１を備えている。 Hereinafter, in the object-based acoustic rendering device 2, the components corresponding to the lth speaker 106-l (lth system) are the signal amplifier 9-l, HPF 10-l, attenuation adding unit 11-l, area element adding unit 12-l, angle weight adding unit 13-l, VBAP coefficient multiplier 14-l, angle weight adding unit 15-l, and adder 16-l. In other words, the object-based acoustic rendering device 2 comprises signal amplifiers 9-l, HPF 10-l, attenuation adding unit 11-l, area element adding unit 12-l, angle weight adding unit 13-l, VBAP coefficient multiplier 14-l, angle weight adding unit 15-l, and adder 16-l for L systems, and further comprises an angle calculation unit 20 and a VBAP coefficient calculation unit 21.

信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ、面積要素付加部１２－ｌ及び角度重み付加部１３－ｌは、ＷＦＳの構成部である。ＶＢＡＰ係数乗算部１４－ｌ及び角度重み付加部１５－ｌは、ＶＢＡＰの構成部であり、ＷＦＳの角度重み付加部１３－ｌに並列に接続されている。 The signal amplifier 9-l, HPF 10-l, attenuation adding unit 11-l, surface area element adding unit 12-l, and angle weight adding unit 13-l are components of the WFS. The VBAP coefficient multiplier 14-l and angle weight adding unit 15-l are components of the VBAP, and are connected in parallel to the angle weight adding unit 13-l of the WFS.

図１に示した実施例１のオブジェクトベース音響レンダリング装置１とこの実施例２のオブジェクトベース音響レンダリング装置２とを比較すると、両オブジェクトベース音響レンダリング装置１，２は、信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ、面積要素付加部１２－ｌ、角度重み付加部１３－ｌ及び角度算出部２０を備えている点で共通する。 Comparing the object-based acoustic rendering device 1 of the first embodiment shown in FIG. 1 with the object-based acoustic rendering device 2 of the second embodiment, both object-based acoustic rendering devices 1 and 2 have in common the fact that they are equipped with a signal amplifier 9-l, HPF 10-l, attenuation adding unit 11-l, surface element adding unit 12-l, angle weighting adding unit 13-l, and angle calculation unit 20.

一方、オブジェクトベース音響レンダリング装置２は、ＶＢＡＰ係数乗算部１４－ｌ、角度重み付加部１５－ｌ及び加算部１６－ｌ、並びにＶＢＡＰ係数算出部２１を備えている点で、これらの構成部を備えていないオブジェクトベース音響レンダリング装置１と相違する。 On the other hand, the object-based acoustic rendering device 2 differs from the object-based acoustic rendering device 1, which does not have these components, in that it has a VBAP coefficient multiplication unit 14-l, an angle weighting unit 15-l, an addition unit 16-l, and a VBAP coefficient calculation unit 21.

図６を参照してオブジェクトベース音響レンダリング装置２の全体処理について説明する。オブジェクトベース音響レンダリング装置２は、図２のステップＳ２０１と同様に、音源信号Ｓを入力すると共に、仮想音源１０４の仮想音源座標ｒ_Sを入力する（ステップＳ６０１）。 The overall processing of the object-based acoustic rendering apparatus 2 will be described with reference to Fig. 6. As in step S201 of Fig. 2, the object-based acoustic rendering apparatus 2 inputs the sound source signal S and the virtual sound source coordinates _rS of the virtual sound source 104 (step S601).

オブジェクトベース音響レンダリング装置２は、図２のステップＳ２０２と同様に、スピーカ１０６－ｌについて、スピーカ座標ｒ_l、面積要素ΔＳ_l及び境界面内向き法線ベクトルｎ（ｒ_l）を入力する（ステップＳ６０２）。 As in step S202 in FIG. 2, the object-based acoustic rendering apparatus 2 inputs the speaker coordinates r _l , the area element ΔS _l and the boundary surface inward normal vector n(r _l ) for the speaker 106-l (step S602).

オブジェクトベース音響レンダリング装置２は、図２のステップＳ２０３と同様に、スピーカ１０６－ｌについて、仮想音源座標ｒ_S及びスピーカ座標ｒ_lに基づき、ＨＰＦｈ（ｒ_l，ｒ_S）を設定する（ステップＳ６０３）。 The object-based acoustic rendering apparatus 2 sets HPFh(r _l , r _S ) for the speaker 106-l based on the virtual sound source coordinates r _S and the speaker coordinates r _l (step S603), similarly to step S203 in FIG.

オブジェクトベース音響レンダリング装置２は、図２のステップＳ２０４と同様に、スピーカ１０６－ｌについて、仮想音源座標ｒ_S及びスピーカ座標ｒ_lに基づき、減衰係数１／||ｒ_l－ｒ_S||²を設定する（ステップＳ６０４）。 The object-based acoustic rendering apparatus 2 sets an attenuation coefficient 1/∥r _l -r _S ∥ ² for the speaker 106-l based on the virtual sound source coordinates r _S and the speaker coordinates r _l (step S604), similarly to step S204 in FIG.

オブジェクトベース音響レンダリング装置２は、図２のステップＳ２０５と同様に、スピーカ１０６－ｌについて、仮想音源座標ｒ_S、スピーカ座標ｒ_l及び境界面内向き法線ベクトルｎ（ｒ_l）に基づき、成す角θ_lを算出する（ステップＳ６０５）。 Similar to step S205 in FIG. 2, the object-based acoustic rendering apparatus 2 calculates an angle θ l for the speaker 106- _l based on the virtual sound source coordinates r _s , the speaker coordinates r _l and the boundary surface inward normal vector n(r _l ) (step S605).

オブジェクトベース音響レンダリング装置２は、スピーカ１０６－ｌについて、成す角θ_lの絶対値｜θ_l｜が０°よりも大きく、かつ９０°よりも小さいか否かを判定し、前記式（９）にてＷＦＳの角度重みｗ_C（θ_l）、及び以下に示す式（１４）にてＶＢＡＰの角度重みｗ_S（θ_l）を設定する（ステップＳ６０６）。

The object-based acoustic rendering device 2 determines whether the absolute value |θ _l | of the angle θ _l formed by the speaker 106-l is greater than 0° and less than 90°, and sets the angle weight w _C (θ _l ) of the WFS using the above equation (9) and the angle weight w _S (θ _l ) of the VBAP using the following equation (14) (step S606).

具体的には、オブジェクトベース音響レンダリング装置２は、成す角｜θ_l｜が０°よりも大きく、かつ９０°よりも小さいと判定した場合（前記式（９）（１４）ではｃｏｓθ_l＞０の場合）、角度重みｗ_C（θ_l）＝ｃｏｓθ_l及び角度重みｗ_S（θ_l）＝｜１－ｃｏｓθ_l｜を設定する。一方、オブジェクトベース音響レンダリング装置２は、成す角｜θ_l｜が０°以下であるか、または９０°以上であると判定した場合（前記式（９）（１４）ではｃｏｓθ_l＞０以外の場合）、角度重みｗ_C（θ_l）＝０及び角度重みｗ_S（θ_l）＝０を設定する。この角度重みｗ_C（θ_l）は角度重み付加部１３－ｌにより用いられ、角度重みｗ_S（θ_l）は、角度重み付加部１５－ｌにて用いられる。 Specifically, when the object-based sound rendering device 2 determines that the formed angle |θ _l | is greater than 0° and less than 90° (when cos θ _l > 0 in the above formulas (9) and (14)), it sets the angle weight w _C (θ _l ) = cos θ _l and the angle weight w _S (θ _l ) = |1 - cos θ _l |. On the other hand, when the object-based sound rendering device 2 determines that the formed angle |θ _l | is less than or equal to 0° or greater than or equal to 90° (when other than cos θ _l > 0 in the above formulas (9) and (14)), it sets the angle weight w _C (θ _l ) = 0 and the angle weight w _S (θ _l ) = 0. This angle weight w _C (θ _l ) is used by the angle weight adding unit 13-l, and the angle weight w _S (θ _l ) is used by the angle weight adding unit 15-l.

角度重みｗ_C（θ_l）は、０から１までの範囲の値をとり、成す角｜θ_l｜が０°＜｜θ_l｜＜９０°の範囲において、成す角｜θ_l｜が０°に近いほど１（ｃｏｓθ_l）に近い値をとり、９０°に近いほど０に近い値をとり、成す角｜θ_l｜が｜θ_l｜≦０°または９０°≦｜θ_l｜の範囲において、０の値をとる。 The angle weight _wC ( _θl ) takes a value in the range from 0 to 1. When the angle | _θl | is in the range of 0°<| _θl |<90°, the closer the angle | _θl | is to 0°, the closer it is to 1 (cos _θl ), and the closer it is to 90°, the closer it is to 0. When the angle | _θl | is in the range of | _θl |≦0° or 90°≦| _θl |, the angle weight wC(θl) takes a value of 0.

角度重みｗ_S（θ_l）は、成す角｜θ_l｜が０°＜｜θ_l｜＜９０°の範囲において、成す角｜θ_l｜が０°に近いほど０に近い値をとり、９０°に近いほど１（｜１－ｃｏｓθ_l｜）に近い値をとり、成す角｜θ_l｜が｜θ_l｜≦０°または９０°≦｜θ_l｜の範囲において、０の値をとる。 The angle weight w _S (θ _l ) takes a value closer to 0 as the angle |θ _l | is closer to 0° when the angle |θ _l | is in the range of 0°<|θ l |<90°, and takes a value closer to 1 (|1-cos θ _l |) as the angle |θ _l | is closer to 90°, and takes a value of 0 when the angle |θ _l | is in the range of |θ _l |≦0° or 90°≦|θ _l |.

オブジェクトベース音響レンダリング装置２は、所定の受音点（聴取者の位置）から仮想音源座標ｒ_Sへの仮想音源方向の単位ベクトル及び全てのスピーカ１０６－ｌにおける所定の受音点からスピーカ座標ｒ_lへのスピーカ方向の単位ベクトルに基づいて、仮想音源１０４を内部に含む３つのスピーカをＶＢＡＰ対象の３つのスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３として特定する。 The object-based acoustic rendering device 2 identifies three speakers that include the virtual sound source 104 therein as three VBAP target speakers 106- _n1 , 106-n2, 106-n3 based on the unit vector of the virtual sound source direction from a specific sound receiving point (listener's position) to the virtual sound source coordinate r S and the unit vector of the speaker direction from a specific sound receiving point in all speakers 106-l to the speaker coordinate r _l .

そして、オブジェクトベース音響レンダリング装置２は、ＶＢＡＰ対象の３つのスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３におけるスピーカ方向の単位ベクトルｒ_n1，ｒ_n2，ｒ_n3を取得する（ステップＳ６０７）。単位ベクトルｒ_n1，ｒ_n2，ｒ_n3は、所定の受音点及びスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３の座標（スピーカ座標）に基づいて、それぞれ取得することができる。ｎ１，ｎ２，ｎ３は、ＶＢＡＰ対象の３つのスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３のインデックスである。 The object-based acoustic rendering device 2 then obtains unit vectors r _n1 , r _n2 , and r _{n3 in the speaker directions of the three speakers 106-n1, 106-n2, and 106-n3 targeted for VBAP (step S607). The unit vectors r n1 , r n2 , and r n3} _can _be _obtained based on the coordinates (speaker coordinates) of the predetermined sound receiving point and the speakers 106-n1, 106-n2, and 106-n3, respectively. n1, n2, and n3 are indexes of the three speakers 106-n1, 106-n2, and 106-n3 targeted for VBAP.

オブジェクトベース音響レンダリング装置２は、仮想音源座標ｒ_Sへの仮想音源方向の単位ベクトル及びスピーカ方向の単位ベクトルｒ_n1，ｒ_n2，ｒ_n3に基づいて、以下の式にてスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３に対応する係数ｇ_l＝ｇ_n1，ｇ_n2，ｇ_n3を算出する（ステップＳ６０８）。

前記式（１５）のｒ_nSは、仮想音源座標ｒ_Sへの仮想音源方向の単位ベクトルである。 The object-based acoustic rendering device 2 calculates coefficients g _l = g _n1 , g _n2 , g _n3 corresponding to speakers 106-n1, 106-n2, 106-n3 using the following formula based on the unit vector of the virtual sound source direction to the virtual sound source coordinate r _S and the unit vectors r _n1 , r _n2 , r _n3 of the speaker directions (step S608).

In the above equation (15), r _nS is a unit vector of the virtual sound source direction to the virtual sound source coordinate r _S .

また、オブジェクトベース音響レンダリング装置２は、スピーカ１０６－１～１０６－Ｌのうちスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３以外のスピーカ１０６－ｌの係数ｇ_l＝０を設定する。これらの係数ｇ_l＝ｇ_n1，ｇ_n2，ｇ_n3，０は、ＶＢＡＰ係数算出部２１により算出及び設定される。 Furthermore, the object-based acoustic rendering device 2 sets the coefficient g _l of the speaker 106-l other than the speakers 106-n1, 106-n2, and 106-n3 among the speakers 106-1 to 106-L to 0. These coefficients g _l =g _n1 , g _n2 , g _n3 , and 0 are calculated and set by the VBAP coefficient calculation unit 21.

オブジェクトベース音響レンダリング装置２は、インデックスがｌ＝ｎ１，ｎ２，ｎ３のスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３について、定数ｕ（例えばｕ＝１／２π）、ステップＳ６０１にて入力した音源信号Ｓ及び仮想音源座標ｒ_S、ステップＳ６０２にて入力したスピーカ座標ｒ_l及び面積要素ΔＳ_l、ステップＳ６０３にて設定したＨＰＦｈ（ｒ_l，ｒ_S）、ステップＳ６０４にて設定した減衰係数１／||ｒ_l－ｒ_S||²及びステップＳ６０６にて設定した角度重みｗ_C（θ_l）及び角度重みｗ_S（θ_l）に基づき、以下の式にて駆動信号ｄ（ｒ_l）を算出し出力する（ステップＳ６０９）。

For the speakers 106-n1, 106-n2, and 106-n3 with indexes l=n1, n2, and n3, the object-based acoustic rendering device 2 calculates and outputs a drive signal d(r _{l ) using the following equation based on a constant u (for example, u=1/2π), the sound source signal S and virtual sound source coordinates r S input in step S601, the speaker coordinates r l} _and area element ΔS _l input in step S602, the HPFh(r _l , r _S ) set in step S603, the attenuation coefficient 1/||r _l -r _S || ² set in step S604, and the angle weights w _C (θ _l ) _and w _S (θ _l ) set in step S606 (step S609).

この式において、係数ｇ_l＝ｇ_n1，ｇ_n2，ｇ_n3である。また、＊は畳み込みを表す。駆動信号ｄ（ｒ_l）は、信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ、面積要素付加部１２－ｌ、角度重み付加部１３－ｌ、ＶＢＡＰ係数乗算部１４－ｌ、角度重み付加部１５－ｌ及び加算部１６－ｌにより算出される。 In this formula, the coefficients g _l = g _n1 , g _n2 , g _n3 . * indicates convolution. The drive signal d(r _l ) is calculated by a signal amplifier 9-l, HPF 10-l, attenuation adding unit 11-l, surface area element adding unit 12-l, angle weighting adding unit 13-l, VBAP coefficient multiplier 14-l, angle weighting adding unit 15-l, and adder 16-l.

一方、オブジェクトベース音響レンダリング装置２は、インデックスがｌ＝ｎ１，ｎ２，ｎ３以外について、すなわちスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３以外のスピーカ１０６－ｌについて、定数ｕ、音源信号Ｓ、仮想音源座標ｒ_S、スピーカ座標ｒ_l及び面積要素ΔＳ_l、ＨＰＦｈ（ｒ_l，ｒ_S）、減衰係数１／||ｒ_l－ｒ_S||²及び角度重みｗ_C（θ_l）に基づき、以下の式にて駆動信号ｄ（ｒ_l）を算出し出力する（ステップＳ６１０）。

この式（１７）は、前記式（１０）と同じである。 On the other hand, for indexes other than l=n1, n2, n3, i.e., for speakers 106-l other than speakers 106-n1, 106-n2, 106-n3, the object-based acoustic rendering device 2 calculates and outputs a drive signal d(r l ) using the following formula based on the constant u, sound source signal S, virtual sound source coordinates r _S , speaker coordinates r _l and area element ΔS _l , HPFh(r _l , r _S ), attenuation coefficient 1/||r _l _-r _S || ² and angle weighting w _C (θ _l ) (step S610).

This formula (17) is the same as the above formula (10).

この駆動信号ｄ（ｒ_l）は、信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ、面積要素付加部１２－ｌ及び角度重み付加部１３－ｌにより算出される。 This drive signal d(r _l ) is calculated by a signal amplifier 9-l, HPF 10-l, attenuation adding section 11-l, surface area element adding section 12-l and angle weighting section 13-l.

次に、図５を参照してオブジェクトベース音響レンダリング装置２の構成部の処理について説明する。信号増幅部９－ｌ（９－１～９－Ｌ）、ＨＰＦ１０－ｌ（１０－１～１０－Ｌ）、減衰付加部１１－ｌ（１１－１～１１－Ｌ）、面積要素付加部１２－ｌ（１２－１～１２－Ｌ）、角度重み付加部１３－ｌ（１３－１～１３－Ｌ）及び角度算出部２０は、図１に示した構成部と同じであるため、説明を省略する。 Next, the processing of the components of the object-based acoustic rendering device 2 will be described with reference to Figure 5. The signal amplifier 9-l (9-1 to 9-L), HPF 10-l (10-1 to 10-L), attenuation adding unit 11-l (11-1 to 11-L), area element adding unit 12-l (12-1 to 12-L), angle weight adding unit 13-l (13-1 to 13-L) and angle calculation unit 20 are the same as the components shown in Figure 1, so their description will be omitted.

ここで、角度重み付加部１３－ｌは、成す角θ_lを反映したＷＦＳの駆動信号を第１駆動信号として、対応する加算部１６－ｌに出力する。角度算出部２０は、成す角θ_lを、対応する角度重み付加部１３－ｌ及び角度重み付加部１５－ｌに出力する。 Here, the angle weighting unit 13-l outputs the WFS drive signal reflecting the formed angle θ _l as a first drive signal to the corresponding adding unit 16-l. The angle calculation unit 20 outputs the formed angle θ _l to the corresponding angle weighting unit 13-l and angle weighting unit 15-l.

ＶＢＡＰ係数算出部２１は、仮想音源座標ｒ_S及びスピーカ座標ｒ₁～ｒ_Lを入力する。そして、ＶＢＡＰ係数算出部２１は、ステップＳ６０７，Ｓ６０８のとおり、所定の受音点から仮想音源座標ｒ_Sへの仮想音源方向の単位ベクトル及び全てのスピーカ１０６－１～１０６－Ｌにおける所定の受音点からスピーカ座標ｒ₁～ｒ_Lへのスピーカ方向の単位ベクトルに基づいて、仮想音源１０４を内部に含むＶＢＡＰ対象の３つのスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３に対応する係数ｇ_n1，ｇ_n2，ｇ_n3を算出する。 The VBAP coefficient calculation unit 21 inputs the virtual sound source coordinate r _S and the speaker coordinates r ₁ to r _L. Then, as in steps S607 and S608, the VBAP coefficient calculation unit 21 calculates coefficients g n1 , g _n2 , and g n3 corresponding to the three speakers 106- _n1 , 106-n2, and 106-n3 that are VBAP targets and include the virtual sound source 104 therein, based on the unit vector of the virtual sound source direction from a predetermined sound receiving point to the virtual sound source coordinate r _S and the unit vectors of the speaker directions from predetermined sound receiving points in all the speakers 106-1 to 106-L to the speaker coordinates r ₁ _to r _L.

ＶＢＡＰ係数算出部２１は、係数ｇ_l＝ｇ_n1，ｇ_n2，ｇ_n3を、スピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３に対応するＶＢＡＰ係数乗算部１４－ｎ１，１４－ｎ２，１４－ｎ３にそれぞれ出力する。また、ＶＢＡＰ係数算出部２１は、係数ｇ_l＝０を、スピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３以外のスピーカ１０６－ｌに対応するＶＢＡＰ係数乗算部１４－ｌにそれぞれ出力する。 The VBAP coefficient calculation unit 21 outputs the coefficients g _l =g _n1 , g _n2 , g _n3 to the VBAP coefficient multiplication units 14-n1, 14-n2, 14-n3 corresponding to the speakers 106-n1, 106-n2, 106-n3, respectively. Also, the VBAP coefficient calculation unit 21 outputs the coefficient g _l =0 to the VBAP coefficient multiplication unit 14-l corresponding to the speaker 106-l other than the speakers 106-n1, 106-n2, 106-n3.

図７は、ＶＢＡＰ係数算出部２１の構成例を示すブロック図である。このＶＢＡＰ係数算出部２１は、スピーカ決定部３１及び算出部３２を備えている。スピーカ決定部３１は、仮想音源座標ｒ_S及び全てのスピーカ１０６－１～１０６－Ｌのスピーカ座標ｒ₁～ｒ_Lを入力する。そして、スピーカ決定部３１は、仮想音源座標ｒ_S及びスピーカ座標ｒ₁～ｒ_Lに基づいて、仮想音源１０４を内部に含むＶＢＡＰ対象の３つのスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３を特定する。 7 is a block diagram showing an example of the configuration of the VBAP coefficient calculation unit 21. This VBAP coefficient calculation unit 21 includes a speaker determination unit 31 and a calculation unit 32. The speaker determination unit 31 inputs the virtual sound source coordinate r _S and the speaker coordinates r ₁ to r _L of all the speakers 106-1 to 106-L. Then, the speaker determination unit 31 specifies three speakers 106-n1, 106-n2, and 106-n3 that are VBAP targets and include the virtual sound source 104 therein, based on the virtual sound source coordinates r _S and the speaker coordinates r ₁ to r _L.

スピーカ決定部３１は、ＶＢＡＰ対象の３つのスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３におけるスピーカ方向の単位ベクトルｒ_n1，ｒ_n2，ｒ_n3を取得し、仮想音源座標ｒ_S及びスピーカ方向の単位ベクトルｒ_n1，ｒ_n2，ｒ_n3を算出部３２に出力する。 The speaker determination unit 31 acquires unit vectors r _n1 , r _n2 , r _n3 of the speaker directions for the three speakers 106-n1, 106-n2, 106-n3 that are VBAP targets, and outputs the virtual sound source coordinates r _S and the unit vectors r _n1 , r _n2 , r _n3 of the speaker directions to the calculation unit 32.

算出部３２は、スピーカ決定部３１から仮想音源座標ｒ_S及びスピーカ方向の単位ベクトルｒ_n1，ｒ_n2，ｒ_n3を入力する。そして、算出部３２は、仮想音源座標ｒ_Sへの仮想音源方向の単位ベクトル及びスピーカ方向の単位ベクトルｒ_n1，ｒ_n2，ｒ_n3に基づいて、前記式（１５）にてスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３に対応する係数ｇ_n1，ｇ_n2，ｇ_n3を算出する。そして、算出部３２は、係数ｇ_n1，ｇ_n2，ｇ_n3を、対応するＶＢＡＰ係数乗算部１４－ｎ１，１４－ｎ２，１４－ｎ３に出力する。 The calculation unit 32 inputs the virtual sound source coordinate r _S and the unit vectors r _n1 , r _n2 , r _n3 of the speaker direction from the speaker determination unit _31. Then, the calculation unit 32 calculates the coefficients g n1 , g _n2 , g n3 corresponding to the speakers 106-n1, 106-n2, 106- _n3 using the above formula (15) based on the unit vector of the virtual sound source direction to the virtual sound source coordinate r _S and the unit vectors r _n1 , r _n2 , r _n3 of the speaker direction. Then, the calculation unit 32 outputs the coefficients g _n1 , g _n2 , g _n3 to the corresponding VBAP coefficient multiplication units 14-n1, 14-n2, 14-n3.

また、算出部３２は、スピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３以外のスピーカ１０６－ｌに対応する係数ｇ_l＝０を設定し、対応するＶＢＡＰ係数乗算部１４－ｌに出力する。 Furthermore, the calculation unit 32 sets the coefficient g _l =0 corresponding to the speaker 106-l other than the speakers 106-n1, 106-n2, and 106-n3, and outputs it to the corresponding VBAP coefficient multiplication unit 14-l.

図５に戻って、ＶＢＡＰ係数乗算部１４－ｌ（１４－１～１４－Ｌ）は、ＶＢＡＰ係数算出部２１から係数ｇ_l（ｇ_n1，ｇ_n2，ｇ_n3，０のうちのいずれかの値）を入力する。そして、ＶＢＡＰ係数乗算部１４－ｌは、面積要素付加部１２－ｌから面積要素が付加された信号を入力し、当該信号に対し係数ｇ_lを乗算することで、係数ｇ_lを反映した信号を生成する。 5, VBAP coefficient multiplication unit 14-l (14-1 to 14-L) receives coefficient g _l (any value of g _n1 , g _n2 , g _n3 , or 0) from VBAP coefficient calculation unit 21. Then, VBAP coefficient multiplication unit 14-l receives a signal to which an area element has been added from area element addition unit 12-l, and multiplies the signal by coefficient g _l to generate a signal that reflects coefficient g _l .

ＶＢＡＰ係数乗算部１４－ｌは、係数ｇ_lを反映した信号を角度重み付加部１５－ｌに出力する。 The VBAP coefficient multiplier 14-l outputs a signal reflecting the coefficient g _l to the angle weighting unit 15-l.

角度重み付加部１５－ｌ（１５－１～１５－Ｌ）は、角度算出部２０から成す角θ_l（θ₁～θ_L）を入力し、成す角θ_lに基づいて、ステップＳ６０６のとおり、前記式（１４）にて角度重みｗ_S（θ_l）を設定する。そして、角度重み付加部１５－ｌは、ＶＢＡＰ係数乗算部１４－ｌから係数ｇ_lを反映した信号を入力し、当該信号に対し角度重みｗ_S（θ_l）を乗算することで、成す角θ_lを反映した信号を生成する。 Angle weighting unit 15-l (15-1 to 15-L) receives angle θ _l (θ ₁ to θ _L ) from angle calculation unit 20, and sets angle weight w _S (θ _l ) in equation (14) based on the angle θ _l as in step S606. Then, angle weighting unit 15-l receives a signal reflecting coefficient g _l from VBAP coefficient multiplication unit 14-l, and multiplies the signal by angle weight w _S (θ _l ) to generate a signal reflecting angle θ _l .

仮想音源１０４を内部に含む３つスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３に対応する角度重み付加部１５－ｎ１，１５－ｎ２，１５－ｎ３において、成す角θ_lを反映した信号は、成す角θ_lが９０°に近いほど、係数ｇ_lを反映した信号に近くなる。一方、成す角θ_lを反映した音源信号Ｓは、成す角θ_lが０°に近いほど、０に近い信号となる。また、仮想音源１０４を内部に含む３つスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３以外のスピーカ１０６－ｌに対応する角度重み付加部１５－ｌにおいて、成す角θ_lを反映した信号は０である。 In the angle weighting units 15-n1, 15-n2, and 15-n3 corresponding to the three speakers 106-n1, 106-n2, and 106-n3 including the virtual sound source 104, the signal reflecting the angle θ _l becomes closer to the signal reflecting the coefficient g _l as the angle θ _l becomes closer to 90°. On the other hand, the sound source signal S reflecting the angle θ _l becomes closer to 0 as the angle θ _l becomes closer to 0°. Also, in the angle weighting units 15-l corresponding to the speakers 106-l other than the three speakers 106-n1, 106-n2, and 106-n3 including the virtual sound source 104, the signal reflecting the angle θ _l is 0.

角度重み付加部１５－ｌは、成す角θ_lを反映した信号を第２駆動信号として加算部１６－ｌに出力する。 The angle weighting section 15-l outputs a signal reflecting the formed angle θ _l to the adding section 16-l as a second drive signal.

加算部１６－ｌ（１６－１～１６－Ｌ）は、角度重み付加部１３－ｌから成す角θ_lを反映した第１駆動信号を入力すると共に、角度重み付加部１５－ｌから成す角θ_lを反映した第２駆動信号を入力する。そして、加算部１６－ｌは、成す角θ_lを反映した第１駆動信号及び成す角θ_lを反映した第２駆動信号を加算し、加算結果を駆動信号ｄ（ｒ_l）としてスピーカ１０６－ｌへ出力する。 The adder 16-l (16-1 to 16-L) receives the first drive signal reflecting the angle θ l from the angle weighting unit 13-l, and receives the second drive signal reflecting the angle θ _{l from the angle weighting unit 15-l. The adder 16-l then adds the first drive signal reflecting the angle θ l} _and the second drive signal reflecting the angle θ _l , and outputs the addition result as a drive signal d( _r _l ) to the speaker 106-l.

以上のように、実施例２のオブジェクトベース音響レンダリング装置２によれば、実施例１のＷＦＳの構成に対してＶＢＡＰの構成を組み込み、ＷＦＳの一部の構成とＶＢＡＰの構成とを、角度重みに応じてシームレスに切り替えることで、駆動信号ｄ（ｒ_l）を生成するようにした。 As described above, according to the object-based acoustic rendering device 2 of the second embodiment, the VBAP configuration is incorporated into the WFS configuration of the first embodiment, and a drive signal d(r _l ) is generated by seamlessly switching between a part of the WFS configuration and the VBAP configuration according to the angle weighting.

具体的には、ＶＢＡＰ係数算出部２１は、所定の受音点から仮想音源座標ｒ_Sへの仮想音源方向の単位ベクトル及びスピーカ座標ｒ₁～ｒ_Lへのスピーカ方向の単位ベクトルに基づいて、仮想音源１０４を内部に含む３つのスピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３に対応する係数ｇ_n1，ｇ_n2，ｇ_n3を算出する。 Specifically, the VBAP coefficient calculation unit 21 calculates coefficients g _n1 , g n2 , and g _{n3 corresponding to the three speakers 106-n1, 106-n2, and 106-n3 that include the virtual sound source 104 therein, based on the unit vector of the virtual sound source direction from a specified sound receiving point to the virtual sound source coordinate r S} _and the unit vectors of the speaker directions to the speaker coordinates r ₁ to _r _L.

ＶＢＡＰ係数乗算部１４－ｌは、ＶＢＡＰ係数算出部２１から係数ｇ_l（ｇ_n1，ｇ_n2，ｇ_n3，０のうちのいずれかの値）を入力する。そして、ＶＢＡＰ係数乗算部１４－ｌは、信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ及び面積要素付加部１２－ｌにより得られた信号に係数ｇ_lを乗算することで、係数ｇ_lを反映した信号を生成する。 The VBAP coefficient multiplication unit 14-l inputs a coefficient g _l (any value of g _n1 , g _n2 , g _n3 , or 0) from the VBAP coefficient calculation unit 21. Then, the VBAP coefficient multiplication unit 14-l multiplies the signal obtained by the signal amplification unit 9-l, HPF 10-l, attenuation addition unit 11-l, and area element addition unit 12-l by the coefficient g _l to generate a signal reflecting the coefficient g _l .

角度重み付加部１５－ｌは、成す角θ_lに基づいて、前記式（１４）にて角度重みｗ_S（θ_l）を設定する。そして、角度重み付加部１５－ｌは、係数ｇ_lを反映した信号に角度重みｗ_S（θ_l）を乗算することで、成す角θ_lを反映した第２駆動信号を生成する。 The angle weighting unit 15-l sets the angle weight w _S (θ _l ) in the above formula (14) based on the formed angle θ _l . Then, the angle weighting unit 15-l generates a second drive signal that reflects the formed angle θ _l by multiplying the signal that reflects the coefficient g _l by the angle weight w _S (θ _l ).

加算部１６－ｌは、信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ、面積要素付加部１２－ｌ及び角度重み付加部１３－ｌからなるＷＦＳの処理により得られた、成す角θ_lを反映した第１駆動信号、及び、成す角θ_lを反映した第２駆動信号を加算し、加算結果を駆動信号ｄ（ｒ_l）としてスピーカ１０６－ｌへ出力する。 The adder 16-l adds together a first drive signal reflecting the angle θ l and a second drive signal reflecting the angle θ _l obtained by processing the WFS consisting of the signal amplifier 9-l, the HPF 10-l, the attenuation adding unit 11-l, the surface element adding unit 12- _l and the angle weighting adding unit 13-l, and outputs the addition result as a drive signal d(r _l ) to the speaker 106-l.

このように、角度重みｗ_S（θ_l）を用いる角度重み付加部１５－ｌは、係数ｇ_lを用いるＶＢＡＰ係数乗算部１４－ｌに対して直列に接続され、ＶＢＡＰのＶＢＡＰ係数乗算部１４－ｌ及び角度重み付加部１５－ｌは、ＷＦＳの角度重み付加部１３－ｌに並列に接続される。そして、角度重み付加部１３－ｌに用いる角度重みｗ_C（θ_l）と角度重み付加部１５－ｌに用いる角度重みｗ_S（θ_l）とは、０～１の範囲においてトレードオフの関係にある。 In this way, the angle weighting unit 15-l using the angle weight w _S (θ _l ) is connected in series to the VBAP coefficient multiplier 14-l using the coefficient g _l , and the VBAP coefficient multiplier 14-l and angle weighting unit 15-l of the VBAP are connected in parallel to the angle weighting unit 13-l of the WFS. The angle weight w _C (θ _l ) used in the angle weighting unit 13-l and the angle weight w _S (θ _l ) used in the angle weighting unit 15-l are in a trade-off relationship in the range of 0 to 1.

このため、仮想音源１０４の位置（仮想音源座標ｒ_S）がスピーカ１０６－ｌの境界面から離れており、成す角θ_lが０°に近づいている場合には、角度重みｗ_C（θ_l）は１に近い大きい値となるが、角度重みｗ_S（θ_l）は０に近い小さい値となる。この場合、駆動信号ｄ（ｒ_l）は、ＷＦＳによる分配側が支配的となる。 For this reason, when the position of the virtual sound source 104 (virtual sound source coordinate r _S ) is away from the boundary surface of the speaker 106-l and the angle θ _l formed therewith is close to 0°, the angle weight w _C (θ _l ) becomes a large value close to 1, but the angle weight w _S (θ _l ) becomes a small value close to 0. In this case, the distribution side by the WFS becomes dominant in the drive signal d(r _l ).

一方、仮想音源１０４の位置（仮想音源座標ｒ_S）がスピーカ１０６－ｌの境界面に近接し、成す角θ_lが９０°に近づいている場合には、角度重みｗ_C（θ_l）は０に近い小さい値となるが、角度重みｗ_S（θ_l）は１に近い大きい値となる。この場合、駆動信号ｄ（ｒ_l）は、ＶＢＡＰによる分配側が支配的となる。 On the other hand, when the position of the virtual sound source 104 (virtual sound source coordinate r _S ) is close to the boundary surface of the speaker 106-l and the formed angle θ _l is close to 90°, the angle weight w _C (θ _l ) is a small value close to 0, but the angle weight w _S (θ _l ) is a large value close to 1. In this case, the distribution side by VBAP becomes dominant in the drive signal d (r _l ).

つまり、スピーカ１０６－ｌに対する仮想音源１０４の位置に応じて、ＷＦＳによる分配側とＶＢＡＰによる分配側とが、連続的にシームレスに切り替わるようになる。 In other words, depending on the position of the virtual sound source 104 relative to the speaker 106-l, the distribution side based on WFS and the distribution side based on VBAP are switched continuously and seamlessly.

したがって、実施例２のオブジェクトベース音響レンダリング装置２では、実施例１と同様の効果を奏することができる。また、スピーカ１０６－ｌが疎に配列された環境において、仮想音源１０４の位置（仮想音源座標ｒ_S）がスピーカ１０６－ｌの境界面に近接し、成す角θ_l が大きくなるにつれてＷＦＳの角度重みは小さく、ＶＢＡＰの角度重みが大きくなり、連続的にレンダリング則が切り替わる。このため、ＷＦＳの角度重みが小さくなることに起因し、再生信号が小さくなる場合でもＶＢＡＰの角度重みでそれを補填することで、再生信号が極端に小さくなるという問題を解決することができる。 Therefore, the object-based sound rendering device 2 of the second embodiment can achieve the same effect as that of the first embodiment. In an environment in which the speakers 106-l are sparsely arranged, as the position of the virtual sound source 104 (virtual sound source coordinate r _S ) approaches the boundary surface of the speakers 106-l and the angle θ _l formed increases, the angle weight of the WFS decreases and the angle weight of the VBAP increases, and the rendering rule is switched continuously. Therefore, even if the playback signal becomes small due to the decrease in the angle weight of the WFS, the problem of the playback signal becoming extremely small can be solved by compensating for it with the angle weight of the VBAP.

尚、実施例２のオブジェクトベース音響レンダリング装置２のＨＰＦ１０－ｌ（１０－１～１０－Ｌ）は、図１に示した実施例１と同様に、ｊｋ||ｒ_l－ｒ_S||＋１を、前記式（１２）のＨＰＦ係数ｈ_n（ｒ_l，ｒ_S）を用いた前記式（１１）のハイパスフィルタに置き換えて使用するようにしてもよい。 The HPF 10-l (10-1 to 10-L) of the object-based acoustic rendering device 2 of the second embodiment may be used by replacing jk∥r _l -r _S ∥+1 with the high-pass filter of the above formula (11) using the HPF coefficient h _n (r _l , r _S ) of the above formula (12), as in the first embodiment shown in FIG.

また、実施例２のオブジェクトベース音響レンダリング装置２の減衰付加部１１－ｌ（１１－１～１１－Ｌ）は、図１に示した実施例１と同様に、減衰係数１／||ｒ_l－ｒ_S||²の代わりに、前記式（１３）の減衰係数ｇ（ｒ_l，ｒ_S）を用いるようにしてもよい。つまり、減衰付加部１１－ｌは、減衰係数ｇ（ｒ_l，ｒ_S）を用いて、仮想音源１０４とスピーカ１０６－ｌとの間の距離に応じて減衰した信号を生成する。 Also, the attenuation adding unit 11-l (11-1 to 11-L) of the object-based acoustic rendering device 2 of the second embodiment may use the attenuation coefficient g(r _l , r _S ) of the above formula (13) instead of the attenuation coefficient 1/∥r _l -r _S ∥ ² , as in the first embodiment shown in Fig. 1. In other words, the attenuation adding unit 11-l uses the attenuation coefficient g(r _l , r _S ) to generate a signal attenuated according to the distance between the virtual sound source 104 and the speaker 106-l.

〔実施例３〕
次に、実施例３について説明する。前述のとおり、実施例３は、実施例２と同様に、実施例１の構成に対してＶＢＡＰの構成を組み込むことで、実施例１の効果に加え、スピーカ１０６－ｌが疎に配置された場合に駆動信号ｄ（ｒ_l）が小さくなる問題を解決する。より詳細には、実施例３は、実施例１に示した単純化したＷＦＳの全体の構成とＶＢＡＰの構成とを、角度重みに応じてシームレスに切り替えることで、駆動信号ｄ（ｒ_l）を生成する。 Example 3
Next, a third embodiment will be described. As described above, in the third embodiment, similar to the second embodiment, the VBAP configuration is incorporated into the configuration of the first embodiment, thereby achieving the effects of the first embodiment and solving the problem that the drive signal d(r _l ) becomes small when the speakers 106-l are sparsely arranged. More specifically, the third embodiment generates the drive signal d(r _l ) by seamlessly switching between the entire simplified WFS configuration shown in the first embodiment and the VBAP configuration according to the angle weighting.

図８は、実施例３のオブジェクトベース音響レンダリング装置の構成例を示すブロック図であり、図９は、実施例３のオブジェクトベース音響レンダリング装置の処理例を示す疑似コードである。 Figure 8 is a block diagram showing an example of the configuration of an object-based acoustic rendering device of the third embodiment, and Figure 9 is pseudocode showing an example of the processing of the object-based acoustic rendering device of the third embodiment.

このオブジェクトベース音響レンダリング装置３は、信号増幅部９－１～９－Ｌ、ＨＰＦ１０－１～１０－Ｌ、減衰付加部１１－１～１１－Ｌ、面積要素付加部１２－１～１２－Ｌ、角度重み付加部１３－１～１３－Ｌ、ＶＢＡＰ係数乗算部１４－１～１４－Ｌ、角度重み付加部１５－１～１５－Ｌ、加算部１６－１～１６－Ｌ、角度算出部２０及びＶＢＡＰ係数算出部２１を備えている。 This object-based acoustic rendering device 3 includes signal amplifiers 9-1 to 9-L, HPFs 10-1 to 10-L, attenuation adding units 11-1 to 11-L, surface area element adding units 12-1 to 12-L, angle weighting adding units 13-1 to 13-L, VBAP coefficient multipliers 14-1 to 14-L, angle weighting adding units 15-1 to 15-L, adders 16-1 to 16-L, an angle calculation unit 20, and a VBAP coefficient calculation unit 21.

信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ、面積要素付加部１２－ｌ及び角度重み付加部１３－ｌは、ＷＦＳの構成部であり、ＶＢＡＰ係数乗算部１４－ｌ及び角度重み付加部１５－ｌは、ＶＢＡＰの構成部である。ＷＦＳの構成部とＶＢＡＰの構成部とは、当該オブジェクトベース音響レンダリング装置３全体として、並列に接続されている。 The signal amplifier 9-l, HPF 10-l, attenuation adding unit 11-l, surface element adding unit 12-l and angle weight adding unit 13-l are components of the WFS, and the VBAP coefficient multiplier 14-l and angle weight adding unit 15-l are components of the VBAP. The components of the WFS and the components of the VBAP are connected in parallel as the object-based acoustic rendering device 3 as a whole.

図２に示した実施例２のオブジェクトベース音響レンダリング装置２とこの実施例３のオブジェクトベース音響レンダリング装置３とを比較すると、両オブジェクトベース音響レンダリング装置２，３は、同一の構成部を備えている点で共通する。 When comparing the object-based acoustic rendering device 2 of the second embodiment shown in FIG. 2 with the object-based acoustic rendering device 3 of the third embodiment, both object-based acoustic rendering devices 2 and 3 have the same components in common.

一方、オブジェクトベース音響レンダリング装置３は、ＶＢＡＰの構成部であるＶＢＡＰ係数乗算部１４－ｌ及び角度重み付加部１５－ｌが、ＷＦＳの全体の構成部と並列に接続されている点で、ＷＦＳの一部の角度重み付加部１３－ｌのみと並列に接続されているオブジェクトベース音響レンダリング装置２と相違する。 On the other hand, the object-based acoustic rendering device 3 differs from the object-based acoustic rendering device 2 in that the VBAP coefficient multiplication unit 14-l and the angle weighting unit 15-l, which are components of the VBAP, are connected in parallel to the entire components of the WFS, whereas the object-based acoustic rendering device 3 is connected in parallel to only a part of the WFS, the angle weighting unit 13-l.

図９を参照してオブジェクトベース音響レンダリング装置３の全体処理について説明する。図９に示すステップＳ９０１～Ｓ９０８の処理は、図６に示した実施例２のオブジェクトベース音響レンダリング装置２によるステップＳ６０１～Ｓ６０８の処理と同じであるため、ここでは説明を省略する。 The overall processing of the object-based acoustic rendering device 3 will be described with reference to FIG. 9. The processing of steps S901 to S908 shown in FIG. 9 is the same as the processing of steps S601 to S608 by the object-based acoustic rendering device 2 of the second embodiment shown in FIG. 6, so a description thereof will be omitted here.

オブジェクトベース音響レンダリング装置３は、ステップＳ９０７にて取得したスピーカ座標ｒ_lのインデックスがｌ＝ｎ１，ｎ２，ｎ３の場合、スピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３について、定数ｕ（例えばｕ＝１／２π）、ステップＳ９０１にて入力した音源信号Ｓ及び仮想音源座標ｒ_S、ステップＳ９０２にて入力したスピーカ座標ｒ_l及び面積要素ΔＳ_l、ステップＳ９０３にて設定したＨＰＦｈ（ｒ_l，ｒ_S）、ステップＳ９０４にて設定した減衰係数１／||ｒ_l－ｒ_S||²及びステップＳ９０６にて設定した角度重みｗ_C（θ_l）及び角度重みｗ_S（θ_l）に基づき、以下の式にて駆動信号ｄ（ｒ_l）を算出し出力する（ステップＳ９０９）。

When the index of the speaker coordinate r _l obtained in step S907 is l = n1, n2, n3, the object-based acoustic rendering device 3 calculates and outputs a drive signal d(r l ) for the speakers 106-n1, 106-n2, 106-n3 using the following equation based on the constant u (e.g., u = 1/2π), the sound source signal S and virtual sound source coordinate r _S input in step S901, the speaker coordinate r _l and area element ΔS _l input in step S902, the HPFh(r _l , r _S ) set in step S903, the attenuation coefficient 1/|| _{r l} _-r _S || ² set in step S904, and the angle weight w _C (θ _l ) and angle weight w _S (θ _l ) set in step S906 (step S909).

一方、オブジェクトベース音響レンダリング装置３は、インデックスがｌ＝ｎ１，ｎ２，ｎ３以外の場合、スピーカ１０６－ｎ１，１０６－ｎ２，１０６－ｎ３以外のスピーカ１０６－ｌについて、定数ｕ、音源信号Ｓ、仮想音源座標ｒ_S、スピーカ座標ｒ_l及び面積要素ΔＳ_l、ＨＰＦｈ（ｒ_l，ｒ_S）、減衰係数１／||ｒ_l－ｒ_S||²及び角度重みｗ_C（θ_l）に基づき、以下の式にて駆動信号ｄ（ｒ_l）を算出し出力する（ステップＳ９１０）。

この式（１９）は、前記式（１０）及び前記式（１７）と同じである。 On the other hand, when the index is other than l=n1, n2, n3, the object-based acoustic rendering device 3 calculates and outputs a drive signal d(r l ) for each speaker 106-l other than speakers 106-n1, 106-n2, 106-n3 based on the constant u, sound source signal S, virtual sound source coordinate r _S , speaker coordinate r _l and area element ΔS _l , HPFh(r _l , r _S ), attenuation coefficient 1/||r _l -r _S || ² and angle weight w _C ( _θ _l ) using the following formula (step S910).

This formula (19) is the same as the above formulas (10) and (17).

尚、前記式（１８）の演算において、右辺の第１項であるＷＦＳの構成による演算結果と、第２項であるＶＢＡＰの構成による演算結果とは、フィルタリングにより遅延が生じる。このため、角度重み付加部１３－ｌ，１５－ｌにおいて、ＶＢＡＰ側にも位相を調整するための遅延を加える必要があるが、ここではその表記は省略する。 In the calculation of equation (18), a delay occurs due to filtering between the calculation result of the WFS configuration, which is the first term on the right side, and the calculation result of the VBAP configuration, which is the second term. For this reason, it is necessary to add a delay to the VBAP side in order to adjust the phase in the angle weighting units 13-l and 15-l, but this will not be shown here.

また、前記式（１８）及び前記式（１９）において、畳み込み＊は、ＦＦＴ（Fast Fourier Transform：高速フーリエ変換）を利用し、周波数領域での積を逆変換して求めるようにしてもよい。前記式（１６）及び前記式（１７）についても同様である。 In addition, in the formulas (18) and (19), the convolution * may be calculated by inversely transforming the product in the frequency domain using FFT (Fast Fourier Transform). The same applies to the formulas (16) and (17).

図８を参照して、オブジェクトベース音響レンダリング装置３に備えたそれぞれの構成部の処理は、図５に示した実施例２のオブジェクトベース音響レンダリング装置２に備えたそれぞれの構成部の処理と同じであるため、構成部の処理の説明については省略する。 Referring to FIG. 8, the processing of each component of the object-based acoustic rendering device 3 is the same as the processing of each component of the object-based acoustic rendering device 2 of the second embodiment shown in FIG. 5, so the description of the processing of the components is omitted.

ＶＢＡＰ係数乗算部１４－ｌ（１４－１～１４－Ｌ）の入力データは、オブジェクトベース音響レンダリング装置２，３間において異なる。図５に示したオブジェクトベース音響レンダリング装置２のＶＢＡＰ係数乗算部１４－ｌは、面積要素付加部１２－ｌから面積要素が付加された信号を入力する。これに対し、図８に示すオブジェクトベース音響レンダリング装置３のＶＢＡＰ係数乗算部１４－ｌは、音源信号Ｓを入力する。 The input data of the VBAP coefficient multiplier 14-l (14-1 to 14-L) differs between the object-based acoustic rendering devices 2 and 3. The VBAP coefficient multiplier 14-l of the object-based acoustic rendering device 2 shown in FIG. 5 inputs a signal to which an area element has been added from the area element adding unit 12-l. In contrast, the VBAP coefficient multiplier 14-l of the object-based acoustic rendering device 3 shown in FIG. 8 inputs the sound source signal S.

具体的には、図８に示すオブジェクトベース音響レンダリング装置３のＶＢＡＰ係数乗算部１４－ｌは、ＶＢＡＰ係数算出部２１から係数ｇ_l（ｇ_n1，ｇ_n2，ｇ_n3，０のうちのいずれかの値）を入力する。そして、ＶＢＡＰ係数乗算部１４－ｌは、音源信号Ｓを入力し、当該音源信号Ｓに係数ｇ_lを乗算することで、係数ｇ_lを反映した信号を生成する。 8 receives a coefficient g _l (any value of g _n1 , g _n2 , g _n3 , or 0) from a VBAP coefficient calculation unit 21. The VBAP coefficient multiplication unit 14-l receives a sound source signal S and multiplies the sound source signal S by the coefficient g _l to generate a signal reflecting the coefficient g _l .

以上のように、実施例３のオブジェクトベース音響レンダリング装置３によれば、実施例１のＷＦＳの構成に対してＶＢＡＰの構成を組み込み、ＷＦＳの全体の構成とＶＢＡＰの構成とを、角度重みに応じてシームレスに切り替えることで、駆動信号ｄ（ｒ_l）を生成する。 As described above, according to the object-based acoustic rendering device 3 of Example 3, the VBAP configuration is incorporated into the WFS configuration of Example 1, and the drive signal d(r _l ) is generated by seamlessly switching between the overall WFS configuration and the VBAP configuration according to the angle weighting.

具体的には、ＶＢＡＰ係数乗算部１４－ｌは、ＶＢＡＰ係数算出部２１から係数ｇ_l（ｇ_n1，ｇ_n2，ｇ_n3，０のうちのいずれかの値）を入力する。そして、ＶＢＡＰ係数乗算部１４－ｌは、音源信号Ｓに係数ｇ_lを乗算することで、係数ｇ_lを反映した信号を生成する。 Specifically, the VBAP coefficient multiplication unit 14-l inputs a coefficient g _l (any value of g _n1 , g _n2 , g _n3 , or 0) from the VBAP coefficient calculation unit 21. Then, the VBAP coefficient multiplication unit 14-l multiplies the sound source signal S by the coefficient g _l to generate a signal that reflects the coefficient g _l .

このように、角度重みｗ_S（θ_l）を用いる角度重み付加部１５－ｌは、係数ｇ_lを用いるＶＢＡＰ係数乗算部１４－ｌに対して直列に接続される。また、ＶＢＡＰのＶＢＡＰ係数乗算部１４－ｌ及び角度重み付加部１５－ｌは、ＷＦＳの信号増幅部９－ｌ、ＨＰＦ１０－ｌ、減衰付加部１１－ｌ、面積要素付加部１２－ｌ及び角度重み付加部１３－ｌに並列に接続される。そして、角度重み付加部１３－ｌに用いる角度重みｗ_C（θ_l）と角度重み付加部１５－ｌに用いる角度重みｗ_S（θ_l）とは、０～１の範囲においてトレードオフの関係にある。 In this way, the angle weighting unit 15-l using the angle weight w _S (θ _l ) is connected in series to the VBAP coefficient multiplier 14-l using the coefficient g _l . The VBAP coefficient multiplier 14-l and angle weighting unit 15-l of the VBAP are connected in parallel to the signal amplifier 9-l, HPF 10-l, attenuation unit 11-l, surface element adding unit 12-l and angle weighting unit 13-l of the WFS. The angle weight w _C (θ _l ) used in the angle weighting unit 13-l and the angle weight w _S (θ _l ) used in the angle weighting unit 15-l are in a trade-off relationship in the range of 0 to 1.

このため、実施例２と同様に、θ_lが０°に近づいたときには、駆動信号ｄ（ｒ_l）は、ＷＦＳによる分配側が支配的となり、θ_lが９０°に近づいたときには、駆動信号ｄ（ｒ_l）は、ＶＢＡＰによる分配側が支配的となる。 For this reason, similarly to the second embodiment, when θ _l approaches 0°, the distribution side by WFS becomes dominant in the drive signal d(r _l ), and when θ _l approaches 90°, the distribution side by VBAP becomes dominant in the drive signal d(r _l ).

つまり、スピーカ１０６－ｌに対する仮想音源１０４の位置に応じて、連続的にＷＦＳとＶＢＡＰとの分配側が、シームレスに切り替わるようになる。 In other words, the distribution side between WFS and VBAP is switched continuously and seamlessly depending on the position of the virtual sound source 104 relative to the speaker 106-l.

したがって、実施例３のオブジェクトベース音響レンダリング装置３では、実施例１と同様の効果を奏することができる。また、実施例２と同様に、スピーカ１０６－ｌが疎に配列された環境において、ＷＦＳによるレンダリングを行っている際に、仮想音源１０４が境界面に近接して配置されると、ＶＢＡＰによるレンダリングに連続的にシームレスに切り替わる。このため、再生信号が小さくなるという問題を解決することができる。 Therefore, the object-based acoustic rendering device 3 of the third embodiment can achieve the same effect as that of the first embodiment. Also, as in the second embodiment, when rendering by WFS is being performed in an environment in which the speakers 106-l are sparsely arranged, if the virtual sound source 104 is placed close to a boundary surface, rendering by VBAP is continuously and seamlessly switched over. This solves the problem of the playback signal becoming smaller.

尚、実施例３のオブジェクトベース音響レンダリング装置３のＨＰＦ１０－ｌ（１０－１～１０－Ｌ）は、図１に示した実施例１及び図５に示した実施例２と同様に、ｊｋ||ｒ_l－ｒ_S||＋１を、前記式（１２）のＨＰＦ係数ｈ_n（ｒ_l，ｒ_S）を用いた前記式（１１）のハイパスフィルタに置き換えて使用するようにしてもよい。 The HPF 10-l (10-1 to 10-L) of the object-based acoustic rendering device 3 of the third embodiment may be configured to use a high-pass filter of the above formula (11) using the HPF coefficient h _n (r _l , r _S ) of the above formula (12), replacing jk∥r _l -r _S ∥+1, as in the first embodiment shown in FIG. 1 and the second embodiment shown in FIG.

また、実施例３のオブジェクトベース音響レンダリング装置３の減衰付加部１１－ｌ（１１－１～１１－Ｌ）は、図１に示した実施例１及び図５に示した実施例２と同様に、減衰係数１／||ｒ_l－ｒ_S||²の代わりに、前記式（１３）の減衰係数ｇ（ｒ_l，ｒ_S）を用いるようにしてもよい。つまり、減衰付加部１１－ｌは、減衰係数ｇ（ｒ_l，ｒ_S）を用いて、仮想音源１０４とスピーカ１０６－ｌとの間の距離に応じて減衰した信号を生成する。 Furthermore, the attenuation adding unit 11-l (11-1 to 11-L) of the object-based acoustic rendering device 3 of the third embodiment may use the attenuation coefficient g(r l , r S ) of the above formula (13) instead of the attenuation coefficient 1/∥r _l -r _S ∥ ² , as in the _first embodiment shown in Fig. 1 and the second embodiment shown in Fig. 5. In other words, the attenuation adding unit 11-l uses the attenuation coefficient g(r _l , r _S ₎ to generate a signal attenuated according to the distance between the virtual sound source 104 and the speaker 106-l.

ただし、仮想音源１０４がスピーカ１０６－ｌに近接して配置された場合、単純化したＷＦＳの構成（ＨＰＦ１０－ｌ～角度重み付加部１３－ｌ）と、ＶＢＡＰの構成（ＶＢＡＰ係数乗算部１４－ｌ及び角度重み付加部１５－ｌ）とでは、信号振幅が大きく異なる場合がある。このため、信号振幅が極端に異ならないように、前記式（１３）の減衰係数ｇ（ｒ_l，ｒ_S）を算出するために用いるパラメータβ1，β2は、適切に調整される必要がある。 However, when the virtual sound source 104 is placed close to the speaker 106-l, the signal amplitude may be significantly different between the simplified WFS configuration (HPF 10-l to angle weighting unit 13-l) and the VBAP configuration (VBAP coefficient multiplier 14-l and angle weighting unit 15-l). Therefore, the parameters β1 and β2 used to calculate the attenuation coefficient g(r _l , r _S ) in the above formula (13) need to be appropriately adjusted so that the signal amplitude does not differ significantly.

以上、実施例１，２，３を挙げて本発明を説明したが、本発明は前記実施例１，２，３に限定されるものではなく、その技術思想を逸脱しない範囲で種々変形可能である。 The present invention has been described above using Examples 1, 2, and 3, but the present invention is not limited to the above Examples 1, 2, and 3, and various modifications are possible without departing from the technical concept thereof.

尚、本発明の実施例１，２，３によるオブジェクトベース音響レンダリング装置１，２，３のハードウェア構成としては、通常のコンピュータを使用することができる。オブジェクトベース音響レンダリング装置１，２，３は、ＣＰＵ、ＲＡＭ等の揮発性の記憶媒体、ＲＯＭ等の不揮発性の記憶媒体、及びインターフェース等を備えたコンピュータによって構成される。 In addition, a normal computer can be used as the hardware configuration of the object-based acoustic rendering devices 1, 2, and 3 according to the first, second, and third embodiments of the present invention. The object-based acoustic rendering devices 1, 2, and 3 are configured by a computer equipped with a CPU, a volatile storage medium such as a RAM, a non-volatile storage medium such as a ROM, and an interface, etc.

オブジェクトベース音響レンダリング装置１に備えたＨＰＦ１０－１～１０－Ｌ、減衰付加部１１－１～１１－Ｌ、面積要素付加部１２－１～１２－Ｌ、角度重み付加部１３－１～１３－Ｌ及び角度算出部２０の各機能は、これらの機能を記述したプログラムをＣＰＵに実行させることによりそれぞれ実現される。 The functions of the HPFs 10-1 to 10-L, attenuation adding units 11-1 to 11-L, surface area element adding units 12-1 to 12-L, angle weighting adding units 13-1 to 13-L, and angle calculation unit 20 provided in the object-based acoustic rendering device 1 are each realized by having the CPU execute a program that describes these functions.

また、オブジェクトベース音響レンダリング装置２，３に備えたＨＰＦ１０－１～１０－Ｌ、減衰付加部１１－１～１１－Ｌ、面積要素付加部１２－１～１２－Ｌ、角度重み付加部１３－１～１３－Ｌ、ＶＢＡＰ係数乗算部１４－１～１４－Ｌ、角度重み付加部１５－１～１５－Ｌ、加算部１６－１～１６－Ｌ、角度算出部２０及びＶＢＡＰ係数算出部２１の各機能は、これらの機能を記述したプログラムをＣＰＵに実行させることによりそれぞれ実現される。 Furthermore, the functions of the HPFs 10-1 to 10-L, attenuation adding units 11-1 to 11-L, surface area element adding units 12-1 to 12-L, angle weighting adding units 13-1 to 13-L, VBAP coefficient multipliers 14-1 to 14-L, angle weighting adding units 15-1 to 15-L, adders 16-1 to 16-L, angle calculation unit 20, and VBAP coefficient calculation unit 21 provided in the object-based acoustic rendering devices 2 and 3 are each realized by having the CPU execute a program that describes these functions.

これらのプログラムは、前記記憶媒体に格納されており、ＣＰＵに読み出されて実行される。また、これらのプログラムは、磁気ディスク（フロッピー（登録商標）ディスク、ハードディスク等）、光ディスク（ＣＤ－ＲＯＭ、ＤＶＤ等）、半導体メモリ等の記憶媒体に格納して頒布することもでき、ネットワークを介して送受信することもできる。 These programs are stored in the storage medium and are read and executed by the CPU. In addition, these programs can be distributed by storing them on storage media such as magnetic disks (floppy disks, hard disks, etc.), optical disks (CD-ROMs, DVDs, etc.), and semiconductor memories, and can also be transmitted and received via a network.

１，２，３オブジェクトベース音響レンダリング装置
９信号増幅部
１０ＨＰＦ（ハイパスフィルタ）
１１減衰付加部
１２面積要素付加部
１３，１５角度重み付加部
１４ＶＢＡＰ係数乗算部
１６加算部
２０角度算出部
２１ＶＢＡＰ係数算出部
３０，３２算出部
３１スピーカ決定部
１００受音点
１０１，１０４仮想音源
１０２，１０６スピーカ
１０３音源
１０５スピーカアレイ
ｒ_nS 仮想音源方向の単位ベクトル
ｒ_n1，ｒ_n2，ｒ_n3 スピーカ方向の単位ベクトル
ｇ_l，ｇ_n1，ｇ_n2，ｇ_n3 係数
ｒ_S 仮想音源座標
ｒ_l スピーカ座標
Ｃ１，Ｃ２領域
Ｓ音源信号
Ｌスピーカの個数
ｌスピーカの番号（系統の番号）
ΔＳ_l スピーカの面積要素
ｎ（ｒ_l）スピーカの境界面内向き法線ベクトル
ｋ波数
θ_l 成す角（仮想音源１０４位置からスピーカ１０６－ｌ位置を結んだベクトル（ｒ_l－ｒ_S）と境界面内向き法線ベクトルｎ（ｒ_l）との成す角）
ｈ（ｒ_l，ｒ_S）ＨＰＦ
ｗ_C（θ_l），ｗ_S（θ_l）角度重み
１／||ｒ_l－ｒ_S||²，ｇ（ｒ_l，ｒ_S）減衰係数
ｄ（ｒ_l）駆動信号 1, 2, 3 Object-based acoustic rendering device 9 Signal amplifier 10 HPF (high pass filter)
REFERENCE SIGNS LIST 11 attenuation adding section 12 area element adding section 13, 15 angle weight adding section 14 VBAP coefficient multiplying section 16 addition section 20 angle calculation section 21 VBAP coefficient calculation section 30, 32 calculation section 31 speaker determination section 100 sound receiving point 101, 104 virtual sound source 102, 106 speaker 103 sound source 105 speaker array r _nS unit vector r _n1 , r _n2 , r _n3 in virtual sound source direction unit vector g _l , g _n1 , g _n2 , g _n3 in speaker direction coefficient r _S virtual sound source coordinate r _l speaker coordinates C1, C2 area S sound source signal L number of speakers l speaker number (system number)
Angle formed by ΔS _l, surface area element n (r _l ) of speaker, inward normal vector k of speaker's boundary surface, and wave number θ _l (angle formed by vector (r _l -r _S ) connecting the position of virtual sound source 104 to the position of speaker 106-l and inward normal vector n (r _l ) of the boundary surface)
h(r _l , r _s ) HPF
w _C (θ _l ), w _S (θ _l ) Angular weight 1/||r _l -r _S || ² , g(r _l , r _S ) Damping coefficient d(r _l ) Drive signal

Claims

1. An object-based audio rendering device that generates drive signals for a plurality of speakers arranged in a listening environment by performing rendering on a sound source object arranged at a position described in audio metadata,
Let L be the number of the plurality of speakers, l (=1, ..., L) be the number of the speakers, the lth speaker be the lth speaker, the coordinates of the virtual sound source from which a sound source signal is output be virtual sound source coordinates r _S , the coordinates of the lth speaker be speaker coordinates r _l , the inward normal vector of the boundary surface at the position of the lth speaker be n(r _l ), the area element centered on the position of the lth speaker be ΔS _l , and the drive signal for the lth speaker be d(r _l ),
an angle calculation unit that calculates an angle θ l between a vector from the position of the virtual sound source coordinate r _S to the position of the speaker coordinate r _l of the first speaker and the boundary surface inward normal vector n(r _l ₎ of the first speaker, for each of the plurality of speakers;
a signal amplifier, a high pass filter, an attenuation adding unit, an area element adding unit, and a first angle weighting unit, each of which corresponds to one of the plurality of speakers;
The signal amplifier unit includes:
amplifying the sound source signal by a constant;
The HPF is
setting an HPFh(r _l , r _S ) based on the virtual sound source coordinates r _S and the speaker coordinates r _l of the first speaker, and performing a filtering process on the signal amplified by the signal amplifier using the HPFh(r _l , r _S );
The attenuation adding unit is
an absolute value of a vector from the position of the virtual sound source coordinate r _S to the position of the speaker coordinate r _l of the first speaker is obtained, the reciprocal of the result of squaring the absolute value is set as an attenuation coefficient 1/||r _l -r _S || ² , and the signal that has been subjected to the filtering process is multiplied by the attenuation coefficient 1/||r _l -r _S || ² ;
The surface area element adding unit
Multiplying the signal multiplied by the attenuation adding unit by the area element ΔS _l ;
The first angle weighting unit
an angle weight wC(θl) is set based on the angle _θl of the first speaker calculated by the angle calculation unit, the angle weight wC(θl) taking a value closer to 1 as the absolute value of the angle _θl approaches 0°, the value closer to 0 as the absolute value of the angle _θl approaches 90°, and the value 0 when the absolute value of the angle _θl is less than 0° or greater than 90°, in the range from 0 to 1; and the drive signal d( _rl ) for the first speaker is generated by multiplying the signal multiplied by the area element addition unit by _{the angle weight wC} ₍ _θl ₎ .

2. The object-based audio rendering apparatus of claim 1,
Furthermore, a VBAP coefficient calculation unit that identifies three speakers including the virtual sound source as three speakers that are targets of VBAP (Vector Based Amplitude Panning ₎ based on a unit vector of a virtual sound source direction toward the virtual sound source coordinate _rS and a unit vector of a speaker direction toward the speaker coordinate _r1 for all speakers, obtains unit vectors _rn1 , _rn2 , _rn3 of the three speakers that are targets of VBAP, calculates coefficients _g1 = _gn1 , _gn2 , _gn3 corresponding to the three speakers that are targets of VBAP based on the unit vector of the virtual sound source direction toward the virtual sound source coordinate _rS and the unit vectors rn1, _rn2 , _rn3 of the three speakers that are targets of VBAP, sets coefficient _g1 = 0 corresponding to speakers other than the three speakers that are targets of VBAP among all the speakers, and outputs the coefficient _g1 ;
a VBAP coefficient multiplier, a second angle weighting unit, and an adder, each corresponding to one of the plurality of speakers;
The VBAP coefficient multiplication unit
multiplying the signal multiplied by the area element adding unit by the coefficient _g output by the VBAP coefficient calculation unit;
The second angle weighting unit is
based on the angle _θl of the first speaker calculated by the angle calculation unit, an angle weight ws(θl) is set, which takes a value closer to 0 as the absolute value of the angle _θl approaches 0°, takes a value closer to 1 as the absolute value of the angle _θl approaches 90°, and takes a value of 0 when the absolute value of the angle _θl is less than 0° or greater than 90°, in a range from ₀ to 1; and generates a second drive signal by multiplying the signal multiplied by the VBAP coefficient multiplication unit by the angle weight _ws ( _θl ) _;
The first angle weighting unit
an angle weight _wC ( _θl ) is set, and a first drive signal is generated by multiplying the signal multiplied by the area element adding unit by the angle weight _wC ( _θl ), and the adder unit generates the drive signal d( _rl ) for the first speaker by adding the first drive signal and the second drive signal.

3. The object-based audio rendering apparatus of claim 2,
The VBAP coefficient multiplication unit
2. An object-based sound rendering apparatus comprising: an input of the sound source signal instead of the signal multiplied by the area element adding unit; and a multiplication unit for multiplying the sound source signal by the coefficient g _l .

An object-based acoustic rendering apparatus according to any one of claims 1 to 3,
The HPF is
With a preset parameter as α, a preset filter order as N, and n=0, 1, . . . , N, the following equation is used:

Set the HPF coefficients h _n (r _l , r _S ) in
Let x[t] and y[t] be the input and output of the HPF at time t, and the input/output characteristics are expressed by the following equation:

and filtering the signal amplified by the signal amplifier using the HPF coefficients h _n (r ₁ , r _s ) so that:

An object-based acoustic rendering apparatus according to any one of claims 1 to 4,
The attenuation adding unit is
With preset parameters as β ₁ and β ₂ , the following equation is used:

and multiplying the filtered _signal by the attenuation coefficient g(r _l , r _S ₎ .

A program for causing a computer to function as an object-based acoustic rendering device according to any one of claims 1 to 5.