JP2982766B2

JP2982766B2 - Sound source direction estimation method and apparatus

Info

Publication number: JP2982766B2
Application number: JP9302786A
Authority: JP
Inventors: 一郎辻
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1997-11-05
Filing date: 1997-11-05
Publication date: 1999-11-29
Anticipated expiration: 2017-11-05
Also published as: JPH11142499A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音源方向推定方法
及びその装置に関し、特に、複数のマイクロホンで受音
した信号を用いて計算された相互相関関数により、音源
方向を推定する音源方向推定方法及びその装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a sound source direction estimating method and apparatus, and more particularly, to a sound source direction estimating method for estimating a sound source direction by a cross-correlation function calculated using signals received by a plurality of microphones. And its device.

【０００２】[0002]

【従来の技術】図３は「１９９２年７月、ザ・ジャーナ
ル・オブ・ザ・アコースティカル・ソサエティ・オブ・
ジャパン、第１３巻、第４号、２４１−２５２ページ
（ＴｈｅＪｏｕｒｎａｌｏｆｔｈｅＡｃｏｕｓｔ
ｉｃａｌＳｏｃｉｅｔｙｏｆＪａｐａｎ，ｖｏｌ．
１３，ｎｏ．４，ｐｐ．２４１−２５２，Ｊｕｌｙ１
９９２）」に記載された従来の音源方向推定装置（以下
第１の従来例）のブロック図である。2. Description of the Related Art FIG. 3 shows "Journal of the Acoustic Society of July, 1992.
Japan, Vol. 13, No. 4, pp. 241-252 (The Journal of the Acoustic)
Ial Society of Japan, vol.
13, no. 4, pp. 241-252, July 1
FIG. 992) is a block diagram of a conventional sound source direction estimating apparatus (hereinafter, referred to as a first conventional example).

【０００３】図３において、音源１１５からマイクロホ
ン１０１、１０２に入力された信号を増幅器（以下アン
プと略記する）１０３、１０４により増幅して、Ａ／Ｄ
コンバータ１０５、１０６で離散信号に変換する。プリ
エンファシス処理部１０７、１０８で高域成分を強調し
た離散信号を用いて、自己回帰フィルタ係数を自己回帰
係数計算部１０９、１１０で求める。このフィルタ係数
と離散信号を逆フィルタ処理部１１１、１１２で畳み込
む。逆フィルタ処理部１１１、１１２の出力は白色化信
号となる。これらの白色化信号に対する相互相関関数を
相互相関関数計算部１１３で求め、方向検出部１１４で
相互相関関数の最大値を与える時間差に対応した方向を
音源方向として推定する。[0003] In FIG. 3, signals input from microphones 101 and 102 from a sound source 115 are amplified by amplifiers (hereinafter abbreviated as amplifiers) 103 and 104 and A / D
The signals are converted into discrete signals by converters 105 and 106. Auto-regression filter coefficients are obtained by auto-regression coefficient calculation sections 109 and 110 using discrete signals in which high-frequency components are emphasized by pre-emphasis processing sections 107 and 108. The filter coefficients and the discrete signal are convolved by the inverse filter processing units 111 and 112. Outputs of the inverse filter processing units 111 and 112 are whitened signals. The cross-correlation function for these whitening signals is obtained by the cross-correlation function calculation unit 113, and the direction corresponding to the time difference giving the maximum value of the cross-correlation function is estimated by the direction detection unit 114 as the sound source direction.

【０００４】図４は、「１９９６年９月、電子情報通信
学会技術報告、ＤＳＰ９６−７７、ＳＰ９６−５２、２
３−２９ページ」に記載された音源方向推定装置（以下
第２の従来例）のブロック図である。FIG. 4 shows “September 1996, IEICE Technical Report, DSP96-77, DSP96-52, SP96-52,
It is a block diagram of a sound source direction estimating device (hereinafter, a second conventional example) described on page 3-29.

【０００５】図４において、音源１２８からマイクロホ
ン１１６、１１７に入力された信号をアンプ１１８、１
１９で増幅し、Ａ／Ｄコンバータ１２０、１２１で離散
信号に変換する。この離散信号は、零交差時間系列生成
部１２２、１２３で信号レベルが０となる時の傾きが正
であるものを１、負であるものを−１、それ以外のもの
を０とする零交差時間系列に変換される。In FIG. 4, signals input from a sound source 128 to microphones 116 and 117 are amplified by amplifiers 118 and 1.
The signal is amplified at 19 and converted to discrete signals by A / D converters 120 and 121. The discrete signal has a zero-crossing which is 1 when the signal level becomes 0 in the zero-crossing time series generation units 122 and 123, -1 when the signal level is negative, and 0 when the signal level is negative. Converted to time series.

【０００６】相互相関関数計算部１２４でこれらの零交
差時間系列の相互相関関数を求め、正規化電力計算部１
２５で、その電力を正規化する。時間平均計算部１２６
で、この正規化電力値は時間平均化され、方向検出部１
２７において、その最大値を与える時間差に対応した方
向が音源方向として推定される。The cross-correlation function calculator 124 calculates the cross-correlation function of these zero-crossing time series, and the normalized power calculator 1
At 25, the power is normalized. Time average calculator 126
Then, this normalized power value is time-averaged, and the direction detection unit 1
At 27, the direction corresponding to the time difference giving the maximum value is estimated as the sound source direction.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、叙上の
従来技術には下記に示す如き欠点があった。However, the prior art described above has the following drawbacks.

【０００８】第１の問題点は、第１の従来例における相
互相関計算に要する演算精度が高く、ハードウェア規模
が大きいことである。The first problem is that the calculation accuracy required for the cross-correlation calculation in the first conventional example is high and the hardware scale is large.

【０００９】その理由は、Ａ／Ｄコンバータ１０５、１
０６により量子化された多ビット離散信号をそのまま用
いて相互相関関数を計算しているためである。The reason is that the A / D converters 105, 1
This is because the cross-correlation function is calculated using the multi-bit discrete signal quantized according to 06 as it is.

【００１０】第２の問題点は、上記第１の従来例は雑音
に弱いことである。A second problem is that the first conventional example is susceptible to noise.

【００１１】その理由は、相互相関関数計算部１１３で
計算された相互相関関数を、時間平均化せずに方向検出
部１１４で用いて音源方向推定を行っているためであ
る。The reason is that the sound source direction estimation is performed by using the cross-correlation function calculated by the cross-correlation function calculation unit 113 by the direction detection unit 114 without time averaging.

【００１２】第３の問題点は、上記第２の従来例におい
ては音源方向の推定精度が低いことである。A third problem is that the accuracy of estimating the sound source direction is low in the second conventional example.

【００１３】その理由は、零交差時間系列生成部１２
２、１２３で生成される零交差時間系列は、入力信号の
零交差点においてだけ値を有し、等価的に入力信号サン
プルを間引いたことになるためである。すなわち、分解
能が低下する。The reason is that the zero-crossing time series generation unit 12
This is because the zero-crossing time sequence generated in 2, 123 has a value only at the zero-crossing point of the input signal, and the input signal samples are equivalently thinned out. That is, the resolution decreases.

【００１４】本発明は従来の上記実情に鑑み、従来の技
術に内在する上記諸欠点を解消するためになされたもの
であり、従って、本発明の目的は、必要とする演算精度
が低く、ハードウェア規模の小さい新規な音源方向推定
方法及びその装置を提供することにある。SUMMARY OF THE INVENTION The present invention has been made in view of the above-mentioned conventional circumstances, and has been made to solve the above-mentioned drawbacks inherent in the conventional technology. It is an object of the present invention to provide a novel sound source direction estimating method and a device thereof having a small wear scale.

【００１５】また、本発明の他の目的は、優れた耐雑音
性をもつ新規な音源方向推定方法及びその装置を提供す
ることにある。It is another object of the present invention to provide a novel sound source direction estimating method and apparatus having excellent noise immunity.

【００１６】本発明の更に他の目的は、精度の高い音源
方向推定を行うことのできる新規な音源方向推定方法及
びその装置を提供することにある。Still another object of the present invention is to provide a novel sound source direction estimating method and apparatus capable of performing highly accurate sound source direction estimation.

【００１７】[0017]

【課題を解決するための手段】上記目的を達成する為
に、本発明に係る音源方向推定装置は、第１のオーディ
オ信号を入力として、その信号レベルが正の時に
“１”、負の時に“−１”、０の時にそのまま“０”で
表される符号時間系列を生成する第１の符号時間系列生
成部（図１の７）と、第２のオーディオ信号を入力とし
て、その信号レベルが正の時に“１”、負の時に“−
１”、０の時にそのまま“０”で表される符号時間系列
を生成する第２の符号時間系列生成部（図１の８）と、
前記第１の符号時間系列生成部の出力と前記第２の符号
時間系列生成部の出力を用いて相関を計算する第１の相
互相関関数計算部（図１の９）と、前記第１の相互相関
関数計算部の出力を正規化する第１の正規化電力計算部
（図１の１０）と、前記第１の正規化電力計算部の出力
を平均する第１の時間平均計算部（図１の１１）と、前
記第１の時間平均計算部の出力を用いて音源方向を推定
する第１の方向検出部（図１の１２）を有することを特
徴とする。In order to achieve the above object, a sound source direction estimating apparatus according to the present invention receives a first audio signal as an input and outputs "1" when the signal level is positive and when the signal level is negative, A first code time sequence generation unit (7 in FIG. 1) for generating a code time sequence directly represented by “0” when “−1” or 0, and a signal level of the second audio signal as an input. Is "1" when is positive and "-" when it is negative.
A second code time sequence generation unit (8 in FIG. 1) that generates a code time sequence directly represented by “0” when the value is “1” or “0”;
A first cross-correlation function calculator (9 in FIG. 1) for calculating a correlation using an output of the first code time sequence generator and an output of the second code time sequence generator; A first normalized power calculator (10 in FIG. 1) that normalizes the output of the cross-correlation function calculator, and a first time average calculator (10 in FIG. 1) that averages the output of the first normalized power calculator. 1) and a first direction detection unit (12 in FIG. 1) for estimating the sound source direction using the output of the first time average calculation unit.

【００１８】[0018]

【作用】本発明では、入力信号を白色化してから、“＋
１”、“−１”または“０”の３値で表される符号時間
系列に変換する。このために、必要な演算精度が低下
し、ハードウェア規模を削減することができる。According to the present invention, after the input signal is whitened, "+
It is converted into a code time sequence represented by a ternary value of “1”, “−1”, or “0.” For this reason, necessary calculation accuracy is reduced and the hardware scale can be reduced.

【００１９】また、本発明では、相互相関関数の正規化
電力値を時間平均して音源の方向推定を行う。このため
に、雑音の影響で方向推定を誤ることがない。In the present invention, the direction of the sound source is estimated by time-averaging the normalized power value of the cross-correlation function. For this reason, the direction estimation is not mistaken due to the influence of noise.

【００２０】さらに、本発明では、零交差時間系列では
なく、符号時間系列を用いて方向推定を行う。このため
に、入力信号と同数の信号サンプルを利用することがで
き、間引きによる推定精度の劣化がない。Further, in the present invention, direction estimation is performed using a code time sequence instead of a zero crossing time sequence. Therefore, the same number of signal samples as the input signal can be used, and there is no deterioration in estimation accuracy due to thinning.

【００２１】[0021]

【発明の実施の形態】次に、本発明をその好ましい各実
施の形態について図面を参照しながら詳細に説明する。Next, preferred embodiments of the present invention will be described in detail with reference to the drawings.

【００２２】［構成の説明］図１は、本発明による第１
の実施の形態を示すブロック構成図である。[Description of Configuration] FIG. 1 shows a first embodiment of the present invention.
FIG. 2 is a block diagram showing an embodiment.

【００２３】図１を参照するに、マイクロホン１、２に
おいて方向推定すべき音源１３から到来する信号Ｘ
₀（ｔ）、Ｙ₀（ｔ）を入力信号として獲得する。これ
らの入力信号は、それぞれ増幅器（以下アンプと略記す
る）３、４により増幅され、Ａ／Ｄコンバータ５、６に
より量子化されて離散信号に変換される。この離散信号
は、信号レベルが正の時に“１”、負の時に“−１”、
０の時にそのまま“０”で表される符号時間系列に、符
号時間系列生成部７、８で変換される。これらの符号時
間系列を用いて、相互相関関数を相互相関関数計算部９
で求める。Referring to FIG. 1, a signal X arriving from a sound source 13 whose direction is to be estimated in microphones 1 and 2 is shown.
₀ (t) and Y ₀ (t) are obtained as input signals. These input signals are amplified by amplifiers (hereinafter abbreviated as amplifiers) 3 and 4, respectively, quantized by A / D converters 5 and 6, and converted into discrete signals. This discrete signal is “1” when the signal level is positive, “−1” when the signal level is negative,
When it is 0, it is converted by the code time sequence generation units 7 and 8 into a code time sequence represented as “0” as it is. Using these code time sequences, a cross-correlation function is calculated by a cross-correlation function calculator 9.
Ask for.

【００２４】相互相関関数計算部９により求められた相
互相関関数は、正規化電力計算部１０で正規化された電
力値に変換され、正規化電力の時間平均計算部１１で時
間平均化される。方向検出部１２では、時間平均化され
た正規化電力の最大値を与える時間差に対応した方向
が、音源方向として推定される。The cross-correlation function obtained by the cross-correlation function calculator 9 is converted into a power value normalized by the normalized power calculator 10 and time-averaged by the time average calculator 11 of the normalized power. . In the direction detection unit 12, the direction corresponding to the time difference that gives the maximum value of the time-averaged normalized power is estimated as the sound source direction.

【００２５】［動作の説明］次に、本発明の動作につい
て、図１を参照して説明する。[Description of Operation] Next, the operation of the present invention will be described with reference to FIG.

【００２６】図１において、音源１３からマイクロホン
１に入力される信号をＸ₀（ｔ）、音源１３からマイク
ロホン２に入力される信号をＹ₀（ｔ）とする。入力信
号Ｘ₀（ｔ）は、アンプ３で増幅され、Ａ／Ｄコンバー
タ５で量子化され、離散信号Ｘ₁（ｎ）に変換される。
入力信号Ｙ₀（ｔ）は、アンプ４で増幅され、Ａ／Ｄコ
ンバータ６で量子化され、離散信号Ｙ₁（ｎ）に変換さ
れる。In FIG. 1, a signal input from the sound source 13 to the microphone 1 is X ₀ (t), and a signal input from the sound source 13 to the microphone 2 is Y ₀ (t). The input signal X ₀ (t) is amplified by the amplifier 3, quantized by the A / D converter 5, and converted into a discrete signal X ₁ (n).
The input signal Y ₀ (t) is amplified by the amplifier 4, quantized by the A / D converter 6, and converted into a discrete signal Y ₁ (n).

【００２７】次に離散信号Ｘ₁（ｎ）は符号時間系列生
成部７で信号レベルが正の時に“１”、負の時に“−
１”、０の時にそのまま“０”で表される符号時間系列
ｘ（ｎ）に変換される。離散信号Ｘ₁（ｎ）に対する符
号時間系列ｘ（ｎ）を、式１で定義する。Next, the discrete time signal X ₁ (n) is “1” when the signal level is positive and “−” when the signal level is negative in the code time sequence generation unit 7.
When it is 1 or 0, it is directly converted into a code time sequence x (n) represented by “0.” The code time sequence x (n) for the discrete signal X ₁ (n) is defined by Expression 1.

【００２８】［式１］[Equation 1]

【００２９】 [0029]

【００３０】離散信号Ｙ₁（ｎ）も、Ｘ₁（ｎ）と同様
にして、符号時間系列生成部８で符号時間系列に変換さ
れる。離散信号Ｙ₁（ｎ）に対する符号時間系列ｙ
（ｎ）は、式１に準じて定義される。相互相関関数計算
部９では、符号時間系列ｘ（ｎ）、ｙ（ｎ）に対する相
互相関関数Ｒxy（ｋ）を、式２よって求める。The discrete signal Y ₁ (n) is also converted into a code time sequence by the code time sequence generator 8 in the same manner as X ₁ (n). Code time sequence y for discrete signal Y ₁ (n)
(N) is defined according to Equation 1. The cross-correlation function calculation unit 9 calculates the cross-correlation function Rxy (k) for the code time series x (n), y (n) using Equation 2.

【００３１】［式２］[Equation 2]

【００３２】 [0032]

【００３３】本来、求める音波の入射方向（音源方向）
θ（ｒａｄ）は、式３によって求めることができる。The incident direction of the sound wave to be originally sought (sound source direction)
θ (rad) can be obtained by Expression 3.

【００３４】［式３］ θ＝ｓｉｎ^-1（τｃ／Ｍ）ここで、τ（ｓｅｃ）は信号Ｘ₀（ｔ）がマイクロホン
１に到達する時間と信号Ｙ₀（ｔ）がマイクロホン２に
到達する時間の差、Ｍ（ｍ）は２つのマイクロホンの間
隔、ｃ（ｍ／ｓｅｃ）は音速である。実際には、サンプ
リング周波数ｆｓ（Ｈｚ）の符号時間系列ｘ（ｎ）、ｙ
（ｎ）を用いて相関を計算するために、２つのマイクロ
ホンで受音した信号の時間差は、対応するサンプル数ｋ
で表される。ただし、ｋは整数である。従って、実際に
求まる時間差τ（ｓｅｃ）は、式４のようになる。[Equation 3] θ = sin ⁻¹ (τc / M) Here, τ (sec) is the time when the signal X ₀ (t) reaches the microphone 1 and the signal Y ₀ (t) reaches the microphone 2 M (m) is the interval between two microphones, and c (m / sec) is the speed of sound. Actually, the code time sequence x (n), y of the sampling frequency fs (Hz)
To calculate the correlation using (n), the time difference between the signals received by the two microphones is calculated by the corresponding number of samples k
It is represented by Here, k is an integer. Therefore, the time difference τ (sec) actually obtained is as shown in Expression 4.

【００３５】［式４］ τ＝ｋ／ｆｓ式４を式３に代入すると式５が得られる。[Equation 4] τ = k / fs By substituting Equation 4 into Equation 3, Equation 5 is obtained.

【００３６】［式５］ θ＝ｓｉｎ^-1（（ｃ／Ｍ）×（ｋ／ｆｓ））式５のθは［ｒａｄ］で表されているが、これを［ｄｅ
ｇ］で表すためには、式５に１８０／πを掛けた式６を
用いなければならない。[Equation 5] θ = sin ⁻¹ ((c / M) × (k / fs)) In Expression 5, θ is represented by [rad].
g], Expression 6 obtained by multiplying Expression 5 by 180 / π must be used.

【００３７】［式６］ θ＝（１８０／π）ｓｉｎ^-1（（ｃ／Ｍ）×（ｋ／ｆ
ｓ））一方、時間差τ（ｓｅｃ）は、直線上に配置された２つ
のマイクロホンの延長線上である真横（９０度）に音源
が位置する時に最大となる。従って、最大値に対応した
サンプル数ｋ_maxは、式７で求めることができる。[Equation 6] θ = (180 / π) sin ⁻¹ ((c / M) × (k / f)
s)) On the other hand, the time difference τ (sec) is maximized when the sound source is located right beside (90 degrees) which is an extension of two microphones arranged on a straight line. Therefore, the number of samples k _max corresponding to the maximum value can be obtained by Expression 7.

【００３８】［式７］ｋ_max≦ｆｓ×（Ｍ／ｃ）すなわち、離散信号を入力とする場合、検出できる音源
方向角度の最大値は、一般に９０度より小さくなる。音
源の到来方向がｋ_maxに相当する角度以上の時、式６で
直接、音源方向を求めることはできない。そこで、ｋ
_max＋１、ｋ_max＋２まで相互相関関数を求めて、ピー
ク値がｋ_max＋１に存在する場合には到来方向が９０度
であると判定する。[Equation 7] k _max ≦ fs × (M / c) That is, when a discrete signal is input, the maximum value of the sound source direction angle that can be detected is generally smaller than 90 degrees. When the arrival direction of the sound source is equal to or larger than the angle corresponding to k _max , the sound source direction cannot be directly obtained by Expression 6. So, k
_The cross-correlation function is obtained up to _max + 1 and _kmax + 2, and when the peak value exists at _kmax + 1, it is determined that the arrival direction is 90 degrees.

【００３９】次に、正規化電力計算部１０で、相互相関
関数Ｒxy（ｋ）から、式８を用いて正規化電力ｐ（ｋ）
を求める。Next, the normalized power calculation unit 10 calculates the normalized power p (k) from the cross-correlation function Rxy (k) using Expression 8.
Ask for.

【００４０】［式８］[Equation 8]

【００４１】 [0041]

【００４２】式８から求めた電力ｐ（ｋ）は、符号時間
系列ｘ（ｎ）、ｙ（ｎ）（ｎ＝０，１，
２，．．．．．，Ｎ−１）に対して与えられるが、音声
信号の非定常性を考慮すると、Ｎを大きくとることはで
きない。そこで、音声の全データ長をＬとするとき、１
フレームをＮサンプルとして、ｕサンプルオーバーラッ
プさせてＦフレームに分割し、各フレームに対応したＦ
個の正規化電力ｐf （ｋ）（ｆ＝０，１，．．．．，Ｆ
−１）の平均値をｐ（ｋ）の代わりに用いる。時間平均
計算部１１で、Ｐ_f（ｋ）の平均値Ｃ_p（ｋ）（ｋ＝
０，１，２，・・・・，ｋ_max，ｋ_max＋１，ｋ_max＋
２）を式９によって求める。The power p (k) obtained from Equation 8 is represented by code time series x (n), y (n) (n = 0, 1,
2,. . . . . , N−1), but N cannot be made large in consideration of the unsteadiness of the audio signal. Therefore, when the total data length of voice is L, 1
Assuming that a frame is N samples, u samples are overlapped and divided into F frames, and F frames corresponding to each frame are divided.
Pieces of normalized power pf (k) (f = 0, 1,..., F
Use the average of -1) instead of p (k). The time average calculator 11 calculates the average value of P _f (k) C _p (k) (k =
.., K _max , k _max +1 and k _max +
2) is obtained by Expression 9.

【００４３】［式９］[Equation 9]

【００４４】 [0044]

【００４５】Ｐ_f（ｋ）は第ｆフレームの電力を表し、
ｘ（ｎ）、ｙ（ｎ）（ｎ＝ｆ（Ｎ−ｕ），ｆ（Ｎ−ｕ）
＋１，ｆ（Ｎ−ｕ）＋２，．．．．，ｆ（Ｎ−ｕ）＋Ｎ
−１）から、式７、式８を用いて求められる。ただし、
Ｆは平均化における総フレーム数（分析区間数）、Ｎ−
ｕは各フレームで新しく入力される符号時間系列ｘ
（ｎ）、ｙ（ｎ）のサンプル数を表す。P _f (k) represents the power of the f-th frame,
x (n), y (n) (n = f (Nu), f (Nu)
+1, f (N−u) +2,. . . . , F (N−u) + N
-1) is obtained using Expressions 7 and 8. However,
F is the total number of frames in the averaging (the number of analysis sections), N-
u is a code time sequence x newly input in each frame.
(N) and y (n) represent the number of samples.

【００４６】最後に、方向検出部１２で、時間平均Ｃ_p
（ｋ）の最大値を与えるｋ１を求め式１０で対応した推
定角度θ度を得る。Finally, the direction detector 12 calculates the time average C _p
K1 that gives the maximum value of (k) is obtained, and an estimated angle θ degree corresponding to Expression 10 is obtained.

【００４７】［式１０］[Equation 10]

【００４８】 [0048]

【００４９】[0049]

【実施例】図２は本発明による第２の実施の形態を示す
ブロック構成図である。FIG. 2 is a block diagram showing a second embodiment according to the present invention.

【００５０】図２を参照するに、この第２の実施の形態
の図１に示された第１の実施の形態に対する相違点は、
符号時間系列生成部２４、２５に、Ａ／Ｄコンバータ１
８、１９から供給する信号を自己回帰係数を用いた逆フ
ィルタで処理して、白色化していることである。このた
めに、自己回帰係数計算部２０、２１、逆フィルタ処理
部２２、２３を備えているが、他の構成／動作は図１と
等しいので、これらの相違点について説明する。Referring to FIG. 2, the difference between the second embodiment and the first embodiment shown in FIG.
The A / D converter 1 is added to the code time sequence generation units 24 and 25.
That is, the signals supplied from 8 and 19 are processed by an inverse filter using an auto-regression coefficient to whiten. To this end, auto-regression coefficient calculation units 20 and 21 and inverse filter processing units 22 and 23 are provided. However, other configurations / operations are the same as those in FIG.

【００５１】離散信号Ｘ₁（ｎ）、Ｙ₁（ｎ）を用い
て、逆フィルタのためのフィルタ係数が、自己回帰係数
計算部２０、２１で求められる。これらのフィルタ係数
と対応する離散信号が逆フィルタ処理部２２、２３で畳
み込まれ白色化信号が生成される。Using the discrete signals X ₁ (n) and Y ₁ (n), filter coefficients for the inverse filter are obtained by the auto-regression coefficient calculators 20 and 21. The discrete signals corresponding to these filter coefficients are convolved by the inverse filter processing units 22 and 23 to generate a whitened signal.

【００５２】次に、離散信号Ｘ₁（ｎ）を用いて、自己
回帰係数α_k（ｋ＝１，２，．．．．，Ｑ）を自己回帰
係数計算部２０において求める。自己回帰係数は、Ｌｅ
ｖｉｎｓｏｎ−Ｄｕｒｂｉｎ法として知られているよう
に、入力信号の自己相関関数を表す式１１を用いて、求
めることができる。Ｌｅｖｉｎｓｏｎ−Ｄｕｒｂｉｎ法
については、「１９７５年４月、プロシーディングス・
オブ・ザ・アイ・イー・イー・イー、第６３巻、第４
号、５６１−５８０ページ（Ｐｒｏｃｅｅｄｉｎｇｓ
ｏｆｔｈｅＩＥＥＥ，ｖｏｌ．６３，Ｎｏ．４，ｐ
５６１−５８０，Ａｐｒｉｌ，１９７５）」に詳述され
ているのでここではその概略を述べる。Next, using the discrete signal X ₁ (n), an auto-regression coefficient calculation section 20 obtains an auto-regression coefficient α _k (k = 1, 2,..., Q). The auto-regression coefficient is Le
As known as the Vinson-Durbin method, it can be obtained by using Expression 11 representing the autocorrelation function of the input signal. For the Levinson-Durbin method, see "Proceedings
Of the iEiEi, Vol. 63, No. 4
Issue, pages 561-580 (Proceedings
of the IEEE, vol. 63, No. 4, p
561-580, April, 1975).

【００５３】式１１に示す自己相関関数Ｒxx（ｋ）（ｋ
＝０，１，２，．．．．，Ｑ−１）を用いて、自己回帰
係数α_k ^(Q)を求める。ただし、Ｑは自己回帰係数の次
数、σ^{q 2}は分散、Ｃ_qは反射係数である。The autocorrelation function Rxx (k) (k
= 0, 1, 2,. . . . , Q-1) to determine the autoregressive coefficient α _k ^(Q) . Here, Q is the order of the autoregressive coefficient, σ ^{q 2} is the variance, and C _q is the reflection coefficient.

【００５４】［式１１］[Equation 11]

【００５５】 [0055]

【００５６】初期値は、式１２のように設定する。The initial value is set as shown in Expression 12.

【００５７】［式１２］ σ₀ ²＝Ｒxx（０） α₀ ⁽⁰⁾＝１ α₁ ⁽¹⁾＝Ｃ₁＝−Ｒxx（1)／σ₀ ² σ₁ ²＝σ₀ ²（１−Ｃ₁ ²）ｑ＝１次に、漸化式１３により変数ｑを更新し、求めるＱ次の
自己回帰係数α_k ^(Q)を順次求める。[Equation 12] σ ₀ ² = Rxx (0) α ₀ ⁽⁰⁾ = 1 α ₁ ⁽¹⁾ = C ₁ = −Rxx (1) / σ ₀ ² σ ₁ ² = σ ₀ ² (1- C ₁ ² ) q = 1 Next, the variable q is updated by the recurrence formula 13, and the Q-order autoregressive coefficient α _k ^(Q) to be obtained is sequentially obtained.

【００５８】［式１３］[Equation 13]

【００５９】 [0059]

【００６０】自己回帰係数計算部２０と同様な計算方法
で、離散信号Ｙ₁（ｎ）に対する自己回帰係数β_k（ｋ
＝１，２，．．．．，Ｑ）を自己回帰係数計算部２１に
て求める。自己回帰係数計算部２０で求めた自己回帰係
数α_k（ｋ＝１，２，・・・・, Ｑ）と離散信号Ｘ
₁（ｎ）を、逆フィルタ処理部２２で式１４を用いて畳
み込み、白色化信号ｅ_X（ｎ）を求める。In the same manner as the auto-regression coefficient calculator 20, the auto-regression coefficient β _k (k) for the discrete signal Y ₁ (n) is calculated.
= 1, 2,. . . . , Q) are calculated by the auto-regression coefficient calculator 21. The autoregression coefficient α _k (k = 1, 2,..., Q) obtained by the autoregression coefficient calculation unit 20 and the discrete signal X
₁ (n) is convolved by the inverse filter processing unit 22 using Expression 14, and a whitening signal e _X (n) is obtained.

【００６１】［式１４］[Equation 14]

【００６２】 [0062]

【００６３】また、自己回帰係数計算部２１で求めた自
己回帰係数β_k（ｋ＝１，２，．．．．．，Ｑ）と離散
信号Ｙ₁（ｎ）を、逆フィルタ処理部２３で式１４と同
様な計算方法で畳み込み、白色化信号ｅ_Y（ｎ）を求め
る。得られた白色化信号ｅ_X（ｎ）とｅ_Y（ｎ）を符号
時間系列生成部２４、２５で符号時間系列ｘ（ｎ）、ｙ
（ｎ）に変換することにより、図１と同様な手順で音源
方向推定を行うことができる。The self-regression coefficient β _k (k = 1, 2,..., Q) obtained by the self-regression coefficient calculation section 21 and the discrete signal Y ₁ (n) are The whitening signal e _Y (n) is obtained by convolution with the same calculation method as in Expression 14. The obtained whitening signals e _X (n) and e _Y (n) are converted into code time sequences x (n), y by code time sequence generation units 24 and 25.
By converting to (n), sound source direction estimation can be performed in the same procedure as in FIG.

【００６４】[0064]

【発明の効果】本発明は以上説明したように構成され、
作用するものであり、本発明によれば以下に示すような
効果が得られる。The present invention is configured as described above,
According to the present invention, the following effects can be obtained.

【００６５】第１の効果は、相互相関関数の計算に要す
る演算精度が低下し、ハードウェア規模を削減できる。The first effect is that the calculation accuracy required for calculating the cross-correlation function is reduced, and the hardware scale can be reduced.

【００６６】その理由は、信号レベルが正の時に
“１”、負の時に“−１”、０の時にそのまま“０”に
する符号時間系列に変換しているためである。The reason is that the signal time is converted into a code time series that is "1" when the signal level is positive, "-1" when the signal level is negative, and "0" when it is 0.

【００６７】第２の効果は、耐雑音性に優れている。The second effect is excellent in noise resistance.

【００６８】その理由は、相互相関関数の電力値を時間
平均化しているためである。The reason is that the power values of the cross-correlation function are averaged over time.

【００６９】第３の効果は、高精度な音源の方向推定を
行うことができる。The third effect is that a highly accurate direction estimation of a sound source can be performed.

【００７０】その理由は、入力信号を白色化しているた
めである。The reason is that the input signal is whitened.

[Brief description of the drawings]

【図１】本発明による第１の実施の形態を示すブロック
構成図である。FIG. 1 is a block diagram showing a first embodiment according to the present invention.

【図２】本発明による第２の実施の形態を示すブロック
構成図である。FIG. 2 is a block diagram showing a second embodiment according to the present invention.

【図３】従来の音源方向推定装置の第１の例を示すブロ
ック図である。FIG. 3 is a block diagram showing a first example of a conventional sound source direction estimating apparatus.

【図４】従来の音源方向推定装置の第２の例を示すブロ
ック図である。FIG. 4 is a block diagram showing a second example of a conventional sound source direction estimation device.

[Explanation of symbols]

１、２、１４、１５…マイクロホン３、４、１６、１７…増幅器５、６、１８、１９…Ａ／Ｄコンバータ７、８、２４、２５…符号時間系列生成部９、２６…相互相関関数計算部１０、２７…正規化電力計算部１１、２８…時間平均計算部１２、２９…方向検出部１３、３０…音源２０、２１…自己回帰係数計算部２２、２３…逆フィルタ処理部 1, 2, 14, 15 ... microphone 3, 4, 16, 17 ... amplifier 5, 6, 18, 19 ... A / D converter 7, 8, 24, 25 ... code time sequence generation unit 9, 26 ... cross-correlation function Calculation units 10, 27 Normalized power calculation unit 11, 28 Time average calculation unit 12, 29 Direction detection unit 13, 30 Sound source 20, 21 Autoregression coefficient calculation unit 22, 23 Inverse filter processing unit

フロントページの続き (58)調査した分野(Int.Cl.⁶，ＤＢ名) G01S 3/80 - 3/86 G01S 5/18 - 5/30 G01S 7/52 - 7/64 G01S 15/00 - 15/96 G06F 15/336 Continued on the front page (58) Fields surveyed (Int.Cl. ⁶ , DB name) G01S 3/80-3/86 G01S 5/18-5/30 G01S 7/52-7/64 G01S 15/00-15 / 96 G06F 15/336

Claims

(57) [Claims]

1. A method for extracting the polarities of a plurality of audio signals obtained by a plurality of microphones, and
Generating a plurality of code time sequences corresponding to one to one, calculating a cross-correlation of the plurality of code time sequences, calculating a normalized power of the cross-correlation, calculating a time average of the normalized power, A sound source direction estimating method characterized by estimating a sound source direction by time averaging.

2. The code time sequence according to claim 1, wherein the code time sequence is expressed as “1” when the signal level is positive, “−1” when the signal level is negative, and “0” as it is when the signal level is 0. The described sound source direction estimation method.

3. A method for calculating a plurality of auto-regression coefficients corresponding to a plurality of audio signals acquired by the plurality of microphones in a one-to-one manner, and using the plurality of auto-regression coefficients as coefficients to inverse the plurality of audio signals. Filter,
Extracting the polarities of the outputs of the plurality of inverse filter processes to generate a plurality of code time sequences, calculating a cross-correlation of the plurality of code time sequences, calculating a normalized power of the cross-correlation,
A sound source direction estimating method, comprising calculating a time average of the normalized power and estimating a sound source direction by the time average.

4. A plurality of code time sequence generators for extracting a plurality of audio signals obtained by a plurality of microphones and a polarity of the plurality of audio signals in a one-to-one correspondence, and the plurality of code time sequences. A cross-correlation function calculator that receives all outputs of the generator and calculates a cross-correlation, a normalized power calculator that receives outputs of the cross-correlation function calculator and calculates a normalized power, and the normalized power calculator A sound source direction estimating apparatus, comprising: a time average calculating unit that calculates the time average of the power value in response to the output of the sound source direction; and a direction detecting unit that estimates the sound source direction based on the output of the time average calculating unit.

5. A plurality of auto-regression coefficient calculators for calculating a plurality of audio signals acquired by a plurality of microphones and auto-regression coefficients thereof in one-to-one correspondence with the plurality of audio signals; A plurality of inverse filter processing units for performing an inverse filter process on the plurality of audio signals by using an output of the regression coefficient calculation unit as a coefficient; and a plurality of code time sequence generating units for extracting polarities of outputs of the plurality of inverse filter process units , A cross-correlation function calculator that receives all outputs of the plurality of code time sequence generators and calculates a cross-correlation, and a normalized power calculator that receives output of the cross-correlation function calculator and calculates normalized power Unit, a time average calculation unit that receives the output of the normalized power calculation unit and calculates a time average,
A sound source direction estimating device, comprising: a direction detecting unit that estimates a sound source direction based on an output of the time average calculating unit.