JP5654955B2

JP5654955B2 - Direct sound extraction device and reverberation sound extraction device

Info

Publication number: JP5654955B2
Application number: JP2011147021A
Authority: JP
Inventors: 橋本　武志; 武志橋本; 哲生渡邉; 笛木　俊宏; 俊宏笛木
Original assignee: Clarion Co Ltd
Current assignee: Faurecia Clarion Electronics Co Ltd
Priority date: 2011-07-01
Filing date: 2011-07-01
Publication date: 2015-01-14
Anticipated expiration: 2031-07-01
Also published as: CN103503066B; JP2013015606A; EP2690623A1; WO2013005550A1; EP2690623B1; CN103503066A; EP2690623A4; US9241214B2; US20140044273A1

Description

本発明は直接音抽出装置および残響音抽出装置に関し、より詳細には、残響音を含んだ入力信号から、直接音を抽出することが可能な直接音抽出装置と、残響音を抽出することが可能な残響音抽出装置とに関する。 The present invention relates to a direct sound extraction device and a reverberation sound extraction device. More specifically, the present invention relates to a direct sound extraction device capable of extracting a direct sound from an input signal including the reverberation sound, and to extract the reverberation sound. The present invention relates to a reverberant sound extraction device.

ホール等の残響が発生しやすい環境で演奏された音楽やスピーチなどを収録した場合には、収録された音響信号に直接音だけでなく残響音が畳み込まれて記録されることが多い。このため、残響音が畳み込まれた音響信号を他の音響環境で再生する場合には、直接音の明瞭度が低下し、再生時に非常に聞きづらくなってしまうおそれがあった。 When music or speech performed in an environment where reverberation is likely to occur such as a hall is recorded, not only direct sound but also reverberation sound is often recorded and recorded in the recorded acoustic signal. For this reason, when an acoustic signal in which reverberant sound is convoluted is reproduced in another acoustic environment, the intelligibility of the direct sound is lowered, and it may be very difficult to hear during reproduction.

また、残響音が畳み込まれたスピーチ音声を用いて、音声認識等を行った場合には、残響音による明瞭度の低下により、スピーチ内容の認識率の低下が生じてしまうという問題があった。 In addition, when speech recognition or the like is performed using speech speech in which reverberant sound is convoluted, there is a problem in that the recognition rate of speech content is reduced due to a decrease in intelligibility due to reverberant sound. .

このように残響音が畳み込まれた音響信号において、残響音を低減する技術が従来より知られている（例えば、特許文献１参照）。この技術を用いることにより、残響音の低減を図って直接音の明瞭化を実現することが可能である。 In the acoustic signal in which the reverberant sound is convoluted as described above, a technique for reducing the reverberant sound has been conventionally known (see, for example, Patent Document 1). By using this technology, it is possible to reduce the reverberant sound and to clarify the sound directly.

特開２０１０−７４５３１号公報JP 2010-74531 A

しかしながら、特許文献１に示した方法では、入力信号において残響音を低減するために、擬似白色化処理、マルチステップ線形予測処理、後部残響予測処理等の多様な信号処理を行う必要があり、多大な処理負担が必要とされている。このため、実際の残響音の低減には、マイクロプロセッサやデジタルシグナルプロセッサ等の高性能なデバイスが必要とされ、コスト等の観点から、特許文献１による方法をそのまま用いることが容易ではないという問題があった。 However, in the method shown in Patent Document 1, it is necessary to perform various signal processing such as pseudo whitening processing, multi-step linear prediction processing, and rear reverberation prediction processing in order to reduce reverberation in the input signal. Processing burden is required. For this reason, in order to reduce actual reverberation, a high-performance device such as a microprocessor or a digital signal processor is required, and from the viewpoint of cost and the like, it is not easy to use the method according to Patent Document 1 as it is. was there.

本発明は、上記問題に鑑みてなされたものであり、残響音が含まれる音響信号において、簡単に直接音または残響音を抽出することが可能な直接音抽出装置および残響音抽出装置を提供することを課題とする。 The present invention has been made in view of the above problems, and provides a direct sound extraction device and a reverberation sound extraction device that can easily extract a direct sound or a reverberation sound from an acoustic signal including a reverberation sound. This is the issue.

上記課題を解決するために、本発明に係る直接音抽出装置は、直接音に残響音が含まれる入力信号に対してフーリエ変換処理を施すフーリエ変換手段と、該フーリエ変換手段によりフーリエ変換処理された前記入力信号の実数と虚数の周波数スペクトルに基づいて、前記入力信号を第１振幅スペクトル信号と位相スペクトル信号とに変換するスペクトル変換手段と、前記第１振幅スペクトル信号に対して予め設定された正規化カットオフ周波数を用いて周波数毎にローパスフィルタリング処理を行うローパスフィルタ手段と、該ローパスフィルタ手段によりローパスフィルタリング処理された第２振幅スペクトル信号のマイナス側の振幅を制限してゼロにする第１リミッタ手段と、該第１リミッタ手段によりマイナス側の振幅の制限が行われた前記第２振幅スペクトル信号を前記第１振幅スペクトル信号から減算することにより第３振幅スペクトル信号を求める第１減算手段と、該第１減算手段により求められた第３振幅スペクトル信号のマイナス側の振幅を制限してゼロにする第２リミッタ手段と、前記位相スペクトル信号と、前記第２リミッタ手段によりマイナス側の振幅の制限が行われた前記第３振幅スペクトル信号とに基づいて、実数と虚数の周波数スペクトルからなる信号を求める逆スペクトル変換手段と、該逆スペクトル変換手段により求められた信号を逆フーリエ変換処理することにより、前記入力信号から前記直接音を抽出した直接音信号を生成する逆フーリエ変換手段とを備えることを特徴とする。 In order to solve the above problems, a direct sound extraction apparatus according to the present invention includes a Fourier transform unit that performs a Fourier transform process on an input signal including a reverberant sound in the direct sound, and a Fourier transform process performed by the Fourier transform unit. Further, spectrum conversion means for converting the input signal into a first amplitude spectrum signal and a phase spectrum signal based on the real and imaginary frequency spectra of the input signal, and preset for the first amplitude spectrum signal Low-pass filter means for performing low-pass filtering processing for each frequency using the normalized cut-off frequency, and first amplitude to limit the negative amplitude of the second amplitude spectrum signal low-pass filtered by the low-pass filter means to zero. The limiter means and the first limiter means limit the amplitude on the minus side. First subtraction means for obtaining a third amplitude spectrum signal by subtracting the second amplitude spectrum signal from the first amplitude spectrum signal, and a negative amplitude of the third amplitude spectrum signal obtained by the first subtraction means Of the real and imaginary numbers based on the second limiter means for limiting the value to zero, the phase spectrum signal, and the third amplitude spectrum signal whose amplitude on the negative side is limited by the second limiter means. Inverse spectrum transforming means for obtaining a signal composed of a frequency spectrum and inverse Fourier transform for generating a direct sound signal obtained by extracting the direct sound from the input signal by subjecting the signal obtained by the inverse spectrum transforming means to inverse Fourier transform processing. Conversion means.

本発明に係る直接音抽出装置は、直接音に残響音が含まれる入力信号をフーリエ変換し、スペクトル変換手段により求められた第１振幅スペクトル信号に対して、予め設定された正規化カットオフ周波数を用いてローパスフィルタリング処理を行うことにより、スペクトル毎に積分された信号（積分信号：第２振幅スペクトル信号）を求める。このようにして積分された信号は、入力信号における時間変化において定常成分をなすスペクトル信号、つまり、残響音信号に該当するものである。 The direct sound extraction apparatus according to the present invention performs a Fourier transform on an input signal including a reverberation sound in the direct sound, and sets a normalized cutoff frequency that is set in advance for the first amplitude spectrum signal obtained by the spectrum conversion means. Is used to obtain a signal integrated for each spectrum (integrated signal: second amplitude spectrum signal). The signal integrated in this way corresponds to a spectrum signal that forms a stationary component in a time change in the input signal, that is, a reverberation signal.

従って、第１減算手段において、第２振幅スペクトル信号を第１振幅スペクトル信号から減算することにより求められる第３振幅スペクトル信号は、入力信号から残響音を減算した信号であり、この処理によって、直接音信号に該当する信号を求めることが可能となる。 Accordingly, the third amplitude spectrum signal obtained by subtracting the second amplitude spectrum signal from the first amplitude spectrum signal in the first subtraction means is a signal obtained by subtracting the reverberation sound from the input signal. A signal corresponding to the sound signal can be obtained.

このため、逆スペクトル変換手段および逆フーリエ変換手段により生成される信号は、入力信号から直接音を抽出した信号となり、直接音に残響音が含まれる入力信号から、簡単に直接音を抽出することが可能となる。 For this reason, the signal generated by the inverse spectrum transforming means and the inverse Fourier transforming means is a signal obtained by extracting the direct sound from the input signal, and the direct sound can be easily extracted from the input signal including the reverberant sound in the direct sound. Is possible.

さらに、正規化カットオフ周波数を調整することにより、入力信号における直接音の抽出時間の調整を行うことが可能である。正規化カットオフ周波数の値を小さくすると、入力信号における直接音の抽出時間が長くなり、非定常音だけでなく定常音を含んだ形で直接音の抽出を行うことが可能となる。定常音を含んだ直接音の抽出を行うことにより、全く定常音を含まない直接音に比べて、音色や聞き易さなどを直接音に付加することが可能となり、聴取者が直接音を聴取した場合において違和感のない音として認識することが可能となる。 Furthermore, it is possible to adjust the extraction time of the direct sound in the input signal by adjusting the normalized cutoff frequency. When the value of the normalized cut-off frequency is reduced, the time for extracting the direct sound from the input signal becomes longer, and the direct sound can be extracted in a form including not only a non-stationary sound but also a stationary sound. By extracting direct sounds that include stationary sounds, it is possible to add timbre, ease of hearing, etc. to the direct sounds, compared to direct sounds that do not include any stationary sounds. In this case, it is possible to recognize the sound as having no sense of incongruity.

また、上述した直接音抽出装置は、前記第２リミッタ手段によりマイナス側の振幅の制限が行われた前記第３振幅スペクトル信号の振幅に対する増幅または減衰を周波数毎に行うことにより、前記第３振幅スペクトル信号に対する重み付けを行う第１ゲイン手段を備え、前記逆スペクトル変換手段が、前記位相スペクトル信号と、前記第１ゲイン手段により重み付けが行われた前記第３振幅スペクトル信号とに基づいて、実数と虚数の周波数スペクトルからなる信号を求めるものであってもよい。 Further, the direct sound extraction device described above performs amplification or attenuation for the amplitude of the third amplitude spectrum signal, on which the amplitude on the negative side is limited by the second limiter unit, for each frequency, thereby the third amplitude. First gain means for weighting the spectrum signal, wherein the inverse spectrum conversion means is based on the phase spectrum signal and the third amplitude spectrum signal weighted by the first gain means, You may obtain | require the signal which consists of an imaginary number frequency spectrum.

上記直接音抽出装置では、第１ゲイン手段が、第２リミッタ手段によりマイナス側の振幅の制限が行われた第３振幅スペクトル信号の振幅に対する増幅または減衰を周波数毎に行うことにより、第３振幅スペクトル信号に対する重み付けを行う。このようにして振幅の増幅または減衰による重み付けを周波数毎に行うことによって、入力信号から直接音を抽出する場合において、残響音が少し含まれた状態の直接音を求めることができる。 In the above direct sound extraction apparatus, the first gain means performs amplification or attenuation for the amplitude of the third amplitude spectrum signal whose amplitude on the negative side is limited by the second limiter means for each frequency. Weighting is performed on the spectrum signal. By performing weighting by amplitude amplification or attenuation in this way for each frequency, when a direct sound is extracted from an input signal, it is possible to obtain a direct sound with a little reverberation sound included.

例えば、直接音に多くの残響音が含まれる（畳み込まれる）場合には、残響音により直接音の聞き取りが困難になる場合もあるが、直接音にわずかに残響音が含まれる場合には、聴取者が聴取した直接音に残響音に伴う音色などが付加されて、聞き取りやすい音になる場合もある。このように、重み付けの調整を行うことにより、抽出される直接音に含まれる残響音の調整を行うことができ、音色などの音質の調整を行うことが可能となる。 For example, if the direct sound contains a lot of reverberant sound (convolved), it may be difficult to hear the direct sound due to the reverberant sound, but if the direct sound contains a slight amount of reverberant sound, In some cases, the direct sound heard by the listener is added with a timbre associated with the reverberant sound to make it easy to hear. As described above, by adjusting the weighting, it is possible to adjust the reverberation sound included in the extracted direct sound, and it is possible to adjust the tone quality such as the timbre.

さらに、本発明に係る残響音抽出装置は、直接音に残響音が含まれる入力信号に対してフーリエ変換処理を施すフーリエ変換手段と、該フーリエ変換手段によりフーリエ変換処理された前記入力信号の実数と虚数の周波数スペクトルに基づいて、前記入力信号を第１振幅スペクトル信号と位相スペクトル信号とに変換するスペクトル変換手段と、前記第１振幅スペクトル信号に対して予め設定された正規化カットオフ周波数を用いて周波数毎にハイパスフィルタリング処理を行うハイパスフィルタ手段と、該ハイパスフィルタ手段によりハイパスフィルタリング処理された第４振幅スペクトル信号のマイナス側の振幅を制限してゼロにする第３リミッタ手段と、該第３リミッタ手段によりマイナス側の振幅の制限が行われた前記第４振幅スペクトル信号を前記第１振幅スペクトル信号から減算することにより第５振幅スペクトル信号を求める第２減算手段と、該第２減算手段により求められた第５振幅スペクトル信号のマイナス側の振幅を制限してゼロにする第４リミッタ手段と、前記位相スペクトル信号と、前記第４リミッタ手段によりマイナス側の振幅の制限が行われた前記第５振幅スペクトル信号とに基づいて、実数と虚数の周波数スペクトルからなる信号を求める逆スペクトル変換手段と、該逆スペクトル変換手段により求められた信号を逆フーリエ変換処理することにより、前記入力信号から前記残響音を抽出した残響音信号を生成する逆フーリエ変換手段とを備えることを特徴とする。 Furthermore, the reverberant sound extraction apparatus according to the present invention includes a Fourier transform unit that performs a Fourier transform process on an input signal that includes a reverberant sound in a direct sound, and a real number of the input signal that has undergone a Fourier transform process by the Fourier transform unit. And a spectrum conversion means for converting the input signal into a first amplitude spectrum signal and a phase spectrum signal based on the imaginary frequency spectrum, and a normalized cutoff frequency preset for the first amplitude spectrum signal. A high-pass filter means for performing high-pass filtering processing for each frequency, a third limiter means for limiting the negative amplitude of the fourth amplitude spectrum signal subjected to high-pass filtering by the high-pass filter means to zero, and the third limiter means The fourth amplitude spectrum in which the amplitude on the negative side is limited by the 3 limiter means. A second subtracting means for obtaining a fifth amplitude spectrum signal by subtracting a torque signal from the first amplitude spectrum signal, and limiting a negative amplitude of the fifth amplitude spectrum signal obtained by the second subtracting means. Based on the fourth limiter means for zeroing, the phase spectrum signal, and the fifth amplitude spectrum signal whose amplitude on the negative side is limited by the fourth limiter means, it is composed of frequency spectra of real and imaginary numbers. An inverse spectrum transforming means for obtaining a signal, and an inverse Fourier transforming means for generating a reverberation sound signal obtained by extracting the reverberation sound from the input signal by performing an inverse Fourier transform process on the signal obtained by the inverse spectrum transforming means. It is characterized by providing.

本発明に係る残響音抽出装置は、直接音に残響音が含まれる入力信号をフーリエ変換し、スペクトル変換手段により求められた第１振幅スペクトル信号に対して、予め設定された正規化カットオフ周波数を用いてハイパスフィルタリング処理を行うことにより、スペクトル毎に微分された信号（微分信号：第４振幅スペクトル信号）を求める。このようにして微分された信号は、入力信号における時間変化において非定常成分をなすスペクトル信号、つまり、直接音信号に該当するものである。 The reverberant sound extraction apparatus according to the present invention performs a Fourier transform on an input signal including a reverberant sound in a direct sound, and sets a normalized cutoff frequency that is set in advance for the first amplitude spectrum signal obtained by the spectrum converting means. Is used to obtain a signal differentiated for each spectrum (differentiated signal: fourth amplitude spectrum signal). The signal thus differentiated corresponds to a spectral signal that forms an unsteady component in the time change in the input signal, that is, a direct sound signal.

従って、第２減算手段において、第４振幅スペクトル信号を第１振幅スペクトル信号から減算することにより求められる第５振幅スペクトル信号は、入力信号から直接音を減算した信号であり、この処理によって残響音信号に該当する信号を求めることが可能となる。 Therefore, the fifth amplitude spectrum signal obtained by subtracting the fourth amplitude spectrum signal from the first amplitude spectrum signal in the second subtracting means is a signal obtained by subtracting the direct sound from the input signal. A signal corresponding to the signal can be obtained.

このため、逆スペクトル変換手段および逆フーリエ変換手段により生成される信号は、入力信号から残響音を抽出した信号となり、直接音に残響音が含まれる入力信号から、簡単に残響音を抽出することが可能となる。 For this reason, the signal generated by the inverse spectrum transform unit and the inverse Fourier transform unit is a signal obtained by extracting the reverberation sound from the input signal, and the reverberation sound can be easily extracted from the input signal in which the reverberation sound is included in the direct sound. Is possible.

さらに、正規化カットオフ周波数を調整することにより、入力信号における残響音の抽出時間の調整を行うことが可能である。正規化カットオフ周波数の値を大きくすると、入力信号における残響音の抽出時間が長くなり、定常音だけでなく非定常音を含んだ形で残響音の抽出を行うことが可能となる。また、正規化カットオフ周波数の値を小さくすると、入力信号における残響音の抽出時間が短くなり、非定常音があまり含まれない残響音の抽出を行うことが可能となる。 Furthermore, by adjusting the normalized cutoff frequency, it is possible to adjust the reverberation sound extraction time in the input signal. When the value of the normalized cut-off frequency is increased, the extraction time of the reverberant sound in the input signal becomes longer, and it becomes possible to extract the reverberant sound including not only the stationary sound but also the unsteady sound. Also, if the value of the normalized cut-off frequency is reduced, the reverberation sound extraction time in the input signal is shortened, and reverberation sound that does not contain much non-stationary sound can be extracted.

さらに、上述した残響音抽出装置は、前記第４リミッタ手段によりマイナス側の振幅の制限が行われた前記第５振幅スペクトル信号の振幅に対する増幅または減衰を周波数毎に行うことにより、前記第５振幅スペクトル信号に対する重み付けを行う第２ゲイン手段を備え、前記逆スペクトル変換手段が、前記位相スペクトル信号と、前記第２ゲイン手段により重み付けが行われた前記第５振幅スペクトル信号とに基づいて、実数と虚数の周波数スペクトルからなる信号を求めるものであってもよい。 Furthermore, the reverberation sound extraction apparatus described above performs amplification or attenuation for the amplitude of the fifth amplitude spectrum signal, on which the amplitude on the negative side is limited by the fourth limiter unit, for each frequency, thereby the fifth amplitude. Second gain means for weighting the spectrum signal, wherein the inverse spectrum conversion means is based on the phase spectrum signal and the fifth amplitude spectrum signal weighted by the second gain means, You may obtain | require the signal which consists of an imaginary number frequency spectrum.

上記残響音抽出装置では、第２ゲイン手段が、第４リミッタ手段によりマイナス側の振幅の制限が行われた第５振幅スペクトル信号の振幅に対する増幅または減衰を周波数毎に行うことにより、第５振幅スペクトル信号に対する重み付けを行う。このようにして振幅の増幅または減衰による重み付けを周波数毎に行うことによって、入力信号から残響音を抽出する場合において、直接音が少し含まれた状態の残響音を求めることができる。このように、重み付けの調整を行うことにより、抽出される残響音に含まれる直接音の調整を行うことができ、音色などの音質の調整を行うことが可能となる。 In the above reverberant sound extraction device, the second gain means performs amplification or attenuation for the amplitude of the fifth amplitude spectrum signal whose amplitude on the negative side is limited by the fourth limiter means for each frequency, thereby obtaining the fifth amplitude. Weighting is performed on the spectrum signal. By performing weighting by amplitude amplification or attenuation for each frequency in this way, when reverberation is extracted from an input signal, it is possible to obtain a reverberation that contains a small amount of direct sound. Thus, by adjusting the weighting, the direct sound included in the extracted reverberation sound can be adjusted, and the tone quality such as the timbre can be adjusted.

本発明に係る直接音抽出装置によれば、直接音に残響音が含まれる入力信号から、簡単に直接音を抽出することができる。また、本発明に係る残響音抽出装置によれば、直接音に残響音が含まれる入力信号から、簡単に残響音を抽出することが可能となる。 According to the direct sound extraction device of the present invention, it is possible to easily extract a direct sound from an input signal in which the direct sound includes a reverberant sound. Moreover, according to the reverberation sound extraction apparatus according to the present invention, it is possible to easily extract a reverberation sound from an input signal in which the direct sound includes the reverberation sound.

実施の形態に係る音響処理装置の概略構成を示したブロック図である。It is the block diagram which showed schematic structure of the sound processing apparatus which concerns on embodiment. 実施の形態に係るＦＦＴ部において、入力信号に対して短時間フーリエ変換処理を行う場合のフーリエ変換長とオーバーラップ長を模式的に示した図である。It is the figure which showed typically the Fourier-transform length and overlap length in the case of performing a short-time Fourier-transform process with respect to an input signal in the FFT part which concerns on embodiment. 実施の形態に係る周波数スペクトル領域フィルタリング部の概略構成を示したブロック図である。It is the block diagram which showed schematic structure of the frequency spectrum domain filtering part which concerns on embodiment. （ａ）は、実施の形態に係るＬＰＦ部における振幅スペクトル毎のフィルタ係数の一例を示したものであり、（ｂ）は、ＨＰＦ部における振幅スペクトル毎のフィルタ係数の一例を示している。(A) shows an example of the filter coefficient for each amplitude spectrum in the LPF unit according to the embodiment, and (b) shows an example of the filter coefficient for each amplitude spectrum in the HPF unit. （ａ）は、実施の形態に係る第１ゲイン部の増幅および減衰の重み付け量の周波数変化の一例を示した図であり、（ｂ）は、第２ゲイン部の重み付け量の周波数変化の一例を示した図である。(A) is the figure which showed an example of the frequency change of the weighting amount of the amplification of the 1st gain part which concerns on embodiment, and (b) is an example of the frequency change of the weighting amount of the 2nd gain part. FIG. 実施の形態に係る周波数スペクトル領域フィルタリング部に入力される入力信号の振幅と、積分信号Ｌｆａ１の振幅と、微分信号Ｌｆａ２の振幅と、直接音信号Ｌｆｄの振幅と、残響音信号Ｌｆｒの振幅との時間変化を例示した第１の図である。The amplitude of the input signal input to the frequency spectrum domain filtering unit according to the embodiment, the amplitude of the integrated signal Lfa1, the amplitude of the differential signal Lfa2, the amplitude of the direct sound signal Lfd, and the amplitude of the reverberant sound signal Lfr It is the 1st figure which illustrated time change. 実施の形態に係る周波数スペクトル領域フィルタリング部に入力される入力信号の振幅と、積分信号Ｌｆａ１の振幅と、微分信号Ｌｆａ２の振幅と、直接音信号Ｌｆｄの振幅と、残響音信号Ｌｆｒの振幅との時間変化を例示した第２の図である。The amplitude of the input signal input to the frequency spectrum domain filtering unit according to the embodiment, the amplitude of the integrated signal Lfa1, the amplitude of the differential signal Lfa2, the amplitude of the direct sound signal Lfd, and the amplitude of the reverberant sound signal Lfr It is the 2nd figure which illustrated time change. 実施の形態に係る周波数スペクトル領域フィルタリング部に入力される入力信号の振幅と、積分信号Ｌｆａ１の振幅と、微分信号Ｌｆａ２の振幅と、直接音信号Ｌｆｄの振幅と、残響音信号Ｌｆｒの振幅との時間変化を例示した第３の図である。The amplitude of the input signal input to the frequency spectrum domain filtering unit according to the embodiment, the amplitude of the integrated signal Lfa1, the amplitude of the differential signal Lfa2, the amplitude of the direct sound signal Lfd, and the amplitude of the reverberant sound signal Lfr It is the 3rd figure which illustrated time change. 実施の形態に係る周波数スペクトル領域フィルタリング部に入力される入力信号の振幅と、積分信号Ｌｆａ１の振幅と、微分信号Ｌｆａ２の振幅と、直接音信号Ｌｆｄの振幅と、残響音信号Ｌｆｒの振幅との時間変化を例示した第４の図である。The amplitude of the input signal input to the frequency spectrum domain filtering unit according to the embodiment, the amplitude of the integrated signal Lfa1, the amplitude of the differential signal Lfa2, the amplitude of the direct sound signal Lfd, and the amplitude of the reverberant sound signal Lfr It is the 4th figure which illustrated time change. 実施の形態に係る残響音抽出装置における入力信号の振幅と、残響音抽出装置において抽出された直接音信号の振幅と、残響音信号の振幅との時間変化を例示した第１の図である。It is the 1st figure which illustrated the time change of the amplitude of the input signal in the reverberation sound extraction device concerning an embodiment, the amplitude of the direct sound signal extracted in the reverberation sound extraction device, and the amplitude of the reverberation sound signal. 実施の形態に係る残響音抽出装置における入力信号の振幅と、残響音抽出装置において抽出された直接音信号の振幅と、残響音信号の振幅との時間変化を例示した第２の図である。It is the 2nd figure which illustrated the time change of the amplitude of the input signal in the reverberation sound extraction device concerning an embodiment, the amplitude of the direct sound signal extracted in the reverberation sound extraction device, and the amplitude of the reverberation sound signal. 実施の形態に係る残響音抽出装置における入力信号の振幅と、残響音抽出装置において抽出された直接音信号の振幅と、残響音信号の振幅との時間変化を例示した第３の図である。It is the 3rd figure which illustrated the time change of the amplitude of the input signal in the reverberation sound extraction device concerning an embodiment, the amplitude of the direct sound signal extracted in the reverberation sound extraction device, and the amplitude of the reverberation sound signal. 実施の形態に係る残響音抽出装置における入力信号の振幅と、残響音抽出装置において抽出された直接音信号の振幅と、残響音信号の振幅との時間変化を例示した第４の図である。It is the 4th figure which illustrated time change of the amplitude of the input signal in the reverberation sound extraction device concerning an embodiment, the amplitude of the direct sound signal extracted in the reverberation sound extraction device, and the amplitude of the reverberation sound signal. 実施の形態に係る残響音抽出装置における入力信号の振幅と、残響音抽出装置において抽出された直接音信号の振幅と、残響音信号の振幅との時間変化を例示した第５の図である。It is the 5th figure which illustrated time change of the amplitude of the input signal in the reverberation sound extraction device concerning an embodiment, the amplitude of the direct sound signal extracted in the reverberation sound extraction device, and the amplitude of the reverberation sound signal. （ａ）は、図１４に示した直接音信号の波形が正規化カットオフ周波数の値の調整状態に応じて変化する様子と、入力信号とを模式的に示した図であり、（ｂ）は、図１４に示した残響音信号の波形が正規化カットオフ周波数の値の調整状態に応じて変化する様子と、入力信号とを模式的に示した図である。(A) is the figure which showed typically a mode that the waveform of the direct sound signal shown in FIG. 14 changed according to the adjustment state of the value of a normalization cut-off frequency, and an input signal, (b) FIG. 15 is a diagram schematically showing a state in which the waveform of the reverberation signal shown in FIG. 14 changes according to the adjustment state of the value of the normalized cutoff frequency, and the input signal.

以下、本発明に係る直接音抽出装置および残響音抽出装置の一例である音響処理装置を示し、添付図面を参照して詳細に説明を行う。 Hereinafter, an acoustic processing apparatus which is an example of a direct sound extraction apparatus and a reverberation sound extraction apparatus according to the present invention will be described and described in detail with reference to the accompanying drawings.

なお、音声や楽器等の直接音に残響音が畳み込まれると、周波数スペクトルにおいて、非定常な音声や楽器等の信号に対して、残響時間に応じた定常性のある信号が付加される。本実施の形態に係る音響処理装置は、入力信号から非定常信号を抽出・分離することにより直接音の抽出を行い、入力信号から定常信号を抽出・分離することにより残響音を抽出するものである。 Note that when a reverberant sound is convoluted with a direct sound such as a voice or a musical instrument, a signal having a steadiness according to the reverberation time is added to a signal such as an unsteady voice or a musical instrument in the frequency spectrum. The sound processing apparatus according to the present embodiment extracts direct sound by extracting and separating an unsteady signal from an input signal, and extracts reverberation sound by extracting and separating a steady signal from the input signal. is there.

図１は、音響処理装置の概略構成を示したブロック図である。音響処理装置１は、図１に示すように、ＦＦＴ部（フーリエ変換手段、スペクトル変換手段）３と、周波数スペクトル領域フィルタリング部４と、ＩＦＦＴ部（逆フーリエ変換手段、逆スペクトル変換手段）５ａ，５ｂとを有している。 FIG. 1 is a block diagram showing a schematic configuration of the sound processing apparatus. As shown in FIG. 1, the sound processing apparatus 1 includes an FFT unit (Fourier transform unit, spectrum transform unit) 3, a frequency spectrum domain filtering unit 4, an IFFT unit (inverse Fourier transform unit, inverse spectrum transform unit) 5a, 5b.

ＦＦＴ部３には、図示を省略した音源部から、直接音（例えばスピーチなどの音声）に残響音（例えば、スピーチにおける反射音）が畳み込まれた（含まれた）２チャンネル（ＬチャンネルとＲチャンネル）の入力信号Ｌ，Ｒとが入力される。ＦＦＴ部３は、残響音が畳み込まれた２チャンネルの入力信号Ｌ，Ｒのそれぞれに対して窓関数により重み付けを行う役割を有している。 The FFT unit 3 has two channels (L channel and L channel) in which a reverberant sound (for example, a reflected sound in speech) is convoluted with a direct sound (for example, speech such as speech) from a sound source unit (not shown). R channel) input signals L and R are input. The FFT unit 3 has a role of weighting each of the two-channel input signals L and R in which reverberation sound is convoluted with a window function.

そして、ＦＦＴ部３は、窓関数による重み付けを行った後に、それぞれの入力信号Ｌ，Ｒに対して、短時間フーリエ変換処理を施すことにより、それぞれの入力信号Ｌ，Ｒを時間領域から周波数領域に変換し、実数と虚数の周波数スペクトルを求める。図２は、ＦＦＴ部３において入力信号Ｌ（あるいは入力信号Ｒ）に対して短時間フーリエ変換処理を行う場合のフーリエ変換長とオーバーラップ長を模式的に示した図である。ここで、ＦＦＴ部３は、入力信号に対してフーリエ変換処理を施すため、本発明に係るフーリエ変換手段に該当することになる。 The FFT unit 3 weights the input signals L and R after performing weighting using a window function, thereby performing a short-time Fourier transform process on the input signals L and R, thereby changing the input signals L and R from the time domain to the frequency domain. To obtain the real and imaginary frequency spectra. FIG. 2 is a diagram schematically showing the Fourier transform length and the overlap length when the FFT unit 3 performs short-time Fourier transform processing on the input signal L (or input signal R). Here, since the FFT unit 3 performs a Fourier transform process on the input signal, it corresponds to a Fourier transform unit according to the present invention.

さらに、ＦＦＴ部３は、周波数領域変換により求められた２チャンネルの周波数スペクトルを、それぞれ振幅スペクトル信号Ｌｆａ，Ｒｆａ（第１振幅スペクトル信号）と位相スペクトル信号Ｌｆｐ，Ｒｆｐに変換する。そして、ＦＦＴ部３は、変換された２チャンネルの振幅スペクトル信号Ｌｆａ，Ｒｆａを、周波数スペクトル領域フィルタリング部４に出力する。また、ＦＦＴ部３は、２チャンネルの位相スペクトル信号Ｌｆｐ,Ｒｆｐを、ＩＦＦＴ部５ａとＩＦＦＴ部５ｂとにそれぞれ出力する。ここで、ＦＦＴ部３は、入力信号を、振幅スペクトル信号Ｌｆａ，Ｒｆａと位相スペクトル信号Ｌｆｐ，Ｒｆｐとに変換するため、本発明に係るスペクトル変換手段に該当することになる。 Further, the FFT unit 3 converts the frequency spectra of the two channels obtained by the frequency domain conversion into amplitude spectrum signals Lfa and Rfa (first amplitude spectrum signal) and phase spectrum signals Lfp and Rfp, respectively. The FFT unit 3 then outputs the converted two-channel amplitude spectrum signals Lfa and Rfa to the frequency spectrum domain filtering unit 4. The FFT unit 3 also outputs two-channel phase spectrum signals Lfp and Rfp to the IFFT unit 5a and the IFFT unit 5b, respectively. Here, since the FFT unit 3 converts the input signal into the amplitude spectrum signals Lfa and Rfa and the phase spectrum signals Lfp and Rfp, it corresponds to the spectrum conversion means according to the present invention.

図３は、周波数スペクトル領域フィルタリング部４の概略構成を示したブロック図である。周波数スペクトル領域フィルタリング部４は、スペクトル毎の簡易なフィルタリング処理により、非定常信号と定常信号の抽出を行う役割を有している。なお、周波数スペクトル領域フィルタリング部４における処理では、振幅スペクトル信号Ｌｆａ，Ｒｆａに対してのみフィルタリング処理を行い、位相スペクトル信号Ｌｆｐ，Ｒｆｐに関してはフィルタリング処理を行わない。 FIG. 3 is a block diagram showing a schematic configuration of the frequency spectrum domain filtering unit 4. The frequency spectrum domain filtering unit 4 has a role of extracting a non-stationary signal and a stationary signal by a simple filtering process for each spectrum. In the process in the frequency spectrum domain filtering unit 4, the filtering process is performed only on the amplitude spectrum signals Lfa and Rfa, and the filtering process is not performed on the phase spectrum signals Lfp and Rfp.

周波数スペクトル領域フィルタリング部４は、図３に示すように、ＬＰＦ部（ローパスフィルタ手段）１０と、ＨＰＦ部（ハイパスフィルタ手段）１１と、第１リミッタ部（第１リミッタ手段）１２と、第２リミッタ部（第２リミッタ手段）１３と、第３リミッタ部（第３リミッタ手段）１４と、第４リミッタ部（第４リミッタ手段）１５と、第１ゲイン部（第１ゲイン手段）１６と、第２ゲイン部（第２ゲイン手段）１７と、第１減算部（第１減算手段）１８と、第２減算部（第２減算手段）１９とを有している。図３は、振幅スペクトル信号Ｌｆａに対して処理を行う機能部（ＬＰＦ部１０、ＨＰＦ部１１、リミッタ部１２〜１５、ゲイン部１６,１７、減算部１８,１９）のみが示されており、振幅スペクトル信号Ｒｆａに対して処理を行う機能部の図示を省略している。しかしながら、振幅スペクトル信号Ｒｆａに対して処理を行う機能部においても、同様の機能部が設けられており、同様のフィルタリング処理が行われる。 As shown in FIG. 3, the frequency spectrum domain filtering unit 4 includes an LPF unit (low-pass filter unit) 10, an HPF unit (high-pass filter unit) 11, a first limiter unit (first limiter unit) 12, and a second A limiter unit (second limiter unit) 13, a third limiter unit (third limiter unit) 14, a fourth limiter unit (fourth limiter unit) 15, a first gain unit (first gain unit) 16, A second gain unit (second gain unit) 17, a first subtraction unit (first subtraction unit) 18, and a second subtraction unit (second subtraction unit) 19 are provided. FIG. 3 shows only functional units (LPF unit 10, HPF unit 11, limiter units 12 to 15, gain units 16 and 17, subtracting units 18 and 19) that process the amplitude spectrum signal Lfa. A functional unit that performs processing on the amplitude spectrum signal Rfa is not shown. However, a similar functional unit is provided in the functional unit that performs processing on the amplitude spectrum signal Rfa, and a similar filtering process is performed.

ＬＰＦ部１０は、ＦＦＴ部３より入力された振幅スペクトル信号Ｌｆａに対して、所定の正規化カットオフ周波数に基づいて、スペクトル毎（周波数毎）にローパスフィルタリング処理を行う役割を有している。第１リミッタ部１２は、ＬＰＦ部１０によりローパスフィルタリング処理された振幅スペクトル信号（第２振幅スペクトル信号）のマイナス側の振幅を制限して、０にする役割を有している。そして、第１ゲイン部１６は、マイナス側の振幅制限が行われた振幅スペクトル信号の振幅を増幅もしくは減衰する役割を有している。このようにして、ＬＰＦ部１０において振幅スペクトル信号Ｌｆａに対してローパスフィルタリング処理を行うことにより、スペクトル毎に積分された信号（積分信号：第２振幅スペクトル信号）Ｌｆａ１が生成されることになる。 The LPF unit 10 has a role of performing low-pass filtering processing for each spectrum (for each frequency) on the amplitude spectrum signal Lfa input from the FFT unit 3 based on a predetermined normalized cutoff frequency. The first limiter unit 12 has a role of limiting the amplitude on the negative side of the amplitude spectrum signal (second amplitude spectrum signal) subjected to low-pass filtering by the LPF unit 10 to zero. The first gain unit 16 has a role of amplifying or attenuating the amplitude of the amplitude spectrum signal on which the minus side amplitude restriction is performed. In this way, by performing the low-pass filtering process on the amplitude spectrum signal Lfa in the LPF unit 10, a signal (integrated signal: second amplitude spectrum signal) Lfa1 integrated for each spectrum is generated.

そして、第１減算部１８は、ＦＦＴ部３より入力された振幅スペクトル信号Ｌｆａから、積分信号Ｌｆａ１の減算を行い、時間変動のある非定常のスペクトル信号（第３振幅スペクトル信号）を求める。その後、第２リミッタ部１３は、第１減算部１８において求められたスペクトル信号（第３振幅スペクトル信号）のマイナス側の振幅を制限して０にする。第２リミッタ部１３において振幅の制限が行われた信号は、直接音信号ＬｆｄとしてＩＦＦＴ部５ａへ出力される。 Then, the first subtraction unit 18 subtracts the integration signal Lfa1 from the amplitude spectrum signal Lfa input from the FFT unit 3 to obtain a non-stationary spectrum signal (third amplitude spectrum signal) with time variation. Thereafter, the second limiter unit 13 limits the minus-side amplitude of the spectrum signal (third amplitude spectrum signal) obtained by the first subtraction unit 18 to zero. The signal whose amplitude is limited in the second limiter unit 13 is output to the IFFT unit 5a as a direct sound signal Lfd.

ＨＰＦ部１１は、ＦＦＴ部３より入力された振幅スペクトル信号Ｌｆａに対して、所定の正規化カットオフ周波数に基づいて、スペクトル毎（周波数毎）にハイパスフィルタリング処理を行う役割を有している。第３リミッタ部１４は、ＨＰＦ部１１によりハイパスフィルタリング処理された振幅スペクトル信号（第４振幅スペクトル信号）のマイナス側の振幅を制限して、０にする役割を有している。そして、第２ゲイン部１７は、マイナス側の振幅制限が行われた振幅スペクトル信号の振幅を増幅もしくは減衰する役割を有している。このようにして、ＨＰＦ部１１において振幅スペクトル信号Ｌｆａに対してハイパスフィルタリング処理を行うことにより、スペクトル毎に微分された信号（微分信号：第４振幅スペクトル信号）Ｌｆａ２が生成されることになる。 The HPF unit 11 has a role of performing high-pass filtering processing for each spectrum (for each frequency) on the amplitude spectrum signal Lfa input from the FFT unit 3 based on a predetermined normalized cutoff frequency. The third limiter unit 14 has a role of limiting the amplitude on the negative side of the amplitude spectrum signal (fourth amplitude spectrum signal) subjected to the high-pass filtering processing by the HPF unit 11 to zero. The second gain unit 17 has a role of amplifying or attenuating the amplitude of the amplitude spectrum signal on which the minus side amplitude restriction is performed. In this way, by performing high-pass filtering on the amplitude spectrum signal Lfa in the HPF unit 11, a signal differentiated for each spectrum (differentiated signal: fourth amplitude spectrum signal) Lfa2 is generated.

そして、第２減算部１９は、ＦＦＴ部３より入力された振幅スペクトル信号Ｌｆａから、微分信号Ｌｆａ２の減算を行い、時間変動の少ない定常のスペクトル信号（第５振幅スペクトル信号）を求める。その後、第４リミッタ部１５は、第２減算部１９において求められたスペクトル信号（第５振幅スペクトル信号）のマイナス側の振幅を制限して０にする。第４リミッタ部１５において振幅の制限が行われた信号は、残響音信号ＬｆｒとしてＩＦＦＴ部５ｂへ出力される。 Then, the second subtractor 19 subtracts the differential signal Lfa2 from the amplitude spectrum signal Lfa input from the FFT unit 3 to obtain a steady spectrum signal (fifth amplitude spectrum signal) with little time variation. Thereafter, the fourth limiter unit 15 limits the minus-side amplitude of the spectrum signal (fifth amplitude spectrum signal) obtained by the second subtraction unit 19 to zero. The signal whose amplitude is limited in the fourth limiter unit 15 is output to the IFFT unit 5b as a reverberation sound signal Lfr.

なお、ＬＰＦ部１０における振幅スペクトル毎のローパスフィルタの正規化カットオフ周波数および、ＨＰＦ部１１における振幅スペクトル毎のハイパスフィルタの正規化カットオフ周波数は、直接音と残響音の分割時間を調節するもの（直接音の抽出時間を調節し、また、残響音の抽出時間を調節するもの）である。また、第１ゲイン部１６および第２ゲイン部１７において、増幅および減衰の重み付け量を変化させることにより、直接音と残響音との混合比率を調節する（直接音に含まれる残響音の割合を調節し、また、残響音に含まれる直接音の割合を調節する）ことが可能となる。 The normalization cutoff frequency of the low pass filter for each amplitude spectrum in the LPF unit 10 and the normalization cutoff frequency of the high pass filter for each amplitude spectrum in the HPF unit 11 adjust the division time of the direct sound and the reverberation sound. (Adjusting the extraction time of the direct sound and adjusting the extraction time of the reverberation sound). Further, the first gain unit 16 and the second gain unit 17 adjust the mixing ratio of the direct sound and the reverberant sound by changing the weighting amount of the amplification and attenuation (the ratio of the reverberant sound included in the direct sound is changed). And the ratio of the direct sound included in the reverberant sound can be adjusted.

図４（ａ）は、本実施の形態に係るＬＰＦ部１０における振幅スペクトル毎のフィルタ係数の一例を示したものであり、図４（ｂ）は、ＨＰＦ部１１における振幅スペクトル毎のフィルタ係数の一例を示している。図４（ａ）（ｂ）に示すＬＰＦ部１０およびＨＰＦ部１１は、それぞれ１次のバタワースフィルタであり、図４に示すように、正規化カットオフ周波数を、０．０００００１、０．０００００２、０．０００００４、・・・・・、０．０６５５に変化させたものである。カットオフ周波数の値が低いほど、直接音の抽出時間と残響音の抽出時間とが長くなる。なお、本実施の形態に係る周波数スペクトル領域フィルタリング部４では、ＬＰＦ部１０およびＨＰＦ部１１のカットオフ周波数を、振幅スペクトルに共通に設定しているが、振幅スペクトル毎に独立に設定する構成とすることも可能である。 4A shows an example of the filter coefficient for each amplitude spectrum in the LPF unit 10 according to the present embodiment, and FIG. 4B shows the filter coefficient for each amplitude spectrum in the HPF unit 11. An example is shown. Each of the LPF unit 10 and the HPF unit 11 shown in FIGS. 4A and 4B is a first-order Butterworth filter. As shown in FIG. 4, the normalized cutoff frequencies are 0.000001, 0.000002, 0.000004,..., 0.0655. The lower the cutoff frequency value, the longer the direct sound extraction time and the reverberation sound extraction time. In the frequency spectrum domain filtering unit 4 according to the present embodiment, the cut-off frequencies of the LPF unit 10 and the HPF unit 11 are set in common for the amplitude spectrum, but are configured to be set independently for each amplitude spectrum. It is also possible to do.

図５（ａ）は、本実施の形態に係る第１ゲイン部１６の増幅および減衰の重み付け量の周波数変化の一例を示した図であり、図５（ｂ）は、第２ゲイン部１７の重み付け量の周波数変化の一例を示した図である。図５（ａ）（ｂ）に示すように、本実施の形態に係る第１ゲイン部１６および第２ゲイン部１７では、ゲイン（信号レベル）が小さいほど混合量は大きくなる。また、図５（ａ）（ｂ）に示すように、直接音側である第１ゲイン部１６は、５００Ｈｚ以下の振幅スペクトルにおいて、直接音と残響音の分離はほとんど行わない。 FIG. 5A is a diagram showing an example of a frequency change in the weighting amount of amplification and attenuation of the first gain unit 16 according to the present embodiment, and FIG. It is the figure which showed an example of the frequency change of weighting amount. As shown in FIGS. 5A and 5B, in the first gain unit 16 and the second gain unit 17 according to the present embodiment, the amount of mixing increases as the gain (signal level) decreases. Further, as shown in FIGS. 5A and 5B, the first gain unit 16 on the direct sound side hardly separates the direct sound and the reverberant sound in the amplitude spectrum of 500 Hz or less.

図６〜図９は、周波数スペクトル領域フィルタリング部４の各部の動作例を示したものであり、周波数スペクトル領域フィルタリング部４に入力される入力信号（振幅スペクトル信号Ｌｆａ）の振幅と、積分信号Ｌｆａ１の振幅と、微分信号Ｌｆａ２の振幅と、直接音信号Ｌｆｄの振幅と、残響音信号Ｌｆｒの振幅との時間変化を例示した図である。図６〜図９の波形は、いずれも１ｋＨｚ付近の振幅スペクトルの時間変化を観測したものである。 6 to 9 show an example of the operation of each part of the frequency spectrum domain filtering unit 4. The amplitude of the input signal (amplitude spectrum signal Lfa) input to the frequency spectrum domain filtering unit 4 and the integrated signal Lfa1 are shown. FIG. 6 is a diagram exemplifying time variation of the amplitude of the differential signal Lfa2, the amplitude of the direct sound signal Lfd, and the amplitude of the reverberation sound signal Lfr. Each of the waveforms in FIGS. 6 to 9 is obtained by observing the time change of the amplitude spectrum near 1 kHz.

なお、図６〜図９に示す動作例における入力信号のサンプリングレートは、４４．１ｋＨｚであり、ＦＦＴ部３のフーリエ変換長は４，０９６ｓａｍｐｌｅ（サンプル）、オーバーラップ長はフーリエ変換長の１５／１６倍の３，８４０ｓａｍｐｌｅ、フーリエ変換の窓関数はブラックマンである。また、図６〜図８に示す入力信号は、再生時間が１ｓｅｃ（秒）の１ｋＨｚの正弦波であり、図９に示す入力信号は音楽である。 6 to 9, the sampling rate of the input signal is 44.1 kHz, the Fourier transform length of the FFT unit 3 is 4,096 sample (sample), and the overlap length is 15 / of the Fourier transform length. The window function of 16 times 3,840 sample and Fourier transform is Blackman. 6 to 8 is a 1 kHz sine wave with a reproduction time of 1 sec (seconds), and the input signal shown in FIG. 9 is music.

また、図８、図９には、第１ゲイン部１６および第２ゲイン部１７において、図５（ａ）および図５（ｂ）に示したスペクトル毎（周波数毎）の重み付けを行った場合が示されており、図６、図７には、第１ゲイン部１６および第２ゲイン部１７において重み付けを行わずに、全ての振幅スペクトルに対するゲイン（信号レベル）を０ｄＢに設定した場合が示されている。 8 and 9, the first gain unit 16 and the second gain unit 17 perform weighting for each spectrum (for each frequency) shown in FIGS. 5 (a) and 5 (b). FIGS. 6 and 7 show the case where the gain (signal level) for all amplitude spectra is set to 0 dB without weighting in the first gain unit 16 and the second gain unit 17. ing.

まず、図６（ａ）に示した直接音側の信号では、ＬＰＦ部１０のローパスフィルタリング処理により、矩形状を呈する入力信号が積分処理される。このため、矩形状の入力信号の立ち上がり部分が抽出されて、振幅の立ち上がりが緩やかな積分信号Ｌｆａ１が生成される。その後、第１減算部１８において、入力信号から積分信号Ｌｆａ１の減算が行われて、入力信号の矩形形状から積分信号Ｌｆａ１の緩やかな立ち上がり部分の振幅が減算されるため、矩形状の信号の立ち上がり部分、すなわち非定常成分が、直接音信号Ｌｆｄとして抽出されることになる。 First, in the direct sound side signal shown in FIG. 6A, the rectangular input signal is integrated by the low pass filtering process of the LPF unit 10. For this reason, the rising portion of the rectangular input signal is extracted, and the integrated signal Lfa1 having a gentle rise in amplitude is generated. Thereafter, the first subtraction unit 18 subtracts the integration signal Lfa1 from the input signal, and the amplitude of the gentle rising portion of the integration signal Lfa1 is subtracted from the rectangular shape of the input signal. The portion, that is, the unsteady component is extracted as the direct sound signal Lfd.

なお、第１減算部１８における減算処理によって、直接音信号Ｌｆｄの振幅がマイナスになるが、第２リミッタ部１３で振幅を制限して０にしているため、図６（ａ）に示されるように、直接音信号Ｌｆｄはマイナスの値にはなっていない。 Although the amplitude of the direct sound signal Lfd becomes negative due to the subtraction processing in the first subtracting unit 18, the amplitude is limited to 0 by the second limiter unit 13, so that as shown in FIG. In addition, the direct sound signal Lfd is not a negative value.

つぎに、図６（ｂ）に示す残響音側の信号では、ＨＰＦ部１１のハイパスフィルタリング処理により、矩形状を呈する入力信号が微分処理される。このため、矩形状の入力信号の鋭い立ち上がり部分と、その後の緩やかな減衰部分からなる微分信号Ｌｆａ２が生成される。その後、第２減算部１９において、入力信号から微分信号Ｌｆａ２の減算が行われて、入力信号の矩形形状から微分信号Ｌｆａ２の鋭い立ち上がり部分等の振幅が減算されるため、矩形状の信号の立ち上がり以外の部分、すなわち定常成分が残響音信号Ｌｆｒとして抽出されることになる。 Next, in the reverberation sound side signal shown in FIG. 6B, the rectangular input signal is differentiated by the high-pass filtering process of the HPF unit 11. Therefore, a differential signal Lfa2 composed of a sharp rising portion of the rectangular input signal and a subsequent gentle attenuation portion is generated. Thereafter, the second subtraction unit 19 subtracts the differential signal Lfa2 from the input signal and subtracts the amplitude of the sharp rising portion of the differential signal Lfa2 from the rectangular shape of the input signal. The other part, that is, the stationary component is extracted as the reverberation sound signal Lfr.

なお、第２減算部１９における減算処理においても、残響音信号Ｌｆｒの振幅がマイナスになるが、第４リミッタ部１５で振幅を制限して０にしているため、図６（ｂ）に示されるように、残響音信号Ｌｆｒはマイナスの値にはなっていない。 In the subtraction processing in the second subtraction unit 19, the amplitude of the reverberation sound signal Lfr is negative, but the amplitude is limited to 0 by the fourth limiter unit 15, so that it is shown in FIG. Thus, the reverberation signal Lfr is not a negative value.

図７は、図６に示した状態において、ＨＰＦ部１１およびＬＰＦ部１０の正規化カットオフ周波数を変更した図である。具体的に、図７（ｂ）に示したＨＰＦ部１１の正規化カットオフ周波数は、図６に示したＨＰＦ部１１の正規化カットオフ周波数０．００８２に対して低い値である０．００４１に設定されている。また、図７（ａ）に示したＬＰＦ部１０の正規化カットオフ周波数は、図６に示したＬＰＦ部の正規化カットオフ周波数０．００８２に対して高い値である０．０１６４に設定されている。 FIG. 7 is a diagram in which the normalized cutoff frequencies of the HPF unit 11 and the LPF unit 10 are changed in the state shown in FIG. Specifically, the normalized cutoff frequency of the HPF unit 11 shown in FIG. 7B is 0.0041, which is lower than the normalized cutoff frequency 0.0082 of the HPF unit 11 shown in FIG. Is set to Also, the normalized cutoff frequency of the LPF unit 10 shown in FIG. 7A is set to 0.0164, which is a higher value than the normalized cutoff frequency 0.0082 of the LPF unit shown in FIG. ing.

図６、図７に示すように、正規化カットオフ周波数を低くすると、フィルタの応答が遅くなり、信号の立ち上がりの応答が長くなる。また、正規化カットオフ周波数を高くすると、フィルタの応答が速くなり、信号の立ち上がりの応答が短くなる。このようにして、カットオフ周波数を調整することにより、直接音と残響音の分割時間を調節（直接音の抽出時間の調節、また、残響音の抽出時間の調節）を行うことか可能となる。 As shown in FIGS. 6 and 7, when the normalized cut-off frequency is lowered, the response of the filter becomes slow and the response of the rising edge of the signal becomes long. Further, when the normalized cutoff frequency is increased, the response of the filter becomes faster and the response of the rising edge of the signal becomes shorter. In this way, by adjusting the cut-off frequency, it is possible to adjust the division time of the direct sound and the reverberation sound (adjustment of the extraction time of the direct sound and the extraction time of the reverberation sound). .

図８は、図６に示した状態において、第１ゲイン部１６および第２ゲイン部１７におけるスペクトル毎の重み付け量を設定した場合を示した図である。重み付け量の設定により、直接音や残響音では、重み付け量に応じたオフセット（振幅のかさ上げ）が生じる。このため、図８（ａ）に示す直接音信号には、オフセットに伴う残響音が付加（図８（ａ）に示す高さＬ１の振幅のかさ上げ）され、図８（ｂ）に示す残響音信号には、オフセットに伴う直接音が付加（図８（ｂ）に示す高さＬ１の振幅のかさ上げ）される。このように、重み付け量の設定に応じて生ずるオフセットにより、直接音と残響音との混合比率を調節する（直接音に含まれる残響音の割合を調節し、また、残響音に含まれる直接音の割合を調節する）ことが可能となる。 FIG. 8 is a diagram showing a case where the weighting amount for each spectrum in the first gain unit 16 and the second gain unit 17 is set in the state shown in FIG. By setting the weighting amount, an offset (amplification of the amplitude) corresponding to the weighting amount occurs in the direct sound and the reverberant sound. For this reason, a reverberant sound accompanying an offset is added to the direct sound signal shown in FIG. 8 (a) (the amplitude of the height L1 shown in FIG. 8 (a) is increased), and the reverberation shown in FIG. 8 (b). A direct sound accompanying an offset is added to the sound signal (the amplitude of the height L1 shown in FIG. 8B is increased). In this way, the mixing ratio of the direct sound and the reverberation sound is adjusted by the offset generated according to the setting of the weighting amount (the ratio of the reverberation sound included in the direct sound is adjusted, and the direct sound included in the reverberation sound is adjusted). Can be adjusted).

図９は、図８に示した状態において、入力信号を音楽信号として時間と共に減衰する１ｋＨｚ付近の成分を抽出して示した図である。直接音側の信号においては、図９（ａ）に示すように、振幅の大きい前半部分において直接音の信号が抽出されており、残響音側の信号においては、図９（ｂ）に示すように、入力信号の振幅が減衰している後半部分において残響音の信号が抽出されていることがわかる。 FIG. 9 is a diagram in which components in the vicinity of 1 kHz that attenuate with time are extracted from the input signal as a music signal in the state shown in FIG. In the direct sound side signal, as shown in FIG. 9A, the direct sound signal is extracted in the first half portion having a large amplitude, and in the reverberant sound side signal, as shown in FIG. 9B. Further, it can be seen that a reverberant signal is extracted in the latter half of the case where the amplitude of the input signal is attenuated.

ＩＦＦＴ部５ａは、周波数スペクトル領域フィルタリング部４においてフィルタリング処理された直接音からなる振幅スペクトル信号（直接音信号Ｌｆｄ，Ｒｆｄ）と、ＦＦＴ部３より取得した位相スペクトル信号Ｌｆｐ，Ｒｆｐとに基づいて実数と虚数の周波数スペクトルに変換した後、窓関数により重み付けを行う処理を行う。そして、ＩＦＦＴ部５ａは、重み付け処理された信号に対して短時間逆フーリエ変換処理とオーバーラップ加算とを行うことにより、周波数領域から時間領域に信号を変換して、直接音からなる直接音信号Ｌｄ、Ｒｄを生成する。 The IFFT unit 5a is a real number based on the amplitude spectrum signal (direct sound signal Lfd, Rfd) made of the direct sound filtered by the frequency spectrum domain filtering unit 4 and the phase spectrum signal Lfp, Rfp acquired from the FFT unit 3. Are converted into an imaginary frequency spectrum, and then weighted by a window function. Then, the IFFT unit 5a performs a short-time inverse Fourier transform process and an overlap addition on the weighted signal, thereby converting the signal from the frequency domain to the time domain to generate a direct sound signal composed of a direct sound. Ld and Rd are generated.

また、ＩＦＦＴ部５ｂも同様に、周波数スペクトル領域フィルタリング部４においてフィルタリング処理された残響音からなる振幅スペクトル信号（残響音信号Ｌｆｒ，Ｒｆｒ）と、ＦＦＴ部３より取得した位相スペクトル信号Ｌｆｐ，Ｒｆｐとに基づいて実数と虚数の周波数スペクトルに変換した後、窓関数により重み付けを行う処理を行う。そして、ＩＦＦＴ部５ｂは、重み付け処理された信号に対して短時間逆フーリエ変換処理とオーバーラップ加算とを行うことにより、周波数領域から時間領域に信号を変換して、残響音からなる残響音信号Ｌｒ、Ｒｒを生成する。 Similarly, the IFFT unit 5b also uses amplitude spectrum signals (reverberation sound signals Lfr, Rfr) that are filtered by the frequency spectrum domain filtering unit 4 and phase spectrum signals Lfp, Rfp acquired from the FFT unit 3. Is converted to a real and imaginary frequency spectrum, and then weighted by a window function. Then, the IFFT unit 5b performs a short-time inverse Fourier transform process and an overlap addition on the weighted signal, thereby converting the signal from the frequency domain to the time domain to generate a reverberant sound signal composed of a reverberant sound. Lr and Rr are generated.

なお、ＩＦＦＴ部５ａ,５ｂは、振幅スペクトル信号と位相スペクトル信号とに基づいて実数と虚数の周波数スペクトルに変換する処理を行うため、本発明に係る逆スペクトル変換手段に該当する。さらに、ＩＦＦＴ部５ａ,５ｂは、重み付け処理された信号に対して短時間逆フーリエ変換処理を行うため、本発明に係る逆フーリエ変換手段に該当する。 The IFFT units 5a and 5b correspond to the inverse spectrum conversion means according to the present invention because they perform processing for conversion into a real number and an imaginary number frequency spectrum based on the amplitude spectrum signal and the phase spectrum signal. Furthermore, since the IFFT units 5a and 5b perform short-time inverse Fourier transform processing on the weighted signal, they correspond to the inverse Fourier transform means according to the present invention.

図１０〜図１４は、音響処理装置１に対する入力信号の振幅と、音響処理装置１において抽出（生成）された直接音信号と残響音信号との振幅の時間変化を例示した図である。図１０と図１１は、入力信号として再生時間が１ｓｅｃの１ｋＨｚの正弦波が入力される場合を示し、図１２と図１３とは、入力信号として音楽が入力される場合を示し、図１４は、ホール（残響音の発生しやすい環境）におけるインパルス応答が入力信号として入力される場合を示している。 10 to 14 are diagrams exemplifying the time change of the amplitude of the input signal to the sound processing apparatus 1 and the amplitude of the direct sound signal and the reverberation sound signal extracted (generated) by the sound processing apparatus 1. 10 and 11 show a case where a 1 kHz sine wave having a playback time of 1 sec is inputted as an input signal, and FIGS. 12 and 13 show a case where music is inputted as an input signal, and FIG. The impulse response in the hall (environment where reverberation is likely to occur) is input as an input signal.

また、図１０〜図１４において、ＨＰＦ部１１およびＬＰＦ部１０における正規化カットオフ周波数は、全て０．００８２であり、図１０、図１２、図１４は、スペクトル毎の重み付け処理がなしの場合、図１１と図１３とは、スペクトル毎（周波数毎）の重み付け処理がある場合を示している。 10 to 14, the normalized cut-off frequencies in the HPF unit 11 and the LPF unit 10 are all 0.0082, and FIGS. 10, 12, and 14 show cases in which weighting processing for each spectrum is not performed. 11 and 13 show a case where there is a weighting process for each spectrum (for each frequency).

また、図１０〜図１４におけるＩＦＦＴ部５の逆フーリエ変換長は４，０９６ｓａｍｐｌｅ、オーバーラップ長はフーリエ変換長の１５／１６倍である３，８４０ｓａｍｐｌｅ、逆フーリエ変換の窓関数はブラックマンとなっており、ＦＦＴ部３においても同様となっている。 10 to 14, the inverse Fourier transform length of the IFFT unit 5 is 4,096 sample, the overlap length is 3,840 sample which is 15/16 times the Fourier transform length, and the window function of the inverse Fourier transform is Blackman. The same applies to the FFT unit 3.

図１０および図１１においては、矩形をなす入力信号の振幅の時間変化に対して、非定常成分である直接音信号と、定常成分である残響音信号とがそれぞれ抽出された状態が示されている。また、図１０に示す直接音信号および残響音信号に対して、図１１に示す直接音信号および残響音信号は、スペクトル毎の重み付け処理により振幅の値がオフセットされている。このため、オフセットされた部分（図１１において、直接音信号および残響音信号の振幅がＬ２の高さだけかさ上げされた部分）において、直接音と残響音とが混合した部分が含まれることになり、第１ゲイン部１６および第２ゲイン部１７による重み付け処理に応じて、直接音と残響音との混合比率を調節することが可能となる。 FIGS. 10 and 11 show a state in which a direct sound signal that is an unsteady component and a reverberation sound signal that is a stationary component are extracted with respect to a temporal change in the amplitude of a rectangular input signal. Yes. Further, with respect to the direct sound signal and the reverberation sound signal shown in FIG. 10, the amplitude values of the direct sound signal and the reverberation sound signal shown in FIG. 11 are offset by the weighting process for each spectrum. For this reason, in the offset portion (in FIG. 11, the portion where the amplitudes of the direct sound signal and the reverberation sound signal are raised by the height of L2), a portion where the direct sound and the reverberation sound are mixed is included. Thus, the mixing ratio of the direct sound and the reverberant sound can be adjusted according to the weighting process by the first gain unit 16 and the second gain unit 17.

図１２および図１３においては、音楽（入力信号）の波形に対して、直接音と残響音とをそれぞれ抽出した波形を確認することができる。また、分離された直接音と残響音とをそれぞれ別々に聴取した場合にも、音楽の直接音と残響音とをそれぞれ確認することができ、直接音と残響音との抽出（あるいは分離）を聴覚的に認識することが可能となる。 In FIG. 12 and FIG. 13, it is possible to confirm a waveform obtained by extracting a direct sound and a reverberation sound from the music (input signal) waveform. In addition, even when the separated direct sound and reverberation sound are listened to separately, the direct sound and reverberation sound of music can be confirmed respectively, and the extraction (or separation) of the direct sound and reverberation sound can be performed. Auditory recognition is possible.

また、図１３では、スペクトル毎の重み付けの設定を行っているため、直接音において残響音が一部付加され、また残響音においては直接音が一部付加された波形（図１３における直接音信号および残響音信号の振幅の高さが、図１２に比べて高くなっている）が確認でき、直接音と残響音の混合比率をスペクトル毎の重み付け設定により調節することが可能であることが確認できる。また、図１３に示した直接音と残響音とをそれぞれ聴取した場合においても、混合比率に応じて直接音と残響音とが混合された出力音を確認することができる。 In FIG. 13, since the weighting is set for each spectrum, a waveform in which a part of the reverberation sound is added to the direct sound and a part of the direct sound is added to the reverberation sound (the direct sound signal in FIG. 13). And the amplitude of the reverberant signal is higher than that of FIG. 12), and it is confirmed that the mixing ratio of the direct sound and the reverberant sound can be adjusted by the weight setting for each spectrum. it can. Further, even when the direct sound and the reverberation sound shown in FIG. 13 are respectively listened to, the output sound in which the direct sound and the reverberation sound are mixed can be confirmed according to the mixing ratio.

図１４においては、ホールにおけるインパルス応答が入力信号として入力されている。インパルス応答であるため、非常に短い信号を入力したときの出力あって、短い時間で振幅が収束する特性を有するが、ホールという残響音の発生しやすい環境でのインパルス応答であるため、直接音に加えて多分に残響音が含まれることになる。 In FIG. 14, the impulse response in the hall is input as an input signal. Since it is an impulse response, it has an output when an extremely short signal is input and the amplitude converges in a short time. In addition to this, a reverberant sound is probably included.

図１４においては、入力信号の振幅の収束に比較して、より短い時間で振幅の収束が行われた直接音と、直接音の振幅の収束よりも長い時間だけ振幅が維持されている残響音とを確認することができる。図１４においては、ＨＰＦ部１１とＬＰＦ部１０の正規化カットオフ周波数が０．００８２に設定されているが、この正規化カットオフ周波数の値を調整することによって、直接音の抽出時間と残響音の抽出時間とを調節することが可能である。 In FIG. 14, compared to the convergence of the amplitude of the input signal, the direct sound in which the amplitude is converged in a shorter time and the reverberant sound in which the amplitude is maintained for a longer time than the convergence of the amplitude of the direct sound. And can be confirmed. In FIG. 14, the normalized cutoff frequency of the HPF unit 11 and the LPF unit 10 is set to 0.0082. By adjusting the value of the normalized cutoff frequency, the direct sound extraction time and reverberation are set. It is possible to adjust the sound extraction time.

図１５（ａ）は、図１４に示した直接音の波形が正規化カットオフ周波数の値の調整状態に応じて変化する様子と、入力信号とを模式的に示した図である。図１５（ａ）に示すように、正規化カットオフ周波数の値が大きくなると、インパルス応答の振幅の収束までの時間が短くなり、正規化カットオフ周波数の値が小さくなると、インパルス応答の振幅の収束までの時間が長くなって、入力信号の振幅の収束状態に近い波形形状を示すことになる。 FIG. 15A is a diagram schematically showing a state in which the waveform of the direct sound shown in FIG. 14 changes according to the adjustment state of the value of the normalized cutoff frequency and the input signal. As shown in FIG. 15A, when the value of the normalized cutoff frequency increases, the time until the amplitude of the impulse response converges decreases, and when the value of the normalized cutoff frequency decreases, the amplitude of the impulse response decreases. The time until convergence becomes longer, and a waveform shape close to the convergence state of the amplitude of the input signal is shown.

このように、正規化カットオフ周波数の値を調整することにより、入力信号における直接音の抽出時間を変更することが可能となる。従って、正規化カットオフ周波数の値を小さくすると、入力信号における直接音の抽出時間が長くなり、非定常音だけでなく定常音を含んだ形で直接音の抽出を行うことが可能となる。例えば、図１４に示す程度に、定常音を含んだ直接音の抽出を行うことにより、全く定常音を含まない直接音に比べて、音色や聞き易さなどを直接音に付加することが可能となり、聴取者が直接音を聴取した場合において違和感のない音として認識することが可能となる。 In this way, by adjusting the value of the normalized cutoff frequency, it is possible to change the direct sound extraction time in the input signal. Therefore, if the value of the normalized cut-off frequency is reduced, the time for extracting the direct sound from the input signal becomes longer, and it becomes possible to extract the direct sound in a form including not only the unsteady sound but also the stationary sound. For example, by extracting a direct sound including a stationary sound as shown in FIG. 14, it is possible to add a timbre, ease of hearing, etc. to the direct sound as compared to a direct sound that does not include a stationary sound at all. Thus, when the listener listens directly to the sound, it can be recognized as a sound without any sense of incongruity.

一方で、図１５（ｂ）は、図１４に示した残響音の波形が正規化カットオフ周波数の値の調整状態に応じて変化する様子と、入力信号とを模式的に示した図である。図１５（ｂ）に示すように、正規化カットオフ周波数の値が大きくなると、残響音の振幅の増大が早い時間から生じるようになり、残響音の振幅の増大が早い段階から大きく立ち上がる傾向がある。一方で、正規化カットオフ周波数の値が小さくなると、残響音の振幅の増大（立ち上がり部分）が緩やかになる。 On the other hand, FIG. 15 (b) is a diagram schematically showing how the reverberant waveform shown in FIG. 14 changes according to the adjustment state of the value of the normalized cutoff frequency and the input signal. . As shown in FIG. 15 (b), when the value of the normalized cut-off frequency increases, the amplitude of the reverberant sound increases from an early time, and the increase in the amplitude of the reverberant sound tends to rise greatly from an early stage. is there. On the other hand, when the value of the normalized cut-off frequency becomes small, the increase (rise part) of the amplitude of the reverberant sound becomes moderate.

このため、正規化カットオフ周波数の値を調節することにより、入力信号における直接音の抽出時間を変更することが可能となり、正規化カットオフ周波数の値を小さくすると、残響音信号に含まれる直接音の影響を少なくすることが可能となり、正規化カットオフ周波数の値を大きくすると、直接音が少し含まれた残響音信号の抽出が可能となる。 For this reason, it is possible to change the extraction time of the direct sound in the input signal by adjusting the value of the normalized cutoff frequency. When the value of the normalized cutoff frequency is decreased, the direct sound included in the reverberant sound signal is changed. It becomes possible to reduce the influence of sound, and if the value of the normalized cut-off frequency is increased, it is possible to extract a reverberant sound signal containing a little direct sound.

以上、本発明に係る直接音抽出装置および残響音抽出装置について、図面を用いて詳細に説明を行ったが、本発明に係る直接音抽出装置および残響音抽出装置は、上述した実施の形態に示される構成には限定されない。当業者であれば、特許請求の範囲に記載された範疇内において、各種の変更例または修正例に想到し得ることは明らかであり、それらについても当然に本発明の技術的範囲に属するものと了解される。 Although the direct sound extraction device and the reverberation sound extraction device according to the present invention have been described in detail with reference to the drawings, the direct sound extraction device and the reverberation sound extraction device according to the present invention are described in the above-described embodiments. It is not limited to the configuration shown. It will be apparent to those skilled in the art that various changes and modifications can be made within the scope of the claims, and these are naturally within the technical scope of the present invention. Understood.

また、本発明に係る直接音抽出装置および残響音抽出装置を利用することにより、様々な音響環境を構築することも可能である。例えば、直接音に残響音が含まれた入力信号から、直接音抽出装置により直接音信号を抽出し、聴取者に近い位置に設置されるスピーカから出力することにより、入力信号をそのままスピーカより出力する場合に比べて、ボーカル音などを鮮明にして聞き取り易くすることが可能となる。また、残響音抽出装置により入力信号から残響音信号を抽出し、聴取者から遠い位置に設置されるスピーカから出力することにより、残響音を効果的に出力させることが可能となる。 Various acoustic environments can also be constructed by using the direct sound extraction device and the reverberation sound extraction device according to the present invention. For example, a direct sound extraction device extracts a direct sound signal from an input signal that includes a reverberant sound in the direct sound, and outputs it from a speaker installed at a position close to the listener so that the input signal is directly output from the speaker. Compared to the case, the vocal sound and the like can be made clearer and easier to hear. Moreover, a reverberant sound can be effectively output by extracting a reverberant sound signal from an input signal by the reverberant sound extracting device and outputting it from a speaker installed at a position far from the listener.

１ …音響処理装置（直接音抽出装置および残響音抽出装置）
３ …ＦＦＴ部（フーリエ変換手段、スペクトル変換手段）
４ …周波数スペクトル領域フィルタリング部
５ａ、５ｂ …ＩＦＦＴ部（逆フーリエ変換手段、逆スペクトル変換手段）
１０ …ＬＰＦ部（ローパスフィルタ手段）
１１ …ＨＰＦ部（ハイパスフィルタ手段）
１２ …第１リミッタ部（第１リミッタ手段）
１３ …第２リミッタ部（第２リミッタ手段）
１４ …第３リミッタ部（第３リミッタ手段）
１５ …第４リミッタ部（第４リミッタ手段）
１６ …第１ゲイン部（第１ゲイン手段）
１７ …第２ゲイン部（第２ゲイン手段）
１８ …第１減算部（第１減算手段）
１９ …第２減算部（第２減算手段）
Ｌ、Ｒ …入力信号
Ｌｆａ、Ｒｆａ …振幅スペクトル信号
Ｌｆｐ、Ｒｆｐ …位相スペクトル信号
Ｌｆａ１ …積分信号
Ｌｆａ２ …微分信号
Ｌｆｄ、Ｌｄ、Ｒｆｄ、Ｒｄ …直接音信号
Ｌｆｒ、Ｌｒ、Ｒｆｒ、Ｒｒ …残響音信号 1 ... Sound processing device (direct sound extraction device and reverberation sound extraction device)
3 ... FFT section (Fourier transform means, spectrum transform means)
4 ... frequency spectrum domain filtering units 5a, 5b ... IFFT units (inverse Fourier transform means, inverse spectrum transform means)
10 ... LPF section (low-pass filter means)
11 ... HPF section (high-pass filter means)
12 ... 1st limiter part (1st limiter means)
13 ... 2nd limiter part (2nd limiter means)
14 ... Third limiter section (third limiter means)
15 ... Fourth limiter section (fourth limiter means)
16 ... 1st gain part (1st gain means)
17 ... 2nd gain part (2nd gain means)
18 ... 1st subtraction part (1st subtraction means)
19 ... 2nd subtraction part (2nd subtraction means)
L, R ... input signals Lfa, Rfa ... amplitude spectrum signals Lfp, Rfp ... phase spectrum signal Lfa1 ... integration signal Lfa2 ... differential signals Lfd, Ld, Rfd, Rd ... direct sound signals Lfr, Lr, Rfr, Rr ... reverberation sound signals

Claims

Fourier transform means for performing a Fourier transform process on an input signal in which reverberant sound is included in the direct sound;
Spectrum conversion means for converting the input signal into a first amplitude spectrum signal and a phase spectrum signal based on the real and imaginary frequency spectra of the input signal Fourier-transformed by the Fourier transform means;
Low-pass filter means for performing a low-pass filtering process for each frequency using a preset normalization cutoff frequency for the first amplitude spectrum signal;
First limiter means for limiting the negative amplitude of the second amplitude spectrum signal low-pass filtered by the low-pass filter means to zero,
First subtraction means for obtaining a third amplitude spectrum signal by subtracting from the first amplitude spectrum signal the second amplitude spectrum signal in which the negative-side amplitude is limited by the first limiter;
Second limiter means for limiting the minus-side amplitude of the third amplitude spectrum signal obtained by the first subtraction means to zero, and
Inverse spectrum conversion means for obtaining a signal composed of a real number and an imaginary number frequency spectrum based on the phase spectrum signal and the third amplitude spectrum signal on which the amplitude of the negative side is limited by the second limiter means;
Direct sound extraction comprising: an inverse Fourier transform means for generating a direct sound signal obtained by extracting the direct sound from the input signal by subjecting the signal obtained by the inverse spectrum transform means to an inverse Fourier transform process. apparatus.

First gain means for weighting the third amplitude spectrum signal by amplifying or attenuating the amplitude of the third amplitude spectrum signal whose amplitude on the negative side is limited by the second limiter means for each frequency. With
The inverse spectrum converting means obtains a signal composed of a real number and an imaginary number frequency spectrum based on the phase spectrum signal and the third amplitude spectrum signal weighted by the first gain means. The direct sound extraction device according to claim 1.

Fourier transform means for performing a Fourier transform process on an input signal in which reverberant sound is included in the direct sound;
Spectrum conversion means for converting the input signal into a first amplitude spectrum signal and a phase spectrum signal based on the real and imaginary frequency spectra of the input signal Fourier-transformed by the Fourier transform means;
High-pass filter means for performing high-pass filtering processing for each frequency using a normalization cutoff frequency set in advance for the first amplitude spectrum signal;
Third limiter means for limiting the negative amplitude of the fourth amplitude spectrum signal subjected to high-pass filtering by the high-pass filter means to zero,
Second subtracting means for obtaining a fifth amplitude spectrum signal by subtracting the fourth amplitude spectrum signal, the amplitude of which has been negatively limited by the third limiter means, from the first amplitude spectrum signal;
Fourth limiter means for limiting the minus-side amplitude of the fifth amplitude spectrum signal obtained by the second subtracting means to zero, and
Inverse spectrum conversion means for obtaining a signal composed of a real number and an imaginary number frequency spectrum based on the phase spectrum signal and the fifth amplitude spectrum signal in which the negative amplitude is limited by the fourth limiter means;
Reverberation sound extraction, comprising: inverse Fourier transform means for generating a reverberation sound signal obtained by extracting the reverberation sound from the input signal by performing an inverse Fourier transform process on the signal obtained by the inverse spectrum conversion means apparatus.

Second gain means for weighting the fifth amplitude spectrum signal by amplifying or attenuating the amplitude of the fifth amplitude spectrum signal, the amplitude of which is negatively limited by the fourth limiter means, for each frequency. With
The inverse spectrum conversion unit obtains a signal composed of a real number and an imaginary number frequency spectrum based on the phase spectrum signal and the fifth amplitude spectrum signal weighted by the second gain unit. The reverberation sound extraction device according to claim 3.