JPH0650438B2

JPH0650438B2 - Speech coder

Info

Publication number: JPH0650438B2
Application number: JP61103641A
Authority: JP
Inventors: 尚己松村; 実赤松
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1986-05-06
Filing date: 1986-05-06
Publication date: 1994-06-29
Anticipated expiration: 2009-06-29
Also published as: JPS62260200A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、音声の帯域圧縮、音声蓄積等に使用される音
声符号化器に関する。The present invention relates to a speech coder used for speech band compression, speech accumulation, and the like.

〔Overview〕

本発明は、予測残差信号を複数のパルス例で符号化する
マルチパルス駆動形の音声符号化器において、現在分析中のフレームへの直前フレームからの影響信号
を生成し、重み付け信号からこの影響信号を差し引い
て、インパルス応答関数との相互相関関数を求めること
により、直前フレームからの影響信号による品質低下をなくした
ものである。The present invention, in a multi-pulse drive type speech coder that encodes a prediction residual signal with a plurality of pulse examples, generates an influence signal from a frame immediately before a frame currently being analyzed, and uses this influence signal from a weighted signal. By subtracting the signal and obtaining the cross-correlation function with the impulse response function, the quality deterioration due to the influence signal from the immediately preceding frame is eliminated.

[Conventional technology]

現在音声の帯域圧縮技術および音声蓄積技術は近年のデ
ータネットワークの発達、多様化にともない、回線コス
トの低減およびネットワークの効率化を目的として、低
ビットレート化の要求が高い。With the recent development and diversification of data networks, voice band compression technology and voice storage technology are required to have a low bit rate for the purpose of reducing line cost and network efficiency.

音声の16bit/s 付近の符号化法としては、従来よりＡＤ
−ＰＣＭ、ＡＰＣ、ＡＴＣ等が提案されているが（例え
ば、落合、荒関、「音声符号化技術」、情報処理学会
誌、Vol.24、No.8 PP.976 −893 、1983）、最近予測残
差信号を複数のパルス列で符号化するマルチパルス駆動
形音声符号化方式が発表され、その品質／ビットレート
比から有望視されている（例えば、荒関、「音源の生成
モデルを用いた最近の音声符号化技術」、電子通信学
会、情報理論研究会資料、ＩＴ84-31 、1984、小野、荒
関、「マルチパルス駆動形音声符号化法の検討」、電子
通信学会、通信方式研究会資料、CS82-161、1982）。Conventionally, AD has been used as a coding method in the vicinity of 16 bit / s of voice.
-PCM, APC, ATC, etc. have been proposed (for example, Ochiai, Araseki, "Speech Coding Technology", Journal of Information Processing Society of Japan, Vol.24, No.8 PP.976-893, 1983), but recently predicted A multi-pulse drive type speech encoding method for encoding a residual signal with a plurality of pulse trains has been announced, and it is promising because of its quality / bit rate ratio (for example, Araseki, “Recently using a source generation model Speech Encoding Technology ", IEICE, Information Theory Workshop Material, IT84-31, 1984, Ono, Araseki," Study on Multi-pulse Driven Speech Encoding Method ", IEICE, Communication Method Workshop Material, CS82 -161, 1982).

この方式は８〜16kbit/sの音声符号化に適し、上記の音
声帯域圧縮および音声蓄積の分野での要求に一致する。This method is suitable for voice coding of 8 to 16 kbit / s, and meets the above-mentioned requirements in the field of voice band compression and voice storage.

[Problems to be solved by the invention]

しかし、従来のマルチパルス駆動形符号化法で、実際に
符号化器を構成する場合に問題となるのは、隣接する音
声分析フレームの境界の影響である。本来音声信号は連
続するものであるから、ある音声分析フレームに注目し
た場合、その音声分析フレームには前の音声分析フレー
ムとの相関性が高い。マルチパルス駆動形音声符号化法
の場合、前の音声分析フレームの最後にパルスが存在し
た場合、そのパルスのインパルス応答のほとんどは現在
の音声分析フレームに存在する。従って、現在の音声分
析フレームだけに注目してパルス抽出を行うと、その中
には前の音声分析フレームのパルスの成分も入っている
ことになり、重複したパルスが異音となり、再生音質を
劣化させてしまう欠点がある。However, in the conventional multi-pulse drive type encoding method, a problem when actually constructing the encoder is the influence of the boundary between the adjacent speech analysis frames. Since a voice signal is originally continuous, when a certain voice analysis frame is focused, the voice analysis frame has a high correlation with the previous voice analysis frame. In the case of multi-pulse driven speech coding, if a pulse was present at the end of the previous speech analysis frame, most of the impulse response of that pulse is in the current speech analysis frame. Therefore, if pulse extraction is performed by focusing only on the current voice analysis frame, it means that the pulse component of the previous voice analysis frame is also included in it, and the duplicated pulse becomes an abnormal sound, and the reproduced sound quality is improved. There is a drawback that it deteriorates.

本発明の目的は、上記の欠点を除去することにより、よ
り高品質のマルチパルス駆動形の音声符号化器を提供す
ることにある。It is an object of the present invention to provide a higher quality multi-pulse driven speech coder by eliminating the above drawbacks.

[Means for solving problems]

本発明は、入力音声信号の分析フレーム内のスペクトル
係数およびインパルス応答関数を求めるスペクトラム分
析器と、上記入力音声信号に上記スペクトル係数により
重み付けを行い重み付け信号を生成する重み付けろ波器
と、上記インパルス応答関数と上記重み付け信号との相
互相関関数を求める相互相関器と、上記インパルス応答
関数の自己相関関数を求める自己相関器と、上記相互相
関関数と上記自己相関関数とからパルス列を抽出し出力
するパルス抽出回路とを含む音声符号化器において、上記パルス列および上記スペクトル係数から現在分析中
の分析フレームの直前の分析フレームからの影響信号を
生成する重み付け信号補正回路と、上記重み付け信号か
ら上記影響信号を差し引いた補正重み付け信号を生成し
上記相互相関器へ入力する減算器とを含むことを特徴と
する。The present invention provides a spectrum analyzer for obtaining a spectral coefficient and an impulse response function in an analysis frame of an input speech signal, a weighting filter for weighting the input speech signal with the spectral coefficient to generate a weighted signal, and the impulse. A cross-correlator that obtains a cross-correlation function between the response function and the weighted signal, an auto-correlator that obtains an auto-correlation function of the impulse response function, and a pulse train is extracted and output from the cross-correlation function and the auto-correlation function. In a speech coder including a pulse extraction circuit, a weighting signal correction circuit for generating an influence signal from an analysis frame immediately before an analysis frame currently being analyzed from the pulse train and the spectrum coefficient, and the influence signal from the weighted signal. Generate a corrected weighting signal from which And a subtractor for inputting the input power.

[Action]

本発明は、重み付け補正回路によりパルス抽出回路から
出力される現在分析中のフレームの直前フレームのパル
ス列のうちで、現在分析中のフレームに影響を与えるパ
ルスから影響信号を生成し、減算器により、重み付けろ
波器から出力される重み付け信号からこの影響信号を差
し引いた補正重み付け信号を生成し、相互相関器に入力
し、インパルス応答関数との相互相関関数を求める。The present invention, in the pulse train of the frame immediately before the frame currently being analyzed, which is output from the pulse extraction circuit by the weighting correction circuit, generates an influence signal from the pulse that affects the frame currently being analyzed, and by the subtracter, A corrected weighting signal is generated by subtracting this influence signal from the weighting signal output from the weighting filter, is input to the cross-correlator, and the cross-correlation function with the impulse response function is obtained.

従って、直前フレームの影響信号による異音の発生はな
くなり、品質を向上させることができる。Therefore, abnormal noise is not generated due to the influence signal of the immediately preceding frame, and the quality can be improved.

〔Example〕

以下、本発明の実施例について図面を参照して説明す
る。Hereinafter, embodiments of the present invention will be described with reference to the drawings.

第１図は本発明の一実施例の構成を示すブロック図であ
る。本実施例は、入力音声信号の分析フレーム内のスペ
クトル係数およびインパルス応答関数を求めるスペクト
ラム分析器２と、上記入力音声信号に上記スペクトル係
数により重み付けを行い重み付け信号を生成する重み付
けろ波器３と、上記インパルス応答関数と上記重み付け
信号との相互相関関数を求める相互相関器７と、上記イ
ンパルス応答関数の自己相関関数を求める自己相関器６
と、上記相互相関関数と上記自己相関関数とからパルス
列を抽出し出力するパルス抽出回路８と、上記パルス列
および上記スペクトル係数から現在分析中の分析フレー
ムの直前の分析フレームからの影響信号を生成する重み
付け信号補正回路４と、上記重み付け信号から上記影響
信号を差し引いた補正重み付け信号を生成し相互相関器
７へ入力する減算器５とを含んでいる。なお、同図にお
いて、１は入力端子、９はパルス出力端子、10はスペク
トル係数出力端子である。FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention. In this embodiment, a spectrum analyzer 2 for obtaining a spectral coefficient and an impulse response function in an analysis frame of an input speech signal, and a weighting filter 3 for weighting the input speech signal with the spectral coefficient to generate a weighted signal. , A cross-correlator 7 for obtaining a cross-correlation function of the impulse response function and the weighted signal, and an auto-correlator 6 for obtaining an auto-correlation function of the impulse response function.
And a pulse extraction circuit 8 that extracts and outputs a pulse train from the cross-correlation function and the autocorrelation function, and generates an influence signal from the analysis frame immediately before the analysis frame currently being analyzed from the pulse train and the spectral coefficient. It includes a weighting signal correction circuit 4 and a subtractor 5 for generating a correction weighting signal by subtracting the influence signal from the weighting signal and inputting it to the cross-correlator 7. In the figure, 1 is an input terminal, 9 is a pulse output terminal, and 10 is a spectrum coefficient output terminal.

本発明の特徴は、第１図において、重み付け信号補正器
４および減算器５を設けたことにある。The feature of the present invention resides in that the weighting signal corrector 4 and the subtractor 5 are provided in FIG.

次に、本実施例の動作について、第２図に示す各部の波
形図を参照して説明する。いま、入力端子１に第２図
(a)に示す音声波形を有する入力音声信号11が入力され
たとする。そして同図に示すように、分析長をＮサンプ
ル（例えば、20ms）、パルス抽出フレーム長をＭサンプ
ル（例えば、３ms）、零時刻を同図に示す位置として、
現在の分析フレームをｌ時刻、直前の分析フレームを
（ｌ−１）時刻、直後の分析フレームを（ｌ＋１）フレ
ーム時刻とする。入力音声信号11はスペクトラム分析器
２重み付けろ波器３へ入力される。スペクトルラム分析
器２では、分析フレーム内のスペクトル係数12と、第２
図(b)に示すインパルス応答関数13が求められる。そし
てスペトル係数12は重み付けろ波器３、重み付け信号補
正器４およびスペクトル係数出力端子10へ出力され、イ
ンパルス応答関数13は自己相関器６および相互相関器７
へ出力される。ここで、第２図(b)に示すインパルス応
答関数13は、ｌ時刻のフレームから分析されたスペクト
ル係数（例えば、ＰＡＲＣＯＲ係数）によるＭサンプル
長のインパルス応答関数である。重み付けろ波器３で
は、入力された入力音声信号11に対して、スペクトル係
数12により、（Ｎ＋Ｍ）サンプルの重み付けを行い第２
図(c)に示す波形の重み付け信号14を出力する。重み付
け信号補正器４では、（ｌ−１）時刻の分析によるパル
スの影響を除去するために、（ｌ−１）時刻で分析され
た第２図(d)に示すパルス抽出器８から帰還された直前
の分析フレームのパルス列19a と、スペクトル分析器２
から入力されたｌ時刻のスペクトル係数12とから、第２
図(e)に示す影響信号15をＭサンプル算出し、減算器５
へ出力する。減算器５では重み付け信号14から影響信号
15を差し引いた、第２図(f)に示す補正重み付け信号16
を生成し相互相関器７へ出力する。相互相関器７では、
第２図(g)に示すように、この補正重み付け信号16とイ
ンパルス応答関数13との相互相関関数17を（Ｎ＋Ｍ）サ
ンプル求めパルス抽出回路８へ出力する。自己相関器で
は、第２図(h)に示すように、インパルス応答関数13の
自己相関関数18をＭサンプル求めパルス抽出回路８へ出
力する。パルス抽出回路８では、相互相関関数17の最大
値に等しい大きさに自己相関関数18を拡大、縮小、反転
させて、相互相関関数17から差し引き、その位置に等倍
のパルスを立てるという操作を繰り返すことにより、第
２図(i)に示すパルス列19を生成し、パルス出力端子９
へ出力する。Next, the operation of this embodiment will be described with reference to the waveform chart of each part shown in FIG. Now, input terminal 1 is shown in FIG.
It is assumed that the input voice signal 11 having the voice waveform shown in (a) is input. As shown in the figure, the analysis length is N samples (for example, 20 ms), the pulse extraction frame length is M samples (for example, 3 ms), and the zero time is the position shown in the figure.
The current analysis frame is at time l, the analysis frame immediately before is at time (l-1), and the analysis frame immediately after is at time (l + 1) frame. The input audio signal 11 is input to the spectrum analyzer 2 and the weighting filter 3. In the spectrum analyzer 2, the spectrum coefficient 12 in the analysis frame and the second
The impulse response function 13 shown in FIG. 6 (b) is obtained. The spectrum coefficient 12 is output to the weighting filter 3, the weighting signal corrector 4, and the spectrum coefficient output terminal 10, and the impulse response function 13 is used for the autocorrelator 6 and the crosscorrelator 7.
Is output to. Here, the impulse response function 13 shown in FIG. 2 (b) is an impulse response function of M sample length based on the spectral coefficient (for example, PARCOR coefficient) analyzed from the frame at time l. The weighting filter 3 weights (N + M) samples of the input input audio signal 11 with the spectral coefficient 12 and outputs the second weighted signal.
The weighting signal 14 having the waveform shown in FIG. In the weighting signal corrector 4, in order to remove the influence of the pulse due to the analysis of the (l-1) time, the signal is fed back from the pulse extractor 8 shown in FIG. 2 (d) analyzed at the (l-1) time. The pulse train 19a of the analysis frame immediately before and the spectrum analyzer 2
From the spectrum coefficient 12 at time l input from
M samples of the influence signal 15 shown in FIG.
Output to. In the subtractor 5, from the weighting signal 14 to the influence signal
Corrected weighting signal 16 shown in FIG. 2 (f) with 15 subtracted
Is generated and output to the cross-correlator 7. In the cross correlator 7,
As shown in FIG. 2 (g), the cross-correlation function 17 of the correction weighting signal 16 and the impulse response function 13 is output to the (N + M) sample determination pulse extraction circuit 8. The autocorrelator outputs the autocorrelation function 18 of the impulse response function 13 to the M sample determination pulse extraction circuit 8 as shown in FIG. In the pulse extraction circuit 8, the operation of enlarging, reducing, and inverting the autocorrelation function 18 to a size equal to the maximum value of the crosscorrelation function 17, subtracting from the crosscorrelation function 17, and setting a pulse of equal size at that position is performed. By repeating the above, the pulse train 19 shown in FIG.
Output to.

〔The invention's effect〕

以上説明したように本発明によれば、スペクトラム分析
器と重み付けろ波器と相互相関器と自己相関器とパルス
抽出器から構成されるマルチパルス駆動形の音声符号化
器に、重み付け信号補正器と減算器とを付加することに
より、音声分析フレームの境界に生じる異音を重み付け
信号の補正により除去し、より高い品質のマルチパルス
駆動形の音声符号化器を提供することができる。As described above, according to the present invention, a multi-pulse drive type speech coder including a spectrum analyzer, a weighting filter, a cross-correlator, an auto-correlator and a pulse extractor, and a weighted signal corrector By adding a subtraction unit and a subtractor, it is possible to provide a higher quality multi-pulse drive type speech encoder by removing abnormal noise generated at the boundary of the speech analysis frame by correcting the weighting signal.

[Brief description of drawings]

第１図は本発明の一実施例の構成を示すブロック図。第２図はその各部における信号の動作波形図。１……入力端子、２……スペクトラム分析器、３……重
み付けろ波器、４……重み付け信号補正器、５……減算
器、６……自己相関器、７……相互相関器、８……パル
ス抽出器、９……パルス出力端子、10……スペクトル係
数出力端子、11……入力音声信号、12……スペクトル係
数、13……インパルス応答関数、14……重み付け信号、
15……影響信号、16……補正重み付け信号、17……相互
相関関数、18……自己相関関数、19……伝送されるパル
ス列、19a ……直前の分析フレームのパルス列。FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention. FIG. 2 is an operation waveform diagram of signals in the respective parts. 1 ... Input terminal, 2 ... Spectrum analyzer, 3 ... Weighting filter, 4 ... Weighting signal corrector, 5 ... Subtractor, 6 ... Autocorrelator, 7 ... Cross-correlator, 8 ...... Pulse extractor, 9 …… Pulse output terminal, 10 …… Spectrum coefficient output terminal, 11 …… Input voice signal, 12 …… Spectrum coefficient, 13 …… Impact response function, 14 …… Weighting signal,
15 …… Influence signal, 16 …… Correction weighting signal, 17 …… Cross-correlation function, 18 …… Autocorrelation function, 19 …… Transmitted pulse train, 19a …… Pulse train of the previous analysis frame.

Claims

[Claims]

1. A spectrum analyzer (2) for obtaining a spectral coefficient and an impulse response function in an analysis frame of an input speech signal, and a weighting filter for weighting the input speech signal with the spectral coefficient to generate a weighted signal. (3) and a cross-correlator (7) for obtaining a cross-correlation function between the impulse response function and the weighted signal
And a speech encoder including an autocorrelator (6) for obtaining an autocorrelation function of the impulse response function, and a pulse extraction circuit (8) for extracting and outputting a pulse train from the crosscorrelation function and the autocorrelation function. In the above, a weighting signal correction circuit (4) for generating an influence signal from an analysis frame immediately before an analysis frame currently being analyzed from the pulse train and the spectral coefficient, and a correction weighting signal obtained by subtracting the influence signal from the weighted signal And a subtracter (5) for generating and inputting to the cross-correlator.