JPH0632032B2

JPH0632032B2 - Speech band signal coding method and apparatus

Info

Publication number: JPH0632032B2
Application number: JP59042307A
Authority: JP
Inventors: 一範小澤
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1984-03-06
Filing date: 1984-03-06
Publication date: 1994-04-27
Anticipated expiration: 2009-04-27
Also published as: JPS60186899A

Description

【発明の詳細な説明】＜産業上の利用分野＞本発明は音声帯域信号（音声信号，データモデム信号
等）の低ビットレイト波形符号化方式、特に伝送情報量
を16kビット／秒以下とするような符号化方式と装置に
関する。DETAILED DESCRIPTION OF THE INVENTION <Industrial Field of Application> The present invention sets a low bit rate waveform coding method for a voice band signal (voice signal, data modem signal, etc.), and particularly sets a transmission information amount to 16 kbit / sec or less. Such an encoding method and device.

＜従来技術とその問題点＞音声信号を16kビット／秒程度以下の伝送情報量で符号
化するための方式として、最近マルチパルス駆動形音声
符号化方式が提案されている。これは、駆動音源信号系
列を表わす複数個のパルス系列（マルチパルス）を、短
時間毎に、符号器側でＡ−ｂ−Ｓ（ANALYSIS-BY-SYNTHE
SIS）の手法を用いて遂次的に求め、このパルス系列を
符号化伝送する方式である。本発明はこの方式に関係す
るものである。この方式の詳細については、ビー・エス
・アタール（B.S.ATAL）氏らによるアイ・シー・エー・
エス・ピー（I.C.A.S.S.P.）の予稿集、1982年614〜617
頁に掲載の「ア．ニュー．モデル．オブ．エル．ピー．
シー．エクサイティション．フォー．プロデューシン
グ．ナチュラル．サウンディング．スピーチ．アット．
ロウ．ビット．レイツ」（“A NEW MODEL OF LPC EXCIT
ATION FOR PRODUCING NATURAL-SOUNDING SPEECH AT LOW
BIT RATES”）と題した論文（文献１）に説明されてい
るので、ここでは簡単に説明を行なうにとどめる。<Prior Art and Problems Thereof> A multi-pulse drive type audio encoding system has recently been proposed as a system for encoding an audio signal with a transmission information amount of about 16 kbit / sec or less. This is a system in which a plurality of pulse sequences (multi-pulses) representing a driving sound source signal sequence are transmitted at a short time by the encoder side at A-B-S (ANALYSIS-BY-SYNTHE).
This is a method of coding and transmitting this pulse sequence sequentially using the method of SIS). The present invention relates to this system. For more details on this method, see ICA by BSATAL et al.
Proceedings of ICSP, 1982 614-617
"A. New Model of L.P.
C. Exhibition. Four. Producing. natural. Sounding. speech. at.
Row. bit. Rates "(" A NEW MODEL OF LPC EXCIT
ATION FOR PRODUCING NATURAL-SOUNDING SPEECH AT LOW
Since it is explained in the paper (reference 1) entitled "BIT RATES"), only a brief explanation will be given here.

第１図は、前記文献１、に記載された従来方式における
符号器側の処理を示すブロック図である。図において、
100は符号器入力端子を示し、Ａ／Ｄ変換された音声信
号系列x(n)が入力される。110はバッファメモリ回路で
あり、音声信号系列を１フレーム（例えば8KHZサンプリ
ングの場合でフレーム長を10ｍsecとすると８０サンプ
ル）分、蓄積する。バッファメモリ回路１１０の出力値
は減算器１２０と、Ｋパラメータ計算回路１８０とに出
力される。但し、文献１、によれＫパラメータのかわり
にレフレクション・エコフィシエンツ（REFLECTION COE
FFICIENTS）と記載されているが、これなＫパラメータ
と同一のパラメータである。Ｋパラメータ計算回路１８
０は、バッファメモリ回路１１０の出力値を用い、共分
散法に従って、フレーム毎の音声信号スペクトルを表わ
すＫパラメータK_iを１６次分（１≦ｉ≦１６）求め、こ
れらを合成フィルタ回路１３０へ出力する。１４０は、
音源パルス発生回路であり、１フレーム内にあらかじめ
定められた個数のパルス系列を発生させる。ここでは、
このパルス系列をd(n)と記する。音源パルス発生回路１
４０によって発生された音源パルス系列の一例を第２図
に示す。第２図で横軸は離散的な時刻を、縦軸は振幅を
それぞれに示す。ここでは、１フレーム内に８個のパル
スを発生させる場合について示してある。音源パルス発
生回路１４０によって発生されたパルス系列d(n)は、合
成フィルタ１３０を駆動する。合成フィルタ１３０は、
d(n)を入力し、音声信号x(n)に対応する再生信号を求め、これを減算器１２０へ出力する。ここで、合成
フィルタ１３０は、ＫパラメータK_iを入力し、これら予
測パラメータai（１≦ｉ≦１６）へ変換し、aiを用いて
再生信号x(n)を計算する。FIG. 1 is a block diagram showing the processing on the encoder side in the conventional method described in Document 1 above. In the figure,
Reference numeral 100 denotes an encoder input terminal to which the A / D converted audio signal sequence x (n) is input. Reference numeral 110 denotes a buffer memory circuit, which stores the audio signal sequence for one frame (for example, 80 samples when the frame length is 10 msec in the case of 8 KHZ sampling). The output value of the buffer memory circuit 110 is output to the subtractor 120 and the K parameter calculation circuit 180. However, according to Ref. 1, instead of the K parameter, the Reflection COE
FFICIENTS) is the same parameter as the K parameter. K parameter calculation circuit 18
0 uses the output value of the buffer memory circuit 110, calculates the K parameter K _i representing the audio signal spectrum for each frame for the 16th order (1 ≦ i ≦ 16) according to the covariance method, and outputs them to the synthesis filter circuit 130. Output. 140 is
A sound source pulse generation circuit that generates a predetermined number of pulse sequences in one frame. here,
This pulse sequence will be referred to as d (n). Source pulse generation circuit 1
An example of the sound source pulse sequence generated by the 40 is shown in FIG. In FIG. 2, the horizontal axis represents discrete time and the vertical axis represents amplitude. Here, the case where eight pulses are generated in one frame is shown. The pulse sequence d (n) generated by the sound source pulse generation circuit 140 drives the synthesis filter 130. The synthesis filter 130 is
Input d (n), and play signal corresponding to audio signal x (n) And outputs it to the subtractor 120. Here, the synthesis filter 130 inputs the K parameter K _i , converts the K parameter K _i into these prediction parameters ai (1 ≦ i ≦ 16), and calculates the reproduction signal x (n) using ai.

は、d(n)とaiを用いて下式のように表わすことができ
る。 Can be expressed as follows using d (n) and ai.

上式でＰは合成フィルタの次数を示し、ここではＰ＝１
６としている。減算器１２０は、原信号と再生信号x(n)と差e(n)を計算し、重み付け回路１９０
へ出力する。１９０は、e(n)を入力し、重み付け関数w
(n)を用い、次式に従って重み付け誤差e_w(n)を計算す
る。 In the above equation, P indicates the order of the synthesis filter, where P = 1.
6 is set. The subtractor 120 outputs the original signal And the reproduction signal x (n) and the difference e (n) are calculated, and the weighting circuit 190
Output to. 190 inputs e (n) and weighting function w
Using (n), the weighting error e _w (n) is calculated according to the following equation.

e_w(n)＝w(n)＊e(n) −(2) 上式で、記号“＊”はたたみこみ積分を表やす。また、
重み付け関数w(n)は、周波数軸上で重み付けを行なうも
のであり、そのＺ変換値をW(Z)とすると、合成フィルタ
の予測パラメータa_iを用いて、次式により表わされる。e _w (n) = w (n) * e (n)-(2) In the above equation, the symbol "*" represents convolution integral. Also,
The weighting function w (n) is used to perform weighting on the frequency axis, and when its Z-transformed value is W (Z), it is expressed by the following equation using the prediction parameter a _i of the synthesis filter.

上式でｒは０≦ｒ≦１の定数であり、W(Z)の周波数特性
を決定する。つまり、ｒ＝１とすると、W(Z)＝１とな
り、W(Z)の周波数特性は平担となる。一方、ｒ＝０とす
ると、W(Z)は合成フィルタの周波数特性の逆特性とな
る。従って、ｒの値によってW(Z)の特性を変えることが
できる。また、(3)式に示したようにW(Z)の特性を合成
フィルタの周波数特性に依存させて決めているのは、聴
感的なマスク効果を利用しているためである。つまり、
入力音声信号のスペクトルのパワが大きな箇所では（例
えばフォルマント周波数の近傍）、再生信号のスペクト
ルとの誤差が少々大きくても、その誤差は耳につきにく
いという聴感的な性質による。第３図に、あるフレーム
における入力音声信号のスペクトルと、W(Z)の周波数特
性の一例とを示した。ここではｒ＝0.8とした。図にお
いて、横軸は周波数（最大４ＫHz）を、縦軸は対数振幅
（最大６０dB）をそれぞれ示す。また、上部の曲線は音
声信号のスペクトルを、下部の曲線は重み付け関数の周
波数特性を表わしている。 In the above equation, r is a constant of 0 ≦ r ≦ 1, and determines the frequency characteristic of W (Z). That is, when r = 1, W (Z) = 1 and the frequency characteristic of W (Z) is flat. On the other hand, when r = 0, W (Z) has the inverse characteristic of the frequency characteristic of the synthesis filter. Therefore, the characteristic of W (Z) can be changed by the value of r. Also, the reason why the W (Z) characteristic is determined depending on the frequency characteristic of the synthesizing filter as shown in the equation (3) is that the perceptual masking effect is used. That is,
At a place where the power of the spectrum of the input audio signal is large (for example, near the formant frequency), even if the error with the spectrum of the reproduction signal is a little large, the error is hard to hear, which is due to the auditory property. FIG. 3 shows the spectrum of the input audio signal in a certain frame and an example of the frequency characteristic of W (Z). Here, r = 0.8. In the figure, the horizontal axis represents frequency (up to 4 KHz) and the vertical axis represents logarithmic amplitude (up to 60 dB). The upper curve represents the spectrum of the audio signal, and the lower curve represents the frequency characteristic of the weighting function.

第１図へ戻って、重み付け誤差e_w(n)は、誤差最小化回
路１５０へフィードバックされる。誤差最小化回路１５
０は、e_w(n)の値を１フレーム分記憶し、これらを用い
て次式に従い、重み付けられた誤差電力εを計算する。Returning to FIG. 1, the weighting error e _w (n) is fed back to the error minimization circuit 150. Error minimization circuit 15
For 0, the value of e _w (n) for one frame is stored, and using these, the weighted error power ε is calculated according to the following equation.

ここでＮは誤差電力を計算するサンプルを示す。文献
１、の方式では、この時間長を５ｍsecとしており、こ
れは８ＫHzサンプリングの場合にはＮ＝40に相当する。
次に、誤差最小化回路１５０は、前記(4)式で計算した
誤差電力εを小さくするように音源パルスの振幅及び位
置を求め、この振幅情報と位置情報とを音源パルス発生
回路１４０に出力する。音源パルス発生回路１４０はこ
の情報に基づいて音源パルス系列を発生させる。 Here, N represents a sample for calculating the error power. In the method of Reference 1, this time length is set to 5 msec, which corresponds to N = 40 in the case of 8 KHz sampling.
Next, the error minimization circuit 150 obtains the amplitude and position of the sound source pulse so as to reduce the error power ε calculated by the equation (4), and outputs this amplitude information and position information to the sound source pulse generation circuit 140. To do. The sound source pulse generation circuit 140 generates a sound source pulse sequence based on this information.

合成フィルタ回路１３０は、この音源パルス系列を駆動
源として再生信号を求める。減算器１２０では、原信号と先に計算した再
生信号との誤差e(n)から上記のようにして求まった再生
信号を減算して、これを新たな誤差e(n)とする。重み付け回
路１９０はe(n)を入力して重み付け誤差e_w(n)を計算
し、これを誤差最小化回路１５０へフィードバックす
る。誤差最小化回路１５０は、再び誤差電力を計算し、
この誤差電力を小さくするように音源パルス系列の振幅
と位置とを調整する。こうして音源パルス系列の発生か
ら誤差最小化による音源パルス系列の調整までの一連の
処理は、音源パルス系列フレーム内のパルス数があらか
じめ定められた数に達するまでくり返され、音源パルス
系列が決定される。The synthesis filter circuit 130 uses this sound source pulse sequence as a driving source Ask for. In the subtractor 120, the reproduction signal obtained as described above from the error e (n) between the original signal and the reproduction signal previously calculated Is subtracted to obtain a new error e (n). The weighting circuit 190 inputs e (n), calculates a weighting error e _w (n), and feeds it back to the error minimizing circuit 150. The error minimization circuit 150 calculates the error power again,
The amplitude and position of the sound source pulse sequence are adjusted so as to reduce this error power. In this way, a series of processes from generation of the sound source pulse sequence to adjustment of the sound source pulse sequence by error minimization is repeated until the number of pulses in the sound source pulse sequence frame reaches a predetermined number, and the sound source pulse sequence is determined. It

以上で従来方式の説明を終了する。This is the end of the description of the conventional method.

この方式の場合に、伝送すべき情報は、合成フィルタの
ＫパラメータK_i（１≦ｉ≦１６）と、音源パルス系列の
パルス位置及び振幅であり、１フレーム内になるパルス
の数によって任意の伝送レイトを実現できる。さらに、
伝送レイトを１６Ｋbps〜１０Ｋbpsとする領域に対して
は、良好な再生音質が得られ有効な方式の一つと考えら
れる。In the case of this method, the information to be transmitted is the K parameter K _i (1 ≦ i ≦ 16) of the synthesizing filter, the pulse position and the amplitude of the sound source pulse sequence, and is arbitrary depending on the number of pulses in one frame. A transmission rate can be realized. further,
It is considered to be one of the effective methods in which a good reproduction sound quality is obtained for a region where the transmission rate is 16 Kbps to 10 Kbps.

しかしながら、この従来方式は、演算量が非常に多いと
いう欠点がある。これは音源パルス系列におけるパルス
の位置と振幅を計算する際に、そのパルスに基づいて再
生した信号と原信号との誤差及び誤差電力を計算し、そ
れらをフィードバックさせて誤差電力を小さくするよう
にパルス位置と振幅とを調整していることに起因してい
る。更には、これらパルスの発生から誤差電力をフィー
ドバックさせてパルス振幅と位置とを調整するまでの処
理を、パルスの数があらかじめ定められた値に達するま
でくり返すことに起因している。However, this conventional method has a drawback that the amount of calculation is very large. This is to calculate the error and error power between the reproduced signal and the original signal based on the pulse when calculating the position and amplitude of the pulse in the sound source pulse sequence, and feed them back to reduce the error power. This is because the pulse position and amplitude are adjusted. Furthermore, it is caused by repeating the processes from the generation of these pulses to the adjustment of the pulse amplitude and the position by feeding back the error power until the number of pulses reaches a predetermined value.

また、16kビット／秒以下の伝送ビットレイトの場合、
音声信号の無声部分では従来方式によれば音源パルス数
が十分に多くはできないので、このような箇所では良好
な特性が得られなかった。In the case of a transmission bit rate of 16 kbit / sec or less,
In the unvoiced part of the voice signal, the number of sound source pulses cannot be increased sufficiently according to the conventional method, so that good characteristics cannot be obtained in such a part.

最近の動向として、16kビット／秒程度の伝送ビットレ
イトで2400ビット／秒程度の音声帯域データモデム信号
を良好に伝送したいという要請が非常に強い。音声帯域
データモデム信号に対しては、従来方式によれば、パル
ス数が十分に多くはないので良好な特性を得ることが困
難であった。As a recent trend, there is a strong demand for good transmission of a voice band data modem signal of about 2400 bits / sec at a transmission bit rate of about 16 kbits / sec. According to the conventional method, it is difficult to obtain good characteristics for a voice band data modem signal because the number of pulses is not sufficiently large.

＜発明の目的＞本発明の目的は、16kビット／秒、あるいは16kビット／
秒以下の伝送ビットレイトで音声信号に対しては勿論の
こと、2400ビット／秒程度の音声帯域データモデム信号
に対しても比較的少ない演算量で良好な特性が得られる
音声帯域信号符号化方式とその装置を提供することにあ
る。<Object of the Invention> The object of the present invention is 16 kbit / sec, or 16 kbit / sec.
Voice band signal coding method that can obtain good characteristics with a relatively small amount of calculation not only for voice signals with a transmission bit rate of less than a second but also for voice band data modem signals of about 2400 bits / sec. And to provide the device.

＜発明の構成＞本発明によれば、送信側では、離散的な音声帯域信号系
列を入力し短時間スペクトル包絡を表すスペクトルパラ
メータ系列を抽出し、前記音声帯域信号系列と前記スペ
クトルパラメータ系列をもとに前記音声帯域信号系列を
良好に表し得るパルス系列を探索し、前記スペクトルパ
ラメータ系列抽出結果または前記パルス系列探索結果を
もとに送出パルス系列の個数をきめる判別符号を作り、
前記判別符号に従い前記送出パルス系列と前記スペクト
ルパラメータ系列とを符号化し前記判別符号と組み合わ
せて出力し、受信側では、前記組み合わされた符号か
ら、前記判別符号を分離し、前記判別符号に従って前記
スペクトルパラメータ系列を表す符号と前記送出パルス
系列を表す符号とを分離し復号し、前記復号されたスペ
クトルパラメータ系列と前記復号されたパルス系列とを
用い前記音声帯域信号系列を再生するようにしたことを
特徴とする音声帯域信号化方法が得られる。<Configuration of Invention> According to the present invention, on the transmission side, a discrete voice band signal sequence is input, a spectrum parameter sequence representing a short-time spectrum envelope is extracted, and the voice band signal sequence and the spectrum parameter sequence are also extracted. To search for a pulse sequence that can satisfactorily represent the voice band signal sequence, and to create a discrimination code that determines the number of transmitted pulse sequences based on the spectrum parameter sequence extraction result or the pulse sequence search result,
The transmission pulse sequence and the spectrum parameter sequence are encoded according to the discrimination code and output in combination with the discrimination code, and on the receiving side, the discrimination code is separated from the combined code, and the spectrum is determined according to the discrimination code. A code representing a parameter sequence and a code representing the transmission pulse sequence are separated and decoded, and the voice band signal sequence is reproduced using the decoded spectrum parameter sequence and the decoded pulse sequence. A featured voice band signaling method is obtained.

また、本発明によれば、離散的な音声帯域信号系列を入
力し、前記音声帯域信号系列から短時間スペクトル包絡
を表すスペクトルパラメータ系列を抽出するパラメータ
計算回路と、前記音声帯域信号系列と前記スペクトルパ
ラメータ系列をもとに前記音声帯域信号系列を良好に表
し得るパルス系列を探索するパルス系列探索回路と、前
記スペクトルパラメータ系列抽出結果または前記パルス
系列探索結果をもとに送出パルス系列の個数を決める判
別符号を作る判別回路と、前記判別符号に従って前記送
出パルス系列と前記スペクトルパラメータ系列を符号化
し前記判別符号と組み合わせて出力する手段とを有する
ことを特徴とする音声帯域信号系列符号化装置が得られ
る。Further, according to the present invention, a parameter calculation circuit that inputs a discrete voice band signal sequence and extracts a spectrum parameter sequence representing a short-time spectrum envelope from the voice band signal sequence, the voice band signal sequence, and the spectrum A pulse sequence search circuit that searches for a pulse sequence that can satisfactorily represent the voice band signal sequence based on a parameter sequence, and determines the number of transmission pulse sequences based on the spectrum parameter sequence extraction result or the pulse sequence search result. A voice band signal sequence encoding device comprising: a discriminating circuit for producing a discriminating code; and means for encoding the transmission pulse sequence and the spectrum parameter sequence in accordance with the discriminating code and outputting in combination with the discriminating code. To be

さらに本発明によれば、送信側から離散的な音声帯域信
号系列より短時間スペクトル包絡を表すスペクトルパラ
メータ系列を抽出し、前記音声帯域信号系列と前記スペ
クトルパラメータ系列をもとに前記音声帯域信号系列を
良好に表し得るパルス系列を探索し、前記スペクトルパ
ラメータ系列抽出結果または前記パルス系列探索結果を
もとに送出パルス系列の個数をきめる判別符号を作り、
前記判別符号に従い前記送出パルス系列と前記スペクト
ルパラメータ系列とを符号化し前記判別符号と組み合わ
せて出力された符号が入力され、前記組み合わされた符
号系列から前記判別符号を分離しさらに前記判別符号に
従ってスペクトルパラメータ系列を表す符号とパルス系
列を表す符号とを分離し復号する手段と、前記復号され
たパルス系列を用いて駆動パルス系列を発生するパルス
系列発生回路と、前記復号されたスペクトルパラメータ
系列と前記駆動パルス系列とを用いて音声帯域信号系列
を再生し出力する合成フィルタ回路とを有することを特
徴とする音声帯域信号復号化装置が得られる。Further, according to the present invention, a spectrum parameter sequence representing a short-time spectrum envelope is extracted from a discrete voice band signal sequence from the transmitting side, and the voice band signal sequence is based on the voice band signal sequence and the spectrum parameter sequence. Is searched for a pulse sequence that can be expressed well, and a discrimination code that determines the number of transmitted pulse sequences based on the spectral parameter sequence extraction result or the pulse sequence search result is created,
A code output by combining the transmission pulse sequence and the spectrum parameter sequence according to the discrimination code and combining with the discrimination code is input, and the discrimination code is separated from the combined code sequence, and further the spectrum is determined according to the discrimination code. Means for separating and decoding the code representing the parameter sequence and the code representing the pulse sequence, a pulse sequence generation circuit for generating a drive pulse sequence using the decoded pulse sequence, the decoded spectrum parameter sequence and the A voice band signal decoding device is provided which has a synthesis filter circuit for reproducing and outputting a voice band signal sequence using a drive pulse sequence.

＜実施例＞本発明による音声符号化方式の構成を図面を用いて詳細
に説明する。第４図(a)は、本発明による音声符号化方
式の符号器側の一実施例を示すブロック図であり、第４
図(b)は復号器側の一実施例を示すブロック図である。
第４図(a)において、音声信号系列x(n)は、入力端子１
９５から入力され、あらかじめ定められたサンプル数だ
け区切られてバッファメモリ回路３４０に蓄積される。
次にＫパラメータ計算回路２８０は、バッファメモリ回
路３４０に蓄積されている音声信号のうち、あらかじめ
定められたサンプル数を入力し、入力音声信号のスペク
トル包絡を表わすＬＰＣパラメータを計算する。ＬＰＣ
パラメータとしては種々あるが以下ではＫパラメータを
用いるものとして説明を進める。尚、Ｋパラメータはパ
ーコール係数と同一のパラメータである。Ｋパラメータ
の計算法としては代表的な方法として自己相関法と、共
分散法がよく知られている。ここでは自己相関法による
Ｋパラメータの計算法を、ジョン・マクホウル（JOHN M
AKHOUL）氏らによるアイ・イー・イー・イートランザク
ションズオンエー・エス・エス・ピー（IEEE TRANS
ACTIONS ON A.S.S.P.）誌1975年６月号．309〜321頁に
掲載の「クォンタイゼイションプロパティズオブ
トランスミッションパラメーターズインリニア、
プリディクティブシステム」（“QUANTIZATION PROPE
RTIES OF TRANS MISSION PARAMETERS IN LINEAR PREDIC
TIVE SYSTEMS”）と題した論文（文献２）等に説明され
ている方法を引用して以下に示す。<Embodiment> The configuration of the audio encoding system according to the present invention will be described in detail with reference to the drawings. FIG. 4 (a) is a block diagram showing an embodiment of the encoder side of the audio encoding system according to the present invention.
FIG. 6B is a block diagram showing an embodiment of the decoder side.
In FIG. 4 (a), the audio signal sequence x (n) is input terminal 1
The data is input from 95, divided into a predetermined number of samples, and stored in the buffer memory circuit 340.
Next, the K parameter calculation circuit 280 inputs a predetermined number of samples of the audio signal stored in the buffer memory circuit 340, and calculates the LPC parameter representing the spectral envelope of the input audio signal. LPC
There are various parameters, but in the following description, the K parameter is used. The K parameter is the same parameter as the Percoll coefficient. The autocorrelation method and the covariance method are well known as typical methods for calculating the K parameter. Here, the calculation method of the K parameter by the autocorrelation method is described by JOHN M.
AKHOUL) et al. IE TRANSACTIONS ON AS TRANS
ACTIONS ON ASSP) June 1975 issue. See Quantization Properties of Pages 309-321.
Transmission Parameters Linear,
Predictive system "(" QUANTIZATION PROPE
RTIES OF TRANS MISSION PARAMETERS IN LINEAR PREDIC
The method described in the paper (Reference 2) entitled "TIVE SYSTEMS") is cited below.

E₀＝R(o) (5a) ａ_ｉ ^（ｉ）＝k_i (5c) ▲ａ_j ⁽ⁱ⁾▼＝▲ａ_j ^(i-1)▼＋▲ｋ_ia_i-j ^(i-1)▼，（１≦
ｊ≦ｉ−１） (5d) E_i＝（１−k_i ²）・E_i-1 (5e) a_j＝a_j ^(p)，（１≦ｊ≦ｐ） (5f) 式(5a)から式(5f)はｊ＝１，２，……ｐとして再帰的に
解くことができる。式において、k_iはｉ次目のＫパラメ
ータ値を示す。またR(i)は入力音声に対する遅れ時間ｉ
の自己相関々数を示す。Ｐは予測分析次数を示す。▲ａ
_j ^(p)▼は分析次数ｄの場合のｊ番目の線形予測係数を示
す。ここで式(5e)のE_iの値は次数ｉの予測における予測
誤差電力を示している。従って計算の各段階で次数ｉの
予測の予測誤差電力を監視することができる。E_iを用い
て正規化予測誤差は次式のように表わせる。E ₀ = R (o) (5a) a _i ⁽ⁱ⁾ = k _i (5c) ▲ a _j ⁽ⁱ⁾ ▼ = ▲ a _j ^(i-1) ▼ + ▲ k _i a _ij ^(i-1) ▼, (1 ≤
j ≦ i−1) (5d) E _i = (1-k _i ² ) · E _i-1 (5e) a _j = a _j ^(p) , (1 ≦ j ≦ p) (5f) Formula (5a) Therefore, equation (5f) can be recursively solved with j = 1, 2, ... P. In the equation, k _i indicates the i-th order K parameter value. R (i) is the delay time i with respect to the input voice.
The autocorrelation number of is shown. P indicates the prediction analysis order. ▲ a
_j ^(p) ▼ indicates the j-th linear prediction coefficient in the case of the analysis order d. Here, the value of E _i in equation (5e) represents the prediction error power in the prediction of order i. Therefore, the prediction error power of the prediction of order i can be monitored at each stage of the calculation. The normalized prediction error can be expressed as follows using E _i .

V_i＝E_i／R(o) (6) ｉ＝ｐの場合には(5e)式を用いてと表わせる。ここで１／Vpは予測利得ともよばれる。従
って(7)式を用いればｐ次予測分析の場合の正規化予測
誤差を知ることができる。以上で自己相関法によるＫパ
ラメータ計算法の説明を終える。V _i = E _i / R (o) (6) When i = p, use equation (5e) Can be expressed as Here, 1 / Vp is also called a prediction gain. Therefore, if the equation (7) is used, the normalized prediction error in the p-th order prediction analysis can be known. This is the end of the description of the K parameter calculation method based on the autocorrelation method.

第４図(a)に戻って、Ｋパラメータ計算回路２８０は、
式(5a)から式(5e)に従ってあらかじめ定められた次数M₁
（例えばM₁＝４）のＫパラメータK_i（１≦ｉ≦M₁）を計
算する。また(7)式に従ってＭ_１次の正規化予測誤差V_M1
を計算する。次に求まった正規化予測誤差V_M1をあらか
じめ定められたしきい値と比較して、V_M1がしきい値よ
りも小さければ入力信号は一例として有声と判別する。
一方、V_M1がしきい値よりも大きければ入力音声は無声
と判別する。このようにしたのは、音声信号の場合、有
声部では相関が大きいために予測し易く正規化予測誤差
はかなり小さな値となる。一方、音声信号の無声部およ
びデータモデム信号は相関が小さいために予測しにく
く、正規化予測誤差はあまり小さくはならないことにも
とずいている。ただし、ここでは説明の簡便さのため
に、有声と無声の２種類に分類したが、特に有声と無声
に分類する必要はなく、また、分類は２種類以上でもよ
い。Ｋパラメータ計算回路２８０は正規化予測誤差V_Pを
用いた有声／無声判別結果を１ビット情報ｄとしてＫパ
ラメータ符号化回路２００とインパルス応答計算回路２
１０とパルス計算回路３９０と合成フィルタ回路４００
と重み付け回路４１０と符号化回路４７０とマルチプレ
クサ４５０とへ出力する。更にＫパラメータ計算回路２
８０は、判別結果が無声であった場合にはM₁次まで求め
たＫパラメータ値K_i（１≦ｉ≦M₁，例えばM₁＝４）をＫ
パラメータ符号化回路２００へ出力する。この場合、信
号の相関が小さいのでM₁は４次程度以上としても予測利
得の向上はごくわずかである。一方、判別結果が有声で
あった場合には音声信号のスペクトル包絡をより精密に
表わすために更にM₂次（M₂≧M₁，例えばM₂＝１２）まで
のＫパラメータ値K_i（１≦ｉ≦M₂）を引き続き計算し、
K_i（１≦ｉ≦M₂）をＫパラメータ符号化回路２００へ出
力する。Returning to FIG. 4 (a), the K parameter calculation circuit 280
Predetermined order M ₁ according to equation (5a) to equation (5e)
The K parameter K _i (1 ≦ i ≦ M ₁ ) of (for example, M ₁ = 4) is calculated. Also, according to Eq. (7), the M ₁ -order normalized prediction error V _M1
To calculate. Next, the obtained normalized prediction error V _M1 is compared with a predetermined threshold value, and if V _M1 is smaller than the threshold value, the input signal is determined to be voiced as an example.
On the other hand, if V _M1 is larger than the threshold value, the input voice is determined to be unvoiced. The reason for this is that in the case of a voice signal, the correlation is large in the voiced part, so that it is easy to predict and the normalized prediction error is a considerably small value. On the other hand, the unvoiced part of the voice signal and the data modem signal are difficult to predict because the correlation is small, and the reason is that the normalized prediction error is not so small. However, for simplification of the description, the voiced and unvoiced voices are classified into two types, but it is not necessary to specifically classify into voiced voices and unvoiced voices, and two or more types may be used. The K parameter calculation circuit 280 sets the K parameter coding circuit 200 and the impulse response calculation circuit 2 as the voiced / unvoiced discrimination result using the normalized prediction error V _P as 1-bit information d.
10, pulse calculation circuit 390, synthesis filter circuit 400
To the weighting circuit 410, the encoding circuit 470, and the multiplexer 450. Furthermore, K parameter calculation circuit 2
80, K parameter value _{K i (1 ≦ i ≦ M} 1, for example, M ₁ = 4) obtained up to the _primary M when discrimination result is silent with K
It is output to the parameter encoding circuit 200. In this case, since the correlation of the signals is small, the improvement of the prediction gain is very small even if M ₁ is set to the fourth order or more. On the other hand, when the discrimination result is voiced, in order to more accurately represent the spectral envelope of the speech signal, K parameter values K _i (1) up to M ₂ _nd order (M ₂ ≧ M ₁ , for example M ₂ = 12) ≦ i ≦ M ₂ ) is continuously calculated,
The K _i (1 ≦ i ≦ M ₂ ) is output to the K parameter encoding circuit 200.

Ｋパラメータ符号化回路２００は、Ｋパラメータ計算回
路２８０から有声／無声判別情報ｄとＫパラメータ値K_i
とを入力する。Ｋパラメータ符号化回路２００は有声に
対する最適な量子化特性と無声に対する最適な量子化特
性の２種の量子化特性をもっており、判別情報ｄに従っ
てこの特性を切り換え、入力したＫパラメータK_iを符号
化し、符号l_kiをマルチプレクサ４５０へ出力する。ま
たＫパラメータ符号化回路２００は、l_kiを復号化して
得たＫパラメータ復号値K_iを用い前述の(5c),(5d),(5f)
式を用いて予測係数値a′_iに変換する。この際に有声／
無声判別情報ｄを用いて次数ｐをM₁またはM₂に切り換え
ておく。Ｋパラメータ符号化回路２００は、予測係数値
a′K_iをインパルス応答計算回路２１０と重み付け回路
４１０と合成フィルタ回路４００とへ出力する。The K parameter encoding circuit 200 receives the voiced / unvoiced discrimination information d and the K parameter value K _i from the K parameter calculation circuit 280.
Enter and. The K parameter coding circuit 200 has two kinds of quantization characteristics, that is, an optimum quantization characteristic for voiced voice and an optimum quantization characteristic for unvoiced voice. These characteristics are switched according to the discrimination information d, and the input K parameter _Ki is encoded. , Code l _ki is output to the multiplexer 450. In addition, the K parameter encoding circuit 200 uses the K parameter decoded value K _i obtained by decoding l _ki, as described in (5c), (5d), and (5f) above.
The prediction coefficient value a ′ _i is converted using the formula. Voice /
The order p is switched to M ₁ or M ₂ using the unvoiced discrimination information d. The K parameter encoding circuit 200 uses the prediction coefficient value
The a′K _i is output to the impulse response calculation circuit 210, the weighting circuit 410, and the synthesis filter circuit 400.

次にインパルス応答計算回路２１０は、Ｋパラメータ計
算回路２８０から有声／無声判別情報ｄとＫパラメータ
符号化回路２００から予測係数値a′_iを入力し、次式で
示される重み付けされた合成フィルタの伝達関数を表わ
すインパルス応答h_w(n)を、あらかじめ定められたサン
プル数だけ計算する。Next, the impulse response calculation circuit 210 inputs the voiced / unvoiced discrimination information d from the K parameter calculation circuit 280 and the prediction coefficient value a ′ _i from the K parameter encoding circuit 200, and outputs the weighted synthesis filter of the following equation. The impulse response h _w (n) representing the transfer function is calculated by a predetermined number of samples.

ここでＰは予測計数値a′_iの次数を示す。Ｐは有声／無
声判別情報ｄに従って切り換えられ、有声の場合はＰは
M₂（例えば１２）次にセットされ、無声の場合はＰはM₁
（例えば４）次にセットされる。また、W(Z)は前記(3)
式で示した重み付け関数のＺ変換表現である。但し次数
Ｐは、有声／無声情報ｄに従いM₂またはM₁に切り換えら
れる。インパルス応答計算回路２１０はインパルス応答
h_w(n)を自己相関々数計算回路３６０と相互相関々数計
算回路３５０とへ出力する。 Where P represents the order of the prediction count a _'i. P is switched according to the voiced / unvoiced discrimination information d. In the case of voiced, P is
M ₂ (eg 12) Set next, P is M ₁ if unvoiced
(Eg 4) is set next. Also, W (Z) is the above (3)
It is a Z-transform expression of the weighting function shown by a formula. However, the order P is switched to M ₂ or M ₁ according to the voiced / unvoiced information d. The impulse response calculation circuit 210 is an impulse response
It outputs h _w (n) to the autocorrelation coefficient calculation circuit 360 and the cross-correlation coefficient calculation circuit 350.

次に自己相関々数計算回路３６０は、インパルス応答計
算回路２１０からインパルス応答h_w(n)を入力し、次式
に従って自己相関々数R_hh(・)をあらかじめ定められた遅
れ時間τだけ計算する。Next, the autocorrelation coefficient calculation circuit 360 inputs the impulse response h _w (n) from the impulse response calculation circuit 210 and calculates the autocorrelation coefficient R _hh (.) According to the following equation for a predetermined delay time τ. To do.

自己相関々数R_hh（τ）はパルス計算回路３９０へ出力
される。 The autocorrelation factor R _hh (τ) is output to the pulse calculation circuit 390.

次に減算器２８５は、バッファメモリ回路３４０に蓄積
された音声信号x(n)を入力し、x(n)から合成フィルタ回
路４００の出力系列を１フレームサンプル分減算し、減
算結果e(n)を重み付け回路４１０へ出力する。Next, the subtractor 285 inputs the audio signal x (n) accumulated in the buffer memory circuit 340, subtracts the output sequence of the synthesis filter circuit 400 by one frame sample from x (n), and subtracts the result e (n ) Is output to the weighting circuit 410.

次に重み付け回路４１０は、減算器２８５から減算結果
e(n)を入力し、またＫパラメータ符号化回路２００から
予測係数値a′_iを入力し、Ｋパラメータ計算回路２８０
から有声／無声判別情報ｄを入力し、e(n)に対して重み
付けを施しe_w(n)を出力する。ここでe_w(n)はＺ変換表現
で次式のように書ける。Next, the weighting circuit 410 outputs the subtraction result from the subtractor 285.
e (n) is input, the prediction coefficient value a ′ _i is input from the K parameter encoding circuit 200, and the K parameter calculation circuit 280 is input.
The voiced / unvoiced discrimination information d is input from the above, weighting is applied to e (n), and e _w (n) is output. Here, e _w (n) is a Z-transform expression and can be written as

E_w(Z)＝E(2)・W(Z) (10) ここでE_w(Z)，E(Z)はそれぞれe_w(n)のＺ変換値，e(n)の
Ｚ変換値を示す。またW(Z)は前記(3)式で示される重み
付け関数のＺ変換値を示す。但しW(Z)の次数ｐは有声／
無声情報ｄに従いM₂またはM₁に切り換えられる。重み付
け回路４１０は、求めたe_w(n)を相互相関々数計算回路
３５０へ出力する。E _w (Z) ＝ E (2) ・ W (Z) (10) where E _w (Z) and E (Z) are the Z conversion value of e _w (n) and the Z conversion value of e (n), respectively. Indicates. W (Z) represents the Z-transformed value of the weighting function represented by the above equation (3). However, the degree p of W (Z) is voiced /
It is switched to M ₂ or M ₁ according to the unvoiced information d. The weighting circuit 410 outputs the obtained e _w (n) to the cross correlation coefficient calculation circuit 350.

次に相互相関々数計算回路３５０は、重み付け回路４１
０からe_w(n)を入力し、またインパルス応答計算回路２
１０からインパルス応答h_w(n)を入力し、次式に従って
相互相関々数_hx(n)をあらかじめ定められたサンプル
数だけ計算する。Next, the cross correlation coefficient calculation circuit 350 uses the weighting circuit 41.
Input e _w (n) from 0, and impulse response calculation circuit 2
The impulse response h _w (n) is input from 10 and the number of cross-correlation _parameters h _x (n) is calculated by a predetermined number of samples according to the following equation.

相互相関々数_hx(・)はパルス計算回路３９０へ出力さ
れる。 The cross correlation number _hx ( _.multidot. ) Is output to the pulse calculation circuit 390.

次にパルス計算回路３９０は、相互相関々数計算回路３
５０から相互相関々数_hx(・)を入力し、自己相関々数
計算回路３６０から自己相関々数R_hh(・)を入力し、Ｋパ
ラメータ計算回路２８０から有声／無声判別情報ｄを入
力する。ここでパルス計算回路３９０は、有声／無声判
別情報ｄに従って、１フレーム内に求められるパルス数
を切り換える。つまり有声の場合にはL₁個のパルスを求
め、無声の場合にはL₂個のパルスを求める。但し、L₁＜
L₂とする。無声の場合に、有声の場合と比較してパルス
数を増やす必要があるのは、前述したように無声の場合
は有声の場合に比べ予測利得が少ないためである。ここ
でパルス数は伝送ビットレイトに応じて決定されなくて
はならない。例えば、伝送ビットレイトを16kビット／
秒とすると、後述する量子化回路における量子化ビット
配分に従えば、有声の場合にL₁＝32，無声の場合にL₂＝
50個程度となる。Next, the pulse calculation circuit 390 uses the cross correlation coefficient calculation circuit 3
The cross-correlation _{coefficient hx} (.) _Is input from 50, the auto-correlation coefficient R _hh (.) _Is input from the auto-correlation coefficient calculation circuit 360, and the voiced / unvoiced discrimination information d is input from the K-parameter calculation circuit 280. . Here, the pulse calculation circuit 390 switches the number of pulses required in one frame according to the voiced / unvoiced discrimination information d. That prompted the L ₁ pulses in the case of voiced, in the case of silent determine the L ₂ pulses. However, L ₁ <
Set to L ₂ . The reason why it is necessary to increase the number of pulses in the unvoiced case as compared with the voiced case is that the unvoiced case has a smaller prediction gain than the voiced case as described above. Here, the number of pulses must be determined according to the transmission bit rate. For example, the transmission bit rate is 16 kbit /
In terms of seconds, according to the quantization bit allocation in the quantization circuit described later, L ₁ = 32 for voiced and L ₂ = unvoiced.
It will be about 50 pieces.

パルス計算回路３９０では、入力信号と合成信号との重
み付け誤差電力を最小化するパルス系列を、次式に従っ
て１パルスずつ順次計算する。The pulse calculation circuit 390 sequentially calculates a pulse sequence that minimizes the weighted error power between the input signal and the combined signal, pulse by pulse, according to the following equation.

ここでg_iはフレーム内のｉ番目にたつパルスの振幅を示
す。m_iはｉ番目のパルスのフレーム内のサンプル位置を
示す。またＬは１フレーム内に求めるパルス数を示し、
この値は前述のように有声／無声判別情報に従ってL
₁（有声の場合），またはL₂（無声の場合）に切り換え
られる。パルスの位置m_iはg_iの絶対値最大値をとるフレ
ーム内位置から求まる。 Here, g _i represents the amplitude of the i-th pulse in the frame. m _i indicates the sample position within the frame of the i-th pulse. L represents the number of pulses to be obtained in one frame,
This value is L according to the voiced / unvoiced discrimination information as described above.
Switchable to ₁ (if voiced) or L ₂ (if unvoiced). The position m _i of the pulse is obtained from the position in the frame where the maximum absolute value of g _i is taken.

次に、(12)に従ってパルスを１つずつ求める過程を、図
面を用いて説明する。第５図(a)は相互相関々数計算回
路３５０で計算され、パルス計算回路３９０へ出力され
た１フレーム分の相互相関々数を示す。図において横軸
は１フレーム内のサンプル時刻を示す。フレーム長は１
６０としている。縦軸は振幅である。第５図(b)は(12)
式に従って求めた第１番目のパルスg₁を示す図である。
第５図(c)は第５図(b)で求めたパルスの影響を差し引い
た後の図である。第５図(d)は第２番目のパルスg₂を求
めた図である。第５図(e)は第２番目のパルスg₂の影響
を差し引いた後の図である。第５図(d)から(e)の処理を
くり返してL₁またはL₂個のパルスを求める。Next, the process of obtaining the pulses one by one according to (12) will be described with reference to the drawings. FIG. 5A shows the cross-correlation count for one frame which is calculated by the cross-correlation count calculation circuit 350 and output to the pulse calculation circuit 390. In the figure, the horizontal axis represents the sample time within one frame. Frame length is 1
It is set to 60. The vertical axis is the amplitude. Figure 5 (b) is (12)
It is a figure which shows the _1st pulse g1 calculated | required according to a formula.
FIG. 5 (c) is a diagram after subtracting the influence of the pulse obtained in FIG. 5 (b). FIG. 5 (d) is a diagram in which the _second pulse g ₂ is obtained. FIG. 5 (e) is a diagram after subtracting the influence of the _second pulse g ₂ . The processes of FIG. 5 (d) to (e) are repeated to obtain L ₁ or L ₂ pulses.

第４図(a)に戻って、パルス計算回路３９０は(12)式に
従って求めたパルス系列を符号化回路４７０へ出力す
る。Returning to FIG. 4A, the pulse calculation circuit 390 outputs the pulse sequence obtained according to the equation (12) to the encoding circuit 470.

次に符号化回路４７０は、パルス計算回路３９０からパ
ルス系列を入力し、Ｋパラメータ計算回路２８０から有
声／無声判別情報ｄを入力する。符号化回路４７０は、
有声／無声判別情報ｄに従い、有声、無声の場合に対し
て量子化ビット数及び量子化特性を切り換える。量子化
特性を切り換えるのは、有声と無声の場合ではパルス振
幅の頻度分布が異なるので、各々の分布に対し最適な量
子化を施すためである。符号化回路４７０は、入力した
パルスの振幅，位置を符号化し、マルチプレクサ４５０
へ出力する。また、パルスの振幅，位置の復号値g′_i,
m′_iをパルス発生回路４２０へ出力する。ここでパルス
系列の符号化法は種々考えられる。一つは、パルス系列
の振幅，位置を別々に符号化する方法であり、また一つ
は振幅，位置を一緒に符号化する方法である。Next, the encoding circuit 470 inputs the pulse sequence from the pulse calculation circuit 390 and the voiced / unvoiced discrimination information d from the K parameter calculation circuit 280. The encoding circuit 470 is
According to the voiced / unvoiced discrimination information d, the number of quantization bits and the quantization characteristic are switched for voiced and unvoiced. The reason why the quantization characteristics are switched is that the frequency distributions of the pulse amplitudes are different between voiced and unvoiced voices, so that optimum quantization is applied to each distribution. The encoding circuit 470 encodes the amplitude and position of the input pulse, and the multiplexer 450
Output to. Also, the amplitude of the pulse and the decoded value g ′ _{i of} the position,
m ′ _i is output to the pulse generation circuit 420. Here, various pulse sequence encoding methods can be considered. One is a method of separately encoding the amplitude and the position of the pulse sequence, and the other is a method of encoding the amplitude and the position together.

前者の方法について一例を説明する。まず、パルス系列
の振幅の符号化法としては、フレーム内のパルス系列の
振幅の最大値を正規化計数として、この値を用いて各パ
ルスの振幅を正規化した後に、量子化，符号化する方法
が考えられる。量子化特性については、有声，無声，各
々の場合の振幅分布に応じた最適な特性を用いる。ま
た、各パルスの振幅を直交関係にある他のパラメータに
変換した後に量子化，符号化を施してもよい。また、パ
ルス振幅毎にビット割り当てを変えてもよい。次に、パ
ルス位置の符号化についても種々の方法が考えられる。
例えば、ファクシミリ信号符号化等でよく知られている
ランレングス符号等を用いてもよい。これは符号“０”
または“１”の続く長さをあらかじめ定められた符号系
列を用いて表わすものである。また、正規化係数の符号
化には、従来よく知られている対数圧縮符号化等を用い
ることができる。An example of the former method will be described. First, as the encoding method of the amplitude of the pulse sequence, the maximum value of the amplitude of the pulse sequence in the frame is used as a normalization count, and the amplitude of each pulse is normalized using this value, and then quantized and encoded. A method can be considered. As the quantization characteristic, the optimum characteristic according to the amplitude distribution in each case of voiced and unvoiced is used. Further, the amplitude of each pulse may be quantized and encoded after being converted into another parameter having an orthogonal relationship. Also, bit allocation may be changed for each pulse amplitude. Next, various methods can be considered for encoding the pulse position.
For example, a run length code or the like which is well known in facsimile signal coding or the like may be used. This is the code "0"
Alternatively, the length following "1" is represented by using a predetermined code sequence. Further, conventionally well-known logarithmic compression encoding or the like can be used for encoding the normalization coefficient.

次に有声，無声の各場合に対する量子化ビット配分の一
例を以下に示す。伝送ビットレイトは16kビット／秒と
する。もし判別情報ｄが有声であった場合には、パルス
振幅の量子化ビット数は５ビット，パルス位置のビット
数は３ビットとする。一方、判別情報が無声であった場
合には、パルス振幅の量子化ビット数は４ビット、パル
ス位置のビット数は２ビットとする。このビット配分に
従えば、伝送ビットレイトを16kビット／秒とした場合
に、前述のように、有声に対するパルス数は32，無声に
対するパルス数は50程度となる。An example of quantized bit allocation for voiced and unvoiced cases is shown below. The transmission bit rate is 16 kbit / sec. If the discrimination information d is voiced, the quantization bit number of the pulse amplitude is 5 bits and the bit number of the pulse position is 3 bits. On the other hand, when the discrimination information is unvoiced, the quantization bit number of the pulse amplitude is 4 bits and the bit number of the pulse position is 2 bits. According to this bit allocation, when the transmission bit rate is 16 kbit / sec, the number of pulses for voiced voice is 32 and the number of pulses for unvoiced voice is about 50, as described above.

尚、パルス系列の符号化に関しては、ここで説明した符
号化方法方式に限らず、衆知の最良の方法を用いること
ができることは勿論である。Regarding the encoding of the pulse sequence, it is needless to say that the best known method can be used without being limited to the encoding method described here.

第４図(a)に戻って、パルス発生回路４２０は、パルス
系列復号値g′_i，m′_iを用いてm′_iの位置に振幅g′_iを
もつ駆動パルス系列を発生させる。パルス発生回路４２
０は、駆動パルス系列を合成フィルタ回路４００へ出力
する。Returning to 4 (a), the pulse generating circuit 420, the pulse sequence decoded value g _'i, m' with _i m 'to the position of the _i amplitude g' generates a drive pulse sequence with _i. Pulse generation circuit 42
0 outputs the drive pulse sequence to the synthesis filter circuit 400.

合成フィルタ回路４００は、パルス発生回路４２０から
駆動パルス系列を入力し、Ｋパラメータ計算回路２８０
から有声／無声判別情報ｄを入力し、Ｋパラメータ符号
化回路２００から予測係数復号値a′_iを入力する。合成
フィルタ回路４００は、入力した駆動パルス系列と予測
係数復号値a′_iとを用いて１フレーム分の応答信号系列を次式に従って計算する。The synthesis filter circuit 400 inputs the drive pulse sequence from the pulse generation circuit 420, and receives the K parameter calculation circuit 280.
The voiced / unvoiced discrimination information d is input from, and the prediction coefficient decoded value a ′ _i is input from the K parameter encoding circuit 200. The synthesis filter circuit 400 uses the input drive pulse sequence and decoded prediction coefficient a ′ _i to generate a response signal sequence for one frame. Is calculated according to the following formula.

ここでの値は２フレーム分（１≦ｎ≦２Ｎ）計算される。d(n)
は駆動信号を表わし、１≦ｎ≦Ｎではパルス発生回路４
２０から入力した駆動パルス系列を用いる。またN+1≦
ｎ≦2Nでは全て０の系列を用いる。次数ｐは判別情報ｄ
に従って切り換え、有声の場合はM₂（例えば12）次，無
声の場合はM₁（例えば４）次とする。(13)で求めたのうち、２フレーム目のの値が減算器２８５へ出力される。 here The value of is calculated for two frames (1 ≦ n ≦ 2N). d (n)
Represents a drive signal, and when 1 ≦ n ≦ N, the pulse generation circuit 4
The drive pulse sequence input from 20 is used. Also N + 1 ≤
When n ≦ 2N, a sequence of all 0 is used. The order p is the discrimination information d
In the case of voiced, M ₂ (for example, 12th) order is selected, and in the case of unvoiced, M ₁ (for example, 4th) order. Found in (13) Of the second frame Is output to the subtractor 285.

次にマルチプレクサ４５０は、符号化回路４７０の出力
符号とＫパラメータ符号化回路２００の出力符号とＫパ
ラメータ符号化回路２８０からの判別情報を表わす１ビ
ット符号とを入力し、これらを組み合わせて送信側出力
端子４８０から通信路へ出力する。以上で本発明による
音声符号化方式の符号器側の説明を終える。Next, the multiplexer 450 inputs the output code of the encoding circuit 470, the output code of the K parameter encoding circuit 200, and the 1-bit code representing the discrimination information from the K parameter encoding circuit 280, and combines them to the transmitting side. Output from the output terminal 480 to the communication path. This is the end of the description of the encoder side of the speech encoding system according to the present invention.

次に本発明による音声符号化方式の復号器側について第
４図(b)を参照して説明する。デマルチプレクサ５００
は、復号器側入力端子４９０から組み合わされた符号を
入力する。デマルチプレクサ５００は入力した符号のう
ち、Ｋパラメータを表わす符号とパルス系列を表わす符
号と有声／無声判別情報を表わす１ビット符号とを分離
し、Ｋパラメータを表わす符号をＫパラメータ復号回路
５２０へ出力し、パルス系列を表わす符号をパルス系列
復号回路５３０へ出力し、有声／無声判別情報を表わす
１ビット符号をＫパラメータ復号回路５２０とパルス系
列復号回路５３０と合成フィルタ回路５５０とへ出力す
る。Next, the decoder side of the voice encoding system according to the present invention will be described with reference to FIG. 4 (b). Demultiplexer 500
Inputs the combined code from the decoder side input terminal 490. Of the input codes, the demultiplexer 500 separates the code representing the K parameter, the code representing the pulse sequence, and the 1-bit code representing the voiced / unvoiced discrimination information, and outputs the code representing the K parameter to the K parameter decoding circuit 520. Then, the code representing the pulse sequence is output to pulse sequence decoding circuit 530, and the 1-bit code representing the voiced / unvoiced discrimination information is output to K parameter decoding circuit 520, pulse sequence decoding circuit 530, and synthesis filter circuit 550.

次にパルス系列復号回路５３０は、有声／無声判別情報
を表わす符号とパルス系列を表わす符号とを入力し、有
声／無声判別情報を表わす符号に従って、有声の場合に
はL₁（例えば32）個のパルス系列を復号化する。一方、
無声の場合にはL₂（例えば50）個のパルス系列を復号化
する。復号化されたパルス系列の振幅，位置情報はパル
ス発生回路５４０へ出力される。パルス発生回路５４０
は、復号化された振幅，位置情報を入力し駆動パルス系
列を発生させ、合成フィルタ回路５５０へ出力する。Next, the pulse sequence decoding circuit 530 inputs the code representing the voiced / unvoiced discrimination information and the code representing the pulse sequence, and according to the code representing the voiced / unvoiced discrimination information, L ₁ (for example, 32) Decode the pulse sequence of. on the other hand,
When unvoiced, L ₂ (eg 50) pulse sequences are decoded. The amplitude and position information of the decoded pulse sequence are output to the pulse generation circuit 540. Pulse generation circuit 540
Receives the decoded amplitude and position information, generates a drive pulse sequence, and outputs the drive pulse sequence to the synthesis filter circuit 550.

次にＫパラメータ復号回路５２０は、有声／無声判別情
報を表わす符号とＫパラメータを表わす符号とを入力
し、有声／無声判別情報を表わす符号に従って、有声の
場合にはM₂（例えば12）次のＫパラメータを復号化す
る。一方、無声の場合にはM₁（例えば４）次のＫパラメ
ータを復号化する。復号化され求めたパラメータ値K_iは
合成フィルタ回路５５０へ出力される。Next, the K parameter decoding circuit 520 inputs a code representing the voiced / unvoiced discrimination information and a code representing the K parameter, and in accordance with the code representing the voiced / unvoiced discrimination information, in the case of voiced M ₂ (for example, 12) th order Decode the K parameters of On the other hand, in the case of being unvoiced, the M ₁ (for example, 4) th order K parameter is decoded. The decoded and obtained parameter value K _i is output to the synthesis filter circuit 550.

次に合成フィルタ回路５５０は、有声／無声判別情報を
表わす符号と駆動パルス系列と、Ｋパラメータ復号値K_i
とを入力する。Ｋパラメータ復号値K_iは前述の(5c),(5
d),(5f)式を用いて予測係数値a′_iに変換される。この
際に有声／無声判別情報を表わす符号に従って次数ｐを
M₁またはM₂に切り換えておく。合成フィルタ回路５５０
は次式に従って合成信号を１フレーム分計算し、受信側出力端子５６０から出力
する。Next, the synthesis filter circuit 550 outputs a code representing the voiced / unvoiced discrimination information, the driving pulse sequence, and the K parameter decoded value K _i.
Enter and. The K parameter decoded value K _i is the same as (5c), (5
Prediction coefficient value a ′ _i is converted using d) and (5f). At this time, the order p is determined according to the code representing the voiced / unvoiced discrimination information.
Switch to M ₁ or M ₂ . Synthesis filter circuit 550
Is the combined signal according to Is calculated for one frame and is output from the reception side output terminal 560.

ここでd(n)は駆動パルス系列を示す。また次数ｐは有声
／無声判別情報を表わす符号に従ってM₁またはM₂に切り
換えられる。以上で本発明による復号器側の説明を終え
る。 Here, d (n) represents a drive pulse sequence. The order p is switched to M ₁ or M ₂ according to a code representing voiced / unvoiced discrimination information. This is the end of the description on the decoder side according to the present invention.

本実施例の構成によれば、パルス系列を前述の(12)式に
従い求めているので、文献１の従来方式のように、音源
パルスで合成フィルタを駆動して再生信号を求め、原信
号との２乗誤差をフードバックしてパルスを調整すると
いう径路がなく、またその処理をくり返す必要もないの
で、演算量を大幅に低減できる。但し、パルス計算アル
ゴリズムを実施例にて説明した方法に限定するものでは
なく、演算量の増加を許せば、文献１に例を示すような
Ａ−ｂ−Ｓ的手法によるパルス計算アルゴリスムを用い
てもよい。According to the configuration of the present embodiment, since the pulse sequence is obtained according to the above-mentioned equation (12), the reproduction signal is obtained by driving the synthesis filter with the sound source pulse as in the conventional method of Document 1 to obtain the original signal. Since there is no path for adjusting the pulse by hooding back the squared error of 1 and there is no need to repeat the processing, the amount of calculation can be greatly reduced. However, the pulse calculation algorithm is not limited to the method described in the embodiment, and if the calculation amount is allowed to increase, a pulse calculation algorithm based on the A-B-S method as shown in Reference 1 is used. Good.

尚、(12)式に示したパルス計算法においては、パルスを
１つずつ順番に計算していた。この方法においては次の
パルスを計算する際にこれより過去に求まった複数個の
パルスの振幅を再調整するようにしてもよい。このよう
にすることによってパルス間の距離が短く、パルスが互
いに独立でない場合に特性が向上する。また音源パルス
を求める方法としては、より最適なパルス系列を計算す
る方法のような他の良好なパルス系列計算法を用いても
よい。In the pulse calculation method shown in the equation (12), pulses were calculated one by one. In this method, the amplitudes of a plurality of pulses obtained in the past may be readjusted when the next pulse is calculated. By doing this, the characteristics are improved when the distance between the pulses is short and the pulses are not independent of each other. As a method for obtaining the sound source pulse, another good pulse sequence calculation method such as a method for calculating a more optimal pulse sequence may be used.

また本実施例においては、符号器側で正規化予測誤差を
前述の(7)式に従い計算し、この値に応じて有声／無声
判別情報をつくっていたが、有声／無声判別情報のつく
り方としては次に示すようにしてもよい。今、伝送ビッ
トレイトを１６ｋビット／秒とする。パルス計算回路３
９０では無声と判断された場合の個数L₁（例えば50）個
のパルスを求め、符号化回路４７０では例えば各パルス
の振幅に対し４ビットの量子化を施し、各パルス位置を
２ビットの符号で表わす。各パルスの振幅，位置を復号
化し、次式に従って誤差電力E₁を計算する。Further, in the present embodiment, the normalized prediction error is calculated on the encoder side in accordance with the above equation (7), and the voiced / unvoiced discrimination information is created according to this value. May be as follows. Now, assume that the transmission bit rate is 16 kbit / sec. Pulse calculation circuit 3
At 90, the number L ₁ (for example, 50) of pulses when it is determined to be unvoiced is obtained, and at the encoding circuit 470, for example, 4-bit quantization is performed on the amplitude of each pulse, and each pulse position is a 2-bit code. Express with. The amplitude and position of each pulse are decoded, and the error power E ₁ is calculated according to the following equation.

ここでR_ee(o)は重み付け回路４１０の出力値e_w(n)のＮ
サンプル分の電力を示す。Ｌはパルスの個数（この場合
はL₁）、g′_iはｉ番目のパルスの復号されたパルス振
幅，m′_iはｉ番目のパルスの復号された位置、_hx(・)
は相互相関々数を示す。さらにL₁個のパルスのうち振幅
の大きな方から順に有声と判断された場合の個数L₂（例
えば32）個のパルスを選び、符号化回路４７０において
各パルス振幅に対し５ビット量子化を施し、各パルス位
置を３ビット符号で表わし復号化する。復号値を用いて
前述の(15)式に従って誤差電力E₂を計算する。但し、(1
5)式のＬはL₂としなくてはならない。次にE₁とE₂とを比
較し、E₁の方が小さければ無声と判断し、判別符号を無
声を示す符号にセットし、パルス数をL₁個とする。一
方、E₂の方が小さければ有声と判断し、判別符号を有声
を示す符号にセットし、パルス数をL₂個とする。このよ
うな構成とすることによって、量子化効果も含めたオー
バーオールの特性による有声／無声判別を行なうことが
できるので、特性がさらに向上する。 Here, R _ee (o) is N of the output value e _w (n) of the weighting circuit 410.
The power of the sample is shown. L is the number of pulses (in this case L _1), g _'i is the i th decoded pulse amplitude of the pulse, m' _i is the i th decoded position of the pulse, _hx (·)
Indicates the number of cross correlations. Further, among the L ₁ pulses, the number L ₂ (for example, 32) pulses when the voices are judged to be voiced in order from the largest amplitude are selected, and the encoding circuit 470 performs 5-bit quantization on each pulse amplitude. , Each pulse position is represented by a 3-bit code and decoded. Using the decoded value, the error power E ₂ is calculated according to the above equation (15). However, (1
L in equation (5) must be L ₂ . Next, E ₁ and E ₂ are compared, and if E ₁ is smaller, it is determined to be unvoiced, the discrimination code is set to a code indicating unvoiced, and the number of pulses is L ₁ . On the other hand, if E ₂ is smaller, it is determined to be voiced, the discrimination code is set to a code indicating voiced, and the number of pulses is L ₂ . With such a configuration, it is possible to perform voiced / unvoiced discrimination based on the overall characteristic including the quantization effect, so that the characteristic is further improved.

また本実施例においては、有声／無声判別情報を用い
て、符号器側ではＫパラメータ符号化回路２００，符号
化回路４７０の量子化特性，量子化ビット配分を切り換
え、復号器側ではＫパラメータ復号回路５２０，パルス
復号回路の復号特性を切り換えていた。装置構成をより
簡略化するために、量子化特性，量子化ビット配分，復
号特性は有声，無声で切り換えずに同じ特性としてもよ
い。Further, in the present embodiment, using the voiced / unvoiced discrimination information, the encoder side switches the K parameter encoding circuit 200, the quantization characteristic of the encoding circuit 470, and the quantization bit distribution, and the decoder side performs the K parameter decoding. The decoding characteristics of the circuit 520 and the pulse decoding circuit were switched. In order to further simplify the device configuration, the quantization characteristic, the quantization bit allocation, and the decoding characteristic may be voiced or unvoiced and may be the same characteristic without switching.

また本実施例においては、有声／無声判別情報を用い
て、符号器側ではＫパラメータ計算回路２８０でＫパラ
メータの次数を切り換えていた。一方、復号器側ではＫ
パラメータ復号回路５２０，合成フィルタ回路５５０の
次数を切り換えていたが、この次数に関する切り換え操
作はなくてもよい。Further, in this embodiment, the order of the K parameter is switched by the K parameter calculation circuit 280 on the encoder side using the voiced / unvoiced discrimination information. On the other hand, K on the decoder side
Although the orders of the parameter decoding circuit 520 and the synthesis filter circuit 550 are switched, the switching operation regarding this order may not be performed.

また本実施例においては、合成フィルタ回路５５０の次
数を、有声／無声判別情報を入力して切り換えていた
が、有声／無声判別情報を用いた切り換え操作はなくて
もよい。これはＫパラメータ復号回路５２０から入力す
るＫパラメータ復号値の次数が有声／無声判別情報に応
じてすでに切り換えられているためである。Further, in the present embodiment, the order of the synthesis filter circuit 550 is switched by inputting the voiced / unvoiced discrimination information, but the switching operation using the voiced / unvoiced discrimination information may be omitted. This is because the order of the K parameter decoded value input from the K parameter decoding circuit 520 has already been switched according to the voiced / unvoiced discrimination information.

また本実施例においては、パルス計算回路３９０におい
て有声／無声判別情報を用いてフレーム内に求めるパル
ス数Ｌを切り換えていたが、パルス計算回路３９０で求
めるパルス数は有声，無声とも同じ数としL₁（例えば5
0）個計算しておき、マルチプレクサ４５０においてパ
ルス系列を表わす符号を伝送する際に、有声／無声判別
情報を用いて伝送するパルス数を切り換えてもよい。こ
のような構成とした場合、パルス数の少ない方に切り換
えて伝送する際には例えばパルス振幅の大きなものから
L₂（例えば32）個選び出して伝送すればよい。Further, in the present embodiment, the pulse calculation circuit 390 uses the voiced / unvoiced discrimination information to switch the number of pulses L to be obtained in the frame, but the number of pulses calculated by the pulse calculation circuit 390 is the same for both voiced and unvoiced L. ₁ (eg 5
0) pieces may be calculated and the number of pulses to be transmitted may be switched using voiced / unvoiced discrimination information when transmitting a code representing a pulse sequence in the multiplexer 450. With such a configuration, when switching to the one with a smaller number of pulses and transmitting, for example, from the one with a large pulse amplitude
L ₂ (for example, 32) pieces may be selected and transmitted.

また本実施例においては、パルス数を切り換える種類を
L₁個またはL₂個の種類としたが、３種類以上のパルス数
に切り換えるようにしてもよい。但しこのようにした場
合には、符号器側で有声／無声判別を行なうためのしき
い値を２種類以上用意することと復号器側に伝送する判
別符号のビット数を増やす必要がある。In addition, in this embodiment, the type of switching the pulse number is
Although the number of pulses is L ₁ or L ₂ , the number of pulses may be switched to three or more. However, in this case, it is necessary to prepare two or more thresholds for the voiced / unvoiced discrimination on the encoder side and increase the number of bits of the discrimination code transmitted to the decoder side.

本実施例の構成においては、短時間スペクトル構造を表
わすインパルス応答系列の自己相関々数を計算する際
に、インパルス応答計算回路２１０によってＫパラメー
タ復号値を用いてインパルス応答を計算した後に、この
インパルス応答を用いて自己相関々数計算回路３６０に
て自己相関々数を計算していた。ディジタル信号処理の
分野でよく知られているように、インパルス応答の自己
相関々数はパワスペクトルと対応関係にある。従ってま
ずＫパラメータ復号値を用いてパワスペクトルを求め、
その後にこの対応関係を用いて自己相関々数を計算する
ような構成としてもよい。一方、音声信号と短時間スペ
クトル包絡を表わすインパルス応答との相関々数を計算
する際に、本実施例の構成では重み付け回路４１０の出
力値e_w(n)とＫパラメータ復号器K_iを用いてインパルス
応答計算回路２１０にて計算したインパルス応答h_w(n)
を用いて相互相関々数_hx(・)を計算していた。よく知
られているように、相互相関々数はクロス・パワスペク
トルと対応関係にある。従ってまずe_w(n)とK_iとを用い
てクロス・パワスペクトルを求め、その後に相互相関々
数を計算するような構成としてもよい。尚、パワスペク
トルと自己相関々数との対応関係，クロス・パワスペク
トルと相互相関々数との対応関係については、エー・ブ
イ・オッペンハイム（A.V.OPPENHEIM）氏らによる「デ
ィジタル信号処理」（“DIGITAL SIGNAL PROCESSING"）
と題した単行本（文献3）の第８章にて詳細に説明され
ているので、ここでは説明を省略する。In the configuration of the present embodiment, when calculating the autocorrelation coefficient of the impulse response sequence representing the short-time spectrum structure, the impulse response calculation circuit 210 calculates the impulse response using the K parameter decoded value, and then the impulse response is calculated. The autocorrelation coefficient calculation circuit 360 calculates the autocorrelation coefficient using the response. As is well known in the field of digital signal processing, the autocorrelation number of the impulse response corresponds to the power spectrum. Therefore, first, the power spectrum is obtained using the K parameter decoded value,
After that, the correspondence relationship may be used to calculate the autocorrelation number. On the other hand, when calculating the correlation coefficient between the speech signal and the impulse response representing the short-time spectrum envelope, the output value e _w (n) of the weighting circuit 410 and the K parameter decoder K _i are used in the configuration of this embodiment. Impulse response h _w (n) calculated by the impulse response calculation circuit 210
_Was used to calculate the cross-correlation number _hx (•). As is well known, the cross correlation number corresponds to the cross power spectrum. Therefore, the configuration may be such that the cross-power spectrum is first obtained using e _w (n) and K _i, and then the number of cross-correlation is calculated. Regarding the correspondence between the power spectrum and the autocorrelation number and the correspondence between the cross power spectrum and the crosscorrelation number, "Digital Signal Processing" by AVOPPENHEIM et al. SIGNAL PROCESSING ")
Since it has been described in detail in Chapter 8 of the book entitled (Reference 3), its explanation is omitted here.

本実施例においては、１フレーム内のパルス系列の符号
化は、パルス系列が全て求まった後に、第４図(a)の符
号化回路４７０によって符号化を施したが、符号化をパ
ルス系列の計算に含めて、パルスを１つ計算する毎に、
符号化を行ない、次のパルスを計算するという構成にし
てもよい。このような構成をとることによって、符号化
の歪をも含めた誤差を最小とするようなパルス系列が求
まるので、更に品質を向上させることができる。In the present embodiment, the encoding of the pulse sequence within one frame is performed by the encoding circuit 470 of FIG. 4 (a) after all the pulse sequences have been obtained. Included in the calculation, every time one pulse is calculated,
The encoding may be performed and the next pulse may be calculated. By adopting such a configuration, a pulse sequence that minimizes an error including coding distortion can be obtained, so that the quality can be further improved.

本実施例によれば、フレーム境界での波形の不連続に起
因したフレーム境界近傍での再生信号がほとんどない。
これは、符号器側において、現フレームのパルス系列を
計算する際に、１フレーム過去の駆動音源パルス系列に
よって合成フィルタを駆動してて得られた応答信号系列
を、現フレームにまで伸ばして求め、これを入力音声信
号系から減算した結果に対して現フレームのパルス系列
を計算するという構成にしたことに起因している。ま
た、本実施例ではフレーム長を一定とした場合について
説明したが、フレーム長を時間的に変化させる可変長フ
レームとしてもよい。また、１フレーム内にたてる音源
パルスの個数は一定でなくてもよい。例えばS/Nを一定
とするように各フレームのパルス系列の個数を変化させ
るようにしてもよい。According to this embodiment, there is almost no reproduced signal near the frame boundary due to the discontinuity of the waveform at the frame boundary.
This is because when the encoder side calculates the pulse sequence of the current frame, the response signal sequence obtained by driving the synthesis filter by the drive excitation pulse sequence of one frame past is extended to the current frame and obtained. This is due to the fact that the pulse sequence of the current frame is calculated for the result of subtracting this from the input audio signal system. Further, although the case where the frame length is constant has been described in the present embodiment, a variable length frame in which the frame length is temporally changed may be used. Further, the number of sound source pulses generated in one frame may not be constant. For example, the number of pulse sequences in each frame may be changed so that the S / N is constant.

また、本実施例においては、短時間音声信号系列のスペ
クトル包絡を表わすパラメータとしてはＫパラメータを
用いたが、これはよく知られている他のパラメータ（例
えばＬＳＰパラメータ等）を用いてもよい。更に前述の
(8)式，(10)式において重み付け関数W(Z)はなくてもよ
い。Further, in this embodiment, the K parameter is used as the parameter representing the spectrum envelope of the short-time speech signal sequence, but other well-known parameters (for example, LSP parameter etc.) may be used. Further above
The weighting function W (Z) may be omitted in Eqs. (8) and (10).

また、本実施例においては、フレーム境界での再生波形
の不連続に起因する品質劣化を防ぐために、現フレーム
より１フレーム過去の駆動音源パルスに由来した応答信
号系列を計算し、現フレームの入力音声からこの応答信
号を減算した後に、パルス系列を計算したが、第６図に
示すように、パルス系列の計算に用いるデータとして、
パルスを伝送するフレームのデータとそれよりも過去の
データとを含むような構成にしてもよい。図６で、N_Tは
パルスを伝送するフレームを示し、Ｎは音源パルスを計
算するフレームを示す。このような構成とすることによ
って、１フレーム過去の駆動音源パルスに由来した応答
信号系列を計算する必要がなくなる。Further, in the present embodiment, in order to prevent the quality deterioration due to the discontinuity of the reproduced waveform at the frame boundary, the response signal sequence derived from the driving sound source pulse one frame before the current frame is calculated, and the current frame is input. After subtracting this response signal from the voice, the pulse sequence was calculated. As shown in FIG. 6, the data used to calculate the pulse sequence was:
It may be configured so as to include data of a frame transmitting a pulse and data in the past. In FIG. 6, N _T indicates a frame for transmitting a pulse, and N indicates a frame for calculating a sound source pulse. With such a configuration, it is not necessary to calculate the response signal sequence derived from the driving sound source pulse of one frame past.

＜発明の効果＞以上説明したように本発明によれば、常に良好な品質の
再生信号を提供できるように、フレームあたりのパルス
数を変化させているので、伝送ビットレイトが16kビッ
ト／秒程度でパルス数が十分でない場合には良好な特性
を得ることが困難であった音声信号の子音部の特性を改
善することができるだけでなく、やはり良好な特性を得
ることが困難であった2400ビット／秒程度の音声帯域デ
ータモデム信号も良好に伝送できるという効果がある。<Advantages of the Invention> As described above, according to the present invention, the number of pulses per frame is changed so that a reproduction signal of good quality can be provided at all times, so that the transmission bit rate is about 16 kbit / sec. It was difficult to obtain good characteristics when the number of pulses was not enough in 2. Not only could the characteristics of the consonant part of the audio signal be improved, but it was also difficult to obtain good characteristics 2400 bits There is an effect that a voice band data modem signal of about 1 / second can be satisfactorily transmitted.

[Brief description of drawings]

第１図は従来方式の構成を示すブロック図、第２図は音
源パルス系列の一例を示す図、第３図は入力音声信号系
列の周波数特性と第１図に記載の重み付け回路の周波数
特性の一例を示す図、第４図(a),(b)は本発明による音
声符号化方式の一実施例を示すブロック図、第５図(a)
〜(e)はパルス探索過程の一例を示す図、第６図はパル
ス伝送フレームと音源パルス計算フレームとの位置関係
を説明するための図である。図において、110,340……バッファメモリ回路、120,285
……減算回路、130,400,550……合成フィルタ回路、14
0,420,540……パルス発生回路、150……誤差最小化回
路、180,280……Ｋパラメータ計算回路、190,410……重
み付け回路、200……Ｋパラメータ符号化回路、210……
インパルス応答計算回路、350……相互相関関数計算回
路、360……自己相関関数計算回路、390……パルス計算
回路、470……符号化回路、450……マルチプレクサ、50
0……デマルチプレクサ、520……Ｋパラメータ復号回
路、530……パルス復号回路をそれぞれ示す。FIG. 1 is a block diagram showing a configuration of a conventional system, FIG. 2 is a diagram showing an example of a sound source pulse sequence, and FIG. 3 is a frequency characteristic of an input audio signal sequence and a frequency characteristic of a weighting circuit shown in FIG. FIG. 4 (a) and FIG. 4 (b) are block diagrams showing an embodiment of a voice coding system according to the present invention, and FIG. 5 (a).
(E) is a figure which shows an example of a pulse search process, FIG. 6 is a figure for demonstrating the positional relationship between a pulse transmission frame and a sound source pulse calculation frame. In the figure, 110,340 ... buffer memory circuit, 120,285
...... Subtraction circuit, 130,400,550 …… Synthesis filter circuit, 14
0,420,540 …… Pulse generation circuit, 150 …… Error minimization circuit, 180,280 …… K parameter calculation circuit, 190,410 …… Weighting circuit, 200 …… K parameter coding circuit, 210 ……
Impulse response calculation circuit, 350 ... Cross-correlation function calculation circuit, 360 ... Autocorrelation function calculation circuit, 390 ... Pulse calculation circuit, 470 ... Encoding circuit, 450 ... Multiplexer, 50
0 ... Demultiplexer, 520 ... K parameter decoding circuit, 530 ... Pulse decoding circuit, respectively.

Claims

[Claims]

1. A transmitter side inputs a discrete voice band signal sequence, extracts a spectrum parameter sequence representing a short-time spectrum envelope, and outputs the voice band signal based on the voice band signal sequence and the spectrum parameter sequence. A pulse sequence that can represent a sequence satisfactorily is searched for, a discrimination code that determines the number of transmission pulse sequences based on the spectral parameter sequence extraction result or the pulse sequence search result is created, and the transmission pulse sequence and the The spectrum parameter sequence is encoded and output in combination with the discrimination code, the receiving side separates the discrimination code from the combined code, and the code representing the spectrum parameter sequence according to the discrimination code and the transmission pulse sequence. Is separated and decoded, and the decoded spectrum parameter sequence and Voice band signaling method characterized by using a serial decoded pulse sequence and to reproduce the voice band signal sequence.

2. A parameter calculation circuit for inputting a discrete voice band signal sequence and extracting a spectrum parameter sequence representing a short-time spectrum envelope from the voice band signal sequence, the voice band signal sequence and the spectrum parameter sequence. Based on a pulse sequence search circuit that searches for a pulse sequence that can satisfactorily represent the voice band signal sequence, and a discrimination code that determines the number of transmission pulse sequences based on the spectrum parameter sequence extraction result or the pulse sequence search result. A voice band signal sequence encoding apparatus comprising: a discriminating circuit to be produced, and means for encoding the transmission pulse sequence and the spectrum parameter sequence in accordance with the discriminating code and outputting in combination with the discriminating code.

3. A spectrum parameter sequence representing a short-time spectrum envelope is extracted from a discrete voice band signal sequence from the transmitting side, and the voice band signal sequence is made good based on the voice band signal sequence and the spectrum parameter sequence. A pulse sequence that can be represented by, and create a discrimination code that determines the number of transmission pulse sequences based on the spectrum parameter sequence extraction result or the pulse sequence search result, and according to the discrimination code, the transmission pulse sequence and the spectrum parameter sequence And a code output by combining with the discrimination code is input, the discrimination code is separated from the combined code sequence, and a code representing a spectrum parameter sequence and a code representing a pulse sequence are further generated according to the discrimination code. A means for separating and decoding, and a drive pattern using the decoded pulse sequence. A voice band signal including a pulse sequence generation circuit for generating a voice sequence, and a synthesis filter circuit for reproducing and outputting a voice band signal sequence using the decoded spectrum parameter sequence and the driving pulse sequence. Decoding device.