JP3159012B2

JP3159012B2 - Audio signal encoding device and decoding device

Info

Publication number: JP3159012B2
Application number: JP30190995A
Authority: JP
Inventors: 兼史杵築; 俊弘丸山
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 1995-10-26
Filing date: 1995-10-26
Publication date: 2001-04-23
Anticipated expiration: 2015-10-26
Also published as: JPH09130260A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は音響信号の符号化装
置及び復号化装置に係り、ＰＣＭ(Pulse Code Modulati
on)方式で変調された音響信号をＭＤＣＴ(Modified Dis
crete Cosine Transform)方式で直交変換すると共に振
幅成分と位相成分に分離してベクトル量子化する高能率
符号化技術と、その符号化データの復号化技術に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio signal encoding and decoding apparatus, and more particularly to a PCM (Pulse Code Modulati).
on) signal is converted to an MDCT (Modified Dis
The present invention relates to a high-efficiency encoding technique that performs orthogonal transformation by a crete cosine transform (Crete Cosine Transform) method and separates an amplitude component and a phase component into vector quantization, and a decoding technology of the encoded data.

【０００２】[0002]

【従来の技術】従来から、音響信号の高能率符号化に関
しては、ＭＰＥＧ(Moving Picture Experts Group)-Aud
io等に代表されるように、時間領域のサンプル信号を周
波数領域に写像し、周波数領域でのエネルギの偏在に係
るビット割当てをサンプル毎に人間の心理聴覚特性を利
用して量子化歪を制御することによって行われている。
このような符号化方式は所謂スカラー量子化の手法に基
づくものであるが、平均１(bit/sample)を下回るような
低ビットレートでの符号化では極端に量子化歪が大きく
なる。2. Description of the Related Art Conventionally, high efficiency coding of audio signals has been performed by MPEG (Moving Picture Experts Group) -Aud.
As represented by io, etc., the sample signal in the time domain is mapped to the frequency domain, and the bit allocation related to the uneven distribution of energy in the frequency domain is controlled for the quantization distortion by using the human psychoacoustic characteristics of each sample. Is done by doing.
Such an encoding method is based on a so-called scalar quantization method. However, in encoding at a low bit rate below an average of 1 (bit / sample), quantization distortion becomes extremely large.

【０００３】そこで、いくつかのサンプルをまとめて量
子化するベクトル量子化の導入が考えられる。ここに、
「ベクトル量子化」とは、従来から動画像符号化方式に適
用されてきたものであり、被写体を表現する各種のベク
トルデータのパターンをインデックスデータと対応付け
てコードブックに格納しておき、動画像データの伝送に
おいてそのインデックスデータのみを伝送するものであ
る。そして、そのベクトル量子化を音響信号に適用する
方式としては、信号波形そのものをベクトル量子化する
方式や、線形予測残差信号をベクトル量子化する方式
や、直交変換係数をベクトル量子化する方式が試されて
いる。しかし、信号波形そのものや線形予測残差信号に
ベクトル量子化を適用する方式は、変化が激しく時系列
的相関が弱いという特性を有した広帯域(オーディオ帯
域)音響信号には適していない。一方、直交変換係数を
ベクトル量子化する方式については周波数領域でのエネ
ルギの偏在性を利用でき、前記の広帯域音響信号の高能
率符号化に適している。Therefore, it is conceivable to introduce vector quantization in which several samples are quantized collectively. here,
"Vector quantization" has been conventionally applied to a moving picture coding method. A pattern of various types of vector data representing a subject is stored in a code book in association with index data, and In transmitting image data, only the index data is transmitted. As a method of applying the vector quantization to an audio signal, a method of vector-quantizing a signal waveform itself, a method of vector-quantizing a linear prediction residual signal, and a method of vector-quantizing an orthogonal transform coefficient are used. Have been tried. However, the method of applying vector quantization to the signal waveform itself or the linear prediction residual signal is not suitable for a wideband (audio band) audio signal having a characteristic that changes greatly and time-series correlation is weak. On the other hand, the method of vector quantizing the orthogonal transform coefficients can utilize the uneven distribution of energy in the frequency domain, and is suitable for the above-described high-efficiency encoding of a wideband acoustic signal.

【０００４】従って、直交変換とベクトル量子化を組み
合わせた音響信号の符号化方式については「オーディオ
信号の変換符号化への変換係数のＶＱの導入」(熊田他,
日本音響学会講演論文集,93/10,p.307)等の技術が開示
されている。図８はその符号化装置の構成例を示す。同
図においては、先ず、入力されるＰＣＭデータに対して
ＭＤＣＴ器51でＭＤＣＴ方式による直交変換がなされ、
そのＭＤＣＴ係数を周波数方向に沿って４次元ベクトル
化器52で４個の係数データ毎に４次元ベクトルデータに
変換する。次に、正規化演算器53では、各ベクトルデー
タのｒｍｓ(二乗平均平方根)値を求め、そのｒｍｓ値を
対数量子化すると共に、その対数量子化されたｒｍｓ値
に応じて各ベクトルデータに量子化ビットを割り当て、
各ベクトルデータをｒｍｓ値で正規化する。一方、コー
ドブック54に対しては、音響信号の学習サンプルに前記
と同様のＭＤＣＴ直交変換とＭＤＣＴ係数の４次元ベク
トル化と正規化処理を施したベクトルデータに対応付け
てインデックスデータが登録されている。そこで、正規
化されたベクトルデータを検索キーとしてベクトル量子
化器55がコードブック54を検索し、対応したインデック
スデータが読出される。また、割り当てられたビット数
に余分があれば、それと４次元ベクトル化器52から得ら
れる元のＤＣＴ係数データを減算器56へ入力させること
で量子化誤差を求め、更にスカラー量子化器57でその量
子化誤差をスカラー量子化する。そして、マルチプレク
サ58で前記のｒｍｓ値とインデックスデータと量子化誤
差データを多重化したビットストリームを構成して伝送
路へ出力させる。[0004] Therefore, as for an audio signal coding method combining orthogonal transform and vector quantization, "Introduction of VQ of transform coefficient to transform coding of audio signal" (Kumada et al.,
Techniques such as the Acoustical Society of Japan, 93/10, p.307) are disclosed. FIG. 8 shows a configuration example of the encoding apparatus. In the figure, first, orthogonal transformation by the MDCT method is performed by the MDCT unit 51 on the input PCM data.
The MDCT coefficients are converted into four-dimensional vector data for each of the four coefficient data by a four-dimensional vectorizer 52 along the frequency direction. Next, the normalization calculator 53 obtains an rms (root mean square) value of each vector data, log-quantizes the rms value, and quantizes the rms value into each vector data according to the log-quantized rms value. All the bits
Each vector data is normalized by the rms value. On the other hand, in the codebook 54, index data is registered in association with vector data obtained by performing the same MDCT orthogonal transform, MDCT coefficient four-dimensional vectorization and normalization processing on the learning sample of the audio signal as described above. I have. Therefore, the vector quantizer 55 searches the code book 54 using the normalized vector data as a search key, and the corresponding index data is read. If there is an excess in the number of allocated bits, a quantization error is obtained by inputting the number of bits and the original DCT coefficient data obtained from the four-dimensional vectorizer 52 to a subtractor 56. The quantization error is scalar-quantized. The multiplexer 58 forms a multiplexed bit stream of the rms value, the index data, and the quantization error data, and outputs the multiplexed bit stream to the transmission path.

【０００５】[0005]

【発明が解決しようとする課題】ところで、上記の論文
で報告されている内容によると、図８の符号化装置での
ＳＮ比は、９０〜２０８ｋbpsの範囲において従来のス
カラー量子化方式と比較して約２〜３dB程度の改善が見
られて良好な結果が得られているが、９０ｋbpsでは一
般的なスカラー量子化方式の場合との差が小さくなって
きており、その傾向から見て更に低いビットレートへの
適用は困難になると予測される。According to the contents reported in the above-mentioned paper, the S / N ratio of the encoding device shown in FIG. 8 is in the range of 90 to 208 kbps compared with the conventional scalar quantization system. About 2 to 3 dB, and good results have been obtained. However, at 90 kbps, the difference from the general scalar quantization method has become smaller, and the tendency is even lower. Application to bit rates is expected to be difficult.

【０００６】そして、その原因は次のように考察され
る。先ず、本質的にベクトル量子化はデータの統計的分
布の偏りを利用する手法である。上記の符号化装置で
は、コードブック54に図５に示すようなＭＤＣＴ係数ｘ
(n)を周波数方向について４個単位で分割して４次元ベ
クトルデータに変換した後にｒｍｓ値で正規化したデー
タを格納させているが、ＭＤＣＴ係数ｘ(n)は信号の位
相の変化によって隣合った係数の符号や大きさが大きく
変化してしまい、そのためにベクトル量子化による効率
に限界が生じている。即ち、上記の符号化装置の方式を
９０ｋbpsより低いビットレートに適用するためには、
より統計的分布に偏りがあるような性質のベクトルを用
いる必要がある。[0006] The cause is considered as follows. First, essentially, vector quantization is a technique that utilizes the bias of the statistical distribution of data. In the above encoding device, the MDCT coefficient x as shown in FIG.
(n) is divided into four units in the frequency direction, converted into four-dimensional vector data, and then stored with data normalized by the rms value. However, the MDCT coefficient x (n) is adjacent due to a change in the phase of the signal. Signs and magnitudes of the combined coefficients greatly change, which limits the efficiency of vector quantization. That is, in order to apply the method of the above encoding apparatus to a bit rate lower than 90 kbps,
It is necessary to use a vector having such a property that the statistical distribution is more biased.

【０００７】一方、図５のＭＤＣＴ係数ｘ(n)から振幅
成分データと位相成分データを求めるとそれぞれ図６及
び図７のようになり、特に振幅成分データの統計的分布
の偏りが大きいことが理解される。そこで、本発明は、
ＭＤＣＴ係数ｘ(n)そのものをベクトル量子化するので
はなく、ＭＤＣＴ係数ｘ(n)から求められる振幅成分デ
ータと位相成分データをベクトル量子化するという特殊
な手法を採用し、全体としてより高能率な符号化を可能
にする符号化装置とそれに対応した復号化装置を提供す
ることを目的として創作された。On the other hand, when the amplitude component data and the phase component data are obtained from the MDCT coefficient x (n) in FIG. 5, the results are as shown in FIGS. 6 and 7, respectively. In particular, the bias of the statistical distribution of the amplitude component data is large. Understood. Therefore, the present invention
Instead of vector quantizing the MDCT coefficient x (n) itself, a special method of vector quantizing the amplitude component data and the phase component data obtained from the MDCT coefficient x (n) is adopted, and the overall efficiency is improved. It has been created for the purpose of providing an encoding device that enables efficient encoding and a decoding device corresponding thereto.

【０００８】[0008]

【課題を解決するための手段】第１の発明は、ＰＣＭ方
式で変調された音響信号をＭＤＣＴ方式で直交変換し、
その変換によって得られるＭＤＣＴ係数を符号化する音
響信号の符号化装置において、前記ＭＤＣＴ係数を振幅
成分データと位相成分データに分離するデータ分離手段
と、前記データ分離手段が分離した各データをベクトル
データへ変換するベクトル化手段と、音響信号の学習サ
ンプルから得られるＭＤＣＴ係数に対して前記のデータ
分離手段とベクトル化手段によるデータ処理と実質的に
同一のデータ処理を施して求められた振幅成分と位相成
分に係る各ベクトルデータに対応させて相互に異なるイ
ンデックスデータを格納せしめたコードブックと、前記
ベクトル化手段が求めた振幅成分と位相成分に係る各ベ
クトルデータを用いて前記コードブックを検索し、その
各ベクトルデータと同一乃至近似した前記コードブック
のベクトルデータに対応付けられている振幅成分と位相
成分に係るインデックスデータを読出すベクトル量子化
手段を具備し、ベクトル量子化手段が読出したインデッ
クスデータを伝送路へ出力することを特徴とした音響信
号の符号化装置に係る。According to a first aspect of the present invention, an acoustic signal modulated by a PCM method is orthogonally transformed by an MDCT method,
In an audio signal encoding apparatus for encoding MDCT coefficients obtained by the conversion, data separating means for separating the MDCT coefficients into amplitude component data and phase component data, and each data separated by the data separating means as vector data. Vectorization means for converting the MDCT coefficients obtained from the training sample of the acoustic signal into amplitude components obtained by performing substantially the same data processing as the data processing by the data separation means and the vectorization means. A code book in which mutually different index data is stored in correspondence with each vector data related to the phase component, and the code book is searched using each vector data related to the amplitude component and the phase component obtained by the vectorization unit. , The vector data of the code book which is the same as or approximate to the respective vector data Encoding a sound signal, comprising: vector quantization means for reading index data relating to the associated amplitude component and phase component; and outputting the index data read by the vector quantization means to a transmission path. Related to the device.

【０００９】この発明では、データ分離手段によってＭ
ＤＣＴ係数を振幅成分データと位相成分データに分離
し、分離された各データをベクトル化手段でベクトルデ
ータへ変換する。コードブックは、音響信号の学習サン
プルに対してこの発明に係るＭＤＣＴ・データ分離・ベク
トル化と実質的に同一の処理を施して得られた振幅成分
と位相成分に係る各ベクトルデータに対応させて相互に
異なるインデックスデータがテーブル化されている。従
って、ベクトル量子化手段がベクトル化手段で変換され
た振幅成分と位相成分に係る各ベクトルデータを検索キ
ーとしてコードブックを参照し、対応するインデックス
データを読出して伝送するようにすれば伝送効率の大幅
な向上が図れ、また伝送路上のストレージメディアの容
量節減も図れる。According to the present invention, M
The DCT coefficient is separated into amplitude component data and phase component data, and each separated data is converted into vector data by vectorization means. The codebook corresponds to each vector data relating to the amplitude component and the phase component obtained by performing substantially the same processing as the MDCT / data separation / vectorization according to the present invention on the learning sample of the acoustic signal. Different index data are tabulated. Therefore, if the vector quantization means refers to the codebook as a search key using each vector data related to the amplitude component and the phase component converted by the vectorization means, and reads and transmits the corresponding index data, the transmission efficiency can be improved. A significant improvement can be achieved, and the capacity of the storage medium on the transmission path can be reduced.

【００１０】ところで、上記のように図５と図６の比較
から明らかなように、振幅成分データａ(m)の分布状態
の統計的な偏りはＭＤＣＴ係数ｘ(n)のそれよりも遥か
に大きい。従って、コードブックに格納させておく振幅
成分に係るベクトルデータは、大きな変換次元数のもの
で且つ少ないデータ個数分だけ用意しておけば足りる。
一方、位相成分データθ(m)については、図７に示すよ
うにその統計的分布の偏りが小さいため、振幅成分デー
タの場合に比べて小さい変換次元数のベクトルデータを
数多く用意しておかねばならない。しかし、同一のメモ
リ容量のコードブックであっても、前記のとおり振幅成
分に係るベクトルデータのデータ量が少なくなる分を位
相成分に係るベクトルデータに割り当てることができ
る。従って、結果的にＭＤＣＴ係数ｘ(n)をそのままベ
クトル量子化してコードブックに格納する場合と比較し
て、ビット量の有効な配分が可能になり、コードブック
の小型化とサーチ時の演算量の低減を実現できる。As is apparent from the comparison between FIG. 5 and FIG. 6, the statistical deviation of the distribution state of the amplitude component data a (m) is far greater than that of the MDCT coefficient x (n). large. Therefore, it is sufficient that the vector data relating to the amplitude component stored in the code book has a large number of conversion dimensions and a small number of data.
On the other hand, for the phase component data θ (m), since the bias of the statistical distribution is small as shown in FIG. 7, it is necessary to prepare a large number of vector data having a smaller conversion dimension than the amplitude component data. No. However, even if the codebooks have the same memory capacity, as described above, a smaller amount of vector data relating to the amplitude component can be allocated to the vector data relating to the phase component. Therefore, as compared with the case where the MDCT coefficient x (n) is vector-quantized as it is and stored in the codebook, the effective allocation of the bit amount becomes possible. Can be reduced.

【００１１】第２の発明は、第１の発明の符号化装置に
対応した復号化装置であって、その符号化装置が具備し
たコードブックの格納データと同一データが格納せしめ
られたコードブックと、入力された振幅成分と位相成分
に係るインデックスデータを用いて前記コードブックを
検索し、その各インデックスデータに合致した振幅成分
と位相成分に係るベクトルデータを読出すベクトル逆量
子化手段と、前記ベクトル逆量子化手段が読出した各ベ
クトルデータを合成してＭＤＣＴ係数に係る振幅成分デ
ータと位相成分データを作成するベクトル合成手段と、
前記ベクトル合成手段が求めた振幅成分データと位相成
分データを合成してＭＤＣＴ係数を求めるデータ合成手
段と、前記データ合成手段が求めたＭＤＣＴ係数に基づ
いてＰＣＭ方式で変調された音響信号を再生する逆ＭＤ
ＣＴ手段を具備したことを特徴とする音響信号の復号化
装置に係る。According to a second aspect of the present invention, there is provided a decoding apparatus corresponding to the encoding apparatus according to the first aspect of the present invention, wherein the codebook stores the same data as data stored in a codebook provided in the encoding apparatus. Vector inverse quantization means for searching the code book using the input index data related to the amplitude component and the phase component, and reading out vector data related to the amplitude component and the phase component corresponding to each index data; Vector synthesizing means for synthesizing each vector data read by the vector dequantizing means to create amplitude component data and phase component data relating to MDCT coefficients;
A data synthesizing unit for synthesizing the amplitude component data and the phase component data obtained by the vector synthesizing unit to obtain an MDCT coefficient; and reproducing a sound signal modulated by the PCM method based on the MDCT coefficient obtained by the data synthesizing unit. Reverse MD
The present invention relates to an audio signal decoding device including CT means.

【００１２】この復号化装置のコードブックが、第１の
発明のコードブックの格納データと同一のデータ内容を
有していなければならないことは当然であり、インデッ
クスデータを検索キーとして検索される点が異なるだけ
である。また、ベクトル逆量子化手段が振幅成分と位相
成分に係るベクトルデータを読出した後のベクトル合成
手段とデータ合成手段と逆ＭＤＣＴ手段による一連のデ
ータ処理は、第１の発明によるＭＤＣＴ方式で直交変換
してデータ分離手段とベクトル化手段が実行する処理の
逆処理に相当する。従って、上記の特質を有したコード
ブックを符号化装置と復号化装置が有していることによ
り、この復号化装置は低いビットレートでもＳＮ比の高
い音響信号の再生が可能になる。It is natural that the codebook of this decoding device must have the same data content as the data stored in the codebook of the first invention, and the index book is searched using the index data as a search key. Only differ. A series of data processing by the vector synthesizing unit, the data synthesizing unit, and the inverse MDCT unit after the vector dequantizing unit reads out the vector data related to the amplitude component and the phase component is performed by the orthogonal transform by the MDCT method according to the first invention. This corresponds to the reverse processing of the processing executed by the data separating means and the vectorizing means. Therefore, since the encoding apparatus and the decoding apparatus have the codebook having the above characteristics, the decoding apparatus can reproduce an audio signal having a high SN ratio even at a low bit rate.

【００１３】[0013]

【発明の実施の形態】以下、本発明の「音響信号の符号
化装置及び復号化装置」の実施形態を図１から図４を用
いて詳細に説明する。先ず、図１は符号化装置の機能ブ
ロック回路図を示す。同図において、1は入力されるＰ
ＣＭ信号に対してＭＤＣＴ方式で直交変換を施すＭＤＣ
Ｔ器、2はＭＤＣＴ器1が求めたＭＤＣＴ係数から振幅成
分データａ(m)と位相成分データθ(m)を求めてそれらを
分離出力する振幅・位相分離器、3は振幅成分データａ
(m)と位相成分データθ(m)を帯域毎に分割し、両データ
ａ(m),θ(m)を各帯域毎に変換次元数を異ならせてベク
トルデータａv(m),θv(m)へ変換する帯域分割・ベクトル
化器、4は振幅成分に係るベクトルデータａv(m)を係数
ａ(k)で正規化する正規化演算器、5は振幅成分用のコー
ドブック、6は位相成分用のコードブック、7は正規化さ
れた振幅成分に係るベクトルデータａvn(m)を検索キー
として振幅成分用のコードブック5を検索してインデッ
クスデータＩＤaを読出すベクトル量子化器、8は位相成
分に係るベクトルデータθv(m)を検索キーとして位相成
分用のコードブック5を検索してインデックスデータＩ
Ｄpを読出すベクトル量子化器、9は正規化に用いた係数
ａ(k)とインデックスデータＩＤa,ＩＤpを多重化したビ
ット列を構成して伝送路へ出力させるマルチプレクサで
ある。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram showing an embodiment of an audio signal encoding and decoding apparatus according to the present invention; FIG. First, FIG. 1 shows a functional block circuit diagram of an encoding device. In the figure, 1 is the input P
MDC that performs orthogonal transform on CM signal by MDCT method
T unit, 2 is an amplitude / phase separator for obtaining amplitude component data a (m) and phase component data θ (m) from the MDCT coefficients obtained by MDCT unit 1 and separating and outputting them. 3 is amplitude component data a.
(m) and the phase component data θ (m) are divided for each band, and the two data a (m) and θ (m) are changed in the number of conversion dimensions for each band to obtain vector data av (m) and θv ( m), 4 is a normalization calculator that normalizes vector data av (m) related to the amplitude component by a coefficient a (k), 5 is a codebook for the amplitude component, and 6 is a codebook for the amplitude component. A codebook for phase component, 7 is a vector quantizer that searches the codebook 5 for amplitude component using the vector data avn (m) relating to the normalized amplitude component as a search key and reads out index data IDa, 8 Searches the codebook 5 for the phase component using the vector data θv (m) related to the phase component as a search key,
A vector quantizer 9 for reading out Dp, and a multiplexer 9 for forming a bit string obtained by multiplexing the coefficient a (k) used for normalization and the index data IDa and IDp and outputting the bit stream to the transmission line.

【００１４】次に、この符号化装置の動作を各機能ブロ
ック要素に係る更に詳細な説明を加えながら説明する。
入力されたＰＣＭ信号はＭＤＣＴ器1によって１フレー
ムずつ切り出され、各フレームに対して直交変換処理が
施されて周波数軸方向についてのＭＤＣＴ係数ｘ(n)が
求められる。振幅・位相分離器2では、求められたＭＤＣ
Ｔ係数ｘ(n)は振幅情報と位相情報を併せ持っているの
で、次の数式１及び数式２を適用して振幅成分データａ
(m)と位相成分データθ(m)を演算し、振幅成分と位相成
分を分離する。Next, the operation of the encoding apparatus will be described with a more detailed description of each functional block element.
The input PCM signal is cut out frame by frame by the MDCT unit 1, and an orthogonal transformation process is performed on each frame to obtain an MDCT coefficient x (n) in the frequency axis direction. In the amplitude / phase separator 2, the obtained MDC
Since the T coefficient x (n) has both amplitude information and phase information, the following equation 1 and equation 2 are applied to the amplitude component data a
(m) and the phase component data θ (m) are calculated to separate the amplitude component and the phase component.

【数１】(Equation 1)

【数２】但し、各数式において、ｎ,ｍはスペクトル番
号で、ｎ＝０,２,４,６,・・・,Ｎ−１、ｍ＝０,１,２,３,
・・・,(Ｎ−１)／２であり、Ｎは１フレーム内のスペクト
ルの数である。Where n and m are spectrum numbers, and n = 0, 2, 4, 6,..., N−1, and m = 0, 1, 2, 3,.
.., (N-1) / 2, where N is the number of spectra in one frame.

【００１５】そして、求められた振幅成分データａ(m)
と位相成分データθ(m)は帯域分割・ベクトル化器3へ入
力される。この帯域分割・ベクトル化器3では、先ず、音
響信号の周波数帯域を例えば０〜２.２５ｋＨzと２.２
５〜４.５ｋＨzと４.５〜９.０ｋＨzと９.０〜１６.０
ｋＨzの４帯域に分割し、振幅成分データａ(m)と位相成
分データθ(m)をそれらの帯域毎に分割する。次に、分
割された各データａ(m),θ(m)を各帯域毎に設定された
変換次元数でベクトルデータａv(m),θv(m)へ変換す
る。即ち、各帯域毎にサンプリング数Ｓを設定し、それ
らサンプリングデータ群毎に変換次元数Ｓのベクトルデ
ータを作成する。Then, the obtained amplitude component data a (m)
And the phase component data θ (m) are input to the band division / vectorization unit 3. In the band splitter / vectorizer 3, first, the frequency band of the audio signal is set to, for example, 0 to 2.25 kHz and 2.2.
5 to 4.5 kHz, 4.5 to 9.0 kHz and 9.0 to 16.0
The frequency band is divided into four bands of kHz and the amplitude component data a (m) and the phase component data θ (m) are divided for each of these bands. Next, each of the divided data a (m) and θ (m) is converted into vector data av (m) and θv (m) using the conversion dimension set for each band. That is, the sampling number S is set for each band, and vector data of the conversion dimension number S is created for each of the sampling data groups.

【００１６】この場合、例えば、前記の帯域分割であれ
ば、０〜２.２５ｋＨzではａv(m)の次元数を６、θv(m)
の次元数を２とし、２.２５〜４.５ｋＨzではａv(m)の
次元数を１２、θv(m)の次元数を４とし、４.５〜９.０
ｋＨzではａv(m)の次元数を２４、θv(m)の次元数を６
とし、９.０〜１６.０ｋＨではａv(m)の次元数を３２、
θv(m)の次元数を２０とするというように、周波数の低
い帯域における変換次元数を周波数の高い帯域における
変換次元数より小さくする。これは、人間の聴覚心理に
おいて周波数が高くなるほど周波数弁別能力が悪くなる
ことを考慮したものであり、音響信号であることの特質
を利用してより有効なデータ量の割付けを行うためであ
る。In this case, for example, in the case of the above-mentioned band division, at 0 to 2.25 kHz, the dimension number of av (m) is 6, θv (m)
, The dimension number of av (m) is 12, and the dimension number of θv (m) is 4 in the range of 2.25 to 4.5 kHz, and 4.5 to 9.0.
In kHz, the number of dimensions of av (m) is 24, and the number of dimensions of θv (m) is 6.
In 9.0 to 16.0 kHz, the dimension number of av (m) is 32,
The conversion dimension in a low frequency band is made smaller than the conversion dimension in a high frequency band such that the dimension of θv (m) is set to 20. This is in consideration of the fact that the higher the frequency is in human auditory psychology, the worse the frequency discrimination ability is, and the more effective data amount is allocated using the characteristic of being an acoustic signal.

【００１７】また、位相成分に係るベクトルデータθv
(m)を作成するに際しては、位相成分データθ(m)の絶対
値ではなく次元内の最初の値と後続データ値の差をベク
トルデータとする。これは、図７に示したように、一般
に位相成分データθ(m)の統計的分布の偏りが小さいた
め、絶対値によるよりも差分データをベクトル化した方
がベクトルデータθv(m)のデータ量が全体として小さく
なるからである。Further, vector data θv relating to the phase component
When creating (m), the difference between the first value in the dimension and the subsequent data value is used as vector data instead of the absolute value of the phase component data θ (m). This is because, as shown in FIG. 7, since the bias of the statistical distribution of the phase component data θ (m) is generally small, it is better to vectorize the difference data than to use the absolute value to obtain the vector data θv (m). This is because the amount becomes smaller as a whole.

【００１８】次に、帯域分割・ベクトル化器3は、作成し
た振幅成分に係るベクトルデータａv(m)を帯域識別デー
タＢと共に正規化演算器4へ転送し、また位相成分に係
るベクトルデータθv(m)と帯域識別データＢをベクトル
量子化器8へ転送する。正規化演算器4では、転送された
ベクトルデータａv(m)を対数化し、そのベクトルデータ
ａv(m)の内の最大パワー値を正規化係数ａ(k)として正
規化されたベクトルデータａvn(m)を求め、それを帯域
識別データＢと共にベクトル量子化器7へ転送する。こ
こで、対数化・正規化処理を行うのは、振幅成分に係る
ベクトルデータａv(m)についてもデータ量の削減を図っ
ておくためである。Next, the band dividing / vectoring unit 3 transfers the created vector data av (m) relating to the amplitude component together with the band identification data B to the normalization computing unit 4, and also generates the vector data θv relating to the phase component. (m) and the band identification data B are transferred to the vector quantizer 8. In the normalization arithmetic unit 4, the transferred vector data av (m) is logarithmized, and the maximum power value in the vector data av (m) is normalized as a normalization coefficient a (k). m) and transfers it to the vector quantizer 7 together with the band identification data B. Here, the logarithmization / normalization processing is performed to reduce the data amount of the vector data av (m) related to the amplitude component.

【００１９】このようにして、ベクトル量子化器7には
帯域識別データＢと正規化された振幅成分に係るベクト
ルデータａvn(m)が、ベクトル量子化器8には帯域識別デ
ータＢと位相成分に係るベクトルデータθv(m)に入力さ
れるが、各ベクトル量子化器7,8はそれぞれ入力データ
[Ｂとａvn(m)],[Ｂとθv(m)]を検索キーとして対応した
コードブック5,6を検索する。Thus, the vector quantizer 7 receives the band identification data B and the vector data avn (m) relating to the normalized amplitude component, and the vector quantizer 8 receives the band identification data B and the phase component. Are input to the vector data θv (m) according to
Using [B and avn (m)] and [B and θv (m)] as search keys, the corresponding codebooks 5 and 6 are searched.

【００２０】そして、この段階で各コードブック5,6の
内容を図２及び図３を用いて説明しておく。両方のコー
ドブック5,6には、音響信号の学習サンプルに対して前
記の《ＭＤＣＴ器1→振幅・位相分離器2→帯域分割・ベク
トル化器3→(振幅成分については正規化演算器4)》の処
理経路と実質的に同一の処理を施して得られるベクトル
データａvn(m),θv(m)のデータパターンが相互に異なる
インデックスデータＩＤa,ＩＤpを対応付けて記憶せし
められている。即ち、音響信号の学習サンプルを用いた
ＬＢＧアルゴリズム等の既知の方法で前記の各データａ
vn(m),θv(m)の全ゆるデータパターンを登録させてあ
る。但し、本実施形態では上記のように帯域分割・ベク
トル化器3の処理段階で帯域分割すると共に、各帯域毎
にベクトルデータへの変換次元数を異ならせている。従
って、振幅成分用のコードブック5についてみれば、図
１及び図２の(Ｂ)〜(Ｅ)に示すように、コードブック5
の中を帯域別コードブック[A.C.B;1],[A.C.B;2],[A.C.
B;3],[A.C.B;4]に４分割し、それぞれに学習サンプルを
処理・正規化して得られた０〜２.２５ｋＨzと２.２５〜
４.５ｋＨzと４.５〜９.０ｋＨzと９.０〜１６.０ｋＨz
におけるベクトルデータａvn(m)のデータパターンを登
録してある。尚、図２の(Ｂ)〜(Ｅ)では理解し易くする
ために図解的パターンで示してある。そして、本実施形
態の前記実例に対応したベクトル化方式にしたがって、
各ベクトルデータのパターンの変換次元数を０〜２.２
５ｋＨzでは６に、２.２５〜４.５ｋＨzでは１２に、
４.５〜９.０ｋＨzでは２４に、９.０〜１６.０ｋＨzで
は３２にとってあり、各帯域について１０２４個のデー
タパターンが設けてある。それらのパターンを、図２の
(Ａ)に示されるＭＤＣＴ方式での直交変換後の周波数-
振幅対数パワーのグラフに対応付けると、微小周波数帯
域Δｆ内におけるベクトルデータａvn(m)のパターンを
示していることになり、高い周波数帯域になるほど微小
周波数帯域Δｆを大きくとって変換次元数を大きくした
データパターンになっている。At this stage, the contents of the codebooks 5 and 6 will be described with reference to FIGS. Both codebooks 5 and 6 contain the above-mentioned “MDCT unit 1 → amplitude / phase separator 2 → band division / vectorizer 3 → (normalization calculator 4 ) >>, the data patterns of vector data avn (m) and θv (m) obtained by performing substantially the same processing are stored in association with mutually different index data IDa and IDp. That is, each of the data a is obtained by a known method such as an LBG algorithm using a learning sample of an acoustic signal.
All data patterns of vn (m) and θv (m) are registered. However, in the present embodiment, band division is performed in the processing stage of the band division / vectorization unit 3 as described above, and the number of dimensions to be converted into vector data is changed for each band. Therefore, regarding the codebook 5 for the amplitude component, as shown in FIGS.
Inside the codebook [ACB; 1], [ACB; 2], [AC
B; 3] and [ACB; 4], each of which is obtained by processing and normalizing a learning sample, from 0 to 2.25 kHz and 2.25 to
4.5 kHz, 4.5 to 9.0 kHz, and 9.0 to 16.0 kHz
The data pattern of the vector data avn (m) is registered. 2 (B) to 2 (E) are shown in an illustrative pattern for easy understanding. Then, according to the vectorization method corresponding to the example of the present embodiment,
The number of conversion dimensions of each vector data pattern is 0 to 2.2
To 5 at 5 kHz, to 12 at 2.25 to 4.5 kHz
In the case of 4.5 to 9.0 kHz, it is set to 24, and in the case of 9.0 to 16.0 kHz, it is set to 32, and 1024 data patterns are provided for each band. These patterns are shown in FIG.
Frequency after orthogonal transform by MDCT method shown in (A)-
When associated with the graph of the amplitude logarithmic power, the pattern of the vector data avn (m) in the minute frequency band Δf is shown. The higher the frequency band, the larger the minute frequency band Δf and the larger the number of conversion dimensions. It is a data pattern.

【００２１】また、位相成分用のコードブック6につい
ても、図１及び図３の(Ｂ)〜(Ｅ)に示すように、前記の
場合と同様にその中を帯域別コードブック[P.C.B;1],
[P.C.B;2],[P.C.B;3],[P.C.B;4]に４分割し、それぞれ
に学習サンプルをベクトル化処理して得られた０〜２.
２５ｋＨzと２.２５〜４.５ｋＨzと４.５〜９.０ｋＨz
と９.０〜１６.０ｋＨzにおけるベクトルデータθv(m)
のデータパターンを登録してある。また、本実施形態の
ベクトル化方式に対応させて、高い周波数帯域の帯域別
コードブックに対して低い周波数帯域の帯域別コードブ
ックよりも変換次元数の大きいベクトルデータθv(m)の
データパターンを登録させるようにしていることも同様
であり、０〜２.２５ｋＨzでは２に、２.２５〜４.５ｋ
Ｈzでは４に、４.５〜９.０ｋＨzでは６に、９.０〜１
６.０ｋＨzでは２０にとってある。ところで、位相成分
用のコードブック6内におけるベクトルデータθv(m)の
データパターンの変換次元数は全帯域を通じて振幅成分
用のコードブック5の場合と比較して小さくなっている
が、その代わりに各帯域別コードブック[P.C.B;1],[P.
C.B;2],[P.C.B;3],[P.C.B;4]におけるベクトルデータθ
v(m)の総数が２０４８個で、振幅成分用のコードブック
5の場合の２倍になっている。これは、図３の(Ａ)に示
されるＭＤＣＴ方式での直交変換後の周波数-位相値の
グラフから明らかなように、位相の統計的分布の偏りが
小さいため、より多くのパターンで対応させなければ位
相の変化態様を適切に表現できず、ベクトル量子化段階
でのエラー率が高くなるからである。As shown in FIGS. 1 and 3 (B)-(E), the codebook 6 for the phase component also includes a codebook [PCB; ],
[PCB; 2], [PCB; 3], and [PCB; 4] are divided into four parts.
25 kHz, 2.25 to 4.5 kHz and 4.5 to 9.0 kHz
And vector data θv (m) at 9.0 to 16.0 kHz
Has been registered. Further, corresponding to the vectorization method of the present embodiment, the data pattern of the vector data θv (m) having a larger conversion dimension than the band-based codebook of the lower frequency band for the band-based codebook of the higher frequency band is used. The same is true for the registration. For 0 to 2.25 kHz, the value is changed to 2, 2.25 to 4.5 kHz.
At 4 Hz, at 4.5-9.0 kHz, 6 and 9.0-1
At 6.0 kHz, it is 20. By the way, the conversion dimension of the data pattern of the vector data θv (m) in the codebook 6 for the phase component is smaller than that in the codebook 5 for the amplitude component throughout the entire band. Codebook for each band [PCB; 1], [P.
Vector data θ in CB; 2], [PCB; 3], [PCB; 4]
Codebook for amplitude component with 2048 total v (m)
It is twice as large as in the case of 5. This is because the bias of the statistical distribution of the phase is small, as is clear from the graph of the frequency-phase value after the orthogonal transform by the MDCT method shown in FIG. Otherwise, the phase change mode cannot be appropriately expressed, and the error rate in the vector quantization stage increases.

【００２２】図１に戻って、各ベクトル量子化器7,8は
以上に説明した各コードブック5,6をそれぞれ検索する
が、その際には先ず帯域識別データＢを用いて各帯域別
コードブック[A.C.B;1]〜[A.C.B;4]と[P.C.B;1]〜[P.C.
B;4]の何れを検索対象とするかを選択する。そして、各
ベクトル量子化器7,8は、それぞれ前記帯域に含まれて
いた振幅成分に係るベクトルデータａvn(m)と位相成分
に係るベクトルデータθv(m)を検索キーとして、帯域識
別データＢで選択した帯域別コードブック[A.C.B;i]と
[P.C.B;i]を検索し、各ベクトルデータａvn(m)とθv(m)
と同一又は最も近似した登録データに対応付けられてい
るインデックスデータＩＤaとＩＤpを読出す。Returning to FIG. 1, the vector quantizers 7 and 8 respectively search the codebooks 5 and 6 described above. Books [ACB; 1] ~ [ACB; 4] and [PCB; 1] ~ [PC
B; 4] is selected. Each of the vector quantizers 7 and 8 uses the vector data avn (m) related to the amplitude component and the vector data θv (m) related to the phase component included in the band as search keys, and Codebook [ACB; i] by band selected in
[PCB; i] is searched and each vector data avn (m) and θv (m)
The index data IDa and IDp associated with the same or the closest registered data are read out.

【００２３】その後、正規化演算器4が用いた正規化係
数ａ(k)と各ベクトル量子化器7,8で読出したインデック
スデータＩＤa,ＩＤpがマルチプレクサ9へ転送され、マ
ルチプレクサ9がそれらを多重化したビット列を伝送路
へ出力する。本実施形態の符号化装置では、上記のよう
にＭＤＣＴ係数Ｘ(n)を振幅成分データａ(m)と位相成分
データθ(m)に分離した後の帯域分割・ベクトル化器3に
よる帯域毎のベクトル化の変換次元数の割当てや位相成
分データθ(m)の差分によるベクトル化、及び振幅成分
に係るベクトルデータａv(m)の正規化等により、小さい
データサイズでのコードブック5,6を構成させることが
できる。また、そのコードブック5,6においては、振幅
成分に係るベクトルデータａvn(m)の変換次元数が位相
成分に係るベクトルデータθv(m)の変換次元数より大き
くなっており、データ伝送の観点から見ると、統計的分
布の偏りが大きいために比較的粗いパターンでの変換で
足りる振幅情報側のデータ量を少なくし、逆に統計的分
布の偏りが小さく細かいパターンでの変換が必要になる
位相情報側のデータ量を多くすることができ、合理的で
高効率な符号化が実現できる。Thereafter, the normalization coefficient a (k) used by the normalization operation unit 4 and the index data IDa and IDp read by the vector quantizers 7 and 8 are transferred to the multiplexer 9, and the multiplexer 9 multiplexes them. And outputs the converted bit string to the transmission path. In the encoding device according to the present embodiment, the band division / vectorization unit 3 separates the MDCT coefficient X (n) into the amplitude component data a (m) and the phase component data θ (m) as described above. Codebooks 5 and 6 with a small data size by allocating the number of conversion dimensions of vectorization, vectorizing by the difference of phase component data θ (m), and normalizing vector data av (m) related to the amplitude component. Can be configured. In the codebooks 5 and 6, the conversion dimension of the vector data avn (m) related to the amplitude component is larger than the conversion dimension of the vector data θv (m) related to the phase component. From the viewpoint, it is necessary to reduce the amount of data on the amplitude information side that is sufficient for conversion with a relatively coarse pattern because the bias of the statistical distribution is large, and conversely, it is necessary to convert with a fine pattern where the bias of the statistical distribution is small The data amount on the phase information side can be increased, and rational and highly efficient encoding can be realized.

【００２４】次に、図４は上記の符号化装置に対応した
復号化装置の機能ブロック回路図を示す。同図におい
て、11は伝送路から受信した正規化係数ａ(k)とインデ
ックスデータＩＤa,ＩＤpを個別に分離するデマルチプ
レクサ、12は振幅成分用のコードブック、13は位相成分
用のコードブック、14はインデックスデータＩＤaを検
索キーとして振幅成分用のコードブック12を検索して正
規化された振幅成分に係るベクトルデータａvn(m)と帯
域識別データＢを読出すベクトル逆量子化器、15はイン
デックスデータＩＤpを検索キーとして位相成分用のコ
ードブック13を検索して位相成分に係るベクトルデータ
θv(m)と帯域識別データＢを読出すベクトル逆量子化
器、16は正規化係数ａ(k)を用いてベクトルデータａvn
(m)を逆正規化する逆正規化演算器、17は逆正規化され
た振幅成分に係るベクトルデータａv(m)と位相成分に係
るベクトルデータθv(m)と帯域識別データＢを用いて各
ベクトルデータａv(m),θv(m)を対応した帯域に配置さ
せると共に、そのベクトルデータａv(m),θv(m)を合成
して振幅成分データａ(m)と位相成分データθv(m)を求
める帯域合成・ベクトル合成器、18は両成分データａ
(m),θ(m)を用いてＭＤＣＴ係数ｘ(n)を求める振幅・位
相合成器、19はＭＤＣＴ係数ｘ(n)に対して逆ＭＤＣＴ
処理を施してＰＣＭ信号を再生させるＩＭＤＣＴ(Inver
s ＭＤＣＴ)器である。Next, FIG. 4 shows a functional block circuit diagram of a decoding device corresponding to the above-mentioned coding device. In the figure, reference numeral 11 denotes a demultiplexer for individually separating a normalization coefficient a (k) and index data IDa and IDp received from a transmission path, 12 a codebook for an amplitude component, 13 a codebook for a phase component, Reference numeral 14 denotes a vector dequantizer for searching the amplitude component codebook 12 using the index data IDa as a search key and reading out the vector data avn (m) and the band identification data B relating to the normalized amplitude component. A vector inverse quantizer that searches the phase component codebook 13 using the index data IDp as a search key and reads out the vector data θv (m) and the band identification data B relating to the phase component, and 16 is a normalization coefficient a (k ) Using vector data avn
(m) is inversely normalized by using an inverse normalization calculator 17 using vector data av (m) related to the inversely normalized amplitude component, vector data θv (m) related to the phase component, and band identification data B. Each of the vector data av (m) and θv (m) is arranged in the corresponding band, and the vector data av (m) and θv (m) are synthesized to combine the amplitude component data a (m) and the phase component data θv ( m) is a band combining / vector combining unit, and 18 is both component data a
(m), an amplitude / phase synthesizer that obtains an MDCT coefficient x (n) using θ (m). 19 is an inverse MDCT for the MDCT coefficient x (n).
IMDCT (Inver
s MDCT) device.

【００２５】この復号化装置の各機能ブロックは、デー
タの変換や処理の流れに関してみると上記の符号化装置
の各機能ブロックに逆の関係で対応しており、それぞれ
が逆の変換や処理を実行するものであることから、ここ
ではその動作に関する詳細な説明を省略するが、復号化
装置として次のような特徴を有している。Each functional block of the decoding device corresponds to each functional block of the above-described encoding device in an inverse relationship with respect to the flow of data conversion and processing, and performs each of the inverse conversion and processing. Since the operation is executed, a detailed description of the operation is omitted here, but the decoding device has the following features.

【００２６】先ず、コードブック12,13がそれぞれ符号
化装置側のコードブック5,6と同一のデータ内容を有し
ており、ベクトル逆量子化器14,15が、伝送路側からデ
マルチプレクサ11を介して受信したインデックスデータ
ＩＤa,ＩＤpを検索キーとして、各コードブック12,13内
でその検索キーに対応付けられたベクトルデータａvn
(m),θv(m)を読出す。この場合に、伝送路側からは帯域
識別データＢが伝送されてこないが、各インデックスデ
ータＩＤa,ＩＤpは個々に異なるコードで与えられてい
るため、コードブック12,13の検索でヒットしたインデ
ックスデータＩＤa,ＩＤpのアドレスに基づいて帯域識
別データＢを検出することができる。First, the codebooks 12 and 13 have the same data contents as the codebooks 5 and 6 on the encoder side, and the vector dequantizers 14 and 15 transmit the demultiplexer 11 from the transmission path side. Using the index data IDa and IDp received via this as search keys, the vector data avn associated with the search keys in each of the codebooks 12 and 13 is used.
(m) and θv (m) are read. In this case, although the band identification data B is not transmitted from the transmission path side, since each index data IDa, IDp is given with a different code individually, the index data IDa hit in the search of the code books 12, 13 is obtained. , Band identification data B can be detected based on the address of IDp.

【００２７】また、振幅・位相合成器18は、帯域合成・ベ
クトル合成器18が作成した振幅成分データａ(m)と位相
成分データθ(m)を用いて、次の数式３及び数式４に示
される簡単な三角関数によりＭＤＣＴ係数ｘ(n),ｘ(n+
1)を求める。The amplitude / phase combiner 18 uses the amplitude component data a (m) and the phase component data θ (m) created by the band combiner / vector combiner 18 to obtain the following formulas 3 and 4. The MDCT coefficients x (n) and x (n +
Ask for 1).

【数３】(Equation 3)

【数４】但し、ｎ,ｍはスペクトル番号で、ｎ＝０,２,
４,６,・・・,Ｎ−１、ｍ＝０,１,２,３,・・・,(Ｎ−１)／２
であり、Ｎは１フレーム内のスペクトルの数である。Where n and m are spectrum numbers, and n = 0, 2,
4, 6, ..., N-1, m = 0, 1, 2, 3, ..., (N-1) / 2
And N is the number of spectra in one frame.

【００２８】そして、本実施形態の符号化装置と復号化
装置は、ベクトル量子化方式によって高い伝送効率やス
トレージメディアの容量節減を実現するという基本的効
果を有しているが、コードブック5,6,14,15に対して振
幅成分と位相成分に係る各ベクトルデータａvn(m),θv
(m)を割付ける方式において変換次元数やデータ個数に
ついて工夫を施し、またベクトル化に際してデータ量を
抑制するための手段を適用したことにより、音響データ
を更に効率的且つ的確に伝送し、復号化装置の出力で高
品質な音声を再生させることが可能になる。尚、本実施
形態では符号化装置と復号化装置にそれぞれ正規化演算
器4と逆正規化演算器16を具備させて振幅成分に係るベ
クトルデータａv(m)のデータ量の抑制を図っているが、
それらの機能ブロックがなくても、コードブック5,14の
サイズが少々大きくなるだけで前記の効果は得られる。The encoding apparatus and the decoding apparatus according to the present embodiment have a basic effect of realizing high transmission efficiency and saving of storage medium capacity by the vector quantization method. Vector data avn (m), θv related to amplitude and phase components for 6, 14, and 15
By devising the number of conversion dimensions and the number of data in the method of assigning (m), and applying means to reduce the amount of data during vectorization, audio data can be transmitted more efficiently and accurately, and decoded. It is possible to reproduce high-quality sound at the output of the conversion device. In the present embodiment, the encoding device and the decoding device are provided with a normalization operation unit 4 and a denormalization operation unit 16, respectively, to reduce the data amount of the vector data av (m) related to the amplitude component. But,
Even without these functional blocks, the above effects can be obtained only by slightly increasing the size of the codebooks 5, 14.

【００２９】[0029]

【発明の効果】本発明の「音響信号の符号化装置及び復
号化装置」は、以上の構成を有していることにより、次
のような効果を奏する。請求項１の発明は、ＰＣＭ信号
をＭＤＣＴ方式で直交変換し、その変換によって得られ
るＭＤＣＴ係数を符号化すると共に、ベクトル量子化方
式を適用して高能率な圧縮と伝送効率の向上を図る符号
化装置において、ＭＤＣＴ係数を振幅成分データと位相
成分データに分離してベクトル化した各ベクトルデータ
にインデックスデータを対応させたコードブックを設け
たことにより、位相成分データよりも振幅成分データの
統計的分布の偏りが大きいという音響信号の特性を利用
してより高能率な符号化を可能にする。請求項２の発明
は、データ分離手段でＭＤＣＴ係数から振幅成分データ
と位相成分データを求める際の効率的な演算を可能にす
る。請求項３の発明は、ベクトル化手段でのデータ変換
方式として、変換次元数を高い周波数帯域で大きくし、
聴覚心理特性を考慮した伝送データ量の低減化とコード
ブックの小サイズ化を実現する。請求項４の発明は、位
相成分に係るベクトルデータのデータ量を小さく抑制
し、コードブックの小サイズ化を実現し、また同一容量
のコードブックでより多くのベクトル量子化のためのデ
ータを具備させることを可能にする。請求項５及び請求
項６の発明は、統計的分布の偏りが小さい位相成分デー
タのベクトル化変換次元数を、統計的分布の偏りが大き
い振幅成分データのベクトル化変換次元数より小さく
し、位相成分に係るベクトルデータの個数を多くして伝
送データ量の増大やコードブックの容量の増大を招くこ
となく効率的且つ合理的な符号化を可能にする。請求項
７の発明は、振幅成分に係るベクトルデータを正規化し
てコードブックの小サイズ化を実現し、また同一容量の
コードブックでより多くのベクトル量子化のためのデー
タを具備させることを可能にする。請求項８の発明は、
請求項１乃至請求項６の発明の符号化装置に対応した復
号化装置を実現する。請求項９の発明は、請求項７の発
明に係る符号化装置に対応した復号化装置を実現する。
請求項１０の発明は、請求項８又は請求項９の発明の復
号化装置において、データ合成手段が振幅成分データと
位相データからＭＤＣＴ係数を求める際の効率的な演算
を可能にする。According to the "audio signal encoding apparatus and decoding apparatus" of the present invention, the following effects can be obtained by having the above configuration. A first aspect of the present invention is a code for orthogonally transforming a PCM signal by an MDCT method, encoding an MDCT coefficient obtained by the transform, and applying a vector quantization method to achieve highly efficient compression and improvement of transmission efficiency. A code book in which the MDCT coefficients are separated into amplitude component data and phase component data and the vector data is made to correspond to the index data by the vectorization device. It is possible to perform more efficient encoding by utilizing the characteristic of the acoustic signal that the distribution has a large bias. According to the second aspect of the present invention, it is possible to perform an efficient calculation when obtaining the amplitude component data and the phase component data from the MDCT coefficients by the data separating means. According to a third aspect of the present invention, as the data conversion method in the vectorization means, the number of conversion dimensions is increased in a high frequency band,
A reduction in the amount of transmitted data and a reduction in the size of the codebook are realized in consideration of psychoacoustic characteristics. The invention according to claim 4 suppresses the data amount of the vector data related to the phase component to a small value, realizes a small codebook, and includes more data for vector quantization in a codebook having the same capacity. Make it possible. The invention according to claims 5 and 6 is characterized in that the vectorization conversion dimension of phase component data having a small statistical distribution bias is smaller than the vectorization conversion dimension of amplitude component data having a large statistical distribution bias, and By increasing the number of vector data related to components, efficient and rational encoding can be performed without increasing the amount of transmission data and the codebook capacity. According to the seventh aspect of the present invention, it is possible to reduce the size of the codebook by normalizing the vector data related to the amplitude component, and to provide a codebook having the same capacity with more data for vector quantization. To The invention of claim 8 is
A decoding device corresponding to the encoding device according to the first to sixth aspects of the present invention is realized. According to a ninth aspect of the present invention, a decoding apparatus corresponding to the encoding apparatus according to the seventh aspect of the present invention is realized.
According to a tenth aspect of the present invention, in the decoding apparatus according to the eighth or ninth aspect, the data synthesizing unit enables efficient computation when obtaining the MDCT coefficient from the amplitude component data and the phase data.

[Brief description of the drawings]

【図１】本発明の実施形態に係る音響信号の符号化装置
の機能ブロック回路図である。FIG. 1 is a functional block circuit diagram of an audio signal encoding device according to an embodiment of the present invention.

【図２】振幅成分用のコードブックの登録内容に関する
説明図である。FIG. 2 is an explanatory diagram regarding registered contents of a code book for an amplitude component.

【図３】位相成分用のコードブックの登録内容に関する
説明図である。FIG. 3 is an explanatory diagram relating to registered contents of a code book for a phase component.

【図４】実施形態に係る復号化装置の機能ブロック回路
図である。FIG. 4 is a functional block circuit diagram of the decoding device according to the embodiment.

【図５】ＭＤＣＴ係数の変化態様の実例を示すグラフで
ある。FIG. 5 is a graph showing an actual example of a variation mode of an MDCT coefficient.

【図６】図５のＭＤＣＴ係数に係る振幅成分データの変
化態様を示すグラフである。FIG. 6 is a graph showing how the amplitude component data relating to the MDCT coefficient in FIG. 5 changes.

【図７】図５のＭＤＣＴ係数に係る位相成分データの変
化態様を示すグラフである。FIG. 7 is a graph showing a change aspect of phase component data related to the MDCT coefficient in FIG. 5;

【図８】ベクトル量子化方式を採用した従来の符号化装
置の機能ブロック回路図である。FIG. 8 is a functional block circuit diagram of a conventional encoding device employing a vector quantization method.

[Explanation of symbols]

1…ＭＤＣＴ器、2…振幅・位相分離器(データ分離手
段)、3…帯域分割・ベクトル化器(ベクトル化手段)、4…
正規化演算器(正規化手段)、5,12…振幅成分用のコード
ブック、6,13…位相成分用のコードブック、7,8…ベク
トル量子化器(ベクトル量子化手段)、9…マルチプレク
サ、11…デマルチプレクサ、14,15…ベクトル逆量子化
器(ベクトル逆量子化手段)、16…逆正規化演算器(逆正
規化手段)、17…帯域合成・ベクトル合成器(ベクトル合
成手段)、18…振幅・位相合成器(データ合成手段)、19…
ＩＭＤＣＴ器(逆ＭＤＣＴ手段)。1 ... MDCT unit, 2 ... amplitude / phase separator (data separation unit), 3 ... band splitting / vectorization unit (vectorization unit), 4 ...
Normalization arithmetic unit (normalization means), 5,12 codebook for amplitude component, 6,13 codebook for phase component, 7,8 vector quantizer (vector quantization means), 9 multiplexer , 11 ... demultiplexer, 14, 15 ... vector inverse quantizer (vector inverse quantizer), 16 ... inverse normalizer (inverse normalizer), 17 ... band synthesis / vector synthesizer (vector synthesizer) , 18 ... amplitude / phase synthesizer (data synthesis means), 19 ...
IMDCT unit (inverse MDCT means).

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開昭59−94797（ＪＰ，Ａ) 特開平３−121624（ＪＰ，Ａ) 特開平３−240319（ＪＰ，Ａ) 特開平５−145427（ＪＰ，Ａ) 特開平６−291674（ＪＰ，Ａ) 特開平７−248799（ＪＰ，Ａ) 特開平７−261800（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) H03M 7/30 ──────────────────────────────────────────────────続き Continuation of front page (56) References JP-A-59-94797 (JP, A) JP-A-3-121624 (JP, A) JP-A-3-240319 (JP, A) JP-A-5-947 145427 (JP, A) JP-A-6-291674 (JP, A) JP-A-7-248799 (JP, A) JP-A-7-261800 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) H03M 7/30

Claims

(57) [Claims]

An audio signal modulated by a PCM (Pulse Code Modulation) method is converted to an MDCT (Modified Discrete Cos
In an audio signal encoding apparatus for performing orthogonal transformation by an ine transform) method and encoding MDCT coefficients obtained by the transformation, a data separation unit for separating the MDCT coefficients into amplitude component data and phase component data; Vectoring means for converting each data separated by the means into vector data; and data processing substantially the same as the data processing by the data separating means and the vectorizing means for MDCT coefficients obtained from a learning sample of an acoustic signal. And a code book in which different index data are stored in correspondence with the respective vector data relating to the amplitude component and the phase component obtained by performing the above-described processing, and the respective vector data relating to the amplitude component and the phase component determined by the vectorizing means. Is searched using the codebook, and the same or similar Vector quantization means for reading out index data relating to the amplitude component and the phase component associated with the vector data of the textbook, and outputting the index data read out by the vector quantization means to a transmission path. An audio signal encoding device.

2. The data separating means separates amplitude component data a (m) and phase component data θ (m) using MDCT coefficients x (n) and x (n + 1) adjacent in time series. 2. The audio signal encoding apparatus according to claim 1, wherein the audio signal is obtained by the following equations (1) and (2). (Equation 1) (Equation 2) (However, n and m are spectrum numbers, and n = 0, 2, 4, 6,...
.., N−1, m = 0, 1, 2, 3,..., (N−1) / 2, and N is the number of spectra in one frame. )

3. The method of converting amplitude component data and phase component data into vector data by the vectorizing means, wherein the number of conversion dimensions in a low frequency band is smaller than the number of conversion dimensions in a high frequency band. The audio signal encoding apparatus according to claim 1 or 2.

4. When the vectorization means converts phase component data into vector data, the difference between the first data value and the subsequent data value of the phase component data group to be converted is defined as vector data. The audio signal encoding apparatus according to claim 1, 2 or 3.

5. The code book according to claim 1, wherein the number of vector data related to the phase component of the codebook is larger than the number of vector data related to the amplitude component.
Audio signal encoding device.

6. The number of conversion dimensions of phase component data into vector data by the vectorization means is made smaller than the number of conversion dimensions of amplitude component data into vector data, and the number of vector data related to the phase component of the codebook is reduced by the amplitude component. 6. The audio signal encoding apparatus according to claim 1, wherein the number of vector data is larger than the number of vector data.

7. A vectorization means for converting the amplitude component data into logarithm and converting it into vector data, and using a maximum power value in the vector data as a normalization coefficient to normalize vector data relating to other amplitude components. And a codebook is obtained by performing substantially the same data processing as the data processing by the data separating means, the vectorizing means, and the normalizing means on the MDCT coefficients obtained from the learning sample of the audio signal. The index data different from each other is stored in correspondence with the amplitude component data thus obtained, and the normalization coefficient of the normalization means is output to the transmission path together with the index data read by the vector quantization means. Claim 2, Claim 3, Claim 4,
The audio signal encoding device according to claim 5 or 6.

8. A decoding device to which index data output to a transmission path by an audio signal encoding device according to claim 1, 2, 3, 4, 5, or 6 is input. In the codebook, the same data as the storage data of the codebook provided in the encoding device is stored, and the codebook is searched using the input index data related to the amplitude component and the phase component. Vector inverse quantization means for reading vector data relating to the amplitude component and phase component matching the respective index data; and amplitude component data relating to the MDCT coefficient by combining the vector data read by the vector inverse quantization means. Vector synthesizing means for generating phase component data; and synthesizing the amplitude component data and the phase component data obtained by the vector synthesizing means to obtain MDCT coefficients. An audio signal decoding apparatus, comprising: a data synthesizing means to be obtained; and an inverse MDCT means for reproducing an audio signal modulated by the PCM method based on the MDCT coefficient obtained by the data synthesizing means.

9. A decoding apparatus to which index data and a normalization coefficient output to a transmission path by the audio signal encoding apparatus according to claim 7 are input, and wherein codebook storage data provided in the encoding apparatus is provided. Using the codebook storing the same data as the above and the input index data relating to the amplitude component and the phase component, the codebook is searched, and the vector data relating to the amplitude component and the phase component matching the respective index data are retrieved. Vector inverse quantization means for reading out the vector component, inverse normalization means for inversely normalizing the vector data relating to the amplitude component read by the vector inverse quantization means using the normalization coefficient, and the inverse normalization means The normalized vector data and the vector data related to the phase component read by the vector inverse quantization means are combined to generate the amplitude component data related to the MDCT coefficient. Vector synthesizing means for generating data and phase component data, data synthesizing means for synthesizing the amplitude component data and phase component data obtained by the vector synthesizing means to obtain MDCT coefficients, and MDCT coefficients obtained by the data synthesizing means. A decoding apparatus for an audio signal, comprising: an inverse MDCT means for reproducing an audio signal modulated by the PCM method on the basis of the above.

10. The data synthesizing means outputs amplitude component data a
10. The method according to claim 8, wherein x (n) and x (n + 1), which are MDCT coefficients, are obtained by the following Expressions 3 and 4, respectively, using (m) and the phase component data b (m). Sound signal decoding device. (Equation 3) (Equation 4) (However, n and m are spectrum numbers, and n = 0, 2, 4, 6,...
.., N−1, m = 0, 1, 2, 3,..., (N−1) / 2, and N is the number of spectra in one frame. )