JP4567412B2

JP4567412B2 - Audio playback device and audio playback method

Info

Publication number: JP4567412B2
Application number: JP2004309597A
Authority: JP
Inventors: 紀之高島; 政一秋保; 美紀長谷山
Original assignee: Alpine Electronics Inc
Current assignee: Alpine Electronics Inc
Priority date: 2004-10-25
Filing date: 2004-10-25
Publication date: 2010-10-20
Anticipated expiration: 2024-10-25
Also published as: JP2006119524A

Description

本発明は、音声再生機および音声再生方法に係り、特に、圧縮音声を再生するのに好適な音声再生機および音声再生方法に関する。 The present invention relates to an audio reproducing device and an audio reproducing method, and more particularly to an audio reproducing device and an audio reproducing method suitable for reproducing compressed audio.

従来から、音声再生機においては、ＭＰ３やＷＭＡ等のいわゆる圧縮アルゴリズムを用いることによって、アナログの音声信号に対する圧縮・符号化を行ってデジタルの音声信号である圧縮音声を得るようになっていた。 Conventionally, in an audio player, a compressed audio, which is a digital audio signal, is obtained by compressing and encoding an analog audio signal by using a so-called compression algorithm such as MP3 or WMA.

そして、圧縮・符号化によって得られた圧縮音声は、所望の再生サンプリング周波数の下で伸長・復号化されて再生されるようになっていた。 The compressed audio obtained by compression / encoding is expanded and decoded under a desired reproduction sampling frequency and reproduced.

ここで、図８は、一般的な非圧縮音声（原曲）のデジタル符号化フォーマットを、図９は、当該非圧縮音声のスペクトラムをそれぞれ示したものであり、これら図８、図９に示すように、デジタル化された音声データの記録可能な周波数の上限は、自信号のサンプリング周波数の約１／２であることが知られている。 Here, FIG. 8 shows a digital encoding format of a general uncompressed sound (original music), and FIG. 9 shows a spectrum of the uncompressed sound, and these are shown in FIGS. Thus, it is known that the upper limit of the recordable frequency of the digitized audio data is about ½ of the sampling frequency of the own signal.

例えば、ＣＤの場合は、符号化の際のサンプリング周波数が４４．１ｋＨｚであるのに対し、実際にＣＤに記録可能な音楽の周波数の上限は２２．０５ｋＨｚとなる。 For example, in the case of a CD, the sampling frequency at the time of encoding is 44.1 kHz, whereas the upper limit of the frequency of music that can be actually recorded on a CD is 22.05 kHz.

特開２００３−３３３６９８号公報JP 2003-333698 A

しかしながら、ＭＰ３等の圧縮音声においては、特に低ビットレートで符号化された場合には、符号化の際のサンプリング周波数は、図８の場合と同一であっても、図１０、図１１に示すように、周波数帯域の上限がカットされてしまうことがあった。 However, for compressed audio such as MP3, particularly when encoded at a low bit rate, the sampling frequency at the time of encoding is the same as in FIG. As described above, the upper limit of the frequency band may be cut.

この結果、圧縮音声は、例えば４４．１ｋＨｚの再生サンプリング周波数の下で再生される場合においても、実際には、サンプリング周波数２２．０５ｋＨｚや３２ｋＨｚ程度の周波数帯域しか含まれていなかった。 As a result, even when the compressed audio is reproduced under a reproduction sampling frequency of 44.1 kHz, for example, only the frequency band of about sampling frequency 22.05 kHz or 32 kHz is actually included.

すなわち、従来は、ＭＰ３等による符号化によって高音域の信号が失われてしまう結果、原音に近い音声を再現することができないといった問題が生じていた。 In other words, conventionally, as a result of loss of a high-frequency signal due to encoding by MP3 or the like, there has been a problem that it is impossible to reproduce sound close to the original sound.

そこで、本発明は、このような問題に鑑みなされたものであり、圧縮によって失われた音域を復元することができ、原音に近い音声を再現することができる音声再生機および音声再生方法を提供することを目的とするものである。 Therefore, the present invention has been made in view of such a problem, and provides an audio playback device and an audio playback method capable of restoring a sound range lost by compression and reproducing sound close to the original sound. It is intended to do.

前述した目的を達成するため、本発明に係る音声再生機の特徴は、所定の圧縮フォーマットにしたがって圧縮・符号化された圧縮音声を、所望の再生サンプリング周波数の下で伸長・復号化して再生する音声再生機において、前記圧縮音声の周波数帯域の上限周波数が、前記再生サンプリング周波数の下での再生に適する所定の判定周波数に満たないか否かを判定する判定装置と、この判定装置の判定結果に基づき、周波数帯域の上限周波数が前記判定周波数に満たない圧縮音声に対して低域通過フィルタ処理後にダウンサンプリング処理を施すダウンサンプリング処理装置と、このダウンサンプリング処理装置によって前記ダウンサンプリング処理が施された圧縮音声に対して、前記圧縮の際に失われた周波数帯域を補間する補間処理をともなうアップサンプリング処理を施すアップサンプリング処理装置とを備えた点にある。 In order to achieve the above-mentioned object, the audio player according to the present invention is characterized in that compressed audio that has been compressed and encoded according to a predetermined compression format is expanded and decoded under a desired reproduction sampling frequency and reproduced. In the audio player, a determination device that determines whether or not an upper limit frequency of the frequency band of the compressed sound is less than a predetermined determination frequency suitable for reproduction under the reproduction sampling frequency, and a determination result of the determination device based on a down-sampling processing apparatus for performing the down-sampling process after the low-pass filtering, the down-sampling processing by the down-sampling processing unit is subjected to compressed audio that is an upper limit frequency of the frequency band less than the determination frequency Interpolation processing for interpolating the frequency band lost during the compression is applied to the compressed audio. It lies in having an up-sampling processing apparatus for performing the Nau upsampling process.

そして、このような構成によれば、周波数帯域の上限周波数が判定周波数に満たない圧縮音声に対してダウンサンプリング処理装置によって低域通過フィルタ処理後にダウンサンプリング処理を施した後、さらに、アップサンプリング処理装置によって補間処理をともなうアップサンプリング処理を施すことによって、圧縮の際に失われた周波数帯域が補間された圧縮音声を得ることができ、この圧縮音声を再生サンプリング周波数の下で伸長・復号化して再生することが可能となる。 According to such a configuration, after the downsampling processing is performed after the low-pass filter processing by the downsampling processing device on the compressed audio whose upper limit frequency of the frequency band is less than the determination frequency, the upsampling processing is further performed. By performing upsampling processing with interpolation processing by the device, it is possible to obtain compressed audio in which the frequency band lost during compression is interpolated, and decompressing and decoding this compressed audio under the reproduction sampling frequency It can be played back.

また、本発明に係る音声再生機の特徴は、判定周波数が、再生サンプリング周波数の１／２の周波数である点にある。 In addition, the audio player according to the present invention is characterized in that the determination frequency is a half of the reproduction sampling frequency.

そして、このような構成によれば、周波数帯域の上限周波数が再生サンプリング周波数の１／２に満たない圧縮音声に対してダウンサンプリング処理装置によって低域通過フィルタ処理後にダウンサンプリング処理を施した後、さらに、アップサンプリング処理装置によって補間処理をともなうアップサンプリング処理を施すことによって、圧縮の際に失われた周波数帯域が補間された圧縮音声を得ることができ、この圧縮音声を再生サンプリング周波数の下で伸長・復号化して再生することが可能となる。 According to such a configuration, after the downsampling processing is performed after the low-pass filter processing by the downsampling processing device for the compressed audio whose upper limit frequency of the frequency band is less than 1/2 of the reproduction sampling frequency, Furthermore, by performing an upsampling process accompanied by an interpolation process by an upsampling processing device, it is possible to obtain a compressed sound in which the frequency band lost during the compression is interpolated. It is possible to reproduce by decompressing and decoding.

さらに、本発明に係る音声再生機の特徴は、判定装置は、圧縮音声のビットレートが所定の条件を満足するか否かを判定するように形成され、ダウンサンプリング処置装置は、周波数帯域の上限周波数が判定周波数に満たない圧縮音声であって、前記ビットレートが前記所定の条件を満足する圧縮音声に対して、低域通過フィルタ処理後のダウンサンプリング処理を施すように形成されている点にある。 Further, the sound reproducing apparatus according to the present invention is characterized in that the determination device is formed so as to determine whether or not the bit rate of the compressed sound satisfies a predetermined condition, and the downsampling treatment device has an upper limit of the frequency band. It is configured to perform a downsampling process after a low-pass filter process on a compressed sound whose frequency is less than the determination frequency and the bit rate satisfies the predetermined condition. is there.

そして、このような構成によれば、圧縮音声の周波数帯域の上限周波数と、当該圧縮音声のビットレートとに応じて、低域通過フィルタ処理後のダウンサンプリング処理およびその後の補間処理をともなうアップサンプリング処理を施すか否かを選択することが可能となる。 According to such a configuration, the upsampling with the downsampling process after the low-pass filter process and the subsequent interpolation process is performed according to the upper limit frequency of the frequency band of the compressed sound and the bit rate of the compressed sound. It is possible to select whether or not to perform processing.

さらにまた、本発明に係る音声再生機の特徴は、判定装置が、再生サンプリング周波数に応じて異なる所定の条件を満足するビットレートのデータを、当該ビットレートに対応する再生サンプリング周波数のデータと互いに対応関係をもたせた状態で格納したテーブルを備え、圧縮音声が前記所定の条件を満足するか否かを前記テーブルを参照して判定するように形成されている点にある。 Still further, the sound reproducing apparatus according to the present invention is characterized in that the determination device allows the bit rate data satisfying a predetermined condition that differs depending on the reproduction sampling frequency to be mutually shared with the reproduction sampling frequency data corresponding to the bit rate. A table stored in a state of correspondence is provided, and the table is configured to determine whether or not the compressed audio satisfies the predetermined condition with reference to the table.

そして、このような構成によれば、判定装置により、ビットレートが所定の条件を満足するか否かをテーブルを参照することによって簡易かつ迅速に判定することが可能となる。 According to such a configuration, the determination device can easily and quickly determine whether or not the bit rate satisfies a predetermined condition by referring to the table.

また、本発明に係る音声再生機の特徴は、アップサンプリング処理装置が、補間処理としてフラクタル補間処理を行う点にある。 Further, the audio player according to the present invention is characterized in that the upsampling processing apparatus performs fractal interpolation processing as interpolation processing.

そして、このような構成によれば、フラクタル補間処理を施すことによって、圧縮の際に失われた周波数帯域をさらに適切に補間することが可能となる。 And according to such a structure, it becomes possible to interpolate the frequency band lost in the compression more appropriately by performing the fractal interpolation process.

さらに、本発明に係る音声再生方法の特徴は、所定の圧縮フォーマットにしたがって圧縮・符号化された圧縮音声を、所望の再生サンプリング周波数の下で伸長・復号化して再生する音声再生方法において、前記圧縮音声の周波数帯域の上限周波数が、前記再生サンプリング周波数の下での再生に適する所定の判定周波数に満たない場合には、当該圧縮音声に対して低域通過フィルタ処理後にダウンサンプリング処理を施し、次いで、前記ダウンサンプリング処理を施した圧縮音声に対して、前記圧縮の際に失われた周波数帯域を補間する補間処理をともなうアップサンプリング処理を施し、これらのダウンサンプリング処理および補間処理をともなうアップサンプリング処理を施した圧縮音声を前記再生サンプリング周波数の下で伸長・復号化して再生する点にある。 Further, the audio reproduction method according to the present invention is characterized in that in the audio reproduction method for reproducing the compressed audio compressed and encoded according to a predetermined compression format under a desired reproduction sampling frequency, the audio is reproduced. When the upper limit frequency of the frequency band of the compressed audio is less than a predetermined determination frequency suitable for reproduction under the reproduction sampling frequency, the compressed audio is subjected to down-sampling processing after low-pass filter processing , Next, the compressed audio subjected to the downsampling process is subjected to an upsampling process with an interpolation process for interpolating a frequency band lost during the compression, and the upsampling with the downsampling process and the interpolation process is performed. The processed compressed audio is decompressed and decompressed under the playback sampling frequency. It turned into and lies in the fact that to play.

そして、このような方法によれば、周波数帯域の上限周波数が判定周波数に満たない圧縮音声に対する低域通過フィルタ処理後のダウンサンプリング処理および補間処理をともなうアップサンプリング処理を施すことによって、圧縮の際に失われた周波数帯域が補間された圧縮音声を得ることができ、この圧縮音声を再生サンプリング周波数の下で伸長・復号化して再生することが可能となる。 According to such a method, by performing the upsampling process with the downsampling process and the interpolation process after the low-pass filter process for the compressed sound whose upper limit frequency of the frequency band is less than the determination frequency, Therefore, it is possible to obtain a compressed sound obtained by interpolating the lost frequency band, and to decompress and decode the compressed sound at a reproduction sampling frequency.

さらにまた、本発明に係る音声再生方法の特徴は、判定周波数を、再生サンプリング周波数の１／２の周波数とする点にある。 Furthermore, the sound reproduction method according to the present invention is characterized in that the determination frequency is set to a half of the reproduction sampling frequency.

そして、このような方法によれば、周波数帯域の上限周波数が再生サンプリング周波数の１／２に満たない圧縮音声に対する低域通過フィルタ処理後のダウンサンプリング処理および補間処理をともなうアップサンプリング処理を施すことによって、圧縮の際に失われた周波数帯域が補間された圧縮音声を得ることができ、この圧縮音声を再生サンプリング周波数の下で伸長・復号化して再生することが可能となる。 According to such a method, the upsampling process with the downsampling process and the interpolation process after the low-pass filter process is performed on the compressed sound whose upper limit frequency of the frequency band is less than 1/2 of the reproduction sampling frequency. Thus, it is possible to obtain compressed audio in which the frequency band lost during compression is interpolated, and it is possible to reproduce the compressed audio by decompressing and decoding it under the reproduction sampling frequency.

また、本発明に係る音声再生方法の特徴は、圧縮音声のビットレートが所定の条件を満足するか否かを判定し、周波数帯域の上限周波数が判定周波数に満たない圧縮音声であって、前記ビットレートが前記所定の条件を満足する圧縮音声に対して、低域通過フィルタ処理後のダウンサンプリング処理を施す点にある。 Further, the audio reproduction method according to the present invention is characterized in that it is determined whether or not the bit rate of the compressed audio satisfies a predetermined condition, and the compressed audio whose upper limit frequency of the frequency band does not satisfy the determination frequency, The downsampling process after the low-pass filter process is performed on the compressed sound whose bit rate satisfies the predetermined condition.

そして、このような方法によれば、圧縮音声の周波数帯域の上限周波数と、当該圧縮音声のビットレートとに応じて、低域通過フィルタ処理後のダウンサンプリング処理およびその後の補間処理をともなうアップサンプリング処理を施すか否かを選択することが可能となる。 According to such a method, according to the upper limit frequency of the frequency band of the compressed audio and the bit rate of the compressed audio, the upsampling with the downsampling process after the low-pass filter process and the subsequent interpolation process is performed. It is possible to select whether or not to perform processing.

さらに、本発明に係る音声再生方法の特徴は、前記再生サンプリング周波数に応じて異なる前記所定の条件を満足するビットレートのデータを、当該ビットレートに対応する再生サンプリング周波数のデータと互いに対応関係をもたせた状態で格納したテーブルを用意し、前記圧縮音声が前記所定の条件を満足するか否かを前記テーブルを参照して判定する点にある。 Furthermore, the audio reproduction method according to the present invention is characterized in that data of a bit rate satisfying the predetermined condition that differs depending on the reproduction sampling frequency is correlated with the data of the reproduction sampling frequency corresponding to the bit rate. A table stored in a laid state is prepared, and it is determined with reference to the table whether or not the compressed sound satisfies the predetermined condition.

そして、このような方法によれば、ビットレートが所定の条件を満足するか否かをテーブルを参照することによって簡易かつ迅速に判定することが可能となる。 According to such a method, it is possible to easily and quickly determine whether or not the bit rate satisfies a predetermined condition by referring to the table.

さらにまた、本発明に係る音声再生方法の特徴は、補間処理としてフラクタル補間処理を施す点にある。 Furthermore, the sound reproduction method according to the present invention is characterized in that fractal interpolation processing is performed as interpolation processing.

そして、このような方法によれば、フラクタル補間処理を施すことによって、圧縮の際に失われた周波数帯域をさらに適切に補間することが可能となる。 And according to such a method, it becomes possible to interpolate the frequency band lost at the time of compression more appropriately by performing fractal interpolation processing.

本発明に係る音声再生機によれば、周波数帯域の上限周波数が判定周波数に満たない圧縮音声に対してダウンサンプリング処理装置によって低域通過フィルタ処理後にダウンサンプリング処理を施した後、さらに、アップサンプリング処理装置によって補間処理をともなうアップサンプリング処理を施すことによって、圧縮の際に失われた周波数帯域が補間された圧縮音声を得ることができ、この圧縮音声を再生サンプリング周波数の下で伸長・復号化して再生することができる結果、圧縮によって失われた音域を復元することができ、原音に近い音声を再現することができる。 According to the sound reproducing device of the present invention, after the downsampling processing is performed after the low-pass filter processing by the downsampling processing device on the compressed sound whose upper limit frequency of the frequency band is less than the determination frequency, the upsampling is further performed. By performing upsampling processing with interpolation processing by the processing device, it is possible to obtain compressed audio in which the frequency band lost during compression is interpolated, and decompressing and decoding this compressed audio under the reproduction sampling frequency As a result, the sound range lost by the compression can be restored, and the sound close to the original sound can be reproduced.

また、本発明に係る音声再生機によれば、周波数帯域の上限周波数が再生サンプリング周波数の１／２に満たない圧縮音声に対してダウンサンプリング処理装置によって低域通過フィルタ処理後にダウンサンプリング処理を施した後、さらに、アップサンプリング処理装置によって補間処理をともなうアップサンプリング処理を施すことによって、圧縮の際に失われた周波数帯域が補間された圧縮音声を得ることができ、この圧縮音声を再生サンプリング周波数の下で伸長・復号化して再生することができる結果、圧縮によって失われた音域を復元することができ、原音に近い音声を再現することができる。 In addition, according to the sound reproducing device of the present invention, the downsampling processing is performed after the low-pass filter processing by the downsampling processing device for the compressed sound whose upper limit frequency of the frequency band is less than 1/2 of the reproduction sampling frequency. After that, by performing an upsampling process accompanied by an interpolation process by the upsampling processing device, a compressed sound in which the frequency band lost during the compression is interpolated can be obtained. As a result, the range lost by compression can be restored, and the sound close to the original sound can be reproduced.

さらに、本発明に係る音声再生機によれば、圧縮音声の周波数帯域の上限周波数と、当該圧縮音声のビットレートとに応じて、低域通過フィルタ処理後のダウンサンプリング処理およびその後の補間処理をともなうアップサンプリング処理を施すか否かを選択することができる結果、圧縮によって失われた音域をさらに良好に復元することができ、より原音に近い音声を再現することができる。 Furthermore, according to the audio player according to the present invention, the downsampling process after the low-pass filter process and the subsequent interpolation process are performed according to the upper limit frequency of the frequency band of the compressed audio and the bit rate of the compressed audio. As a result of being able to select whether or not to perform the upsampling process, the sound range lost by the compression can be restored more satisfactorily, and the sound closer to the original sound can be reproduced.

さらにまた、本発明に係る音声再生機によれば、判定装置により、ビットレートが所定の条件を満足するか否かをテーブルを参照することによって簡易かつ迅速に判定することができる結果、圧縮によって失われた音域をさらに安価にかつ効率的に復元することができる。 Furthermore, according to the sound reproducing device of the present invention, the determination device can easily and quickly determine whether or not the bit rate satisfies the predetermined condition by referring to the table. The lost sound range can be restored more inexpensively and efficiently.

また、本発明に係る音声再生機によれば、フラクタル補間処理を施すことによって、圧縮の際に失われた周波数帯域をさらに適切に補間することができる結果、圧縮によって失われた音域をさらに高精度に復元することができ、より原音に近い音声を再現することができる。 Further, according to the sound reproducing device of the present invention, by performing the fractal interpolation process, the frequency band lost during the compression can be more appropriately interpolated, and as a result, the sound range lost by the compression can be further increased. The sound can be restored to accuracy, and the sound closer to the original sound can be reproduced.

さらに、本発明に係る音声再生方法によれば、周波数帯域の上限周波数が判定周波数に満たない圧縮音声に対する低域通過フィルタ処理後のダウンサンプリング処理および補間処理をともなうアップサンプリング処理を施すことによって、圧縮の際に失われた周波数帯域が補間された圧縮音声を得ることができ、この圧縮音声を再生サンプリング周波数の下で伸長・復号化して再生することができる結果、圧縮によって失われた音域を復元することができ、原音に近い音声を再現することができる。 Furthermore, according to the sound reproduction method according to the present invention, by performing the upsampling process with the downsampling process and the interpolation process after the low-pass filter process for the compressed sound whose upper limit frequency of the frequency band is less than the determination frequency, It is possible to obtain compressed audio in which the frequency band lost during compression is interpolated, and this compressed audio can be decompressed and decoded under the reproduction sampling frequency. The sound can be restored and the sound close to the original sound can be reproduced.

さらにまた、本発明に係る音声再生方法によれば、周波数帯域の上限周波数が再生サンプリング周波数の１／２に満たない圧縮音声に対する低域通過フィルタ処理後のダウンサンプリング処理および補間処理をともなうアップサンプリング処理を施すことによって、圧縮の際に失われた周波数帯域が補間された圧縮音声を得ることができ、この圧縮音声を再生サンプリング周波数の下で伸長・復号化して再生することができる結果、圧縮によって失われた音域を復元することができ、原音に近い音声を再現することができる。 Furthermore, according to the audio reproduction method of the present invention, the upsampling with the downsampling process and the interpolation process after the low-pass filter process for the compressed audio whose upper limit frequency of the frequency band is less than 1/2 of the reproduction sampling frequency. By applying the processing, it is possible to obtain compressed audio in which the frequency band lost during compression is interpolated, and this compressed audio can be decompressed and decoded under the reproduction sampling frequency, resulting in compression. Can restore the lost sound range and reproduce the sound close to the original sound.

また、本発明に係る音声再生方法によれば、圧縮音声の周波数帯域の上限周波数と、当該圧縮音声のビットレートとに応じて、低域通過フィルタ処理後のダウンサンプリング処理およびその後の補間処理をともなうアップサンプリング処理を施すか否かを選択することができる結果、圧縮によって失われた音域をさらに良好に復元することができ、より原音に近い音声を再現することができる。 Further, according to the audio reproduction method of the present invention, the downsampling process after the low-pass filter process and the subsequent interpolation process are performed according to the upper limit frequency of the frequency band of the compressed audio and the bit rate of the compressed audio. As a result of being able to select whether or not to perform the upsampling process, the sound range lost by the compression can be restored more satisfactorily, and the sound closer to the original sound can be reproduced.

さらに、本発明に係る音声再生方法によれば、ビットレートが所定の条件を満足するか否かをテーブルを参照することによって簡易かつ迅速に判定することができる結果、圧縮によって失われた音域をさらに安価にかつ効率的に復元することができる。 Furthermore, according to the audio reproduction method of the present invention, it is possible to easily and quickly determine whether or not the bit rate satisfies a predetermined condition by referring to the table. Further, it can be restored at low cost and efficiently.

さらにまた、本発明に係る音声再生方法によれば、フラクタル補間処理を施すことによって、圧縮の際に失われた周波数帯域をさらに適切に補間することができる結果、圧縮によって失われた音域をさらに高精度に復元することができ、より原音に近い音声を再現することができる。 Furthermore, according to the audio reproduction method of the present invention, the frequency band lost during the compression can be more appropriately interpolated by performing the fractal interpolation process. It can be restored with high accuracy, and sound closer to the original sound can be reproduced.

以下、本発明に係る音声再生機の実施形態について、図１乃至図７を参照して説明する。 Hereinafter, embodiments of an audio playback device according to the present invention will be described with reference to FIGS.

図１に示すように、本実施形態における音声再生機１は、判定装置としての圧縮音声判定部２を有しており、この圧縮音声判定部２には、ＣＤ、放送、通信などを通じて楽音等の圧縮音声が入力されるようになっている。 As shown in FIG. 1, the audio player 1 in this embodiment has a compressed audio determination unit 2 as a determination device, and the compressed audio determination unit 2 includes music, etc. through CD, broadcast, communication, and the like. Compressed audio is input.

そして、圧縮音声判定部２は、入力された圧縮音声の圧縮フォーマットに基づいて、圧縮音声の周波数帯域の上限周波数が、所定の再生サンプリング周波数の下での再生に適する判定周波数としての再生サンプリング周波数の１／２の周波数に満たないか否かを判定するようになっている。 Based on the compression format of the input compressed audio, the compressed audio determination unit 2 has a reproduction sampling frequency as a determination frequency suitable for reproduction with the upper limit frequency of the frequency band of the compressed audio being reproduced under a predetermined reproduction sampling frequency. It is determined whether or not the frequency is less than half the frequency.

さらに、圧縮音声判定部２は、入力された圧縮音声のビットレート（ｂｐｓ）に基づいて、当該ビットレートの値が、再生サンプリング周波数に応じて異なる所定の条件を満足するか否かを判定するようになっている。 Further, the compressed audio determination unit 2 determines whether or not the value of the bit rate satisfies a predetermined condition that differs depending on the reproduction sampling frequency, based on the bit rate (bps) of the input compressed audio. It is like that.

なお、前記所定の条件は、前記再生サンプリング波数との関係において好適な値を選択すればよい。例えば、前記再生サンプリング周波数が４４．１ｋＨｚの場合には、前記所定の条件は、９６ｋｂｐｓ以下であることとしてもよい。 For the predetermined condition, a suitable value may be selected in relation to the reproduction sampling wave number. For example, when the reproduction sampling frequency is 44.1 kHz, the predetermined condition may be 96 kbps or less.

また、圧縮音声判定部２に、以下の表１に示すようなテーブルを用意し、このテーブル内に、前記再生サンプリング周波数に応じて異なる前記所定の条件を満足する複数のビットレートのデータを、各ビットレートに対応する再生サンプリング周波数のデータと互いに対応関係をもたせた状態で格納しておくようにしてもよい。 Further, a table as shown in Table 1 below is prepared in the compressed sound determination unit 2, and data of a plurality of bit rates satisfying the predetermined condition that differs depending on the reproduction sampling frequency are included in this table. The reproduction sampling frequency data corresponding to each bit rate may be stored in correspondence with each other.

例えば、表１においては、再生サンプリング周波数が４４．１ｋＨｚの場合における所定の条件を満足するビットレートは、表１の左欄に示すように、３２ｋｂｐｓ、６４ｋｂｐｓおよび９６ｋｂｐｓとなっている。圧縮音声がこれらのビットレートの値をとる場合には、表１の右欄に示すように、圧縮音声に対して後述するＦＩＦアップサンプリング処理（表１右欄におけるＦＩＦ処理）が施されるようになっている。なお、この場合、デジタル的には、表１の右欄に示すように、ＦＩＦ処理を制御するための信号の値が１となる。一方、表１において、１２８ｋｂｐｓ、１６０ｋｂｐｓ等のビットレートは、再生サンプリング周波数が４４．１ｋＨｚの下での所定の条件を満足しないものとなり、この場合には、前記信号の値は０となり、ＦＩＦアップサンプリング処理は施されないことになる。

For example, in Table 1, as shown in the left column of Table 1, the bit rates that satisfy the predetermined condition when the reproduction sampling frequency is 44.1 kHz are 32 kbps, 64 kbps, and 96 kbps. When the compressed audio takes these bit rate values, as shown in the right column of Table 1, the IF upsampling processing (FIF processing in the right column of Table 1) described later is performed on the compressed audio. It has become. In this case, digitally, as shown in the right column of Table 1, the value of the signal for controlling the IF processing is 1. On the other hand, in Table 1, bit rates such as 128 kbps and 160 kbps do not satisfy a predetermined condition when the reproduction sampling frequency is 44.1 kHz. In this case, the value of the signal is 0, and the FIFO is increased. Sampling processing is not performed.

また、表１においては、再生サンプリング周波数が４８．０ｋＨｚの場合における所定の条件を満足するビットレートは、表１の左欄に示すように、３２ｋｂｐｓ、６４ｋｂｐｓおよび９６ｋｂｐｓとなっている。圧縮音声がこれらのビットレートの値をとる場合には、表１の右欄に示すように、前記信号の値が１となり、圧縮音声に対してＦＩＦアップサンプリング処理（表１右欄におけるＦＩＦ処理）が施されるようになっている。一方、表１において、１２８ｋｂｐｓ、１６０ｋｂｐｓ等のビットレートは、再生サンプリング周波数が４８．０ｋＨｚの下での所定の条件を満足しないものとなり、この場合には、前記信号の値が０となり、ＦＩＦアップサンプリング処理は施されないことになる。 In Table 1, as shown in the left column of Table 1, the bit rates that satisfy the predetermined condition when the reproduction sampling frequency is 48.0 kHz are 32 kbps, 64 kbps, and 96 kbps. When the compressed audio takes these bit rate values, as shown in the right column of Table 1, the value of the signal becomes 1, and the IF upsampling processing (FIF processing in the right column of Table 1) is applied to the compressed audio. ) Is given. On the other hand, in Table 1, bit rates such as 128 kbps and 160 kbps do not satisfy the predetermined condition when the reproduction sampling frequency is 48.0 kHz. In this case, the value of the signal is 0, and the FIFO is increased. Sampling processing is not performed.

このようにすれば、入力された圧縮音声のビットレートが前記所定の条件を満足するか否かをテーブルを参照することによって簡易かつ迅速に判定することが可能となる。 In this way, it is possible to easily and quickly determine whether or not the bit rate of the input compressed audio satisfies the predetermined condition by referring to the table.

圧縮音声判定部２の出力側には、ダウンサンプリング処理装置としてのダウンサンプリング処理部３が接続されている。 A downsampling processing unit 3 as a downsampling processing device is connected to the output side of the compressed sound determination unit 2.

このダウンサンプリング処理部３には、圧縮音声判定部２から、図２に示すような周波数帯域の上限周波数が再生サンプリング周波数Ｆｓの１／２に満たないフォーマットの圧縮音声であって、かつ、ビットレートが前記所定の条件を満足する圧縮音声であって、ダウンサンプリング処理を要する圧縮音声（以下、「該当圧縮音声」と称する）が入力されるようになっている。 The downsampling processing unit 3 receives from the compressed audio determination unit 2 compressed audio having a format in which the upper limit frequency of the frequency band as shown in FIG. Compressed sound whose rate satisfies the predetermined condition and which requires downsampling processing (hereinafter referred to as “corresponding compressed sound”) is input.

そして、ダウンサンプリング処理部３は、入力された該当圧縮音声に対して、低域通過フィルタ処理およびその後のダウンサンプリング処理（以下、単にダウンサンプリング処理と略称する）を施し、このダウンサンプリング処理を施した該当圧縮音声を出力するようになっている。 The down-sampling processing unit 3 performs low-pass filter processing and subsequent down-sampling processing (hereinafter simply referred to as “down-sampling processing”) on the input compressed audio, and performs this down-sampling processing. The corresponding compressed audio is output.

なお、ダウンサンプリング処理は、例えば、図３に示すようにダウンサンプリング処理後のサンプリング周波数Ｆｓ’が再生サンプリング周波数Ｆｓの半分１／２Ｆｓになるようにしてもよい。 In the downsampling process, for example, as shown in FIG. 3, the sampling frequency Fs ′ after the downsampling process may be half Fs ′ of the reproduction sampling frequency Fs.

ダウンサンプリング処理部３の出力側には、アップサンプリング処理装置としてのＦＩＦ（Fractal Interpolation Functions）アップサンプリング処理部４が接続されている。 An FIF (Fractal Interpolation Functions) upsampling processing unit 4 as an upsampling processing device is connected to the output side of the downsampling processing unit 3.

このＦＩＦアップサンプリング処理部４には、ダウンサンプリング処理部３から出力されたダウンサンプリング処理後の該当圧縮音声が入力されるようになっている。 The FIF upsampling processing unit 4 is input with the corresponding compressed audio output from the downsampling processing unit 3 after downsampling processing.

そして、ＦＩＦアップサンプリング処理部４は、入力されたダウンサンプリング処理後の該当圧縮音声に対して、図４、図５に示すようなフラクタル補間処理をともなうアップサンプリング処理（以下、「ＦＩＦアップサンプリング処理」と称する）を施すことによって圧縮の際に失われた周波数帯域を補間するようになっている。 Then, the FIFO upsampling processing unit 4 performs upsampling processing (hereinafter referred to as “FIF upsampling processing”) with the fractal interpolation processing as shown in FIGS. 4 and 5 on the input compressed speech after the downsampling processing. The frequency band lost at the time of compression is interpolated.

フラクタル補間処理は、圧縮音声を例えば図５に示すような波形を有する図形としてとらえ、この圧縮音声の図形が、自己相似性をもつ図形すなわちある単一の線分を適宜縮小、拡大あるいは回転させたものをつなぎ合わせることによって構成されている図形とみなし、当該線分を援用することによって失われた周波数帯域を補間する処理である。このようなフラクタル補間処理は、原音に近い音声を高精度に再現するのに極めて好適な手法である。 In the fractal interpolation processing, the compressed speech is regarded as a figure having a waveform as shown in FIG. 5, for example, and the figure of the compressed voice is reduced, enlarged or rotated as appropriate by a self-similar figure, that is, a single line segment. This is a process of interpolating the frequency band lost by using the line segment, assuming that the figure is configured by connecting the objects together. Such a fractal interpolation process is a very suitable technique for reproducing a sound close to the original sound with high accuracy.

したがって、本実施形態においては、圧縮音声に対してダウンサンプリング処理部３によってダウンサンプリング処理を施した後、さらに、ＦＩＦアップサンプリング処理部４によってＦＩＦアップサンプリング処理を施すことによって、圧縮の際に失われた周波数帯域を適切に補間することができる。 Therefore, in the present embodiment, after downsampling processing is performed on the compressed audio by the downsampling processing unit 3, and further, the IF upsampling processing unit 4 performs the IF upsampling processing, so that it is lost during compression. The interpolated frequency band can be appropriately interpolated.

なお、ＦＩＦアップサンプリング処理部４には、圧縮音声判定部２から直ちに該当圧縮音声以外のフォーマットの圧縮音声が入力されるようになっている。この圧縮音声としては、例えば、再生サンプリング周波数が４４．１ｋＨｚであるのに対して、２２．０５ｋＨｚのサンプリング周波数で圧縮された圧縮音声のように、周波数帯域の上限周波数が再生サンプリング周波数Ｆｓの１／２に満たないフォーマットの圧縮音声であり、かつ、ダウンサンプリング処理を要しない圧縮音声が該当する。 The FIF upsampling processing unit 4 is immediately supplied with compressed audio in a format other than the corresponding compressed audio from the compressed audio determining unit 2. As this compressed sound, for example, while the reproduction sampling frequency is 44.1 kHz, the upper limit frequency of the frequency band is 1 of the reproduction sampling frequency Fs as in the compressed sound compressed at the sampling frequency of 22.05 kHz. Compressed audio that has a format less than / 2 and does not require downsampling processing.

そして、ＦＩＦアップサンプリング処理部４は、圧縮音声判定部２から直に入力された圧縮音声に対しても、ＦＩＦアップサンプリング処理を施し、このＦＩＦアップサンプリング処理を施した圧縮音声を出力するようになっている。 Then, the IF upsampling processing unit 4 also performs the IF upsampling process on the compressed sound input directly from the compressed sound determination unit 2 and outputs the compressed sound subjected to the IF upsampling process. It has become.

ＦＩＦアップサンプリング処理部４の出力側には、イコライザ（ＥＱ）やＴＣＲ等からなるポストプロセシング部５が接続されており、このポストプロセシング部５には、ＦＩＦアップサンプリング処理部４から出力されたＦＩＦアップサンプリング処理後の圧縮音声が入力されるようになっている。 A post-processing unit 5 made up of an equalizer (EQ), TCR, or the like is connected to the output side of the FIF up-sampling processing unit 4. The compressed audio after the upsampling process is input.

そして、ポストプロセシング部５は、入力された圧縮音声に対して、音質や発音タイミング等を調整するポストプロセシング処理（後処理）を施した後に出力するようになっている。 The post-processing unit 5 performs post-processing (post-processing) for adjusting the sound quality, the sound generation timing, etc., on the input compressed sound, and outputs the result.

ポストプロセシング部５から出力された圧縮音声は、Ｄ／Ａ等を介して伸長・復号化されてスピーカ（図示せず）から音声出力されるようになっている。 The compressed sound output from the post-processing unit 5 is decompressed and decoded via D / A or the like and output from a speaker (not shown).

なお、ポストプロセシング部５には、圧縮音声判定部２から直ちに圧縮音声が入力される場合がある。この場合の圧縮音声としては、例えば、圧縮の際に音域がほとんどカットされなかったロスレス圧縮オーディオ等の周波数帯域の上限周波数が再生サンプリング周波数Ｆｓの１／２に達しているとみなすことができる圧縮音声が該当する。 In some cases, the compressed speech is immediately input from the compressed speech determination unit 2 to the post-processing unit 5. As the compressed sound in this case, for example, compression that can be considered that the upper limit frequency of the frequency band of lossless compressed audio or the like whose sound range is hardly cut during compression has reached 1/2 of the reproduction sampling frequency Fs. Applicable to audio.

次に、本発明に係る音声再生方法の実施形態について、図６および図７を参照して説明する。 Next, an embodiment of a sound reproduction method according to the present invention will be described with reference to FIGS.

本実施形態における音声再生方法は、前述した音声再生機１を一手段として用いることによって実行することができる。 The sound reproducing method in the present embodiment can be executed by using the sound reproducing device 1 described above as one means.

すなわち、まず、図６のステップ１（ＳＴ１）において、圧縮音声判定部２は、ＣＤ、放送、通信等を通じて圧縮音声を取得する。 That is, first, in step 1 (ST1) of FIG. 6, the compressed sound determination unit 2 acquires compressed sound through CD, broadcasting, communication, or the like.

次いで、ステップ２（ＳＴ２）において、圧縮音声判定部２により、ステップ１（ＳＴ１）において取得した圧縮音声の周波数帯域の上限周波数が再生サンプリング周波数Ｆｓの１／２に満たないか否かを判定する。 Next, in step 2 (ST2), the compressed audio determination unit 2 determines whether or not the upper limit frequency of the frequency band of the compressed audio acquired in step 1 (ST1) is less than 1/2 of the reproduction sampling frequency Fs. .

さらに、ステップ２（ＳＴ２）において、圧縮音声判定部２により、ステップ１（ＳＴ１）において取得した圧縮音声のビットレートが、再生サンプリング周波数との関係において所定の条件を満足するか否かを判定する。 Further, in step 2 (ST2), the compressed audio determination unit 2 determines whether or not the bit rate of the compressed audio acquired in step 1 (ST1) satisfies a predetermined condition in relation to the reproduction sampling frequency. .

そして、ステップ２（ＳＴ２）において、圧縮音声の周波数帯域の上限周波数が再生サンプリング周波数Ｆｓの１／２に満たない場合であって、ビットレートが前記所定の条件を満足する場合には、ステップ３（ＳＴ３）に進み、そうでない場合にはステップ６（ＳＴ６）に進む。 In step 2 (ST2), when the upper limit frequency of the frequency band of the compressed audio is less than ½ of the reproduction sampling frequency Fs and the bit rate satisfies the predetermined condition, step 3 Proceed to (ST3), otherwise proceed to step 6 (ST6).

ステップ３（ＳＴ３）においては、圧縮音声判定部２により、圧縮音声の周波数帯域の上限周波数が再生サンプリング周波数Ｆｓの１／２に満たない圧縮音声で、かつ、ビットレートが前記所定の条件を満足する圧縮音声が、ダウンサンプリング処理を要する圧縮音声であるか否かを判定する。 In step 3 (ST3), the compressed audio determination unit 2 causes the compressed audio whose frequency band upper limit frequency is less than 1/2 of the reproduction sampling frequency Fs, and the bit rate satisfies the predetermined condition. It is determined whether or not the compressed sound to be compressed is a compressed sound that requires downsampling processing.

そして、ステップ３（ＳＴ３）において、圧縮音声が、ダウンサンプリング処理を要する圧縮音声（該当圧縮音声）である場合にはステップ４（ＳＴ４）に進み、ダウンサンプリング処理を要しない圧縮音声である場合にはステップ５（ＳＴ５）に進む。 In step 3 (ST3), if the compressed audio is compressed audio that requires downsampling processing (corresponding compressed audio), the process proceeds to step 4 (ST4), and if the compressed audio is compressed audio that does not require downsampling processing. Advances to step 5 (ST5).

ステップ４（ＳＴ４）においては、ダウンサンプリング処理部３により、圧縮音声に対してダウンサンプリング処理を施した後にステップ５（ＳＴ５）に進む。 In step 4 (ST4), the downsampling processing unit 3 applies a downsampling process to the compressed sound, and then the process proceeds to step 5 (ST5).

次いで、ステップ５（ＳＴ５）においては、ＦＩＦアップサンプリング処理部４によって、圧縮音声に対してＦＩＦアップサンプリング処理を施す。 Next, in step 5 (ST5), the FIFO upsampling processing unit 4 performs an IF upsampling process on the compressed audio.

これによって、図７に示すように、圧縮の際に失われた周波数帯域が適切に補間された圧縮音声が得られる。なお、ＦＩＦアップサンプリング処理によって補間される周波数帯域には、高音域は勿論のこと、大きな信号の近傍で削除されてしまった微小信号に対応する音域も含まれている。さらに、図７のように、原曲にはない高音域を生成することもできる。 As a result, as shown in FIG. 7, a compressed sound in which the frequency band lost during the compression is appropriately interpolated can be obtained. Note that the frequency band interpolated by the FIF upsampling process includes not only a high sound range but also a sound range corresponding to a minute signal that has been deleted in the vicinity of a large signal. Furthermore, as shown in FIG. 7, it is also possible to generate a high frequency range that is not found in the original music.

最後に、ステップ６（ＳＴ６）においては、圧縮音声を伸長・復号化して再生する。 Finally, in step 6 (ST6), the compressed sound is decompressed and decoded and reproduced.

以上述べたように、本実施形態によれば、圧縮音声に対してＦＩＦアップサンプリング処理を施すことによって圧縮の際に失われた周波数帯域が補間された圧縮音声を得ることができる結果、圧縮によって失われた音域を高精度に復元することができ、圧縮前の原音に近い高音質の音声を再現することができる。 As described above, according to the present embodiment, it is possible to obtain a compressed sound obtained by interpolating the frequency band lost in the compression by performing the IF upsampling process on the compressed sound. The lost sound range can be restored with high accuracy, and high-quality sound close to the original sound before compression can be reproduced.

なお、本発明は、前述した実施の形態に限定されるものではなく、必要に応じて種々の変更が可能である。 In addition, this invention is not limited to embodiment mentioned above, A various change is possible as needed.

例えば、フラクタル補間処理以外の手法によって、圧縮の際に失われた周波数帯域を補間する補間処理を施すようにしてもよい。 For example, interpolation processing for interpolating a frequency band lost during compression may be performed by a method other than fractal interpolation processing.

また、前記ＦＩＦアップサンプリング処理をＣＤ等の非圧縮ソースに応用し、原曲にはない高音域を生成することによって、ＤＶＤオーディオ並の高音質化を実現することも可能である。 Further, by applying the above-described FIF upsampling processing to an uncompressed source such as a CD and generating a high sound range that is not included in the original music, it is possible to achieve high sound quality equivalent to that of DVD audio.

本発明に係る音声再生機の実施形態を示すブロック図1 is a block diagram showing an embodiment of an audio player according to the present invention. 本発明に係る音声再生機の実施形態において、該当圧縮音声のデジタル符号化フォーマットを示す説明図Explanatory drawing which shows the digital encoding format of applicable compression audio | voice in embodiment of the audio | voice reproducing device based on this invention. 本発明に係る音声再生機の実施形態においてダウンサンプリング後の該当圧縮音声のデジタル符号化フォーマットを示す説明図Explanatory drawing which shows the digital encoding format of the corresponding compression audio | voice after downsampling in embodiment of the audio | voice reproducing apparatus based on this invention. 本発明に係る音声再生機の実施形態においてＦＩＦアップサンプリング処理を施した該当圧縮音声のデジタル符号化フォーマットを示す説明図Explanatory drawing which shows the digital encoding format of the applicable compression audio | voice which performed the IF upsampling process in embodiment of the audio | voice reproducing device based on this invention 本発明に係る音声再生機の実施形態において、フラクタル補間処理を示す概念図The conceptual diagram which shows the fractal interpolation process in embodiment of the audio | voice player based on this invention 本発明に係る音声再生方法の実施形態を示すフローチャートThe flowchart which shows embodiment of the audio | voice reproduction | regeneration method based on this invention 本発明に係る音声再生方法の実施形態において、圧縮の際に失われた周波数帯域が補間された圧縮音声のスペクトラムを示す図The figure which shows the spectrum of the compression audio | voice in which the frequency band lost at the time of compression was interpolated in embodiment of the audio | voice reproduction method which concerns on this invention 一般的な非圧縮音声のデジタル符号化フォーマットを示す説明図Explanatory drawing showing a digital encoding format of general uncompressed audio 一般的な非圧縮音声のスペクトラムを示す図Diagram showing a general uncompressed audio spectrum 周波数帯域の上限がカットされた圧縮音声のデジタル符号化フォーマットを示す説明図Explanatory drawing which shows the digital encoding format of the compression audio | voice from which the upper limit of the frequency band was cut 周波数帯域の上限がカットされた圧縮音声のスペクトラムを示す図Diagram showing the spectrum of compressed audio with the upper limit of the frequency band cut

Explanation of symbols

１音声再生機
２圧縮音声判定部
３ダウンサンプリング処理部
４ＦＩＦアップサンプリング処理部 DESCRIPTION OF SYMBOLS 1 Audio | voice player 2 Compressed sound determination part 3 Downsampling process part 4 FIF upsampling process part

Claims

In an audio reproducing apparatus that reproduces compressed audio that has been compressed and encoded according to a predetermined compression format by decompressing and decoding it under a desired reproduction sampling frequency.
A determination device for determining whether an upper limit frequency of the frequency band of the compressed audio is less than a predetermined determination frequency suitable for reproduction under the reproduction sampling frequency;
Based on the determination result of this determination device, a downsampling processing device that performs downsampling processing after low-pass filter processing for compressed speech whose upper limit frequency of the frequency band is less than the determination frequency;
The compressed audio, wherein the down-sampling process is performed by the down-sampling processing unit, and a up-sampling processing unit that performs up-sampling processing with the interpolation process of interpolating a frequency band that may have been lost during the compression An audio player characterized by that.

The sound reproducing apparatus according to claim 1, wherein the determination frequency is a half of the reproduction sampling frequency.

The determination device is configured to determine whether or not a bit rate of the compressed audio satisfies a predetermined condition,
The down-sampling treatment apparatus performs the down-sampling process on compressed audio whose upper limit frequency of the frequency band is less than the determination frequency and whose bit rate satisfies the predetermined condition. The sound reproducing device according to claim 1 or 2, wherein the sound reproducing device is formed as follows.

The determination apparatus includes a table storing bit rate data satisfying the predetermined condition that differs according to the reproduction sampling frequency in a state of having a corresponding relationship with the reproduction sampling frequency data corresponding to the bit rate. The audio player according to claim 3, further comprising: determining whether or not the compressed audio satisfies the predetermined condition with reference to the table.

5. The sound reproducing device according to claim 1, wherein the upsampling processing device performs a fractal interpolation process as the interpolation process. 6.

In an audio reproduction method for reproducing compressed audio that has been compressed and encoded in accordance with a predetermined compression format, decompressed and decoded under a desired reproduction sampling frequency,
When the upper limit frequency of the frequency band of the compressed audio does not reach a predetermined determination frequency suitable for reproduction under the reproduction sampling frequency, downsampling processing is performed on the compressed audio after low-pass filter processing. ,
Next, the compressed audio subjected to the downsampling process is subjected to an upsampling process with an interpolation process for interpolating a frequency band lost during the compression,
An audio reproduction method characterized in that the compressed audio subjected to the upsampling process accompanied with the downsampling process and the interpolation process is decompressed and decoded under the reproduction sampling frequency.

The audio reproduction method according to claim 6, wherein the determination frequency is a half of the reproduction sampling frequency.

Determining whether the bit rate of the compressed audio satisfies a predetermined condition;
7. The downsampling process is performed on compressed audio in which the upper limit frequency of the frequency band is less than the determination frequency and the bit rate satisfies the predetermined condition. Or the audio | voice reproduction method of Claim 7.

Preparing a table storing bit rate data satisfying the predetermined condition depending on the reproduction sampling frequency in a state of having a corresponding relationship with the reproduction sampling frequency data corresponding to the bit rate; 9. The audio reproduction method according to claim 8, wherein whether or not the audio satisfies the predetermined condition is determined with reference to the table.

10. The audio reproduction method according to claim 6, wherein fractal interpolation processing is performed as the interpolation processing.