JPS5850357B2 - Speech analysis and synthesis device - Google Patents
Speech analysis and synthesis deviceInfo
- Publication number
- JPS5850357B2 JPS5850357B2 JP53037495A JP3749578A JPS5850357B2 JP S5850357 B2 JPS5850357 B2 JP S5850357B2 JP 53037495 A JP53037495 A JP 53037495A JP 3749578 A JP3749578 A JP 3749578A JP S5850357 B2 JPS5850357 B2 JP S5850357B2
- Authority
- JP
- Japan
- Prior art keywords
- audio
- signal
- average power
- noise
- short
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired
Links
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 27
- 238000003786 synthesis reaction Methods 0.000 title claims abstract description 27
- 230000005236 sound signal Effects 0.000 claims abstract description 28
- 238000001228 spectrum Methods 0.000 claims abstract description 22
- 230000005284 excitation Effects 0.000 claims description 13
- 230000002194 synthesizing effect Effects 0.000 claims description 2
- 238000005259 measurement Methods 0.000 abstract description 4
- 239000002131 composite material Substances 0.000 abstract 2
- 230000005540 biological transmission Effects 0.000 description 15
- 238000010586 diagram Methods 0.000 description 4
- 230000008054 signal transmission Effects 0.000 description 3
- 238000000034 method Methods 0.000 description 2
- 238000010183 spectrum analysis Methods 0.000 description 2
- 101100389815 Caenorhabditis elegans eva-1 gene Proteins 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Landscapes
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
【発明の詳細な説明】
本発明は自己相関係数、相互相関係数の相関係数、又は
共分散係数等の相関係数と等価な係数(以下、「相関係
数」と総称する)を直接的に用いて音声信号のスペクト
ラム情報を分析し、前記分析データを用いて音声信号を
合成するピッチ励振型の音声分析合成装置、例えば自己
相関ボコーダ、相互相関ボコーダ等のボコーダ装置及び
相関係数から変換される係数を用いて音声信号のスペク
トラム情報を分析し、前記分析データを用いて音声信号
を合成するピッチ励振型の音声分析合成装置、例えば線
形予測型ボコーダ装置1こ関し、殊1こ高周囲雑音環境
下1こおける音声などの分析合成1こ実施して好適な合
成音声を得るための音声分析合成装置1こ係るものであ
る。DETAILED DESCRIPTION OF THE INVENTION The present invention uses coefficients equivalent to correlation coefficients such as autocorrelation coefficients, cross-correlation coefficients, or covariance coefficients (hereinafter collectively referred to as "correlation coefficients"). A pitch-excitation type speech analysis and synthesis device that directly analyzes spectrum information of a speech signal and synthesizes a speech signal using the analysis data, such as a vocoder device such as an autocorrelation vocoder or a cross-correlation vocoder, and a correlation coefficient. Regarding a pitch excitation type voice analysis and synthesis device, for example, a linear predictive vocoder device 1, which analyzes spectrum information of a voice signal using coefficients converted from , and synthesizes a voice signal using the analysis data, there is one particular feature. This invention relates to a speech analysis and synthesis apparatus for analyzing and synthesizing speech and the like in a high ambient noise environment to obtain a suitable synthesized speech.
従来、この種の音声分析合成装置]こおいては分析側か
ら合成側へ被分析波形の情報を伝達するために、前記相
関係数を直接的1こ用いて分析されたスペクトラム情報
、又は相関係数から変換される係数を用いて分析された
スペクトラム情報の外1こ、ピッチ周期情報、有声無声
判別情報、短時間平均電力情報等を伝達していた。[Conventionally, this type of speech analysis/synthesis device] In order to transmit information of the analyzed waveform from the analysis side to the synthesis side, spectrum information analyzed using the correlation coefficient directly or correlation coefficient is used. In addition to spectrum information analyzed using coefficients converted from relational coefficients, pitch period information, voiced/unvoiced discrimination information, short-term average power information, etc. were transmitted.
周知のよう1こ、相関係数を用いて分析された雑音の混
入した音声信号の周波数スペクトラムは、雑音成分を抑
圧し、あるいは平担化し音声信号成分の極太周波数成分
(例えばホルマント周波数)を強調する性質がある。As is well known, the frequency spectrum of a noisy audio signal analyzed using a correlation coefficient can be analyzed by suppressing or flattening the noise component and emphasizing the extremely thick frequency components (for example, formant frequencies) of the audio signal component. There is a tendency to
例エバ第1 図+こおいて、スペクトラムエンベロープ
101を持つ音声信号に、スペクトラムエンベロープ1
02を持つ雑音が混入した信号のスペクトラムエンベロ
ープは本来は103のよう1こ表現されるが、相関係数
を用いて例えば全極型モデルで表現された場合1こは特
fこ104のよう1こ、より音声そのもののスペクトラ
ムに近い形1こ変形される。Example Eva 1 In Figure 1, an audio signal with a spectrum envelope 101 has a spectrum envelope 1
The spectrum envelope of a signal mixed with noise having a value of 02 is originally expressed as 103, but if it is expressed using a correlation coefficient, for example, with an all-pole model, 1 is specially expressed as 104. This is transformed into a shape that more closely resembles the spectrum of the voice itself.
前記スペクトラムの変形1こよる効果の一つ1こ、例え
ば、短時間平均電力のほぼ等しい雑音と音声信号との混
合信号を分析した場合、分析結果から表現されるホルマ
ントの帯域幅が、音声そのもののホルマントの帯域幅と
、殆んど違わない場合も存在する事がある。One of the effects of the spectrum modification 1 is that, for example, when analyzing a mixed signal of a speech signal and noise with approximately equal short-term average power, the formant bandwidth expressed from the analysis result is similar to that of the speech itself. There may be cases where the band width of the formant is almost the same as that of the formant.
通常、この種の音声分析合成装置lこおける合成音声の
振幅は、はぼホルマントの帯域幅と、合成合成を生成す
るためのフィルタEこ加えられる励振音源の振幅等1こ
よって決定される。Normally, the amplitude of the synthesized speech in this type of speech analysis and synthesis apparatus is determined by the bandwidth of the formant, the amplitude of the excitation sound source added to the filter E for generating the synthesis, etc.
従来、この種の音声分析合成装置1こおいては、例えば
入力音声信号レベルが、周囲雑音を無視し得る程度1こ
大きい場合1こは、その合成音声の振幅再現性は良好で
ある。Conventionally, in this type of speech analysis and synthesis apparatus 1, when the input speech signal level is one level higher than ambient noise, for example, the amplitude reproducibility of the synthesized speech is good.
しかし入力音声信号レベルが周囲雑音レベルと同程度、
あるいは周囲雑音レベル以下の場合(こ、合成音声を生
成するためのフィルタ(こ加えられる励振音源の振幅は
、周囲雑音電力(こ強く影響されるのにもかかわらず、
ホルマントの帯域内が入力音声信号そのもののホルマン
トの帯域内(こ近いため、合成音声の振幅は異状1こ増
大し振幅再現性は著しく劣化する。However, if the input audio signal level is about the same as the ambient noise level,
Alternatively, if the amplitude of the excitation sound source added to the filter for generating synthesized speech is below the ambient noise level (although it is strongly influenced by the ambient noise power),
Since the formant band is close to the formant band of the input audio signal itself, the amplitude of the synthesized speech increases by an abnormality and the amplitude reproducibility deteriorates significantly.
一般1こ、周囲雑音レベルは短時間(例えば数10m5
EC〜数5EC)ではそれほど変化しないが、音声信号
レベルは短時間内1こ激しく変化することが多い。Generally speaking, the ambient noise level is limited for a short period of time (for example, several tens of meters5)
EC to several 5 EC), it does not change much, but the audio signal level often changes drastically within a short period of time.
例えば有声音定常部と有声音語尾とではレベルが大きく
異なる。For example, the level differs greatly between a voiced sound stationary part and a voiced sound ending.
そのため、この種の音声分析合成装置1こおいては、例
えば比較的1こレベルの大きな音声音定常部よりも、レ
ベルの小いさく4声音語尾付近が相対的1こ強調され、
聴覚的1こ著しく自然性を損なうという欠点を有し′C
いた。Therefore, in this type of speech analysis and synthesis device 1, for example, a low-level four-tone sound near the end is relatively emphasized by 1, rather than a relatively large voice sound stationary part of a 1-level.
It has the disadvantage of significantly impairing the naturalness of the auditory sense.
there was.
本発明の目的は、入力信号から計測される相関係数を用
いて音声の分析合成を行なう音声分析合成装置1こおい
て、周囲雑音の短時間平均電力と、周囲雑音の混入した
音声信号の短時間平均電力とを計測し、前記周囲雑音の
混入した音声信号の短時間平均電力から、周囲雑音の短
時間平均電力を差引き、前記差引かれた電力を用いて励
振音源の振幅を決定すること1こより、上記欠点を除去
し、良好な合成音声を再現し得る装置を提供すること1
こある。An object of the present invention is to provide a speech analysis and synthesis device 1 that analyzes and synthesizes speech using a correlation coefficient measured from an input signal, and uses the short-term average power of ambient noise and the output of a speech signal mixed with ambient noise. measuring the short-time average power, subtracting the short-time average power of the ambient noise from the short-term average power of the audio signal mixed with the ambient noise, and determining the amplitude of the excitation sound source using the subtracted power. Therefore, it is an object of the present invention to provide a device capable of reproducing good synthesized speech by eliminating the above-mentioned drawbacks.
There it is.
次fこ本発明の第1の実施例を図面を参照して説明する
。Next, a first embodiment of the present invention will be described with reference to the drawings.
第2図は本発明の第1の実施例を説明するためのブ爾ツ
ク図である。FIG. 2 is a book diagram for explaining the first embodiment of the present invention.
第2図1こおいて一点鎖線201で囲んだ部分は本発明
の構成を示す。In FIG. 2, a portion surrounded by a dashed line 201 shows the structure of the present invention.
点線202で囲んだ部分は本発明1こよる音声分析合成
装置の分析側を、点線203で囲んだ部分は同様に合成
側を示す。The part surrounded by a dotted line 202 shows the analysis side of the speech analysis and synthesis apparatus according to the first invention, and the part surrounded by a dotted line 203 similarly shows the synthesis side.
音声信号源205と雑音源206とからの音波信号は音
響信号−電気信号変換器A20γへ入力される。The sound wave signals from the audio signal source 205 and the noise source 206 are input to the acoustic signal-to-electrical signal converter A20γ.
音響信号−電気信号変換器A20γは音波信号を電気的
信号1こ変換し、雑音の混入した音声信号として、相関
係数スペクトラム情報分析器208と有声無声判別器2
09とピッチ抽出器210と短時間平均電力計測器A2
11とへ変換された電気的信号を出力する。The acoustic signal-to-electrical signal converter A20γ converts the sound wave signal into one electrical signal, and converts it into a noise-containing audio signal, which is sent to the correlation coefficient spectrum information analyzer 208 and the voiced/unvoiced discriminator 2.
09, pitch extractor 210, and short-time average power meter A2
11 and outputs the converted electrical signal.
相関係数型スペクトラム情報分析器208は雑音の混入
した音声信号を相関係数を用いて直接的、又11間接的
1こスペクトラム分析し、分析結果を符号化器212へ
出力する。The correlation coefficient type spectrum information analyzer 208 performs direct or indirect spectrum analysis on the noise-containing audio signal using correlation coefficients, and outputs the analysis result to the encoder 212.
有声無声判別器209は雑音の混入した音声信号の有声
無声判定を行ない、判定結果を符号化器212へ出力す
る。The voiced/unvoiced discriminator 209 determines whether the voice signal containing noise is voiced or not, and outputs the determination result to the encoder 212 .
ピッチ抽出器210は雑音の混入した音声信号から自己
相関法等のピンチ抽出手法1こよりピンチ周期の抽出を
行ない、ピッチ周期情報を符号化器212へ出力する。A pitch extractor 210 extracts a pinch period from a noise-containing audio signal using a pinch extraction method 1 such as an autocorrelation method, and outputs pitch period information to an encoder 212.
短時間平均電力計測器A212は雑音の混入した音声信
号の短時間平均電力を計測し、計測結果を音声電力計測
器213へ出力する。The short-time average power measuring device A212 measures the short-term average power of the audio signal mixed with noise, and outputs the measurement result to the audio power measuring device 213.
雑音源206からの音波信号は音響信号−電気信号変換
器B214へ入力される。The acoustic signal from the noise source 206 is input to the acoustic signal-to-electrical signal converter B214.
音響信号−電気信号変換器B214は音波信号を電気的
信号1こ変換し、雑音信号として、短時間平均電力計測
器B215へ変換された電気的信号を出力する。The acoustic signal-to-electrical signal converter B214 converts the sonic signal into one electric signal, and outputs the converted electric signal as a noise signal to the short-time average power meter B215.
短時間平均電力計測器B215は雑音信号の短時間平均
電力を計測し、計測結果を音声電力計測器213へ出力
する。The short-time average power measuring device B215 measures the short-time average power of the noise signal and outputs the measurement result to the audio power measuring device 213.
音声電力計測器213は短時間平均電力計測器A211
から供給される雑音の混入した音声信号の短時間平均電
力と、短時間平均電力計測器B215から供給される雑
音信号の短時間平均電力との差の電力を計測し、計測結
果を音声電力として符号化器212へ出力する。The audio power measuring device 213 is a short-time average power measuring device A211.
The difference between the short-time average power of the noise-containing audio signal supplied from the B215 and the short-time average power of the noise signal supplied from the short-time average power meter B215 is measured, and the measurement result is used as the audio power. Output to encoder 212.
符号化器212G1相関係数型スペクトラム情報分析器
208から供給される分析結果と、有声無声判別器20
9から供給される判定結果と、ピッチ抽出器210から
供給されるピッチ周期情報と、音声電力計測器213か
ら供給される音声電力とを各々量子化し、多重化し、多
重化された信号を伝送路204を介して復号化器216
へ出力する。The analysis result supplied from the encoder 212G1 correlation coefficient type spectrum information analyzer 208 and the voiced/unvoiced discriminator 20
9, the pitch period information supplied from the pitch extractor 210, and the audio power supplied from the audio power measuring device 213 are each quantized and multiplexed, and the multiplexed signals are sent to the transmission path. decoder 216 via 204
Output to.
復号化器216は多重化された信号を分離、復号し、復
号スペクトラム情報をスペクトラム情報伝送路211へ
、復号有声無声判別情報を有声無声判別情報伝送路21
8へ、復号ヒ”ノチ周期情報をピッチ伝送路219へ、
復号音声電力情報を電力情報伝送路220へ出力する。The decoder 216 separates and decodes the multiplexed signal, sends the decoded spectrum information to the spectrum information transmission line 211, and sends the decoded voiced/unvoiced discrimination information to the voiced/unvoiced discrimination information transmission line 21.
8, decoded high-notch cycle information to pitch transmission line 219,
The decoded audio power information is output to the power information transmission path 220.
音源信号発生器221は有声無声判別情報伝送路218
から供給される復号有声無声判別情報と、ピッチ伝送路
219から供給される復号ピッチ周期情報と、電力情報
伝送路220から供給される復号音声電力情報とから例
えばダツドレーのボコーダとして知られるチャンネルボ
コーダの励振信号発生手法1こより励振信号を発生し、
励振信号を励振信号伝送路222へ出力する。The sound source signal generator 221 is connected to the voiced/unvoiced discrimination information transmission line 218
From the decoded voiced/unvoiced discrimination information supplied from the decoded voiced/unvoiced discrimination information, the decoded pitch cycle information supplied from the pitch transmission line 219, and the decoded voice power information supplied from the power information transmission line 220, a channel vocoder known as a Datsudray vocoder, for example, is used. Excitation signal generation method 1 Generate an excitation signal from
The excitation signal is output to the excitation signal transmission path 222.
音声合成フィルタ223はスペクトラム情報伝送路21
7を介して供給される復号スペクトラム情報Eこよりフ
ィルタ係数を決定し、励振信号伝送路222を介して供
給される励振信号1こより励振され、その出力は合成音
声信号として波形出力端子224へ出力される。The speech synthesis filter 223 is connected to the spectrum information transmission path 21
The filter coefficients are determined from the decoded spectrum information E supplied via the excitation signal transmission line 222, and the output is outputted as a synthesized audio signal to the waveform output terminal 224. Ru.
次に本発明の第2の実施例を図面を参照して説明する。Next, a second embodiment of the present invention will be described with reference to the drawings.
第3図は本発明の第2の実施例を説明するためのブロッ
ク図である。FIG. 3 is a block diagram for explaining a second embodiment of the present invention.
第3図は本発明1こおける分析側の構成の一部を示し、
一点鎖線301で囲んだ部分は本発明の構成を示す。FIG. 3 shows a part of the configuration of the analysis side in the first aspect of the present invention,
A portion surrounded by a dashed line 301 shows the configuration of the present invention.
第3図はいわゆるブレストーク型音声通信装置Iこ本発
明を適用した場合の実施例を示す。FIG. 3 shows an embodiment in which the present invention is applied to a so-called breath-talk type voice communication device I.
音声信号源302が音声を発生するときは、制御信号入
力端子304には必らず送話信号が入力される。When the audio signal source 302 generates audio, a transmitting signal is always input to the control signal input terminal 304.
また制御信号入力端子304へ送話中断信号が入力され
るときには音声信号源302は音声を発生しない。Furthermore, when the speech interruption signal is input to the control signal input terminal 304, the audio signal source 302 does not generate audio.
制御信号入力端子304へ送話中断信号が入力されてい
る場合1こは、雑音源303からの音波信号は音響信号
−電気信号変換器305へ入力される。When the speech interrupt signal is input to the control signal input terminal 304, the sound wave signal from the noise source 303 is input to the acoustic signal-electrical signal converter 305.
音響信号−電気信号変換器305は音波信号を電気的信
号1こ変換し短時間平均電力計測器306へ出力する。The acoustic signal-to-electrical signal converter 305 converts the sonic signal into an electrical signal and outputs it to the short-time average power measuring device 306 .
短時間平均電力計測器306は雑音の短時間平均電力を
計測し、記憶器301へ出力する。The short-time average power measuring device 306 measures the short-time average power of noise and outputs it to the memory 301.
記憶器30γは制御信号入力端子304を介して送話中
断信号が入力される場合fこは前記短時間平均電力を記
憶する。The storage device 30γ stores the short-time average power when the speech interruption signal is inputted through the control signal input terminal 304.
制御信号入力端子304へ送話信号が入力されている場
合には、音声信号源302と雑音源303とからの音波
信号が音響信号−電気信号変換器305へ入力される。When a transmission signal is input to the control signal input terminal 304, the sound wave signals from the audio signal source 302 and the noise source 303 are input to the acoustic signal-electrical signal converter 305.
音響信号−電気信号変換器305は音波信号を電気的信
号(こ変換し短時間平均電力計測器306へ出力する。The acoustic signal-to-electrical signal converter 305 converts the acoustic signal into an electrical signal and outputs it to the short-time average power meter 306.
短時間平均電力計測器306は雑音の混入した音声信号
の短時間平均電力を計測し、音声電力計測器308へ出
力する。The short-time average power measuring device 306 measures the short-term average power of the audio signal mixed with noise and outputs it to the audio power measuring device 308 .
記憶器307は制御信号入力端子304を介して送話信
号が入力される場合1こはあらかじめ記憶された雑音の
短時間平均電力を音声電力計測器308へ出力する。When a transmitting signal is input through the control signal input terminal 304, the memory 307 outputs the short-term average power of the noise stored in advance to the voice power meter 308.
音声電力計測器308は短時間平均電力計測器306か
ら供給される雑音の混入した音声信号の短時間平均電力
から記憶器307から供給される雑音の短時間平均電力
に定数aを掛けたものを引き、音声電力信号を発生し、
音声電力信号出力端子309へ出力する。The audio power meter 308 calculates the short-term average power of the noise-containing audio signal supplied from the short-term average power meter 306 multiplied by the short-term average power of the noise supplied from the memory 307 by a constant a. and generate an audio power signal,
It is output to the audio power signal output terminal 309.
音声電力信号出力端子309は第1の実施例における第
2図fこ示す音声電力計測器、213の出力端に対応す
る。The audio power signal output terminal 309 corresponds to the output terminal of the audio power measuring device 213 shown in FIG. 2f in the first embodiment.
なお上記定数aは雑音レベルの短時間変動率を考慮して
決定される。Note that the constant a is determined in consideration of the short-term fluctuation rate of the noise level.
本発明1こよる音声分析合成装置は、周囲雑音の短時間
平均電力と、周囲雑音が混入した音声信号の短時間平均
電力とを計測し、前記二種類の短時間平均電力の差を計
測することにより、音声信号本来の短時間平均電力を求
め、これ1こより励振音源の振幅を決定する。The speech analysis and synthesis device according to the first aspect of the present invention measures the short-term average power of ambient noise and the short-term average power of an audio signal mixed with ambient noise, and measures the difference between the two types of short-term average power. By doing this, the short-term average power of the audio signal is obtained, and from this, the amplitude of the excitation sound source is determined.
従って、分析ホルマントの帯域中があまり周囲雑音の影
響を受けない性質を有する相関係数を用いて音声信号の
スペクトラム情報を分析する音声分析合成装置の合成音
声の振幅再現性を改善するという効果がある。Therefore, it is possible to improve the amplitude reproducibility of synthesized speech by a speech analysis and synthesis device that analyzes spectrum information of speech signals using a correlation coefficient whose analysis formant band is not affected by ambient noise. be.
第1図は相関係数を用いたスペクトラム分析の特性を説
明するための波形図、第2図及び第3図は本発明の実施
例を示すブロック図である。
202・・・・・・分析側、203・・・・・・合成側
、204・・・・・・伝送路、205,302・・・・
・・音声信号源、206.303・・・・・・雑音源、
207,214゜305・・・・・・音響信号−電気信
号変換器、208・・・・・・相関係数型スペクトラム
情報分析器、209・・・・・・有声無声判別器、21
0・・・・・・ピッチ抽出器、211.215,306
・・・・・・短時間平均電力計測器、212・・・・・
・符号化器、213,308・・・・・・音声電力計測
器、216・・・・・・復号化器、217・・・・・・
スペクトラム情報伝送路、218・・・・・・有声無声
判別情報伝送路、219・・・・・・ピッチ伝送路、2
20・・・・・・電力情報伝送路、221・・・・・・
音源信号発生器、222・・・・・・励振信号伝送路、
223・・・・・・音声合成フィルタ、30γ・・・・
・・記憶器。FIG. 1 is a waveform diagram for explaining characteristics of spectrum analysis using correlation coefficients, and FIGS. 2 and 3 are block diagrams showing embodiments of the present invention. 202... Analysis side, 203... Synthesis side, 204... Transmission path, 205, 302...
...Audio signal source, 206.303...Noise source,
207,214゜305...Acoustic signal-electrical signal converter, 208...Correlation coefficient type spectrum information analyzer, 209...Voiced/unvoiced discriminator, 21
0...Pitch extractor, 211.215,306
...Short-time average power meter, 212...
- Encoder, 213, 308...Audio power measuring device, 216...Decoder, 217...
Spectrum information transmission line, 218... Voiced/unvoiced discrimination information transmission line, 219... Pitch transmission line, 2
20...Power information transmission line, 221...
Sound source signal generator, 222...excitation signal transmission line,
223...Speech synthesis filter, 30γ...
...Memory device.
Claims (1)
される周波数スペクトラムおよび音源情報を示すパラメ
ータを計測し、このパラメータ値1こより諸係数および
励振入力が定まる合成フィルタ1こよって音声を合成す
る音声分析合成装置1こおいて、前記入力信号の短時間
平均電力を計測する第1の計測手段と、前記周囲雑音成
分の短時間平均電力を計測する第2の計測手段と、前記
第1の計測手段の出力と第2の計測手段の出力の差を出
力する第3の計測手段とを備え、この第3の計測手段の
出力を前記合成フィルタの励振音源情報としたことを特
徴とする音声分析合成装置。 2、特許請求の範囲第1項記載の音声分析合成装置1こ
おいて、前記第1の計測手段と第2の計測手段が、外部
からの制御信号1こより前記入力信号と前記周囲雑音成
分の短時間平均電力を各々時間的1こ別個Eこ計測する
計測手段から戊り、前記第3の計測手段が、前記計測手
段で得られた前記入力信号の短時間平均電力と前記周囲
雑音成分の短時間平均電力との差を計測する手段である
ことを特徴とする音声分析合成装置。[Claims] 1. A synthesis filter 1 that measures parameters indicating frequency spectrum and sound source information measured from an input signal consisting of an audio signal including ambient noise, and determines various coefficients and excitation input from the parameter values 1. In the speech analysis and synthesis device 1 for synthesizing speech, a first measuring means for measuring the short-term average power of the input signal, a second measuring means for measuring the short-term average power of the ambient noise component, and a third measuring means for outputting a difference between the output of the first measuring means and the output of the second measuring means, and the output of the third measuring means is used as excitation sound source information of the synthesis filter. Features of speech analysis and synthesis equipment. 2. In the speech analysis and synthesis device 1 according to claim 1, the first measuring means and the second measuring means measure the input signal and the ambient noise component from an external control signal 1. The third measuring means measures the short-term average power of the input signal obtained by the measuring means and the ambient noise component. A speech analysis and synthesis device characterized in that it is a means for measuring a difference from a short-time average power.
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP53037495A JPS5850357B2 (en) | 1978-03-30 | 1978-03-30 | Speech analysis and synthesis device |
| CA324,405A CA1123955A (en) | 1978-03-30 | 1979-03-29 | Speech analysis and synthesis apparatus |
| US06/236,428 US4360708A (en) | 1978-03-30 | 1981-02-20 | Speech processor having speech analyzer and synthesizer |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP53037495A JPS5850357B2 (en) | 1978-03-30 | 1978-03-30 | Speech analysis and synthesis device |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JPS54133003A JPS54133003A (en) | 1979-10-16 |
| JPS5850357B2 true JPS5850357B2 (en) | 1983-11-10 |
Family
ID=12499098
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP53037495A Expired JPS5850357B2 (en) | 1978-03-30 | 1978-03-30 | Speech analysis and synthesis device |
Country Status (1)
| Country | Link |
|---|---|
| JP (1) | JPS5850357B2 (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2017183430A1 (en) * | 2016-04-22 | 2017-10-26 | 株式会社オートネットワーク技術研究所 | Conduction path with noise filter |
| DE112017002164B4 (en) | 2016-04-25 | 2023-02-09 | Autonetworks Technologies, Ltd. | Line path with noise filter |
-
1978
- 1978-03-30 JP JP53037495A patent/JPS5850357B2/en not_active Expired
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2017183430A1 (en) * | 2016-04-22 | 2017-10-26 | 株式会社オートネットワーク技術研究所 | Conduction path with noise filter |
| DE112017002110B4 (en) | 2016-04-22 | 2022-12-15 | Autonetworks Technologies, Ltd. | Line path with noise filter |
| DE112017002164B4 (en) | 2016-04-25 | 2023-02-09 | Autonetworks Technologies, Ltd. | Line path with noise filter |
Also Published As
| Publication number | Publication date |
|---|---|
| JPS54133003A (en) | 1979-10-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Wang et al. | An objective measure for predicting subjective quality of speech coders | |
| JP4740260B2 (en) | Method and apparatus for artificially expanding the bandwidth of an audio signal | |
| Harma et al. | A comparison of warped and conventional linear predictive coding | |
| Rabiner | Digital‐Formant Synthesizer for Speech‐Synthesis Studies | |
| JP4302978B2 (en) | Pseudo high-bandwidth signal estimation system for speech codec | |
| KR20010014352A (en) | Method and apparatus for speech enhancement in a speech communication system | |
| JP2903533B2 (en) | Audio coding method | |
| Steeneken et al. | Basics of the STI measuring method | |
| JP4654621B2 (en) | Voice processing apparatus and program | |
| JPS5850357B2 (en) | Speech analysis and synthesis device | |
| JPH08160993A (en) | Sound analysis-synthesizer | |
| JPS58211797A (en) | Band split type vocoder | |
| KR930011736B1 (en) | Pitch Control Method of Waveform Coding and Hybrid Coding by Pitch Half Method of Speech Signal | |
| Heute | Speech-transmission quality: aspects and assessment for wideband vs. narrowband signals | |
| JPS5941597B2 (en) | Speech analysis and synthesis device | |
| JPS60260100A (en) | Voice synthesizer | |
| JPH0235998B2 (en) | ||
| Alku | COMPARISON OF WIDEBAND EARPIECE INTEGRATIONS IN MOBILE PHONE | |
| Hyrkäs | Laajakaistaisten matkapuhelinkuulokkeiden integroinnin vertailu | |
| Park | Signal Enhancement of a Variable Rate Vocoder with a Hybrid domain SNR Estimator | |
| GB2266213A (en) | Digital signal coding | |
| JPS58211795A (en) | Linear forecast type voice analyzer/synthesizer | |
| JPS61236600A (en) | Pattern matching vocoder | |
| Safee et al. | Speech Processing in Digital Signal Processing | |
| Bush | Time-frequency resolution in speech analysis and synthesis |