Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
JPH0145640B2 - - Google Patents
[go: Go Back, main page]

JPH0145640B2 - - Google Patents

Info

Publication number
JPH0145640B2
JPH0145640B2 JP56139575A JP13957581A JPH0145640B2 JP H0145640 B2 JPH0145640 B2 JP H0145640B2 JP 56139575 A JP56139575 A JP 56139575A JP 13957581 A JP13957581 A JP 13957581A JP H0145640 B2 JPH0145640 B2 JP H0145640B2
Authority
JP
Japan
Prior art keywords
speech
notch filter
memory
signal
speech segment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
JP56139575A
Other languages
Japanese (ja)
Other versions
JPS5842096A (en
Inventor
Seiichi Kotani
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Denso Ten Ltd
Original Assignee
Denso Ten Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Denso Ten Ltd filed Critical Denso Ten Ltd
Priority to JP56139575A priority Critical patent/JPS5842096A/en
Publication of JPS5842096A publication Critical patent/JPS5842096A/en
Publication of JPH0145640B2 publication Critical patent/JPH0145640B2/ja
Granted legal-status Critical Current

Links

Landscapes

  • Noise Elimination (AREA)

Description

【発明の詳細な説明】[Detailed description of the invention]

本発明は、音声合成システムの出力信号などの
音声信号の雑音抑圧方式に関する。 音声特にア、イ、ウなどの母音の波形は第1図
に示すようにある特定の周波数領域本例では1K
Hz,2KHz,3.6KHz近傍にエネルギが集中してい
る。これは声道の共振によるものでホルマントと
呼ばれ、これらの第1、第2、第3ホルマント
F1,F2,F3の間のデイツプd1,d2は声道の反共
振によるものでアンチホルマントと呼ばれる。こ
のようにホルマント、アンチホルマントを持つ信
号では、ホルマント部の信号対雑音比S/Nは充
分大きいがアンチホルマント部のS/Nは小さく
(これは雑音レベルを鎖線の如くとつてみるとd1
では余り差がなく、d2では雑音レベル以下になつ
ていることから容易に理解されよう)、綜合的に
S/Nが劣化する。この対策としてアンチホルマ
ント部をバンドエリミネートする(斜線部はこの
除去部分を示す)ノツチフイルタを音声合成シス
テムのデコーダの出力端に挿入する方法が考えら
れている(例えばT.S.Lamba etal:Intelligible
Voice Communication……,Radio and
Electronic Engineer,Vol.48,No.4)。この場合
は雑音成分だけでなく当該帯域の信号成分も失な
われるが、原信号のホルマント成分は保存されて
いるので音声品質の低下は少なく雑音特性を改善
できる(雑音がないように聞える)。 しかしながら音声の種類例えば母音の種類によ
りホルマント周波数は次表に示されるように大幅
に変化するから、ある種の音声に対して有効なノ
ツチフイルタも他の種類の音声に対してはかえつ
て品質を劣化することになる。
The present invention relates to a noise suppression method for a speech signal such as an output signal of a speech synthesis system. As shown in Figure 1, the waveforms of voices, especially vowels such as A, I, and U, are in a specific frequency range of 1K in this example.
Energy is concentrated near Hz, 2KHz, and 3.6KHz. This is due to vocal tract resonance and is called formant, and these first, second, and third formants
The dips d 1 and d 2 between F 1 , F 2 , and F 3 are due to antiresonance of the vocal tract and are called antiformants. In this way, in a signal with formant and antiformant, the signal-to-noise ratio S/N of the formant part is sufficiently large, but the S/N of the antiformant part is small .
(This can be easily understood from the fact that there is not much difference at d2 , and it is below the noise level at d2), and the S/N deteriorates overall. As a countermeasure to this problem, a method has been considered in which a notch filter that band-eliminates the antiformant part (the shaded part indicates the removed part) is inserted into the output terminal of the decoder of the speech synthesis system (for example, TSLamba etal: Intelligible
Voice Communication……,Radio and
Electronic Engineer, Vol.48, No.4). In this case, not only the noise component but also the signal component in the relevant band is lost, but the formant component of the original signal is preserved, so there is little deterioration in voice quality and the noise characteristics can be improved (it sounds like there is no noise). However, the formant frequency changes significantly depending on the type of voice, such as the type of vowel, as shown in the table below, so a notch filter that is effective for some types of voice will actually degrade the quality for other types of voice. I will do it.

【表】【table】

【表】 上記の欠点は前記文献記載のシステムに準じた
システムで色々な音声をメモリへ書込み、それを
読出してデコードするとき、ノツチフイルタの中
心周波数と帯域を変化した場合に聴感上で確認さ
れた。本発明はこのような障害を除くべく、中心
周波数と帯域を自動的に変更できるノツチフイル
タを設け、それぞれの音声に対応させて該中心周
波数と帯域を変更させて、音声の忠実度を保ちな
がら雑音特性を改善しようとするものである。次
に実施例を参照しながらこれを詳細に説明する。 第2図は波形符号化型の音声合成システムに本
発明を適用した例を示す。10は記憶部で、
ROM(読取り専用メモリ)で構成され、音声素
片を波形符号化して記憶する部分Aと該音声素片
に対して使用するノツチフイルタの特性制御情報
を記憶する部分Bとを有する。なお部分A,Bは
図面では一方はメモリの前半部、他方は後半部と
分離しておくように示しているが、これは説明上
で、実際のメモリシステム上の配置は適宜変更で
きる。12はマイクロプロセツサで構成される制
御部で、記憶部10の読出しアドレス信号などを
出力する。14はデコーダ部で、記憶部10の読
出しデジタル出力をデコードして音声アナログ信
号を出力する。16はノツチフイルタ部で、帯域
除去するその帯域の中心周波数および幅が前記制
御情報により変更される。例えば記憶部の部分A
に記憶されている音声素片群のうち第1図に示さ
れるような周波数スペクトル特性を持つ母音を読
出すとすると制御部12は、該母音の音声信号を
サンプリングしΔ変調等して得たデジタルデータ
群を記憶している部分Aのアドレス群を逐次出力
し、その読出し出力がデコーダ部14でデコード
されてアナログ音声信号となり、それがノツチフ
イルタ部16を通つて出力され、図示しない増幅
器を介してスピーカを駆動するが、上記制御部が
出力するアドレス群の先頭のもので部分Bに格納
されている該母音に関するノツチフイルタ部16
の特性制御情報、即ち該フイルタの中心周波数お
よび帯域を本例では第1図の斜線部の中央および
幅にする制御信号も読出され、ノツチフイルタ部
16を上記のように特性変更する。メモリに書込
む情報はデジタル情報であるから、フイルタ制御
信号がアナログ信号ならDA変換を行なう。なお
このノツチフイルタとしてはデジタルフイルタが
好ましい。またノツチフイルタは複数個設け、音
声素片の第1、第2……アンチホルマント部を除
去するようにしてもよく、この場合は上記特性制
御情報を複数個用意する。 一般の音声では無声期間50%、母音区間49%、
子音期間1%、というデータも文献に発表されて
おり、ノツチフイルタの特性制御は、音声を音
節、音韻あるいは母音、子音等と区別するとき、
音韻または母音単位で行なうのが効果的である。
またこのフイルタ部にはホルマント部の伸長機能
を持たせることも有効である。なお音声信号をΔ
変調等して記憶部分Aに音声素片群を書込むと
き、該音声信号をノツチフイルタに通してアンチ
ホルマント部除去を行なうことも考えられ、この
場合は再生に際してノツチフイルタ部16を通す
必要はないが、本発明方式はアンチホルマント部
除去を行なわずに音声素片群を書込まれているい
わば汎用音声メモリを用いての音声合成出力等に
有効である。 以上説明したように本発明によればメモリに記
憶されたア,イ……,カ,キ……などの音声素片
の群を用いて音質の良好な、雑音特性の優れた有
意音声を出力することができ、甚だ有効である。
[Table] The above drawbacks were audibly confirmed when the center frequency and band of the notch filter were changed when various sounds were written to memory and read out and decoded using a system similar to the system described in the above literature. . In order to eliminate such disturbances, the present invention provides a notch filter that can automatically change the center frequency and band, and changes the center frequency and band according to each voice, thereby eliminating noise while maintaining the fidelity of the voice. This is an attempt to improve the characteristics. Next, this will be explained in detail with reference to examples. FIG. 2 shows an example in which the present invention is applied to a waveform encoding type speech synthesis system. 10 is the storage section,
It is composed of a ROM (read-only memory) and has a part A for storing waveform encoded speech segments and a part B for storing characteristic control information of a notch filter used for the speech segments. In the drawings, parts A and B are shown so that one is separated from the first half of the memory and the other is separated from the latter half of the memory, but this is for illustrative purposes only, and the actual arrangement on the memory system can be changed as appropriate. Reference numeral 12 denotes a control section composed of a microprocessor, which outputs a read address signal for the storage section 10 and the like. A decoder section 14 decodes the read digital output of the storage section 10 and outputs an audio analog signal. 16 is a notch filter section, and the center frequency and width of the band to be removed are changed by the control information. For example, part A of the storage section
If a vowel having the frequency spectrum characteristics as shown in FIG. 1 is to be read out from a group of speech segments stored in The address group of the part A that stores the digital data group is sequentially outputted, and the read output is decoded by the decoder section 14 to become an analog audio signal, which is outputted through the notch filter section 16 and then sent through an amplifier (not shown). The notch filter unit 16 regarding the vowel stored in part B, which is the first address of the address group output by the control unit, drives the speaker.
Characteristic control information, ie, a control signal for setting the center frequency and band of the filter to the center and width of the hatched area in FIG. 1 in this example, is also read out, and the characteristics of the notch filter section 16 are changed as described above. Since the information written to the memory is digital information, if the filter control signal is an analog signal, DA conversion is performed. Note that a digital filter is preferable as this notch filter. Further, a plurality of notch filters may be provided to remove the first, second, . In general speech, the silent period is 50%, the vowel interval is 49%,
There is also data published in the literature that the consonant period is 1%, and the characteristic control of the notch filter is useful when distinguishing speech from syllables, phonemes, vowels, consonants, etc.
It is effective to do this on a phoneme or vowel basis.
It is also effective to provide this filter section with a function of expanding the formant section. Note that the audio signal is
When writing a group of speech segments into the storage section A by modulation, etc., it is also possible to pass the speech signal through a notch filter to remove the antiformant part. The method of the present invention is effective for speech synthesis output using a so-called general-purpose speech memory in which a group of speech segments is written without performing antiformant part removal. As explained above, according to the present invention, meaningful speech with good sound quality and excellent noise characteristics is output using a group of speech segments such as A, I..., Ka, Ki... stored in the memory. It can be done and is very effective.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は音声素片の周波数スペクトル密度特性
を示すグラフ、第2図は本発明の説明用ブロツク
図である。 図面で10はメモリ、14はデコード部、16
はノツチフイルタである。
FIG. 1 is a graph showing the frequency spectrum density characteristics of a speech segment, and FIG. 2 is a block diagram for explaining the present invention. In the drawing, 10 is a memory, 14 is a decoding section, 16
is a notsuchi filter.

Claims (1)

【特許請求の範囲】[Claims] 1 メモリに記憶させた多数の音声素片を選択的
に読み出し、デコードして音声信号を出力するシ
ステムの雑音抑圧方式において、各音声素片のア
ンチホルマント部除去用ノツチフイルタを設けて
その特性制御情報を各音声素片に対応させてメモ
リに記憶させておき、メモリからの各音声素片読
出しに際しては当該音声素片に対する特性制御情
報をも読出してそれを前記ノツチフイルタに加
え、こうして特性変更されたノツチフイルタに音
声素片読出し出力のデコード出力を通すことを特
徴とする音声信号の雑音抑圧方式。
1. In a noise suppression method for a system that selectively reads out a large number of speech segments stored in a memory, decodes them, and outputs a speech signal, a notch filter for removing the antiformant part of each speech segment is provided, and its characteristic control information is provided. is stored in memory in correspondence with each speech segment, and when each speech segment is read out from the memory, characteristic control information for the speech segment is also read out and added to the notch filter. A method for suppressing noise in a speech signal, characterized in that a decoded output of a speech segment readout output is passed through a notch filter.
JP56139575A 1981-09-04 1981-09-04 Noise depression system for voice signal Granted JPS5842096A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP56139575A JPS5842096A (en) 1981-09-04 1981-09-04 Noise depression system for voice signal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP56139575A JPS5842096A (en) 1981-09-04 1981-09-04 Noise depression system for voice signal

Publications (2)

Publication Number Publication Date
JPS5842096A JPS5842096A (en) 1983-03-11
JPH0145640B2 true JPH0145640B2 (en) 1989-10-04

Family

ID=15248451

Family Applications (1)

Application Number Title Priority Date Filing Date
JP56139575A Granted JPS5842096A (en) 1981-09-04 1981-09-04 Noise depression system for voice signal

Country Status (1)

Country Link
JP (1) JPS5842096A (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2056110C (en) * 1991-03-27 1997-02-04 Arnold I. Klayman Public address intelligibility system

Also Published As

Publication number Publication date
JPS5842096A (en) 1983-03-11

Similar Documents

Publication Publication Date Title
JP4132109B2 (en) Speech signal reproduction method and device, speech decoding method and device, and speech synthesis method and device
US5752223A (en) Code-excited linear predictive coder and decoder with conversion filter for converting stochastic and impulsive excitation signals
US5251261A (en) Device for the digital recording and reproduction of speech signals
US6240388B1 (en) Audio data decoding device and audio data coding/decoding system
US4703505A (en) Speech data encoding scheme
JPH0145640B2 (en)
JP4271588B2 (en) Encoding method and encoding apparatus for digital data
JP2734028B2 (en) Audio recording device
JPH05303399A (en) Audio time axis companding device
JP4826580B2 (en) Audio signal reproduction method and apparatus
JPS5880699A (en) Voice synthesizing system
JP2860991B2 (en) Audio storage and playback device
JPS5821798A (en) Voice reproducer
JPS6295595A (en) Voice response method
JP3949346B2 (en) Speech synthesis method and apparatus
JP4817949B2 (en) In-vehicle machine
JPS61252598A (en) Voice word editing system
JP4159927B2 (en) Digital audio decoder
JPH01197793A (en) Speech synthesizer
JPS5965897A (en) Residual signal encoding method
JPS58158695A (en) Voice modification method
JPH0449956B2 (en)
JPS5915300A (en) Audio recording and playback device
JPH0312320B2 (en)
JPS61118798A (en) Voice synthesizer