Zhang et al., 2015 - Google Patents

Gain factor linear prediction based decision-directed method for the a priori SNR estimation

Zhang et al., 2015

Document ID: 13548897396199697684
Author: Zhang W; Ou S; Shen S; Gao Y
Publication year: 2015
Publication venue: 2015 8th International Congress on Image and Signal Processing (CISP)

External Links

Cited by

Snippet

The performance of a noisy speech enhancement algorithm depends mainly on the accuracy of the a priori signal-to-noise ratio (SNR) estimate. The decision-directed (DD) algorithm for estimating the a priori SNR has received lots of attention due to its good …

Continue reading at ieeexplore.ieee.org (other versions)

238000004088 simulation 0 abstract description 6

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/09—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M9/00—Interconnection arrangements not involving centralised switching
- H04M9/08—Two-way loud-speaking telephone systems with means for suppressing echoes or otherwise conditioning for one or other direction of traffic

Similar Documents

Publication	Publication Date	Title
US10049678B2 (en)	2018-08-14	System and method for suppressing transient noise in a multichannel system
Suhadi et al.	2010	A data-driven approach to a priori SNR estimation
Krueger et al.	2010	Model-based feature enhancement for reverberant speech recognition
Ma et al.	2004	Perceptual Kalman filtering for speech enhancement in colored noise
US9875748B2 (en)	2018-01-23	Audio signal noise attenuation
US7885810B1 (en)	2011-02-08	Acoustic signal enhancement method and apparatus
Martín-Doñas et al.	2017	Dual-channel DNN-based speech enhancement for smartphones
Wang et al.	2015	Deep neural network based supervised speech segregation generalizes to novel noises through large-scale training
Blouet et al.	2008	Evaluation of several strategies for single sensor speech/music separation.
Chen et al.	2009	Study of the noise-reduction problem in the Karhunen–Loève expansion domain
Hendriks et al.	2006	Adaptive time segmentation for improved speech enhancement
Bavkar et al.	2013	PCA based single channel speech enhancement method for highly noisy environment
Zhang et al.	2015	Gain factor linear prediction based decision-directed method for the a priori SNR estimation
Rao et al.	2025	Low-complexity neural speech dereverberation with adaptive target control
Tupitsin et al.	2016	Two-step noise reduction based on soft mask for robust speaker identification
Chazan et al.	2018	LCMV beamformer with DNN-based multichannel concurrent speakers detector
Sun et al.	2014	A variable momentum factor algorithm for a priori SNR estimation in speech enhancement
Hepsiba et al.	2022	Computational intelligence for speech enhancement using deep neural network
Shen et al.	2016	A priori SNR estimator based on a convex combination of two DD approaches for speech enhancement
Lee et al.	2016	Multi-stage speech enhancement for automatic speech recognition
Ji et al.	2015	A priori SAP estimator based on the magnitude square coherence for dual-channel microphone system
Dahlan et al.	2019	Unbiased noise estimator for Q-spectral subtraction based speech enhancement
Wu et al.	2021	Convolutional Recurrent Neural Network With Attention Gates For Real-time Single-channel Speech Enhancement
Prodeus et al.	2016	Objective estimation of the quality of radical noise suppression algorithms
Taghia et al.	2014	A negentropy based adaptive line enhancer for single-channel noise reduction at low SNR conditions