Zhang et al., 2015 - Google Patents
Gain factor linear prediction based decision-directed method for the a priori SNR estimationZhang et al., 2015
- Document ID
- 13548897396199697684
- Author
- Zhang W
- Ou S
- Shen S
- Gao Y
- Publication year
- Publication venue
- 2015 8th International Congress on Image and Signal Processing (CISP)
External Links
Snippet
The performance of a noisy speech enhancement algorithm depends mainly on the accuracy of the a priori signal-to-noise ratio (SNR) estimate. The decision-directed (DD) algorithm for estimating the a priori SNR has received lots of attention due to its good …
- 238000004088 simulation 0 abstract description 6
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/09—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M9/00—Interconnection arrangements not involving centralised switching
- H04M9/08—Two-way loud-speaking telephone systems with means for suppressing echoes or otherwise conditioning for one or other direction of traffic
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US10049678B2 (en) | System and method for suppressing transient noise in a multichannel system | |
| Suhadi et al. | A data-driven approach to a priori SNR estimation | |
| Krueger et al. | Model-based feature enhancement for reverberant speech recognition | |
| Ma et al. | Perceptual Kalman filtering for speech enhancement in colored noise | |
| US9875748B2 (en) | Audio signal noise attenuation | |
| US7885810B1 (en) | Acoustic signal enhancement method and apparatus | |
| Martín-Doñas et al. | Dual-channel DNN-based speech enhancement for smartphones | |
| Wang et al. | Deep neural network based supervised speech segregation generalizes to novel noises through large-scale training | |
| Blouet et al. | Evaluation of several strategies for single sensor speech/music separation. | |
| Chen et al. | Study of the noise-reduction problem in the Karhunen–Loève expansion domain | |
| Hendriks et al. | Adaptive time segmentation for improved speech enhancement | |
| Bavkar et al. | PCA based single channel speech enhancement method for highly noisy environment | |
| Zhang et al. | Gain factor linear prediction based decision-directed method for the a priori SNR estimation | |
| Rao et al. | Low-complexity neural speech dereverberation with adaptive target control | |
| Tupitsin et al. | Two-step noise reduction based on soft mask for robust speaker identification | |
| Chazan et al. | LCMV beamformer with DNN-based multichannel concurrent speakers detector | |
| Sun et al. | A variable momentum factor algorithm for a priori SNR estimation in speech enhancement | |
| Hepsiba et al. | Computational intelligence for speech enhancement using deep neural network | |
| Shen et al. | A priori SNR estimator based on a convex combination of two DD approaches for speech enhancement | |
| Lee et al. | Multi-stage speech enhancement for automatic speech recognition | |
| Ji et al. | A priori SAP estimator based on the magnitude square coherence for dual-channel microphone system | |
| Dahlan et al. | Unbiased noise estimator for Q-spectral subtraction based speech enhancement | |
| Wu et al. | Convolutional Recurrent Neural Network With Attention Gates For Real-time Single-channel Speech Enhancement | |
| Prodeus et al. | Objective estimation of the quality of radical noise suppression algorithms | |
| Taghia et al. | A negentropy based adaptive line enhancer for single-channel noise reduction at low SNR conditions |