Voice Activity Detection with Decision Trees in Noisy Environments

Da Li Hu; Liang Zhong Yi; Zheng Pei; Bing Luo

doi:10.4028/www.scientific.net/AMM.128-129.749

Paper Titles

The Safety Monitoring of Colliery Equipments Based on Data Mining Technology
p.731

The Study of Low-Power Consumption Heat Meter Based on MSP430
p.735

The Study of the Dual-Wavelength Fiber-Optic Temperature Sensor
p.741

Video Signal Capture and Processing Card Based on H.264 Hardware Encoder
p.745

Voice Activity Detection with Decision Trees in Noisy Environments
p.749

Voltage Dip Analyzing Method for Multiple Faults Condition in Complex Power Network
p.753

A Control Method of Hypersonic Vehicle Based on the Structured Singular Value Theory
p.761

A Control System of Binocular Eyes Based on DSP TMS320F2812
p.765

A Cooperative Co-Evolutionary Controller for AC Induction Motor
p.771

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 128-129Voice Activity Detection with Decision Trees in...

Voice Activity Detection with Decision Trees in Noisy Environments

Abstract:

An improved project based on double thresholds method in noisy environments is proposed for robust endpoints detection. Firstly, in this method, the distribution of zero crossing rate (ZCR) on the preprocessed signal is taken into account, and then the speech signal is divided into different parts to obtain appropriate thresholds with decision trees on the basis of the ZCR distribution. Finally, the double thresholds method, focusing on different importance of the energy and ZCR, is taken in the corresponding situation to determine the input segment is speech or non-speech. Simulation results indicate that the proposed method with decision trees obtains more accurate data than the traditional double thresholds method.

You might also be interested in these eBooks

Measuring Technology and Mechatronics Automation IV

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volumes 128-129)

Pages:

749-752

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.128-129.749

Citation:

Cite this paper

Online since:

October 2011

Authors:

Da Li Hu, Liang Zhong Yi, Zheng Pei, Bing Luo

Keywords:

Density, Distribution, Threshold, Voice Activity Detection (VAD), Zero Crossing Rate (ZCR)

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

References

[1] Rabiner L. R and Sambur M. R, in: An algorithm for determining the endpoints of isolated utterances [J]. Bell System Technical Journal, 1975, 54: 297-315.

DOI: 10.1002/j.1538-7305.1975.tb02840.x

Google Scholar

[2] Misra Hemant, Ikbal Shajith etc, in: Multi-resolution spectral entropy feature for robust ASR. ICASSPC05, 2005, 1: 253-256.

Google Scholar

[3] Wu Bingfei and Wang Kunching, in: Robust endpoint detection algorithm based on the adaptive band-partitioning spectral entropy in adverse environments. IEEE Trans on Speech Processing, 2005, 13: 762-775.

DOI: 10.1109/tsa.2005.851909

Google Scholar

[4] Long Hainan and Zhang Cuigai, in: An improved method for robust speech endpoint detection, Proceedings of the Eighth International Conference on Machine Learning and Cybernetics, 2009: 2067-(2071).

DOI: 10.1109/icmlc.2009.5212154

Google Scholar

[5] Martin Arnaud and Mauuary Laurent, in: Robust speech/non-speech detection based on LDA-derived parameter and voicing parameter for speech recognition in noisy environments, Speech Communication, 2006, 48: 191-206.

DOI: 10.1016/j.specom.2005.07.005

Google Scholar

[6] Hsieh Cheng-Hsiung, Feng Ting-Yu and Huang Po-Chin, in: Energy-based VAD with grey magnitude spectral subtraction. Speech Communication, 2009, 51: 810-819.

DOI: 10.1016/j.specom.2008.08.005

Google Scholar

[7] Liu Qingsheng, Xu Xiaopeng and Huang Wenhao, in: Research on a speech endpoint method, Computer Engineering, 2003, 29: 120-121.

Google Scholar