Speaker Localization Based on Phase Detection of Vowel Fundamental Frequencies

Article Preview

Abstract:

Research on human-robot interaction has recently been getting an increasing amount of attention. In the research field of human-robot interaction, speech signal processing in particular is the source of much interest. In this paper, we present experiment of speaker localization using a microphone array and an ITD (Interaural Time Difference) method which finds the sound source by phase shift of two signals. Band pass filters are designed to get vowel fundamental frequencies. Phase sensitive detectors are applied to measure the phase differences of voice signal of different microphones. All circuits are constructed by FPAA (field programmable analog array). The proposed system can output speaker location in real-time with low power.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 433-440)

Pages:

6490-6496

Citation:

Online since:

January 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] J. Sohn, N. S. Kim, and W. Sung, A Statistical model-based voice activity detection, presented at the IEEE Signal Processing Letters, vol. 6, no. 1, pp.1-3, (1999).

Google Scholar

[2] T. Nishiura, T. Yamada, S. Nakamura, and K. Shikano, Localization of multiple sound sources based on a CSP analysis with a microphone array, presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.1053-1056, (2000).

DOI: 10.1109/icassp.2000.859144

Google Scholar

[3] C. H. Knapp and G. C. Carter, The generalized correlation method for estimation of time delay, presented at the IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 24, no. 4, pp.320-327, (1976).

DOI: 10.1109/tassp.1976.1162830

Google Scholar

[4] M. Brandstein and D. Ward. Microphone Arrays: Signal Processing Techniques and Applications. Springer. 1st edition. pp.157-180, (2001).

DOI: 10.1007/978-3-662-04619-7

Google Scholar

[5] M.S. Brandstein and H. Silverman, A robust method for speech signal time-delay estimation in reverberant rooms, presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.375-378, Munich, Germany, April (1997).

DOI: 10.1109/icassp.1997.599651

Google Scholar

[6] G. C. Carter, Time delay estimation for passive sonar signal processing, IEEE Trans. Acoust., Speech, Signal Process., vol. 29, no. 3, pp.463-470, Jun. (1981).

DOI: 10.1109/tassp.1981.1163560

Google Scholar

[7] H. Silverman, Some analysis of microphone arrays for speech data acquisition, IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 35, no. 12, pp.1699-1712, (1987).

DOI: 10.1109/tassp.1987.1165098

Google Scholar

[8] AN231E04 Datasheet Rev 1. 1 Dynamically Reconfigurable dpASP, www. anadigm. com.

Google Scholar

[9] Jinshan GAO, Shijie WANG, Phase Sensitive Detection Based on FPAA, The International Conference on Electrical Engineering and Automatic Control, ICEEAC2010, Zibo, Shangdong, P R of China, unpublished.

Google Scholar