Combination of SVM and Score Normalization for Person Identification Based on Audio-Visual Feature Fusion

Article Preview

Abstract:

Aimed at the problem of low accuracy rate for face recognition and speaker recognition in noisy environment, a multi-biometric model fusing face features and speech features is presented by combining Normalization and SVM theory based on the research of feature level fusion. Face features and speech features are first extracted by pulse coupled neural network and VQ-SVM respectively. Then the distance between tested people and template people is calculated after getting the fused feature on the feature level fusion. In order to reduce the computational cost and improve the recognition performance, matching distance is normalized and finally recognized by SVM. Experiment on the ORL database show that even when the signal to noise ratio is declined, recognition rate of the fused system is clearly higher than the single system under noisy environment and the purpose of identity recognition is achieved.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

236-240

Citation:

Online since:

January 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Lu-ping Ji, Zhang Yi and Li-feng Shang. An improved pulse coupled neural network for image processing [J]. Neural Comput & Applic, (2008) 17: 255-263.

DOI: 10.1007/s00521-007-0119-5

Google Scholar

[2] Kazuhiro Hotta. Local normalized linear summation kernel for fast and robust recognition [J]. Pattern Recognition, 2010, 43(1): 906-913.

DOI: 10.1016/j.patcog.2009.09.009

Google Scholar

[3] Peipei, Yin Fu-chun, Sun Chao, Wang Hua-ping, Liu An adaptive feature fusion framework for multi-class classification based on SVM[J]. Journal of System Simulation, 2007, 19(10): 2272-2275.

Google Scholar

[4] Nazmeen Bibi Boodoo, R K Subramanian. Robust Multi-biometric Recognition using Face and Ear Image [J]. International Journal of Computer Science and Information Security, 2009, 6(2): 164-169.

Google Scholar

[5] Sree Hari Krishnan Parthasarathi, Mathew Magimai. -Doss. Evaluating the robustness of privacy- sensitive audio feature for speech detection in personal audio log scenarios [R]. Switzerland: Idiap Research Institute, 2010. 2.

DOI: 10.1109/icassp.2010.5495596

Google Scholar

[6] Mingxing He, Shi-Jinn Horng, Pingzhi Fan , Ray-Shine Run, Rong-Jian Chen, Jui-Lin Lai. Performance evaluation of score level fusion in multimodal biometric systems [J]. Pattern Recognition, 2010, 43(1): 1789-1800.

DOI: 10.1016/j.patcog.2009.11.018

Google Scholar

[7] Lee, J. -S. & Park. Robust audio-visual speech recognition based on late Integration [J]. IEEE Transactions on Multimedia, 2008, 10(5): 767-779.

DOI: 10.1109/tmm.2008.922789

Google Scholar

[8] Information on http: /www. IDIAP. com.

Google Scholar