Retrieval Oriented Robust Audio Hashing

De Long Cui; Jing Long Zuo

doi:10.4028/www.scientific.net/AMR.121-122.854

Paper Titles

Modal Analysis of the Cable-Stayed Space Truss Combining ANSYS and MSC.ADAMS
p.832

Research on Robot Pose Control System Based on MMA7455
p.838

Booth Array Multiplier Based on Adiabatic Computing
p.843

The Research for Extension Software Data Structure
p.849

Retrieval Oriented Robust Audio Hashing
p.854

Open Loop Stability on Hybrid Synchronous Motor
p.860

Research on ALA+SPM Rotor Synchronous Motor Direct Torque Control
p.866

Development of Multilayer Situation Decision Support System and its Applications
p.872

Research on an Electro-Mechanical Hybrid Power Controller
p.878

HomeAdvanced Materials ResearchAdvanced Materials Research Vols. 121-122Retrieval Oriented Robust Audio Hashing

Retrieval Oriented Robust Audio Hashing

Abstract:

Aiming at content-based audio retrieval (CBAR) applications, a robust audio hashing scheme is proposed. First the audio is divided to frame by fixed length and then low-frequent and high-frequent components are obtained by three-level lifting-based wavelet transformation in every frame. Secondly the audio frame is approximately represented as a product of a base matrix and an encoding matrix, or coefficient matrix, using non-negative matrix factorization (NMF). Finally the sum of each column in the coefficient matrix is calculated, which is then quantized to produce one bit of the hash sequence. Experiment results show that the proposed scheme is robust against Mp3 compression, Real compression, filtering, amplitude compression, equalization, echo, etc. It is insensitive to small local change, and therefore is suitable for distinguishing different audios.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Advanced Materials Research (Volumes 121-122)

Pages:

854-859

DOI:

https://doi.org/10.4028/www.scientific.net/AMR.121-122.854

Citation:

Cite this paper

Online since:

June 2010

Authors:

De Long Cui, Jing Long Zuo

Keywords:

Audio Digest, Audio Hash, Audio Retrieval, Content-Based Audio Retrieval, Lifting-Based Wavelet, Non-Negative Matrix Factorization

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

References

[1] T. Kalker, J. Haitsma, J. Oostveen, in: Proc. InternationalWorkshop on Content Based Multimedia Indexing (CBMI '01), edtied by IEEE Int, (2001) in press.

Google Scholar

[2] M. K. Mihc¸ ak, R. Venkatesan, in: Proc. Information Hiding, Pittsburgh, Pa, USA, (2001) in press.

Google Scholar

[3] L. Lu, H. Jiang, H. J. Zhang, in: Proc. ACM Multimedia (MM'01), Ottawa, Canada, (2001) in press.

Google Scholar

[4] J. T. Foote, in: Multimedia Storage and Archiving Systems II, vol. 3229 of Proceedings of SPIE, Dallas, Tex, USA, (1997)p.138.

Google Scholar

[5] B. Logan, in: Proc. 1st International Symposium on Music Information Retrieval (ISMIR '00), Plymouth, Mass, USA, (2000) in press.

Google Scholar

[6] T. Zhang ,C. C. J. Kuo, in: Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP '99), Phoenix, Ariz, USA, (1999) in press.

Google Scholar

[7] J. Haitsma, T. Kalker, J. Oostveen, in: Second International Workshop on Content Based Multimedia and Indexing. Brescia, Italy: CBMI, (2001) in press.

Google Scholar