The Comparison of the Effect of Haimming Window and Blackman Window in the Time-Scaling and Pitch-Shifting Algorithms

Article Preview

Abstract:

The real-time pitch shifting process is widely used in various types of music production. The pitch shifting technology can be divided into two major types, the time domain type and the frequency domain type. Compared with the time domain method, the frequency domain method has the advantage of large shifting scale, low total cost of computing and the more flexibility of the algorithm. However, the use of Fourier Transform in frequency domain processing leads to the inevitable inherent frequency leakage effects which decrease the accuracy of the pitch shifting effect. In order to restrain the side effect of Fourier Transform, window functions are used to fall down the spectrum-aliasing. In practical processing, Haimming Window and Blackman Window are frequently used. In this paper, we compare both the effect of the two window functions in the restraint of frequency leakage and the performance and accuracy in subjective based on the traditional phase vocoder[1]. Experiment shows that Haimming Window is generally better than Blackman Window in pitch shifting process.

You have full access to the following eBook

Info:

Periodical:

Pages:

221-225

Citation:

Online since:

September 2011

Export:

Share:

Citation:

[1] J. Laroche, Time and pitch scale modification of audio signals, in Applications of Digital Signal Processing to Audio and Acoustics,M. Kahrs and K. Brandenburg, Eds. Kluwer, Norwell, MA, (1998).

DOI: 10.1007/0-306-47042-x_7

Google Scholar

[2] J.L. Flanagan and R.M. Golden, Phase vocoder, Bell Syst. Tech. J., vol. 45, p.1493–1509, Nov (1966).

DOI: 10.1002/j.1538-7305.1966.tb01706.x

Google Scholar

[3] J. B. Allen and L. R. rabiner, A unified approach to short-time Fourier analysis and synthesis, Proc. IEEE, vol. 65, no. 11, p.1558–1564, Nov. (1977).

DOI: 10.1109/proc.1977.10770

Google Scholar

[4] R. Portnoff, Time-scale modifications of speech based on short-time Fourier analysis, IEEE Trans. Acoust., Speech, Signal Processing, vol. 29, no. 3, p.374–390, (1981).

DOI: 10.1109/tassp.1981.1163581

Google Scholar

[5] M.S. Puckette, Phase-locked vocoder, in Proc. IEEE ASSPWorkshop on app. of sig. proc. to audio and acous., New Paltz, NY, (1995).

Google Scholar

[6] J. Laroche and M. Dolson, Improved phase vocoder time-scale modification of audio, to appear in May issue of IEEE trans. speech and audio proc., (1999).

DOI: 10.1109/89.759041

Google Scholar

[7] L.B. Almeida and F.M. Silva, Variable-frequency synthesis: an improved harmonic coding scheme, in Proc. IEEE Int. Conf. Acoust., Speech, Signal processing, 1984, p.27. 5. 1–27. 5. 4.

DOI: 10.1109/icassp.1984.1172489

Google Scholar

[8] R. J. McAulay and T. F. Quatieri, Speech analysis/synthesis based on a sinusoidal representation, IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, no. 4, p.744–754, Aug (1986).

DOI: 10.1109/tassp.1986.1164910

Google Scholar

[9] X. Serra and J. Smith, Spectral modeling synthesis: A sound analysis/synthesis system based on a deterministic plus stochastic decomposition, Computer Music J., vol. 14, no. 4, p.12–24, Winter (1990).

DOI: 10.2307/3680788

Google Scholar

[10] E. B. George and M. J. T. Smith, Analysis-bysynthesis/ Overlap-add sinusoidal modeling applied to the analysis and synthesis of musical tones, J. Audio Eng. Soc., vol. 40, no. 6, p.497–516, (1992).

Google Scholar

[11] S. Tassart and P. Depalle, Analytical approximations of fractional delays: Lagrange interpolators and allpass filters, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Munich, Germany, (1997).

DOI: 10.1109/icassp.1997.599673

Google Scholar

[12] T.I. Laakso, V. Valimaki, M. Karjalainen, and U. KLaine, Splitting the unit delay [fir/all pass filters design], IEEE Signal Processing mag., vol. 13, no. 1, p.30–60, Jan (1996).

DOI: 10.1109/79.482137

Google Scholar