Hybrid signal decomposition based on instantaneous harmonic parameters and perceptually motivated wavelet packets for scalable audio coding
详细信息    查看全文
文摘
The paper presents a complete framework for hybrid representation of audio and speech signals that can be used in coding applications. The parameterization approach is based on the three-part model (sinusoids, transients and noise). The essential contributions of the paper can be summarized as follows: (i) a precise mathematical solution to the problem of instantaneous harmonic parameters estimation that can be applied to nonstationary (amplitude and frequency modulated) signals. The instantaneous harmonic parameters (magnitude, frequency and phase) are calculated as the result of the narrow-band filtering of signals. The frequency-modulated filters synthesis with the closed form impulse response has been proposed. The filter frequency bounds can be determined during the components frequency tracking and can be adjusted according to the fundamental frequency modulations; (ii) a practical technique of instantaneous harmonic analysis and numerical evaluation of its performance; (iii) a new transient parameterization scheme based on matching pursuit with frame-based psychoacoustic optimized wavelet packet dictionary. The choice of most relevant coefficients is based on maximizing the matching between the auditory excitation scalograms of original and modeled signals; (iv) the given hybrid analysis system is applied to speech and audio signals in order to validate the proposed methods.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700