Decoding ambisonic signals to irregular quad loudspeaker configuration based on hybrid ANN and modified tabu search
详细信息    查看全文
  • 作者:P. W. M. Tsang (1) eewmtsan@cityu.edu.hk
    K. W. K. Cheung (1)
    A. C. S. Leung (1)
  • 关键词:Ambisonic &#8211 ; Artificial neural network &#8211 ; Modified tabu search &#8211 ; Heuristic genetic algorithm
  • 刊名:Neural Computing & Applications
  • 出版年:2011
  • 出版时间:October 2011
  • 年:2011
  • 卷:20
  • 期:7
  • 页码:983-991
  • 全文大小:2.0 MB
  • 参考文献:1. Rumsey F (2001) Spatial audio. Focal Press, pp 111–118
    2. Gerzon MA (1970) Surround-sound from 2-channel stereo. HiFi News, August
    3. Gerzon MA (1974) Surround-sound psychoacoustics. Wireless World. Wireless World, December
    4. Gerzon MA (1976) Multidirectional sound reproduction systems. United States Patent, 3,997,725, December
    5. Gerzon MA (1985) Ambisonics in multichannel broadcasting and video. J Audio Eng Soc 33(11):859–871
    6. Fellgett P (1975) Ambisonics. Part one: general system description. Studio Sound, pp 20–40
    7. Malham D (2007) Higher order Ambisonic systems., Space in Music–Music in Space (Mphil thesis), University of York
    8. Benjamin EM, Lee R, Heller AJ (2006) Localization in horizontal-only ambisonic systems. 121st Convention of the Audio Engineering Society, http://www.ai.sri.com/pubs/files/1374.pdf, 2006
    9. Gerzon MA (1983) Decoders for feeding irregular loudspeaker arrays. United States Patent 4,414,430, November 1983
    10. Farina A (1998) Software Implementation of B-format encoding and decoding. Pre-prints of the 104th AES Conv., Amsterdam
    11. Wiggins B et al (2003) The design and optimization of surround sound decoders using heuristic methods. In: Proceedings of UKSim’03, pp 106–114
    12. Wiggins B (2004) An investigation into the real-time manipulation and control of three-dimensional sound fields. PhD thesis, University of Derby, Derby
    13. Gerzon MA (1992) General metatheory of auditory localization, 92nd Conv. Audio Eng. Soc, Vienna
    14. Moore D, Wakefield J (2006) An enhanced approach to surround sound decoder design. In: Proc. of Comp. and Engg. Ann, Res. (CEARC’06). University of Huddersfield, Huddersfield, pp 1–6
    15. Moore D, Wakefield J (2007) The design of improved first order ambisonic decoders by the application of range removal and importance in a heuristic search algorithm. In: 31st Audio Engg. Soc. Int’l Conf., June 2007
    16. Tsang PWM, Cheung KWK (2009) Development of a re-configurable ambisonic decoder for irregular loudspeaker configuration. IET Circuits Devices Syst 3(4):197–203
    17. Herrera F et al (2003) A taxonomy for the crossover operator for real-coded genetic algorithms: an experimental study. Int J Intel Syst 18:309–338
    18. Tsang PWM, Cheung, WK, Leung CS (2009) Decoding ambisonic signals to irregular loudspeaker configuration based on artificial neural network. Neural Inform Process 5864:273–280 (Springer, Berlin)
    19. Rumelhart DE, Hinton GE, Williams RJ (1986) “Learning internal representations by error propagation”, parallel data processing, vol 1, chap 8. The M.I.T. Press, Cambridge, pp 318–362
    20. Hagan MT, Menhaj M (1994) Training feed-forward networks with the Marquardt algorithm. IEEE Trans Neural Netw 5(6):989–993
    21. Hagan MT, Demuth HB, Beale MH (1996) Neural network design. PWS Publishing, Boston, MA
    22. Glover F, Laguna M (1997) Tabu search. Kluwer, London
    23. Wiggins B (2007) The generation of panning laws for irregular speaker arrays using heuristic methods. AES 31st International Conference
    24. Poletti MA (2000) A unified theory of horizontal holographic sound systems. JAES 48(12):1155–1182
    25. Betlehem T, Abhayapala TD (2005) Theory and design of sound field reproduction in reverberant rooms. ASAJ 117(4):2100–2111
    26. Noisternig M, Musil T, Sontacchi A, H枚ldrich R (2003) 3D binaural sound reproduction using a virtual ambisonic approach. VECIMS 2003. In: lnt Sym. Vir. Env. Hum. Comp. Int. and Meas. Sys. pp 174–178
  • 作者单位:1. Department of Electronic Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong SAR
  • 刊物类别:Computer Science
  • 刊物主题:Simulation and Modeling
  • 出版者:Springer London
  • ISSN:1433-3058
文摘
Past research has proven that a first-order B-format ambisonic signal can be used to partially reconstruct the original sound field through a collection of arbitrary positioned loudspeakers. This is achieved by setting the gain of each loudspeaker to be the weighted sum of the three components in the B-format signal. Deduction of the weighting factors (a.k.a. the decoding parameters) has been successfully accomplished with the use of the Modified Tabu Search (MTS), and later with the Heuristic Genetic Algorithm (HGA) which provides higher precision and stability. Despite the favorable outcome, both methods involve large amount of iterations and the computation time is lengthy. In this paper, we propose a scheme to overcome this problem based on the integration of Neural Network Estimation (NNE) and the MTS. Compared to HGA, the new approach is about two orders of magnitude faster, and at the same time capable of attaining similar precision in determining the decoding parameters.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700