Decoding ambisonic signals to irregular quad loudspeaker configuration based on hybrid ANN and modified tabu search

详细信息查看全文

作者：P. W. M. Tsang (1) eewmtsan@cityu.edu.hk
K. W. K. Cheung (1)
A. C. S. Leung (1)
关键词：Ambisonic &#8211 ; Artificial neural network &#8211 ; Modified tabu search &#8211 ; Heuristic genetic algorithm
刊名：Neural Computing & Applications
出版年：2011
出版时间：October 2011
年：2011
卷：20
期：7
页码：983-991
全文大小：2.0 MB
参考文献：1. Rumsey F (2001) Spatial audio. Focal Press, pp 111–118
2. Gerzon MA (1970) Surround-sound from 2-channel stereo. HiFi News, August
3. Gerzon MA (1974) Surround-sound psychoacoustics. Wireless World. Wireless World, December
4. Gerzon MA (1976) Multidirectional sound reproduction systems. United States Patent, 3,997,725, December
5. Gerzon MA (1985) Ambisonics in multichannel broadcasting and video. J Audio Eng Soc 33(11):859–871
6. Fellgett P (1975) Ambisonics. Part one: general system description. Studio Sound, pp 20–40
7. Malham D (2007) Higher order Ambisonic systems., Space in Music–Music in Space (Mphil thesis), University of York
8. Benjamin EM, Lee R, Heller AJ (2006) Localization in horizontal-only ambisonic systems. 121st Convention of the Audio Engineering Society, http://www.ai.sri.com/pubs/files/1374.pdf, 2006
9. Gerzon MA (1983) Decoders for feeding irregular loudspeaker arrays. United States Patent 4,414,430, November 1983
10. Farina A (1998) Software Implementation of B-format encoding and decoding. Pre-prints of the 104th AES Conv., Amsterdam
11. Wiggins B et al (2003) The design and optimization of surround sound decoders using heuristic methods. In: Proceedings of UKSim’03, pp 106–114
12. Wiggins B (2004) An investigation into the real-time manipulation and control of three-dimensional sound fields. PhD thesis, University of Derby, Derby
13. Gerzon MA (1992) General metatheory of auditory localization, 92nd Conv. Audio Eng. Soc, Vienna
14. Moore D, Wakefield J (2006) An enhanced approach to surround sound decoder design. In: Proc. of Comp. and Engg. Ann, Res. (CEARC’06). University of Huddersfield, Huddersfield, pp 1–6
15. Moore D, Wakefield J (2007) The design of improved first order ambisonic decoders by the application of range removal and importance in a heuristic search algorithm. In: 31st Audio Engg. Soc. Int’l Conf., June 2007
16. Tsang PWM, Cheung KWK (2009) Development of a re-configurable ambisonic decoder for irregular loudspeaker configuration. IET Circuits Devices Syst 3(4):197–203
17. Herrera F et al (2003) A taxonomy for the crossover operator for real-coded genetic algorithms: an experimental study. Int J Intel Syst 18:309–338
18. Tsang PWM, Cheung, WK, Leung CS (2009) Decoding ambisonic signals to irregular loudspeaker configuration based on artificial neural network. Neural Inform Process 5864:273–280 (Springer, Berlin)
19. Rumelhart DE, Hinton GE, Williams RJ (1986) “Learning internal representations by error propagation”, parallel data processing, vol 1, chap 8. The M.I.T. Press, Cambridge, pp 318–362
20. Hagan MT, Menhaj M (1994) Training feed-forward networks with the Marquardt algorithm. IEEE Trans Neural Netw 5(6):989–993
21. Hagan MT, Demuth HB, Beale MH (1996) Neural network design. PWS Publishing, Boston, MA
22. Glover F, Laguna M (1997) Tabu search. Kluwer, London
23. Wiggins B (2007) The generation of panning laws for irregular speaker arrays using heuristic methods. AES 31st International Conference
24. Poletti MA (2000) A unified theory of horizontal holographic sound systems. JAES 48(12):1155–1182
25. Betlehem T, Abhayapala TD (2005) Theory and design of sound field reproduction in reverberant rooms. ASAJ 117(4):2100–2111
26. Noisternig M, Musil T, Sontacchi A, H枚ldrich R (2003) 3D binaural sound reproduction using a virtual ambisonic approach. VECIMS 2003. In: lnt Sym. Vir. Env. Hum. Comp. Int. and Meas. Sys. pp 174–178
作者单位：1. Department of Electronic Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong SAR
刊物类别：Computer Science
刊物主题：Simulation and Modeling
出版者：Springer London
ISSN：1433-3058

文摘

Past research has proven that a first-order B-format ambisonic signal can be used to partially reconstruct the original sound field through a collection of arbitrary positioned loudspeakers. This is achieved by setting the gain of each loudspeaker to be the weighted sum of the three components in the B-format signal. Deduction of the weighting factors (a.k.a. the decoding parameters) has been successfully accomplished with the use of the Modified Tabu Search (MTS), and later with the Heuristic Genetic Algorithm (HGA) which provides higher precision and stability. Despite the favorable outcome, both methods involve large amount of iterations and the computation time is lengthy. In this paper, we propose a scheme to overcome this problem based on the integration of Neural Network Estimation (NNE) and the MTS. Compared to HGA, the new approach is about two orders of magnitude faster, and at the same time capable of attaining similar precision in determining the decoding parameters.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700