Efficient Implementation of 2-D FCT with Reduced Memory Access for Programmable DSPs
详细信息    查看全文
  • 作者:Xiangyang Liu (1)
    Hua Bao (2)

    1. Department of Computer Science
    ; Anhui Normal University ; Wuhu ; China
    2. Mobile and Wireless Group
    ; Broadcom Corporation ; Matawan ; NJ ; USA
  • 关键词:2 ; D fast cosine transform ; Memory access ; DSP
  • 刊名:The Journal of VLSI Signal Processing
  • 出版年:2015
  • 出版时间:August 2015
  • 年:2015
  • 卷:80
  • 期:2
  • 页码:153-161
  • 全文大小:577 KB
  • 参考文献:1. Ahmed, N (1974) Discrete cosine transfom. IEEE Transactions on Computers C-23: pp. 90-93 CrossRef
    2. Pennebaker, W, Mitchell, J (1993) JPEG still image data compression standard. Kluwer Academic Publishers, Boston
    3. Mitchell, J (1997) MPEG video compression standard. Kluwer Academic Publishers, Boston CrossRef
    4. Oizumi, M (2006) Preprocessing method for DCT-based image-compression. IEEE Transactions on Consumer Electronics 52: pp. 1021-1026 CrossRef
    5. Wang, C (2003) A DCT-based MPEG-2 transparent scrambling algorithm. IEEE Transactions on Consumer Electronics 49: pp. 1208-1213 CrossRef
    6. Rao, K, Yip, P (1990) Discrete cosine transform: algorithms, advantages, applications vol. 4. Academic, London CrossRef
    7. M. Wezelenburg, "General radix 2 DCT and DST algorithms," in / Proceedings of the International Conference on ECCTD'97, Budapest, Hungary, 1997, pp. 789鈥?94.
    8. Chan, Y-H, Siu, W-C (1993) Mixed-radix discrete cosine transform. IEEE Transactions on Signal Processing 41: pp. 3157-3161 CrossRef
    9. Tatsaki, A (1995) Prime-factor DCT algorithms. IEEE Transactions on Signal Processing 43: pp. 772-776 CrossRef
    10. Wu, Z (2009) An improved scaled DCT architecture. IEEE Transactions on Consumer Electronics 55: pp. 685-689 CrossRef
    11. W. Yuan, / et al., "Matrix Factorization for Fast DCT Algorithms," in / 2006 I.E. International Conference on Acoustics, / Speech and Signal Processing, 2006, pp. 948鈥?51.
    12. Hsu, H, Liu, C (2008) Fast radix-q and mixed-radix algorithms for type-IV DCT,. IEEE Signal Processing Letters 15: pp. 910-913 CrossRef
    13. Park, J, Roy, K (2008) A low complexity reconfigurable DCT Architecture to trade off image quality for power consumption. Journal of Signal Processing Systems 53: pp. 399-410 CrossRef
    14. A. Prasoon and K. Rajan, "4脳 4 2-D DCT for H. 264/AVC," in / International Conference on Advances in Computing, / Communication and Control (ICAC3), 2009, pp. 573鈥?77.
    15. Makhoul, J (1980) A fast cosine transform in one and two dimensions. IEEE Transactions on Acoustics, Speech and Signal Processing 28: pp. 27-34 CrossRef
    16. Chan, S, Ho, K (1991) A new two-dimensional fast cosine transform algorithm. IEEE Transactions on Signal Processing 39: pp. 481-485 CrossRef
    17. Feig, E, Winograd, S (1992) On the multiplicative complexity of discrete cosine transforms. IEEE Transactions on Information Theory 38: pp. 1387-1391 CrossRef
    18. Christopoulos, C (1995) The vector-radix fast cosine transform: Pruning and complexity analysis. Signal Processing 43: pp. 197-205 CrossRef
    19. S. An and C. Wang, "A recursive algorithm for 2-D DCT," in / International Symposium on Signal, / Systems and Electronics, 2007, pp. 335鈥?38.
    20. M. Tian, / et al., "A new fast algorithm for 8脳 8 2-D DCT and its VLSI implementation," in / IEEE International Workshop on VLSI Design and Video Technology, 2005, pp. 179鈥?82.
    21. Ramaswamy, SV, Miller, GD (1993) Multiprocessor DSP architectures that implement the FCT based JPEG still picture image compression algorithm with arithmetic coding. IEEE Transactions on Consumer Electronics 39: pp. 1-5 CrossRef
    22. X. Liu, "Memory access reduction method for efficient implementation of fast cosine transform Pruning on DSP," in / Acoustics Speech and Signal Processing (ICASSP), / 2010 I.E. International Conference on, 2010, pp. 1490鈥?493.
    23. Skodras, AN, Constantinides, AG (1991) Efficient input-reordering algorithms for fast DCT. Electronics Letters 27: pp. 1973-1975 CrossRef
  • 刊物类别:Engineering
  • 刊物主题:Electrical Engineering
    Circuits and Systems
    Computer Imaging, Vision, Pattern Recognition and Graphics
    Computer Systems Organization and Communication Networks
    Signal,Image and Speech Processing
    Mathematics of Computing
  • 出版者:Springer New York
  • ISSN:1939-8115
文摘
In this paper, we present a novel memory access reduction scheme (MARS) for two-dimension fast cosine transform (2-D FCT). It targets programmable DSPs with high memory-access latency. It reduces the number of memory accesses by: 1) reducing the number of weighting factors and 2) combining butterflies in vector-radix 2-D FCT pruning diagram from two stages to one stage with an efficient structure. Hardware platform based on general purpose processor is used to verify the effectiveness of the proposed method for vector-radix 2-D FCT pruning implementation. Experimental results validate the benefits of the proposed method with reduced memory access, less clock cycle and fewer memory space compared with the conventional implementation.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700