Complex shading efficiently for ray tracing on GPU
详细信息    查看全文
  • 作者:Xin Yang (1)
    Duan-qing Xu (2)
    Lei Zhao (2)
    Bing Yang (3)

    1. College of Computer Science
    ; Dalian University of Technology ; Dalian ; China
    2. College of Computer Science
    ; Zhejiang University ; Hangzhou ; China
    3. School of Computer Science
    ; Hangzhou Dianzi University ; Hangzhou ; China
  • 关键词:Shading ; GPU ; Ray tracing
  • 刊名:Multimedia Tools and Applications
  • 出版年:2015
  • 出版时间:February 2015
  • 年:2015
  • 卷:74
  • 期:3
  • 页码:1091-1106
  • 全文大小:667 KB
  • 参考文献:1. AMD (2008) ATI stream computing. AMD Developer Website. http://ati.amd.com/technology/streamcomputing/. Accessed June 2008
    2. Bennett K (2009) NVIDIA鈥檚 鈥淔ermi鈥?architecture white paper. Nvidia Developer Website. http://www.hardocp.com/article/2009/09/30/nvidias_fermi_architecture_white_paper/. Accessed July 2009
    3. Boulos S, Edwards D, Lacewell JD, et al. (2007) Packet-based whitted and distribution ray tracing. In: Proc. Graphics Interface 2007. Montreal, Canada, pp 177鈥?84
    4. Choi B, Komuravelli R, Lu V, et al. (2010) Parallel SAH k-D tree construction. In: Proc. of the Conference on High Performance Graphics. Saarbrucken, Germany, pp 77鈥?6
    5. Dammertz H, Hanika J, Keller A (2008) Shallow bounding volume hierarchies for fast SIMD ray tracing of incoherent rays. Comput Graph Forum 27(4):1225鈥?233 CrossRef
    6. Deering M, Winner S, Schediwy B et al (1988) The triangle processor and normal vector shader: a VLSI system for high performance graphics. Comput Graph 22(4):21鈥?1 CrossRef
    7. Henry W (2010) Demystifying GPU microarchitecture through microbenchmarking. In: Proc. IEEE International Symposium on Performance Analysis of Systems & Software, 28鈥?0 March 2010, pp 235鈥?46
    8. Hoberock J, Lu V, Jia Y, et al. (2009) Stream compaction for deferred shading. Proceedings of the Conference on High Performance Graphics, New Orleans, Louisiana, pp 173鈥?80
    9. Lindholm E, Nickolls J, Oberman S et al (2008) NVIDIA Tesla: a unified graphics and computing architecture. IEEE Micro 28(2):39鈥?5 CrossRef
    10. Mansson E., Munkberg J. and Akenine-Molle TR (2007) Deep coherent ray tracing. In: Proc. of 2007 I.E. Symposium on Interactive Ray Tracing. Ulm, Germany, pp 79鈥?5
    11. Overbeck R, Ramamoorthi R, Mark WR (2008) Large ray packets for real-time whitted ray tracing. In: Proc. of IEEE/EG Symposium on Interactive Ray Tracing. Los Angeles, California, USA, pp 41鈥?8
    12. Pharr M, Kolb C, Gershbein R, et al. (1997) Rendering complex scenes with memory-coherent ray tracing. In: Proc. of the 24th annual Conference on Computer graphics and interactive techniques. Los Angeles, California, USA, pp 101鈥?08
    13. Reshetov A (2006) Omnidirectional ray tracing traversal algorithm for kd-trees. In: Proc. of IEEE Symposium on Interactive Ray Tracing. Salt Lake City, Utah, USA, pp 57鈥?0
    14. Reshetov A (2007) Faster ray packets-triangle intersection through vertex culling. In: Proc. of ACM SIGGRAPH 2007 Posters. San Diego, California, USA, p 171
    15. Sengupta S, Harris M, Zhang Y, et al. (2007) Scan primitives for GPU computing. In: Proc. of the 22nd ACM SIGGRAPH/EUROGRAPHICS Symposium on Graphics hardware. San Diego, California, USA, pp 97鈥?06
    16. Shih M, Chiu YF, Chen YC, et al. (2009) Real-Time Ray Tracing with CUDA. In: Proc. of the 9th International Conference on Algorithms and Architectures for Parallel Processing. Taipei, Taiwan, pp 327鈥?37
    17. Wald I, Benthin C, Boulos S (2008) Getting rid of packets: efficient SIMD single-ray traversal using multibranching BVHs. In: Proc. of IEEE/Eurographics Symposium on Interactive Ray Tracing. Los Angeles, California, USA, pp 49鈥?7
    18. Wald I, Boulos S, Shirley P (2007) Ray tracing deformable scenes using dynamic bounding volume hierarchies. ACM Trans Graph 26(1):6
    19. Wald I, Gribble CP, Boulos S, et al. (2007) SIMD Ray Stream Tracing-SIMD ray traversal with generalized ray packets and on-the-fly re-ordering. Technical Report #UUSCI-2007-012
    20. Wald I, Slusallek P, Benthin C et al (2001) Interactive rendering with coherent ray tracing. Comput Graph Forum 20(3):153鈥?64 CrossRef
    21. Zlatu拧ka M, Havran V (2010) Ray Tracing on a GPU with CUDA-Comparative Study of Three Algorithms. In: Proc. of 18th International Conference on Computer Graphics, Visualization and Computer Vision. Czech Republic, pp 69鈥?6
  • 刊物类别:Computer Science
  • 刊物主题:Multimedia Information Systems
    Computer Communication Networks
    Data Structures, Cryptology and Information Theory
    Special Purpose and Application-Based Systems
  • 出版者:Springer Netherlands
  • ISSN:1573-7721
文摘
Complex shading often associates with long shaders and huge data access. To obtain good performance on current generation GPU hardware, it is necessary to design some algorithms to manage data, schedule more efficient threads, and memory access under the hierarchy of GPU memory. In this paper, we propose an approach to accelerate the rendering process for complex shaders by analyzing and sorting shading jobs according to their complexity and potential memory access. We show that by sorting these shading jobs in three levels of memory hierarchies and reorganizing threads block according to the complexity, all shading jobs are scheduled in order, and we can significantly improve cache utilization and GPU hardware utilization, especially for poor performance caused by large branching. All sorting work are processed on CPU with plentiful logic function, and can be processed in a very efficient manner, compared with the expensive compaction operation on GPU. Our experiments with this hierarchy demonstrate improvements against a SIMD packet tracing with compaction on GPU.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700