详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
     中央处理单元CPU(Central Processing Unit)作为一种通用处理器,要兼顾程序执行和数据运算的并行性、通用性以及它们的平衡性,因此CPU内部大多数的晶体管都用于制造庞大的缓存和复杂的逻辑控制单元,运算单元所占有的比例较小,虽然近年来CPU也出现了多核的产品,不过这样并没有提高晶体管的利用率。与此同时,在游戏产业的推动下,计算机图形处理器(Graphics Processing Unit) GPU的性能正以每年翻倍的速度发展,为了实现更逼真的图形效果,GPU支持越来越复杂的运算,而且在性能上,主流GPU的单精度浮点运算可达到同期CPU的十倍左右,外部存储器带宽为CPU的五倍左右,采用基于GPU计算的构架,所需要的成本和功耗都要优于CPU。而CUDA的问世更克服了许多传统上基于GPU的光线追踪算法所遇到的困难。
Scientific visualization, computer 3-D (Three dimension) animation and virtual reality are the three major research direction in computer graphics recently, and the core technology is 3D realistic graphics. Current 3D real-time rendering technology is commonly applied in many fields including military, aerospace, aviation, medicine, geology, exploration,3D games and CAD (Computer-aided design) industry etc.
     CPU (Central Processing Unit) is a general purpose processor, which needs to consider the balance among the program execution, data parallel computing and universality, so most of the transistors in CPU are used for the large cache and complex logic control unit, but the occupation probability of operation unit is very small. Although multi-core CPU products appear, it doesn't improve the utilization of the transistor. At the same time, propelled by the game industry, GPU (Graphics Processing Unit) performance rate is doubled per year. In order to achieve more realistic graphics effect, more and more complex operations are supported by CPU. The single-precision floating-point operations of the mainstream GPU can reach about ten times performance of CPU in the same period, and the external memory bandwidth is about five times the CPU. GPU computing architecture beats the CPU in the cost and power consumption. As CUDA comes out, it overcomes many difficulties that the traditional GPU-based ray tracing algorithms encountered.
     This dissertation gives a detailed analysis on 3-D real-time rendering optimization based on GPU and also proposes some effective solutions and suggestions. The main research includes:3-D real-time rendering GPU pipeline optimization methods and parallel computing performance optimization methods; the traditional shadows algorithm; the pluses and minuses of rasteration.The main work and contributions of this dissertation are listed as following:
     Firstly, a technical framework of GPU and its graphics rendering pipeline are introduced, which includes super pipelines, parallel computing, vertex processor, and fragment processors. For the difference between GPU and CPU, the acceleration principles of GPU are analyzed; and the general-purpose computing model of current mature framework CUDA and its characteristics are discussed.
     Secondly, parallel computing and rendering pipeline in 3-D real-time rendering optimization are studied. For the rendering pipeline, all the bottleneck stages that often occur in pipelines are analyzed and the optimization methods for all the stages are put forward accordingly. In parallel computing, the optimization goal is to complete the calculation within the allowable error in the shortest time, therefore, calculation density and throughput become the key points, and three parallel computing optimize principles which lay a solid theoretical foundation for 3-D real-time rendering system are summarized in the CUDA architecture.
     Thirdly, the concept and classification of shadow are introduced, then the strategies of traditional rasterization-based shadow rendering are analyzed and the deficiencies of a variety of shadow algorithms are summarized. Rendering using ray tracing method can reflect the real world, however, the drawback is that the computation is very huge and computing processing is quite long. Therefore, we propose a GPU-based parallel PKD-Tree can be constructed 3-D scene on the division algorithm to detect objects in the scene light and intersection of the situation and demonstrate real-time shadows, and make comparison with the traditional KD-Tree.
     For the 3-D real-time rendering bottleneck - graphics generation rate, a GPU-based 3-D real-time rendering system is implemented. The system is based on current graphics hardware and software architecture, in particular the graphics capabilities of the hardware accelerator and the graphics are generated by the variety of acceleration. In addition, according to the organizational structure of the virtual scene and graphics realistic levels, the PKD-Tree traversal methods and second light treatment methods are analyzed, after that, the results of realistic light effect using recursively rendering are realized.
     Eventually, research trends of 3-D real time rendering technology are discussed.
1. 边馥苓 主编.空间信息导论[M].测绘出版社.2006.8
    2. 边馥苓 编著.数字工程的原理与方法[M].测绘出版社.2006.12
    3. 李德仁,刘强,朱庆.数码城市GIS中建筑物室外与室内三维一体化表示与漫游[J].武汉大学学报 信息科学版2003年6月第28卷第3期
    4.Philip J.Schneider David H.Eberly著,周长发译.计算机图形学几何工具算法详解[M].电子工业出版社.2005
    5. Dave Shreiner, Mason Woo,Jackie Neider, Tom Davis著,邓郑祥译.OpenGL编程指南(第四版)[M].北京:人民邮电出版社,2005.
    6.Donald Hearn, M. Pauline Baker著,蔡士杰,吴春镕,孙正兴等译.计算机图形学(第二版)[M].电子工业出版社.2002
    7. Owens J.D., Luebke D., Govindaraju, et al. A survey of general-purpose comutation on graphics hardware [C]. Computer Graphics Forum,80-113,2007.
    8. Moreland K. Angel E. The FFT on a GPU [C]. In proceedings of Graphics Hardware. San Diego. California.2003
    9. Akira Nukad, Yasuhiko Ogata, Toshio Endo, Satoshi Matsuok. Bandwidth intensive 3-D FFT kernel for GPUs using CUDA [C]. Conference on High Performance Networking and Computing. Austin, Texas.2008
    10. http://www.optitex.com/en/products/3DRunway_Tools [OL]
    11. http://www.scicomp.com/derivativesmodeling/benefits [OL]
    12. http://www.manifold.net/index.shtml [OL]
    13. Moreland K., Angel E.. The FFT on a GPU [C]. In proceedings of Graphics Hardware. San Diego. 112-119.2003
    14. Aaron Lefohn, Ross Whitaker. A GPU-Based Three-Dimensional Level Set Solver with Curvature Flow [R]. University of Utah tech report.2002.
    15. Aaron E. Lefohn, Joe M. Kniss, Charles D. Hansen, Ross T. Whitaker. Interactive Deformation and Visualization of Level Set Surfaces Using Graphics Hardware [C]. IEEE Visualization Proceedings of the 14th IEEE Visualization.2003.
    16. Nathan A. Carr, Jesse D. Hall, John C. Hart. The ray engine. SIGGRAPH/EUROGRAPHICS Conference On Graphics Hardware. Saarbrucken, Germany.37-46.2002
    17. Sven Woop, Jorg Schmittler, Philipp Slusallek. RPU:A Programmable Ray Processing Unit for Realtime Ray Tracing [C]. ACM Transactions on Graphics (TOG).434-444.2005
    18. Nathan A. Carr, Jesse D. Hall, John C. Hart. GPU Algorithms for Radiosity and Subsurface Scattering [C]. SIGGRAPH/EUROGRAPHICS Conference On Graphics Hardware.San Diego, California.51-59.2003
    19. Greg Coombe, Mark J. Harris, Anselmo Lastra. Radiosity on graphics hardware [C]. ACM International Conference Proceeding Series.161-168.2004
    20. Timothy J. Purcell, Craig Donner, Mike Cammarano, Henrik Warm Jensen, Pat Hanrahan. Photon mapping on programmable graphics hardware [C]. SIGGRAPH/EUROGRAPHICS Conference On Graphics Hardware. San Diego, California.41-50.2003
    21. Naga K. Govindaraju, Ming C. Lin, Dinesh Manocha. Quick-CULLIDE:Efficient inter-and intra-object collision culling using graphics hardware. In Proceedings of IEEE Virtual Reality.2005
    22. Jens Kruger, Peter Kipfer, Polina Kondratieva, Rudiger Westermann. A particle system for interactive visualization of 3D flows. IEEE Transaction on Visualization and Computer Graphics. 2005
    23. Naga K. Govindaraju, Brandon Lloyd, Wei Wang, Ming Lin, Dinesh Manocha. Fast computation of database operations using graphics processors. International Conference on Computer Graphics and Interactive Techniques. Los Angeles, California.2005
    24. Bingsheng He, Ke Yang, Rui Fang, Mian Lu,Naga K. Govindaraju, Qiong Luo, Pedro V. Sander Relational Joins on Graphics Processors.2008 ACM SIGMOD international conference on Management of data. Vancouver, Canada.511-524.2008
    25. Naga K. Govindaraju, Nikunj Raghuvanshi, Dinesh Manocha. Fast and approximate stream mining of quantiles and frequencies using graphics processor. Proceedings of the 2005 ACM SIGMOD international conference on Management of data. Baltimore, Maryland.611-622.2005
    26. S. Guha, S. Krisnan, S. Venkatasubramanian. Data visualization and mining using the GPU. Tutorial at 11th ACM International Conference on Knowledge Discovery and Data Mining.2005
    27. http://cuda.csdn.net/[OL]
    31.史胜伟,姜昱明,朱新蕾.基于GPU的真实感地形绘制[J].电脑开发与应用2008年第21卷第 6期
    49.Randi J. Rost著.天宏工作室译.OpenGL着色语言[M].人民邮电出版社.2006
    50. William R. Mark, R. Steven Glanville, Kurt Akeley, Mark J. Kilgard. Cg:A System for Programming Graphics Hardware in a C-like Language [C]. Proceedings of SIGGRAPH.2003
    51. Peeper C, Mitchell JL. Introduction to the DirectX 9 High-Level shader Language. http://ati.amd.com/developer/ShaderX2_IntroductionToHLSL.pdf [OL].2003
    92.Randima Femando编著、姚勇,王小琴译、龚敏敏,黄东海审校.GPU精粹——实时图形编程技术、技巧和技艺[M].人民邮电大学出版社.2006年
    93.Matt Pharr编著,龚敏敏译.GPU精粹Ⅱ——高性能图形芯片和通用计算的编程技巧[M].清华大学出版社.2007年
    95. H. Nguyen(ed.), GPU Gem 3-3D and General Programming Techniques for GPUs [M]. Addison Wesley,2007.
    96. T. Mattson, et al., Patterns for Parallel Programming, Addison Wesley,2005.
    97. Tim Foley, Jeremy Sugerman. Kd-tree acceleration structures for a gpu raytracer[C]. Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware.2005
    98. Daniel Reiter Horn, Jeremy Sugerman, Mike Houston, and Pat Hanrahan. Interactive k-d tree GPU raytracing [C]. Proceedings of the 2007 symposium on Interactive 3D graphics and games.2007
    99. Jon P. Ewins, Marcus D. Waller, Martin White, Paul F. Lister, Implementing an anisotropic texture filter, Computers & Graphics, Volume 24, Issue 2, April 2000, Pages 253-267
    100. Jacco Bikker. Real-time ray tracing through the eyes of a game developer [R]. Symposium on Interactive Ray Tracing.2007
    101. http://openrt.de/. [OL]
    102. Turner Whitted. An Improved Illumination Model for Shaded Display [J]. Graphics and Image Processing.1980
    103. Alexander Reshetov, Alexei Soupikov, Jim Hurley. Multi-level ray tracing algorithm[C]. ACM SIGGRAPH.2005
    104. Timothy J. Purcell, Ian Buck, William R. Mark, Pat Hanrahan. Ray Tracing on Programmable Graphics Hardware [C]. ACM Transactions on Graphics.2002
    105. NVIDIA Corporation. NVIDIA CUDA Compute Unified Device Architecture Programming Guide. Version 1.0.2007
    106. NVIDIA Corporation.GPU Programming Guide GeForce 8 and 9 Series.2008
    107. Khronos Group. OpenCL Parallel Computing for Heterogeneous Devices.2009
    108. http://cn.slizone.com/object/sli_cuda_learn_cn.html. [OL]
    109. http://www.amd.com/us/PRODUCTS/WORKSTATION/GRAPHICS/CROSSFIRE-PRO/Pages/cro ssfire-pro.aspx. [OL]
    110. http://www.microsoft.com/downloads/details.aspx?familyid=20735F91-5896-4A86-80DD-ADDB8 333DA61&displaylang=en. [OL]
    111. Christopher C. Tanner, Christopher J. Migdal, and Michael T. Jones. The Clipmap:A Virtual Mipmap[C]. International Conference on Computer Graphic.1998
    112. Lance Williams. Casting Curved Shadows on Cruved Surfaces. [C] SIGGRAPH 1978
    113. Franklin C. Crow, Shadow Algorithms for Computer Graphics. [C] SIGGRAPH 1977
    114. Philippe Bergeron. A General Version of Crow's Shadow Volumes. [J] IEEE Computer Graphics and Applications. vol.6, no.9, pp.17-28, Sept.1986
    115. http://www.caustic.com/index.php. [OL]
    116. Daniel Pohl. Ray Tracing and Gaming-Quake 4:Ray Traced Project. [OL] http://www.idfun.de/temp/q4rt/.2006
    117. Ingo Wald. Real-time Ray Tracing and Interactive Global Illumination. [D] Saarland University. 2004
    118. Stefan Popov, Johannes Gunther, Hans-Peter Seidel, and Philipp Slusallek. Stackless KD-Tree Traversal for High Performance GPU Ray Tracing. [C]Computer Graphics Forum 26(3), pp.415-424, 2007, (Proceedings of Eurographics)

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700