基于特大增量步算法的网络并行计算
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
论文第一部分工作对特大增量步算法的体系进行了比较全面完整的阐述。主要成果是:(1)从力学概念出发详细阐述了特大增量步算法源于经典力法的推导思路,给出了完整的特大增量步算法的格式,包括非线性弹性问题、弹塑性问题和有限元格式一般表达的公式;(2)对比了与特大增量步算法具有一定相似性的LATIN 算法,指出基于力法和广义逆矩阵理论的LIM 算法和LATIN 算法虽然在方程分组和迭代形式上有相似之处,但在力学概念和数学理论基础上都有根本的不同;(3)给出了特大增量步算法完整的数学上的证明,包括特大增量步算法解的唯一性、解的收敛性和解的稳定性的证明,由此证明了特大增量步算法在数学上的完备性;(4)针对解的收敛性问题讨论了算法在弹塑性问题下样本点选取,指出在保证问题的收敛性的前提下LIM 算法在样本点的选择上比逐步增量法具有更大的自由性。
    论文第二部分工作也是本论文更主要的工作是特大增量步算法的并行化研究工作。主要内容及成果为:(1)深入分析了特大增量步算法的并行性,指出算法在力学概念层面上所具有的时间上的并行和空间上的并行的内在并行性,以及算法在数值计算层面上在矩阵向量运算方面的并行性;(2)设计了特大增量步算法的分布式存储异步并行算法模型,提出了LIM 算法主从式的并行程序结构;(3)以平面应力问题为例,设计并给出了特大增量步算法基于广播的并行方案和基于非阻塞通信的并行方案;(4)描述了本论文所使用的并行环境,其硬件环境为机群系统,软件环境为Linux 操作系统加上基于消息传递MPI 编程环境;(5)对特大增量步算法并行程序的两个方案的并行性能进行了测试,分析了方案中的问题,进而给出了针对通信的程序优化方案,在清华大学计算机系高性能计算研究所的MPI环境下机群系统上的运行结果给出了接近10的加速比和95%以上的节点资源占用率,求解的规模可以达到上千单元网格,表现出良好的并行效率。
The Large Increment Method (LIM) is a new method presented by Prof. Xila Liu’s group in recent years, which is based on the force method and the generalized inverse matrix (GIM) theory for material nonlinearity problems. The whole theory system and the deduction of LIM are presented and discussed in detail. And the formulations of LIM are given. After the comparison between LIM and LATIN ( Large Time Increment Method presented by Ladeveze from Paris ), a conclusion is made that LIM and LATIN are basically two different iteration method in both mechanical concept and mathematical background.
    The mathematical proof of LIM is also given for the unique of the solution, the astringency of the solution and the stability of iteration, which means that LIM is self-contained in mathematics. Furthermore, the chosen of sample points in LIM shows more optional without break the stability of iteration according to the step by step method. This is also an advantage of LIM. In a word, LIM brings the force method a new light in the computer calculation field.
    The parallel computation of structural mechanics is a hot research field in western counties, while in China what has been done is far from enough. LIM has some natural parallel-calculating characteristics which are different with the classical sub-structural algorithm. These characteristics are discussed detailedly. LIM can be executed parallel in time field and space field. The calculation of matrix and vector in LIM can also be parallelized. A master-slave parallel model based on the distributed memory is given. Taking the plane stress problem for example, two parallel schemes are given, based on the broadcast and noblocking communication respectively.
    The parallel environment used by LIM is a cluster system installed with LINUX and MPI, which belongs to the Computation Science Department of
    Tsinghua University. The two given parallel schemes are tested and evaluated in this environment. Then the optimized program is given and shows a good efficiency with an approximate accelerated ratio of 8 and an occupancy of calculation joint up to 95%. While the ratio between the calculation and the communication is the key for a cluster system using massage passing mode. In order to get higher efficiency, one must pay more attention to it. Besides, the load-balancing is another important factor in parallel computation. The parallelization of LIM can bring some fresh air into the parallel computation field of structural mechanics.
引文
[1] Jharna Majumdar. Development of parallel algorithms for computer vision. Defence Science Journal. 1996, 46(4): 243-251
    [2] Roscoe Giles, Claudio Rebbi. Computational center develops practical experience in parallel computing. Computers in Physics, 1992, 6(2): 122
    [3] Stephen S. Yau, Doo-Hwan Bae, Madhan Chidambaram. Framework for software development for distributed parallel computing systems. Proceedings of the Third Workshop on Future Trends of Distributed Computing Systems. 1992, p 240-246
    [4] Datta, Biswa Nath. Recent developments in large-scale and parallel matrix computations and their applications to linear control problems. Proceedings of the IEEE Conference on Decision and Control. 1994, 1: 438-442
    [5] Scoy, F.L. Developing software for parallel computing systems. Computer Physics Communications, 1996, 97(1-2) High-Performance Computing in Science, p 36-44
    [6] 岑松,龙志飞. 一种新型厚薄板通用三角形广义协调元. 工程力学. 1998, 15(1): 10-22
    [7] 张春俊. 材料非线性问题的特大增量步算法[博士论文] 北京清华大学土木工程系1996
    [8] 郭早阳. 特大增量步方法和并行程序设计[硕士论文] 北京清华大学土木工程系1999
    [9] 周树荃,梁维泰,邓绍忠. 有限元结构分析并行计算. 北京:科学出版社1994
    [10] 林成江,李三立. 并行处理任务级仿真环境的研究及实现. 计算机学报. 1995, 18(7): 496-501
    [11] 李三立. 超级计算——人类认识世界的又一次革命. 中国计算机用户. 1996(1): 10-11
    [12] 张瀛,黄巍,马群生,李三立. MP860 层次式并行超级计算机的设计和实现. 计算机学报. 1998, 21(增刊): 230-236
    [13] 李三立,都志辉,马群生,王小鸽. 网络并行超级计算系统THNPSC-1. 计算机学报. 2001, 24(6): 627-632
    [14] 时培植,李三立. 高性能计算中面向Web 的性能工具. 清华大学学报(自然科学版). 2004, 44(1): 37-40
    [15] V.James Meyers. Matrix Analysis of Structures. HARPER&ROW,PUBLISHERS 1983
    [16] 龙驭球,包世华. 结构力学北京:高等教育出版社1996
    [17] 刘西拉. 结构工程学科的现状与展望北京:人民交通出版社1997
    [18] 叶康生,袁驷. 壳体的有限元线法分析(Ⅰ)——基本理论. 工程力学. 2002, 19(3): 20-29
    [19] 钟万勰. 弹性力学解新体系. 大连:大连理工大学出版社1995
    [20] El-Sayed, M. E. M., Marjadi, D., and Sandgren, E., Force method formulations based on Hamilton’s principles, Comp. Struct, 1991, 38(3): 301-316.
    [21] I. Kaneko, M. Lawo, G. Thierauf, On computational procedures for the force method, J. Numer. Meth. Engng. 1982, 18: 1469-1495.
    [22] Patnaik, S. N., The variational energy formulation for the integrated force method, AIAA. J., 1986, 24: 129-137.
    [23] Giambanco, F., Palizzolo, L., and Panzeca, T., The indirect force method, Comp. Struct. 1990, 37: 759-768.
    [24] 袁驷. 从矩阵位移法看有限元应力精度的损失与恢复. 力学与实践. 1998, 20: 1-6
    [25] 杨桂通. 弹塑性力学. 北京:人民教育出版社1980
    [26] 王仁,熊祝华等. 塑性力学基础北京:科学出版社1982
    [27] 周绥平,刘西拉. 结构矩阵分析及SMIS-PC 程序. 北京:人民交通出版社1989
    [28] 刘西拉,张春俊. 基于广义逆矩阵的特大增量步算法第三届全国结构工程学术会议论文集工程力学增刊1994: 21-35
    [29] ZHANG Chunjun. LIU Xila. A large Increment Method for Material Nonlinearity Problems. Advances in Structural Engineering. 1997, 11(2): 99-110.
    [30] 何旭初. 广义逆矩阵的基本理论和计算方法上海:上海科学技术出版社1985.
    [31] Fieder Kuhnert,陈杰译. 广义逆矩阵与正则化方法. 北京:高等教育出版社1984
    [32] Ben-Isracl, A., and Greville, Generalized Inverse: Theory and applications. Wiley, 1974
    [33] R. Penrose A generalized inverse for matrices. Proc. Cambridge Philos. Soc. 1955, 51: 406-413
    [34] Moore, E. H, On the reciprocal of the general algebraic matrix (abstract), Bull. Amer. Math. Soc., 1920, 26: 394-395
    [35] 蒋友谅. 非线性有限元法, 北京:北京工业学院出版社1988
    [36] 殷有泉. 固体力学非线性有限元引论北京:北京大学出版社,清华大学出版社1987
    [37] 董哲仁. 钢筋混凝土非线性有限元法原理与应用北京:中国铁道出版社1998
    [38] 江见鲸. 钢筋混凝土结构非线性有限元分析陕西:陕西科学技术出版社1994
    [39] Fletcher, R., and Reeves, C. M., Function minimization by conjugate gradients. Comp. J. 1964, 7(2): 149-154
    [40] P. V. Marcal, A stiffness method for elastic-plastic problems, Int. J. Mech. Science. 1965, 7: 229-238
    [41] Rice, Tracy. Computational fracture mechanics. Proc. Symp. Numer. Comp. Meths. Struct. Mech. NY: Academic Press. 1973, 585
    [42] R. D. Krieg, S. W. Key. Implementation of a time dependent plasticity theory into structural computer programs, in Constitutive Equations in Viscoplasticity: Computational and Engineering Aspects. NY: ASME. 1976, 125-137
    [43] 王瑁成,邵敏. 有限单元法基本原理与数值方法北京:清华大学出版社1988
    [44] 匡文起,张玉良,辛克贵. 结构矩阵分析和程序设计北京:高等教育出版社1991
    [45] 陈警英. 物理学中的叠加原理. 云南教育学院学报. 1994, 10(2): 32-35
    [46] 万正权,徐秉汉,朱邦俊. 弹塑性板壳结构非线性有限元分析. 计算力学学报. 1997, 14(4): 426-434
    [47] Ristinmaa, M. Consistent stiffness matrix in FE calculations of elasto-plastic bodies. Computers and Structures. 1994, 53(1): 93-103
    [48] P. Boisse, P. Ladeveze, M. Poss, and P. Rougee. A new large time increment algorithm for anisotropic plasticity. Int. J. Plasticity. 1991, 7: 63-77
    [49] P. Boisse, P. Bussy, and P. Ladeveze. A new approach in nonlinear mechanics: the large time increment method. Int. J. Numer. Meth. Engng. 1990, 29: 647-663
    [50] Pierre Ladeveze. About a new algorithm in structural mechanics. Int. Rep. LMT. Cachan. 1984(44)
    [51] Pierre Ladeveze. About a family of algorithm in structural mechanics. C. R. Acad. Sci. Paris. 1985, 300(2): 41-44
    [52] Pierre Ladeveze. Nonlinear Computational Structural Mechanics. New York: Springer-Verlag 1999.
    [53] 李庆扬,王能超,易大义. 数值分析. 武汉:华中理工大学出版社. 1982
    [54] J. B. Martin. Plasticity: Fundamentals and General Results. US: MIT Press. 1975
    [55] J. B. Martin. A complementary work bounding principle for forward integration along the path of loading for elastic-plastic bodies. J. Appl. Mech. 1987, 109: 314-345
    [56] 陈国良,吴俊敏,章锋等. 并行计算机体系结构. 北京:高等教育出版社. 2002
    [57] 黄铠,徐志伟. 可扩展并行计算技术、结构与编程. 北京:机械工业出版社2000
    [58] M.-Y. Wu, W. Shu. MIMD programs on SIMD architectures. Frontiers of Massively Parallel Computation -Conference Proceedings, 1996, 162-170
    [59] 薛一波,韩承德. 共享存储型MIMD 体系结构的研究. 中国科学院研究生院学报. 1996, 13(2): 187-195
    [60] R. Kent Koeninger, Mark Furtney, Martin Walker. Shared memory MPP from Cray Research. Digital Technical Journal. 1994, 6(2): 8-21
    [61] S. Lennart Johnsson. Network-related performance issues and techniques for MPPs. Proceedings of SPIE -The International Society for Optical Engineering, 1996, v CR62: 176-209
    [62] 陈波,韩永国,刘志勤. 高性能并行计算的研究与分析. 四川师范学院学报(自然科学版). 2003. 24(2): 200-202
    [63] 陈国良. 并行算法安徽:中国科学技术大学出版社1990
    [64] 陈国良. 并行计算. 北京:高等教育出版社. 1999
    [65] 徐甲同,李学干. 并行处理技术陕西:西安电子科技大学出版社1999
    [66] Kai Hwang, Zhiwei XU. Scalable Parallel Computing Technology, Architecture, Programming. McGraw-Hill. 1998
    [67] 任晓明,杨大鉴,刘国权. 网络并行计算系统模型. 计算机工程与应用. 2001, 15: 118-120
    [68] 2004 IEEE international symposium on cluster computing and the grid, CC grid 2004Anon. 2004 IEEE International Symposium on Cluster Computing and the Grid, CCGrid 2004, 2004 IEEE International Symposium on Cluster Computing and the Grid, CCGrid 2004, 2004, 805
    [69] Katsuki Fujisawa, Masakazu Kojima, and Akiko Takeda etc. High performance grid and cluster computing for some optimization problems. Proceedings -International Symposium on Applications and the Internet Workshops, Proceedings -2004 International Symposium on Applications and the Internet Workshops (Saint 2004Workshops), 2004, 612-615
    [70] Eisenhauer, Greg; Bustamante, Fabian E.; Schwan, Karsten. Event services for high performance computing. IEEE International Symposium on High Performance Distributed Computing, Proceedings, 2000, 113-120
    [71] G.S. Poo, A.M. Goscinski. Introduction to special issue on network-based parallel and distributed computing. Computer Communications 1999(22): 987-988
    [72] Morin, Christine; Gallard, Pascal; Lottiaux, Renaud; Vallee, Geoffroy. Towards an efficient single system image cluster operating system. Future Generation Computer Systems, 2004, 20(4): 505-521
    [73] Buyya, R.; Cortes, T.; Jin, H. Single system image. International Journal of High Performance Computing Applications, 2001, 15(2): 124-135
    [74] 朱璇,郑纬民,汪东升,杨广文. 单一系统映象在机群管理中的实现. 计算机工程与应用. 2002, 7: 86-88
    [75] 祝明发,徐志伟,朱宁宁,张弛. 可扩展单一映象文件系统的设计、实现及评价. 计算机研究与发展. 1999, 36(12): 1502-1509
    [76] Rajkumar Buyya[美]编. 郑纬民等译. 高性能集群计算:编程与应用(第二卷). 北京:电子工业出版社. 2001
    [77] 都志辉. 高性能计算并行编程技术——MPI 并行程序设计北京:清华大学出版社2001
    [78] Werstein, Paul; Pethick, Mark; Huang, Zhiyi. A Performance Comparison of DSM, PVM, and MPI. Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings, 2003, 476-482
    [79] 秦忠国,姜弘道. 消息传递界面PVM 和MPI 的现状与发展趋势. 计算机研究与发展. 1998. 35(6): 496-499
    [80] 赵晨,李仕锋,许小刚等. 网络环境中MPI 和PVM 的分析与比较. 计算机工程与应用. 2003, 3: 181-183
    [81] Alves, Albano; Pina, Antonio; Exposto, Jose; Rufino, Jose High performance multithreaded message passing on a myrinet cluster. Advances in High Performance Computing, v7, Applications of High-Performance Computing in Engineering, VII, 2002, 241-250
    [82] Flich, Jose; Lopez, Pedro; Duato, Jose; Malumbres, M.P. Boosting the performance of Myrinet networks. IEEE Transactions on Parallel and Distributed Systems, 2002, 13(11): 1166-1182
    [83] Barak, A. ; Gilderman, I.; Metrik, I. Performance of the communication layers of TCP/IP with the Myrinet gigabit LAN. Computer Communications, v 22, n 11, Jul, 1999, 22(11): 989-997
    [84] E.D.Sotelino, Parallel Processing Techniques in Structural Engineering Applications. Journal of Structural Engineering ASCE, Vol. 129, No. 12, Dec. 2003: 1698-1706
    [85] S. Bitzarakis, M. Papadrakakis, A. Kotsopulos, Parallel Solution Techniques in Computational Structural Mechanics. Computer. Methods in Applied Mechanics and Engineering , Vol. 148 1997: 75-104
    [86] S. Bitzarakis, M. Papadrakakis, A. Kotsopulos. Parallel solution techniques in computational structural mechanics. Computer methods in applied mechanics and engineering. 1997(148): 75-104
    [87] Kaneko, I., Lawo, M., and Thierauf, G. On computational procedures for the force method. J. Numer. Meth. Engng, 1982, 18: 1469-1495.
    [88] Weiping Shen, Jiahao Lin, F.W. Williams. Parallel computing for the high precision direct integration method. Computer methods in applied mechanics and engineering. 1995(126): 315-331
    [89] 周树荃,邓绍忠. 有限元结构并行计算的若干进展. 南京航空航天大学学报. 1995. 27(1): 27-32
    [90] 王建一,张卫,王殿富. 有限元问题的并行求解. 哈尔滨工业大学学报. 1994, 26(2): 37-40
    [91] 王人鹏,叶天麒. 有限元并行分布计算研究. 力学与实践. 1994, 16(6): 8-10
    [92] 李丽君,金先龙,李渊印,李根国. 有限元软件结构分析模块的并行开发及应用. 上海交通大学学报. 2004, 38(8): 1354-1357
    [93] 李丽君,金先龙,李渊印. 结构地震反应分析的并行计算及软件开发. 2004, 25(3): 210-213
    [94] 张汝清. 概说并行计算结构力学. 计算结构力学及其应用. 1995. 12(4): 477-484
    [95] 张伟林. 网络并行计算及在计算力学中的应用. 安徽建筑工业学院学报(自然科学版). 2003. 11(1): 23-27
    [96] 张汝清. 并行计算结构力学的发展和展望. 力学进展. 1994. 24(4): 511-516
    [97] Bitzarakis, S.; Papadrakakis, M.; Kotsopulos, A. Parallel solution techniques in computational structural mechanics. Computer Methods in Applied Mechanics and Engineering, v 148, n 1-2, Aug 15, 1997, 148(1-2): 75-104
    [98] Performance of iterative methods in ANSYS on cray parallel/vector supercomputers. Poole, Eugene L. (Cray Research, Inc); Heroux, Michael A.; Vaidya, Pravin; Joshi, Anil Source: Computing Systems in Engineering: An International Journal, v 6, n 3, Jun, 1995, p 251-259
    [99] Noor, Ahmed K.; Kamel, Hussein A.; Fulton, Robert E. SUBSTRUCTURING TECHNIQUES EM DASH STATUS AND PROJECTIONS. Computers and Structures, v 8, n 5, May, 1978, 8(5): 621-632
    [100] Noor, Ahmed K.; McComb, Harvey G. Jr. eds. TRENDS IN COMPUTERIZED STRUCTURAL ANALYSIS AND SYNTHESIS. Computers and Structures, v 10, n 1-2, Apr, 1978, 10(1-2): 430
    [101] 周树荃,彭立东. 非线性有限元结构分析增量法的并行处理. 计算物理. 1997. 14(4,5): 537-539
    [102] 车永刚,王正华. 隐格式并行直接求解方法研究. 空气动力学学报. 2002, 20(增): 94-99
    [103] 李晓梅,迟利华. 并行求解大型稀疏线性方程组的研究概况. 装备指挥技术学院学报. 1999, 10(3): 1-8
    [104] 迟利华,李晓梅,王正华,霍红卫. 块带状线性方程组的分布式并行算法. 计算机工程与科学. 1999, 21(3): 51-54
    [105] Saxena, P.C.; Kahlon, K.S. Parallel algorithm for solution to tridiagonal systems of linear equations. Journal of the Institution of Engineers (India), Part CP: Computer Engineering Division. 1999, 80(2): 38-41
    [106] Wilburn, Vincent C.; Ko, Hak-Lim; Alexander, Winser E. Algorithm and architecture for the parallel solution of systems of linear equations. Conference Proceedings -International Phoenix Conference on Computers and Communications. 1996, 392-398
    [107] Hughes, Thomas J. R.; Winget, James; Levit, Itzhak. ELEMENT-BY-ELEMENT SOLUTION PROCEDURES FOR NONLINEAR STRUCTURAL ANALYSIS. NASA Conference Publication, 1984, 65-84
    [108] Hughes, Thomas J. R.; Levit, Itzhak; Winget, James. ELEMENT-BY-ELEMENT SOLUTION ALGORITHM FOR PROBLEMS OF STRUCTURAL AND SOLID MECHANICS. Computer Methods in Applied Mechanics and Engineering, v 36, n 2, Feb, 1983, 36(2): 241-254
    [109] Hughes, Thomas J. R.; Levit, Itzhak; Winget, James. ELEMENT-BY-ELEMENT IMPLICIT ALGORITHMS FOR HEAT CONDUCTION. Journal of Engineering Mechanics, 1983, 109(2): 576-585
    [110] 陈国良. 并行算法的设计与分析. 北京:高等教育出版社. 2002
    [111] 蒋英,雷永梅. 基于MPI 的几种算法的并行编程通用算法. 计算机工程与应用. 2003.3: 139-141
    [112] 孟杰,孙彤,李三立. MPI 网络并行计算系统通信性能及并行计算性能的研究. 小型微型计算机系统. 1997. 18(1): 13-18
    [113] 谢超,麦联叨,都志辉,马群生. 关于并行计算系统中加速比的研究与分析. 计算机工程与应用. 2003, 26: 66-68
    [114] 薛一波,王建中. 并行处理中加速比的研究. 计算机工程与设计. 1995, 16(1): 11-16
    [115] 金之雁,王鼎兴. 大规模数据并行问题的可扩展性分析. 应用气象学报. 2003. 14(3): 369-374
    [116] Luecke, Glenn R.; Kraeva, Marina; Ju, Lilu. Comparing the performance of MPICH with Cray's MPI and with SGI's MPI. Concurrency Computation Practice and Experience. 2003, 15(9): 779-802
    [117] Cotronis, Yiannis. Composition of message passing interface applications over MPICH-G2. International Journal of High Performance Computing Applications. 2004, 18(3): 327-339
    [118] Gropp, William; Lusk, Ewing. Sowing MPICH: A case study in the dissemination of a portable environment for parallel scientific computing. International Journal of Supercomputer Applications and High Performance Computing, 1997, 11(2): 103-114
    [119] Sun, Xian-He; Ni, Lionel M. Another view on parallel speedup. Proc Supercomput. 1990, 90: 324-333

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700