Streaming Applications on Heterogeneous Platforms
详细信息    查看全文
  • 关键词:Multiple streams ; Heterogeneous platforms ; Performance
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2016
  • 出版时间:2016
  • 年:2016
  • 卷:9966
  • 期:1
  • 页码:116-129
  • 全文大小:1,825 KB
  • 参考文献:1.Adriaens, J.T., Compton, K., Kim, N.S., Schulte, M.J.: The case for GPGPU spatial multitasking. In: 2012 IEEE 18th International Symposium on High Performance Computer Architecture (HPCA), pp. 1–12. IEEE, February 2012
    2.Boyer, M., Meng, J., Kumaran, K.: Improving GPU performance prediction with data transfer modeling. In: 2013 IEEE 27th International Symposium on Parallel and Distributed Processing Symposium Workshops and PhD Forum (IPDPSW), pp. 1097–1106. IEEE, May 2013
    3.Che, S., Boyer, M., Meng, J., Tarjan, D., Sheaffer, J.W., Lee, S.-H., Skadron, K.: Rodinia: a benchmark suite for heterogeneous computing. In: IEEE International Symposium on Workload Characterization, 2009. IISWC 2009, pp. 44–54. IEEE, October 2009
    4.Gómez-Luna, J., González-Linares, J.M., Benavides, J.I., Guil, N.: Performance models for asynchronous data transfers on consumer graphics processing units. J. Parallel Distrib. Comput. 72(9), 1117–1126 (2012)CrossRef
    5.Gregg, C., Hazelwood, K.: Where is the data? Why you cannot debate CPU vs. GPU performance without the answer. In: 2011 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 134–144. IEEE, April 2011
    6.Hennessy, J.L., Patterson, D.A.: Computer Architecture: A Quantitative Approach, 4th edn. Morgan Kaufmann, Burlington (2006)MATH
    7.Ino, F., Nakagawa, S., Hagihara, K.: GPU-chariot: a programming framework for stream applicationsrunning on multi-GPU systems. IEICE Trans. 96–D(12), 2604–2616 (2013)CrossRef
    8.Intel Inc. hStreams Architecture document for Intel MPSS 3.5, April 2015
    9.Liu, B., Qiu, W., Jiang, L., Gong, Z.: Software pipelining for graphic processing unit acceleration: partition, scheduling and granularity. Int. J. High Perform. Comput. Appl. 30(2), 169–185 (2015)CrossRef
    10.Meswani, M.R., Carrington, L., Unat, D., Snavely, A., Baden, S., Poole, S.: Modeling and predicting performance of high performance computing applications on hardware accelerators. Int. J. High Perform. Comput. Appl. 27(2), 89–108 (2013)CrossRef
    11.Mittal, S., Vetter, J.S.: A survey of CPU-GPU heterogeneous computing techniques. ACM Comput. Surv. 47(4), 36 (2015)CrossRef
    12.NVIDIA Inc. CUDA C Best Practices Guide Version 7.0, March 2015
    13.Owens, J.D., Houston, M., Luebke, D., Green, S., Stone, J.E., Phillips, J.C.: GPU computing. Proc. IEEE 96(5), 879–899 (2008)CrossRef
    14.Pienaar, J.A., Raghunathan, A., Chakradhar, S.: MDR: performance model driven runtime for heterogeneous parallel platforms. In: Proceedings of the International Conference on Supercomputing, ICS 2011, pp. 225–234. ACM, New York (2011)
    15.Takizawa, H., Sato, K., Kobayashi, H.: SPRAT: runtime processor selection for energy-aware computing. In: 2008 IEEE International Conference on Cluster Computing, pp. 386–393. IEEE (2008)
    16.The Khronos OpenCL Working Group. OpenCL - The open standard for parallel programming of heterogeneoussystems, January 2016. http://​www.​khronos.​org/​opencl/​
    17.Werkhoven, B.V., Maassen, J., Seinstra, F.J., Bal, H.E.: Performance models for CPU-GPU data transfers. In: 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), pp. 11–20. IEEE, May 2014
    18.Wende, F., Steinke, T., Cordes, F.: Concurrent kernel execution on xeon phi within parallel heterogeneous workloads. In: Silva, F., Dutra, I., Santos Costa, V. (eds.) Euro-Par 2014. LNCS, vol. 8632, pp. 788–799. Springer, Heidelberg (2014). doi:10.​1007/​978-3-319-09873-9_​66
    19.Wende, F., Steinke, T., Cordes, F.: Multi-threaded kernel offloading to GPGPU using hyper-Q on kepler architecture. Technical report 14–19, ZIB, Takustr. 7, 14195 Berlin (2014)
    20.Yang, C., Wang, F., Du, Y., Chen, J., Liu, J., Yi, H., Lu, K.: Adaptive optimization for petascale heterogeneous CPU/GPU computing. In: 2010 IEEE International Conference on Cluster Computing (CLUSTER), pp. 19–28. IEEE (2010)
    21.Yang, C., Xue, W., Fu, H., Gan, L., Li, L., Xu, Y., Lu, Y., Sun, J., Yang, G., Zheng, W.: A peta-scalable CPU-GPU algorithm for global atmospheric simulations. In: Proceedings of the 18th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2013, pp. 1–12. ACM, New York (2013)
  • 作者单位:Zhaokui Li (18)
    Jianbin Fang (18)
    Tao Tang (18)
    Xuhao Chen (18)
    Canqun Yang (18)

    18. Software Institute, College of Computer, National University of Defense Technology, Changsha, China
  • 丛书名:Network and Parallel Computing
  • ISBN:978-3-319-47099-3
  • 刊物类别:Computer Science
  • 刊物主题:Artificial Intelligence and Robotics
    Computer Communication Networks
    Software Engineering
    Data Encryption
    Database Management
    Computation by Abstract Devices
    Algorithm Analysis and Problem Complexity
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1611-3349
  • 卷排序:9966
文摘
Using multiple streams can improve the overall system performance by mitigating the data transfer overhead on heterogeneous systems. Currently, very few cases have been streamed to demonstrate the streaming performance impact and a systematic investigation of streaming necessity and how-to over a large number of test cases remains a gap. In this paper, we use a total of 56 benchmarks to build a statistical view of the data transfer overhead, and give an in-depth analysis of the impacting factors. Among the heterogeneous codes, we identify two types of non-streamable codes and three types of streamable codes, for which a streaming approach has been proposed. Our experimental results on the CPU-MIC platform show that, with multiple streams, we can improve the application performance by up 90 %. Our work can serve as a generic flow of using multiple streams on heterogeneous platforms.
NGLC 2004-2010.National Geological Library of China All Rights Reserved.
Add:29 Xueyuan Rd,Haidian District,Beijing,PRC. Mail Add: 8324 mailbox 100083
For exchange or info please contact us via email.