Acknowledgement
Grant : 매니코어 기반 슈퍼컴퓨터 작업 및 데이터 처리 기술 연구, 매니코어 기반 초고성능 스케일러블 OS 기초연구
Supported by : 한국과학기술정보연구원, 정보통신기술진흥센터
References
- A. Sodani, "Knights Landing (KNL): 2nd Generation Intel Xeon Phi Processor," Presented at Hot-Chips 2015, Aug. 2015.
- Intel. (2016, June 22). Intel Xeon Phi Processor Product Brief [Online]. Available: http://www.intel.co.kr/content/dam/www/public/us/en/documents/product -brief s/xeon-phi-processor-product -brief.pdf (downloaded 2016, Aug. 14)
- A. Heinecke , A. Breuer, M. Bader, and P. Dubey, "High Order Seismic Simulations on the Intel Xeon Phi Processor (Knights Landing)," High Performance Computing, Vol. 9697, pp. 343-362, Jun. 2016.
- Message Passing Interface, https://www.mpi-forum.org/.
- L. Chai, P. Lai, H.-W. Jin, and D. K. Panda, "Designing an efficient kernel-level and user-level hybrid approach for MPI intra-node communication on multi-core systems," Proc. of International Conference on Parallel Processing, pp. 222-229, Sep. 2008.
- H.-W. Jin, S. Sur, L. Chai, and D. K. Panda, "Lightweight kernel-level primitives for highperformance MPI intra-node communication over multi-core systems," Proc. of IEEE International Cluster Conference, pp. 446-451, Sep. 2007.
- D. Buntinas, B. Goglin, D. Goodell, G. Mercier, and S. Moreaud, "Cache-efficient, intranode, largemessage MPI communication with MPICH2-Nemesis," Proc. of International Conference on Parallel Processing, pp. 462-469, Sep. 2009.
- B. Goglin, M. Stephanie, "KNEM: a generic and scalable kernel-assisted intra-node MPI communication framework," Journal of Parallel and Distributed Computing, Vol. 73, No. 2, pp. 176-188, 2013. https://doi.org/10.1016/j.jpdc.2012.09.016
- L. Chai, A. Hartono, and D. K. Panda, "Designing high performance and scalable MPI intra-node communication support for clusters," Proc. of the IEEE International Conference on Cluster Computing, pp. 1-10, 2006.
- X. Wu, V. Taylor, C. Lively, and S. Sharkawi, "Performance analysis and optimization of parallel scientific applications on CMP cluster systems," Proc. of International Conference on Parallel Processing- Workshops, pp. 188-195, 2008.
- C. Zhang, X. Yuan, and A. Srinivasan, "Processor affinity and MPI performance on SMP-CMP clusters," Proc. of the IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, pp. 1-8, 2010.
- M. Si, Y. Ishikawa, and M. Tatagi, "Direct MPI Library for Intel Xeon Phi Co-Processors," Proc. of the IEEE 27th International Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), pp. 816-824, 2013.
- S. Potluri, K. Hamidouche, D. Bureddy, and D. K. Panda, "MVAPICH2-MIC: A High Performance MPI Library for Xeon Phi Clusters with InfiniBand," Proc. of the Extreme Scaling Workshop, pp. 25-32, 2013.
- M. Noack, F. Wende, T. Steinke, and F. Cordes, "A Unified Programming Model for Intra- and Inter-Node Offloading on Xeon Phi Clusters," Proc. of the SC14: International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 203-214, 2014.
- S. Neuwirth, D. Frey, and U. Bruening, "Communication Models for Distributed Intel Xeon Phi Coprocessors," Proc. of the IEEE 21st International Conference onParallel and Distributed Systems, pp. 499-506, 2015.
- S. Potluri, A. Venkatesh, D. Bureddy, K. Kandalla, and D. K. Panda, "Efficient Intra-node Communication on Intel-MIC Clusters," Proc. of the 13th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, pp. 128-135, 2013.
- A. Shimada, A. Hori, and Y. Ishikawa, "Eliminating Costs for Crossing Process Boundary from MPI Intra-node Communication," Proc. of the 21st European MPI Users' Group Meeting, pp. 119-120, 2014.
- K. Kandalla, A. Venkatesh, K. Hamidouche, S. Potluri, D. Bureddy, and D. K. Panda, "Designing Optimized MPI Broadcast and Allreduce for Many Integrated Core (MIC) InfiniBand Clusters," Proc. of the IEEE 21st Annual Symposium on High- Performance Interconnects, pp. 63-70, 2013.
- A. Venkatesh, S. Potluri, R. Rajachandrasekar, M. Luo, K. Hamidouche, and D. K. Panda, "High Performance Alltoall and Allgather Designs for InfiniBand MIC Clusters," Proc. of IEEE 28th International Parallel and Distributed Processing Symposium, pp. 637-646, 2014.
- MVAPICH, [Online]. Available: http://mvapich.cse.ohio-state.edu/
- memkind library, [Online]. Available: http://memkind.github.io/memkind/
- A. Kleen, "A numa api for linux," Novel Inc, 2005.