Exascale message passing interface based program deadlock detection

Deadlock detection is one of the main issues of software testing in High Performance Computing (HPC) and also inexascale computing areas in the near future. Developing and testing programs for machines which have millions of cores is not an easy task. HPC program consists of thousands (or millions)...

Full description

Saved in:
Bibliographic Details
Main Authors: Al Dhubhani, R., Eassa, F., Saeed, F.
Format: Article
Published: Institute of Advanced Engineering and Science 2016
Subjects:
Online Access:http://eprints.utm.my/id/eprint/72697/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84960158781&doi=10.11591%2fijece.v6i1.9575&partnerID=40&md5=db9a8a35d797deca78b9f5a7f3d044d2
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Deadlock detection is one of the main issues of software testing in High Performance Computing (HPC) and also inexascale computing areas in the near future. Developing and testing programs for machines which have millions of cores is not an easy task. HPC program consists of thousands (or millions) of parallel processes which need to communicate with each other in the runtime. Message Passing Interface (MPI) is a standard library which provides this communication capability and it is frequently used in the HPC. Exascale programs are expected to be developed using MPI standard library. For parallel programs, deadlock is one of the expected problems. In this paper, we discuss the deadlock detection for exascale MPI-based programs where the scalability and efficiency are critical issues. The proposed method detects and flags the processes and communication operations which are potential to cause deadlocks in a scalable and efficient manner. MPI benchmark programs were used to test the proposed method.