Publications

Improving MPI-HMMER's scalability with parallel I/O

Abstract

We present PIO-HMMER, an enhanced version of MPI-HMMER. PIO-HMMER improves on MPI-HMMER's scalability through the use of parallel I/O and a parallel file system. In addition, we describe several enhancements, including a new load balancing scheme, enhanced post-processing, improved double- buffering support, and asynchronous I/O for returning scores to the master node. Our enhancements to the core HMMER search tools, hmmsearch and hmmpfam, allow for scalability up to 256 nodes where MPI-HMMER previously did not scale beyond 64 nodes. We show that our performance enhancements allow hmmsearch to achieve between 48x and 221x speedup using 256 nodes, depending on the size of the input HMM and the database. Further, we show that by integrating database caching with PIO-HMMER's hmmpfam tool we can achieve up to 328x performance using only 256 nodes.

Date
May 23, 2009
Authors
John Paul Walters, Rohan Darole, Vipin Chaudhary
Conference
Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
Pages
1-11
Publisher
IEEE