Publications
Improving MPI-HMMER's scalability with parallel I/O
Abstract
We present PIO-HMMER, an enhanced version of MPI-HMMER. PIO-HMMER improves on MPI-HMMER's scalability through the use of parallel I/O and a parallel file system. In addition, we describe several enhancements, including a new load balancing scheme, enhanced post-processing, improved double- buffering support, and asynchronous I/O for returning scores to the master node. Our enhancements to the core HMMER search tools, hmmsearch and hmmpfam, allow for scalability up to 256 nodes where MPI-HMMER previously did not scale beyond 64 nodes. We show that our performance enhancements allow hmmsearch to achieve between 48x and 221x speedup using 256 nodes, depending on the size of the input HMM and the database. Further, we show that by integrating database caching with PIO-HMMER's hmmpfam tool we can achieve up to 328x performance using only 256 nodes.
- Date
- May 23, 2009
- Authors
- John Paul Walters, Rohan Darole, Vipin Chaudhary
- Conference
- Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
- Pages
- 1-11
- Publisher
- IEEE