Home   Publications   Awards   Research     NB Collectives       LibNBC       NBCBench       Performance         Open MPI/MVAPI         MVAPICH       CG Solver       Kernels       HPL     MPI Topologies     MPI Datatypes     Netgauge     Network Topologies     Ethernet BTL eth     ORCS     DFSSSP     Older Projects     cDAG     LogGOPSim     CoMPIler   Teaching   Miscellaneous   Full CV [pdf]   BLOG   bio
  
 
 
  
 
  
  Events
  
  
  
  
   
  
   Past Events
  
  
  
  
  
  
   
    | 
NBCBench - benchmarking Nonblocking MPI Collective Operations Performance 	
  Description
  NBCBench is a benchmark that measures overlap and asynchronous 
  progression of nonblocking collective operations implemented in 
  LibNBC. NBCBench is distributed under the BSD license.
  Download NBCBench
  
  Performance Results for different MPI Implementations
We present performance results of LibNBC for different MPI
implementations. LibNBC issues MPI_Isend() and MPI_Irecv() calls, and
the performance and possible overlap depends on the implementation in
MPI. We do also compare the collective operations implemented in LibNBC
to the MPI operations.
Results are available for the following MPI Implementations:
Please keep in mind that not all collective algorithms in LibNBC are
optimized!
  Benchmark Methodology
We used the overlap-benchmark which has been designed to assess the
maximal possible overlap and the minimal latencies. The benchmark will
be described later. Details can be found in "Accurately Measuring Collective 
Operations at Massive Scale" [1] and "Implementation and Performance 
Analysis of Non-Blocking Collective Operations for MPI" [2].
 
  References
| PMEO'08 | [1] Torsten Hoefler, Timo Schneider and Andrew Lumsdaine: |  |   | Accurately Measuring Collective Operations at Massive Scale In Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium, PMEO'08 Workshop, presented in Miami, FL, ISSN: 1530-2075, ISBN: 978-1-4244-1694-3, Apr. 2008,  Invited to a journal special issue on top picks from PMEO'08.         |  
 | SC07 | [2] Torsten Hoefler, Andrew Lumsdaine and Wolfgang Rehm: |  |   | Implementation and Performance Analysis of Non-Blocking Collective Operations for MPI In Proceedings of the 2007 International Conference on High Performance Computing, Networking, Storage and Analysis, SC07, presented in Reno, USA, IEEE Computer Society/ACM, Nov. 2007, (acceptance rate 20%, 54/268)         |   
 |