Next: Introduction
Up: thesis_paper
Previous: List of Tables
Contents
- RMI Architecture.
- A parallel matrix addition.
- Red-black iteration.
- A pipelined matrix multiplication program.
- HPJava data parallel version of the N-body.
- Data parallel version of the ``N-body'' example with MPI
communications.
- Principal classes of mpiJava
- Minimal mpiJava program (run in two processes)
- Software Layers
- PingPong Results in Shared Memory (SM) mode
- PingPong Results in Distributed Memory (DM) mode
- Send and receive operations for various array shapes.
- Communication times from ping-pong benchmark in non-shared-memory
case. The lines represent the model
defined by Equations 5.1 to 5.3 in the text, with
parameters from Table 5.1.
- Communication times from ping-pong benchmark in shared-memory
case. The lines represent the model
defined by Equations 5.1 to 5.3 in the text, with
parameters from Table 5.1.
- Improved protocol for handling arrays of primitive elements.
- Pseudocode for ArrayOutputStream and
ArrayInputStream
- Ping-pong timings with primitive array data sent separately (solid
points) in distributed memory mode, compared with the unoptimized results from Figure 5.2
(open points). Recall that the goal is to bring times for ``object-oriented''
sends of arrays down to the ``native'' send times,
most closely approximated by the triangular points.
- Ping-pong timings with primitive array data sent separately (solid
points) in shared memory mode, compared with the unoptimized results from Figure 5.3
(open points). Recall that the goal is to bring times for ``object-oriented''
sends of arrays down to the ``native'' send times,
most closely approximated by the triangular points.
- Timings allowing two-dimensional array proxies in the
object stream (solid points) in distributed memory mode,
compared with the unoptimized results from Figure 5.2
(open points).
Sends of two-dimensional Java arrays (solid circles) are now much closer to
the native bandwidth (of which the triangular points are representative).
- Timings allowing two-dimensional array proxies in the
object stream (solid points) in shared memory mode,
compared with the unoptimized results from Figure 5.3
(open points).
Sends of two-dimensional Java arrays (solid circles) are now much closer to
the native bandwidth (of which the triangular points are representative).
- Parallel Multiplicative Linear Congruential Generators (MLCG).
- The Main Procedure of Potts Model Simulation using Metropolis
Algorithm: One starts with an initial configuration of spins and
repeats these procedures.
- Checkerboard Partition and Blocked Communication.
- The Main Procedure of Sequential Swendsen-Wang Algorithm.
- MIMD Component Labeling. The bonds are shown as the thick
lines.
- The Potts Model Monte Carlo Simulation.
- Metropolis Performance with Lattice Size
.
- Metropolis Performance with Lattice Size
.
- Metropolis Performance with Lattice Size
.
- Metropolis Performance with Lattice Size
.
- Metropolis Performance with Lattice Size
.
- Metropolis Performance with Lattice Size
.
- Swendsen-Wang Performance with Lattice Size
.
- Swendsen-Wang Performance with Lattice Size
.
- Swendsen-Wang Performance with Lattice Size
.
- Swendsen-Wang Performance with Lattice Size
.
- Swendsen-Wang Performance with Lattice Size
.
- Swendsen-Wang Performance with Lattice Size
.
- Speedup of Metropolis by using mpiJava as compared with serial Java.
- Speedup of Swendsen-Wang by using mpiJava as compared with serial Java.
Bryan Carpenter
2004-06-09