An Empirical Study of the Performance of Scalable Locality on a Distributed Shared Memory System

TRICERATOPS

TriCollege Digital Repository

An Empirical Study of the Performance of Scalable Locality on a Distributed Shared Memory System

View Dublin Core Metadata

Title: An Empirical Study of the Performance of Scalable Locality on a Distributed Shared Memory System
Author: Douglas, Tim
Advisor: Wonnacott, David G.
Department: Haverford College. Dept. of Computer Science
Type: Thesis (B.S.)
Issue Date: 2011
Abstract: We run an empirical study to determine the performance attainable using scalable locality on a distributed shared memory system. We utilize the PLUTO automatic parallelizer and locality optimizer on a one-dimensional Jacobi stencil with a distributed shared memory system provided by Intel’s Cluster OpenMP on commodity hardware. Unlike earlier works which base performance concerns on tile size, we discover tile sizes are less crucial, and instead threading issues and cache interference need particular attention. On our 56 processor cluster of 14 quad-core machines, with preliminary single-level tiling we are able to realize 121.3 GFLOPS, or 63% of linear speedup. Our data suggest that adding further cores would increase performance linearly.
Subject: Distributed shared memory -- Evaluation
Terms of Use: http://creativecommons.org/licenses/by-nc/3.0/us/
Permanent URL: http://hdl.handle.net/10066/7569

Files in this item

Files Description Size Format
2011DouglasT_thesis.pdf Thesis 116.5Kb PDF
2011DouglasT_release.pdf **Archive Staff Only** 98.08Kb PDF

Citation

Douglas, Tim. "An Empirical Study of the Performance of Scalable Locality on a Distributed Shared Memory System". 2011. Available electronically from http://hdl.handle.net/10066/7569.

This item appears in the following Collection(s)

View Dublin Core Metadata

http://creativecommons.org/licenses/by-nc/3.0/us/ Except where otherwise noted, this item's license is described as http://creativecommons.org/licenses/by-nc/3.0/us/