Abstract
A 16 way cache-coherent nonuniform memory access (ccNUMA) Intel system consisting of four commodity four-processor Fujitsu Teamserver SMPs connected by a Synfinity cache-coherent switch was built. Results from a performance-evaluation study confirm the success of the combined hardware/software approach for performance tuning in computation-intensive workloads. The results also show that the poor local-memory bandwidth in the commodity Intel-based systems is often the main contributor to poor scalability and performance.
Original language | English (US) |
---|---|
Pages (from-to) | 207-227 |
Number of pages | 21 |
Journal | IBM Journal of Research and Development |
Volume | 45 |
Issue number | 2 |
DOIs | |
State | Published - Mar 2001 |
Externally published | Yes |
ASJC Scopus subject areas
- General Computer Science