Show simple item record

dc.contributor.authorKarlin, Ian
Keasler, Jeff
McGraw, James R.
Sarkar, Vivek
Sharma, Kamal
dc.date.accessioned 2017-08-02T22:03:12Z
dc.date.available 2017-08-02T22:03:12Z
dc.date.issued 2013-04-25
dc.identifier.urihttps://hdl.handle.net/1911/96406
dc.description.abstract This paper describes a new approach to managing array data layouts to optimize performance for scientific codes. Prior research has shown that changing data layouts (e.g., interleaving arrays) can improve performance. However, there have been two major reasons why such optimizations are not widely used: (1) the need to select different layouts for different computing platforms, and (2) the cost of re-writing codes to use to new layouts. We describe a source-to-source translation process that allows us to generate codes with different array interleavings, based on a data layout specification. We used this process to generate 19 different data layouts for an ASC benchmark code (IRSmk) and 32 different data layouts for the DARPA UHPC challenge application (LULESH). Performance results for multicore versions of the benchmarks with different layouts show significant benefits on four computing platforms (IBM POWER7, AMD APU, Intel Sandybridge, IBM BG/Q). For IRSmk, our results show performance improvements ranging from 22.23× on IBM POWER7 to 1.10× on Intel Sandybridge. For LULESH, we see improvements ranging from 1.82× on IBM POWER7 to 1.02× on Intel Sandybridge. We also developed a new optimization algorithm to recommend a layout for an input source program and specific target machine characteristics. Our results show that the performance of this automated layout algorithm outperforms the manual layouts in one case and performs within 10% of the best architecture-specific layout in all the other cases, but one.
dc.format.extent 17 pp
dc.language.iso eng
dc.rights You are granted permission for the noncommercial reproduction, distribution, display, and performance of this technical report in any format, but this permission is only for a period of forty-five (45) days from the most recent time that you verified that this technical report is still available from the Computer Science Department of Rice University under terms that include this permission. All other rights are reserved by the author(s).
dc.title User-Specified and Automatic Data Layout Selection for Portable Performance
dc.type Technical report
dc.date.note April 25, 2013
dc.identifier.digital TR13-03
dc.type.dcmi Text
dc.identifier.citation Karlin, Ian, Keasler, Jeff, McGraw, James R., et al.. "User-Specified and Automatic Data Layout Selection for Portable Performance." (2013) https://hdl.handle.net/1911/96406.


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record