Now showing items 1-2 of 2
Binary analysis for attribution and interpretation of performance measurements on fully-optimized code
Modern scientific codes frequently employ sophisticated object-oriented design. In these codes, deep loop nests are often spread across multiple routines. To achieve high performance, such codes rely on compilers to inline ...
Developing a scalable, extensible parallel performance analysis toolkit
Modern parallel systems and applications are constantly increasing in scale and complexity, and consequently good parallel performance is impossible to achieve without the help of performance tools. However, monitoring ...