Rice Univesrity Logo
    • FAQ
    • Deposit your work
    • Login
    Search 
    •   Rice Scholarship Home
    • Faculty & Staff Research
    • George R. Brown School of Engineering
    • Computer Science
    • Search
    •   Rice Scholarship Home
    • Faculty & Staff Research
    • George R. Brown School of Engineering
    • Computer Science
    • Search
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Search

    Show Advanced FiltersHide Advanced Filters

    Filters

    Use filters to refine the search results.

    Now showing items 1-10 of 13

    • Sort Options:
    • Relevance
    • Title Asc
    • Title Desc
    • Issue Date Asc
    • Issue Date Desc
    • Results Per Page:
    • 5
    • 10
    • 20
    • 40
    • 60
    • 80
    • 100
    Thumbnail

    Performance Analysis and Optimization of a Hybrid Seismic Imaging Application 

    Paul, Sri Raj; Araya-Polo, Mauricio; Mellor-Crummey, John; Hohl, Detlef (Elsevier, 2016)
    Applications to process seismic data are computationally expensive and, therefore, employ scalable parallel systems to produce timely results. Here we describe our experiences of using performance analysis tools to gain insight into an MPI+OpenMP code developed by Shell that performs Reverse Time Migration on a cluster to produce models of the ...
    Thumbnail

    A Sample-Driven Call Stack Profiler 

    Fowler, Rob; Froyd, Nathan; Mellor-Crummey, John (2004-07-15)
    Call graph profiling reports measurements of resource utilization along with information about the calling context in which the resources were consumed. We present the design of a novel profiler that measures resource utilization and its associated calling context using a stack sampling technique. Our scheme has a novel combination of features and ...
    Thumbnail

    Improving Performance with Integrated Program Transformations 

    Jin, Guohua; Mellor-Crummey, John; Qasem, Apan (2004-09-09)
    Achieving a high fraction of peak performance on today’s computer systems is difficult for complex scientific applications. To do so, an application’s characteristics must be tailored to exploit the characteristics of its target architecture. Today, commercial compilers do not adequately tailor programs automatically; thus, application scientists ...
    Thumbnail

    Tools for Application-Oriented Performance Tuning 

    Fowler, Robert; Mellor-Crummey, John; Whalley, David (2001-03-14)
    Application performance tuning is a complex process that requires assembling various types of information and correlating it with source code to pinpoint the causes of performance bottlenecks. Existing performance tools don't adequately support this process in one or more dimensions. We discuss some of the critical utility and usability issues for ...
    Thumbnail

    The Platform-Aware Compilation Environment: Status and Future Directions 

    Cooper, Keith D.; Khan, Rishi; Lele, Sanjiva; Mellor-Crummey, John; Merényi, Erzsébet; (2012-06-13)
    The Platform-Aware Compilation Environment (PACE) is an ambitious attempt to construct a portable compiler that produces code capable of achieving high levels of performance on new architectures. The key strategies in PACE are the design and development of an optimizer and runtime system that are parameterized by system characteristics, the automatic ...
    Thumbnail

    Effective Communication Coalescing for Data-Parallel Applications 

    Chavarria-Miranda, Daniel; Mellor-Crummey, John (2005-07-29)
    Communication coalescing is a static optimization that can reduce both communication frequency and redundant data transfer in compiler-generated code for regular, data parallel applications. We present an algorithm for coalescing communication that arises when generating code for regular, data-parallel applications written in High-Performance Fortran ...
    Thumbnail

    Effective Performance Measurement and Analysis of Multithreaded Applications 

    Mellor-Crummey, John; Tallent, Nathan (2008-10-13)
    Understanding why the performance of a multithreaded program does not improve linearly with the number of cores in a sharedmemory node populated with one or more multicore processors is a problem of growing practical importance. This paper makes three contributions to performance analysis of multithreaded programs. First, we describe how to measure ...
    Thumbnail

    Understanding Unfulfilled Memory Reuse Potential in Scientific Applications 

    Marin, Gabriel; Mellor-Crummey, John (2007-10-05)
    The potential for improving the performance of data-intensive scientific programs by enhancing data reuse in cache is substantial because CPUs are significantly faster than memory. Traditional performance tools typically collect or simulate cache miss counts or rates and attribute them at the function level. While such information identifies program ...
    Thumbnail

    The Platform-Aware Compilation Environment: Preliminary Design Document 

    Cooper, Keith D.; Mellor-Crummey, John; Merényi, Erzsébet; Sadayappan, P.; Sarkar, Vivek; (2010-09-15)
    The Platform-Aware Compilation Environment (PACE) is an ambitious attempt to construct a portable compiler that produces code capable of achieving high levels of performance on new architectures. The key strategies in PACE are the design and development of an optimizer and runtime system that are parameterized by system characteristics, the automatic ...
    Thumbnail

    Compiling Stencils in High Performance Fortran 

    Brickner, R. Gregg; Kennedy, Ken; Mellor-Crummey, John; Roth, Gerald H. (1997-11-12)
    For many Fortran90 and HPF programs performing dense matrix computations, the main computational portion of the program belongs to a class of kernels known as stencils. Stencil computations are commonly used in solving partial differential equations, image processing, and geometric modeling. The efficient handling of such stencils is critical for ...
    • previous page
    • 1
    • 2
    • next page

    Home | FAQ | Contact Us | Privacy Notice | Accessibility Statement
    Managed by the Digital Scholarship Services at Fondren Library, Rice University
    Physical Address: 6100 Main Street, Houston, Texas 77005
    Mailing Address: MS-44, P.O.BOX 1892, Houston, Texas 77251-1892
    Site Map

     

    Browse

    Entire ArchiveCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsTypeThis CommunityBy Issue DateAuthorsTitlesSubjectsType

    My Account

    Login

    Context

    Export Search Metadata

    Discover

    Author
    Mellor-Crummey, John (13)
    Araya-Polo, Mauricio (2)Cooper, Keith D. (2)Kennedy, Ken (2)Merényi, Erzsébet (2)Sadayappan, P. (2)Sarkar, Vivek (2)Torczon, Linda (2)Whalley, David (2)Adve, Vikram S. (1)... View MoreSubjecthybrid programming models (1)MPI+OpenMP (1)performance analysis (1)reverse time migration (1)... View MoreDate Issued2010 - 2020 (4)2000 - 2009 (6)1997 - 1999 (3)Has File(s)Yes (13)

    Home | FAQ | Contact Us | Privacy Notice | Accessibility Statement
    Managed by the Digital Scholarship Services at Fondren Library, Rice University
    Physical Address: 6100 Main Street, Houston, Texas 77005
    Mailing Address: MS-44, P.O.BOX 1892, Houston, Texas 77251-1892
    Site Map