Description

With the advent of GPGPU, many applications are being accelerated by using CUDA programing paradigm. We are able to achieve around 10x -100x speedups by simply porting the application on

With the advent of GPGPU, many applications are being accelerated by using CUDA programing paradigm. We are able to achieve around 10x -100x speedups by simply porting the application on to the GPU and running the parallel chunk of code on its multi cored SIMT (Single instruction multiple thread) architecture. But for optimal performance it is necessary to make sure that all the GPU resources are efficiently used, and the latencies in the application are minimized.

Reuse Permissions
  • 2.07 MB application/pdf

    Download count: 0

    Details

    Contributors
    Date Created
    • 2017
    Resource Type
  • Text
  • Collections this item is in
    Note
    • Partial requirement for: M.S., Arizona State University, 2017
      Note type
      thesis
    • Includes bibliographical references (pages 91-96)
      Note type
      bibliography
    • Field of study: Engineering

    Citation and reuse

    Statement of Responsibility

    by Ameya Wadekar

    Machine-readable links