Retargeting and Respecializing GPU Workloads for Performance Portability
2024 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, CGO(2024)
Key words
Performance Portability,GPU Workloads,Benchmark,Programming Model,Amount Of Memory,Rodinia,Shared Memory,Parallel Threads,Automatic Translation,Usability,Flow Control,Outer Loop,Memory Load,Higher Level Of Abstraction,Access Patterns,Block Level,Synchronization Process,Domain-specific Languages,Memory Bandwidth,Parallel Loops,Compile Time,L2 Cache,Parallel Optimization,Block Dimensions,Kernel Feature,Executive Resources,Double-precision Floating-point
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined