WeChat Mini Program
Old Version Features

Retargeting and Respecializing GPU Workloads for Performance Portability

2024 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, CGO(2024)

Cited 0|Views16
Key words
Performance Portability,GPU Workloads,Benchmark,Programming Model,Amount Of Memory,Rodinia,Shared Memory,Parallel Threads,Automatic Translation,Usability,Flow Control,Outer Loop,Memory Load,Higher Level Of Abstraction,Access Patterns,Block Level,Synchronization Process,Domain-specific Languages,Memory Bandwidth,Parallel Loops,Compile Time,L2 Cache,Parallel Optimization,Block Dimensions,Kernel Feature,Executive Resources,Double-precision Floating-point
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined