Chrome Extension
WeChat Mini Program
Use on ChatGLM

LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators

Krishna Teja Chitty-Venkata,Siddhisanket Raskar,Bharat Kale, Farah Ferdaus, Aditya Tanikanti, Ken Raffenetti, Valerie Taylor,Murali Emani,Venkatram Vishwanath

SC-W '24 Proceedings of the SC '24 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis(2025)

Cited 0|Views5
Key words
Large Language Models,AI Accelerators,Inference Performance Evaluation,Benchmarking
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined