Demystifying the Resilience of Large Language Model Inference: An End-to-End Perspective
Yu
Sun, Zachary
Coalson, Shiyang
Chen, Hang
Liu, Zhao
Zhang, Sanghyun
Hong, Bo
Fang, and Lishan
Yang
In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2025