Unveiling GPU Bottlenecks in Large-Batch LLM Inference” was published by researchers at Barcelona Supercomputing Center, ...