Efficient LLM Serving on Hybrid Real-time and Best-effort Requests Jan 1, 2025ยท Wan Borui , Zhao Juntao , Jiang Chenyu , Guo Chuanxiong , Wu Chuan ยท 0 min read Cite Type Journal article Publication arXiv preprint arXiv:2504.09590 Last updated on Jan 1, 2025 OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training Jan 1, 2025 โ