Cdmpp: A device-model agnostic framework for latency prediction of tensor programs Jan 1, 2024ยท Hanpeng Hu , Junwei Su , Juntao Zhao , Yanghua Peng , Yibo Zhu , Haibin Lin , Chuan Wu ยท 0 min read Cite Type Conference paper Publication Proceedings of the Nineteenth European Conference on Computer Systems Last updated on Jan 1, 2024 โ OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training Jan 1, 2025 Llm-pq: Serving llm on heterogeneous clusters with phase-aware partition and adaptive quantization Jan 1, 2024 โ