News | Chen Chen

Mar 14, 2026	One Hermes work, which remarkably accelerates LLM agent serving with a novel probabilistic demand model, is accepted by ACM TACO. Congratulations to Yifei, Zuo, Zhenghao and Weiye!
Mar 13, 2026	One paper is accepted by TMC. Congratulations to Jiayi and Zuo!
Feb 28, 2026	One paper is accepted by TPDS. Congratulations to Zuo and Jiayi!
Feb 21, 2026	Our work on efficient training job configuration is accepted by CVPR’26. Congratulations to Guanjie!
Jan 31, 2026	Two papers on efficient scheduling of LLM training jobs are accepted by EuroSys’26. Congratulations to Yuxuan and Yanbo!
Jan 26, 2026	One paper is accepted by ICLR’26.
Jan 21, 2026	The RetroInfer paper is accepted by VLDB’26.
Dec 08, 2025	One paper accepted by ToN. Congratulations to Jiayi and Zuo!
Oct 18, 2025	One paper on deep learning job scheduling is accepted by TCC.
Oct 15, 2025	The SemanticPrefetcher work is accepted by CloudCom 2025. Congratulations to Tianze!
Sep 19, 2025	The RetrievalAttention work is accepted by NeurIPS 2025. Congratulations to Di!
Sep 05, 2025	One paper on attaining efficient and also fair scheduling for model training jobs accepted by ACM TACO. Congratulations to Yifei!
Aug 19, 2025	One paper on serverless computing accepted by IEEE Transactions on Services Computing.
Jun 16, 2025	One paper accepted by VLDB.
Jun 14, 2025	Two papers respectively on LLM Agent Serving and Unified AI Caching released on Arxiv.
May 08, 2025	One paper on GPU serverless computing accepted by TACO.
Mar 27, 2025	Two papers respectively on LLM application scheduling and Federated Learning accepted by IEEE ICDCS.
Mar 22, 2025	One paper is accepted by ISCA.
Dec 21, 2024	One paper on GPU cluster scheduling is accepted by TACO.
Dec 20, 2024	One paper on GPU resource pooling is accepted by IPDPS 2025.
Dec 16, 2024	The RetrievalAttention work wins the best paper award of the 4th NeurIPS Workshop on Efficient Natural Language and Speech Processing.
Oct 10, 2024	The RetrievalAttention work is accepted by the 4th NeurIPS Workshop on Efficient Natural Language and Speech Processing (ENLSP).
Sep 16, 2024	RetrievalAttention on accelerating long-context LLM Inference is released on arXiv.
Jun 21, 2024	Our poster on LLM Inference is awarded the Best Poster Award on IWQoS’24.
Jun 11, 2024	FedCA on Federated Learning is accepted by ICPP’24.
Apr 15, 2024	PAS on Federated Learning is accepted by IWQoS’24.
Mar 21, 2024	Parrot on accelerating LLM inference via the abstraction of Semantic Variable is accepted by OSDI’24.
Dec 01, 2023	One paper on Federated Learning accepted by INFOCOM’24.
Jan 25, 2023	APF accepted to IEEE Trans. Parallel and Distributed Systems as an extended version of ICDCS ’21.
Nov 30, 2022	GIFT accepted to JSAC, special issue on Communication-Efficient Distributed Learning over Networks.
Sep 09, 2022	Proposal funded by National Natural Science Foundation of China (Youth Program).
Jun 30, 2022	Awarded CCF-Huawei Populus Grove Fund.
Jan 01, 2022	Started my career at SJTU as an associate professor.