Publications

2025

  1. TACO
    Taming Flexible Job Packing in Deep Learning Training Clusters
    Pengyu Yang, Weihao Cui, Chunyu Xue, Han Zhao, Chen Chen, Quan Chen, Jing Yang, and Minyi Guo
    ACM Transactions on Architecture and Code Optimization, 2025
  2. IPDPS
    Reducing the End-to-End Latency of DNN-based Recommendation Systems Deployed in GPU Pools
    Luan Guangqiang, Pang Pu, Chen Quan, Xu Guoyao, Zhang Chi, Zi Yanyi, Yu Yinghao, Yang Guodong, Zhang Liping, Chen Chen, and Guo Minyi
    In 39th IEEE International Parallel and Distributed Processing Symposium, 2025

2024

  1. ENLSP
    RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
    Di Liu, Meng Chen, Baotong Lu, Huiqiang Jiang, Zhenhua Han, Qianxi Zhang, Qi Chen, Chengruidong Zhang, Bailu Ding, Kai Zhang, Chen Chen, Fan Yang, Yuqing Yang, and Lili Qiu
    In the 4th NeurIPS Workshop on Efficient Natural Language and Speech Processing (Spotlight), 2024
  2. ICPP
    FedCA: Efficient Federated Learning with Client Autonomy
    Na Lv, Zhi Shen, Chen Chen, Zhifeng Jiang, Jiayi Zhang, Quan Chen, and Minyi Guo
    In Proceedings of the 53rd ACM International Conference on Parallel Processing, 2024
  3. IWQoS
    PAS: Towards Accurate and Efficient Federated Learning with Parameter-Adaptive Synchronization
    Zuo Gan, Chen Chen, Jiayi Zhang, Gaoxiong Zeng, Yifei Zhu, Jieru Zhao, Quan Chen, and Minyi Guo
    In Proceedings of the IEEE/ACM International Symposium on Quality of Service, 2024
  4. IWQoS Poster
    Towards Efficient Compound Large Language Model System Serving in the Wild
    Yifei Zhu, Botao Zhu, Chen Chen, and Xiaoyi Fan
    In Proceedings of the IEEE/ACM International Symposium on Quality of Service, 2024
  5. OSDI
    Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
    Chaofan Lin, Zhenhua Han, Chengruidong Zhang, Yuqing Yang, Fan Yang, Chen Chen, and Lili Qiu
    In Proceedings of the USENIX Symposium on Operating Systems Design and Implementation, 2024
  6. INFOCOM
    DPBalance: Efficient and Fair Privacy Budget Scheduling for Federated Learning as a Service
    Yu Liu, Zibo Wang, Yifei Zhu, and Chen Chen
    In Proceedings of the IEEE Conference on Computer Communications, 2024
  7. HPCA
    An Optimizing Framework on MLIR for Efficient FPGA-based Accelerator Generation
    Weichuang Zhang, Jieru Zhao, Guan Shen, Quan Chen, Chen Chen, and Minyi Guo
    In Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2024
  8. ASPLOS
    DataFlower: Exploiting the Data-flow Paradigm for Serverless Workflow Orchestration
    Zijun Li, Chuhao Xu, Quan Chen, Jieru Zhao, Chen Chen, and Minyi Guo
    In Proceedings of the ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023

  1. ICCD
    STAG: Enabling Low Latency and Low Staleness of GNN-based Services with Dynamic Graphs
    Jiawen Wang, Quan Chen, Deze Zeng, Zhuo Song, Chen Chen, and Minyi Guo
    In Proceedings of the IEEE International Conference on Computer Design, 2023
  2. TCC
    Accelerating Distributed Learning in Non-Dedicated Environments
    Chen Chen, Qizhen Weng, Wei Wang, Baochun Li, and Bo Li
    IEEE Transactions on Cloud Computing, 2023
  3. ICPP
    Asfl: Adaptive Semi-asynchronous Federated Learning for Balancing Model Accuracy and Total Latency in Mobile Edge Networks
    Jieling Yu, Ruiting Zhou, Chen Chen, Bo Li, and Fang Dong
    In Proceedings of the ACM International Conference on Parallel Processing, 2023
  4. TPDS
    Synchronize Only the Immature Parameters: Communication-Efficient Federated Learning By Freezing Parameters Adaptively
    Chen Chen, Hong Xu, Wei Wang, Baochun Li, Bo Li, Li Chen, and Gong Zhang
    IEEE Transactions on Parallel and Distributed Systems, 2023
  5. JSAC
    GIFT: Towards Accurate and Efficient Federated Learning with Gradient-Instructed Frequency Tuning
    Chen Chen, Hong Xu, Baochun Li, Bo Li, Li Chen, and Gong Zhang
    IEEE Journal on Selected Areas in Communications (special issue on Communication-Efficient Distributed Learning over Networks), 2023

2022

  1. SoCC
    Characterizing and orchestrating VM reservation in geo-distributed clouds to improve the resource efficiency
    Jiuchen Shi, Kaihua Fu, Quan Chen, Changpeng Yang, Pengfei Huang, Mosong Zhou, Jieru Zhao, Chen Chen, and Minyi Guo
    In Proceedings of the ACM Symposium on Cloud Computing, 2022

2021

  1. ICDCS
    Communication-Efficient Federated Learning with Adaptive Parameter Freezing
    Chen Chen, Hong Xu, Wei Wang, Baochun Li, Bo Li, Li Chen, and Gong Zhang
    In Proceedings of the IEEE International Conference on Distributed Computing Systems, 2021
  2. IJCNN
    Two-dimensional learning rate decay: Towards accurate federated learning with non-iid data
    Kaiwei Mo, Chen Chen, Jiamin Li, Hong Xu, and Chun Jason Xue
    In Proceedings of the International Joint Conference on Neural Networks, 2021

2020

  1. SoCC
    Semi-Dynamic Load Balancing: Efficient Distributed Learning in Non-Dedicated Environments
    Chen Chen, Qizhen Weng, Wei Wang, Baochun Li, and Bo Li
    In Proceedings of the ACM Symposium on Cloud Computing, 2020
  2. SC
    Metis: Learning to schedule long-running applications in shared container clusters at scale
    Luping Wang, Qizhen Weng, Wei Wang, Chen Chen, and Bo Li
    In Proceedings of the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, 2020

2019

  1. INFOCOM
    Round-robin synchronization: Mitigating communication bottlenecks in parameter servers
    Chen Chen, Wei Wang, and Bo Li
    In Proceedings of the IEEE Conference on Computer Communications, 2019

2018

  1. SoCC Poster
    Fast distributed deep learning via worker-adaptive batch sizing
    Chen Chen, Qizhen Weng, Wei Wang, Baochun Li, and Bo Li
    In Proceedings of the ACM symposium on cloud computing, 2018
  2. INFOCOM
    Performance-Aware Fair Scheduling: Exploiting Demand Elasticity of Data Analytics Jobs
    Chen Chen, Wei Wang, and Bo Li
    In Proceedings of the IEEE Conference on Computer Communications, 2018

2017

  1. ICDCS
    Speculative Slot Reservation: Enforcing Service Isolation for Dependent Data-Parallel Computations
    Chen Chen, Wei Wang, and Bo Li
    In Proceedings of the IEEE International Conference on Distributed Computing Systems, 2017
  2. INFOCOM
    Cluster fair queueing: Speeding up data-parallel jobs with delay guarantees
    Chen Chen, Wei Wang, Shengkai Zhang, and Bo Li
    In Proceedings of the IEEE Conference on Computer Communications, 2017

2016

  1. ICC
    Software-defined inter-domain routing revisited
    Chen Chen, Bo Li, Dong Lin, and Baochun Li
    In Proceedings of the IEEE International Conference on Communications, 2016