Multi-Dimensional ML-Pipeline Optimization in Cost-Effective Disaggregated DatacenterPingyi HuoAnusha Devulapallyet al.2025MICRO 2025
NetZIP: Algorithm/Hardware Co-design of In-network Lossless Compression for Distributed Large Model TrainingJinghan HuangHyungyo Kimet al.2025MICRO 2025
Chameleon: Adaptive Caching and Scheduling for Many-Adapter LLM Inference EnvironmentsNikoleta IliakopoulouJovan Stojkovicet al.2025MICRO 2025