H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference Jun 21, 2025· Cong Li , Yihan Yin , Xintong Wu , Jingchen Zhu , Zhutianya Gao , Dimin Niu , Qiang Wu , Xin Si Yuan Xie , Chen Zhang , Others · 0 min read Cite Type Conference paper Publication Proceedings of the 52nd Annual International Symposium on Computer Architecture. 🏆Best Paper Award Last updated on Jun 21, 2025 Authors Yuan Xie Chair Professor Fang Professor of Engineering | Chair Professor | IEEE/ACM/AAAS Fellow ← MemTunnel: a CXL-based Rack-Scale Host Memory Pooling Architecture for Cloud Service Aug 20, 2025 TRACI: Network Acceleration of Input-Dynamic Communication for Large-Scale Deep Learning Recommendation Model Jun 21, 2025 →