Publications
2025
[Arxiv] MIRAGE: KV Cache Optimization through Parameter Remapping for Multi-tenant LLM Serving PDF
Ruihao Li*, Shagnik Pal*, Vineeth Narayan Pullu, Prasoon Sinha, Jeeho Ryoo, Lizy K. John, and Neeraja J. Yadwadkar. (*= equal contribution)
[J2] Old is Gold: Optimizing Single-threaded Applications with ExGen-Malloc PDF
Ruihao Li, Lizy K. John, and Neeraja J. Yadwadkar.
IEEE Computer Architecture Letters (CAL).
[J1] Performance Implications of Pipelining the Data Transfer in CPU-GPU Heterogeneous Systems PDF
Ruihao Li, Bagus Hanindhito, Sanjana Yadav, Qinzhe Wu, Krishna Kavi, Gayatri Mehta, Neeraja J. Yadwadkar, and Lizy K. John.
ACM Transactions on Architecture and Code Optimization (TACO).
[C11] CADOSys: Cache Aware Design Space Optimization for Spatial ML Accelerators PDF
Ruihao Li, Siyuan Ma, Krishna Kavi, Gayatri Mehta, Neeraja J. Yadwadkar, and Lizy K. John.
Great Lakes Symposium on VLSI (GLSVLSI 2025).
[W2] The Utilization Fallacy and the Real Drivers of Carbon-Efficient Inference Serving PDF
Prasoon Sinha, Dimitrios Liakopoulos, Ruihao Li, and Neeraja J. Yadwadkar.
Workshop on Sustainable Computer Systems (HotCarbon 2025)
2024
[C10] BLQ: Light-Weight Locality-Aware Runtime for Blocking-Less Queuing PDF
Qinzhe Wu, Ruihao Li, Jonathan Beard, and Lizy John.
ACM SIGPLAN 33rd International Conference on Compiler Construction (CC 2024).
2023
[C9] Performance Implications of Async Memcpy and UVM: A Tale of Two Data Transfer Modes PDF
Ruihao Li, Sanjana Yadav, Qinzhe Wu, Krishna Kavi, Gayatri Mehta, Neeraja J. Yadwadkar, and Lizy K. John.
2023 IEEE International Symposium on Workload Characterization (IISWC 2023).
[C8] HLSDataset: Open-Source Dataset for ML-Assisted FPGA Design using High Level Synthesis PDF
Zhigang Wei, Aman Arora, Ruihao Li, and Lizy K. John.
34th IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP 2023).
[C7] NextGen-Malloc: Giving Memory Allocator Its Own Room in the House PDF
Ruihao Li, Qinzhe Wu, Krishna Kavi, Gayatri Mehta, Neeraja J. Yadwadkar, and Lizy K. John.
HotOS XIX: The 19th Workshop on Hot Topics in Operating Systems (HotOS 2023).
2022
[W1] Performance Impact of NVMe-Over-TCP on HDFS Workloads PDF
Nikita Sharma, Ruihao Li, Qinzhe Wu, and Lizy Kurian John.
First International Workshop on Intelligent and Adaptive Edge-Cloud Operations and Services (Intel4EC, in conjunction with UCC 2022)
[C6] SPAMeR: Speculative Push for Anticipated Message Requests in Multi-Core Systems PDF
Qinzhe Wu, Ashen Ekanayake, Ruihao Li, Jonathan Beard, and Lizy K. John.
51st International Conference on Parallel Processing (ICPP 2022).
[C5] Hardware-aware 3D Model Workload Selection and Characterization for Graphics and ML Applications PDF
Ruihao Li, Aman Arora, Sikan Li, Qinzhe Wu, and Lizy K. John.
The 23rd International Symposium on Quality Electronic Design (ISQED 2022)
2021
[C4] Wave-PIM: Accelerating Wave Simulation Using Processing-in-Memory PDF
Bagus Hanindhito*, Ruihao Li*, Dimitrios Gourounas, Arash Fathi, Karan Govil, Dimitar Trenev, Andreas Gerstlauer, and Lizy K. John. (*= equal contribution)
50th International Conference on Parallel Processing (ICPP 2021).
[C3] Performance characterization of. net benchmarks PDF
Aniket Deshmukh*, Ruihao Li*, Rathijit Sen, Robert R Henry, Monica Beckwith, Gagan Gupta (*= equal contribution)
2021 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2021).
[C2] Improving CNN Performance on FPGA Clusters through Topology Exploration PDF
Ruihao Li, Ke Liu, Xiaojun Cai, Mengying Zhao, Lizy K. John, and Zhiping Jia.
The 36th ACM/SIGAPP Symposium On Applied Computing (SAC 2021).
2020
[C1] Accelerating Force-directed Graph Layout with Processing-in-Memory Architecture PDF
Ruihao Li, Shuang Song, Qinzhe Wu, and Lizy K. John.
2020 IEEE 27th International Conference on High Performance Computing, Data, and Analytics (HiPC 2020).
[P1] Maximizing CNN Throughput on FPGA Clusters (Poster)
Ruihao Li, Ke Liu, Mengying Zhao, Zhaoyan Shen, Xiaojun Cai, Zhiping Jia.
28th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA 2020).