Publications

* indicates equal contribution

2026

ICML’26 (In Submission)

HELIOS: Heterogeneous Lightweight VLA Model Serving System

Jongheon Jeong^*, Hojoon Kim^*, Rokhee Lee, Yeonhong Park, Young H. Oh, and Jae W. Lee

2026

In submission
ICML’26 (In Submission)

QUESO: Storage-Assisted Quantization Error Compensation for On-Device LLM Inference

Seong Hoon Seo, Donghyun Lee, Geonha Lee, Hojoon Kim, Yeonhong Park, and Jae W. Lee

2026

In submission
MLSys’26

AgenticCache: Cache-Driven Asynchronous Planning for Embodied AI Agents

Hojoon Kim, Yuheng Wu, and Thierry Tambe

2026

Acceptance Rate: 133/504 = 26.4%

2025

ICML’25 Spotlight

FlashTP: Fused, Sparsity-Aware Tensor Product for Machine Learning Interatomic Potentials

Seung Yul Lee, Hojoon Kim, Yutack Park, Dawoon Jeong, Seungwu Han, Yeonhong Park, and Jae W. Lee

2025

Spotlight (313/12,107, 2.6%)

PDF Code
OSDI’25

DecDEC: A Systems Approach to Advancing Low-Bit LLM Quantization

Yeonhong Park^*, Jake Hyun^*, Hojoon Kim, and Jae W. Lee

2025

Acceptance Rate: 52/327 = 15.9%

PDF Code