Publications
* indicates equal contribution
2026
- MLSys’26 (In Submission)
AgenticCache: Cache-Driven Asynchronous Planning for Embodied AI Agents2026In submission - MLSys’26 (In Submission)
QUESO: Storage-Assisted Quantization Error Compensation for On-Device LLM Inference2026In submission