News
Apr 24, 2025 | I will attend ICLR in Singapore to present, Palu our awesome low-rank KV-Cache compression paper. Come find me for a coffee chat! |
---|---|
Mar 24, 2025 | Check out our latest work, xKV, which can compress DeepSeek MLA’s KV-Cache for another 3x. |
Mar 14, 2025 | I successfully defended my PhD final exam. What a journey! Cannot wait for the next ;). |
Mar 02, 2025 | The preprint of TeleRAG, our new paper on RAG inference acceleration, is out on ArXiv! |
Jan 22, 2025 | Our KV-Cache compression framework, Palu, is accepted to ICLR 2025! You’re welcome to use our code to make your LLM more efficient. See you in Singapore! |
Jan 10, 2025 | ![]() Drop me an email if you have opportunities! |
Jan 02, 2025 | I’m visiting Taiwan for three weeks. Hit me up if you want to chat about research! |
Dec 14, 2024 | I’m attending NeurIPs 2024 in vancouver! |
Oct 29, 2024 | Gave a talk on Palu KV-Cache compression at UW CSE research day. |