chienyu_cse_head_shot_0281_corp.jpg

Photo taken in UW Gates Center, Nov 2024

Chien-Yu Lin 林建宇

PhD Candidate
Email: cyulin at cs.washington.edu
Sampl lab
Paul G. Allen School of Computer Science and Engineering
University of Washington, Seattle


About me

I am a final-year Ph.D. student in the CSE department at the University of Washington. My research lies on the intersection of machine learning, systems, and computer architecture, with an emphasis on efficiency and acceleration. I am fortunate to be advised by Professor Luis Ceze and also work closely with Professor Baris Kasikci. I am also a member of the awesome Sampl research group.

Prior to UW, I obtained my Bachelor and Master degree from Department of Electronics Engineering, National Yang Ming Chiao Tung University (previously NCTU, now NYCU), advised by Prof. Bo-Cheng Lai.

Besides research, I enjoy outdoor activities such as tennis, hiking and (backcountry) skiing.

News

Mar 02, 2025 The preprint of TeleRAG, our new paper on RAG inference acceleration, is out on ArXiv!
Jan 22, 2025 Our KV-Cache compression framework, Palu, is accepted to ICLR 2025! You’re welcome to use our code to make your LLM more efficient. See you in Singapore!
Jan 10, 2025 :briefcase: I am on the job market for research positions in both academic and industry.
Drop me an email if you have opportunities!
Jan 02, 2025 I’m visiting Taiwan for three weeks. Hit me up if you want to chat about research!

Selected publications


* means equal contribution

  1. telerag.png
    TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval
    Chien-Yu Lin*, Keisuke Kamahori*, Yiyu Liu, and 11 more authors
    2025
  2. palu_concept.png
    Palu: Compressing KV-Cache with Low-Rank Projection
    Chi-Chih Chang*, Wei-Cheng Lin*Chien-Yu Lin*, and 7 more authors
    In Proceedings of International Conference on Learning Representations (ICLR), 2025
  3. atom2.png
    Atom: Low-Bit Quantization for Efficient and Accurate LLM Serving
    Yilong Zhao, Chien-Yu Lin, Kan Zhu, and 7 more authors
    In Proceedings of Machine Learning and Systems (MLSys), 2024
  4. FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline
    Chien-Yu Lin, Qichen Fu, Thomas Merth, and 2 more authors
    In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Jan 2024
    Oral (Top 2.6%)
  5. SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks
    Chien-Yu Lin*, Anish Prabhu*, Thomas Merth, and 4 more authors
    In Proceedings the 17th European Conference on Computer Vision (ECCV), Jan 2022
  6. Supporting compressed-sparse activations and weights on SIMD-like accelerator for sparse convolutional neural networks
    Chien-Yu Lin, and Bo-Cheng Lai
    In Proceedings of the 23rd Asia and South Pacific Design Automation Conference (ASP-DAC), Jan 2018