Lethe: Layer- and Time-Adaptive KV Cache Pruning for Reasoning-Intensive LLM Serving

Publication
CoRR
Jidong Zhai
Jidong Zhai
Professor
(长聘教授、博士生导师)