Jisen Li
mtilyjason
ยท
AI & ML interests
Finetuning, multimodal, quantization, agents
Recent Activity
upvoted a paper about 14 hours ago
SAW-INT4: System-Aware 4-Bit KV-Cache Quantization for Real-World LLM Serving upvoted a paper about 14 hours ago
ProtocolBench: Which LLM MultiAgent Protocol to Choose? upvoted a paper about 14 hours ago
Kitty: Accurate and Efficient 2-bit KV Cache Quantization with Dynamic Channel-wise Precision Boost