Sanjiv Kumar

Sanjiv Kumar

Hi! I work in the area of large-scale machine learning and computer vision. You can find more information about me including a complete list of papers at: www.sanjivk.com.
Authored Publications
Sort By
  • Title
  • Title, descending
  • Year
  • Year, descending
    Google
DistillSpec: Improving speculative decoding via knowledge distillation
Yongchao Zhou
Kaifeng Lyu
Aditya Menon
Afshin Rostamizadeh
Jean-François Kagy
Rishabh Agarwal
International Conference on Learning Representations (ICLR) (2024)
Think before you speak: Training language models with pause tokens
Sachin Goyal
Ziwei Ji
Aditya Menon
Vaishnavh Nagarajan
International Conference on Learning Representations (ICLR) (2024)
Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines
Yuchen Li
Alexandre Kirchmeyer
Aashay Mehta
Yilong Qin
Andrej Risteski
International Conference on Machine Learning (2024) (to appear)
USTAD: Unified Single-model Training Achieving Diverse Scores for Information Retrieval
Seungyeon Kim
Manzil Zaheer
Veeru Sadhanala
Sadeep Jayasumana
Aditya Menon
Rob Fergus
International Conference on Machine Learning (ICML) (2024)
Teacher Guided Training: An Efficient Framework for Knowledge Transfer
Manzil Zaheer
Seungyeon Kim
Chong You
Himanshu Jain
Rob Fergus
International Conference on Learning Representations (ICLR) (2023)