Sanjiv Kumar

Sanjiv Kumar

Hi! I work in the area of large-scale machine learning and computer vision. You can find more information about me including a complete list of papers at: www.sanjivk.com.
Authored Publications
Sort By
  • Title
  • Title, descending
  • Year
  • Year, descending
    Think before you speak: Training language models with pause tokens
    Sachin Goyal
    Ziwei Ji
    Aditya Menon
    Vaishnavh Nagarajan
    International Conference on Learning Representations (ICLR) (2024)
    Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines
    Yuchen Li
    Alexandre Kirchmeyer
    Aashay Mehta
    Yilong Qin
    Andrej Risteski
    International Conference on Machine Learning (2024) (to appear)
    USTAD: Unified Single-model Training Achieving Diverse Scores for Information Retrieval
    Seungyeon Kim
    Manzil Zaheer
    Veeru Sadhanala
    Sadeep Jayasumana
    Aditya Menon
    Rob Fergus
    International Conference on Machine Learning (ICML) (2024)
    DistillSpec: Improving speculative decoding via knowledge distillation
    Yongchao Zhou
    Kaifeng Lyu
    Aditya Menon
    Afshin Rostamizadeh
    Jean-François Kagy
    Rishabh Agarwal
    International Conference on Learning Representations (ICLR) (2024)
    Teacher Guided Training: An Efficient Framework for Knowledge Transfer
    Manzil Zaheer
    Seungyeon Kim
    Chong You
    Himanshu Jain
    Rob Fergus
    International Conference on Learning Representations (ICLR) (2023)
    SOAR: Improved Indexing for Approximate Nearest Neighbor Search
    David Simcha
    Dave Dopson
    Neural Information Processing Systems (2023)