Amir Yazdanbakhsh

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Haoran You

Yipin Guo

Yichao Fu

Wei Zhou

Huihong Shi

Xiaofan Zhang

Souvikk Kundu

Amir Yazdanbakhsh

Yingyan Lin

38th Annual Conference on Neural Information Processing Systems (NeurIPS) (2024)

Data-Driven Offline Optimization for Architecting Hardware Accelerators

Amir Yazdanbakhsh

Aviral Kumar

Kevin Swersky

Milad Hashemi

Sergey Levine

International Conference on Learning Representations 2022 (to appear)

GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation

Amir Yazdanbakhsh

Mangpo Phothilimthana

Ondrej Sykora

Thirimadura C. Yasendra Mendis

2022 IEEE International Symposium on Workload Characterization (2022) (to appear)

Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask

Amir Yazdanbakhsh

Sheng-Chun Kao

Shivani Agrawal

Suvinay Subramanian

Tushar Krishna

Utku Evci

(2022) (to appear)

Accelerating Attention through Gradient-Based Learned Runtime Pruning

Amir Yazdanbakhsh

Hadi Esmaeilzadeh

Mingu Kang

Soroush Ghodrati

Zheng Li

ISCA (2022) (to appear)

An Evaluation of Edge TPU Accelerators for Convolutional Neural Networks

Amir Yazdanbakhsh

Berkin Akin

Kiran K Seshadri

ArXiv, https://arxiv.org/abs/2102.10423 (2021)

ReLeQ: A Reinforcement Learning Approach for Automatic Deep Quantization of Neural Networks

Ahmed Taha Elthakeb

Prannoy Pilligundla

Fatemeh Mireshghallah

Amir Yazdanbakhsh

Hadi Esmaeilzadeh

IEEE Micro (2020)

Efficient Imitation Learning with Local Trajectory Optimization

Jialin Song

Wenjie Jiang

Amir Yazdanbakhsh

Ebrahim Songhori

Anna Darling Goldie

Navdeep Jaitly

Azalia Mirhoseini

ICML 2020 Workshop on Inductive Biases, Invariances and Generalization in RL (2020)

Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation

Byung Hoon Ahn

Prannoy Pilligundla

Amir Yazdanbakhsh

Hadi Esmaeilzadeh

International Conference on Learning Representations (2020) (to appear)

Apollo: Transferable Architecture Exploration

Albin Jones

Amir Yazdanbakhsh

Berkin Akin

Christof Angermueller

James Pierce Laudon

Kevin Swersky

Milad Hashemi

Ravi Narayanaswami

Sat Chatterjee

Yanqi Zhou

ML for Systems Workshop at NeurIPS 2020

Defining the technology of today and tomorrow.

Philosophy

People

Research areas

Foundational ML & Algorithms

Computing Systems & Quantum AI

Science, AI & Society

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Amir Yazdanbakhsh

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Research areas

Foundational ML & Algorithms

Computing Systems & Quantum AI

Science, AI & Society

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Amir Yazdanbakhsh

Research Areas

Filter by:

Publications

Years

Research Areas

Teams

Join us