Amir Yazdanbakhsh

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Haoran You

Yipin Guo

Yichao Fu

Wei Zhou

Huihong Shi

Xiaofan Zhang

Souvikk Kundu

Amir Yazdanbakhsh

Yingyan Lin

38th Annual Conference on Neural Information Processing Systems (NeurIPS) (2024)

GRANITE: A Graph Neural Network Model for Basic Block Throughput Estimation

Amir Yazdanbakhsh

Mangpo Phothilimthana

Ondrej Sykora

Thirimadura C. Yasendra Mendis

2022 IEEE International Symposium on Workload Characterization (2022) (to appear)

Accelerating Attention through Gradient-Based Learned Runtime Pruning

Amir Yazdanbakhsh

Hadi Esmaeilzadeh

Mingu Kang

Soroush Ghodrati

Zheng Li

ISCA (2022) (to appear)

Data-Driven Offline Optimization for Architecting Hardware Accelerators

Amir Yazdanbakhsh

Aviral Kumar

Kevin Swersky

Milad Hashemi

Sergey Levine

International Conference on Learning Representations 2022 (to appear)

Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask

Amir Yazdanbakhsh

Sheng-Chun Kao

Shivani Agrawal

Suvinay Subramanian

Tushar Krishna

Utku Evci

(2022) (to appear)

An Evaluation of Edge TPU Accelerators for Convolutional Neural Networks

Amir Yazdanbakhsh

Berkin Akin

Kiran K Seshadri

ArXiv, https://arxiv.org/abs/2102.10423 (2021)

Efficient Imitation Learning with Local Trajectory Optimization

Jialin Song

Wenjie Jiang

Amir Yazdanbakhsh

Ebrahim Songhori

Anna Darling Goldie

Navdeep Jaitly

Azalia Mirhoseini

ICML 2020 Workshop on Inductive Biases, Invariances and Generalization in RL (2020)

Apollo: Transferable Architecture Exploration

Albin Jones

Amir Yazdanbakhsh

Berkin Akin

Christof Angermueller

James Pierce Laudon

Kevin Swersky

Milad Hashemi

Ravi Narayanaswami

Sat Chatterjee

Yanqi Zhou

ML for Systems Workshop at NeurIPS 2020

Mixed-Signal Charge-Domain Acceleration of Deep Neural Networks through Interleaved Bit-Partitioned Arithmetic

Soroush Ghodrati

Hardik Sharma

Sean Kinzer

Amir Yazdanbakhsh

Jongse Park

Nam Sung Kim

Doug Burger

Hadi Esmaeilzadeh

29th International Conference on Parallel Architectures and Compilation Techniques (PACT), IEEE (2020)

ReLeQ: A Reinforcement Learning Approach for Automatic Deep Quantization of Neural Networks

Ahmed Taha Elthakeb

Prannoy Pilligundla

Fatemeh Mireshghallah

Amir Yazdanbakhsh

Hadi Esmaeilzadeh

IEEE Micro (2020)

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Amir Yazdanbakhsh

Research Areas

Join us

Amir Yazdanbakhsh

Research Areas

Filter by:

Publications

Years

Research Areas

Teams

Join us