Min Ma

My work focuses on research and development of automatic speech recognition, large language modeling, multimodal multilingual modeling, etc.
Authored Publications
Sort By
  • Title
  • Title, descending
  • Year
  • Year, descending
    Google
Multimodal Modeling for Spoken Language Identification
Shikhar Bharadwaj
Ankur Bapna
Sriram (Sri) Ganapathy
Vera Axelrod
Sid Dalmia
Wei Han
Yu Zhang
Sandy Ritchie
Proceedings of 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024) (2024)
Label Aware Speech Representation Learning For Language Identification
Ankur Bapna
Shikhar Bharadwaj
Sriram Ganapathy
Vera Axelrod
Wei Han
Proceedings of Interspeech 2023, pp. 5351-5355
MASR: Multi-Label Aware Speech Representation
Anjali Raj
Shikhar Bharadwaj
Sriram Ganapathy
2023 Workshop on Automatic Speech Recognition and Understanding (ASRU) (2023)
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Sebastian Ruder
Mihir Sanjay Kale
Shruti Rijhwani
Jean-Michel Sarr
Cindy Wang
John Wieting
Christo Kirov
Dana L. Dickinson
Bidisha Samanta
Connie Tao
David Adelani
Vera Axelrod
Reeve Ingle
Dmitry Panteleev
Findings of the Association for Computational Linguistics: EMNLP 2023, Association for Computational Linguistics, Singapore, pp. 1856-1884
XTREME-S: Evaluating Cross-lingual Speech Representations
Ankur Bapna
Clara E. Rivera
Mihir Sanjay Kale
Sandy Ritchie
Sebastian Ruder
Simran Khanuja
Ye Jia
Yu Zhang
Proc. Interspeech 2022
FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech
Alexis Conneau
Simran Khanuja
Yu Zhang
Vera Axelrod
Siddharth Dalmia
Clara Rivera
Ankur Bapna
IEEE Spoken Language Technology Workshop (SLT) (2022)
Improving Streaming ASR with Non-streaming Model Distillation on Unsupervised Data
Chung-Cheng Chiu
Liangliang Cao
Ruoming Pang
Thibault Doutre
Wei Han
Yu Zhang
Zhiyun Lu
ICASSP 2021 (to appear)
Transliteration based approaches to improve code-switched speech recognition performance
Jesse Emond
Bhuvana Ramabhadran
Pedro Moreno
IEEE Spoken Language Technology Workshop (SLT) (2018), pp. 448-455