Identifying Hearing Difficulty Moments in Conversational Audio

Jack Collins; Adrian Buzea; Chris Collier; Alejandro Ballesta Rosen; Julian Maclaren; Dick Lyon; Kelly Miles; Simon Carlile

Identifying Hearing Difficulty Moments in Conversational Audio

Jack Collins

Adrian Buzea

Chris Collier

Alejandro Ballesta Rosen

Julian Maclaren

Dick Lyon

Kelly Miles

Simon Carlile

Trends in Hearing (2026)

Download Google Scholar

Abstract

Individuals regularly experience Hearing Difficulty Moments in everyday conversation. Identifying Hearing Difficulty Moments has particular significance in the field of hearing assistive technology where timely interventions are key for real-time hearing assistance. In this article, we propose and compare machine learning solutions for the temporal detection of segments containing Hearing Difficulty Moments in conversational audio. We show that audio language models, through their multimodal reasoning capabilities, can achieve state-of-the-art results for this task, significantly outperforming a simple automatic speech recognition (ASR) hotword heuristic and a more conventional fine-tuning approach with Wav2Vec, an audio-only input architecture that is state-of-the-art for ASR.

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Identifying Hearing Difficulty Moments in Conversational Audio

Abstract

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs