This paper discusses a method to inject text when training an ASR system without the need for up sampling the text sequence to match the length of the speech sequence.
Meet the teams driving innovation
Our teams advance the state of the art through research, systems engineering, and collaboration across Google.