Distributed Latent Variable Models of Lexical Co-occurrences

Fernando Pereira
Tenth International Workshop on Artificial Intelligence and Statistics (2005)

Abstract

Low-dimensional representations for lexical co-occurrence data have become increasingly important in alleviating the sparse data problem inherent in natural language processing tasks. This work presents a distributed latent variable model for inducing these low-dimensional representations. The model takes inspiration from both connectionist language models [1, 16] and latent variable models [13, 9]. We give results which show that the new model significantly improves both bigram and trigram models.

Research Areas

×