Discriminative Log-Linear Grammars with Latent Variables

Dan Klein
Advances in Neural Information Processing Systems 20 (NIPS), MIT Press, Cambridge, MA (2008), pp. 1153-1160

Abstract

We demonstrate that log-linear grammars with latent variables can be practically trained using discriminative methods. Central to efficient discriminative training is a hierarchical pruning procedure which allows feature expectations to be efficiently approximated in a gradient-based procedure. We compare L1 and L2 regularization and show that L1 regularization is superior, requiring fewer iterations to converge, and yielding sparser solutions. On full-scale treebank parsing experiments, the discriminative latent models outperform both the comparable generative latent models as well as the discriminative non-latent baselines.

Research Areas

×