My primary research interest is in machine learning algorithms and applications: (1) Efficient modeling. Designing neural models to efficiently learn from data; (2) Fast training. Developing algorithms to speed up deep learning training; (3) Fast inference. Developing compact neural models to meet extreme memory and latency constraints. I am also interested in developing algorithms toward true natural language understanding. For more information about me, please visit my personal homepage and Google Scholar.