Jon Orwant

Jon Orwant

Sr. Engineering Director, Operations Research and Compiler Research. Prior to Google, Dr. Orwant was director of research for France Telecom and CTO of O'Reilly Media. He received his BS, MS, and PhD from MIT.
Authored Publications
Sort By
  • Title
  • Title, descending
  • Year
  • Year, descending
    A Dataset of Syntactic-Ngrams over Time from a Very Large Corpus of English Books
    Yoav Goldberg
    Second Joint Conference on Lexical and Computational Semantics, Association for Computational Linguistics, Atlanta, Georgia, USA (2013), pp. 241-247
    Preview
    Google+ Ripples: A Native Visualization of Information Flow
    Jack Hebert
    Geoffrey Borggaard
    Alison Cichowlas
    Jonathan Feinberg
    Christopher Wren
    Proceedings of the 22nd International World Wide Web Conference (2013), pp. 1389-1398
    Preview
    Programming Perl
    Tom Christiansen
    brian d foy
    Larry Wall
    O'Reilly, 1005 Gravenstein Highway North Sebastopol, CA 95472 (2012)
    Preview
    Syntactic Annotations for the Google Books Ngram Corpus
    Yuri Lin
    Jean-Baptiste Michel
    Erez Lieberman Aiden
    William Brockman
    Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Volume 2: Demo Papers (ACL '12) (2012)
    Preview
    Quantitative Analysis of Culture Using Millions of Digitized Books
    Jean-Baptiste Michel
    Yuan Kui Shen
    Aviva Presser Aiden
    Adrian Veres
    Matthew K. Gray
    The Google Books Team
    Joseph P. Pickett
    Dale Holberg
    Dan Clancy
    Steven Pinker
    Martin A. Nowak
    Erez Lieberman Aiden
    Science (2010)
    Preview abstract We constructed a corpus of digitized texts containing about 4% of all books ever printed. Analysis of this corpus enables us to investigate cultural trends quantitatively. We survey the vast terrain of ‘culturomics,’ focusing on linguistic and cultural phenomena that were reflected in the English language between 1800 and 2000. We show how this approach can provide insights about fields as diverse as lexicography, the evolution of grammar, collective memory, the adoption of technology, the pursuit of fame, censorship, and historical epidemiology. Culturomics extends the boundaries of rigorous quantitative inquiry to a wide array of new phenomena spanning the social sciences and the humanities. View details