Dong Xin

Dong Xin

Authored Publications
Google Publications
Other Publications
Sort By
  • Title
  • Title, descending
  • Year
  • Year, descending
    Crawling deep web entity pages
    Yeye He
    Venkatesh Ganti
    Sriram Rajaraman
    Nirav Shah
    WSDM(2013), pp. 355-364
    Preview
    Graph cube: on warehousing and OLAP multidimensional networks
    Peixiang Zhao
    Xialolei Li
    Jiawei Han
    SIGMOD - Proceedings of the 2011 International Conference on Management of Data, ACM, New York, NY
    Preview abstract We consider extending decision support facilities toward large sophisticated networks, upon which multidimensional attributes are associated with network entities, thereby forming the so-called multidimensional networks. Data warehouses and OLAP (Online Analytical Processing) technology have proven to be effective tools for decision support on relational data. However, they are not well equipped to handle the new yet important multidimensional networks. In this paper, we introduce Graph Cube, a new data warehousing model that supports OLAP queries effectively on large multidimensional networks. By taking account of both attribute aggregation and structure summarization of the networks, Graph Cube goes beyond the traditional data cube model involved solely with numeric value based group-by’s, thus resulting in a more insightful and structure-enriched aggregate network within every possible multidimensional space. Besides traditional cuboid queries, a new class of OLAP queries, crossboid, is introduced that is uniquely useful in multidimensional networks and has not been studied before. We implement Graph Cube by combining special characteristics of multidimensional networks with the existing well-studied data cube techniques. We perform extensive experimental studies on a series of real world data sets and Graph Cube is shown to be a powerful and efficient tool for decision support on large multidimensional networks. View details
    A framework for robust discovery of entity synonyms
    Kaushik Chakrabarti
    Surajit Chaudhuri
    Tao Cheng
    KDD(2012), pp. 1384-1392
    SEISA: set expansion by iterative similarity aggregation
    Yeye He
    WWW(2011), pp. 427-436
    EntityTagger: automatically tagging entities with descriptive phrases
    Kaushik Chakrabarti
    Surajit Chaudhuri
    Tao Cheng
    WWW (Companion Volume)(2011), pp. 19-20
    Fast personalized PageRank on MapReduce
    Bahman Bahmani
    Kaushik Chakrabarti
    SIGMOD Conference(2011), pp. 973-984
    Keyword++: A Framework to Improve Keyword Search Over Entity Databases
    Yeye He
    Venkatesh Ganti
    PVLDB, 3(2010), pp. 711-722
    Query portals: dynamically generating portals for entity-oriented web queries
    Sanjay Agrawal
    Kaushik Chakrabarti
    Surajit Chaudhuri
    Venkatesh Ganti
    Arnd Christian König
    SIGMOD Conference(2010), pp. 1171-1174
    Promotion Analysis in Multi-Dimensional Space
    Tianyi Wu
    Qiaozhu Mei
    Jiawei Han
    PVLDB, 2(2009), pp. 109-120
    Mining Document Collections to Facilitate Accurate Approximate Entity Matching
    Surajit Chaudhuri
    Venkatesh Ganti
    PVLDB, 2(2009), pp. 395-406
    Detecting gene clusters under evolutionary constraint in a large number of genomes
    Xu Ling
    Xin He
    Bioinformatics, 25(2009), pp. 571-577
    Top-down mining of frequent closed patterns from very high dimensional data
    Hongyan Liu
    Xiaoyu Wang
    Jun He
    Jiawei Han
    Zheng Shao
    Inf. Sci., 179(2009), pp. 899-924
    Exploiting web search to generate synonyms for entities
    Surajit Chaudhuri
    Venkatesh Ganti
    WWW(2009), pp. 151-160
    Exploiting web search engines to search structured databases
    Sanjay Agrawal
    Kaushik Chakrabarti
    Surajit Chaudhuri
    Venkatesh Ganti
    Arnd Christian König
    WWW(2009), pp. 501-510
    Efficiently Identifying Max-Gap Clusters in Pairwise Genome Comparison
    Xu Ling
    Xin He
    Jiawei Han
    Journal of Computational Biology, 15(2008), pp. 593-609
    P-Cube: Answering Preference Queries in Multi-Dimensional Space
    Jiawei Han
    ICDE(2008), pp. 1092-1100
    ARCube: supporting ranking aggregate queries in partially materialized data cubes
    Tianyi Wu
    Jiawei Han
    SIGMOD Conference(2008), pp. 79-92
    An efficient filter for approximate membership checking
    Kaushik Chakrabarti
    Surajit Chaudhuri
    Venkatesh Ganti
    SIGMOD Conference(2008), pp. 805-818
    Integrating OLAP and Ranking: The Ranking-Cube Methodology
    Jiawei Han
    ICDE Workshops(2007), pp. 253-256
    DataScope: Viewing Database Contents in Google Maps' Way
    Tianyi Wu
    Xiaolei Li
    Jiawei Han
    Jacob Lee
    Ricardo Redder
    VLDB(2007), pp. 1314-1317
    Computing Iceberg Cubes by Top-Down and Bottom-Up Integration: The StarCubing Approach
    Jiawei Han
    Xiaolei Li
    Zheng Shao
    Benjamin W. Wah
    IEEE Trans. Knowl. Data Eng., 19(2007), pp. 111-126
    Optimization of Bounds in Temporal Flexible Plans with Dynamic Controllability
    Benjamin W. Wah
    International Journal on Artificial Intelligence Tools, 16(2007), pp. 17-44
    Progressive and selective merge: computing top-k with ad-hoc ranking functions
    Jiawei Han
    Kevin Chen-Chuan Chang
    SIGMOD Conference(2007), pp. 103-114
    Frequent pattern mining: current status and future directions
    Jiawei Han
    Hong Cheng
    Xifeng Yan
    Data Min. Knowl. Discov., 15(2007), pp. 55-86
    Semantic annotation of frequent patterns
    Qiaozhu Mei
    Hong Cheng
    Jiawei Han
    ChengXiang Zhai
    TKDD, 1(2007)
    On compressing frequent patterns
    Jiawei Han
    Xifeng Yan
    Hong Cheng
    Data Knowl. Eng., 60(2007), pp. 5-29
    Ranking objects based on relationships
    Kaushik Chakrabarti
    Venkatesh Ganti
    Jiawei Han
    SIGMOD Conference(2006), pp. 371-382
    Extracting redundancy-aware top-k patterns
    Hong Cheng
    Xifeng Yan
    Jiawei Han
    KDD(2006), pp. 444-453
    Generating semantic annotations for frequent patterns with context analysis
    Qiaozhu Mei
    Hong Cheng
    Jiawei Han
    ChengXiang Zhai
    KDD(2006), pp. 337-346
    Mining Interesting Patterns from Very High Dimensional Data: A Top-Down Row Enumeration Approach
    Hongyan Liu
    Jiawei Han
    Zheng Shao
    SDM(2006)
    Towards Robust Indexing for Ranked Queries
    Chen Chen
    Jiawei Han
    VLDB(2006), pp. 235-246
    C-Cubing: Efficient Computation of Closed Cubes by Aggregation-Based Checking
    Zheng Shao
    Jiawei Han
    Hongyan Liu
    ICDE(2006), pp. 4
    Top-Down Mining of Interesting Patterns from Very High Dimensional Data
    Hongyan Liu
    Jiawei Han
    Zheng Shao
    ICDE(2006), pp. 114
    Discovering interesting patterns through user's interactive feedback
    Xuehua Shen
    Qiaozhu Mei
    Jiawei Han
    KDD(2006), pp. 773-778
    Answering Top-k Queries with Multi-Dimensional Selections: The Ranking Cube Approach
    Jiawei Han
    Hong Cheng
    Xiaolei Li
    VLDB(2006), pp. 463-475
    Mining Evolving Customer-Product Relationships in Multi-Dimensional Space
    Xiaolei Li
    Jiawei Han
    Xiaoxin Yin
    ICDE(2005), pp. 580-581
    Mining Compressed Frequent-Pattern Sets
    Jiawei Han
    Xifeng Yan
    Hong Cheng
    VLDB(2005), pp. 709-720
    Summarizing itemset patterns: a profile-based approach
    Xifeng Yan
    Hong Cheng
    Jiawei Han
    KDD(2005), pp. 314-323
    MM-Cubing: Computing Iceberg Cubes by Factorizing the Lattice Space
    Zheng Shao
    Jiawei Han
    SSDBM(2004), pp. 213-222
    Optimization of Bounds in Temporal Flexible Planning with Dynamic Controllability
    Benjamin W. Wah
    ICTAI(2004), pp. 40-48
    Star-Cubing: Computing Iceberg Cubes by Top-Down and Bottom-Up Integration
    Jiawei Han
    Xiaolei Li
    Benjamin W. Wah
    VLDB(2003), pp. 476-487
    Exploiting support vector machines in hidden Markov models for speaker verification
    Zhaohui Wu
    Yingchun Yang
    INTERSPEECH(2002)