Previous Project - Overview
The goal of the Avatar project is two fold: (i) to enable the discovery and extraction of structured information buried in volumes of unstructured text (such as emails, web pages, and blogs), and (ii) to exploit this information to drive the next generation of search and business intelligence applications. Ongoing research in Avatar is at the cusp of a number of disciplines ranging from search and information retrieval to machine learning, information extraction, and probabilistic databases.
The Avatar project has been superseeded by SystemT and related project within the IIIS Department. Please follow these links into our new pages:
Project Contact: Sriram Raghavan
Selected Publications
- Frederick Reiss, Sriram Raghavan, Rajasekar Krishnamurthy, Huaiyu Zhu, Shivakumar Vaithyanathan: An Algebraic Approach to Rule-Based Information Extraction. ICDE 2008: 933-942
- Yunyao Li, Rajasekar Krishnamurthy, Shivakumar Vaithyanathan, and H.V.Jagadish: Getting Work Done on the Web: Supporting Transactional Queries. SIGIR 2006.
- T.S.Jayram, Rajasekar Krishnamurthy, Sriram Raghavan, Shivakumar Vaithyanathan, and Huaiyu Zhu: Avatar Information Extraction System. IEEE Data Engineering Bulletin, May 2006.
- Douglas Burdick, Prasad Deshpande, T.S. Jayram, Raghu Ramakrishnan and Shivakumar Vaithyanathan: OLAP Over Uncertain and Imprecise Data. VLDB 2005.
- Huaiyu Zhu, Sriram Raghavan, Shivakumar Vaithyanathan, T.S.Jayram, Rajasekar Krishnamurthy, Prasad Deshpande, Rahul Gupta, Krishna P Chitrapura: Avatar: Using text-analytics to bridge the structured-unstructured divide. IBM Technical Report.

