Research
My current research interests are databases and data warehousing, with an emphysis on improving the performance and usability of databases and data warehouse systems. My PhD work at the University of Illinois at Urbana-Champaign focused on large-scale integration for deep Web data.
Selected Publications
- BIwTL: A Business Information Warehouse Toolkit and Language for Warehousing Simplification and Automation. B. He, R. Wang, Y. Chen, A. Lelescu, and J. Rhodes. Proceedings of the 2007 ACM SIGMOD Conference (SIGMOD 2007), Beijing, China, June 2007. [PDF]
- Accessing the Deep Web: A Survey. B. He, M. Patel, Z. Zhang, and K. C.-C. Chang. Communications of the ACM (CACM), 50(5): 94-101, May 2007. [PDF]
- Automatic Complex Schema Matching across Web Query Interfaces: A Correlation Mining Approach. B. He and K. C.-C. Chang. ACM Transactions on Database Systems (TODS), 31(1):346-395, March 2006. [PDF]
- Light-weight Domain-based Form Assistant: Querying Web Databases On the Fly. Z. Zhang, B. He, and K. C.-C. Chang. In Proceedings of the 31st Very Large Data Bases Conference (VLDB 2005), Trondheim, Norway, August 2005. [PDF]
- Making Holistic Schema Matching Robust: An Ensemble Approach. B. He and K. C.-C. Chang. In Proceedings of the 2005 ACM SIGKDD Conference (KDD 2005) (Full Paper), Chicago, Illinois, August 2005. [PDF]
- Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web. K. C.-C. Chang, B. He, and Z. Zhang. In Proceedings of the 2nd Conference on Innovative Data Systems Research (CIDR 2005), Asilomar, California, January 2005. [PDF]
- A Holistic Paradigm for Large Scale Schema Matching. B. He and K. C.-C. Chang. SIGMOD Record, 33(4):20-25, December 2004. Invited paper. [PDF]
- Structured Databases on the Web: Observations and Implications. K. C.-C. Chang, B. He, C. Li, M. Patel, and Z. Zhang. SIGMOD Record, 33(3):61-70, September 2004. [PDF]
- Discovering Complex Matchings across Web Query Interfaces: A Correlation Mining Approach. B. He, K. C.-C. Chang, and J. Han. In Proceedings of the 2004 ACM SIGKDD Conference (KDD 2004) (Full Paper), Seattle, Washington, August 2004. [PDF]
- Understanding Web Query Interfaces: Best-Effort Parsing with Hidden Syntax. Z. Zhang, B. He, and K. C.-C. Chang. In Proceedings of the 2004 ACM SIGMOD Conference (SIGMOD 2004), Paris, France, June 2004. [PDF]
- Statistical Schema Matching across Web Query
Interfaces. B. He and K. C.-C. Chang. In Proceedings of the 2003
ACM SIGMOD Conference (SIGMOD 2003), San Diego, California, June 2003.
[PDF]
More papers
Professional Activities
Honors
- Outstanding Technical Achievement Award, IBM, 2008
- Runner Up of ASR Best Paper Award, IBM, 2008
- Invention Plateau Award, IBM, 2008
- Invention Achievement Awards, IBM, 2008
- Invention Achievement Awards, IBM, 2007
- Bravo! Award, IBM, 2006
- Innovation Matters for Business Insights Workbench, IBM, 2006
- Winner of ComputerWorld Horizon Award for
Business Insights Workbench, 2006
Filed Patents
- A General Data Filtering and Optimization Framework for ETL processes. R. Wang, B. He, and Y. Chen. August 2008.
- A Simplified ER Model to Access Structured Data. B. He, A. Behal, and Y. Chen. August 2008.
- Adaptive Aggregation: Improving the Performance of Grouping and Duplicate Elimination By Avoiding Unnecessary Disk Access. B. He and Y. Chen. March 2008.
- A Weighted Hasse Diagram Approach to Optimal Extraction and Transformation in an ETL Process. B. He, Y. Chen, and A. Lelescu. January 2008.
- Efficient Update Schemes for Large Volume Data Updates in Data Warehouses. B. He and Y. Chen. August 2007.
- A Declarative Language based Technique for Warehousing Simplification and Automation. B. He, Y. Chen, A. Lelescu, J. Rhodes, and R. Wang. April 2007.
- Failure Recovery and Error Correction Techniques for Data Loading in Large Information Warehouses. B. He, Y. Chen, A. Lelescu, J. Rhodes, and R. Wang. March 2007.
- Method and System for Extracting Web Query Interfaces. K. C.-C. Chang, Z. Zhang, and B. He. August 2004.