Research
My current research interests are databases, data warehousing, and Web services, with an emphysis on improving the performance and usability of business intelligence applications. My PhD work at the University of Illinois at Urbana-Champaign focused on large-scale integration for deep Web data.
Selected Publications
- Top-K Aggregation Queries Over Large Networks. X. Yan, B. He, F. Zhu, and J. Han. Proceedings of the 2010 IEEE International Conference on Data Engineering (ICDE 2010), Long Beach, California, March 2010.
- BIwTL: A Business Information Warehouse Toolkit and Language for Warehousing Simplification and Automation. B. He, R. Wang, Y. Chen, A. Lelescu, and J. Rhodes. Proceedings of the 2007 ACM SIGMOD Conference (SIGMOD 2007), Beijing, China, June 2007. [PDF]
- Accessing the Deep Web: A Survey. B. He, M. Patel, Z. Zhang, and K. C.-C. Chang. Communications of the ACM (CACM), 50(5): 94-101, May 2007. [PDF]
- Automatic Complex Schema Matching across Web Query Interfaces: A Correlation Mining Approach. B. He and K. C.-C. Chang. ACM Transactions on Database Systems (TODS), 31(1):346-395, March 2006. [PDF]
- Light-weight Domain-based Form Assistant: Querying Web Databases On the Fly. Z. Zhang, B. He, and K. C.-C. Chang. In Proceedings of the 31st Very Large Data Bases Conference (VLDB 2005), Trondheim, Norway, August 2005. [PDF]
- Making Holistic Schema Matching Robust: An Ensemble Approach. B. He and K. C.-C. Chang. In Proceedings of the 2005 ACM SIGKDD Conference (KDD 2005) (Full Paper), Chicago, Illinois, August 2005. [PDF]
- Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web. K. C.-C. Chang, B. He, and Z. Zhang. In Proceedings of the 2nd Conference on Innovative Data Systems Research (CIDR 2005), Asilomar, California, January 2005. [PDF]
- A Holistic Paradigm for Large Scale Schema Matching. B. He and K. C.-C. Chang. SIGMOD Record, 33(4):20-25, December 2004. Invited paper. [PDF]
- Structured Databases on the Web: Observations and Implications. K. C.-C. Chang, B. He, C. Li, M. Patel, and Z. Zhang. SIGMOD Record, 33(3):61-70, September 2004. [PDF]
- Discovering Complex Matchings across Web Query Interfaces: A Correlation Mining Approach. B. He, K. C.-C. Chang, and J. Han. In Proceedings of the 2004 ACM SIGKDD Conference (KDD 2004) (Full Paper), Seattle, Washington, August 2004. [PDF]
- Understanding Web Query Interfaces: Best-Effort Parsing with Hidden Syntax. Z. Zhang, B. He, and K. C.-C. Chang. In Proceedings of the 2004 ACM SIGMOD Conference (SIGMOD 2004), Paris, France, June 2004. [PDF]
- Statistical Schema Matching across Web Query
Interfaces. B. He and K. C.-C. Chang. In Proceedings of the 2003
ACM SIGMOD Conference (SIGMOD 2003), San Diego, California, June 2003.
[PDF]
More papers
Honors
- Runner Up of Almaden Grand Challenge Competition, IBM, 2009
- Invention Plateau Award, IBM, 2009
- Invention Achievement Awards, IBM, 2009
- Outstanding Technical Achievement Award, IBM, 2008
- Runner Up of ASR Best Paper Award, IBM, 2008
- Invention Plateau Award, IBM, 2008
- Invention Achievement Awards, IBM, 2008
- Invention Achievement Awards, IBM, 2007
- Bravo! Award, IBM, 2006
- Innovation Matters for Business Insights Workbench, IBM, 2006
- Winner of ComputerWorld Horizon Award for Business Insights Workbench, 2006
Filed Patents
- Efficient Iceberg Query Evaluation using Compressed Bitmap Index. B. He and H. Hsiao. Octobor 2009.
- An Efficient Subject-Independent Document Readership Classifier. B. He, S. Spangler, and Y. Chen. Octobor 2009.
- Efficient and Scalable Data Evolution with Column Oriented Databases. B. He and H. Hsiao. August 2009.
- Concurrency Control for Multiple ETL Processes. B. He, R. Wang, and Y. Chen. August 2009.
- Efficient ETL Schemes for Versioning Data Warehouses. B. He, Y. Chen, and S. Spangler. December 2008.
- Support Multi-Value Slice and Dice in Data Warehouses. B. He and Y. Chen. November 2008.
- A General Data Filtering and Optimization Framework for ETL processes. R. Wang, B. He, and Y. Chen. August 2008.
- A Simplified ER Model to Access Structured Data. B. He, A. Behal, and Y. Chen. August 2008.
- Adaptive Aggregation: Improving the Performance of Grouping and Duplicate Elimination By Avoiding Unnecessary Disk Access. B. He and Y. Chen. March 2008.
- A Weighted Hasse Diagram Approach to Optimal Extraction and Transformation in an ETL Process. B. He, Y. Chen, and A. Lelescu. January 2008.
- Efficient Update Schemes for Large Volume Data Updates in Data Warehouses. B. He and Y. Chen. August 2007.
- A Declarative Language based Technique for Warehousing Simplification and Automation. B. He, Y. Chen, A. Lelescu, J. Rhodes, and R. Wang. April 2007.
- Failure Recovery and Error Correction Techniques for Data Loading in Large Information Warehouses. B. He, Y. Chen, A. Lelescu, J. Rhodes, and R. Wang. March 2007.
- Method and System for Extracting Web Query Interfaces. K. C.-C. Chang, Z. Zhang, and B. He. August 2004. US Patent 7552116, Granted September 2009.