|
Intelligent Data Storage
Organizations are under increasing regulatory scrutiny to properly
retain and manage their records. They are also under continuous cost
pressure, which is driving them to identify and move less critical
data from the primary repositories to lower-cost storage. With the
emergence of new data analysis techniques and the availability of
compute resources to make the analysis practical, organizations are
realizing that they may gain business advantage by rejuvenating old
data and making the data accessible. Many organizations are also
amassing large amounts of data generated by new instruments and
sensors.
Consequently, the data that needs to be archived is increasingly
diverse and voluminous. More importantly, we believe that the very
nature of archive is changing from what may be described as a data
landfill (where data is dumped just in case the data will be needed) to
a data gold mine, what we call nearline information systems, where the
focus is on properly retaining information and facilitating usage of the
information.
The Intelligent Data Storage (IDS) project is a collaborative effort
between the storage systems and computer science teams to develop
techniques and solutions for handling this new and emerging form of
archive. Some of our focus areas include data storage management based
on data semantics, indexing and mining of archive data, identification
of archival candidates, tiered storage optimizations, security and
privacy considerations for archive, industry-specific customizations
and regulatory compliance by leveraging our earlier work on
Fossilization™.
IBM Almaden Research - Storage Management and Solutions
|
Windsor Hsu and Shauchi Ong: WORM Storage is Not Enough, to appear
in IBM Systems Journal special issue on Compliance Management, 2007
Soumyadeb Mitra, Windsor Hsu and Marianne Winslett: Trustworthy Keyword Search for Regulatory-Compliant Record Retention,
Proceedings of
the International Conference on Very Large Data Bases (VLDB),
Sep. 2006 (Best Paper Award)
Lan Huang, Windsor Hsu, and Fengzhou Zheng: CIS: Content Immutable Storage for Trustworthy Record Keeping,
Proceedings of the
Conference on Mass Storage Systems and Technologies, May 2006
Windsor Hsu, Shauchi Ong, and Birgit Pfitzmann: Archiv mit Dokumentenversteinerung,
Zeitschrift fur Datenrecht und
Informationssicherheit, Vol. 5(2), 2005
Qingbo Zhu and Windsor Hsu: Fossilized Index: The Linchpin of Trustworthy Non-Alterable Electronic
Records, Proceedings of the ACM SIGMOD
Conference, June 2005
Ying Chen: Information Valuation for Information Lifecycle Management,
Proceedings of the IEEE International Conference on Autonomic
Computing, June 2005
Timothy Denehy and Windsor Hsu: Duplicate Management for Reference Data,
IBM Research Report RJ 10305, October 2003
Windsor Hsu:
Fossilization of Electronic Records, Stanford University
Computer Systems Colloquium, October 2005.
Fossilization: Compliant Reference Storage Solutions,
Innovation Matters, IBM Research, January 2005
|