IBM®
Skip to main content
    United States [change]    Terms of use
 
 
 
    Home    Products    Services & solutions    Support & downloads    My account    
IBM Research

(none)

Activity Report


  Activity Report, January - June 2002



Highlights

Information Statistics and Communication Complexity Lower Bounds
Ziv Bar-Yossef, T. S. Jayram, Ravi Kumar, and D. Sivakumar present a new method for proving strong lower bounds in communication complexity. This method is based on the notion of the conditional information cost of a general communication protocol, which measures the amount of information revealed by the transcript about the inputs. While conditional information cost is a lower bound on the communication complexity, we show that it also admits a direct sum theorem. The direct sum decomposition reduces the task to that of proving lower bounds for simple problems (such as AND of two bits). For the latter, they develop a novel technique based on Hellinger distance and its generalizations. Their paradigm leads to two main results:
(1) An improved lower bound for the multi-party set-disjointness problem in the general communication model, and a nearly optimal lower bound in the one-way communication model. Consequently, approximating the k-th frequency moment for real k > 2 in the data stream model requires at least n^{1 - 2/k} space, resolving a conjecture of Alon, Matias, and Szegedy [STOC 96].
(2) A lower bound for the L_p approximation problem in the general communication model. This solves an open problem of Saks and Sun [STOC 02]. Consequently, approximating the L_p norms for p > 2 to within a factor of n^{epsilon} in the data stream model with constant number of passes requires at least n^{1 - 4epsilon - 2/p} space.
A paper will appear in the 43rd Symposium on Foundations of Computer Science (FOCS 2002).

Algorithm to Count Distinct Elements in a Data Stream
The zeroth-frequency moment of a sequence of elements is the number of distinct elements that occur in the sequence. Counting the number of distinct elements is a fairly fundamental problem in databases. Ziv Bar-Yossef, T. S. Jayram, Ravi Kumar, D. Sivakumar, and Luca Trevisan (UC Berkeley) present three space- and time-efficient algorithms for approximating the zeroth-frequency moment of a sequence of elements in the data stream model. Their algorithms improve upon known algorithms for this problem and offer a spectrum of time/space tradeoffs. A paper will appear in the 6th Intl. Workshop on Randomization and Approximation Techniques in Computer Science (RANDOM 2002).

Faster Approximation Algorithms for the Minimum Latency Problem
Aaron Archer (Cornell University) and David Williamson have devised faster approximation algorithms for the minimum latency problem (MLP). The MLP is a version of the traveling salesman problem in which one wants to minimize the average time each node is visited, rather than the total time for visiting all nodes. All known approximation algorithms for MLP call a subroutine for the k-MST problem, which in turn calls a subroutine for the prize-collecting Steiner tree problem (PCST). The best previously known algorithms achieve an approximation factor of 10.77 with O(n^2 log n) PCST calls, or 7.18 + x with O(n^{1/x}) PCST calls. They achieve a factor of 9.28 with O(n log n) PCST calls. To achieve a similar factor, previous algorithms required at least Omega(n^4) times more PCST calls. A paper with this algorithm was submitted to the 2003 ACM-SIAM Symposium on Discrete Algorithms (SODA).

Awards and Honors
R. Fagin was selected as a "Highly Cited Researcher" by ISI. Individuals cited by ISI are the most highly cited within their field during 1981-1999, and comprise less than one-half of one percent of all publishing researchers.

R. Fagin received an IBM Outstanding Innovation Award for Aggregation Algorithms for Middleware.

R. Kumar, S. Rajagopalan, and A. Tomkins received IBM Outstanding Innovation Awards for Web Graph Structure.

D. Gibson, R. Kumar, S. Rajagopalan, and A. Tomkins received IBM Research Division Awards for HITS/CLEVER.

N. Megiddo, Invention Achievement Award, Eleventh Plateau, Jan. 2002

Publications and Conferences
M. Ajtai, The invasiveness of off-line memory checking, Proc. 34th ACM Symp. on Theory of Computing, May 2002, pp. 504-513.

M. Ajtai, T. S. Jayram, R. Kumar, and D. Sivakumar, Approximate counting of inversions in a data stream, Proc. 34th ACM Symp. on Theory of Computing, May 2002, pp. 370-379.

M. Ajtai, R. Kumar, and D. Sivakumar, Sampling short lattice vectors and the closest lattice vector problem, Proc. 17th IEEE Conf. on Computational Complexity, May 2002, pp. 53-57.

T. Asano and D. P. Williamson, Improved approximation algorithms for MAX SAT, Journal of Algorithms 42 (2002), pp. 173-202.

Z. Bar-Yossef, T. S. Jayram, R. Kumar, and D. Sivakumar, Information theory methods in communication complexity, Proc. 17th IEEE Conf. on Computational Complexity, May 2002, pp. 93-102.

Z. Bar-Yossef, R. Kumar, and D. Sivakumar, Reductions in streaming algorithms, with an application to counting triangles in graphs, Proc. 13th ACM-SIAM Symp. on Discrete Algorithms, Jan. 2002.

T. Batu, S. Dasgupta, R. Kumar, and R. Rubinfeld, The complexity of estimating the entropy, Proc. 34th ACM Symp. on Theory of Computing, May 2002, pp. 678-687; also, Proc. 17th IEEE Conf. on Computational Complexity, May 2002, p. 17.

C. Dwork and L. Stockmeyer, 2-round zero knowledge and proof auditors, Proc. 34th ACM Symp. on Theory of Computing, May 2002.

R. Fagin, Combining fuzzy information: an overview, SIGMOD Record 31, 2, (2002), pp. 109-118.

R. Fagin, A. Karlin, J. Kleinberg, P. Raghavan, S. Rajagopalan, R. Rubinfeld, M. Sudan, and A. Tomkins, Random walks with "back buttons", Annals of Applied Probability 11, 3 (2001), pp. 810-862.

C.-T. Ho and L. Stockmeyer, A new approach to fault-tolerant wormhole routing in mesh-connected parallel computers, Proc. Intl. Parallel and Distributed Processing Symp., April 2002.

J. Rao, C. Zhang, G. Lohman, and N. Megiddo, Automating physical database design in a parallel database, ACM SIGMOD/PODS, June 2002.

R. Ravi and D. P. Williamson, Erratum: An approximation algorithm for minimum-cost vertex-connectivity problems, Algorithmica 34 (2002), pp. 98-107.

L. Stockmeyer and D. S. Modha, Links between complexity theory and constrained block coding, IEEE Trans. on Information Theory, 48 (2002), pp. 59-88.

D. P. Williamson, The primal-dual method for approximation algorithms, Mathematical Programming 91 B (2002), pp. 447-478.

Patents
M. Ajtai, R.C. Burns, R. Fagin, and L.J. Stockmeyer, System and Method for Differential Compression of Data from a Plurality of Binary Sources, Issued as Patent 6374250 in US, 04/16/2002.

S. Chakrabarti, B.E. Dom, D.A. Gibson, P. Raghavan, S. Rajagopalan, S. Ravikumar, and A.S. Tomkins, Method for Interactively Creating an Information Database Including Preferred Information Elements, Such As Preferred-Authority, World Wide Web Pages, Issued as Patent 6336112 in US, 01/01/2002.

B.G. Lindsay, G.S. Manku, and S. Rajagopalan, Single Pass Space Efficient System and Method for Generating an Approximate Quantile in a Data Set Having an Unknown Size, Issued as Patent 6343288 in US, 01/29/2002.

Invited Talks

  • R. Kumar
    "Simple exponential algorithms for SVP and CVP," Contemporary Methods in Cryptography, Institute for Pure and Applied Mathematics, UCLA, Jan. 2002.


Presentations and Seminars
  • R. Fagin
    "Easier ways to win logical games,"
    Symposium on the Effectiveness of Logic in Computer Science, Saarbruecken, Germany, March 2002.
    ARC, March 2002.
  • R. Kumar
    "Approximating the number of inversions in a data stream,"
    Cornell University, Feb. 2002.
    "Rank aggregation methods for the Web,"
    NEC Research Institute, Feb. 2002.
  • T. S. Jayram
    "Algorithms for counting inversions,"
    University of Maryland, Feb. 2002.
    "An information statistics approach to data stream and communication complexity,"
    UC Berkeley, May 2002.
    Workshop on Branching Programs, McGill University, May 2002.
  • D. Williamson
    "Approximation algorithms for MAX 3-CUT and other problems via complex semidefinite programming,"
    Sixth Aussois Workshop on Combinatorial Optimization, Jan. 2002.
    Carnegie Mellon University, March 2002.


Committee Work
  • R.Fagin
    Program committee member, ACM Symposium on Principles of Database Systems (PODS), 2002.
    Program committee member, Workshop on Distributed Data and Structures, 2002.
  • R. Kumar
    Program committee member, 2nd International Workshop on Web Dynamics, 2002.
  • T. S. Jayram
    Review panel member, NSF, Washington D.C.
  • D. Williamson
    Program committee member, 9th International Conference on Integer Programming and Combinatorial Optimization (IPCO).


Editorships


External Positions


Miscellaneous
N. Megiddo taught the course Network and Integer Programming (MS&E 212) at Stanford University.

Personnel

Transitions
Ziv Bar-Yossef joined the department, July 2002. He is a recent graduate of UC Berkeley.
Jim Hafner is on a two-year leave to work with the Storage Attachment project at ARC.

Organization
David Williamson (Manager) CS Principles and Methodologies
Sally Floyd (Administrative Assistant)
Jim Hafner (On Leave)

Andrew Tomkins (Manager) Information Management Principles
Ziv Bar-Yossef
David Gibson
Ravi Kumar
Kevin McCurley
Sridhar Rajagopalan
John Tomlin (Assignee)

Ron Fagin (Manager) Foundations of Computer Science
Miki Ajtai
T. S. Jayram
Nimrod Megiddo
D. Sivakumar
Larry Stockmeyer
  About IBM  |  Privacy  |  Legal  |  Contact

    About IBMPrivacyContact