The FIND Seminar is a bi-weekly seminar series that hosts cutting-edge research talks on topics related to the broad themes of Foundations of Information, Networks and Decision Systems. Talks are about 50 minutes long with time for questions and discussion.
Location: Rhodes Hall 310 and Zoom
Time: 4:15PM ET, bi-weekly on (alternating) Thursdays
Delivery format: All talks will have a live audience in Rhodes Hall 310. Until circumstances allow otherwise, external speakers will give the talk remotely via Zoom (broadcasted in RH310). Remote audience is also welcome, but in-person participation is encouraged.
Mailing list: To subscribe to the FIND seminar mailing list, email find-seminar-l-request@cornell.edu, with “join” in the subject line and a blank email body. All talks info and reminders will be sent via the mailing list.
Upcoming Talk
Title: Differentially Private Space-Efficient Algorithms for Counting Distinct Elements in the Turnstile Model
Speaker: Rachel Cummings
Date and Time: 04/17/2025, 4:15PM ET
Location: Rhodes Hall 310 and Zoom

Abstract:
The turnstile continual release model of differential privacy captures scenarios where a privacy-preserving real-time analysis is sought for a dataset evolving through additions and deletions. In typical applications of real-time data analysis, both the length of the stream T and the size of the universe |U| from which data come can be extremely large. This motivates the study of private algorithms in the turnstile setting using space sublinear in both T and |U|. In this paper, we give the first sublinear space differentially private algorithms for the fundamental problem of counting distinct elements in the turnstile streaming model. Our algorithm achieves, on arbitrary streams, O(T^{1/3}) space and additive error, and a (1+\eta)-relative approximation for all \eta \in (0,1). Our result significantly improves upon the space requirements of the state-of-the-art algorithms for this problem, which is linear, approaching the known Omega(T^{1/4}) additive error lower bound for arbitrary streams. Moreover, when a bound W on the number of times an item appears in the stream is known, our algorithm provides O(\sqrt{W}) additive error, using O(\sqrt{W}) space. This additive error asymptotically matches that of prior work which required linear space. Our results address an open question about designing low-memory mechanisms for this problem. We complement these results with a space lower bound, which shows that any algorithm that uses similar techniques must use space Omega(T^{1/3}) on arbitrary streams.
Joint work with Alessandro Epasto, Jieming Mao, Tamalika Mukherjee, Tingting Ou, and Peilin Zhong.
Schedule for Spring 2025:
A list of previous talks can be found here.