TCS+ | A carbon-free dissemination of ideas across the globe.

March 4, 2020

TCS+ talk: Wednesday, March 11 — Thomas Steinke, IBM Research Almaden

The next TCS+ talk will take place this coming Wednesday, March 11th at 1:00 PM Eastern Time (10:00 AM Pacific Time, 18:00 Central European Time, 17:00 UTC). Thomas Steinke from IBM Research Almaden will speak about “Reasoning About Generalization via Conditional Mutual Information” (abstract below).

Please make sure you reserve a spot for your group to join us live by signing up on the online form. In view of the recent travel restrictions and coronavirus precautions, in particular, do not hesitate to reserve a seat even for a group of size one: there should be enough room for everyone, so don’t be shy!

As usual, for more information about the TCS+ online seminar series and the upcoming talks, or to suggest a possible topic or speaker, please see the website.

Abstract: We provide an information-theoretic framework for studying the generalization properties of machine learning algorithms. Our framework ties together existing approaches, including uniform convergence bounds and recent methods for adaptive data analysis. Specifically, we use Conditional Mutual Information (CMI) to quantify how well the input (i.e., the training data) can be recognized given the output (i.e., the trained model) of the learning algorithm. We show that bounds on CMI can be obtained from VC dimension, compression schemes, differential privacy, and other methods. We then show that bounded CMI implies various forms of generalization.

Based on joint work with Lydia Zakynthinou.

Leave a comment

February 20, 2020

TCS+ talk: Wednesday, February 26 — Henry Yuen, University of Toronto

The next TCS+ talk will take place this coming Wednesday, February 26th at 1:00 PM Eastern Time (10:00 AM Pacific Time, 19:00 Central European Time, 18:00 UTC). Henry Yuen from University of Toronto will speak about “MIP* = RE” (abstract below).

Please make sure you reserve a spot for your group to join us live by signing up on the online form. As usual, for more information about the TCS+ online seminar series and the upcoming talks, or to suggest a possible topic or speaker, please see the website.

Abstract: MIP* denotes the class of problems that admit interactive proofs with quantum entangled provers. It has been an outstanding question to characterize the complexity of MIP*. Most notably, there was no known computable upper bound on this class.
We show that MIP* is equal to the class RE, the set of recursively enumerable languages. In particular, this shows that MIP* contains uncomputable problems. Through a series of known connections, this also yields a negative answer to Connes’ Embedding Problem from the theory of operator algebras. In this talk, I will explain the connection between Connes’ Embedding Problem, quantum information theory, and complexity theory. I will then give an overview of our approach, which involves reducing the Halting Problem to the problem of approximating the entangled value of nonlocal games.
Joint work with Zhengfeng Ji, Anand Natarajan, Thomas Vidick, and John Wright.

Leave a comment

February 6, 2020

TCS+ talk: Wednesday, February 12 — Albert Atserias, Universitat Politecnica de Catalunya

The next TCS+ talk will take place this coming Wednesday, February 12th at 1:00 PM Eastern Time (10:00 AM Pacific Time, 19:00 Central European Time, 18:00 UTC). Albert Atserias from Universitat Politecnica de Catalunya will speak about “Automating Resolution is NP-Hard” (abstract below).

Abstract: We show that it is NP-hard to distinguish CNF formulas that have Resolution refutations of almost linear length from CNF formulas that do not even have weakly exponentially long ones. It follows from this that Resolution is not automatable in polynomial time unless P = NP, or in weakly exponential time unless ETH fails. The proof of this is simple enough that all its ideas can be explained in a talk. Along the way, I will try to explain the process of discovery that led us to the result. This is joint work with Moritz Müller.

Leave a comment

February 1, 2020

TCS+: Spring approaches, talks resume!

The winter hiatus is nearly over, and the new season of TCS+ is about to start! Our first two talks will take place on Feb 12th and 26th, respectively. We’re pretty excited about them…

On February 12th, 1pm EST, Albert Atserias (Universitat Politecnica de Catalunya) will tell us all about how Automating Resolution is NP-Hard.
Then, on February 26th, Henry Yuen (University of Toronto) will speak about the recent proof that MIP*=RE.

Stay tuned for the official talk announcements. And this is only the beginning of the semester…

In the meantime, if you have suggestions, here is the link.

Leave a comment

November 27, 2019

TCS+ talk: Wednesday, December 4 — Nihar Shah, CMU

The next TCS+ talk, and last of the Fall season, will take place this coming Wednesday, December 4th at 1:00 PM Eastern Time (10:00 AM Pacific Time, 19:00 Central European Time, 18:00 UTC). Nihar Shah from CMU will speak about “Battling Demons in Peer Review” (abstract below).

Abstract: Peer review is the backbone of scholarly research. It is however faced with a number of challenges (or “demons”) which cause unfairness to authors, and degrade the overall quality of the process. This talk will present principled and practical approaches to battle these demons in peer review:

Subjectivity: How to ensure that all papers are judged by the same yardstick?

Mis-calibration: How to use ratings in presence of arbitrary or adversarial mis-calibration?

Bias: How to rigorously test for existence of (gender/fame/race/…) biases in peer review?

Strategic behavior: How to insulate peer review from strategic behavior of author-reviewers?

Noise: How to assign reviewers to papers to simultaneously ensure fair and accurate evaluations in the presence of review noise?

The work uses tools from social choice theory, statistics and learning theory, information theory, game theory and decision theory. No prior knowledge on these topics will be assumed.

Bio:
Nihar B. Shah is an Assistant Professor in the Machine Learning and Computer Science departments at Carnegie Mellon University (CMU). His research interests include statistics, machine learning, information theory, and game theory, with a focus on applications to learning from people. He is a recipient of the the 2017 David J. Sakrison memorial prize from EECS Berkeley for a “truly outstanding and innovative PhD thesis”, the Microsoft Research PhD Fellowship 2014-16, the Berkeley Fellowship 2011-13, the IEEE Data Storage Best Paper and Best Student Paper Awards for the years 2011/2012, and the SVC Aiya Medal 2010, and has supervised the Best Student Paper at AAMAS 2019.

Leave a comment

November 13, 2019

TCS+ talk: Wednesday, November 20 — Jason Li, CMU

The next TCS+ talk will take place this coming Wednesday, November 20th at 1:00 PM Eastern Time (10:00 AM Pacific Time, 19:00 Central European Time, 18:00 UTC). Jason Li from CMU will speak about “The Karger-Stein Algorithm is Optimal for $k$ -cut” (abstract below).

Abstract: In the $k$ -cut problem, we are given an edge-weighted graph and want to find the least-weight set of edges whose deletion breaks the graph into $k$ connected components. Algorithms due to Karger-Stein and Thorup showed how to find such a minimum $k$ -cut in time approximately $O(n^{2k-2})$ . The best lower bounds come from conjectures about the solvability of the $k$ -clique problem and a reduction from $k$ -clique to $k$ -cut, and show that solving $k$ -cut is likely to require time $\Omega(n^k)$ . Our recent results have given special-purpose algorithms that solve the problem in time $n^{1.98k + O(1)}$ , and ones that have better performance for special classes of graphs (e.g., for small integer weights).

Leave a comment

October 31, 2019

TCS+ talk: Wednesday, November 6 — Ryan O’Donnell, CMU

The next TCS+ talk will take place this coming Wednesday, November 6th at 1:00 PM Eastern Time (10:00 AM Pacific Time, 19:00 Central European Time, 18:00 UTC). Ryan O’Donnell from CMU will speak about “Explicit near-Ramanujan graphs of every degree” (abstract below).

Abstract: For every constant $d$ and $\varepsilon$ , we give a deterministic $\text{poly}(n)$ -time algorithm that outputs a $d$ -regular graph on $\approx n$ vertices that is $\varepsilon$ -near-Ramanujan; i.e., its eigenvalues are bounded in magnitude by $2\sqrt(d-1) + \varepsilon$ (excluding the single trivial eigenvalue of $d$ ).

Joint work with Sidhanth Mohanty (Berkeley) and Pedro Paredes (CMU).

Leave a comment

October 15, 2019

TCS+ talk: Tuesday, October 22 — Hao Huang, Emory University

The next TCS+ talk will take place this coming Tuesday, October 22th at 1:00 PM Eastern Time (10:00 AM Pacific Time, 19:00 Central European Time, 17:00 UTC) (note the unusual day). Hao Huang from Emory University will speak about “A proof of the Sensitivity Conjecture” (abstract below).

Abstract: In the $n$ -dimensional hypercube graph, one can easily choose half of the vertices such that they induce an empty graph. However, having even just one more vertex would cause the induced subgraph to contain a vertex of degree at least $\sqrt{n}$ . This result is best possible, and improves a logarithmic lower bound shown by Chung, Furedi, Graham and Seymour in 1988. In this talk we will discuss a very short algebraic proof of it.

As a direct corollary of this purely combinatorial result, the sensitivity and degree of every boolean function are polynomially related. This solves an outstanding foundational problem in theoretical computer science, the Sensitivity Conjecture of Nisan and Szegedy.

Leave a comment

September 27, 2019

TCS+ talk: Wednesday, October 2 — Shachar Lovett, UCSD

The next TCS+ talk will take place this coming Wednesday, October 2th (next week!) at 1:00 PM Eastern Time (10:00 AM Pacific Time, 19:00 Central European Time, 17:00 UTC). Shachar Lovett from UCSD will lead us “Towards the sunflower conjecture” (abstract below).

Abstract: A sunflower with $r$ petals is a collection of $r$ sets so that the intersection of each pair is equal to the intersection of all. Erdos and Rado in 1960 proved the sunflower lemma: for any fixed $r$ , any family of sets of size $w$ , with at least about $w^w$ sets, must contain a sunflower. The famous sunflower conjecture is that the bound on the number of sets can be improved to $c^w$ for some constant $c$ . Despite much research, the best bounds until recently were all of the order of $w^{cw}$ for some constant c. In this work, we improve the bounds to about $(\log w)^w$ .

There are two main ideas that underlie our result. The first is a structure vs pseudo-randomness paradigm, a commonly used paradigm in combinatorics. This allows us to either exploit structure in the given family of sets, or otherwise to assume that it is pseudo-random in a certain way. The second is a duality between families of sets and DNFs (Disjunctive Normal Forms). DNFs are widely studied in theoretical computer science. One of the central results about them is the switching lemma, which shows that DNFs simplify under random restriction. We show that when restricted to pseudo-random DNFs, much milder random restrictions are sufficient to simplify their structure.

Joint work with Ryan Alweiss, Kewen Wu and Jiapeng Zhang.

Leave a comment

September 18, 2019

TCS+ talk: Wednesday, September 25 — Mark Sellke, Stanford

The next TCS+ talk will take place this coming Wednesday, September 25th at 1:00 PM Eastern Time (10:00 AM Pacific Time, 19:00 Central European Time, 17:00 UTC). Mark Sellke from Stanford will speak about “Chasing Convex Bodies” (abstract below).

Abstract: I will explain our recent understanding of the chasing convex bodies problem posed by Friedman and Linial in 1991. In this problem, an online player receives a request sequence $K_1,\dots,K_T$ of convex sets in $d$ dimensional space and moves his position online into each requested set. The player’s movement cost is the length of the resulting path. Chasing convex bodies asks if there an online algorithm with cost competitive against the offline optimal path. This is both an challenging metrical task system and (equivalent to) a competitive analysis view on online convex optimization.

This problem was open for $d>2$ until last year but has recently been solved twice. The first solution gives a $2^{O(d)}$ competitive algorithm while the second gives a nearly optimal $\min(d,\sqrt(d\log(T)))$ competitive algorithm for $T$ requests. The latter result is based on the Steiner point, which is the exact optimal solution to a related geometric problem called Lipschitz selection and dates from 1840. In the talk, I will briefly outline the first solution and fully explain the second.

Partially based on joint works with Sébastien Bubeck, Bo’az Klartag, Yin Tat Lee, and Yuanzhi Li.

Leave a comment