Recent & Upcoming Talks

2024

Brian DePasquale

May 17, 2024 2:00 PM — 4:00 PM SEC LL2.224

Kim Stachenfeld - Predictive models for representation learning and simulation

In the context of deep learning, predictive models serve multiple purposes. One use is to drive representation learning, as the …

Apr 12, 2024 2:00 PM — 4:00 PM SEC LL2.224

Stefano Ermon - Score Entropy Discrete Diffusion Models

Diffusion models are at the core of many state-of-the-art generative AI systems for content such as images, videos, and audio. These …

Apr 5, 2024 2:00 PM — 4:00 PM SEC LL2.224

Andrea Montanari - Solving overparametrized systems of nonlinear equations

I will discuss the problem of solving a system of equations F(x)=0, for x a d-dimensional unit vectors and D a non-linear map from R^d …

Mar 22, 2024 2:00 PM — 4:00 PM SEC LL2.224

Tom Griffiths - Using the Tools of Cognitive Science to Understand the Behavior of Large Language Models

Large language models have been found to have surprising capabilities, even what have been called “sparks of artificial general …

Mar 15, 2024 2:00 PM — 4:00 PM SEC LL2.224

Larry Abbott - Modeling the Navigational Circuitry of the Fly

Navigation requires orienting oneself relative to landmarks in the environment, evaluating relevant sensory data, remembering goals, …

Feb 23, 2024 2:00 PM — 4:00 PM SEC LL2.224

Rajesh Rao - Active Predictive Coding: A Sensory-Motor Theory of the Neocortex and a Unifying Framework for AI

Recent neurophysiological experiments indicate that almost all cortical areas, even those traditionally labelled as primary sensory …

Feb 16, 2024 2:00 PM — 4:00 PM SEC LL2.224

Noam Brown - CICERO: Human-Level Performance in the Game of Diplomacy by Combining Language Models with Strategic Reasoning

In this talk I will describe CICERO, the first AI agent to achieve human-level performance in Diplomacy, a strategy game involving both …

Jan 26, 2024 2:00 PM — 4:00 PM SEC LL2.224

2023

Carsen Stringer - Unsupervised pretraining in biological neural networks

Representation learning in neural networks may be implemented with supervised or unsupervised algorithms, distinguished by the presence …

Nov 3, 2023 2:00 PM — 4:00 PM SEC LL2.224

Emmanuel Abbe - Logic Reasoning and Generalization on the Unseen

Transformers have become the dominant neural network architecture in deep learning. While they are state of the art in language and …

Oct 13, 2023 2:00 PM — 4:00 PM SEC LL2.224

Denny Zhou - Teach language models to reason

Over the past decades, the machine learning community has developed tons of data-driven techniques aimed at enhancing learning …

Sep 15, 2023 2:15 PM — 4:00 PM SEC LL2.224

Tom Goldstein - Dataset security issues in generative AI

Machine learning systems are built using large troves of training data that may contain private or copyrighted content. In this talk, …

Sep 8, 2023 2:15 PM — 4:00 PM SEC LL2.224

Video

Yann LeCun - Towards Machines that can Learn, Reason, and Plan.

How could machines learn as efficiently as humans and animals? How could machines learn how the world works and acquire common sense? …

May 23, 2023 2:00 PM — 3:45 PM SEC 1.413

Video

Timothy Lillicrap - Model-based reinforcement learning and the future of language models

Large language models are capable of an incredible array of tasks. Language models are pre-trained on large amounts of text data from …

May 19, 2023 2:00 PM — 3:45 PM SEC 1.413

Yejin Choi - Common Sense: the Dark Matter of Language and Intelligence

Scale appears to be the winning recipe in today’s leaderboards. And yet, extreme-scale neural models are (un)surprisingly brittle …

May 12, 2023 2:15 PM — 4:00 PM SEC 1.413

Video

Jamie Morgenstern - Shifts in Distributions and Preferences in Response to Learning

In this talk, I’ll describe some recent work outlining how distribution shifts are fundamental to working with human-centric …

Apr 21, 2023 2:00 PM — 3:45 PM SEC 1.413

Video

Uri Hasson - Deep language models as a cognitive model for natural language processing in the human brain

Naturalistic experimental paradigms in cognitive neuroscience arose from a pressure to test, in real-world contexts, the validity of …

Apr 14, 2023 2:00 PM — 3:45 PM SEC 1.413

Sebastien Bubeck - First Contact

The new wave of AI systems, ChatGPT and its more powerful successors, exhibit extraordinary capabilities across a broad swath of …

Mar 31, 2023 2:00 PM — 3:45 PM SEC 1.413

Ludwig Schmidt - A data-centric view on reliable generalization: From ImageNet to LAION-5B

Researchers have proposed many methods to make neural networks more reliable under distribution shift, yet there is still large room …

Mar 10, 2023 2:00 PM — 3:45 PM SEC 1.413

Video

Chelsea Finn - Neural networks make stuff up. What should we do about it?

When machine learning models are deployed into the world, they inevitably encounter scenarios that differ from their training data, …

Feb 24, 2023 2:00 PM — 3:45 PM SEC 1.413

Video

Roger Grosse - Studying Neural Net Generalization through Influence Functions

How can we trace surprising behaviors of machine learning models back to their training data? Influence functions aim to predict how …

Feb 10, 2023 2:00 PM — 3:45 PM SEC 1.413

Matthieu Wyart - Data structure and curse of dimensionality in deep learning

Deep learning algorithms are responsible for a technological revolution in a variety of tasks, yet understanding why they work remains …

Feb 3, 2023 2:00 PM — 3:45 PM SEC LL2.221

2022

Peter Latham - Failures, Weight Scaling, and Learning in the Brain.

The brain is a very noisy place: when a spike arrives at a pre-synaptic terminal, about half the time neurotransmitter fails to …

Dec 9, 2022 3:30 PM — 5:00 PM SEC 1.413

Been Kim - My hopes and dreams of communicating with machines and where to begin.

The emergence of machines that seem to offer the same or better capabilities than humans raised interests in many sectors who are eager …

Nov 18, 2022 3:30 PM — 5:00 PM SEC 1.413

Morgane Austern - Theory for some graph embedding methods such as node2vec, and efficient concentration inequalities.

Network data is ubiquitous, and many examples can be found in domains ranging from biology to social sciences. Learning from graph data …

Nov 11, 2022 3:30 PM — 5:00 PM SEC 1.413

Ryan Tibshirani - Advances and Challenges in Conformal Prediction

This talk gives an overview of distribution-free predictive inference, using conformal prediction. Conformal prediction essentially …

Nov 8, 2022 4:30 PM — 6:00 PM SEC 2.118

John Schulman - Language Models, Search, and Overoptimization

Language models can be dramatically improved by reward models, which predict the quality of a sample. Two approaches for combining …

Nov 4, 2022 3:30 PM — 5:00 PM SEC 1.413.

Yue Lu - Understanding the Universality Phenomenon in High-Dimensional Estimation and Learning: Some Recent Progress

Universality is a fascinating high-dimensional phenomenon. It points to the existence of universal laws that govern the macroscopic …

Oct 28, 2022 3:30 PM — 5:00 PM SEC 1.413.

Video

Michael Bronstein - Geometric deep learning: from Euclid to drug design

Geometric Deep Learning is an attempt for geometric unification of a broad class of ML problems from the perspectives of symmetry and …

Oct 21, 2022 3:30 PM — 5:00 PM SEC 1.413.

Video

Hidenori Tanaka - Physics of Neural Phenomena: Understanding Learning and Computation through Symmetry

Once described as alchemy, a quantitative science of machine learning is emerging. This talk will seek to unify the scientific …

Sep 23, 2022 3:30 PM — 5:00 PM SEC 1.413.

Fei-Fei Li - From Seeing to Doing: Understanding and Interacting with the Real World.

Visual intelligence is a cornerstone of intelligence. From passive perception to embodied interaction with the world, vision plays a …

May 27, 2022 3:00 PM — 5:00 PM

Sam Gershman -Using video games to reverse engineer human intelligence

Video games have become an attractive testbed for evaluating AI systems, by capturing some aspects of real-world complexity (rich …

May 20, 2022 3:00 PM — 5:00 PM Virtual.

Video

Na Li - Learning decentralized policies in Multiagent Systems: How to learn efficiently and what are the learned policies?

Multiagent reinforcement learning has received a growing interest with various problem settings and applications. We will first present …

May 13, 2022 3:00 PM — 5:00 PM SEC 1.413.

Nati Srebro - Understanding Deep Learning as Biased Non-Parametric Search

How and why are we succeeding in training huge non-convex deepnetworks? How can deep neural networks with billions of parameters …

Apr 11, 2022 2:00 PM — 4:00 PM SEC 1.413 CLASSROOM.

Eero Simoncelli - Photographic Image Priors in the Era of Machine Learning

Inverse problems in image processing and computer vision are often solved using prior probability densities, such as spectral or …

Apr 2, 2022 3:00 PM — 5:00 PM Virtual.

Video

Andrew Saxe - The Neural Race Reduction: Dynamics of feature learning in deep architectures

What is the relationship between task geometry, network architecture, and emergent feature learning dynamics in nonlinear deep …

Mar 25, 2022 3:00 PM — 5:00 PM SEC 1.413 CLASSROOM.

Video

Tomaso Poggio - Thoughts on Learning Theory

I will describe a personal perspective on a few key problems in learning theory at the moment. Several different architectures that …

Mar 11, 2022 3:00 PM — 5:00 PM SEC 1.413 CLASSROOM.

Video

Petros Koumoutsakos - Learning Algorithms for Complex systems: 3 easy pieces

I will present 3 learning algorithms fusing scientific computing and AI for the prediction and control of complex physical systems. The …

Mar 4, 2022 3:00 PM — 5:00 PM SEC 1.413 CLASSROOM.

Video

Aarti Singh - Local Signal Adaptivity: Feature learning in Neural networks beyond kernels

Neural networks have been shown to significantly outperform kernel methods (including neural tangent kernels) in problems such as image …

Feb 18, 2022 3:00 PM — 5:00 PM Virtual.

Video

SueYeon Chung - Structure, Function, and Learning in Distributed Neural Networks

A central goal in neuroscience is to understand how orchestrated computations in the brain arise from the properties of single neurons …

Feb 11, 2022 3:00 PM — 5:00 PM

Video

Samy Bengio - Can neural networks learn to reason?

The successes of deep learning critically rely on the ability of neural networks to output meaningful predictions on unseen data …

Feb 4, 2022 3:00 PM — 5:00 PM

Video

Max Welling - The Impact of Deep Learning on the Natural Sciences

Deep learning has significantly changed the fields of speech recognition, computer vision and natural language processing, to name a …

Jan 28, 2022 3:00 PM — 5:00 PM

Video

2021

Sho Yaida - Effective Theory of Deep Neural Networks

Large neural networks perform extremely well in practice, providing the backbone of modern machine learning. The goal of this talk is …

Dec 3, 2021 3:00 PM — 5:00 PM

Video

Lester Mackey - Kernel Thinning and Stein Thinning

This talk will introduce two new tools for summarizing a probability distribution more effectively than independent sampling or …

Nov 19, 2021 3:00 PM — 5:00 PM

Video

Jared Kaplan - Thoughts on Neural Scaling Laws

I’ll discuss empirical work on neural scaling laws, emphasizing their apparent precision, universality, and ubiquity. Along the …

Oct 22, 2021 3:00 PM — 5:00 PM

Video

Richard Baraniuk - Deep Network Spline Geometry

We study the geometry of deep learning through the lens of approximation theory via spline functions and operators. Our key result is …

Oct 15, 2021 3:00 PM — 5:00 PM Virtual.

Stefanie Jegelka - Learning in Graph Neural Networks

Graph Neural Networks (GNNs) have become a popular tool for learning representations of graph-structured inputs, with applications in …

Apr 2, 2021 1:00 PM — 3:00 PM Virtual.

Video

Surya Ganguli - Weaving together machine learning, theoretical physics, and neuroscience.

An exciting area of intellectual activity in this century may well revolve around a synthesis of machine learning, theoretical physics, …

Mar 19, 2021 1:00 PM — 3:00 PM Virtual.

Video

Percy Liang - Surprises in the Quest for Robust Machine Learning

Standard machine learning produces models that are accurate on average but degrade dramatically on when the test distribution of …

Mar 4, 2021 1:00 PM — 3:00 PM Virtual.

Video

Daniela Witten - Hypothesis testing after hierarchical clustering

As datasets continue to grow in size, in many settings the focus of data collection has shifted away from testing pre-specified …

Feb 5, 2021 1:00 PM — 3:00 PM Virtual.

2020

Ashish Vaswani - Self-attention for Vision

Deep learning seeks to discover universal models that work across all modalities and tasks. While self-attention has enhanced the …

Dec 4, 2020 11:00 AM — 1:00 PM Virtual.

Francis Bach - On the convergence of gradient descent for wide two-layer neural networks

Many supervised learning methods are naturally cast as optimization problems. For prediction models which are linear in their …

Nov 20, 2020 1:00 PM — 3:00 PM Virtual.

Video

Lenka Zdeborova - Insights on gradient-based algorithms in high-dimensional non-convex learning.

Gradient descent algorithms and their noisy variants, such as the Langevin dynamics or multi-pass SGD, are at the center of attention …

Nov 13, 2020 10:00 AM — 12:00 PM Virtual.

Video

Sanmi Koyejo - Algorithmic fairness and metric elicitation via the geometry of classifier statistics

From music recommendations to high-stakes medical treatment selection, complex decision-making tasks are increasingly automated as …

Nov 6, 2020 11:00 AM — 1:00 PM Virtual.

Video

Balaji Lakshminarayanan - Uncertainty and Out-of-Distribution Robustness in Deep Learning

Quantifying uncertainty in deep learning is a challenging and yet unsolved problem. Predictive uncertainty estimates are important to …

Oct 30, 2020 1:00 PM — 3:00 PM Virtual.

Video

Surbhi Goel - Computational Complexity of Learning Neural Networks over Gaussian Marginals

A major challenge in the theory of deep learning is to understand the computational complexity of learning basic families of neural …

Oct 23, 2020 1:00 PM — 3:00 PM Virtual.

Video

Ilya Sutskever - The unreasonable effectiveness of large generative models

GPT3 has shown that large generative models are unexpectedly powerful and capable. In this talk, I will review some of these …

Oct 16, 2020 4:30 PM — 6:00 PM Virtual.

Guy Gur-Ari - Large learning rates and the catapult effect.

Why do large learning rates often produce better results? Why do “infinitely wide” networks trained using kernel methods …

Oct 9, 2020 1:00 PM — 3:00 PM Virtual.

Video

Shai Shalev-Shwartz - Deep Learnability.

“What is learnable?” is a fundamental question in learning theory. The talk will address this question for deep learning, …

Oct 5, 2020 10:00 AM — 12:00 PM Virtual.

Video

Behnam Neyshabur - Towards Learning Convolutions from Scratch

Convolution is one of the most essential components of architectures used in computer vision. As machine learning moves towards …

Sep 17, 2020 4:00 PM — 6:00 PM Virtual.

Video

Hanie Sedghi - What is being transferred in transfer learning?

One desired capability for machines is the ability to transfer their understanding of one domain to another domain where data is …

Aug 27, 2020 4:00 PM — 6:00 PM Virtual.

Video

Maithra Raghu - Insights from Deep Representations for Machine Learning Systems and Human Collaborations

The fundamental breakthroughs in machine learning, and the rapid advancements of the underlying deep neural network models have enabled …

Aug 13, 2020 4:00 PM — 6:00 PM Virtual.

Video

Nadav Cohen - Analyzing Optimization and Generalization in Deep Learning via Dynamics of Gradient Descent

Understanding deep learning calls for addressing the questions of: (i) optimization — the effectiveness of simple gradient-based …

Aug 5, 2020 11:00 AM — 1:00 PM Virtual.

Video

Alex Dimakis - Deep Generative models and Inverse Problems.

Modern deep generative models like GANs, VAEs and invertible flows are showing amazing results on modeling high-dimensional …

Jul 9, 2020 4:00 PM — 6:00 PM Virtual.

Video

Jascha Sohl-Dickstein - Understanding infinite width neural networks

As neural networks become wider their accuracy improves, and their behavior becomes easier to analyze theoretically. I will give an …

Jun 11, 2020 4:00 PM — 6:00 PM Virtual.

Anima Anandkumar - The road to autonomy: sample-efficient learning in control systems with built-in safety and stability guarantees.

Autonomous systems require efficient learning mechanisms that are fully integrated with the control loop. We need robust learning …

May 21, 2020 1:30 PM — 3:30 PM Virtual.

Slides Video

Zico Kolter - Implicit layers, equilibrium models and why one layer is (really) all you need

A common view of deep learning is that deep networks provide a hierarchical means of processing input data, where early layers extract …

Apr 3, 2020 1:30 PM — 3:30 PM Virtual.

Video

Moritz Hardt - Performative Prediction

When predictions support decisions they may influence the outcome they aim to predict. We call such predictions performative; the …

Mar 27, 2020 1:30 PM — 3:30 PM Virtual--[Zoom](https://harvard.zoom.us/j/117799722)

Constantinos Daskalakis - Learning from Censored and Dependent Data

Abstract: Machine Learning is invaluable for extracting insights from large volumes of data. A key assumption enabling many methods, …

Mar 6, 2020 12:00 PM — 2:00 PM 24 Oxford St, Haller Hall. Cambridge, Massachusetts

Jacob Steinhardt - Science and Robustness in Deep Learning

How should we go about creating a science of deep learning? One might be tempted to focus on replicability, reproducibility, and …

Feb 28, 2020 1:30 PM — 3:30 PM 33 Oxford St, MD Room 119. Cambridge, Massachusetts

Video

Adi Shamir - A Simple Explanation for the Mysterious Existence of Adversarial Examples with Small Hamming Distance

The existence of adversarial examples in which tiny changes in the input can fool well trained neural networks has many applications …

Feb 21, 2020 1:30 PM — 3:30 PM 33 Oxford St, MD Room 119. Cambridge, Massachusetts

2019

Daniel Soudry - The Implicit Bias of Gradient Descent on Separable Data

We examine gradient descent on unregularized logistic regression problems, with homogeneous linear predictors on linearly separable …

Dec 6, 2019 1:45 PM — 4:00 PM 60 Oxford St, Room 330. Cambridge, Massachusetts

Matus Telgarsky - A margin perspective on neural networks

This talk will survey the role played by margins in optimization, generalization, and representation of neural networks. A specific …

Dec 3, 2019 1:45 PM — 4:00 PM 60 Oxford St, Room 330. Cambridge, Massachusetts

Jason Lee - Beyond Linearization in Neural Networks

Deep Learning has had phenomenal empirical successes in many domains including computer vision, natural language processing, and speech …

Nov 12, 2019 1:45 PM — 4:00 PM 60 Oxford St, Room 330. Cambridge, Massachusetts

Peter Bartlett - Benign Overfitting

Classical theory that guides the design of nonparametric prediction methods like deep neural networks involves a tradeoff between the …

Oct 31, 2019 1:30 PM — 3:00 PM Haller Hall, Geological Museum Room 102. Cambridge, Massachusetts

Misha Belkin - Some thoughts on principles of machine learning

Much recent theoretical work has concentrated on “solving deep learning”. Yet, deep learning is not a thing in itself and …

Oct 29, 2019 1:30 PM — 4:00 PM 60 Oxford St, Room 330. Cambridge, Massachusetts

PDF