Understanding Machine Learning: Theory and Empirics
We are a research group working on understanding modern machine learning, and especially deep learning. We take a scientific approach, using tools across the spectrum from theory to experiment. For example, we want to characterize the behavior of deep learning systems, understand which design choices are necessary or sufficient for these behaviors, and identify universal structures underlying machine learning methods (deep or otherwise).
We are particularly interested in the areas of generalization, interpolation, optimization, overparameterization, and relations between these objects. In the past, our group has published influential results in Double Descent, Neural Tangent Kernels, overparameterization, and frameworks for generalization.
If Deep Learning is the solution, what is the problem?
Upcoming Talks:
Faculty:
Current PhD students:
Current Postdocs:
Former Group Members:
Some Recorded Talks: