Publications

My primary publications are listed below with quick descriptions. Click on a title to get more detailed information about a paper including the abstract and selected figures. Publications can be searched or filtered here. Note that authorship is alphabetical in high-energy physics.

The background image is a visualization an Energy Flow Network used to classify quark and gluon jets. The sizes and locations of the rings highlight the singularity structure of QCD.

The Hidden Geometry of Particle Collisions

Patrick T. Komiske, Eric M. Metodiev, Jesse Thaler

April 2020 Journal of High Energy Physics 07 (2020) 006

We unify many concepts in collider physics, including infrared and collinear safety, observables, jet finding, pileup mitigation and more, using a geometric language based on the Energy Mover’s Distance. Along the way, we develop new techniques grounded in this geometry, including extensions of observables, new jet-finding algorithms, novel pileup mitigation based on Apollonius diagrams, and a concrete notion of “theory space.”

PDF arXiv iNSPIRE DOI

OmniFold: A Method to Simultaneously Unfold All Observables

Anders Andreassen, Patrick T. Komiske, Eric M. Metodiev, Benjamin Nachman, Jesse Thaler

November 2019 Phys. Rev. Lett. 124 (2020) 182001

We develop OmniFold, an ML-based unfolding technique that can incorporate full-phase-space information, works without binning, and can avoid choosing specific observables.

PDF arXiv iNSPIRE Datasets GitHub DOI

Cutting Multiparticle Correlators Down to Size

Patrick T. Komiske, Eric M. Metodiev, Jesse Thaler

November 2019 Phys. Rev. D101 (2020) 036019

We show that a broad class of mathematical objects, multiparticle correlators, can be manipulated by “cutting” the vertices and edges of their graphical representation, leading to many identities, computational speedups, and surprising connections to string theory.

PDF arXiv iNSPIRE GitHub Docs DOI

Exploring the Space of Jets with CMS Open Data

Patrick T. Komiske, Radha Mastandrea, Eric M. Metodiev, Preksha Naik, Jesse Thaler

August 2019 Phys. Rev. D101 (2020) 034009

We explore the CMS 2011A Jet Primary Dataset using standard jet substructure observables as well as the Energy Mover’s Distance. Our reprocessed datasets and analysis code are made public to facilitate future Open Data studies.

PDF arXiv iNSPIRE Datasets GitHub Docs DOI

The Machine Learning Landscape of Top Taggers

Gregor Kasieczka (ed), Tilman Plehn (ed), et al.

February 2019 SciPost Physics 7 (2019) 014

A community report on a variety of ML top taggers to which we contributed a PFN, EFN, and EFP model.

PDF arXiv iNSPIRE DOI

The Metric Space of Collider Events

Patrick T. Komiske, Eric M. Metodiev, Jesse Thaler

February 2019 Phys. Rev. Lett. 123 (2019) 041801

We develop a metric, the Energy Mover’s Distance (EMD), on the space of events that, intuitively, is the amount of “work” required to rearrange one event into another. Many techniques that require a pairwise distance between objects can now be applied to collider events, including quantifying event distortion, classification based on density estimation, and studying the space of events itself.

PDF arXiv iNSPIRE GitHub Docs DOI

Energy Flow Networks: Deep Sets for Particle Jets

Patrick T. Komiske, Eric M. Metodiev, Jesse Thaler

October 2018 Journal of High Energy Physics 01 (2019) 121

We adapt and specialize the Deep Sets neural network architecture for use with collider events, since the particles in an event naturally form a variable length, unordered set of objects. Our resulting Energy Flow Networks (EFNs) and Particle Flow Networks (PFNs) are incredibly powerful and simple architectures for use in collider physics.

PDF arXiv iNSPIRE Datasets GitHub Docs DOI

An operational definition of quark and gluon jets

Patrick T. Komiske, Eric M. Metodiev, Jesse Thaler

September 2018 Journal of High Energy Physics 11 (2018) 059

We develop a precise, practical, hadron-level definition of quark and gluon jets based on topic modeling of two mixed samples of jets. This allows for data-driven extractions of separate quark- and gluon-jet cross sections, among other things.

PDF arXiv iNSPIRE DOI

Learning to classify from impure samples with high-dimensional data

Patrick T. Komiske, Eric M. Metodiev, Benjamin Nachman, Matthew D. Schwartz

January 2018 Phys. Rev. D98 (2018) 011502

We study two methods of weakly supervised training in the context of jet classification, extending them to deep neural network architectures. We find that the Classification Without Labels (CWoLa) paradigm outperforms Learning from Label Proportions (LLP).

PDF arXiv iNSPIRE DOI

Energy Flow Polynomials: A complete linear basis for jet substructure

Patrick T. Komiske, Eric M. Metodiev, Jesse Thaler

December 2017 Journal of High Energy Physics 04 (2018) 013

We develop the Energy Flow Polynomials (EFPs), a set of IRC-safe observables that form an (over)complete basis for any IRC-safe observable. This supports the sufficiency of linear methods for tasks such as classifying different jets, and indeed we find that a linear classifier using EFPs performs surprisingly well on a variety of jet discrimination tasks.

PDF arXiv iNSPIRE GitHub Docs DOI

Pileup Mitigation with Machine Learning (PUMML)

Patrick T. Komiske, Eric M. Metodiev, Benjamin Nachman, Matthew D. Schwartz

July 2017 Journal of High Energy Physics 12 (2017) 051

We develop the PUMML framework for mitigating the contamination from extra protons colliding at the LHC using machine learning. We demonstrate that a convolutional neural network can clean up such contamination at least as well as existing methods, with improvements in robustness across a wide variety of pileup levels.

PDF arXiv iNSPIRE Dataset GitHub DOI

Deep learning in color: Towards automated quark/gluon jet discrimination

Patrick T. Komiske, Eric M. Metodiev, Matthew D. Schwartz

December 2016 Journal of High Energy Physics 01 (2017) 110

We show for the first time that deep learning is quite successful at discriminating between quark and gluon jets. We use a convolutional neural network trained on jet images and observable large improvements in classification efficiency, as well as rough insensitivity to the mismodeling of quark and gluon jets by Monte Carlo simulations.

PDF arXiv iNSPIRE DOI

Talks

Optimizing Particle Physics with Machine Learning

Seminar (Groups 41 & 89)

May 12, 2021 MIT Lincoln Laboratory (virtual)

PDF

OmniFold: Simultaneously Unfolding All Observables with Deep Learning

CMS Machine Learning Forum

Feb 10, 2021 Virtual

PDF Keynote

OmniFold: Improved Unfolding with Deep Learning

LHC Electroweak and Jets Working Group

Feb 8, 2021 Virtual

PDF Keynote Event

Simultaneously Unfolding All Observables with Deep Learning

Jefferson Lab Theory Seminar

Jan 11, 2021 Virtual

PDF Keynote

The Hidden Geometry of Particle Collisions

BSM PANDEMIC Double Feature

Dec 1, 2020 Virtual

PDF Keynote Event

Probing QCD with Energy Flow Observables

CEPC Workshop

Oct 27, 2020 Shanghai, China (virtual)

PDF Keynote Event

Machine Learning - An Essential Toolkit for Particle Physics

Snowmass Computational Frontier Workshop

Aug 10, 2020 Virtual

PDF Keynote Event

The Hidden Geometry of Particle Collisions

Particle Physics Phenomenology Series

Jun 4, 2020 Genoa, Italy (virtual)

PDF Keynote

OmniFold: Simultaneously Unfolding All Observables

ML4Jets 2020

Jan 17, 2020 NYU - New York, NY

PDF Keynote Event

Cutting Multiparticle Correlators Down to Size

BOOST 2019

Jul 24, 2019 Cambridge, MA

PDF Event

The Metric Space of Collider Events

Particle Physics Seminar
University of Chicago

May 29, 2019 Chicago, IL

PDF Keynote

Point Cloud Strategies for Boosted Objects

ML-HEP-LBL Meetup

Apr 17, 2019 Berkeley, CA

PDF Event

The (Metric) Space of Collider Events

Elementary Particle Theory Seminar
Maryland Center for Fundamental Physics

Mar 25, 2019 College Park, MD

PDF Keynote

The Metric Space of Collider Events

Deep Learning in the Natural Sciences

Mar 1, 2019 Hamburg, Germany

PDF Keynote Event

Point Cloud Strategies for Boosted Objects

CERN BSM Forum

Feb 21, 2019 CERN, Switzerland

PDF

Energy Flow and Jet Substructure

Particle Physics Lunch Talk
Harvard University

Nov 28, 2018 Cambridge, MA

PDF

Energy Flow Networks: Deep Sets for Particle Jets

Machine Learning for Jet Physics 2018

Nov 15, 2018 Fermilab - Batavia, IL

PDF Event

Point Cloud Strategies for Boosted Tops

Boosted Objects for New Physics Searches

Nov 13, 2018 Fermilab - Batavia, IL

PDF Event

Analyzing Jet Substructure via Energy Flow

(B)SM/DM/LHC/QCD/ML Journal Club
MIT Center for Theoretical Physics

Oct 12, 2018 Cambridge, MA

PDF

Energy Flow and Jet Substructure

BOOST 2018

Jul 18, 2018 Paris, France

PDF Event

(Machine) Learning Jet Physics

Lunch Talk
MIT Center for Theoretical Physics

May 18, 2018 Cambridge, MA

PDF

Jet Physics & Modern Machine Learning

Particle Physics Lunch Talk
Harvard University

Feb 7, 2018 Cambridge, MA

PDF

Energy Flow Polynomials for Jet Substructure

MIT Jet Workshop

Jan 11, 2018 Cambridge, MA

PDF

Linear Jet Tagging with the Energy Flow Basis

Machine Learning for Jet Physics 2017
Lawrence Berkeley National Laboratory

Dec 12, 2017 Berkeley, CA

PDF Event

Quark/Gluon Discrimination with Jet-Images and Deep Learning

BSM/DM/LHC Journal Club
MIT Center for Theoretical Physics

Sep 8, 2017 Cambridge, MA

PDF

Quark/Gluon Discrimination with Jet-Images and Deep Learning

BOOST 2017

Jul 18, 2017 Buffalo, NY

PDF Event

Projects

EnergyFlow

Python package for computing Energy Flow Polynomials, instantiating Energy/Particle Flow Networks, computing the Energy Mover’s Distance between events, and working with particle kinematics in python.

Docs GitHub PyPI

EnergyEnergyCorrelators

C++ library with a Python wrapper for computing $N$-point energy-energy correlators and related high-dimensional structures. Utilizes the BOOST histogram library for simple, efficient, and flexible binning of distributions.

GitHub PyPI

Wasserstein

A C++ library with a Python wrapper for computing the $p$-Wasserstein distances, known as the Earth Mover’s Distance for $p=1$ and the Energy Mover’s Distance in particle physics.

GitHub PyPI

MOD

The MIT Open Data project utilizes public collider data for interesting scientific endeavors in a complementary manner to the experimental collaborations. For our analysis using the CMS 2011A Jet Primary Dataset and associated simulated datasets, we re-released a number of datasets in an easy to use format as well as made our entire analysis publically available.

GitHub Zenodo

EventGeneration

A C++ library for facilitating particle physics event generation with Pythia8 and jet clustering with FastJet3 including the association of the hard-process, parton-level, and hadron-level events. Includes a python script for reading the resulting text files.

GitHub

Experience

PhD Candidate in Physics

Center for Theoretical Physics
Massachusetts Institute of Technology

Sep 2016 - Present Cambridge, MA

Advisor: Jesse Thaler

Machine learning neural network architecture/algorithm development for high-energy particle physics datasets
Software library development and creation of easy-to-use public datasets, including reprocessing TBs of CMS Open Data
Studied Large Hadron Collider phenomena, jet physics, quantum field theory, quantum chromodynamics

Teaching Assistant

8.09/8.309 - Advanced Classical Mechanics
Massachusetts Institute of Technology

Sep 2017 - Dec 2019 Cambridge, MA

TA for classical mechanics taught by Iain Stewart in 2017, 2018, 2019

Taught weekly sections (2018, 2019)
Held regular office hours
Helped with exam review sessions
Graded homework (2017) and exams

AM in Physics

Harvard University

Sep 2015 - May 2016 Cambridge, MA

AB in Physics and Mathematics

Harvard University

Aug 2012 - May 2016 Cambridge, MA

summa cum laude
Highest Honors in Physics
Secondary field in computer science
John Harvard Scholarship (2014 - 2015)
Derek C. Bok Award for Distinction in Teaching (2014)
Harvard College Scholarship (2013 - 2014)

PRISE Fellow

Harvard University

Jun 2015 - Aug 2015 Cambridge, MA

Computed the normal modes of an exponential block-spring system allowing for the definition of a family of Fourier-like discrete transformations from position space to mode space, worked with Howard Georgi and Matthew Schwartz
Explored the quantum-to-classical transition through decoherence to a pointer basis, worked with Matthew Reece

Trading Intern

Jane Street Capital

Jan 2015, Jan 2016 New York, NY

Studied financial markets
Wrote bash program to analyze novel type of options trade
Practiced trading in mock simulations

Teaching Fellow

Harvard University Physics Department

Sep 2014 - Dec 2015 Cambridge, MA

TF for Honors Special Relativity (Physics 16, Fall 2014) taught by Howard Georgi and Quantum Mechanics I (Physics 143a, Fall 2015) taught by Matt Reece

Taught weekly sections
Prepared practice problems
Organized LaTeX and Mathematica review sessions

Summer Intern

Superconducting Electronics Group, Quantum Computing Collaboration
Northrop Grumman Electronic Systems

May 2014 - Aug 2014 Baltimore, MD

Wrote MATLAB program to interface with existing experimental code base to improve the fidelity of high-speed, precision microwave pulses used for qubit control via calculation of a transfer function and deconvolution methods

Calculus Course Assistant

Harvard University Mathematics Department

Sep 2013 - Dec 2013 Cambridge, MA

Ran weekly problem sessions
Worked one-on-one with students in class and the math question center
Graded homework

Summer Intern

Asymmetric Operations Department
Research and Exploratory Development Department
Johns Hopkins University Applied Physics Laboratory

May 2012 - Aug 2013 Laurel, MD

Investigated electromagnetic properties of high-impedance Sievenpiper metamaterial structures for low-profile RF antenna applications
Characterized material properties of magnetic nanoparticle polymers
Catalogued dielectric properties of explosive simulant materials for transportation security purposes

Secondary Education

Century High School

Aug 2008 - Jun 2012 Sykesville, MD

National AP Scholar
STEM Academy Award
Math and Science Content Awards

PhD Candidate in Physics

Center for Theoretical Physics

Massachusetts Institute of Technology

About Me

Interests

Education

Publications

Talks

Projects

Experience

PhD Candidate in Physics

Center for Theoretical PhysicsMassachusetts Institute of Technology

Teaching Assistant

8.09/8.309 - Advanced Classical MechanicsMassachusetts Institute of Technology

AM in Physics

Harvard University

AB in Physics and Mathematics

Harvard University

PRISE Fellow

Harvard University

Trading Intern

Teaching Fellow

Harvard University Physics Department

Summer Intern

Superconducting Electronics Group, Quantum Computing CollaborationNorthrop Grumman Electronic Systems

Calculus Course Assistant

Harvard University Mathematics Department

Summer Intern

Asymmetric Operations DepartmentResearch and Exploratory Development DepartmentJohns Hopkins University Applied Physics Laboratory

Secondary Education

Century High School

Contact

Center for Theoretical Physics
Massachusetts Institute of Technology

8.09/8.309 - Advanced Classical Mechanics
Massachusetts Institute of Technology

Superconducting Electronics Group, Quantum Computing Collaboration
Northrop Grumman Electronic Systems

Asymmetric Operations Department
Research and Exploratory Development Department
Johns Hopkins University Applied Physics Laboratory