We unify many concepts in collider physics, including infrared and collinear safety, observables, jet finding, pileup mitigation and more, using a geometric language based on the Energy Mover’s Distance. Along the way, we develop new techniques grounded in this geometry, including extensions of observables, new jet-finding algorithms, novel pileup mitigation based on Apollonius diagrams, and a concrete notion of “theory space.”
Patrick T. Komiske, Eric M. Metodiev, Jesse Thaler
April 2020
Journal of High Energy Physics07 (2020) 006
We develop OmniFold, an ML-based unfolding technique that can incorporate full-phase-space information, works without binning, and can avoid choosing specific observables.
Anders Andreassen, Patrick T. Komiske, Eric M. Metodiev, Benjamin Nachman, Jesse Thaler
We show that a broad class of mathematical objects, multiparticle correlators, can be manipulated by “cutting” the vertices and edges of their graphical representation, leading to many identities, computational speedups, and surprising connections to string theory.
Patrick T. Komiske, Eric M. Metodiev, Jesse Thaler
We explore the CMS 2011A Jet Primary Dataset using standard jet substructure observables as well as the Energy Mover’s Distance. Our reprocessed datasets and analysis code are made public to facilitate future Open Data studies.
Patrick T. Komiske, Radha Mastandrea, Eric M. Metodiev, Preksha Naik, Jesse Thaler
We develop a metric, the Energy Mover’s Distance (EMD), on the space of events that, intuitively, is the amount of “work” required to rearrange one event into another. Many techniques that require a pairwise distance between objects can now be applied to collider events, including quantifying event distortion, classification based on density estimation, and studying the space of events itself.
Patrick T. Komiske, Eric M. Metodiev, Jesse Thaler
We adapt and specialize the Deep Sets neural network architecture for use with collider events, since the particles in an event naturally form a variable length, unordered set of objects. Our resulting Energy Flow Networks (EFNs) and Particle Flow Networks (PFNs) are incredibly powerful and simple architectures for use in collider physics.
Patrick T. Komiske, Eric M. Metodiev, Jesse Thaler
October 2018
Journal of High Energy Physics01 (2019) 121
We develop a precise, practical, hadron-level definition of quark and gluon jets based on topic modeling of two mixed samples of jets. This allows for data-driven extractions of separate quark- and gluon-jet cross sections, among other things.
Patrick T. Komiske, Eric M. Metodiev, Jesse Thaler
September 2018
Journal of High Energy Physics11 (2018) 059
We study two methods of weakly supervised training in the context of jet classification, extending them to deep neural network architectures. We find that the Classification Without Labels (CWoLa) paradigm outperforms Learning from Label Proportions (LLP).
Patrick T. Komiske, Eric M. Metodiev, Benjamin Nachman, Matthew D. Schwartz
We develop the Energy Flow Polynomials (EFPs), a set of IRC-safe observables that form an (over)complete basis for any IRC-safe observable. This supports the sufficiency of linear methods for tasks such as classifying different jets, and indeed we find that a linear classifier using EFPs performs surprisingly well on a variety of jet discrimination tasks.
Patrick T. Komiske, Eric M. Metodiev, Jesse Thaler
December 2017
Journal of High Energy Physics04 (2018) 013
We develop the PUMML framework for mitigating the contamination from extra protons colliding at the LHC using machine learning. We demonstrate that a convolutional neural network can clean up such contamination at least as well as existing methods, with improvements in robustness across a wide variety of pileup levels.
Patrick T. Komiske, Eric M. Metodiev, Benjamin Nachman, Matthew D. Schwartz
July 2017
Journal of High Energy Physics12 (2017) 051
We show for the first time that deep learning is quite successful at discriminating between quark and gluon jets. We use a convolutional neural network trained on jet images and observable large improvements in classification efficiency, as well as rough insensitivity to the mismodeling of quark and gluon jets by Monte Carlo simulations.
Patrick T. Komiske, Eric M. Metodiev, Matthew D. Schwartz
December 2016
Journal of High Energy Physics01 (2017) 110