Emma Hart

Latent Space Inference via Paired Autoencoders

Current and ongoing work with Bas Peters, Julianne Chung, and Matthias Chung. This work describes a novel data-driven latent space inference framework built on paired autoencoders to handle observational inconsistencies when solving inverse problems. Our approach uses two autoencoders, one for the parameter space and one for the observation space, connected by learned mappings between the autoencoders' latent spaces. These mappings enable a surrogate for regularized inversion and optimization in low-dimensional, informative latent spaces. Our flexible framework can work with partial, noisy, or out-of-distribution data, all while maintaining consistency with the underlying physical models. The paired autoencoders enable reconstruction of corrupted data, and then use the reconstructed data for parameter estimation, which produces more accurate reconstructions compared to paired autoencoders alone and end-to-end encoder-decoders of the same architecture, especially in scenarios with data inconsistencies. We demonstrate our approaches on two imaging examples in medical tomography and geophysical seismic-waveform inversion, but the described approaches are broadly applicable to a variety of inverse problems in scientific and engineering applications.

Download:

Preprint

A Paired Autoencoder Framework for Inverse Problems via Bayes Risk Minimization

Current and ongoing work with Julianne and Matthias Chung. In this work, we describe a new data-driven approach for inverse problems that exploits technologies from machine learning (e.g., neural networks and autoencoder networks) and dimensionality reduction (e.g., low-rank and latent representations). We consider a paired autoencoder framework, where two autoencoders are used to efficiently represent the input and target spaces separately, and optimal mappings are learned between latent spaces. Similar to end-to-end approaches, the paired approach creates a surrogate model for forward propagation and for regularized inversion, but our approach can outperform existing approaches in scenarios where training data for unsupervised learning are readily available but labeled training pairs are scarce. We focus on interpretations using Bayes risk and empirical Bayes risk minimization, and we provide various theoretical results and connections to existing works on low-rank matrix approximations. Moreover, we show that cheaply computable evaluation metrics are available through this framework and can be used to predict whether the solution for a new sample should be predicted well.

Download:

Preprint, Poster, Presentation

Elucidating the Design Choice of Probability Paths in Flow Matching for Forecasting

Work with Soon Hoe Lim, Yijin Wang, Annan Yu, Michael Mahoney, Xiaoye Sherry Li, and Ben Erichson. Flow matching has recently emerged as a powerful paradigm for generative modeling and has been extended to probabilistic time series forecasting. However, the impact of the specific choice of probability path model on forecasting performance, particularly for high-dimensional spatio-temporal dynamics, remains under-explored. In this work, we demonstrate that forecasting spatio-temporal data with flow matching is highly sensitive to the selection of the probability path model. Motivated by this insight, we propose a novel probability path model designed to improve forecasting performance. Our empirical results across various dynamical system benchmarks show that our model acheives faster convergence during training and improved predictive performance compared to existing probability path models. Importantly, our approach is efficient during inference, requiring only a few sampling steps. This makes your proposed model practical for real-world application and opens new avenues for probabailistic forecasting.

Download:

TMLR Publication, Poster

Paired Autoencoders for Likelihood-Free Estimation in Inverse Problems

Current and ongoing work with Julianne Chung, Matthias Chung, Bas Peters, and Eldad Haber. We consider the solution of nonlinear inverse problems where the forward problem is a discretization of a partial differential equation. Such problems are notoriously difficult to solve in practice and require minimizing a combination of a data-fit term and a regularization term. The main computational bottleneck of typical algorithms is the direct estimation of the data misfit. Therefore, likelihood-free approaches have become appealing alternatives. Nonetheless, difficulties in generalization and limitations in accuracy have hindered their broader utility and applicability. In this work, we use a paired autoencoder framework as a likelihood-free estimator (LFE) for inverse problems. We show that the use of such an architecture allows us to construct a solution efficiently and to overcome some known open problems when using LFEs. In particular, our framework can assess the quality of the solution and improve on it if needed. We demonstrate the viability of our approach using examples from full waveform inversion and inverse electromagnetic imaging.

Download:

Machine Learning: Science and Technology Publication

Comparison of Atlas-Based and Neural-Network-Based Semantic Segmentation for DENSE MRI Images

Work completed at NSF REU with Lars Ruthotto, Elle Buser, and Ben Huenemann. We compared two segmentation methods, one atlas-based and one neural-network-based,to see how well they can each automatically segment the brain stem and cerebellum in Displacement Encoding with Stimulated Echoes Magnetic Resonance Imaging (DENSE-MRI) data. The segmentation is a pre-requisite for estimating the average displacements in these regions, which have recently been proposed as biomarkers in the diagnosis of Chiari Malformation type I (CMI). In numerical experiments, the segmentations of both methods were similar to manual segmentations provided by trained experts. Overall, the neural-network-based method alone produced more accurate segmentations than the atlas-based method did alone, but that a combination of the two methods, in which the atlas-based method is used for the segmentation of the brain stem and the neural-network is used for the segmentation of the cerebellum, may be the most successful.

Download:

SIURO Publication, Presentation

A Three Step Reaction Model of Smoldering and Flaming Combustion

High honors senior thesis project completed as an undergraduate with Dan Schult. Smoldering combustion is characterized by the slow, low temperature, flameless burning of solid fuel and is the most persistent type of combustion. Flaming combustion, in contrast, involves a higher temperature burning of gaseous fuel and is rather limited in how long it can be sustained. Smoldering and flaming combustion are very interrelated, often occurring simultaneously in nature and seeming to feed into each other. Despite this inter-relatedness, the literatures of the two are somewhat sparsely connected. Better understanding the mechanics of transition between smoldering and flaming combustion, and how the two work to sustain each other, is an especially relevant topic of study in fire safety, engineering, ecology, and earth science contexts alike, among others. This project expands on previously developed models, complicating the combustion reaction scheme in order to be able to support both smoldering and flaming solutions in a single model.

Download:

Written Summary, Poster, Presentation