Research

Here you find an overview of my research interests and projects that I have been working on. There is a bunch of other stuff in the pipeline that I haven’t had time to write up yet, so check back later for updates! I am currently working on three main topics:

Machine Learning for Bayesian Inference

GP surrogate and Active Learning procedure

Bayesian inference is central to much of modern astrophysics, yet its application to complex models with expensive likelihoods can be computationally prohibitive. To address this, I have been working on a series of methods that use Gaussian Processes (GPs) as surrogate models to accelerate Bayesian analysis, particularly in the context of cosmological and gravitational wave data (though the approach is general and can be applied to any expensive likelihood). I have developed a software package called GPry, which implements these methods and is designed to be a user-friendly and efficient drop-in replacement for existing samplers. The project has been developed with in collabroation with Jesús Torrado, Nils Schöneberg, Christian Fidler, Riccardo Buscicchio, and Germano Nardini and has resulted in three papers:

[1] Surrogate Posterior for Bayesian Inference

In our foundational work, we introduce a method for emulating the posterior distribution using a Gaussian Processes Regressor as surrogate model. This innovation allows for direct posterior reconstruction with far fewer function evaluations than traditional MC based methods. We used active learning to sample new training points based on the regions where the GP emulation is most uncertain or informative. This method demonstrated how GPs could serve as an efficient surrogate model for Bayesian posteriors, enabling fast and accurate inference, especially in problems with expensive likelihoods. We test the methods on a variety of synthetic and real-world problems.

[2] Parallel Acquisition for Active Learning

Building on previous work, we tackle a key limitation in the active learning step: the sequential nature of acquisition function optimization. We propose a parallel approach using Monte Carlo sampling and develop the NORA algorithm, which enables the batched proposal of new sample points using a nested sampler. Our method allows for efficient batch acquisition using a ranked pool strategy that maintains near-optimal information gain while being orders of magnitude more scalable.

[3] Application to LISA Inference

Finally, we apply this framework to a concrete and highly demanding application: parameter inference for sources in the upcoming LISA gravitational wave observatory. Using GPry, we demonstrate inference speed-ups of up to two orders of magnitude compared to state-of-the-art nested sampling algorithms like nessai. This is achieved without compromising on inference accuracy, particularly in regimes where likelihood evaluations are slow (e.g., due to expensive waveform models) as is the case for supermassive black hole binaries.

GIF showing the GPry Algorithm

What’s next?

The GPry framework is still in its early stages, and there are many avenues for future work. Some potential directions include:

Speeding up the GP model: We are currently using a simple GP implementation which is based on sklearn, but which does not fully take advantage of hardware acceleration like thread parallelization and GPUs. We are currently working on implementing the core functionality of the GPR in jax, which natively supports GPU and TPU acceleration.
Interfacing: Although GPry is designed to be a drop-in replacement for existing samplers, we are currently working on interfacing it with codes that are widely used in the community.

Reconstructing Curvature Perturbations via scalar-induced GWs

Detector Sensitivity to USR phase

A central goal in modern cosmology is to probe the physics of the very early universe, especially inflation, through indirect observables. One promising avenue is the study of scalar-induced gravitational waves (SIGWs), which can carry imprints of enhanced primordial curvature perturbations. In this context, our recent work explores how upcoming gravitational wave experiments, such as LISA, can be used to reconstruct the small-scale curvature power spectrum and constrain early-universe models.

In this paper, which was produced for the LISA Cosmology Working Group, we explore how the Laser Interferometer Space Antenna (LISA) can be used to reconstruct small-scale primordial curvature perturbations by observing the scalar-induced gravitational wave (SIGW) background. These gravitational waves originate from enhanced scalar fluctuations generated during non-standard inflationary phases, such as ultra-slow-roll.

We propose and compare three different reconstruction strategies:

Agnostic (binned): A flexible method that reconstructs the curvature power spectrum in frequency bins without assuming a specific model.
Template-based: Uses physically motivated templates based on common inflationary scenarios.
Ab initio (first-principles): Directly infers inflationary parameters using theoretical models like single-field ultra-slow-roll inflation.

Our study also considers extensions such as non-standard thermal histories and primordial non-Gaussianities. The analysis is implemented in the open-source SIGWAY package, which interfaces with gravitational wave data pipelines for signal inference.

Posterior Predictives Inflaton Potential

What’s next?

We are currently working on extending the analysis to include non-standard thermal histories for both constant and changing equation of state which allows for a more realistic modeling of the early universe and its impact on the SIGW background. In addition, we are applying the same analysis to determine whether the PTA signal can be explained by a SIGW background.

Fast PTA inference

Pulsar Timing Arrays (PTAs) are a powerful tool for detecting low-frequency gravitational waves by monitoring the precise timing of millisecond pulsars. However, Bayesian inference for PTA data is notoriously expensive, often requiring millions of likelihood evaluations due to the high dimensionality of the parameter space and the complexity of the noise models involved.

To address this, I have developed discoverysamplers, a Python package that provides lightweight bridge interfaces connecting Discovery (a JAX-enabled PTA likelihood code developed by NANOGrav) to various state-of-the-art sampling backends. The package is designed to be modular and easy to use, allowing researchers to seamlessly switch between different samplers while maintaining a consistent interface.

Currently supported samplers include:

Eryn: Ensemble MCMC with parallel tempering and reversible-jump support
Nessai: Flow-based nested sampling with importance sampling
JAX-NS: Pure JAX nested sampling with native GPU support
GPry: Gaussian process emulation via the Cobaya framework (see above)

The package uses a unified prior specification format and handles all the necessary transformations between Discovery’s callable interface and each sampler’s expected API. This makes it straightforward to benchmark different sampling strategies and identify the most efficient approach for a given PTA dataset.

What’s next?

We are currently working on extending discoverysamplers to support additional backends, including blackjax for end-to-end JAX-based inference with gradient-based samplers like NUTS. The goal is to fully leverage hardware acceleration (GPUs/TPUs) to achieve significant speed-ups in PTA parameter estimation.