Research

I work on statistical learning theory with a particular focus on online learning: sequential prediction and aggregation of experts, bandit problems (stochastic, adversarial, sleeping, dueling), online convex optimisation, nonparametric regression, and applications to reinforcement learning and demand forecasting. My PhD thesis, supervised by Yannig Goude and Gilles Stoltz, focused on prediction of individual sequences.

Google Scholar · ORCID

Publications

Optimisation

A Continuized View on Nesterov Acceleration for Stochastic Gradient Descent and Randomized Gossip. Mathieu Even, Raphaël Berthier, Francis Bach, Nicolas Flammarion, Pierre Gaillard, Hadrien Hendrikx, Laurent Massoulié, Adrien Taylor. NeurIPS (Outstanding paper award, <1‰ of submitted papers), 2021.
Accelerated Gossip in Networks of Given Dimension using Jacobi Polynomial Iterations. Raphaël Berthier, Francis Bach, Pierre Gaillard, SIAM Journal on Mathematics of Data Science. Volume 2, Issue 1, 2020.

Reinforcement learning

Online Markov Decision Processes with Terminal Law Constraints. Bianca Marin Moreno, Margaux Brégère, Pierre Gaillard, Nadia Oudjane. ALT 2026.
Online Episodic Convex Reinforcement Learning. Bianca Marin Moreno, Khaled Eldowa, Margaux Brégère, Pierre Gaillard, and Nadia Oudjane. ICML, 2025.
MetaCURL: Non-stationary Concave Utility Reinforcement Learning. Bianca Marin Moreno, Margaux Brégère, Pierre Gaillard, and Nadia Oudjane. NeurIPS, 2024.
Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent. Bianca Marin Moreno, Margaux Brégère, Pierre Gaillard, Nadia Oudjane. AISTATS, 2024.

Counterfactual and off-policy learning

Counterfactual Learning of Stochastic Policies with Continuous Actions. Houssam Zenati, Alberto Bietti, Matthieu Martin, Eustache Diemert, Pierre Gaillard, Julien Mairal. TMLR, 2025
Sequential Counterfactual Risk Minimization. Houssam Zenati, Eustache Diemert, Matthieu Martin, Julien Mairal, Pierre Gaillard. ICML, 2023.

Forecasting and applications

(Online) Convex Optimization for Demand-Side Management: Application to Thermostatically Controlled Loads. Bianca Marin Moreno, Margaux Brégère, Pierre Gaillard, Nadia Oudjane. Journal of Optimization Theory and Applications. Volume 205, article number 43, 2025.
Online Convex Optimization for Survival Analysis: An Adaptive and Stochastic Approach. Camila Fernandez, Pierre Gaillard, Joseph de Vilmarest, and Olivier Wintenberger. Statistical Papers. Volume 66, article number 86, 2025.
Aggregation methods and comparative study in time-to-event analysis models. Camila Fernandez, Chung Shue Chen, Pierre Gaillard, and Alonso Silva. International Journal of Data Science and Analytics. 2024.
Additive models and robust aggregation for GEFCom2014 probabilistic electric load and electricity price forecasting. Pierre Gaillard, Yannig Goude, and Raphaël Nedellec. International Journal of Forecasting, Volume 32, Issue 3, pages 1038–1050, 2016.
Forecasting electricity consumption by aggregating experts; how to design a good set of experts. Pierre Gaillard, Yannig Goude. In A. Antoniadis et al. editors, Modeling and Stochastic Learning for Forecasting in High Dimensions, volume 217 of Lecture Notes in Statistics, pages 95–115. Springer, 2015.
Forecasting electricity consumption by aggregating specialized experts. Marie Devaine, Pierre Gaillard, Yannig Goude, and Gilles Stoltz. Machine Learning, Volume 90, Issue 2, pages 231–260, 2013.

PhD thesis

Contributions to online robust aggregation: work on the approximation error and on probabilistic forecasting. Applications to forecasting for energy markets. Pierre Gaillard. Université Paris-Sud 11, 2015.

Software

Opera: Online Prediction by ExpeRt Aggregation. Pierre Gaillard, Yannig Goude. R package, 2016.
Opera is an R package for prediction of time series based on online robust aggregation of a finite set of forecasts (machine learning method, statistical model, physical model, human expertise…). More formally, we consider a sequence of observation y(1),…,y(t) to be predicted element by element. At each time instance t, a finite set of experts provide prediction x(k,t) of the next observation y(t). Several methods are implemented to combine these expert forecasts according to their past performance (several loss functions are implemented to measure it). These combining methods satisfy robust finite time theoretical performance guarantees. We demonstrate on different examples from energy markets (electricity demand, electricity prices, solar and wind power time series) the interest of this approach both in terms of forecasting performance and time series analysis.

PhD Students and postdocs

PhD students currently supervised

Pierre Boudart, co-advised with Alessandro Rudi (Inria), 2023-...
Paul Liautaud, co-advised with Olivier Wintenberger, 2022-...

Former PhD students and postdocs

Julien Zhou, industrial PhD with Criteo, co-advised with Julyan Arbel (Inria) and Thibaud Rahier (Criteo), 2022-2026
Bianca Moreno, industrial PhD with EDF R&D, co-advised with Nadia Oudjane (EDF R&D) and Margaux Brégère (EDF R&D), 2022-2025. Now research scientist at CFM.
Camila Fernandez, industrial PhD with Nokia Bell labs, co-advised with Olivier Wintenberger, Chung Shue Chen (Nokia Bell labs), and Alonso Silva (Nokia Bell Labs). 2020-2024.
Houssam Zenati, industrial PhD with Criteo, co-advised with Julien Mairal and Eustach Ziemert (Criteo). 2019-2023. Now postdoc at Inria Paris-Saclay.
Rémi Jézéquel, co-advised with Alessandro Rudi, 2019-2023. Now research scientist at CFM.
Rémy Degenne, postdoc, january-august 2020. Now researcher at Inria Lille.
Raphaël Berthier, co-advised with Francis Bach, 2018-2021.
Margaux Brégère, industrial PhD with EDF R&D, co-advised with Gilles Stoltz and Yannig Goude (EDF R&D), 2017-2020. Now researcher at EDF R&D.

School

Here, you can find some reports I wrote during my studies.

Prévision de la consommation électrique (à court terme) par agrégation séquentielle d'experts spécialisés, 2011. Under the supervision of Yannig Goude and Gilles Stoltz
- Slides (fr)
- Technical report
Invert sparse matrices with gaussian belief propagation algorithm, 2010. Under the supervision of Devavrat Shah
The Lasso, or how to choose among a large set of variables with few observations, 2009. Pierre Gaillard and Anisse Ismaili. Under the supervision of Sylvain Arlot