A02

Space-time in high dimensions

A02 addresses problems of high-dimensionality and dimension reduction. It concentrates on a flexible framework for vector autoregressive (panel) models and models spatio-temporal extremes to analyze rare events. In the long term, a framework for high-dimensional space-time data analysis is developed that is equipped with thorough statistical theory and meets practical challenges, such as implementations and tuning parameter calibration.

Project Leaders

Prof. Dr. Axel Bücher
Faculty of Mathematics - Chair of Mathematical Statistics
Ruhr University Bochum

Prof. Dr. Andreas Groll
Department of Statistics - Chair of Statistical Methods for Big Data
TU Dortmund University

Prof. Dr. Johannes Lederer
Department of Mathematics - Chair of Mathematics of Data-Driven Methods
University of Hamburg

Summary

When analyzing spatio-temporal data in high dimensions, one usually pursues one of the following two general statistical goals: either the (dynamic behavior of the) center of each associated distribution is of greatest interest, or it is the extreme values which rarely occur but can have drastic consequences. The two goals require different statistical tools, which is reflected in the project's research agenda: the focus is both on autoregressive (VAR) models and panel data setups to analyze typical behavior, and on the analysis of spatio-temporal extremes to analyze rare events. The former is challenging because the number of parameters is often very large, in particular in the case of VAR models with additional exogenous variables (VARX). The latter is challenging because the focus on extremes usually leads to comparably small sample sizes.

We aim to devise new estimators that account for high-dimensionality, provide a feasible implementation, equip the estimators with statistical guarantees and test them in simulations and on empirical data. A key technique in our research is regularization, which deals with high-dimensionality by complementing classical objective functions with additional terms that formalize prior information about the data or setting. A by now standard type of prior information, called sparsity, is that only a small number of predictors should have a relevant effect. But many applications require more complex prior information: in VARX models, for example, exogenous predictors that are close in space can behave similarly ("fusion sparsity") or form functional groups ("group sparsity"), and the connectivities across time can also depend on the exogenous predictors ("lag selection"). In extremes, similar phenomena may occur for marginal extreme value models at different locations, and certain conditional independence relations have recently been connected to sparse graphical models for extremes. Such complex types of sparsity are called "structured sparsity" and will be the main focus of our work. The inclusion of structured sparsity will lead to more efficient estimation and more accurate prediction in the space-time applications of TRR 391 and beyond.

Publications

Lederer, J., Oesting, M. (2025). Extremes in High Dimensions: Methods and Scalable Algorithms. arXiv. DOI: 10.48550/arXiv.2303.04258.

Boulin, A., Haufs., E. (2025). Extrapolating into the Extremes with Minimum Distance Estimation. arXiv. DOI: 10.48550/arXiv.2511.20466.

Grytzka, J., Bürkner, P., Groll, A. (2025). LASSO penalization in generalized linear mixed models. Proceedings of the 39th International Workshop on Statistical Modelling, Volume 1.

Mohaddes, A., Lederer, J. (2025). Cardinality Sparsity: Applications in Matrix-Matrix Multiplications and Machine Learning. Transactions on Machine Learning Research. OpenReview: https://openreview.net/forum?id=zoSRSpGu9C.

Boulin, A., Bücher, A. (2025). Structured linear factor models for tail dependence. arXiv. DOI: 10.48550/arXiv.2507.16340.

Lederer, J., Sabourin, A., Taheri, M. (2025). Adaptive tail index estimation: minimal assumptions and non-asymptotic guarantees. arXiv. DOI: 10.48550/arXiv.2505.22371.

Mohaddes, A., Iafrate, F., Lederer, J. (2025). Regularized Learning for Fractional Brownian Motion via Path Signatures. arXiv. DOI: 10.48550/arXiv.2506.16156.

Taheri, M., Lederer, J. (2025). Regularization can make diffusion models more efficient. arXiv. DOI: 10.48550/arXiv.2502.09151.

Bücher, A., Pakzad C. (2025). The empirical copula process in high dimensions: Stute's representation and applications. To appear in Annals of Statistics. Already available on arXiv DOI: 10.48550/arXiv.2405.05597.

Lederer, J., von Sachs, R. (2025). Simultaneous estimation of stable parameters for multiple autoregressive processes from datasets of nonuniform sizes. Journal of Time Series Analysis. DOI: 10.1111/jtsa.12806.

Boulin, A. (2024). Estimating max-stable random vectors with discrete spectral measure using model-based clustering. arXiv. DOI: 10.48550/arXiv.2402.01609.

By car

By train

By plane

The H-Bahn (Suspended Monorail System)

Map