Upcoming events

19 April 2024 12:30 till 13:15

[NA] Alena Kopaničáková : Enhancing Training of Deep Neural Networks Using Multilevel and Domain Decomposition Strategies

The training of deep neural networks (DNNs) is traditionally accomplished using stochastic gradient descent or its variants. While these methods have demonstrated certain robustness and accuracy, their convergence speed deteriorates for large-scale, highly ill-conditioned, and stiff problems, such as ones arising in scientific machine learning applications. Consequently, there is a growing interest in adopting more sophisticated training strategies that can not only accelerate convergence but may also enable parallelism, convergence control, and automatic selection of certain hyper-parameters.
In this talk, we propose to enhance the training of DNNs by leveraging nonlinear multilevel and domain decomposition strategies. We will discuss how to construct a multilevel hierarchy and how to decompose the parameters of the network by exploring the structure of the DNN architecture, properties of the loss function, and characteristics of the dataset. Furthermore, the dependency on a large number of hyper-parameters will be reduced by employing a trust-region globalization strategy. The effectiveness of the proposed training strategies will be demonstrated through a series of numerical experiments from the field of image classification and physics-informed neural networks.

References:
[1] A. Kopaničáková, H. Kothari, G. Karniadakis and R. Krause. Enhancing training of physics-informed neural networks using domain-decomposition based preconditioning strategies. Under review, 2023.
[2] S. Gratton, A. Kopaničáková, and Ph. Toint. Multilevel Objective-Function-Free Optimization with an Application to Neural Networks Training. SIAM, Journal on Optimization (Accepted), 2023.
[3] A. Kopaničáková. On the use of hybrid coarse-level models in multilevel minimization methods. Domain Decomposition Methods in Science and Engineering XXVII (Accepted), 2023.
[4] A. Kopaničáková, and R. Krause. Globally Convergent Multilevel Training of Deep Residual Networks. SIAM Journal on Scientific Computing, 2022.

23 April 2024 16:00 till 17:00

[AN] Jan Rozendaal: Local Smoothing and Decoupling

The local smoothing conjecture for the wave equation is one of the biggest open
problems in harmonic analysis, with connections to other major problems in har-
monic analysis and geometric analysis. Research on this conjecture has also led to
advances in areas such as number theory, through the development of the theory
of Fourier decoupling.
In this talk, I will explain the local smoothing conjecture and its connection to
Fourier decoupling. Then I will discuss a recently developed framework of function
spaces for local smoothing and decoupling.

25 April 2024 16:00 till 17:00

[PDE&A] Sharmila Gunasekaran

29 April 2024 15:45 till 16:45

[STAT/AP] Botond Szabo: tba

tba

02 May 2024 16:00 till 17:00

[PDE&A] Annika Bach

Technische Universiteit Eindhoven,https://www.tudelft.nl,PDE&A

03 May 2024 12:30 till 13:15

[NA] Jakob Zech: Nonparametric Distribution Learning via Neural ODEs

In this talk, we explore approximation properties and statistical aspects of Neural Ordinary Differential Equations (Neural ODEs). Neural ODEs are a recently established technique in computational statistics and machine learning, that can be used to characterize complex distributions. Specifically, given a fixed set of independent and identically distributed samples from a target distribution, the goal is either to estimate the target density or to generate new samples. We first investigate the regularity properties of the velocity fields used to push forward a reference distribution to the target. This analysis allows us to deduce approximation rates achievable through neural network representations. We then derive a concentration inequality for the maximum likelihood estimator of general ODE-parametrized transport maps. By merging these findings, we are able to determine convergence rates in terms of both the network size and the number of required samples from the target distribution. Our discussion will particularly focus on target distributions within the class of positive \(C^k\) densities on the \(d\)-dimensional unit cube \([0,1]^d\).

07 May 2024 16:00 till 17:00

[AN] Chenxi Deng: TBA

13 May 2024 15:45 till 16:45

[STAT/AP] Sonja Cox: tba

tba

14 May 2024 16:00 till 17:00

[AN] Michal Wrochna: TBA

TBA

16 May 2024 16:00 till 17:00

[PDE&A] Paul Zegeling: A generalized midpoint-based BV-method for unstable PDEs (and beyond)

21 May 2024 16:00 till 17:00

[AN] Joris van Winden: TBA

TBA

27 May 2024 15:45 till 16:45

[STAT/AP] Collin Drent: Condition-Based Production for Stochastically Deteriorating Systems: Optimal Policies and Learning

Production systems used in the manufacturing industry degrade due to production and may eventually break down, resulting in high maintenance costs at scheduled maintenance moments. This degradation behavior, and hence the system's reliability, is affected by the system's production rate. While producing at a higher rate generates more revenue, the system's reliability may also decrease. Production should thus be controlled dynamically to trade-off reliability and revenue accumulation in between maintenance moments. We study this dynamic trade-off for (i) systems where the relation between production and degradation is known as well as (ii) systems where this relation is not known and needs to be learned on-the-fly from condition data. For systems with a known production-degradation relation, we cast the decision problem as a continuous-time Markov decision process and prove that the optimal policy has intuitive monotonic properties. We also present sufficient conditions for the optimality of bang-bang policies and we characterize the structure of the optimal interval between scheduled maintenance moments. For systems with an a-priori unknown production-degradation relation, we propose a Bayesian procedure to learn the unknown degradation rate under any production policy from real-time condition data. Numerical studies indicate that on average across a wide range of practical settings (i) condition-based production increases profits by 50% compared to static production, (ii) integrating condition-based production and maintenance interval selection increases profits by 21% compared to a state-of-the-art approach, and (iii) our Bayesian approach performs close, especially in the bang-bang regime, to an Oracle policy that knows each system's production-degradation relation.

28 May 2024 16:00 till 17:00

[AN] Hans Maassen: TBA

TBA

30 May 2024 16:00 till 17:00

[PDE&A] Viktoria Freingruber

31 May 2024 12:30 till 13:15

[NA] Hyea Hyun Kim: Partitioned neural network approximation to partial differential equations and its training performance enhancement utilizing domain decomposition algorithms

With the success of deep learning technologies in many scientific and engineering applications, neural network approximation methods have emerged as an active research area in numerical partial differential equations. However, the new approximation methods still need further validations on their accuracy, stability, and efficiency so as to be used as alternatives to classical approximation methods. In this talk, we first introduce partitioned neural network approximation to partial differential equations, where neural network functions localized in each small subdomains are employed as a solution surrogate in order to reduce the approximation and optimization errors in the standard single large neural network approximation. The parameters in each local neural network function are then optimized to minimize the corresponding cost function to the model problem. To enhance the parameter training efficiency further, iterative algorithms for the partitioned neural network function can be developed by utilizing classical domain decomposition algorithms and their convergence theory. We finally present promising features in this new approach as a way of enhancing the neural network solution accuracy, stability, and efficiency with some supporting numerical results.

03 June 2024 15:45 till 16:45

[STAT/AP] Eni Musta: tba

tba

04 June 2024 16:00 till 17:00

[AN] Anouk Wisse: TBA

TBA

06 June 2024 16:00 till 17:00

[PDE&A] Marco Rehmeier

TBA

11 June 2024 16:00 till 17:00

[AN] Rik Westdorp: TBA

TBA

13 June 2024 16:00 till 17:00