Deep Learning and Mean-Field Games: A Stochastic Optimal Control Perspective

Luca Di Persio; Matteo Garbelli

doi:10.3390/sym13010014

Deep Learning and Mean-Field Games: A Stochastic Optimal Control Perspective

Symmetry ◽

10.3390/sym13010014 ◽

2020 ◽

Vol 13 (1) ◽

pp. 14

Author(s):

Luca Di Persio ◽

Matteo Garbelli

Keyword(s):

Optimal Control ◽

Deep Learning ◽

Stochastic Optimal Control ◽

Mathematical Formulation ◽

Mean Field ◽

Mean Field Games ◽

Theoretical Frameworks ◽

Depth Analysis ◽

The Mean ◽

Hamilton Jacobi Bellman

We provide a rigorous mathematical formulation of Deep Learning (DL) methodologies through an in-depth analysis of the learning procedures characterizing Neural Network (NN) models within the theoretical frameworks of Stochastic Optimal Control (SOC) and Mean-Field Games (MFGs). In particular, we show how the supervised learning approach can be translated in terms of a (stochastic) mean-field optimal control problem by applying the Hamilton–Jacobi–Bellman (HJB) approach and the mean-field Pontryagin maximum principle. Our contribution sheds new light on a possible theoretical connection between mean-field problems and DL, melting heterogeneous approaches and reporting the state-of-the-art within such fields to show how the latter different perspectives can be indeed fruitfully unified.

Download Full-text

A finite difference analogue of the “mean field” equilibrium problem

Вычислительные технологии ◽

10.25743/ict.2020.25.4.004 ◽

2020 ◽

pp. 31-44

Author(s):

Виктория Сергеевна Корниенко ◽

Владимир Викторович Шайдуров ◽

Евгения Дмитриевна Карепова

Keyword(s):

Control Function ◽

Mean Field ◽

Mean Field Games ◽

Differential Problem ◽

Control Functions ◽

Economic Interaction ◽

The Mean ◽

Hamilton Jacobi Bellman ◽

The Stability ◽

The Cost

Представлен конечно-разностный аналог дифференциальной задачи, сформулированной в терминах теории “игр среднего поля” (mean field games). Задачи оптимизации такого типа формулируются как связанные системы параболических дифференциальных уравнений в частных производных типа Фоккера - Планка и Гамильтона - Якоби - Беллмана. Предложенный конечно-разностный аналог обладает основными свойствами оптимизационной дифференциальной задачи непосредственно на дискретном уровне. В итоге он может служить как приближение, сходящееся к исходной дифференциальной задаче при стремлении шагов дискретизации к нулю, так и как самостоятельная оптимизационная задача с конечным числом участников. Для предложенного аналога построен алгоритм монотонной минимизации функционала стоимости, проиллюстрированный на модельной экономической задаче In most forecasting problems, overstating or understating forecast leads to various losses. Traditionally, in the theory of “mean field games”, the functional responsible for the costs of implementing the interaction of the continuum of agents between each other is supposed to be dependent on the squared function of control of the system. Since additional external factors can influence the player’s strategy, the control function of a dynamic system is more complex. Therefore, the purpose of this article is to develop a computational algorithm applicable for more general set of control functions. As a research method, a computational experiment and proof of the stability of the constructed computational scheme are used in this study. As a result, the numerical algorithm was applied on the problem of economic interaction in the presence of alternative resources. We consider the model, in which a continuum of consumer agents consists of households deciding on heating, having a choice between the cost of installing and maintaining the thermal insulation or the additional cost of electricity. In the framework of the problem, the convergence of the method is numerically demonstrated. Conclusions. The article considers a model of the strategic interaction of continuum of agents, the interaction of which is determined by a coupled differential equations, namely, the Fokker - Planck and the Hamilton - Jacobi - Bellman one. To approximate the differential problem, difference schemes with a semi-Lagrangian approximation are used, which give a direct rule for minimizing the cost functional

Download Full-text

Stability Analysis in Mean-Field Games via an Evans Function Approach

Volume 3: Modeling and Validation; Multi-Agent and Networked Systems; Path Planning and Motion Control; Tracking Control Systems; Unmanned Aerial Vehicles (UAVs) and Application; Unmanned Ground and Aerial Vehicles; Vibration in Mechanical Systems; Vibrations and Control of Systems; Vibrations: Modeling, Analysis, and Control ◽

10.1115/dscc2018-8926 ◽

2018 ◽

Author(s):

Piyush Grover

Keyword(s):

Optimal Control ◽

Stability Analysis ◽

Mean Field ◽

Evans Function ◽

Function Approach ◽

Mean Field Games ◽

Mean Field Game ◽

Large Populations ◽

Multi Agent ◽

Hamilton Jacobi Bellman

This work is concerned with stability analysis of stationary and time-varying equilibria in a class of mean-field games that relate to multi-agent control problems of flocking and swarming. The mean-field game framework is a non-cooperative model of distributed optimal control in large populations, and characterizes the optimal control for a representative agent in Nash-equilibrium with the population. A mean-field game model is described by a coupled PDE system of forward-in-time Fokker-Planck (FP) equation for density of agents, and a backward-in-time Hamilton-Jacobi-Bellman (HJB) equation for control. The linear stability analysis of fixed points of these equations typically proceeds via numerical computation of spectrum of the linearized MFG operator. We explore the Evans function approach that provides a geometric alternative to solving the characteristic equation.

Download Full-text

The Mean Field Games

Mean Field Games and Mean Field Type Control Theory - SpringerBriefs in Mathematics ◽

10.1007/978-1-4614-8508-7_3 ◽

2013 ◽

pp. 11-14

Author(s):

Alain Bensoussan ◽

Jens Frehse ◽

Phillip Yam

Keyword(s):

Mean Field ◽

Mean Field Games ◽

The Mean

Download Full-text

Small-noise asymptotics of Hamilton–Jacobi–Bellman equations and bifurcations of stochastic optimal control problems

Communications in Nonlinear Science and Numerical Simulation ◽

10.1016/j.cnsns.2014.09.029 ◽

2015 ◽

Vol 22 (1-3) ◽

pp. 38-54 ◽

Cited By ~ 4

Author(s):

Dieter Grass ◽

Tatiana Kiseleva ◽

Florian Wagener

Keyword(s):

Optimal Control ◽

Stochastic Optimal Control ◽

Optimal Control Problems ◽

Control Problems ◽

Small Noise ◽

Bellman Equations ◽

Hamilton Jacobi Bellman ◽

Small Noise Asymptotics

Download Full-text

COMPUTATION OF MEAN FIELD EQUILIBRIA IN ECONOMICS

Mathematical Models and Methods in Applied Sciences ◽

10.1142/s0218202510004349 ◽

2010 ◽

Vol 20 (04) ◽

pp. 567-588 ◽

Cited By ~ 83

Author(s):

AIME LACHAPELLE ◽

JULIEN SALOMON ◽

GABRIEL TURINICI

Keyword(s):

Theoretical Result ◽

Optimization Problem ◽

Existence Result ◽

Mean Field ◽

Numerical Results ◽

Mean Field Games ◽

Economy Of Scale ◽

Technological Transition ◽

The Mean

Motivated by a mean field games stylized model for the choice of technologies (with externalities and economy of scale), we consider the associated optimization problem and prove an existence result. To complement the theoretical result, we introduce a monotonic algorithm to find the mean field equilibria. We close with some numerical results, including the multiplicity of equilibria describing the possibility of a technological transition.

Download Full-text

Schrödinger approach to Mean Field Games with negative coordination

SciPost Physics ◽

10.21468/scipostphys.9.4.059 ◽

2020 ◽

Vol 9 (4) ◽

Author(s):

Thibault Bonnemain ◽

Thierry Gobron ◽

Denis Ullmo

Keyword(s):

Mean Field ◽

Time Limit ◽

External Potential ◽

Mean Field Games ◽

Relative Importance ◽

Mean Field Game ◽

Non Linear ◽

The Mean ◽

The Way

Mean Field Games provide a powerful framework to analyze the dynamics of a large number of controlled agents in interaction. Here we consider such systems when the interactions between agents result in a negative coordination and analyze the behavior of the associated system of coupled PDEs using the now well established correspondence with the non linear Schrödinger equation. We focus on the long optimization time limit and on configurations such that the game we consider goes through different regimes in which the relative importance of disorder, interactions between agents and external potential vary, which makes possible to get insights on the role of the forward-backward structure of the Mean Field Game equations in relation with the way these various regimes are connected.

Download Full-text

Connection among Stochastic Hamilton–Jacobi–Bellman Equation, Path-Integral, and Koopman Operator on Nonlinear Stochastic Optimal Control

Journal of the Physical Society of Japan ◽

10.7566/jpsj.90.104802 ◽

2021 ◽

Vol 90 (10) ◽

pp. 104802

Author(s):

Jun Ohkubo

Keyword(s):

Optimal Control ◽

Path Integral ◽

Stochastic Optimal Control ◽

Bellman Equation ◽

Koopman Operator ◽

Hamilton Jacobi Bellman Equation ◽

Hamilton Jacobi Bellman

Download Full-text

Numerical Solution of the Hamilton-Jacobi-Bellman Equation in Stochastic Optimal Control with Application of Portfolio Optimization

International Conference of Computational Methods in Sciences and Engineering 2004 (ICCMSE 2004) ◽

10.1201/9780429081385-105 ◽

2019 ◽

pp. 432-435

Author(s):

Helfried Peyrl ◽

Florian Herzog ◽

Hans P. Geering

Keyword(s):

Optimal Control ◽

Numerical Solution ◽

Portfolio Optimization ◽

Stochastic Optimal Control ◽

Bellman Equation ◽

Hamilton Jacobi Bellman Equation ◽

Hamilton Jacobi Bellman

Download Full-text

Risk‐sensitive maximum principle for stochastic optimal control of mean‐field type Markov regime‐switching jump‐diffusion systems

International Journal of Robust and Nonlinear Control ◽

10.1002/rnc.5358 ◽

2021 ◽

Vol 31 (6) ◽

pp. 2141-2167

Author(s):

Jun Moon

Keyword(s):

Optimal Control ◽

Maximum Principle ◽

Stochastic Optimal Control ◽

Regime Switching ◽

Mean Field ◽

Jump Diffusion ◽

Markov Regime Switching ◽

Field Type ◽

Diffusion Systems ◽

Risk Sensitive

Download Full-text

Learning in mean field games: The fictitious play

ESAIM Control Optimisation and Calculus of Variations ◽

10.1051/cocv/2016004 ◽

2017 ◽

Vol 23 (2) ◽

pp. 569-591 ◽

Cited By ~ 20

Author(s):

Pierre Cardaliaguet ◽

Saeed Hadikhanloo

Keyword(s):

Differential Games ◽

Mean Field ◽

Mean Field Games ◽

Fictitious Play ◽

Mean Field Game ◽

Interacting Agents ◽

Learning Procedure ◽

Equilibrium Configurations ◽

The Mean

Mean Field Game systems describe equilibrium configurations in differential games with infinitely many infinitesimal interacting agents. We introduce a learning procedure (similar to the Fictitious Play) for these games and show its convergence when the Mean Field Game is potential.

Download Full-text