Post

DeepONet

“Inspired from the universal approximation theorem, neural networks with a single layer could not only approximates continuous functions, but also continuous functional or operators1

Concepts

functional: a mapping from a space of functions into a real number

operator: a mapping from a space of functions to another space of functions

Architecture

What DeepONet wants to do is to form a network to learn functions and points together, different from fully-connected neural networks that only gurantee a small approximation error, and expecting consideration on optimization error and generalization error.

Architecture of DeepONet

  • Inputs
    • input functions $u(x_{1}),u(x_{2}),…,u(x_{m})$, will be learnt by (stacked/unstacked) branch net, of which input is $x_{1}, x_{2},…, x_{m}$, also called sensors.
    • points y will be learnt by trunk net
  • Requirements
    • consistency of of sensors
  • Notice
    • y and u(x) may be not equally matched in dimensions in high dimensional problems, so two sub-networks are needed.
    • input function is represented by neural networks.

Notices

  • Input functions here were mean-zero Gaussian Random field(GRF) and Orthogonal polynomials, and could be expanded to more than one function.
  • number of sensors m were derived to be sufficient enough to achieve accuracy, which is set as 100 in the paper.

Personal Opinions

DeepOnet provides a primary method in learning PDEs and ODEs by the input function and seperate points. The input functions may be explicitly expressed by equations or implicitly expressed by networks, which indicates a nested functions that generally appears in the complex phenomena. However,

  1. the process may lack of interpretability, when explaination is needed for the causality between input and output function.
  2. Additionally, all sensors from the input function are packaged to be learned by p branch nets. What if there are multiple input functions to be learnt? Further, if we don’t know prior knowledge, how do we discern these functions from raw data, to achieve an approximator of mechanisms that learn known knowledge and may discover unknown knowledge.

Further development

  • Solved problems
    • multiple-input operators - MIONet2
    • with physics - Physics-informed DeepOnets3
    • Extrapolation - DeepONet with physics or sparse observation4
  • Unsolved problems
    • Without prior knowledge, how could we learn explainable functions with mathematical thinking from raw data?

References

  1. DeepXDE. https://deepxde.readthedocs.io/en/latest/modules/deepxde.nn.tensorflow.html. 

  2. MIONet: Learning Multiple-Input Operators via Tensor Product. (2022). https://doi.org/10.1137/22M1477751. 

  3. Learning the solution operator of parametric partial differential equations with physics-informed DeepONets.(2021). https://doi.org/10.1126/sciadv.abi8605. 

  4. Reliable extrapolation of deep neural operators informed by physics or sparse observations. (2023). https://doi.org/10.1016/j.cma.2023.116064. 

This post is licensed under CC BY 4.0 by the author.