Single Reference Frequency Loss for Multifrequency Wavefield Representation Using Physics-Informed Neural Networks


Highlights
  1. Single reference frequency loss function for multifrequency wavefield.
  2. Mitigate the change in the spatial wavenumber over frequency by adapting the spatial scale to frequency.
  3. Dynamic frequency weighting derivation.
  4. Accuracy improvements seen in the frequency-domain wavefield prediction, time-domain snapshots, and estimated velocity, compared to vanilla PINN.

Abstract

Physics-informed neural networks (PINNs) can offer approximate multidimensional functional solutions to the Helmholtz equation that is flexible, requires low memory, and has no limitations on the shape of the solution space. However, the neural network (NN) training can be costly, and the cost dramatically increases as we train for multifrequency wavefields by adding frequency as an additional input to the NN multi- dimensional function. In this case, the often large variation of the wavefield features (specifically wavelength) with frequency adds more complexity to the NN training. Thus, we propose a new loss function for the NN multidimensional input training that allows us to seamlessly include frequency as a dimension. We specifically utilize the linear relation between frequency and wavenumber (the wavefield space representation) to incorporate a reference frequency scaling to the loss function. As a result, the effective wavenumber of the wavefield solution as a function of frequency remains almost stationary, which reduces the learning burden on the NN function. We demonstrate the effectiveness of this modified loss function on a layered model.

Theory
When we double the frequency, the first and third terms will quadruple. To keep the Helmholtz equation stationary, we need the second term (the Laplacian operator acting on $\delta \mathbf{U}$) to quadruple as well. To do so, we rescale the spatial axes to maintain the effective wavenumber. Thus, when we double the frequency, we rescale the spatial coordinates by half. For simplicity, we use three frequency-domain wavefields to demonstrate the process. We immediately arrive at the conclusion that the wavefield by the frequency upscaling (8-Hz wavefield) and the wavefield by spatial rescaling (4-Hz wavefield after spatial rescaling) share similar wavenumber content satisfying the Helmholtz equation, which means that we just need one frequency here to describe two wavefields with different frequencies.




The new single reference frequency loss function: \begin{equation} \mathcal{L} = \frac{1}{N} \sum_{i=1}^{N}\left|\omega_{ref}^{2} \mathbf{m}^{i} \Phi(\theta, \mathbf{x}^i)+\frac{\partial^2\Phi(\theta, \mathbf{x}^i)}{\partial^2(\alpha\mathbf{x}^i)}+\omega_{ref}^{2} \delta \mathbf{m}^{i} U_{0}^i\right|_{2}^{2}, \label{srl} \end{equation} where $\alpha$ is the scaling factor, equal to the ratio of the current frequency to a reference frequency, $\omega_{ref}$ is the reference angular frequency and $N$ is the number of training samples. In practice, we implement equation \ref{srl}~by utilizing a computational graph.

Results

It is obvious that, with one more input dimension, the representation of NNs for the wavefield becomes harder to obtain via conventional training. On the other hand, our proposed loss function provided reasonable results considering the larger (four) dimensional space, and we obtain a much more accurate amplitude and phase representation of the wavefield. We also calculated the velocity models from the PINN predictions using \ref{srl}. We can observe that the PINN with our proposed loss function reconstructs the details of the velocity model much better than the vanilla method.

Transforming the multifrequency wavefield into time-domain records could help us understand better what parts of the wavefield we managed to predict accurately in time. Using inverse Fourier transform, we obtain time-domain snapshots for the numerical implementation, the conventional PINN, and our proposed PINN of the multifrequency wavefield representation. Fig. 7 shows the time-domain snapshot at 1.0 s transformed from 3- to 8-Hz wavefields with a frequency interval of 0.2 Hz. We observe that the time-snap shot obtained by the model trained with a single reference frequency loss function shows more agreement with the numerical result compared to the vanilla multifrequency loss function, specifically the key reflection corresponding to the high-velocity perturbation at 2.0 km.


Citation

If you found the paper useful, please cite it via:

                  
X. Huang and T. Alkhalifah, "Single Reference Frequency Loss for Multifrequency Wavefield Representation Using Physics-Informed Neural Networks," in IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1-5, 2022, Art no. 3007105, doi: 10.1109/LGRS.2022.3176867.
          
      
                  
@article{huang2022PINNref,
  author={Huang, Xinquan and Alkhalifah, Tariq},
  journal={IEEE Geoscience and Remote Sensing Letters}, 
  title={Single Reference Frequency Loss for Multifrequency Wavefield Representation Using Physics-Informed Neural Networks}, 
  year={2022},
  volume={19},
  pages={1-5},
  doi={10.1109/LGRS.2022.3176867}}