Solving Schrodinger equations using a physically constrained neural network

Kai-Fang Pu; Han-Lin Li; Hong-Liang Lü; Long-Gang Pang

doi:10.1088/1674-1137/acc518

Chinese Physics C> 2023, Vol. 47> Issue(5) : 054104 DOI: 10.1088/1674-1137/acc518

Solving Schrodinger equations using a physically constrained neural network

1.
College of Science, Wuhan University of Science and Technology, Wuhan 430065, China
2.
HiSilicon Research Department, Huawei Technologies Co., Ltd., Shenzhen 518000, China
3.
Key Laboratory of Quark and Lepton Physics (MOE) and Institute of Particle Physics, Central China Normal University, Wuhan 430079, China

Abstract
HTML
Reference
Related

PDF

Abstract：
Deep neural networks (DNNs) and auto differentiation have been widely used in computational physics to solve variational problems. When a DNN is used to represent the wave function and solve quantum many-body problems using variational optimization, various physical constraints have to be injected into the neural network by construction to increase the data and learning efficiency. We build the unitary constraint to the variational wave function using a monotonic neural network to represent the cumulative distribution function (CDF) $F(x) = \int_{-\infty}^{x} \psi^*\psi {\rm d}x'$. Using this constrained neural network to represent the variational wave function, we solve Schrodinger equations using auto-differentiation and stochastic gradient descent (SGD) by minimizing the violation of the trial wave function $ \psi(x) $ to the Schrodinger equation. For several classical problems in quantum mechanics, we obtain their ground state wave function and energy with very low errors. The method developed in the present paper may pave a new way for solving nuclear many-body problems in the future.
- deep neural network ,
- auto differentiation ,
- variational problems ,
- the cumulative distribution function ,
- ground state wave function

References

[1]	G. V. Cybenko, Mathematics of Control, Signals and Systems 2, 303 (1989) doi: 10.1007/BF02551274
[2]	A. Boehnlein et al., Rev. Mod. Phys. 94, 031003 (2022) doi: 10.1103/RevModPhys.94.031003
[3]	D. Saad, American Scientist 92, 578 (2004)
[4]	P. Mehta, M. Bukov, C.-H. Wang et al., Physics reports 810, 1 (2019) doi: 10.1016/j.physrep.2019.03.001
[5]	E. M. Nordhagen, J. M. Kim, B. Fore et al., arXiv: 2210.00365
[6]	B. R. Barrett, P. Navrátil, and J. P. Vary, Progress in Particle and Nuclear Physics 69, 131 (2013) doi: 10.1016/j.ppnp.2012.10.003
[7]	G. Torlai, G. Mazzola, J. Carrasquilla et al., Nature Physics 14, 447 (2018) doi: 10.1038/s41567-018-0048-5
[8]	C. Adams, G. Carleo, A. Lovato et al., Phys. Rev. Lett. 127, 022502 (2021) doi: 10.1103/PhysRevLett.127.022502
[9]	D. Pfau, J. S. Spencer, A. G. D. G. Matthews, and W. M. C. Foulkes, Phys. Rev. Res. 2, 033429 (2020) doi: 10.1103/PhysRevResearch.2.033429
[10]	M. Ruggeri, S. Moroni, and M. Holzmann, Phys. Rev. Lett. 120, 205302 (2018) doi: 10.1103/PhysRevLett.120.205302
[11]	J. Han, L. Zhang, and E. Weinan, Journal of Computational Physics 399, 108929 (2019) doi: 10.1016/j.jcp.2019.108929
[12]	S. Shi, K. Zhou, J. Zhao, S. Mukherjee, and P. Zhuang, Phys. Rev. D 105, 014017 (2022) doi: 10.1103/PhysRevD.105.014017
[13]	K. Choo, A. Mezzacapo, and G. Carleo, Nature communications 11, 2368 (2020) doi: 10.1038/s41467-020-15724-9
[14]	M. Scherbela, R. Reisenhofer, L. Gerard et al., Nature Computational Science 2, 331 (2022) doi: 10.1038/s43588-022-00228-x
[15]	Y. Yang and P. Zhao, arXiv: 2211.13998
[16]	R. P. Feynman, Rev. Mod. Phys. 20, 367 (1948) doi: 10.1103/RevModPhys.20.367
[17]	S. Chen, O. Savchuk, S. Zheng et al., Phys. Rev. D 107, 056001 (2023) doi: 10.1103/PhysRevD.107.056001
[18]	Y. Che, C. Gneiting, and F. Nori, Phys. Rev. B 105, 214205 (2022) doi: 10.1103/PhysRevB.105.214205
[19]	M. Raissi, P. Perdikaris, and G. E. Karniadakis, arXiv: 1711.10561
[20]	M. Raissi, P. Perdikaris, and G. E. Karniadakis, Journal of Computational physics 378, 686 (2019) doi: 10.1016/j.jcp.2018.10.045
[21]	E. Haghighat, M. Raissi, A. Moure et al., Computer Methods in Applied Mechanics and Engineering 379, 113741 (2021) doi: 10.1016/j.cma.2021.113741
[22]	J. Hendriks, C. Jidling, A. Wills et al., arXiv: 2002.01600
[23]	J. Hermann, Z. Schätzle, and F. Nóe, Nature Chemistry 12, 891 (2020) doi: 10.1038/s41557-020-0544-y
[24]	P. Hohenberg and W. Kohn, Phys. Rev. 136, B864 (1964) doi: 10.1103/PhysRev.136.B864
[25]	W. Kohn and L. J. Sham, Phys. Rev. 140, A1133 (1965) doi: 10.1103/PhysRev.140.A1133
[26]	M. S. Badar, S. Shamsi, J. Ahmed et al., Molecular dynamics simulations: concept, methods, and applications, in Transdisciplinarity (Springer, 2022), p. 131
[27]	D. Luo, G. Carleo, B. K. Clark et al., Phys. Rev. Lett. 127, 276402 (2021) doi: 10.1103/PhysRevLett.127.276402
[28]	H. J. Rothe, Lattice gauge theories: an introduction (Singapore: World Scientific Publishing Company, 2012), p. 628
[29]	R. Abbott, M. S. Albergo, A. Botev et al., arXiv: 2208.03832
[30]	J. Keeble and A. Rios, Phys. Lett. B 809, 135743 (2020) doi: 10.1016/j.physletb.2020.135743
[31]	H. Saito, Journal of the Physical Society of Japan 87, 074002 (2018) doi: 10.7566/JPSJ.87.074002
[32]	C. Giuseppe and M. Troyer, Science 355, 602 (2017) doi: 10.1126/science.aag2302
[33]	A. Paszke, S. Gross, S. Chintala et al., Automatic differentiation in pytorch (2017)
[34]	X. Glorot and Y. Bengio, Understanding the difficulty of training deep feedforward neural networks, in Proceedings of the thirteenth international conference on artificial intelligence and statistics (JMLR Workshop and Conference Proceedings, 2010), p. 249
[35]	I. Senitzky, Phys. Rev. 124, 642 (1961) doi: 10.1103/PhysRev.124.642
[36]	M. Capak and B. Gönül, Modern Physics Letters A 31, 1650134 (2016) doi: 10.1142/S0217732316501340
[37]	R. L. Karandikar, Sadhana 31, 81 (2006) doi: 10.1007/BF02719775
[38]	M. Abadi, A. Agarwal, P. Barham et al., arXiv: 1603.04467
[39]	D. P. Kingma and J. Ba, arXiv: 1412.6980

[1]	Jialei Wei , Ao Liu , Dejiang Li , Cuihong Wen . Physical parameter regression from black hole images using a multiscale adaptive neural network. Chinese Physics C, 2025, 49(12): 125105. doi: 10.1088/1674-1137/adf542
[2]	To Chung Yiu , Haozhao Liang , Jenny Lee . Nuclear mass predictions based on a deep neural network and finite-range droplet model (2012). Chinese Physics C, 2024, 48(2): 024102. doi: 10.1088/1674-1137/ad021c
[3]	Xiangrui Gao , Yu Jia , Liuji Li , Xiaonu Xiong . Relativistic correction to gluon fragmentation function into pseudoscalar quarkonium. Chinese Physics C, 2017, 41(2): 023103. doi: 10.1088/1674-1137/41/2/023103
[4]	Masoumeh Mohamadian , Hossein Afarideh , Mitra Ghergherehchi . Optimized feed-forward neural-network algorithm trained for cyclotron-cavity modeling. Chinese Physics C, 2017, 41(1): 017003. doi: 10.1088/1674-1137/41/1/017003
[5]	Cai-Xun Zhang , Shin-Ted Lin , Jian-Ling Zhao , Xun-Zhen Yu , Li Wang , Jing-Jun Zhu , Hao-Yang Xing . Discrimination of neutrons and γ-rays in liquid scintillator based on Elman neural network. Chinese Physics C, 2016, 40(8): 086204. doi: 10.1088/1674-1137/40/8/086204
[6]	A. Mirjalili , K. Keshavarzian . Meson polarized distribution function and mass dependence ofthe nucleon parton densities. Chinese Physics C, 2014, 38(8): 083101. doi: 10.1088/1674-1137/38/8/083101
[7]	LI Zhe , TUO Xian-Guo , YANG Jian-Bo , LIU Ming-Zhe , CHENG Yi , WANG Lei . Statistical distribution based detector response function ofSi (PIN) detector for K_α and K_β X-ray. Chinese Physics C, 2013, 37(1): 018202. doi: 10.1088/1674-1137/37/1/018202
[8]	YANG Chao , WU Xiao-Bing , LIU Da-Gang . Three-dimensional particle-in-cell with Monte Carlo collision simulation of the electron energy distribution function in the multi-cusp ion source for proton therapy. Chinese Physics C, 2012, 36(10): 1013-1018. doi: 10.1088/1674-1137/36/10/018
[9]	MA Kai , WANG Jian-Hua , YUAN Yi . Wigner function for the Dirac oscillator in spinor space. Chinese Physics C, 2011, 35(1): 11-15. doi: 10.1088/1674-1137/35/1/003
[10]	HOU Zhao-Yu , GUO Peng , WU Wen-Wang . Uncertainty study of D_S^-(D^-)→γlν (l=e,μ) decays determined by wave function. Chinese Physics C, 2011, 35(7): 603-607. doi: 10.1088/1674-1137/35/7/001
[11]	WANG Zhi , CHEN Jia-Er , LU Yuan-Rong , GUO Zhi-Yu , YAN Xue-Qing , ZHU Kun , KANG Ming-Lei , FANG Jia-Xun . Study of separated function radio frequency quadrupoles accelerator. Chinese Physics C, 2010, 34(4): 502-505. doi: 10.1088/1674-1137/34/4/017
[12]	CHEN Jia-Er , ZHU Kun , GUO Zhi-Yu , LU Yuan-Rong , YAN Xue-Qing , GAO Shu-Li , WANG Zhi , KANG Ming-Lei , FANG Jia-Xun , YU Mao-Lin , LI Wei-Guo , GUO Ju-Fang . Power test of the separated function RFQ accelerator. Chinese Physics C, 2009, 33(S2): 56-59. doi: 10.1088/1674-1137/33/S2/015
[13]	CHEN Guang-Ling , TIAN Shun-Qiang , JIANG Bo-Cheng , LIU Gui-Min . A GUI tool for beta function measurement using MATLAB. Chinese Physics C, 2009, 33(4): 297-300. doi: 10.1088/1674-1137/33/4/012
[14]	WANG Si-Guang , MAO Ya-Jun , YE Hong-Xue . An artificial neural network for proton identification in HERMES data. Chinese Physics C, 2009, 33(3): 217-223. doi: 10.1088/1674-1137/33/3/011
[15]	XIANG Wen-Chang , ZHOU Dai-Cui , WAN Ren-Zhuo , YUAN Xian-Bao . Analytic expression for the proton structure function in deep inelastic scattering. Chinese Physics C, 2009, 33(2): 98-102. doi: 10.1088/1674-1137/33/2/004
[16]	ZHANG Zhao , ZHAO Wei-Qin . Green Function Iterative Solution of Ground State Wave Function for Yukawa Potential. Chinese Physics C, 2003, 27(3): 215-222.
[17]	Gao Xiaochun , Gao Jun , Fu Jian . Invariant-Related Unitary Transformation and the Evolution of the Wave Function of the Universe in Third-Quantized Cosmology. Chinese Physics C, 1996, 20(S1): 61-69.
[18]	Yu Qingchang . Contour Function Theory for Transport of Charged Particle Beams. Chinese Physics C, 1995, 19(S3): 319-323.
[19]	Wang Shan , Liu Yiming , Jiang Yuzhen , Chen Xiaofan , D. Keane , S. Y. Fung , S. Y. Chu . Azimuthal Correlation Function and the Nuclear Equation of State. Chinese Physics C, 1990, 14(S4): 361-366.
[20]	Lin Chunzhen , Wu Chongshi , Zeng Jinyan . Seniority and K-Structure of the Cranked Shell Model Wave Function (I) Even-Even Nuclei. Chinese Physics C, 1988, 12(S4): 415-424.

Access

Figures(4) / Tables(1)

Get Citation

Kai-Fang Pu, Han-Lin Li, Hong-Liang Lü and Long-Gang Pang. Solving Schrodinger equations using physically constrained neural network[J]. Chinese Physics C. doi: 10.1088/1674-1137/acc518

Kai-Fang Pu, Han-Lin Li, Hong-Liang Lü and Long-Gang Pang. Solving Schrodinger equations using physically constrained neural network[J]. Chinese Physics C. doi: 10.1088/1674-1137/acc518 shu

RIS(for EndNote,Reference Manager,ProCite)

BibTex

Txt

Milestone

Received: 2023-01-01

Article Metric

Article Views(4775)
PDF Downloads(72)
Cited by(0)

Policy on re-use

To reuse of subscription content published by CPC, the users need to request permission from CPC, unless the content was published under an Open Access license which automatically permits that type of reuse.

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

HTML

I. INTRODUCTION

The universal approximation theorem of the deep neural network (DNN) [1] makes it powerful for representing a variational function $ y = f(x, \theta) $ with trainable parameters θ. In physics, this function can be used as solution of many different partial differential equations (PDEs) $ \hat{L} f = 0 $, such as Maxwell equations in the electromagnetic field, Navier-Stokes equations in fluid dynamics, Schrodinger equations in quantum mechanics, and Einstein field equations for gravity. The traditional way to solve this problem is to use physical models. These models face great challenges in solving inverse problems with complex geometric regions and high-dimensional space. Unlike these models, the deep learning method developed in this study provides a new direction to solve these problems. As the parameters of a DNN are initialized with random numbers, the variational function $ f(x, \theta) $ violates the PDEs, and the residuals $ \delta = |\hat{L} f| $ are usually the optimization objectives that can be minimized to the desired precision. In this way, many physical problems [2] are naturally mapped into optimization problems [3] that can be solved using the modern deep learning libraries.

The main advantages of machine learning are that (1) it directly establishes the function mapping between input and output data, and (2) ordinary differential equations (ODEs) and PDEs can be transformed into variational problems that can be solved using optimization. Machine learning can be helpful in finding low-dimensional manifolds in a high-dimensional space, which is crucial for the quantum many-body problem, which suffers from the curse of dimensionality. The associated disadvantage is that it is at an early stage of development and its applicability to computational physics has not been fully tested.

With strong information encapsulation capability, deep learning has been proved to be a powerful tool in solving quantum many-body problems [4–8]. The most typical application is to use the DNN to represent the wave function of quantum many-body states for many-electron systems [9]. In subsequent developments, artificial neural network (ANN) applications were extended to prototypical spin lattice systems and quantum systems in a continuous space [10–12]. Recently, machine learning has been used to deal with ab-initio problems [13–15]. The Feynman path integral [16] is another method for solving quantum state problems. Modern generative models can represent a probability distribution with high computational efficiency. A Fourier-flow generative model has been proposed to simulate the Feynman propagator and generate paths for quantum systems [17]. Further, Ref. [18] proposed a Feynman path generator that can estimate the Euclidean propagator and the ground state wave function with high accuracy.

PDEs usually have boundary and/or initial conditions. In an early study, these initial and boundary conditions were built into the neural network by construction, and the training objective was to minimize the residual δ alone. This method uses hard constraints such that $ f(x, \theta) $ satisfies the initial and boundary conditions automatically. It is thus quite data efficient. The recent physics informed neural network [19–21] uses soft constraints where the violations to initial and boundary conditions are also added to the training objective $L = |\hat{L} f| + \beta_1 |\delta_{BC}| + \beta_2 |\delta_{IC}|$.

Some variational functions should obey physical constraints. For example, in solving the Maxwell equations, the magnetic field represented by the DNN should be divergence free. To include this constraint, the paper "Linearly constrained neural network" proposes a DNN that produces a vector field $ \vec{A}(x, y, z, \theta) $ whose curl $ \nabla \times \vec{A} $ is divergence free [22]. It is thus also possible to construct a scalar field $ \phi(x, y, z, \theta) $ whose gradients $ (\partial_x \phi, \partial_y \phi, \partial_z \phi) $ are curl free. Actually, a general method has been developed to construct neural networks with linear constraints. In solving the many-body Schrodinger equations, the many fermion wave function should be anti-symmetric. FermiNet and PauliNet use the Slater determinant to construct DNNs that are anti-symmetric. [23] In DFT [24–26] and molecular dynamics [26], the local chemical environment usually has translational or rotational symmetry that is considered using a gauge equivalent neural network [27]. In the lattice gauge field theory [28], gauge equivariant normalizing flows are employed to sample field configurations [29].

In the present work, we use a monotonic neural network to represent the cumulative distribution function $\int_{-\infty}^{x} f(x') {\rm d}x'$, whose first order derivative is the probability density $ f(x) = \psi^*(x)\psi(x) $ that gives the ground state wave function. The present paper demonstrates that a neural network with physical constraints can be used as efficient trial wave functions of Schrodinger equations. Auto-diff helps to compute the required derivatives of the trial function with respect to the input variables. In this way, optimizing the violation of the trial function to PDEs allows solving the PDEs with high accuracy. Compared to previous methods, our method does not need to calculate any numerical integrals in the whole calculation and the unitary constraint we impose on the variational wave function increases the data learning efficiency. The improved algorithm greatly reduces the amount of computation required to solve the same Schrodinger equation. These advantages make our method more suitable for dealing with many-body states, which require a huge amount of computation.

IV. CONCLUSIONS

In the present study, we used a physics-based neural network to solve Schrodinger equations numerically. We designed a monotonic neural network to represent the CDF of the ground state wave function. In this way, the wave function represented by the DNN satisfies the normalization condition by design. The variational optimization is reduced to an optimization problem by minimizing the violation of the trial wave function and trial ground state energy $ E_0 $ to Schrodinger equations. The method is used to solve Schrodinger equations with three different potentials, the harmonic oscillator, the Woods-Saxon potential, and the infinitely high potential well, all with a small relative error.

Compared to traditional variational methods in solving quantum mechanical problems, the trial wave function represented by the DNN does not have fixed function forms before training. The training objective is different from the traditional $ E_0 = \dfrac{\langle \psi | H | \psi \rangle}{\langle \psi | \psi \rangle } $, where numerical integration is required for both the numerator and denominator. In our case, the objective is to minimize the violation to the Schrodinger equation on sampled spatial coordinates. As the neural network is constrained, the trial wave function is normalized by construction. Our method is also different from the previous Schrodinger equation solver using supervised learning, where ground state energies from numerical solutions are needed to train the neural network. In another DNN Schrodinger solver [30, 31], the initial values of the network parameters greatly affect the optimization results. To avoid strong fluctuations, they provide a trial wave function whose form is close to the exact solution. The disadvantage of the previous algorithms is that they can only solve problems in which the form of the exact solution of the equation is known. Our algorithm can directly ignore the pre-training process, so we do not need to know any information of the exact solution before training. This is more universal and provides the possibility to solve problems that have never been dealt with before. In addition, we observe that our DNN can approximate the ground state wave function with fewer trainable parameters. Moreover, the physical constraints constructed in the neural network make the current method quite data efficient. Thus, we can achieve higher accuracy with less computation.

The current method can be improved in several ways. First, the CDF works for wave functions in high dimensional space as long as the n-dim spatial coordinates are flattened. Second, the spatial coordinates used for training can be sampled using the learned wave function or through active learning, to increase the training efficiency. Third, the anti-symmetric constraints of the wave function should be considered for many fermion systems. Although further efforts have to be done to improve the current method, it shows good properties in solving classical quantum mechanical problems. The next step is to solve the ground state energy and wave functions of the deuteron. It also paves a new way in solving many nucleon problems.

ACKNOWLEDGMENTS

LG Pang and KF Pu acknowledge the support provided by Huawei Technologies Co., Ltd. The contributions of Dr. Hong-Liang Lü are non-Huawei achievements. The computations were performed at the NSC3 super cluster at CCNU and High-Performance Computing Center of Wuhan University of Science and Technology.

Reference (39)

$N_{\rm unit}N_{\rm layer}$	1	2	3	4
4	0.9995717	0.9999767	0.9999705	0.9999618
8	0.9999416	0.9999797	0.9999910	0.9999932
16	0.9999861	0.9999923	0.9999936	0.9999967
32	0.9999789	0.9999909	0.9999896	0.9999922
64	0.9999744	0.9999746	0.9999903	0.9999941

Solving Schrodinger equations using a physically constrained neural network

Abstract：

References

Access

Article Metrics

Metrics

通讯作者: 陈斌, bchen63@163.com

Email This Article

Solving Schrodinger equations using a physically constrained neural network

HTML

目录