Abstract
Many-body interactions are essential for understanding non-linear optics and ultrafast spectroscopy of materials. Recent first principles approaches based on nonequilibrium Greenâs function formalisms, such as the time-dependent adiabatic GW (TD-aGW) approach, can predict nonequilibrium dynamics of excited states including electron-hole interactions. However, the high-dimensionality of the electron-hole kernel poses significant computational challenges. Here, we develop a data-driven low-rank approximation for the electron-hole kernel, leveraging localized excitonic effects in the Hilbert space of crystalline systems to achieve significant data compression through singular value decomposition (SVD). We show that the subspace of non-zero singular values remains small even as the k-grid grows, ensuring computational tractability with extremely dense k-grids. This low-rank property enables at least 95% data compression and an order-of-magnitude speedup of TD-aGW calculations. Our approach avoids intensive training processes and eliminates time-accumulated errors, seen in previous approaches, providing a general framework for high-throughput, nonequilibrium simulation of light-driven dynamics in materials.
Similar content being viewed by others
Introduction
In recent years, ultrafast spectroscopic techniques, including pump-probe transient absorption1,2, solid state high-harmonic generation (HHG)3, terahertz spectroscopy4, and time-resolved angle-resolved photoemission spectroscopy (TR-ARPES)5, have emerged as crucial techniques for studying novel non-equilibrium physics in condensed matter systems. Interpreting the complex features observed in these experiments requires a predictive understanding of the roles of electron-electron and electron-phonon interactions and other many-body effects under highly non-equilibrium and strong-field conditions. Electron-electron (and electron-hole) interactions, in particular, play a key role in the coherent femtosecond dynamics of solid state systems, leading to a wide variety of phenomena, including light-induced superconductivity, many-body enhancement of HHG, coherent and incoherent excitonic effects in ARPES, and exciton-enhanced shift currents3,6,7,8,9,10,11,12. In the last decades, there has been an intense effort to simulate these driven non-equilibrium electronic processes from first principles13,14,15,16,17,18. One highly successful approach is the time-dependent adiabatic GW (TD-aGW)8,9,13,19, which is a non-equilibrium quantum master equation approach that simulates the dynamics of the single-electron reduced density matrix and has successfully described diverse phenomena, including transient absorption19, TR-ARPES9, and HHG with exciton effects7. However, a significant challenge lies in the computational demands of these simulations, which are prohibitively slow for large systems and for parameter sweeping, which is necessary to describe different driving-field conditions in ultrafast and strong-field spectroscopic measurements.
There has been growing interest in leveraging data-driven and machine learning techniques to accelerate first-principles many-body simulations, using methods such as low-rank approximations20,21,22,23,24,25,26,27, representation learning28,29,30,31,32,33, equivariant graph neural networks34,35,36 and other deep learning approaches37,38,39,40. For non-equilibrium simulations, data-driven prediction based on extrapolation from an existing time sequence, using methods such as dynamic mode decomposition (DMD)41,42,43 or Gaussian processes44, are commonly employed. However, since these approaches are extrapolative, they fail to describe situations where the external field changes dramatically outside the initially observed time interval, which is crucial for modeling ultrafast spectroscopy. Operator learning neural networks45,46,47,48,49,50 show promise for solving differential equations under more general conditions. However, such operator learning requires extensive training datasets for different driving conditions and still suffers from time-accumulated predictive errors and a black-box process that can make it difficult to extract physical understanding45,51,52. Thus, the development of new data-driven techniques that can accelerate calculations of many-body dynamics under highly non-equilibrium conditions and are robust against both rapid changes in the external field and error accumulation over the simulation time is urgently needed.
Here, we develop a data-driven low-rank approximation for the electron-hole kernelâthe key interaction that encodes exciton effects across different many-body approachesâbased on singular value decomposition (SVD). It exploits the inherent localization of excitonic effects within valleys in reciprocal space, which gives rise to a low-rank structure for the electron-hole interaction kernel across crystalline materials. In practice, we find that the kernel can be compressed by as much as 95% in most crystalline solids, while retaining information essential for reproducing both optical absorption spectra, for excitons with near-zero momentum, and energy loss spectra for finite momentum excitons. Moreover, we apply the SVD-compressed electron-hole kernel to accelerate TD-aGW calculations, achieving an order of magnitude speedup for calculations of both linear and nonlinear spectra under different driving conditions. This speedup makes repeated calculations under different field conditions computationally tractable, allowing first principles simulations to mirror experimental conditions. In contrast with approaches based on extrapolation from early-time dynamics, our physically interpretable approach exhibits no accumulated predictive error over time and requires no datasets or training, laying the groundwork for high-throughput simulations of nonequilibrium light-driven dynamics in materials.
Results
Two-particle electron-hole kernel in non-equilibrium dynamics
In interacting many-body systems, the equation of motion of the non-equilibrium single-particle Greenâs function takes the form:
where t, \({t}^{{\prime} }\) lie on the Keldysh contour; \(G(t,{t}^{{\prime} })\) is the two-time Greenâs function; Σ is the self-energy matrix in the Keldysh formalism53; and H(t) is a mean-field Hamiltonian. There is an equivalent adjoint equation for the time-evolution over \({t}^{{\prime} }\). Following the Langreth rules, one can derive a set of self-consistent equations from Eq. (1) for the retarded, advanced, lesser, and greater two-time Greenâs functions on the real-time axis, known as the Kadanoff-Baym equations (KBE)53,54,55. The self-energy can be computed within conserving schemes, such as the GW approximation56,57,58âi.e., Σâ=âiGW, where W is the screened Coulomb potential commonly approximated by the random phase approximation (RPA)59,60.
One challenge in solving Eq. (1) is that the time evolution of G over t and \({t}^{{\prime} }\) must be performed simultaneously. In first principles calculations, a common approximation is to decouple the many-body effects in equilibrium from the changes induced by the driving field by splitting the self-energy into an equilibrium piece Σ[G0], plus a correction term δΣ[G, G0]â=âΣ[G]âââΣ[G0], where G0 is the Greenâs function in the absence of an external field. Then, G0 is evaluated within the equilibrium GW approximation56,57,58, while the correction is evaluated in the static limit of the GW approximation (i.e., the static Coulomb hole and screened exchange approximation, or static-COHSEX) which ignores retardation effects in ΣGW and thus eliminates the double time dependence13,19,56,57,58. In this approximation, the non-equilibrium dynamics of the system can be described by the equation of motion of the reduced single-particle density matrix, expressed here in a Bloch basis, \({\rho }_{nm{\bf{k}}}=\left\langle \right.n{\bf{k}}| \rho | m{\bf{k}}\left.\right\rangle\)13:
where the quasiparticle Hamiltonian HQPâ=âH0â+âΣGW[G0], and δΣGW is the non-equilibrium change in the static-COHSEX self-energy, in which time-dependent changes to the screened Coulomb interaction are typically neglected13. E(t) is an arbitrary external electric field, which couples to the system in the length gauge through the position operator r61,62. The dipole approximation is made, so there are no finite momentum transitions, and the reduced single-electron density matrix only needs one crystal momentum index k. We refer to Eq. (2) as the time-dependent adiabatic GW (TD-aGW)8,9,13.
In the linear response limit, TD-aGW can be explicitly proven to be equivalent to the GW-Bethe-Salpeter equation (GW-BSE) approach in many-body perturbation theory (Fig. 1a, b)13,63. In GW-BSE, exciton eigenstates are found by diagonalizing an effective Hamiltonian consisting of the non-interacting electron-hole pair Hamiltonian plus the BSE electron-hole interaction kernel, which is derived by taking the functional derivative \(K=\frac{\delta \Sigma }{\delta G}\) of the GW self-energy and then ignoring both the non-equilibrium time evolution of the screened Coulomb interaction W and the frequency dependence of the screened Coulomb interaction63,64,65. We note that these approximations are consistent with the approximations made to the GW self-energy in TD-aGW. Therefore, under the static-COHSEX approximation for the single-electron self-energy in TD-aGW, δΣGW is given by the matrix product of the BSE kernel K and the non-equilibrium lesser Greenâs function, i.e.,
The non-equilibrium electron dynamics are therefore completely controlled by the two-particle BSE kernel \({K}_{{n}_{1}{n}_{2}{\bf{k}}{n}_{3}{n}_{4}{{\bf{k}}}^{{\prime} }}\) (Fig. 1). When finite momentum electron-hole pairs are created, \({K}_{{n}_{1}{n}_{2}{\bf{k}}{n}_{3}{n}_{4}{{\bf{k}}}^{{\prime} }}\) has to be replaced by \({K}_{{n}_{1}{n}_{2}{\bf{k}},{n}_{3}{n}_{4}{{\bf{k}}}^{{\prime} }}^{{\bf{Q}}}\), where k and \({{\bf{k}}}^{{\prime} }\) are the momenta of the hole after and before electron-hole interaction, and Q is the center of mass momentum of the electron-hole pair. The kernel is composed of two terms (Fig. 1b)66,67,68: an exchange interaction mediated by the bare Coulomb interaction v:
and a direct interaction mediated by the screened Coulomb interaction
Here, the matrices are written in the electron-hole pair basis, where \(| n{n}^{{\prime} }{\bf{k}}{\bf{Q}}\left.\right\rangle ={| n{\bf{k}}+{\bf{Q}}\left.\right\rangle }_{{\rm{electron}}}\otimes {| {n}^{{\prime} }{\bf{k}}\left.\right\rangle }_{{\rm{hole}}}\); v(Qâ+âG) denotes the bare Coulomb interaction, where G is a reciprocal lattice vector representing how much the momentum transfer exceeds the first Brillouin zone; \({W}_{{\bf{G}}{{\bf{G}}}^{{\prime} }}({\bf{q}})\) is the screened Coulomb interaction; \({\bf{q}}={\bf{k}}-{{\bf{k}}}^{{\prime} }\); and \({M}_{n{n}^{{\prime} }}({\bf{k}},{\bf{q}},{\bf{G}}):= \langle n{\bf{k}}+{\bf{q}}| {{\rm{e}}}^{{\rm{i}}({\bf{q}}+{\bf{G}})\cdot {\bf{r}}}| {n}^{{\prime} }{\bf{k}}\rangle\). In the current work (and previous first principles implementations66,69), the electron and hole states are Kohn-Sham (KS) states70 derived from density functional theory (DFT) in a plane-wave basis.
Double solid black lines are electron or hole quasiparticle propagators. Double blue dashed lines correspond to RPA-screened Coulomb interactions, and single blue dashed lines correspond to the bare Coulomb interaction. A cross means that the two ends connected to the cross are at the same time, corresponding to the calculation of physical observables like the dipole moment. Orange wavy lines correspond to the external driving field. a Linear response of TD-aGW with (top) and without (bottom) the coupling to the external field. Note that the linear response of TD-aGW is a ladder diagram approximation which is equivalent to GW-BSE. b \({K}_{{n}_{1}{n}_{2}{\bf{k}}{n}_{3}{n}_{4}{{\bf{k}}}^{{\prime} }}^{{\bf{Q}}}\) is the irreducible BSE kernel in the electron-hole pair basis. It is composed of a screened direct term and an unscreened exchange term that correspond exactly to the ladder diagrams in (a). c Example of a nonlinear diagram of TD-aGW contributing to the nonlinear response of \({\delta}{\rho}_{c_{2}v_{1}}\) involving \({K}_{{c}_{1}{c}_{2}{\bf{k}}{c}_{1}{c}_{2}{{\bf{k}}}^{{\prime} }}\), which will never appear in the equilibrium BSE. d Schematic of the physical process in (c): the external driving first causes occupation of the conduction bands (corresponding to the blue highlighted part of the conduction band), and then an electron-hole pair between two conduction bands form and coherently transitions to another electron-hole pair due to renormalization by \({K}_{{c}_{1}{c}_{2}{\bf{k}}{c}_{1}{c}_{2}{{\bf{k}}}^{{\prime} }}\). The polarization of the latter electron-hole pair (where the electron is on c2 and the hole is on v1) is then given by (c).
We note that the equilibrium BSE is a ladder diagram approximation and can be viewed as a self-energy correction to the free electron-hole pair, and therefore only matrix elements in the form of \({K}_{cv{c}^{{\prime} }{v}^{{\prime} }}\), \({K}_{cv{v}^{{\prime} }{c}^{{\prime} }}\), \({K}_{vc{v}^{{\prime} }{c}^{{\prime} }}\) and \({K}_{vc{c}^{{\prime} }{v}^{{\prime} }}\) appear (indexing convention in Fig. 1b), corresponding to interactive renormalization of the resonant and anti-resonant parts of the free electron-hole pair propagator, where c and v run over the conduction subspace and the valence subspace of the system, respectively64,65. Nonlinear TD-aGW, on the other hand, does not have such constraints, and the whole \({K}_{{n}_{1}{n}_{2}{\bf{k}},{n}_{3}{n}_{4}{{\bf{k}}}^{{\prime} }}^{{\bf{Q}}}\) matrix contributes to the time evolution (Fig. 1c). The intuition is that in nonlinear response, the external driving field can first pump electrons to conduction bands and then create electron-hole pairs from the occupied conduction band (Fig. 1d). In non-equilibrium BSE, which can be seen as the incoherent limit of TD-aGW in a pump-probe scheme, these components also appear14.
SVD analysis of electron-hole kernel
The electron-hole kernel matrix has a size of \({N}_{{\rm{b}}}^{4}{N}_{{\bf{k}}}^{2}{N}_{{\bf{Q}}}\), where Nb, Nk and NQ denote the number of bands, the number of k-points, and the number of momenta Q of the electron-hole pairs, respectively. For a typical solid-state semiconductor, Nk is on the order of 103â106 to capture fine features of the exciton wavefunction arising from dispersive bands, while the number of bands Nb is 1â102, for features of interest near the bandedge71,72,73,74. Here, we start by only considering coupling to an external field in the dipole approximation, so NQâ=â1, but more generally, when scattering to finite momenta is allowed NQ is of the same order as Nk. The kernel matrix is therefore numerically demanding to evaluate and also poses computational challenges for downstream tasks.
In the equilibrium BSE, the exciton envelope wavefunction is often localized in a low-energy region of the Brillouin Zone (i.e., a valley) so that the BSE can be solved on a nonuniform patch in k-space73,74,75,76,77,78. However, in TD-aGW (Eq. (2)), the external field can drive intraband transitions for the excited electrons and holes, and their motion may no longer be restricted to a single clearly-defined region of k-space, determined by the exciton eigenstate (Fig. 1d). The occurrence of exciton patches nonetheless hints at a low-rank structure of the BSE kernel (SI §2), which is explored in this section.
We start by focusing on the photo-excited electron-hole kernel \({K}_{0}:= {K}_{{n}_{1}{n}_{2}{\bf{k}},{n}_{3}{n}_{4}{{\bf{k}}}^{{\prime} }}^{{\bf{Q}} = 0}\in {{\mathbb{C}}}^{{N}_{{\bf{k}}}{N}_{{\rm{b}}}^{2}\times {N}_{{\bf{k}}}{N}_{{\rm{b}}}^{2}}\) of monolayer MoS2, a prototypical 2D material known for hosting strong exciton effects72 (See Methods for more details). K0 can be separated into contributions from transitions between different bands, which we refer to as channels \({K}_{0}^{{\rm{ch}}}\in {{\mathbb{C}}}^{{N}_{{\bf{k}}}\times {N}_{{\bf{k}}}}\), where the channel, ch, is defined as (n3, n4) â (n1, n2). A detailed analysis of individual channels (see SI §2) reveals that off-diagonal channels \({K}_{0}^{{\rm{off}}\,\text{-}\,{\rm{ch}}}\), involving transitions between different sets of bands (i.e., n1, n2ââ ân3, n4), are low-rank. However, diagonal channels \({K}_{0}^{{\rm{diag}}\,\text{-}\,{\rm{ch}}}\) between the same set of bands (i.e., n1, n2â=ân3, n4) are close to full-rank because of large diagonal (\({\bf{k}}={{\bf{k}}}^{{\prime} }\)) elements. As a result, the entire K0 has a full-rank structure, making it difficult to achieve significant compression when applying a naive SVD directly to K0.
To address this, we further decompose K0 into parts that are diagonal in k, \({K}_{0}^{{\rm{diag}}}\) (i.e., \({n}_{1},{n}_{2},{\bf{k}}={n}_{3},{n}_{4},{{\bf{k}}}^{{\prime} }\)), and off-diagonal in k, \({K}_{0}^{{\rm{off}}}\) (i.e., \({n}_{1},{n}_{2},{\bf{k}}\ne {n}_{3},{n}_{4},{{\bf{k}}}^{{\prime} }\)). \({K}_{0}^{{\rm{diag}}}\) contributes to most of the non-zero singular values but only accounts for a small portion \(1/{N}_{{\rm{b}}}^{2}{N}_{{\bf{k}}}\) of the entire K0. The primary computational challenge lies in \({K}_{0}^{{\rm{off}}}\), which is low rank and can be effectively compressed by SVD. The final low-rank approximation is achieved by performing SVD on \({K}_{0}^{{\rm{off}}}\) and retaining only the first z largest singular values so that the kernel is approximated as
where \({\tilde{M}}_{0}\in {{\mathbb{C}}}^{z\times z}\), \({\tilde{U}}_{0}\in {{\mathbb{C}}}^{{N}_{{\rm{b}}}^{2}{N}_{{\bf{k}}}\times z}\) and \({\tilde{V}}_{0}^{T}\in {{\mathbb{C}}}^{z\times {N}_{{\rm{b}}}^{2}{N}_{{\bf{k}}}}\) are the truncated singular value and singular vector matrices for \({K}_{0}^{{\rm{off}}}\). This low-rank approximation can achieve a compression rate of \(z/{N}_{{\rm{b}}}^{2}{N}_{{\bf{k}}}\), where z is numerically determined from the computational convergence of K0 reconstructed from \({\tilde{K}}_{0}\). The sorted singular values of both the entire K0 and \({K}_{0}^{{\rm{off}}}\) are shown in Fig. 2a. After an initial rapid decay, the singular values of K0 converge to a non-zero value, preventing accuracy-preserving truncation. The singular values of the \({K}_{0}^{{\rm{off}}}\), on the other hand, rapidly decay to zero, thus supporting our assumption that (6) is an efficient low-rank approximation of K0.
a Decay of sorted singular values for the electron-hole kernel K0 of monolayer MoS2. The red dots represent the singular values of the full K0, and the blue dots represent the singular values of the off-diagonal part \({K}_{0}^{{\rm{off}}}\). b Zoomed in logscale decay of the sorted singular values for each channel \({K}_{0}^{{\rm{ch}}}\). The red dots represent the diagonal channels \({K}_{0}^{{\rm{diag}}\,\text{-}\,{\rm{ch}}}\), and the blue dots represent the off-diagonal channels \({K}_{0}^{{\rm{off}}\,\text{-}\,{\rm{ch}}}\). câe Heat maps of the minimum ratio of singular values required to reconstruct each channel with an accuracy of R2â=â0.98 for K0, the direct interaction \({K}_{0}^{{\rm{d}}}\), and the exchange interaction \({K}_{0}^{{\rm{x}}}\). The mean truncation ratios for the SVD of the \({K}_{0}^{{\rm{off}}\,\text{-}\,{\rm{ch}}}\) are 12%, 14%, and 2% respectively. f The minimum number of z (preserved singular values) for reconstructing K0 starting from different k-grids of 12âÃâ12âÃâ1, 24âÃâ24âÃâ1, 36âÃâ36âÃâ1, and 48âÃâ48âÃâ1. The reconstruction accuracy ranges from R2â=â0.80 to 0.98, represented by different colors. The blue labels 31%, 15%, 12%, and 8% denote compression rate with reconstruction accuracy of R2â=â0.98 for different k-grids.
Next, to quantitatively analyze the rank of each individual channel in K0, we conduct channel-wise SVD on each \({K}_{0}^{{\rm{ch}}}\) over k and \({{\bf{k}}}^{{\prime} }\), then preserve the \({z}^{{\prime} }\) largest singular values:
where \({\tilde{M}}_{0}^{{\rm{ch}}}\in {{\mathbb{C}}}^{{z}^{{\prime} }\times {z}^{{\prime} }}\), \({\tilde{U}}_{0}^{{\rm{ch}}}\in {{\mathbb{C}}}^{{N}_{{\bf{k}}}\times {z}^{{\prime} }}\) and \({({\tilde{V}}_{0}^{{\rm{ch}}})}^{T}\in {{\mathbb{C}}}^{{z}^{{\prime} }\times {N}_{{\bf{k}}}}\) are the truncated singular value and singular vector matrices for each \({K}_{0}^{{\rm{ch}}}\). Figure 2b shows the decay of the singular values for each \({K}_{0}^{{\rm{ch}}}\). As previously noted, the singular values of \({K}_{0}^{{\rm{off}}\,\text{-}\,{\rm{ch}}}\) decay much faster than the \({K}_{0}^{{\rm{diag}}\,\text{-}\,{\rm{ch}}}\). Figure 2c shows the minimum ratio of singular values \({z}^{{\prime} }/{N}_{k}\) required to reconstruct each \({K}_{0}^{{\rm{ch}}}\) with a high accuracy of R2â=â0.98 (\({R}^{2}=1-{\Sigma }_{i}{({y}_{i}-{\hat{y}}_{i})}^{2}/{\Sigma }_{i}{({y}_{i}-{\bar{y}}_{i})}^{2}\), where yi and \({\hat{y}}_{i}\) are real and predicted values). The \({K}_{0}^{{\rm{diag}}\,\text{-}\,{\rm{ch}}}\) requires a high SVD truncation ratio of 95% for reconstruction, while the mean truncation ratio for the \({K}_{0}^{{\rm{off}}\,\text{-}\,{\rm{ch}}}\) is only 12%. Figure 2d, e respectively show the channel-wise truncation rate of the direct \({K}_{0}^{{\rm{d}}}\) and exchange \({K}_{0}^{{\rm{x}}}\) interaction only. Notably, the exchange interaction is highly compressible, requiring a truncation ratio of only 2%. The direct term alone is less compressible with a mean truncation ratio of 14% for off-diagonal channels.
Eqs. (6) and (7) can both be utilized to compress the BSE kernel and accelerate TD-aGW simulations. As the number of bands increases, the SVD compression scheme described in Eq. (6) poses challenges for both calculations, which scale as \({\mathcal{O}}({({N}_{{\rm{b}}}^{2}{N}_{{\bf{k}}})}^{3})\), and memory. Alternatively, the time complexity of channel-wise SVD (Eq. (7)) is \({\mathcal{O}}({N}_{{\rm{b}}}^{4}{({N}_{{\bf{k}}})}^{3})\), and a high overall SVD compression rate can still be achieved as non-compressible diagonal channels only account for \(1/{N}_{{\rm{b}}}^{2}\) of K0. Moreover, channel-wise SVD is inherently well-suited for parallel implementation and scalability.
Lastly, we investigate the SVD compression for different sizes of the k-grid. Figure 2f shows the average minimum number of z required to reconstruct \({K}_{0}^{{\rm{off}}\,\text{-}\,{\rm{ch}}}\) for varying k-grid sizes. Notably, for a given accuracy requirement, the number of z tends to converge as the size of the k-grid increases, indicating that additional k-points do not proportionally contribute new information to the K0. This key observation ensures computational feasibility even when an extremely dense k-grid is required for BSE and TD-aGW, where k-grid convergence is a major bottleneck. Moreover, since z remains independent of Nk, the compression ratio \(z/{N}_{{\rm{b}}}^{2}{N}_{{\bf{k}}}\) will decrease with increasing Nk, significantly accelerating calculations with large k-grids. We explain this by the fact that excitonic binding has a limited interaction range (SI §5).
SVD acceleration of TD-aGW
Next, we apply the SVD-compressed kernel to accelerate TD-aGW calculations. In Eq. (2), the time-dependent Hamiltonian H(t)â=âHQPâââeE(t) â râ+âδΣ(t) is updated at each time step. The major computational bottleneck lies in evaluating the change in the COHSEX self-energy δΣ(t) (Eq. (3)), which has a complexity of \({\mathcal{O}}({N}_{{\rm{b}}}^{4}{N}_{{\bf{k}}}^{2})\) per time step. Further analysis of the time complexity is given in SI §1. We can reduce this complexity by taking advantage of the low-rank structure of \({K}_{0}^{{\rm{off}}}\). We plug the SVD truncated kernel (Eq. (6)) into the TD-aGW Hamiltonian:
Here, the diagonal part of Hamiltonian \({H}^{{\rm{diag}}}(t)={H}^{{\rm{QP}}}+{K}_{0}^{{\rm{diag}}}\delta \rho (t)-e{\bf{E}}(t)\cdot {\bf{r}}\) is time-dependent but can be evaluated in a negligible \({\mathcal{O}}({N}_{{\rm{b}}}^{2}{N}_{{\bf{k}}})\). \({\tilde{S}}_{0}={\tilde{U}}_{0}{\tilde{M}}_{0}\) is time-independent, so the acceleration is achieved by first projecting the density matrix Ï(t) onto the low-rank subspace spanned by the truncated singular vectors \({\tilde{V}}_{0}^{T}\), then mapping it back to the full Hilbert space via \({\tilde{S}}_{0}\). The time complexity of this term is reduced to \({\mathcal{O}}({N}_{{\rm{b}}}^{2}{N}_{{\bf{k}}}\times 2z)\), accelerating TD-aGW by a factor of \({N}_{{\rm{b}}}^{2}{N}_{{\bf{k}}}/2z\) compared to the full kernel case. We note that with a given accuracy, z is almost independent of Nk in the large Nk limit, and therefore the SVD-compression approach is particularly powerful with a dense k-grid, which is needed for convergence of the BSE kernel72.
As an initial benchmark, we validate the SVD-accelerated approach by comparing the linear optical absorption spectrum obtained from TD-aGW using full K0 to \({\tilde{K}}_{0}\) with different SVD truncation ratios. We conduct TD-aGW simulations for 150âfs with a time step of 0.125âfs, and a narrow Gaussian pulse of width 0.125âfs centered at 0.125âfs is applied to excite the system. The absorption spectrum A(Ï) is obtained by a Fourier transformation of the time-dependent polarization P(t)8. Figure 3a shows the linear absorption spectrum of monolayer MoS2 calculated with the full K0 and SVD subspace truncated to include 20%, 5% and 1% of singular values. Notably, the absorption spectrum with a 5% SVD subspace (resulting in a 10Ã speedup) can accurately reproduce the result from the full calculation with less than 0.04âeV error in the position of the first exciton peak. When the SVD truncation threshold is decreased to 1% of singular values, there is a significant blue shift of the absorption spectrum, which arises from the information loss in the direct Coulomb interaction in the kernel.
a The optical absorption spectrum of monolayer MoS2 in arbitrary units (a.u.) is calculated in TD-aGW linear response after excitation by a narrow Gaussian electric pulse. The TD-aGW simulations are conducted with the full K0 (dark) and the \({\tilde{K}}_{0}\) that includes 20% (red), 5% (orange), and 1% (blue) of singular values. The (b) second- and (c) third-order response intensity of monolayer MoS2 to incident electric field frequencies from 0.4 to 3.2âeV using full K0 and the \({\tilde{K}}_{0}\). The external field is a sine wave with a field amplitude of 2.36âÃâ107âV/cm2.
The SVD-accelerated TD-aGW method is robust to different setups and parameters of the external driving field due to the preservation of the main physical interaction. To demonstrate this, we perform a parameter sweeping of the incident field frequency Ï. Figure 3bâc shows the second harmonic generation (SHG) and the third harmonic generation (THG) intensity of monolayer MoS2 corresponding to plane-wave incident light with frequencies ranging from 0.4 to 3.2âeV with an interval of 0.04âeV. As we expect, the SVD-accelerated TD-aGW with 5% truncation is still enough to capture both SHG and THG spectrum patterns, while the 1% truncation kernel results in an obvious deviation from the ground truth, in line with the linear response results.
Next, to evaluate the performance of SVD-accelerated TD-aGW in the nonperturbative strong-field regime, we conduct simulations with the external field used in previous nonlinear optical experiments3 and calculations7: \({\bf{E(t)}}={{\bf{E}}}_{{\bf{0}}}\cdot {\sin }^{2}(\pi t/{T}_{{\rm{pulse}}})\cdot \sin (\omega t)\), where the electric field amplitude E0 is 2.36âÃâ107âV/cm2, Tpulse is 100âfs, and Ï is the incident pulse frequency. The simulation length is 100âfs with a time step of dtâ=â0.1âfs. Figure 4aâc shows the time-dependent occupation âkÏnnk(t)/Nk of the first conduction band c1 in monolayer MoS2 with excitation field energies in the gap (0.35âeV), resonant with the lowest bright exciton (1.80âeV), and above the continuum (2.90 eV). For the in-gap excitation (Fig. 4a), the maximum excited-state population fills less than 1.5% of the conduction band, while the occupation increases to 25% for excitation above the continuum (Fig. 4c). We note that the external driving field in this scenario is strong, and proliferation of carriers may lead to considerable non-equilibrium renormalization of screening19. Figure 4 serves primarily as a proof of principle that SVD compression is still numerically reliable when the population change in the density matrix is large, and we leave possible dynamic updates to the screening in the non-equilibrium self-energy in TD-aGW to future studies. For below-gap excitations, a SVD truncation ratio of 1% (50à speedup) is sufficient to accurately capture both the occupation and harmonic yield. When the SVD truncation ratio is lowered to 0.1%, a substantial deviation from the ground truth is observed, indicating that the \({K}_{0}^{{\rm{off}}}\) is not negligible even if it is much smaller than the \({K}_{0}^{{\rm{diag}}}\). However, as the excitation energy increases (Fig. 4b, c), the numerical deviation between the 0.1% SVD-accelerated TD-aGW from the ground truth becomes smaller. A possible explanation is that \({K}_{0}^{{\rm{off}}}\) is more important for in-gap excitations, where the small energy of the incident photon can drive transitions between different valence bands or conduction bands, whereas the larger driving fields are resonant with specific valence to conduction band transitions.
TD-aGW simulations in the strong-field regime with the full K0 (black) vs. SVD subspace with different singular value truncation ratios of 5% (red), 1% (orange), and 0.1% (blue). Time-dependent electronic occupation of the first conduction band c1 in MoS2 with driving field frequencies of (a) 0.35, (b) 1.80, and (c) 2.90âeV. The occupation percentage is computed as \(\frac{1}{{N}_{{\bf{k}}}}{\sum }_{{\bf{k}}}{\rho }_{{c}_{1},{c}_{1},{\bf{k}}}(t)\), where Nk is the number of k points. dâf The high-harmonic generation (HHG) spectrum for the same driving field energies as (aâc), respectively.
Importantly, compared to previous data-driven methods and operator learning approaches for solving the KBE43,44, our SVD-accelerated TD-aGW avoids the time-accumulated errors that quickly cause the prediction to diverge from the ground truth. This arises from two key factors: i) the SVD-accelerated TD-aGW leverages the linear low-rank structure of the kernel matrix to retain most of the physical couplings for non-equilibrium dynamics; and ii) our SVD compression is independent of the external driving field, whereas supervised learning methods may struggle to capture dynamic behaviors driven by different external fields that deviate significantly from the training set.
To demonstrate the effectiveness of SVD-accelerated TD-aGW in the frequency domain, we calculate the high-harmonic generation (HHG) spectrum by \(\,\text{HHG}\,(\omega )={\left\vert \omega \int{\bf{J}}(t){{\rm{e}}}^{-{\rm{i}}\omega t}{\rm{d}}t\right\vert }^{2}\)7, where the time-dependent current is calculated by \({\bf{J}}(t)={\rm{Tr}}(\rho (t){\bf{v}})\) and v is the velocity operator. Figure 4d shows the logarithmic magnitude of the HHG spectrum with an incident pulse frequency of Ïâ=â0.35âeV. The HHG calculation using 5% truncated kernel agrees well with the ground truth. The 0.1% truncation can only reproduce the intensity of the 1st-order response, but the relative strength of the high harmonics are still faithfully reproduced. As shown in Fig. 4e, f, when the incident photon frequencies are increased to 1.8 and 2.9âeV, even the low truncation ratio of 0.1% can still agree well with the ground truth for the high-harmonic generation due to the decreased contribution from the \({K}_{0}^{{\rm{off}}}\) in the high-frequency region of the incident pulse. We note, however, that while the above gap pump is an informative numerical test, it does not reflect physically realistic conditions, as the pump intensity required to produce high harmonics would burn the sample. For the parameter sweeping of the angle-dependent HHG spectrum see SI §3.
SVD compression for GW-BSE within TDA
Up to now, we have used monolayer MoS2 as a testbed for SVD compression. Here, we explore material dependence and demonstrate that the optical absorption can be reliably calculated with \({\tilde{K}}_{0}\) across extended crystalline materials. In Fig. 5a, b, we apply channel-wise SVD to the GW-BSE kernel in the Tamm-Dancoff approximation (TDA) for monolayer MoS2 and black phosphorus, which are both 2D semiconductors with exciton binding energies on the order of several hundred meVs and exciton wavefunctions confined to low-energy valleys in k-space75,76,79,80. Calculation details are given in Methods. As Fig. 5a, b illustrates, electron-hole kernel matrices of MoS2 and black phosphorus exhibit similar low-rank structure, with an average 5% SVD compression of \({K}_{0}^{{\rm{off}}}\) being sufficient to accurately reproduce the linear absorption spectra.
The absorption spectrum calculated through the equilibrium GW-BSE formalism within the Tamm-Dancoff approximation of (a) monolayer MoS2, (b) monolayer black phosphorus, and (c) a Benzene molecule. In (a, b) SVD compression is conducted over the momentum space \(| {\bf{k}}\left.\right\rangle\) of \({K}_{0}^{{\rm{ch}}}\) for crystal systems. The BSE is calculated with the full K0(black), \({K}_{0}^{{\rm{diag}}\,\text{-}\,{\rm{ch}}}\)+5%\({K}_{0}^{{\rm{off}}\,\text{-}\,{\rm{ch}}}\)(red), and \({K}_{0}^{{\rm{diag}}\,\text{-}\,{\rm{ch}}}\) only (blue). c SVD compression is conducted over the band-to-band transition channels \(| {n}_{1},{n}_{2}\left.\right\rangle\) of the molecule system due to lack of periodicity. The BSE is calculated with the full K0(black), \({K}_{0}^{{\rm{diag}}\,\text{-}\,{\rm{ch}}}\)+50% \({K}_{0}^{{\rm{off}}\,\text{-}\,{\rm{ch}}}\)(red), +20% \({K}_{0}^{{\rm{off}}\,\text{-}\,{\rm{ch}}}\)(orange), +5% \({K}_{0}^{{\rm{off}}\,\text{-}\,{\rm{ch}}}\)(blue).
We have studied electron-hole interactions in extended systems, where we perform a channel-wise SVD compression over k-points. Lastly, we consider a benzene molecule in the gas phase, which lacks periodicity but converges slowly with respect to the number of bands81. Here, without the crystal momentum k degree-of-freedom, the SVD can be only applied to the transition channels \(| {n}_{1},{n}_{2}\left.\right\rangle\). However, even with a modest compression rate of 50%, compression over bands introduces unphysical peak splitting, highlighting that while SVD compression works well over k-points, it is fundamentally unsuitable for molecular systems that lack periodicity.
Finally, we note that SVD compression can be trivially generalized to a finite momentum BSE kernel (SI §4), suggesting a route for further accelerating dynamcis calculations simulating scattering between finite momentum states.
Discussion
In summary, we reveal a low-rank structure of the electron-hole kernel, which arises due to the localization of excitonic effects in momentum space in crystalline systems. As a consequence of the low-rank structure, we show that the full kernel matrix can be compressed to at least 5% of its original size, in semiconductors with exciton binding energies on the order of a few hundreds of meV. Notably, we show that the size of the subspace of singular values kept after SVD compression, capturing key information about the electron-hole kernel, tends to converge as the k-grid size increases, ensuring computational tractability even for dense k-grids. Leveraging this general low-rank property, we apply it to accelerate non-equilibrium TD-aGW calculations, achieving a speedup of an order of magnitude in calculating the time-dependent density matrix, angle-resolved HHG in the strong-field regime, and optical absorption spectrum in the weak-field limit for monolayer MoS2. Here, we demonstrate SVD acceleration in the context of zero-momentum excitations in TD-aGW, but our formalism can be trivially generalized to finite momentum BSE calculations, where we show that 5% of SVD kernel data can generate electron loss spectroscopy comparable to the ground truth (see SI §4). Our method avoids time-accumulated prediction errors seen in other data-driven and operator learning methods for dynamic calculation of nonequilibrium many-body effects. It also requires no training data and is agnostic to the choice of crystal system and external driving field, regardless of whether the driving field lies above or below the bandgap or in the weak or strong-field regime. Our approach provides a robust, efficient, and broadly applicable framework for accelerating many-body calculations, offering new possibilities for studying nonequilibrium excitonic and electronic dynamics with high accuracy and reduced computational cost.
Methods
DFT and equilibrium G W-BSE calculations
In this work, all DFT calculations are performed using Quantum ESPRESSO69 with the Perdew-Burke-Ernzerhof (PBE) functional82, and all GW-BSE calculations are performed using BerkeleyGW66,67,83. For zero-momentum GW-BSE calculations of MoS2, the planewave cutoff for the DFT kinetic wavefunctions, dielectric matrix used in GW calculations, and kernel matrix are 95, 20, and 5 Ry respectively. 6000 bands are included in constructing the dielectric matrix. The K0 of MoS2 includes 2 conduction and 2 valence bands, which are sampled on a uniform k-grid of 36âÃâ36âÃâ1 over the first Brillouin zone. For black phosphorus, the planewave cutoffs for the DFT kinetic energy, RPA dielectric matrix, and kernel matrix are 95, 20, and 5âRy respectively. 1000 bands are included in constructing the dielectric matrix. The K0 of black phosphorus includes 4 valence and 4 conduction bands, which are sampled on a k-grid of 20âÃâ20âÃâ1. For the benzene molecule, the DFT planewave cutoff energy is 90âRy. 600 bands are included in evaluating the dielectric matrix and kernel matrix with a planewave cutoff energy of 40 and 10âRy respectively.
TD-aGW
The theory of TD-aGW has been outlined in the main text. The implementation we use is the same as that in refs. 8,9. The time-integration is performed with a time step of 0.1âfs using a fourth order Runge-Kutta method.
Long range exchange and the definition of response functions
The Gâ=â0 contribution in Eq. (4), which corresponds to the long-range, macroscopic Coulomb interaction, is often zeroed out in actual simulations so that the E field in Eq. (2) is the total macroscopic electric field (including the macroscopic electric field radiated by the electrons in the material) and not just the external electric field. This means the linear and nonlinear response from Eq. (2) are calculated with respect to the total macroscopic electric field, which is consistent with the way material-specific constitutive relations are defined in electrodynamics in media, where Dâ=âϵ0Eâ+âP(E) and E is the total electric field, with no distinction between the âexternalâ electric field and the electric field from the materialâs response being made. The linear response from Eq. (2) (which is equivalent to BSE; see the discussion in the main text) with the long-range Coulomb interaction zeroed out, therefore, directly gives the dielectric function, while the linear response from Eq. (2) with the long-range Coulomb interaction corresponds to the inverse of the dielectric function. This fact is also discussed in SI § 4.
Data availability
All the raw data for the MoS2 BSE kernel from BerkeleyGW, the TD-aGW generated data, and post-processed data used for plotting are stored on the Materials Data Facility under https://doi.org/10.18126/1f89-fc48.
References
Leone, S. R. et al. What will it take to observe processes in âreal timeâ? Nat. Photonics 8, 162â166 (2014).
Ramasesha, K., Leone, S. R. & Neumark, D. M. Real-time probing of electron dynamics using attosecond time-resolved spectroscopy. Annu. Rev. Phys. Chem. 67, 41â63 (2016).
Liu, H. et al. High-harmonic generation from an atomically thin semiconductor. Nat. Phys. 13, 262â265 (2017).
Ulbricht, R., Hendry, E., Shan, J., Heinz, T. F. & Bonn, M. Carrier dynamics in semiconductors studied with time-resolved terahertz spectroscopy. Rev. Mod. Phys. 83, 543â586 (2011).
Boschini, F., Zonno, M. & Damascelli, A. Time-resolved ARPES studies of quantum materials. Rev. Mod. Phys. 96, 015003 (2024).
Cavalleri, A. Photo-induced superconductivity. Contemp. Phys. 59, 31â46 (2018).
Chang Lee, V., Yue, L., Gaarde, M. B., Chan, Y.-h & Qiu, D. Y. Many-body enhancement of high-harmonic generation in monolayer MoS2. Nat. Commun. 15, 6228 (2024).
Chan, Y.-H., Qiu, D. Y., da Jornada, F. H. & Louie, S. G. Giant exciton-enhanced shift currents and direct current conduction with subbandgap photo excitations produced by many-electron interactions. Proc. Natl. Acad. Sci. USA 118, e1906938118 (2021).
Chan, Y.-H., Qiu, D. Y., da Jornada, F. H. & Louie, S. G. Giant self-driven exciton-floquet signatures in time-resolved photoemission spectroscopy of MoS2 from time-dependent GW approach. Proc. Natl. Acad. Sci. USA 120, e2301957120 (2023).
Man, M. K. et al. Experimental measurement of the intrinsic excitonic wave function. Sci. Adv. 7, eabg0192 (2021).
Madéo, J. et al. Directly visualizing the momentum-forbidden dark excitons and their dynamics in atomically thin semiconductors. Science 370, 1199â1204 (2020).
Lin, Y. et al. Exciton-driven renormalization of quasiparticle band structure in monolayer MoS2. Phys. Rev. B 106, L081117 (2022).
Attaccalite, C., Grüning, M. & Marini, A. Real-time approach to the optical properties of solids and nanostructures: time-dependent Bethe-Salpeter equation. Phys. Rev. B Condens. Matter Mater. Phys. 84, 245110 (2011).
Perfetto, E., Sangalli, D., Marini, A. & Stefanucci, G. Nonequilibrium Bethe-Salpeter equation for transient photoabsorption spectroscopy. Phys. Rev. B 92, 205304 (2015).
Perfetto, E., Pavlyukh, Y. & Stefanucci, G. Real-time GW: toward an ab initio description of the ultrafast carrier and exciton dynamics in two-dimensional materials. Phys. Rev. Lett. 128, 016801 (2022).
Marques, M. A. & Gross, E. K. Time-dependent density functional theory. Annu. Rev. Phys. Chem. 55, 427â455 (2004).
Reining, L., Olevano, V., Rubio, A. & Onida, G. Excitonic effects in solids described by time-dependent density-functional theory. Phys. Rev. Lett. 88, 066404 (2002).
Romaniello, P. et al. Double excitations in finite systems. J. Chem. Phys. 130, 044108 (2009).
Sangalli, D. Excitons and carriers in transient absorption and time-resolved ARPES spectroscopy: an ab initio approach. Phys. Rev. Mater. 5, 083803 (2021).
Luo, Y., Desai, D., Chang, B. K., Park, J. & Bernardi, M. Data-driven compression of electron-phonon interactions. Phys. Rev. X 14, 021023 (2024).
Lu, J. & Ying, L. Compression of the electron repulsion integral tensor in tensor hypercontraction format with cubic scaling cost. J. Computational Phys. 302, 329â335 (2015).
Shao, M. et al. Low rank approximation in G0W0 calculations. Sci. China Math. 59, 1593â1612 (2016).
Kaye, J., Chen, K. & Parcollet, O. Discrete Lehmann representation of imaginary time Greenâs functions. Phys. Rev. B 105, 235115 (2022).
Dunlap, B. I. Robust and variational fitting. Phys. Chem. Chem. Phys. 2, 2113â2116 (2000).
Pham, B. Q. & Gordon, M. S. Compressing the four-index two-electron repulsion integral matrix using the resolution-of-the-identity approximation combined with the rank factorization approximation. J. Chem. Theory Comput. 15, 2254â2264 (2019).
Del Ben, M. et al. Static subspace approximation for the evaluation of G0W0 quasiparticle energies within a sum-over-bands approach. Phys. Rev. B 99, 125128 (2019).
Baratz, A., Cohen, G. & Refaely-Abramson, S. (2024) Unsupervised learning approach to quantum wave-packet dynamics from coupled temporal-spatial correlations. Physical Review B 110, https://doi.org/10.1103/PhysRevB.110.134304
Zang, J, et al. Machine learning-based compression of quantum many body physics: PCA and autoencoder representation of the vertex function. Machine Learning: Science and Technology 5.4, 045076 (2024).
Shinaoka, H., Otsuki, J., Ohzeki, M. & Yoshimi, K. Compressing Greenâs function using intermediate representation between imaginary-time and real-frequency domains. Phys. Rev. B 96, 035147 (2017).
Zadoks, A., Marrazzo, A. & Marzari, N. Spectral operator representations. npj Comput Mater 10, 278 (2024).
Knøsgaard, N. R. & Thygesen, K. S. Representing individual electronic states for machine learning GW band structures of 2D materials. Nat. Commun. 13, 468 (2022).
Hou, B., Wu, J. & Qiu, D. Y. Unsupervised representation learning of KohnâSham states and consequences for downstream predictions of many-body effects. Nat. Commun. 15, 9481 (2024).
Luchnikov, I. A., Ryzhov, A., Stas, P.-J., Filippov, S. N. & Ouerdane, H. Variational autoencoder reconstruction of complex many-body physics. Entropy 21, 1091 (2019).
Batzner, S. et al. E (3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 13, 2453 (2022).
Thomas, N. et al. Tensor field networks: rotation-and translation-equivariant neural networks for 3D point clouds. arXiv preprint. https://arxiv.org/abs/1802.08219 (2018).
Li, H. et al. Deep-learning density functional perturbation theory. Phys. Rev. Lett. 132, 096401 (2024).
Yang, L. et al. Deep learning-enhanced variational Monte Carlo method for quantum many-body physics. Phys. Rev. Res. 2, 012039 (2020).
MedvidoviÄ, M., Moreno, J.R. Neural-network quantum states for many-body physics. Eur. Phys. J. Plus 139, 631 (2024).
Noé, F., Olsson, S., Köhler, J. & Wu, H. Boltzmann generators: Sampling equilibrium states of many-body systems with deep learning. Science 365, eaaw1147 (2019).
Carrasquilla, J. & Torlai, G. How to use neural networks to investigate quantum many-body physics. PRX Quantum 2, 040201 (2021).
Yin, J. et al. Analyzing and predicting non-equilibrium many-body dynamics via dynamic mode decomposition. J. Comput. Phys. 477, 111909 (2023).
Maliyov, I., Yin, J., Yao, J., Yang, C. & Bernardi, M. Dynamic mode decomposition of nonequilibrium electron-phonon dynamics: accelerating the first-principles real-time Boltzmann equation. npj Comput. Mater. 10, 123 (2024).
Reeves, C. C. et al. Dynamic mode decomposition for extrapolating nonequilibrium Greenâs-function dynamics. Phys. Rev. B 107, 075107 (2023).
Gu, M., Lin, Y., Lee, V. C. & Qiu, D. Y. Probabilistic forecast of nonlinear dynamical systems with uncertainty quantification. Phys. D: Nonlinear Phenom. 457, 133938 (2024).
Lu, L., Jin, P., Pang, G., Zhang, Z. & Karniadakis, G. E. Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nat. Mach. Intell. 3, 218â229 (2021).
Li, Z. et al. Fourier neural operator for parametric partial differential equations. arXiv preprint. https://arxiv.org/abs/2010.08895 (2020).
Kovachki, N. et al. Neural operator: learning maps between function spaces with applications to PDEs. J. Mach. Learn. Res. 24, 1â97 (2023).
Li, Z. et al. Neural operator: Graph kernel network for partial differential equations. arXiv preprint. https://arxiv.org/abs/2003.03485 (2020).
Li, Z., Meidani, K. & Farimani, A. B. Transformer for partial differential equationsâ operator learning. arXiv preprint. https://arxiv.org/abs/2205.13671 (2022).
Zhu, Y. et al. Predicting nonequilibrium Greenâs function dynamics and photoemission spectra via nonlinear integral operator learning. Machine Learning: Science and Technology 6.1, 015027 (2025).
Brunton, S. L. & Kutz, J. N. Data-driven Science and Engineering: Machine Learning, Dynamical Systems, and Control (Cambridge University Press, 2022).
Shwartz-Ziv, R. & Tishby, N. Opening the black box of deep neural networks via information. arXiv preprint. https://arxiv.org/abs/1703.00810 (2017).
Kremp, D., Schlanges, M. & Kraeft, W.-D. Quantum Statistics of Nonideal Plasmas, Vol. 25 (Springer Science & Business Media, 2005).
Kadanoff, L. P. Quantum Statistical Mechanics (CRC Press, 2018).
Kadanoff, L., Baym, G. & Mechanics, Q. S. Greenâs function methods in equilibrium and nonequilibrium problems. Front. Phys. (1962).
Hedin, L. New method for calculating the one-particle Green's function with application to the electron-gas problem. Phys. Rev. 139, A796 (1965).
Hedin, L. & Lundqvist, S. Effects of electron-electron and electron-phonon interactions on the one-electron states of solids. in Solid State Physics, Vol. 23, 1â181 (Elsevier, 1970).
Hybertsen, M. S. & Louie, S. G. Electron correlation in semiconductors and insulators: band gaps and quasiparticle energies. Phys. Rev. B 34, 5390 (1986).
Adler, S. L. Quantum theory of the dielectric constant in real solids. Phys. Rev. 126, 413 (1962).
Wiser, N. Dielectric constant with local field effects included. Phys. Rev. 129, 62 (1963).
Aversa, C. & Sipe, J. E. Nonlinear optical susceptibilities of semiconductors: results with a length-gauge analysis. Phys. Rev. B 52, 14636 (1995).
Virk, K. S. & Sipe, J. Semiconductor optics in length gauge: a general numerical approach. Phys. Rev. B Condens. Matter Mater. Phys. 76, 035213 (2007).
Rocca, D., Ping, Y., Gebauer, R. & Galli, G. Solution of the Bethe-Salpeter equation without empty electronic states: application to the absorption spectra of bulk systems. Phys. Rev. BâCondens. Matter Mater. Phys. 85, 045116 (2012).
Rohlfing, M. & Louie, S. G. Electron-hole excitations and optical spectra from first principles. Phys. Rev. B 62, 4927 (2000).
Albrecht, S., Reining, L., Del Sole, R. & Onida, G. Ab initio calculation of excitonic effects in the optical spectra of semiconductors. Phys. Rev. Lett. 80, 4510 (1998).
Deslippe, J. et al. BerkeleyGW: a massively parallel computer package for the calculation of the quasiparticle and optical properties of materials and nanostructures. Comput. Phys. Commun. 183, 1269â1289 (2012).
Rohlfing, M. & Louie, S. G. Electron-hole excitations and optical spectra from first principles. Phys. Rev. B 62, 4927â4944 (2000).
Strinati, G. Application of the Greenâs functions method to the study of the optical properties of semiconductors. La Riv. del. Nuovo Cim. 11, 1â86 (1988).
Giannozzi, P. et al. Quantum espresso: a modular and open-source software project for quantum simulations of materials. J. Phys. Condens. Matter 21, 395502 (2009).
Kohn, W. & Sham, L. J. Self-consistent equations including exchange and correlation effects. Phys. Rev. 140, A1133 (1965).
Kammerlander, D., Botti, S., Marques, M. A. L., Marini, A. & Attaccalite, C. Speeding up the solution of the Bethe-Salpeter equation by a double-grid method and Wannier interpolation. Phys. Rev. B Condens. Matter Mater. Phys. 86, 125203 (2012).
Qiu, D. Y., Da Jornada, F. H. & Louie, S. G. Optical spectrum of MoS2: many-body effects and diversity of exciton states. Phys. Rev. Lett. 111, 216805 (2013).
Hou, B., Wang, D., Barker, B. A. & Qiu, D. Y. Exchange-driven intermixing of bulk and topological surface states by chiral excitons in Bi2Se3. Phys. Rev. Lett. 130, 216402 (2023).
Alvertis, A. M. et al. Importance of nonuniform Brillouin zone sampling for ab initio Bethe-Salpeter equation calculations of exciton binding energies in crystalline solids. Phys. Rev. B 108, 235117 (2023).
Qiu, D. Y., da Jornada, F. H. & Louie, S. G. Screening and many-body effects in two-dimensional crystals: monolayer MoS2. Phys. Rev. B 93, 235435 (2016).
Li, L. et al. Direct observation of the layer-dependent electronic structure in phosphorene. Nat. Nanotechnol. 12, 21â25 (2017).
Tempelaar, R. & Berkelbach, T. C. Many-body simulation of two-dimensional electronic spectroscopy of excitons and trions in monolayer transition metal dichalcogenides. Nat. Commun. 10, 3419 (2019).
Wu, J., Hou, B., Li, W., He, Y. & Qiu, D. Y. Quasiparticle and excitonic properties of monolayer \(1{T}^{{\prime} }\,{{\rm{WTe}}}_{2}\) within many-body perturbation theory. Phys. Rev. B 110, 075133 (2024).
Cao, T., Li, Z., Qiu, D. Y. & Louie, S. G. Gate switchable transport and optical anisotropy in 90 twisted bilayer black phosphorus. Nano Lett. 16, 5542â5546 (2016).
Qiu, D. Y., da Jornada, F. H. & Louie, S. G. Environmental screening effects in 2D materials: renormalization of the bandgap, electronic structure, and optical spectra of few-layer black phosphorus. Nano Lett. 17, 4706â4712 (2017).
Qiu, D. Y., da Jornada, F. H. & Louie, S. G. Solving the Bethe-Salpeter equation on a subspace: approximations and consequences for low-dimensional materials. Phys. Rev. B 103, 045117 (2021).
Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865â3868 (1996).
Hybertsen, M. S. & Louie, S. G. Electron correlation in semiconductors and insulators: Band gaps and quasiparticle energies. Phys. Rev. B 34, 5390â5413 (1986).
Acknowledgements
This work was primarily supported by the U.S. Department of Energy, Office of Science, Basic Energy Sciences under Early Career Award No. DE-SC0021965. Development of the BerkeleyGW code was supported by Center for Computational Study of Excited-State Phenomena in Energy Materials (C2SEPEM) at the Lawrence Berkeley National Laboratory, funded by the U.S. Department of Energy, Office of Science, Basic Energy Sciences, Materials Sciences and Engineering Division, under Contract No. DE-C02-05CH11231. Calculations on benzene and black phosphorus were supported by the National Science Foundation Division of Chemistry under award number CHE-2412412. The calculations used resources of the National Energy Research Scientific Computing (NERSC), a DOE Office of Science User Facility operated under contract no. DE-AC02-05CH11231, under award BES-ERCAP-0031507 and BES-ERCAP-0027380; and the Texas Advanced Computing Center (TACC) at The University of Texas at Austin.
Author information
Authors and Affiliations
Contributions
B.H. and J.W. developed the code and performed calculations on monolayer MoS2. V.C.L. performed a part of the calculations on monolayer MoS2. J.G. and L.Y.L performed calculations on black phosphorous and benzene, respectively. B.H., J.W., and D.Y.Q. conceived and designed the workflow of the project and drafted the manuscript. D.Y.Q. supervised the research. All authors contributed to data analysis and editing the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the articleâs Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Hou, B., Wu, J., Chang Lee, V. et al. Data-driven low-rank approximation for the electron-hole kernel and acceleration of time-dependent GW calculations. npj Comput Mater 11, 204 (2025). https://doi.org/10.1038/s41524-025-01680-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41524-025-01680-9







