Exploring 400 Gbps/Î» and beyond with AI-accelerated silicon photonic slow-light technology

Han, Changhao; Yang, Qipeng; Qin, Jun; Zhou, Yan; Zheng, Zhao; Zhang, Yunhao; Wang, Haoren; Sun, Yu; Lu, Junde; Wang, Yimeng; Ge, Zhangfeng; Wu, Yichen; Wang, Lei; He, Zhixue; Yu, Shaohua; Hu, Weiwei; Peng, Chao; Shu, Haowen; Bowers, John E.; Wang, Xingjun

doi:10.1038/s41467-025-61933-5

Download PDF

Article
Open access
Published: 16 July 2025

Exploring 400 Gbps/Î» and beyond with AI-accelerated silicon photonic slow-light technology

Changhao Han^1,2^Â na1,
Qipeng Yang¹^Â na1,
Jun Qin³^Â na1,
Yan ZhouÂ ORCID: orcid.org/0000-0002-1743-891X⁴^Â na1,
Zhao ZhengÂ ORCID: orcid.org/0009-0005-3856-7488¹,
Yunhao ZhangÂ ORCID: orcid.org/0009-0001-5851-750X⁵,
Haoren Wang⁵,
Yu Sun³,
Junde Lu³,
Yimeng Wang¹,
Zhangfeng Ge⁴,
Yichen Wu¹,
Lei Wang⁵,
Zhixue He⁵,
Shaohua Yu^1,5,
Weiwei Hu¹,
Chao PengÂ ORCID: orcid.org/0000-0002-0200-0798^1,5,6,
Haowen ShuÂ ORCID: orcid.org/0000-0002-5429-8661^1,6,
John E. BowersÂ ORCID: orcid.org/0000-0003-4270-8296² &
â€¦
Xingjun WangÂ ORCID: orcid.org/0000-0001-8206-2544^1,4,5,6Â

Nature Communications volumeÂ 16, ArticleÂ number:Â 6547 (2025) Cite this article

8397 Accesses
1 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Silicon photonics is a promising platform for the extensive deployment of optical interconnections, with the feasibility of low-cost and large-scale production at the wafer level. However, the intrinsic efficiency-bandwidth trade-off and nonlinear distortions of pure silicon modulators result in the transmission limits, which raises concerns about the prospects of silicon photonics for ultrahigh-speed scenarios. Here, we propose an artificial intelligence (AI)-accelerated silicon photonic slow-light technology to explore 400 Gbps/Î» and beyond transmission. By utilizing the artificial neural network, we achieve a data capacity of 3.2 Tbps based on an 8-channel wavelength-division-multiplexed silicon slow-light modulator chip with a thermal-insensitive structure, leading to an on-chip data-rate density of 1.6 Tb/s/mm². The demonstration of single-lane 400 Gbps PAM-4 transmission reveals the great potential of standard silicon photonic platforms for next-generation optical interfaces. Our approach increases the transmission rate of silicon photonics significantly and is expected to construct a self-optimizing positive feedback loop with computing centers through AI technology.

Parallel convolutional processing using an integrated photonic tensor core

Article 06 January 2021

Indistinguishable photons from an artificial atom in silicon photonics

Article Open access 13 August 2024

Design of a monolithic silicon-on-insulator resonator spiking neuron

Article Open access 10 September 2024

Introduction

With the rapid growth of global data volume in recent years, high-speed optical interconnection is seen as one promising approach to revolutionize high-performance computing centers^1,2,3. Among the reported optoelectronic integration technologies, silicon photonics is an important platform with high complementary metal-oxide semiconductor (CMOS) compatibility, which brings the feasibility of low-cost and large-scale production at the wafer level^4,5,6. However, as core electro-optical (EO) conversion devices, pure silicon modulators have a limited plasma dispersion effect⁷, thereby typically possessing low baud rates^8,9. Nevertheless, the utilization of high-order formats can maximize the limited bandwidth resources of pure silicon modulators¹⁰. For optical interconnection of 1.6 TbE, intensity modulation/direct detection (IM/DD) represented by four-level pulse amplitude modulation (PAM-4) is standardized in IEEE Standard 802.3dj¹¹. Compared to the complex coherent scheme, the IM/DD system, where the signal is modulated over the intensity of light, has been chosen as the definite technology route for short-reach optical connections in computing centers because of its ease of implementation¹². But so far, due to the bandwidth-efficiency trade-off and nonlinear distortions, the growth trend of transmission rates based on silicon modulators has been slow over the years. Although the data rates have continued to increase with the proposal of various solutions^13,14, the reported rates still exist a gap from single-lane 400 Gbps. This transmission bottleneck has raised concerns about the viability of pure silicon modulators for ultrahigh-speed scenarios, especially when compared to heterogeneous integration routes represented by thin film lithium niobate^{15,16,17,18,19}, plasmonics materials^20,21 and organic polymers^22,23. For pure silicon modulators, it is challenging to increase the transmission rate significantly, while achieving a high integration density and a wide optical passband simultaneously¹³. Although silicon Mach-Zehnder modulators (Si-MZMs) possess the merits of large operating wavelength range, their excessively long modulation arms reduce integration density^{24,25,26,27,28,29,30,31,32}; while silicon micro-ring modulators (Si-MRMs) achieve a compact footprint, the resulting narrow passbands and low thermal stability require additional feedback mechanisms^{33,34,35,36,37,38,39,40}. These issues restrict the deployment of silicon modulators and raise concerns about the future development path of silicon photonics⁴¹. To overcome the above limitations, silicon slow-light modulators (Si-SLMs) are proposed as a promising approach on silicon-on-insulator (SOI) platforms^{42,43,44,45,46}. Recently, the theoretical design and device-level research on Si-SLMs have been conducted, leading to a leap in bandwidth performance⁴⁷. Therefore, the potential of high-bandwidth Si-SLM schemes inspires further exploration of the speed limit of silicon photonics by utilizing high-order formats. Also, developing compact Si-SLM systems to enhance the on-chip data-rate density of silicon platforms can reduce the wafer area budget for a certain data capacity. Accordingly, taking full advantages of Si-SLMs and adopting high-order formats to construct a high-capacity transmission system is needed for 1.6 TbE, and is expected to provide key solutions for next-generation 3.2 TbE optical interfaces.

In practical application scenarios, the adoption of equalization is necessary, especially for high-order modulation⁴⁸. In most transmission works, linear equalizers such as feed-forward equalization (FFE) and decision-feedback equalization (DFE) are usually adopted and effectively mitigate linear distortions in the system^49,50. However, in the process of compensating nonlinear distortions including nonlinear modulation and chirp characteristics, which is particularly important in pure silicon platforms, great challenges have occurred for above conventional equalization methods. On the other aspect, an increasing number of works utilize artificial intelligence (AI) technology to accelerate the discovery in sciences⁵¹. In the optoelectronics, silicon photonics has promoted the deployment of optical computing for AI applications^52,53 (SiPh for AI), while AI technology can also be applied to realize the potential of silicon photonics^54,55 (AI for SiPh). Specially, for signal processing applications, artificial neural network (ANN) equalizers have been proposed to construct a complex map with nonlinear boundaries between the input and output spaces^56,57. Compared to conventional equalizers, ANN models the equalization as muti-level problems to mitigate the nonlinear distortions and can provide effective compensation solutions⁵⁸. More importantly, ANN equalizers are particularly favorable for the optical transmission based on Si-SLMs. Although the bandwidth-efficiency trade-off of pure silicon modulation can be optimized effectively by introducing slow-light resonators, the physical principle of Si-SLMs is still the nonlinear plasma dispersion effect, and the nonlinear cosine transfer functions of the device architecture will also introduce signal distortions. Therefore, by using an ANN equalizer, the nonlinear distortions of Si-SLMs can be reduced. At the manufacturing level, achieving complete uniformity remains challenging in the doping process, while the depletion-mode PN junction is the modulation basis for Si-SLMs^9,13. Simultaneously, although one-dimensional waveguide grating structure has improved the process stability compared to two-dimensional photonic crystals^42,45, the structural fluctuations of doped silicon gratings still exist due to the fabrication limitations. Therefore, Si-SLMs require an adaptive technical approach at the back end to eliminate the variations caused by process deviations, thus making the deployment of large-scale high-density Si-SLM systems possible.

In this work, we propose an AI-accelerated silicon photonic slow-light technology to explore 400â€‰Gbps per wavelength transmission. Using a standard silicon photonic process, we fabricate an 8-channel wavelength-division-multiplexed (WDM) Si-SLM chip on a SOI platform. Benefiting from the innovative slow-light design, the intrinsic bandwidth-efficiency trade-off for pure silicon modulators is effectively mitigated. Under the precise optimization, our compact Si-SLMs exhibit an ultrahigh EO bandwidth of 90â€‰GHz, a remarkable modulation efficiency of 0.82â€‰Vâ‹…cm while possessing a wide optical passband of 7â€‰nm around 1550â€‰nm simultaneously. By utilizing the ANN equalizer, we demonstrate a total data capacity of 3.2 Tbps based on our thermal-insensitive Si-SLM chip, with all bit error rates (BERs) below hard-decision forward error correction (HD-FEC) threshold, leading to an on-chip data-rate density of 1.6â€‰Tb/s/mm². Meanwhile, the transmission link does not require individual resonant wavelength adjustment and additional thermo-electric cooler (TEC) platforms, thus reducing the system budget. Notably, adopting the fundamental industry-standard IM/DD format PAM-4, we realize an unprecedented 400â€‰Gbps optical transmission per wavelength successfully, which is, to the best of our knowledge, the highest single-lane transmission rate achieved in standard silicon photonic platforms. Our work reveals the great potential of standard silicon photonic platforms in 3.2 TbE optical interconnections, validating the significant value of AI technology for silicon photonics.

Results

Device and chip design

The overall application architecture of the AI-accelerated slow-light technology is conceptually demonstrated in Fig. 1. Here, the AI-accelerated slow-light transceiver module provides an ultrahigh-speed solution for interconnections in computing centers (applications in photonics include neural networks, inverse design, dynamic simulation, etc.). Through collaborative integration with ANN, the Si-SLM system can achieve a leap in throughput per lane and elevates the interface rate of computing centers, thereby leading to faster training iterations for weight programming of ANN and enabling more immediate and efficient signal processing for the Si-SLM system. In turn, the enhanced transceiver module can further increase the interconnection rate for computing centers to achieve more powerful computility. This bi-directional promotion mechanism forms a self-optimizing positive feedback loop between AI-accelerated Si-SLM systems and computing centers, continuously improving the overall architecture performance.

**Fig. 1: Artificial intelligence (AI)-accelerated silicon photonic slow-light technology.**

Based on the above design concept, we designed an ultrahigh-speed WDM Si-SLM chip for computing centers. The slow-light effect is an effective approach to enhance the interaction between light and matter, which has been confirmed in both theory and experiment^{59,60,61,62,63,64}. Especially, thanks to the CMOS compatibility of the silicon slow-light structure, Si-SLMs for ultrahigh-speed transmission can be fabricated under a standard silicon photonic process, without introducing complex manufacturing process and heterogeneous materials, and an 8-channel Si-SLM chip was realized on a 200-mm SOI wafer (Fig. 2a). Benefiting from the compact footprint and dense distribution of Si-SLMs, the actual modulation area (bottom) occupies only 4â€‰mm Ã— 0.5â€‰mm, with the pitch of 500â€‰Î¼m between neighboring devices. Also, corresponding to the transmitter area, the 8-channel GeSi photodetectors (PDs) are also designed and arranged (top) for the complete transceiver. It can be seen that the adoption of Si-SLMs makes the transmitter area already close to the receiver area. For the optical ports, the edge coupler array (left) is used on the same side, and the three groups from bottom to top are modulator inputs, modulator outputs and PD inputs, respectively. Meanwhile, the direct-current (DC) pad array (right) is applied to control the modulator operating point effectively through the TiN heater of each device. For the single device, the architecture and morphology of one Si-SLM can be seen in Fig. 2b. Under a compact Mach-Zehnder interferometer (MZI) structure, the modulation arms can be shrunk to only 249â€‰Î¼m due to the slow-light effect, which is an order of magnitude shorter than conventional Si-MZMs, thus enhancing system integration density. Simultaneously, to enlarge the phase accumulation, also for the better photoelectric integration and low chirp, a dual-drive architecture of GSGSG-type radio-frequency (RF) electrodes is adopted here, which enables the modulator to operate under the push-pull driving mode. At the remote end of the RF electrodes, on-chip termination resistors are integrated to reduce microwave reflection. Near the waveguide output port, TiN heaters are adopted on the waveguides to control the operating point of the modulator (See Supplementary Section 1 for more details). In the modulation arms, the slow-light effect is generated by the coupled-resonator optical waveguide (CROW), which is a one-dimensional waveguide grating structure. The scanning electron microscope (SEM) image of the fabricated slow-light waveguide is also illustrated below the device photograph in Fig. 2b. Here, a complete resonator is constructed by a Î»/4 phase shifter region with broader width in the middle position and an equal number of Bragg gratings on both sides with a period of around 300â€‰nm, along the direction of light propagation in the waveguide. Each certain number of gratings construct one side beam of a resonator, and the two beams are separated by the phase shifter from their adjacent one. Therefore, the supercell is created through the phase shifter region, which leads to a mid-gap mode embedded in the bandgap. In our reconfigurable slow-light model, a finite number of resonators are cascaded to construct the modulation arm, and the structure parameters can be selected flexibly, according to specific application scenarios. Based on the grating waveguide configuration, the PN junction with a periodic structure is formed, which will work in the depletion mode under reverse bias voltage. Actually, the slow-light approach is a pure silicon solution with considerable high design freedom. For example, if the design goal is to push the bandwidth to the extreme limit, so as to achieve ultra-low-cost OOK transmission, then fewer resonators are required⁴⁷. Here, for achieving a deeper modulation depth and better separation of ultrahigh-speed multi-level signals, we redesigned structures fully, cascading more resonators to enlarge the phase accumulation, and focus on the design of more resonators to achieve a balance between bandwidth and efficiency (Supplementary Section 2).

**Fig. 2: Design and characterization of the Si-SLM chip.**

Performance characterization

To prove the feasibility of the proposed Si-SLMs for high-order signal transmission, we comprehensively characterized the device performance in detail, including both dynamic and static parameters. First, we evaluated the high-frequency EO response of the designed device, which is a prerequisite for achieving ultrahigh-speed transmission. Also, for Si-SLMs, an ultrahigh EO bandwidth is the most prominent advantage, dictating the achievable data rate. Especially, considering that the ANN equalizer adopted later will improve signal quality, the first thing to ensure is that the signal can complete the entire EO conversion and transmission process with minimal RF loss. Here, to meet the demand of high-order modulation, we control the structure parameters to maximize the EO bandwidth while ensuring a balance between bandwidth and efficiency. We experimentally characterized the small-signal properties of the Si-SLMs through S-parameter measurements to obtain the EO bandwidth, including S₂₁ transmission and S₁₁ reflection responses. Through the careful regulation strategy, the modulator has an EO bandwidth of 90â€‰GHz under the bias voltage of 3â€‰V, as shown in Fig. 2c, which is a fairly high bandwidth value for pure silicon modulators. While the device bandwidth increases with the bias voltage, it already exceeds 70â€‰GHz at only 1 V, and the low bias voltage requirement facilitates the co-packing with integrated driver chips. In terms of reflection performance, the S₁₁ response is maintained below âˆ’10â€‰dB, indicating that the reflection of the modulator is weak. Simultaneously, with the adoption of slow-light effect, the modulation efficiency can be improved by enhancing the interaction between light and matter. Here, we apply a low-frequency small signal to the device around the quadrature bias point and obtain the modulation efficiency by fitting the phase variation at multiple wavelengths. Figure 2d demonstrates the modulation efficiency measured under different wavelengths. The modulation efficiency increases slightly closer to the edge of the passband, mainly due to the relatively higher group index at the band edge region. In the passband around 1550â€‰nm, an averaged modulation efficiency of 0.82â€‰Vâ‹…cm is obtained, which is also a remarkable value for pure silicon modulation based on plasma dispersion effect. Therefore, by adopting the slow-light approach, we achieved improvements in both bandwidth and modulation efficiency for pure silicon modulators simultaneously. As an approach that can be utilized on SOI platforms, the slow-light structure can mitigate the intrinsic bandwidth-efficiency trade-off in silicon material, achieving the promotions in both EO bandwidth and modulation efficiency. For the optical loss of the modulator, the unoptimized coupling loss between a pair of edge couplers and the fibers is 10â€‰dB (5-dB loss for each one), and the insertion loss of the Si-SLMs is measured to be 10.5â€‰dB (9.1â€‰dB from the modulation arms, remaining from directional couplers and routing waveguides). The optical loss can be further reduced by improving the manufacturing process or introducing a transition structure between the conventional waveguide and the slow-light waveguide. Moreover, by embedding CROW into the modulation arms, the advantages of MZI can be leveraged to maintain a sufficient static extinction ratio (ER) while reducing the device footprint. Through the standard thermal tuning mechanism by TiN heaters, the static ER of the compact modulator is measured to be 36â€‰dB, provided by the resonator-assisted MZI architecture. Next, to characterize the linearity for measuring the multi-level separation capabilities in high-order transmission, we further examined the third order intermodulation (IMD3) spurious free dynamic range (SFDR). For our Si-SLMs, the measured IMD3 SFDR is 91.82â€‰dBâ‹…Hz^2/3, shown in Fig. 2e, which is similar to conventional silicon modulators, mainly due to the nonlinear limitation of the plasma dispersion effect of silicon materials, together with the nonlinear cosine transfer functions of the MZI architecture. As a pure silicon device, the physical essence of Si-SLMs is still based on the nonlinear silicon modulation, and the mitigation for subsequent nonlinear distortions is a challenge especially for ultrahigh-speed transmission, which is exactly what the ANN model excels at.

Afterwards, we experimentally characterized the performance of the modulator array on the Si-SLM chip. For WDM modulators, the channel crosstalk is a significant performance indicator to evaluate the overall transmission quality. Also, in terms of the designed layout of the modulator chip, to increase the rate density, a dense arrangement of devices is necessary. Here, the pitch of the modulator array is 500â€‰Î¼m (Fig. 2a), resulting in a minimum distance of only 20â€‰Î¼m between modulators. Nevertheless, Si-SLMs are favorable for dense array designs, because the ultra-compact footprint of Si-SLMs can shorten RF links (approximately ten times that of Si-MZMs), which will reduce the crosstalk between parallel signals at an ultrahigh-frequency region. To quantify the crosstalk, we tested the EO response of parallel modulators by applying a small RF signal on one channel (Ch4) and simultaneously receiving the transmitted signal on the other channels (Ch1, Ch2, Ch3, Ch5, Ch6, Ch7 and Ch8). Figure 2f demonstrates the experimental result of crosstalk, in which the curve of Ch4 is its own anticipated EO bandwidth curve, and the response curves of other channels illustrates the measured crosstalk is around âˆ’30â€‰dB at 60â€‰GHz and âˆ’20â€‰dB at 90â€‰GHz. In the ultra-compact and ultrahigh-density chip layout, the crosstalk of the WDM Si-SLMs remains at a low level and the impact on parallel signal transmission is small due to the ultra-short RF links. Furthermore, we tested the spectral performance for all channels on the same WDM Si-SLM chip. In practice, the optical passband of the modulator is the reflection of the optical bandwidth, and a wide passband is essential for multi-wavelength applications with high thermal robustness. Theoretically, for the photonic bandgap of the designed structure, the topologically mid-gap mode is generated in the bandgap between the antisymmetric and symmetric transverse electric bands^65,66,67,68, as the mode adopted for practical communication application. Thus, the supercell band opens multiple windows including one large passband around 1550â€‰nm. By cascading more resonant cavities, the passband can be kept wide enough while ensuring sufficient efficiency. Here, under careful regulation, the fabricated Si-SLM possesses a flat passband (mid-gap mode) of 7â€‰nm around 1550â€‰nm, with an out-of-band rejection ratio of 50â€‰dB, shown in Fig. 2g. Meanwhile, the high passband conformity between devices can demonstrate that the fabricated Si-SLMs have a relatively favorable process consistency in general. However, from the spectral results, it can be seen that a wavelength shift still occurs, due to the unavoidable fabrication deviations. Although the manufacturing uniformity of Si-SLMs has been improved by introducing one-dimensional CROW compared to two-dimensional photonic crystals, the performance fluctuations between doped silicon grating structures still exist. Therefore, for large-scale Si-SLM systems, the AI equalization at the back end is necessary to minimize device performance fluctuations caused by process errors. Despite this, on the other aspect, the appropriate slight wavelength shift can still extend the available passband to 10â€‰nm on one chip and the passband resources have not been fully utilized yet. Here, for our designed WDM Si-SLM system, the specific wavelengths of 8-channels are from 1548â€‰nm to 1555â€‰nm, with the spacing of 1â€‰nm (Fig. 2g). By increasing the wavelength operating range (e.g. 10â€‰nm), a larger wavelength interval (e.g. 1.25â€‰nm) can be adopted between channels. It is worth noting that, thanks to the wide passband and high uniformity of Si-SLMs, compared with Si-MRMs, there is no need to design separately to obtain individual operating wavelengths for different channels, nor to find the precise wavelength operating point between different resonance periods. Moreover, the compact Si-SLMs possess high thermal robustness, which can reduce the requirements for additional TEC operating platforms, thus saving the system budget.

In short, optimizing the bandwidth-efficiency trade-off while maintaining a sufficiently wide passband through Si-SLMs is the device-level guarantee for achieving ultrahigh-speed transmission per lane. Based on the designed Si-SLMs, the compact WDM chip with low crosstalk and flat passbands can realize ultra-high integration density and lead to a remarkable total capacity.

DNN transmission

For the system level, based on our carefully designed high-bandwidth Si-SLMs, we build an 8-channel WDM Si-SLM transmission system, which is demonstrated in Fig. 3a. On each channel, the Si-SLM encodes the carrier into PAM-4 signal format at different symbol rates. At the receiving side,the signal can be partly coupled to an on-chip GeSi PD on the same chip, whereas the remaining part is sent into a commercial PD. The transmitted ultrahigh-speed signals are recorded by a real-time oscilloscope and then processed with AI equalizers. For the algorithm level, leveraging AI technology to improve the transmission capacity, we adopt the ANN equalizers to improve the nonlinear limitation of conventional equalizers such as FFE and DFE. Although the bandwidth-efficiency trade-off has been mitigated, the Si-SLMs will still introduce nonlinear distortions during signal transmission due to the nonlinear modulation of the pure silicon material and the nonlinear cosine transfer functions of MZI architecture. At the system level, the primary electrical nonlinear sources originate from RF components due to the inherent nonlinear behavior of charge transport at the transistor junction, which is predominantly observed in the transistor driver (DRV) and transimpedance amplifier (TIA). Also, the utilization of erbium-doped fiber amplifiers (EDFA) could introduce nonlinear Kerr impairments to the optical link. Therefore, the resulting nonlinear signal distortions become the bottleneck for improving the signal quality, especially for the multi-level formats at ultra-high speeds. The ANN equalizers, which specialize in compensating nonlinear distortions, are exactly suitable for application in the Si-SLM system.

To minimize computing resources, we develop a relatively simple ANN model first, extend the network to a deep-learning neural network (DNN) equalizer with only two hidden layers to enhance the system performance. In the DNN, to classify the signal into four categories for PAM-4 signal⁵⁸, an activation function with four saturation level regions is implemented through equation $f(x)=2{\eta }_{2}/(1+{e}^{-{\eta }_{1}(x-2\alpha )})-{\eta }_{2}+2\alpha$, where Î± equals to âˆ’1, 0, 1 when xâ€‰â‰¤â€‰âˆ’1, âˆ’1 < xâ€‰â‰¤1 and x > 1, respectively, and ${\eta }_{2}=(1+{e}^{-{\eta }_{1}})/(1-{e}^{-{\eta }_{1}})$. The constructed f(x) has four saturation regions which are close to the amplitude of PAM-4 (âˆ’3, âˆ’1, 1, 3) which makes it suitable for PAM-4 equalization. Similarly, it can be extended to 8-level sigmoid function which can be used to equalize the modulated signals such as PAM-8. Details can be found in Supplementary Section 3. Two hidden layers are incorporated in the DNN. Figure 3b shows the tap-delay two hidden layer DNN equalizer structure. The activation function in each neurons determines the nonlinear mapping characteristic across the network. The weight optimization process involves a gradient descent scheme coupled with the error back-propagation algorithm to iteratively update the network parameters.

Since 224â€‰Gbps PAM-4 is the standard rate for 1.6â€‰T interfaces, we focus on the 224â€‰Gbps PAM-4 signal transmission here. Thanks to the ultrahigh-bandwidth and wide flat passband of Si-SLMs, we obtained clear eye diagrams of 224â€‰Gbps PAM-4 together with 200â€‰Gbps PAM-4 at all channels of different wavelengths (1548â€“1555â€‰nm, with 1â€‰nm spacing, Ch2, Ch4, Ch6 and Ch8 with corresponded wavelengths are demonstrated here), shown in Fig. 4a, with all BERs lower than 2 Ã— 10^âˆ’2 (transmission results for all channels are shown in Supplementary Section 3), leading to a total capacity of 1.6â€‰Tbps. Specifically, Fig. 4b demonstrates the BERs at different wavelengths in the passband for 8 channels, with the PAM-4 signal speed set as 130â€‰Gbps, 170â€‰Gbps, 200â€‰Gbps and 224â€‰Gbps, and the consistency between all WDM channels is favorable. Moreover, the BER curves with increasing data rates of different channels (Ch2, Ch4, Ch6 and Ch8 are shown for example here) are illustrated in Fig. 4c, while the almost overlapped lines confirm the uniformity between different channels.

Furthermore, we verify that the implemented DNN equalizer captures channel response instead of the generation pattern of the pseudo random bit sequence (PRBS). Different patterns of PRBS are generated, Pattern 0 is used to train the DNN, the others (Pattern 1, 2 and 3) are used for BER test. The results of 170â€‰Gbps, 200â€‰Gbps and 224â€‰Gbps are shown in Fig. 4d, there is a similar performance under different signal patterns, which indicates that the DNN equalizer learns the information of channel rather than the signal pattern and has a robust equalization ability for different data patterns. For the neuron numbers, the BER performance of the DNN initially for all data rates improves when the number of input neurons increases, as shown in Fig. 4e. However, after reaching a certain neuron count, such as 25 neurons, the BER performance ceases to improve further. Meanwhile, as the number of neurons in the hidden layer increases, there is no significant (exponential order) variation in the BER performance, as depicted in Fig. 4f. In the experiment, the input layer is set with 25 neurons and two hidden layers are set with 30 neurons.

As a relatively simple AI equalizer, our designed DNN equalizer with only two hidden layers takes up less computing resources, but still achieves high-quality transmission for 1.6 TbE interface with 224 Gbps per lane. As the simplest high-order modulation format, PAM-4 is highly feasible for data center scenarios. Therefore, it is crucial to realize optical transmission of 224 Gbps PAM-4 per channel based on a Si-SLM chip. Furthermore, based on 224 Gbps PAM-4 transmission per lane, a 3.2 TbE interface can be realized by scaling out the channels. More importantly, the high-quality PAM-4 results of DNN equalizer which takes less resources demonstrate the potential of ANN solutions and its adaptability to slow-light systems, which encourages us to continue developing more efficient ANN equalizers.

GRU transmission

By adopting the designed simple DNN equalizer with less computing resources, we achieve a total capacity of 1.6â€‰Tbps transmission with 224â€‰Gbps PAM-4 per lane based on our WDM Si-SLM chip. This remarkable result demonstrates the high compatibility between the AI-accelerated technology and Si-SLM systems. Therefore, for the purpose of exploring the potential of AI-accelerated slow-light systems, we develop a more efficient ANN equalizer to push the silicon photonics transmission to the next stage.

Recurrent neural networks (RNN) are widely utilized for their proficiency in modeling temporal dynamics and processing sequential data. In principle, bi-directional RNNs (bi-RNNs) can efficiently handle not only inter-symbol interference among preceding and succeeding symbols caused by chromatic dispersion, but also the nonlinear impairments caused by devices and fiber transmission links^69,70,71. Additionally, compared to unidirectional RNN, bi-RNNs model the dependence on past and future states. A gated recurrent unit (GRU) contains two gates including reset gate and update gate^70,72. GRU is a less complex variant compared to the long short-term memory (LSTM)⁷⁰, which is an advanced type of RNN that demonstrates robust capabilities for capturing and modeling long-term dependencies⁵⁷. The detailed structure of a GRU unit is demonstrated in Fig. 5a. The bidirectional GRU (bi-GRU) model comprises two unidirectional GRU layer operating in opposite directions. By combining forward and backward GRU processing, the model incorporates information from both the future and the past to influence its current states. Figure 5b shows the architecture of the implemented bi-GRU network for nonlinear equalization in our Si-SLM transmission system. The first layer is the input layer, where the current symbol x_i is enclosed with its k preceding and k succeeding symbols. This sequence serves as the input to the bi-GRU network. The subsequent layer is the GRU layer, which consists of two GRU links. The input sequence is first passed through the initial GRU layer. Subsequently, the sequence is reversed and processed through the second GRU layer.This approach allows us to process both preceding and subsequent data simultaneously, enabling information from both the past and future to influence the current states. The outputs of the bi-GRU model layer are fully connected to a linear layer. As a result, the output layer produces the predicted class for the current symbol. Consequently, the predicted class y_i for the i-th symbol x_i is determined. Some reports employ bi-GRU to compensate the fiber nonlinear impairment during the long-distance transmission in coherent system^70,73, but there are still less bi-GRU performance evaluation reports in short-reach IM/DD systems, especially for systems that employ silicon devices with data rates around 400â€‰Gbps in a high-density silicon chip. In this work, we employ the bi-GRU network to promote the transmission performance. Meanwhile, based on bi-GRU, a new three-layer GRU equalizer (T-biGRU) is further proposed and implemented. The T-biGRU model is determined based on the state of three GRU layers. The first GRU layer processes the data in a forward direction, starting from the beginning of the sequence. The second GRU layer processes data in a backward direction to capture reverse temporal dependencies. The third GRU layer, like the first, processes data in a forward direction from the beginning of the sequence (see Methods). The hidden state encapsulates the flow of symbolic information across recurrent time steps, ensuring continuity and context throughout the sequence. The bi-GRU model relies on the states of two GRU layers, whereas the T-biGRU model utilizes the states of three GRU layers. By integrating forward, backward and repeated forward GRU processing, the T-biGRU model more comprehensively extracts both global and local features of the sequence, thereby further enhancing equalization performance.

**Fig. 5: Bidirectional gated recurrent unit (bi-GRU) transmission results of PAM-4 signal.**

Next, we evaluated the practical PAM-4 transmission for the Si-SLM chip by employing bi-GRU and T-biGRU, as depicted in Figs. 5â€“9. Here, our goal is to increase the single-lane data rate of Si-SLMs significantly by AI approach. However, the frequency broadening brought by IM/DD signals cannot be ignored at ultrahigh-speed data rates. Therefore, in the practical GRU WDM transmission experiment, we select on-chip channels Ch1, Ch3, Ch5, Ch7 (odd channels) as one WDM transmission path and Ch2, Ch4, Ch6, Ch8 (even channels) as the other WDM transmission path to reduce modulation crosstalk by increasing the spectral separation between channels. After assembling each group of on-chip signals through WDM, the two paths are then transmitted in parallel to reach a total capacity of 3.2â€‰Tbps, similar to the solution adopted by commercial optical module manufacturers⁷⁴. By adopting bi-GRU equalization, we can realize high-quality eye diagrams of PAM-4 signal up to 400â€‰Gbps, indicating 3.2 Tbps aggregation data rate of 8 parallel channels. Figure 5c summarizes the optical eye diagrams of PAM-4 signals at gradually increasing rates for different channels, in which the horizontal axis shows the different channels (Ch2, Ch4, Ch6 and Ch8 are shown for examples here) in the WDM system together with their corresponding wavelengths, and the vertical axis shows the different PAM-4 transmission rates (280â€‰Gbps, 320â€‰Gbps, 360â€‰Gbps and 400â€‰Gbps are demonstrated). All the eye diagrams are quite clear and the BERs are all below HD-FEC threshold, even up to 400â€‰Gbps. Actually, taking 280â€‰Gbps PAM-4 as an instance, which is already a fairly high rate for pure silicon modulators, our solution can reduce the BERs to only around 10^âˆ’5 order at this rate. More importantly, even when the rate increases to 400â€‰Gbps, the BERs still maintain below HD-FEC threshold, and the performance consistency between channels does not show downward trend. To the best of our knowledge, this is the highest single-lane IM/DD transmission rate for silicon modulators. In particular, we achieve this goal only using the industry-standard format PAM-4 for computing centers. Simultaneously, the wide optical passbands of Si-SLMs provide the prerequisite for multi-wavelength communication. The BERs at different wavelengths of all channels are well below the HD-FEC threshold for the whole passband, as demonstrated in Fig. 5d, and the passband consistency is favorable, which is critical for multi-wavelength applications. By leveraging single-lane 400â€‰Gbps PAM-4 transmission, a total data capacity of 3.2â€‰Tbps is achieved based on the 8-channel Si-SLM chip with a compact modulation area of only 4 mm Ã— 0.5â€‰mm, leading to a remarkable on-chip data-rate density of 1.6â€‰Tb/s/mm². Based on the improved data-rate density, more modulator arrays can be integrated on a single wafer with the same tape-out budget, thereby significantly increasing the total transmission capacity for one chip, or reducing the average cost for a certain targeted capacity. For the trend of BERs with increasing rates, the BER curves for different channels are demonstrated in Fig. 5e, and the nearly overlapped curves illustrate the performance uniformity between different channels. Especially, the variation trend of BER curves is relatively smooth, and no point where the BER abruptly raises is observed with the increasing rates. Also, the PAM-4 constellation at 400â€‰Gbps is illustrated as the inset, high-quality differentiation between the four levels can be observed. For complete results, all the PAM-4 eye diagrams, constellations and corresponded BERs for all channels and data rates are summarized in Supplementary Section 4.

Moreover, similarly like the case of DNN, the system performance of bi-GRU for data rates of 360â€‰Gbps, 380â€‰Gbps and 400â€‰Gbps under different patterns of PRBS are analyzed, as shown in Fig. 6a. There is a similar performance under different signal patterns, which proves that the implemented bi-GRU equalizer captures the channel response rather than PRBS pattern. Normally, as shown in Fig. 6b, with more GRU units in the hidden layer, the BER will be better. From 340â€‰Gbps to 400â€‰Gbps, it is effective to reduce BERs by changing GRU numbers, and there is no obvious downward trend found in this effectiveness, which reflects the applicability of the GRU solution to higher rates. In the specific experiment, 200 GRU units is chosen for the equalizer here. In more detail, the comprehensive discussions on AI equalizer network configurations are provided in Supplementary Section 5. Simultaneously, the robustness of the AI network equalizers is also analyzed and the improvement methods are given in Supplementary Section 7.

**Fig. 6: bi-GRU transmission analysis.**

The PAM-4 results based on the AI-accelerated slow-light technology demonstrate the great potential of silicon modulators for ultrahigh-speed applications. The ability of pure silicon modulators to achieve such high rates and maintain low BERs in PAM-4 signals, the simplest high-order format that complies with IEEE standards, is an encouragement for the deployment of silicon photonics. Moreover, in order to verify the scalability of our AI-accelerated slow-light system for higher-order signals, we utilized high-speed PAM-8 signal onto the Si-SLM chip. Even for PAM-8 signals with more levels compared to PAM-4, the bi-GRU equalizer still possesses a powerful ability for mitigating the nonlinear distortions in muti-level signals. Here, the specific PAM-8 eye diagrams at 1550â€‰nm (Ch3) from 240â€‰Gbps to 390â€‰Gbps are demonstrated in Fig. 7a, with all BERs below HD-FEC threshold. Also, the constellations corresponded to each data rates are illustrated, with favorable separation between the 8-levels. For WDM systems, similar to the PAM-4 results, the BERs for all wavelengths in 8 channels are still below HD-FEC threshold in Fig. 7b (all the PAM-8 results are demonstrated in Supplementary Section 4). Figure 7c illustrates the BER curves changing with data rates, and the consistency between channels are quite well. Compared with the PAM-4 format, when the data rates is around 240â€‰Gbps, the BERs of PAM-8 is slightly lower than BERs of PAM-4. This is because the baud rate of PAM-8 (80â€‰Gbaud) is at a fairly low level, while the PAM-4 baud rate already reaches 120â€‰Gbaud. For a relatively low baud rate especially under 100â€‰Gbaud, the PAM-8 signal is more advantageous. However, as the data rate continues to rise, the BERs of PAM-8 increase indeed faster than that of PAM-4, and the two are basically the same when the data rate reaches around 400â€‰Gbps. The reason is that, as the data rate rises, the nonlinear compensation required for PAM-8 with more signal levels at ultrahigh-speed region gradually increases compared with PAM-4 under the same linearity. Despite this, it is effective to increase the total rate under the limited baud rate by adopting advanced formats with more levels, and our approach can also adapt to the higher-order formats.

**Fig. 7: bi-GRU transmission results of PAM-8 signal.**

The above experimental results are based on back-to-back (B2B) scenario, and the transmission penalty for 100â€‰m, 200â€‰m and 300â€‰m standard single-mode fiber (SSMF) is then experimentally assessed for both PAM-4 and PAM-8 signals with the bi-GRU equalizer. Considering the consistency of all channels has been proven to be favorable, the BER penalty is illustrated for one channel (Ch3) at 1550â€‰nm. The transmission results for different SSMF distances are demonstrated in Fig. 8, in which Fig. 8a shows the BERs of PAM-4 (280â€‰Gbps, 320â€‰Gbps, 360â€‰Gbps, 400â€‰Gbps) and Fig. 8b shows the BERs of PAM-8 (300â€‰Gbps, 330â€‰Gbps, 360â€‰Gbps, 390â€‰Gbps). Due to the impact of the power fading effect in optical fiber, the BER performance gradually degrades with increasing transmission distance compared to B2B scenario for both cases of PAM-4 and PAM-8. For the transmission distance of 300â€‰m, the BERs still remain below HD-FEC threshold at 360â€‰Gbps, but deteriorate beyond HD-FEC threshold at 400â€‰Gbps.

**Fig. 8: BER performance for different transmission distances.**

Furthermore, the transmission results of employing T-biGRU equalizer (Fig. 9a) are evaluated for both PAM-4 (400â€‰Gbps in Fig. 9b) and PAM-8 (390â€‰Gbps in Fig. 9c). For BER curves of different channels (Ch2, Ch4, Ch6, Ch8), PAM-4 results (280 Gbps, 320â€‰Gbps, 360â€‰Gbps, 400â€‰Gbps) are shown in Fig. 9d, while PAM-8 results (300â€‰Gbps, 330â€‰Gbps, 360â€‰Gbps, 390â€‰Gbps) are shown in Fig. 9e. The results indicate that by employing T-biGRU equalizer, the BER performance for around 400â€‰Gbps transmission of all channels can be improved to smaller than 10^âˆ’3, which is slightly better than the bi-GRU case. The T-biGRU results for all eye diagrams, constellations and BERs corresponding to Fig. 9 are demonstrated in Supplementary Section 4. Meanwhile, the performance comparison of three AI network equalizers (DNN, bi-GRU, T-biGRU) together with traditional algorithms (DFE, VNLE) is shown in Supplementary Section 6. However, it is worth noting that the computational complexity of the T-biGRU is 1.5 times that of the bi-GRU (see Methods), leading to increased multiplications and higher power consumption (Supplementary Section 8). In the equalizer selection, it is important to strike a balance among performance, computational complexity, and power consumption to achieve the desired system performance effectively.

**Fig. 9: Three-layer bidirectional gated recurrent unit (T-biGRU) transmission results.**

Discussion

To explore 400â€‰Gbps per wavelength of silicon photonics, we comprehensively optimize the device metrics with careful design. By adopting the slow-light approach, the bandwidth-efficiency trade-off for pure silicon modulators is greatly optimized, paving the way for 400â€‰Gbps transmission per wavelength. Our compact Si-SLM presents an ultrahigh EO bandwidth of 90â€‰GHz and a remarkable modulation efficiency of 0.82â€‰Vâ‹…cm, while possessing a wide optical passband of 7â€‰nm around 1550â€‰nm. Thanks to the high CMOS compatibility of Si-SLMs, we fabricate an 8-channel Si-SLM chip on a SOI platform using a standard silicon photonic process. In the fabrication flow, no additional heterogeneous materials and complex process are introduced, and this superiority provides a prerequisite for large-scale and low-cost production at the wafer level. Our Si-SLM chip, with its favorable basic performance, provides a foundation for transmission capacity enhancement, and the advantage of wafer-level production utilizing the standard silicon photonic process ensures the feasibility of extensive deployment of this solution.

More specifically, Table 1 summaries the properties of the representative results of Si-MZMs, Si-MRMs and Si-SLMs on SOI platforms in recent years. From the comparison, we can see that the metrics including bandwidth, modulation efficiency, and passband of our designed Si-SLMs are all relatively prominent for pure silicon modulators. While the bandwidth of our device exceeds that of previous modulators, the modulation efficiency has been improved compared to Si-MZMs, and the optical passband is greater than that of Si-MRMs (a full width at half maximum in one resonance period). Also, the transmission performance of our device is ahead of other Si-SLMs. It can be seen that, in recent years, although the transmission rate has continued to raise with the proposal and optimization of various solutions, the overall upward trend is still slow. Notably, our modulator has achieved single-lane 400 Gbps transmission using the industry-standard format PAM-4, with all BERs below HD-FEC threshold in a wide optical passband. As the highest single-lane rate demonstrated in standard silicon photonic platforms so far, this record-high result reveals the great high-speed transmission potential of silicon photonics.

Table 1 Comparison with representative silicon modulators

Full size table

The precise adoption of AI technology is an important guarantee for realizing the potential of our Si-SLM chips, and the slow-light systems are well suitable for accelerated driving with ANN equalizer. While the slow-light approach has maximized the overall performance of pure silicon modulators, its physical essence is still based on nonlinear modulation of doped silicon material (the prerequisite for large-scale industrial application), and the ANN model specializes in compensating for nonlinear distortions exactly. By embedding ANN into the signal processing workflow of Si-SLMs, the intrinsic nonlinear distortions in pure silicon devices can be effectively mitigated. At the manufacturing level, since silicon material does not possess a high and consistent EO coefficient, forming PN junction is the basis for realizing pure silicon modulation, while achieving complete uniformity in doping process is still challenging. Simultaneously, although the CMOS-compatible waveguide grating structure has already improved process stability compared to photonic crystals, the structural fluctuations of doped silicon gratings still exist due to the unavoidable fabrication deviations. Therefore, through AI equalization at the back end, the variations between Si-SLMs caused by process errors can be eliminated, thereby making the deployment of large-scale transmission systems possible. Here, three ANN models have been developed here, including the DNN, bi-GRU and T-biGRU equalizers. First, we adopt the simple DNN equalizer with only two hidden layers, which takes up less computing resources, and achieve a total capacity of 1.6 Tbps with 224â€‰Gbps PAM-4 per lane under the IEEE rate standard. More importantly, by adopting the bi-GRU equalizer, we achieve an ultrahigh-speed transmission of 400â€‰Gbps per lane and a total capacity of 3.2â€‰Tbps based on the Si-SLM chip, utilizing the standard high-order format PAM-4, with all BERs under HD-FEC threshold. Considering the ultra-compact occupied area (4â€‰mm Ã— 0.5â€‰mm), we achieved an on-chip data-rate density of 1.6â€‰Tb/s/mm². Meanwhile, benefiting from the wide flat passband around 1550â€‰nm, the whole links are without individual resonant wavelength adjustment and TEC operating platforms, thus reducing the system budget. Next, we have verified the feasibility of the PAM-8 signal, demonstrating the applicability of our AI-accelerated slow-light solution for higher-order data transmission with more levels. Furthermore, the transmission performances of employing the T-biGRU equalizer are evaluated, and the BERs for around 400â€‰Gbps transmission can be slightly improved for both PAM-4 and PAM-8 signals.

In addition, this work illustrates the potential of AI technology in photonics filed, and the proposed AI-accelerated Si-SLM system is a good exemplification of "AI for SiPh" together with "SiPh for AI". For the optical interconnections in computing centers, the AI-accelerated Si-SLM technology provides an ultrahigh-speed deployment solution that enables a leap in single-lane transmission rate of the interfaces. Therefore, the computility of the computing centers can be improved, thus leading to faster training iterations of the ANN. This enhanced ANN will promote the signal processing for Si-SLM systems to be more immediate, thereby further improving the interconnection rates for computing centers. In short, a self-optimizing positive feedback loop between computing centers and Si-SLM systems through ANN can be constructed based on this bi-directional promotion mechanism, leading to a continuous improvement in the whole architecture. Especially, through the efficient photonic neural networks, the deep fusion of optical interconnection and optical computing systems based on this synergy can capitalize on the inherent advantages of photonics to facilitate real-time system optimization.

In summary, we have demonstrated the first pure silicon photonic system with a total capacity of 3.2â€‰Tbps based on AI-accelerated slow-light technology on a SOI platform. Integrating AI technology with Si-SLM systems offers a promising pathway to overcome the limitations of silicon photonics in ultrahigh-speed transmission. By utilizing the industry-standard IM/DD format PAM-4, we achieve 400â€‰Gbps/Î» optical transmission in all wavelength channels, with BERs below HD-FEC threshold. To our best knowledge, this is the highest single-lane transmission rate demonstrated in standard silicon photonic platforms. Under precise regulation, our compact thermal-insensitive Si-SLM chip possesses an ultrahigh bandwidth, a remarkable modulation efficiency, a wide optical passband simultaneously, leading to an on-chip data-rate density of 1.6â€‰Tb/s/mm², without individual resonant wavelength adjustment requirements and additional TEC operating platforms. This proposed AI-accelerated slow-light technology increases the transmission rate of silicon photonic significantly, paving the way for 400â€‰Gbps optical transmission per lane. Our work demonstrates the great value of AI for silicon photonics and highlights the potential of standard silicon photonic platforms for next-generation optical interconnections of 3.2â€‰TbE and beyond.

Methods

Fabrication of the devices

The silicon slow-light structure is CMOS-compatible, thus the Si-SLMs can be fabricated under a standard silicon photonic process, leveraging the industrial advantages of silicon photonics. The Si-SLM chips adopt a standard 90â€‰nm photolithography process on a 200-mm SOI wafer with a silicon thickness of 220â€‰nm at CompoundTek Pte. The 90-nm-thick rib area is etched partially for carrier doping and metal contact in 220-nm-thick silicon. The concentrations of P-type and N-type doping in the depletion-mode PN junction are both 5.0 Ã— 10¹⁷/cm³, to ensure a sufficiently high electrical bandwidth. All feature parameters of the slow-light waveguide satisfy the requirements of the commercial silicon photonic foundry, providing a basis for large-scale and low-cost production at the wafer level.

Experimental details

In the electro-optical response test, a vector network analyzer (Keysight PNA-X Network Analyzer N5247B) with its lightwave component analyzer (Keysight LCA Optical Receiver N4372E) is adopted to measure the bandwidth. For the optical test, the spectra are obtained by a high-resolution optical spectrum analyzer (Yokogawa AQ6370C). For the transmission experiments, the high-speed signal is generated by an arbitrary wave generator (Keysight M8199B), and then the differential signals are amplified by a commercial SHF single-ended driver to obtain a 5-V Vpp and then injected into the modulator working under the dual-drive push-pull configuration. After the electro-optical conversion, the modulated optical signal is amplified by an Amonics pre-amp erbium-doped fiber amplifier to compensate for the loss. At the receiving end, the signals are fast sampled by an oscilloscope (Keysight UXR-Series 256-GSa/s real-time) and afterward offline processed.

Algorithm details

DNN

The implemented lite DNN equalizer consists of an input layer with M neurons, followed by two hidden layers with N₁ and N₂ neurons, respectively. The architecture culminates in an output layer with a single neuron. The DNN is trained using the mean-square-error (MSE) criterion, employing the back-propagation (BP) algorithm. The training process is executed in two distinct steps. Initially, the forward propagation phase calculates the output based on the input feed. This process is detailed as

$$\left\{\begin{array}{l}{Y}_{1}(n)=f({W}_{1}{(n)}^{T}X(n))\quad \\ {Y}_{2}(n)=f({W}_{2}{(n)}^{T}{Y}_{1}(n))\quad \\ {y}_{out}(n)=f({W}_{3}{(n)}^{T}{Y}_{2}(n))\quad \end{array}\right.$$

(1)

where

$$X(n)={\left[\begin{array}{cccc}{x}_{1}(n)&{x}_{2}(n)&\ldots &{x}_{M}(n)\end{array}\right]}^{T}$$

(2)

$${W}_{1}(n)={\left[\begin{array}{cccc}{w}_{1,11}(n)&{w}_{1,12}(n)&\ldots &{w}_{1,1M}(n)\\ {w}_{1,21}(n)&{w}_{1,22}(n)&\ldots &{w}_{1,2M}(n)\\ \ldots &\ldots &\ldots &\ldots \\ {w}_{1,{N}_{1}1}(n)&{w}_{1,{N}_{1}2}(n)&\ldots &{w}_{1,{N}_{1}M}(n)\end{array}\right]}^{T}$$

(3)

$${W}_{2}(n)={\left[\begin{array}{cccc}{w}_{2,11}(n)&{w}_{2,12}(n)&\ldots &{w}_{2,1{N}_{1}}(n)\\ {w}_{2,21}(n)&{w}_{2,22}(n)&\ldots &{w}_{2,2{N}_{1}}(n)\\ \ldots &\ldots &\ldots &\ldots \\ {w}_{2,{N}_{2}1}(n)&{w}_{2,{N}_{2}2}(n)&\ldots &{w}_{2,{N}_{2}{N}_{1}}(n)\end{array}\right]}^{T}$$

(4)

$${W}_{3}(n)={\left[\begin{array}{cccc}{w}_{3,1}(n)&{w}_{3,2}(n)&\ldots &{w}_{3,{N}_{2}}(n)\end{array}\right]}^{T}$$

(5)

$${Y}_{1}(n)={\left[\begin{array}{cccc}{y}_{1,1}(n)&{y}_{1,2}(n)&\ldots &{y}_{1,{N}_{1}}(n)\end{array}\right]}^{T}$$

(6)

$${Y}_{2}(n)={\left[\begin{array}{cccc}{y}_{2,1}(n)&{y}_{2,2}(n)&\ldots &{y}_{2,{N}_{2}}(n)\end{array}\right]}^{T}$$

(7)

f ( â‹… ) is the activation function (Supplementary Section 3), as shown in the DNN transmission parts, X(n) is the input vector, Y₁(n) and Y₂(n) are the outputs of the first and second hidden layers respectively, y_out(n) is the output signal, W₁, W₂ and W₃ are the weight matrices between different layers of the DNN equalizer.

The second step in the training process is back-propagation, where the calculated error term is propagated from the output layer back to the input layer. This mechanism allows for the adjustment of the networkâ€™s weights to minimize the error. The square error for a single training example is defined as

$$E(X(n),info(n))={\left\vert \, {y}_{out}(n)-info(n)\right\vert }^{2}$$

(8)

where info(n) is the desired signal.

The stochastic gradient descent is utilized to optimize the cost function with respect to the weight matrices, including W₁, W₂ and W₃. The updated weight matrix for the subsequent iteration can be derived by

$${W}_{i}(n+1)={W}_{i}(n)-\eta \frac{\partial E(X(n),info(n))}{\partial {W}_{i}(n)}\qquad i=1,2,3$$

(9)

where Î· indicates the learning rate of the network.

Consider the four-layer DNN equalizer with n_in, n_H, n_out nodes in the input, hidden and output layer, respectively. Typically, the output layer has a fixed single node. Here, n_in can be viewed as the tap number in a digital filter. The primary computational burden in the equalizer arises from the error propagation needed to calculate the squared error derivative of each node in all the hidden layers. Compared to the DNN, which employs a sigmoid function with two saturation regions, the DNN that employs multi-level sigmoid does not incur additional computational requirements. The computational complexity of the DNN equalizer per iteration can be characterized from the program as:

$${C}_{DNN}=8{n}_{H}^{2}+7{n}_{in}{n}_{H}+12{n}_{H}+4{n}_{in}+3{n}_{out}$$

(10)

GRU

A GRU unit is composed of a reset gate r_t and an update gate z_t. The output h_t is determined by both current input x_t and previous state h_tâˆ’1 under the control of these two gates. The outputs of the gates and the GRU unit are calculated as follows:

$$\begin{array}{l}{r}_{t}=\sigma ({W}_{r}{x}_{t}+{U}_{r}{h}_{t-1}+{b}_{r})\\ {z}_{t}=\sigma ({W}_{z}{x}_{t}+{U}_{z}{h}_{t-1}+{b}_{z})\\ {\tilde{h}}_{t}=tanh\left[{W}_{h}{x}_{t}+{U}_{h}({r}_{t}\odot {h}_{t-1})+{b}_{h}\right]\\ {h}_{t}=(1-{z}_{t})\odot {h}_{t-1}+{z}_{t}\odot {\tilde{h}}_{t}\end{array}$$

(11)

where W_r, U_r, W_z, U_z, W_h and U_h are the weight matrices. b_r, b_z, b_h are the synthesis of bias vectors for input x_t and previous state h_tâˆ’1, Ïƒ is the logistic sigmoid function, tanh is the hyperbolic tangent activation function, âŠ™ denotes the Hadamard product.

Models with bidirectional structure are capable of learning information from both preceding and following data when processing the current data. The bi-GRU model employed in this work comprises two unidirectional GRU layer operating in opposite directions. By combining forward and backward GRU processing, the model incorporates information from both the future and the past to influence its current states. The bi-GRU model relies on the states of two GRU layers, whereas the T-biGRU model utilizes the states of three GRU layers. By integrating forward, backward and repeated forward GRU processing, the T-biGRU model more comprehensively extracts both global and local features of the sequence, thereby further enhancing equalization performance. The bi-GRU model can thus be mathematically described as follows:

$$\begin{array}{l}{\overrightarrow{{h}_{t}}}_{bi-GRU}=GR{U}_{fwd}({x}_{t},\overrightarrow{{h}_{t-1}})\\ {\overleftarrow{{h}_{t}}}_{bi-GRU}=GR{U}_{bwd}({x}_{t},\overleftarrow{{h}_{t+1}})\\ {{h}_{t}}_{bi-GRU}={\overrightarrow{{h}_{t}}}_{bi-GRU}\oplus {\overleftarrow{{h}_{t}}}_{bi-GRU}\end{array}$$

(12)

where ${\overrightarrow{{h}_{t}}}_{bi-GRU}$ and ${\overleftarrow{{h}_{t}}}_{bi-GRU}$ is the state of the forward and backward GRU, respectively. ${{h}_{t}}_{bi-GRU}$ is the output of bi-GRU. âŠ• indicates the operation of concatenating two vectors.

For the case of T-biGRU:

$$\begin{array}{l}{\overrightarrow{{h}_{t1}}}_{T-biGRU}=GR{U}_{fwd}({x}_{t},\overrightarrow{{h}_{t-1}})\\ {\overleftarrow{{h}_{t}}}_{T-biGRU}=GR{U}_{bwd}({x}_{t},\overleftarrow{{h}_{t+1}})\\ {\overrightarrow{{h}_{t2}}}_{T-biGRU}=GR{U}_{fwd}({x}_{t},\overrightarrow{{h}_{t-1}})\\ {{h}_{t}}_{T-biGRU}={\overrightarrow{{h}_{t1}}}_{T-biGRU}\oplus {\overleftarrow{{h}_{t}}}_{T-biGRU}\oplus {\overrightarrow{{h}_{t2}}}_{T-biGRU}\end{array}$$

(13)

where ${\overrightarrow{{h}_{t1}}}_{T-biGRU}$, ${\overleftarrow{{h}_{t}}}_{T-biGRU}$ and ${\overrightarrow{{h}_{t2}}}_{T-biGRU}$ are the states of the first forward GRU, the backward GRU and the second forward GRU, respectively. ${{h}_{t}}_{T-biGRU}$ represents the output of T-biGRU.

The main computation complexity of one GRU layer can be decribed as:

$${C}_{GRU}=3{n}_{H}({n}_{E}+{n}_{H})$$

(14)

where n_E refers to the input size of the GRU layer, and n_H represents the number of GRU units that used in the layer. The complexity of bi-GRU and T-biGRU layer can be calculated as:

$$\begin{array}{l}{C}_{bi-GRU}=2\times 3{n}_{H}({n}_{E}+{n}_{H})\\ {C}_{T-biGRU}=3\times 3{n}_{H}({n}_{E}+{n}_{H})\end{array}$$

(15)

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The data that support the plots within this paper and other findings of this study are available on Zenodo database [https://doi.org/10.5281/zenodo.15631151]. All other data used in this study are available from the corresponding authors upon request.

Code availability

The codes that support the findings of this study are available from the corresponding authors upon request.

References

Cheng, Q., Bahadori, M., Glick, M., Rumley, S. & Bergman, K. Recent advances in optical technologies for data centers: a review. Optica 5, 1354â€“1370 (2018).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Shu, H. et al. Microcomb-driven silicon photonic systems. Nature 605, 457â€“463 (2022).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Shu, H. et al. Microcomb technology: from principles to applications. Photonics Insights 3, R09â€“R09 (2024).
ArticleÂ Google ScholarÂ
Atabaki, A. H. et al. Integrating photonics with silicon nanoelectronics for the next generation of systems on a chip. Nature 556, 349â€“354 (2018).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Siew, S. Y. et al. Review of silicon photonics technology and platform development. J. Lightwave Technol. 39, 4374â€“4389 (2021).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Bogaerts, W. & Chrostowski, L. Silicon photonics circuit design: methods, tools and challenges. Laser Photonics Rev. 12, 1â€“29 (2018).
ArticleÂ Google ScholarÂ
Soref, R. A. & Bennnett, B. R. Electrooptical effects in silicon. J. Quantum Electron. 23, 123â€“129 (1987).
ArticleÂ ADSÂ Google ScholarÂ
Reed, G. T., Mashanovich, G., Gardes, F. Y. & Thomson, D. J. Silicon optical Modulators. Nat. Photonics 4, 518â€“526 (2010).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Reed, G. T. et al. Recent breakthroughs in carrier depletion based silicon optical modulators. Nanophotonics 3, 229â€“245 (2014).
ArticleÂ CASÂ Google ScholarÂ
Shi, Y. et al. Silicon photonics for high-capacity data communications. Photonics Res. 10, A106â€“A134 (2022).
ArticleÂ CASÂ Google ScholarÂ
Stanley, S. 800G Client Optics in the Data Center. A Heavy Reading white paper produced for Cisco. https://www.cisco.com/c/dam/en/us/products/interfaces-modules/transceiver-modules/white-paper-sp-800g-client-optics-data-center.pdf (2022).
Zhou, X., Urata, R. & Liu, H. Beyond 1 Tb/s intra-data center interconnect technology: IM-DD OR coherent? J. Lightwave Technol. 38, 475â€“484 (2020).
ArticleÂ ADSÂ Google ScholarÂ
Rahim, A. et al. Taking silicon photonics modulators to a higher performance level: state-of-the-art and a review of new technologies. Adv. Photonics 3, 024003 (2021).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Zhou, X., Yi, D., Chan, D. W. U. & Tsang, H. K. Silicon photonics for high-speed communications and photonic signal processing. npj Nanophotonics 1, 1â€“14 (2024).
ArticleÂ Google ScholarÂ
Wang, C. et al. Integrated lithium niobate electro-optic modulators operating at CMOS-compatible voltages. Nature 562, 101â€“104 (2018).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
He, M. et al. High-performance hybrid silicon and lithium niobate Machâ€“Zehnder modulators for 100 Gbits^âˆ’1 and beyond. Nat. Photonics 13, 359â€“364 (2019).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Xu, M. et al. High-performance coherent optical modulators based on thin-film lithium niobate platform. Nat. Commun. 11, 1â€“7 (2020).
ADSÂ Google ScholarÂ
Zhang, M., Wang, C., Kharel, P., Zhu, D. & Loncar, M. Integrated lithium niobate electro-optic modulators: when performance meets scalability. Optica 8, 652â€“667 (2021).
ArticleÂ ADSÂ Google ScholarÂ
Kharel, P., Reimer, C., Luke, K., He, L. & Zhang, M. Breaking voltage-bandwidth limits in integrated lithium niobate modulators using micro-structured electrodes. Optica 8, 357â€“363 (2021).
ArticleÂ ADSÂ Google ScholarÂ
Ayata, M. et al. High-speed plasmonic modulator in a single metal layer. Science 358, 630â€“632 (2017).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Haffner, C. et al. Low-loss plasmon-assisted electro-optic modulator. Nature 556, 483â€“486 (2018).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Alloatti, L. et al. 100 GHz silicon-organic hybrid modulator. Light Sci. Appl. 3, 1â€“4 (2014).
ArticleÂ Google ScholarÂ
Lu, G. W. et al. High-temperature-resistant silicon-polymer hybrid modulator operating at up to 200 Gbits^âˆ’1 for energy-efficient datacentres and harsh-environment applications. Nat. Commun. 11, 1â€“9 (2020).
ADSÂ Google ScholarÂ
Liu, A. et al. A high-speed silicon optical modulator based on a metal-oxide-semiconductor capacitor. Nature 427, 615â€“618 (2004).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Ding, R. et al. High-speed silicon modulator with slow-Wave electrodes and fully independent differential drive. J. Lightwave Technol. 32, 2240â€“2247 (2014).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Patel, D. et al. Design, analysis, and transmission system performance of a 41 GHz silicon photonic modulator. Opt. Express 23, 14263â€“14287 (2015).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Li, M., Wang, L., Li, X., Xiao, X. & Yu, S. Silicon intensity Mach-Zehnder modulator for single lane 100 Gb/s applications. Photonics Res. 6, 109â€“116 (2018).
ArticleÂ CASÂ Google ScholarÂ
Li, K. et al. Electronic-photonic convergence for silicon photonics transmitters beyond 100 Gbps on-off keying. Optica 7, 1514â€“1516 (2020).
ArticleÂ ADSÂ Google ScholarÂ
Zhang, H. et al. 800 Gbit/s transmission over 1 km single-mode fiber using a four-channel silicon photonic transmitter. Photonics Res. 8, 1776â€“1782 (2020).
ArticleÂ CASÂ Google ScholarÂ
Alam, M. S. et al. Net 220 Gbps/Î» IM/DD Transmssion in O-Band and C-Band With Silicon Photonic Traveling-Wave MZM. J. Lightwave Technol. 39, 4270â€“4278 (2021).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Mohammadi, A., Zheng, Z., Zhang, X., Rusch, L. A. & Shi, W. Segmented silicon modulator with a bandwidth beyond 67 GHz for high-speed signaling. J. Lightwave Technol. 41, 5059â€“5066 (2023).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Li, K. et al. An integrated CMOS-silicon photonics transmitter with a 112 gigabaud transmission and picojoule per bit energy efficiency. Nat. Electron. 6, 910â€“921 (2023).
ArticleÂ Google ScholarÂ
Xu, Q., Schmidt, B., Pradhan, S. & Lipson, M. Micrometre-scale silicon electro-optic modulator. Nature 435, 325â€“327 (2005).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Bogaerts, W. et al. Silicon microring resonators. Laser Photonics Rev. 6, 47â€“73 (2012).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Sun, J. et al. A 128 Gb/s PAM4 silicon microring modulator with integrated thermo-optic resonance tuning. J. Lightwave Technol. 37, 110â€“115 (2019).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Cai, H., Fu, S., Yu, Y. & Zhang, X. Lateral-zigzag PN junction enabled high-efficiency silicon micro-ring modulator working at 100Gb/s. Photonics Techonol. Lett. 34, 525â€“528 (2022).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Chan, D. W. U. et al. C-band 67 GHz silicon photonic microring modulator for dispersion-uncompensated 100 Gbaud PAM-4. Opt. Lett. 47, 2935â€“2938 (2022).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Zhang, Y. et al. 240 Gb/s optical transmission based on an ultrafast silicon microring modulator. Photonics Res. 10, 1127â€“1133 (2022).
ArticleÂ Google ScholarÂ
Chan, D. W. U. et al. Efficient 330-Gb/s PAM-8 modulation using silicon microring modulators. Opt. Lett. 48, 1036â€“1039 (2023).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Yuan, Y. et al. A 5 Ã— 200 Gbps microring modulator silicon chip empowered by two-segment Z-shape junctions. Nat. Commun. 15, 1â€“9 (2024).
ADSÂ Google ScholarÂ
Shekhar, S. et al. Roadmapping the next generation of silicon photonics. Nat. Commun. 15, 1â€“15 (2024).
ArticleÂ Google ScholarÂ
Hinakura, Y., Arai, H. & Baba, T. 64 Gbps Si photonic crystal slow light modulator by electro-optic phase matching. Opt. Express 27, 14321â€“14327 (2019).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Hinakura, Y., Akiyama, D., Ito, H. & Baba, T. Silicon photonic crystal modulators for high-speed transmission and wavelength division multiplexing. J. Sel. Top. Quantum Electron. 27, 4900108 (2021).
ArticleÂ CASÂ Google ScholarÂ
Kawahara, K. et al. High-speed, low-voltage, low-bit-energy silicon photonic crystal slow-light modulator with impedance-engineered distributed electrodes. Optica 11, 1212â€“1219 (2024).
ArticleÂ CASÂ Google ScholarÂ
Jafari, O., Shi, W. & LaRochelle, S. Mach-Zehnder silicon photonic modulator assisted by phase-shifted Bragg gratings. Photonics Techonol. Lett. 32, 445â€“448 (2020).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Jafari, O., Zhalehpour, S., Shi, W. & LaRochelle, S. DAC-less PAM-4 slow-light silicon photonic modulator providing high efficiency and stability. J. Lightwave Technol. 39, 5074â€“5082 (2021).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Han, C. et al. Slow-light silicon modulator with 110-GHz bandwidth. Sci. Adv. 9, eadi5339 (2023).
ArticleÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Zhou, J., Wang, J., Zhu, L. & Zhang, Q. Silicon photonics for 100Gbaud. J. Lightwave Technol. 39, 857â€“867 (2021).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Zhong, K. et al. Digital signal processing for short-reach optical communications: a review of current technologies and future trends. J. Lightwave Technol. 36, 377â€“400 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Che, D. & Chen, X. Modulation format and digital signal processing for IM-DD optics at post-200G era. J. Lightwave Technol. 42, 588â€“605 (2024).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Wang, H. et al. Scientific discovery in the age of artificial intelligence. Nature 620, 47â€“60 (2023).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Bai, B. et al. Microcomb-based integrated photonic processing unit. Nat. Commun. 14, 1â€“10 (2023).
ArticleÂ ADSÂ Google ScholarÂ
Shen, Y. et al. Deep learning with coherent nanophotonic circuits. Nat. Photonics 11, 441â€“446 (2017).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Ma, W. et al. Deep learning for the design of photonic structures. Nat. Photonics 15, 77â€“90 (2021).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Liu, D., Tan, Y., Khoram, E. & Yu, Z. Training deep neural networks for the inverse design of nanophotonic structures. ACS Photonics 5, 1365â€“1369 (2018).
ArticleÂ CASÂ Google ScholarÂ
Zhao, H. & Zhang, J. Adaptively combined FIR and functional link artificial neural network equalizer for nonlinear communication channel. IEEE Trans. Neural Netw. 20, 665â€“674 (2009).
ArticleÂ PubMedÂ Google ScholarÂ
Liu, C. et al. 81-GHz W-band 60-Gbps 64-QAM wireless transmission based on a dual-GRU equalizer. Opt. Express 30, 2364â€“2377 (2022).
ArticleÂ ADSÂ PubMedÂ Google ScholarÂ
Liu, S. et al. A multilevel artificial neural network nonlinear equalizer for millimeter-wave mobile fronthaul systems. J. Lightwave Technol. 35, 4406â€“4417 (2017).
ArticleÂ ADSÂ Google ScholarÂ
Vlasov, Y. A., Oâ€™boyle, M., Hamann, H. F. & McNab, S. J. Active control of slow light on a chip with photonic crystal waveguides. Nature 438, 65â€“69 (2005).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Baba, T. Slow light in photonic crystals. Nat. Photonics 2, 465â€“473 (2008).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Krauss, T. F. Why do we need slow light. Nat. Photonics 2, 448â€“450 (2008).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Brosi, J. M. et al. High-speed low-voltage electro-optic modulator with a polymer-infiltrated silicon photonic crystal waveguide. Opt. Express 16, 4177â€“4191 (2008).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Jiang, Y., Jiang, W., Gu, L., Chen, X. & Chen, R. T. 80-micron interaction length silicon photonic crystal waveguide modulator. Appl. Phys. Lett. 87, 221105 (2005).
ArticleÂ ADSÂ Google ScholarÂ
Lin, C. Y. et al. Electro-optic polymer infiltrated silicon photonic crystal slot waveguide modulator with 23 dB slow light enhancement. Appl. Phys. Lett. 97, 093304 (2010).
ArticleÂ ADSÂ Google ScholarÂ
Gao, X. et al. Dirac-vortex topological cavities. Nat. Nanotechnol. 15, 1012â€“1018 (2020).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Yariv, A., Xu, Y., Lee, R. K. & Scherer, A. Coupled-resonator optical waveguide: a proposal and analysis. Opt. Lett. 24, 711â€“713 (1999).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Yang, Y., Peng, C., Liang, Y., Li, Z. & Noda, S. Analytical perspective for bound states in the continuum in photonic crystal slabs. Phys. Rev. Lett. 113, 037401 (2014).
ArticleÂ ADSÂ PubMedÂ Google ScholarÂ
Jin, J. et al. Topologically enabled ultrahigh-Q guided resonances robust to out-of-plane scattering. Nature 574, 501â€“504 (2019).
ArticleÂ ADSÂ CASÂ PubMedÂ Google ScholarÂ
Deligiannidis, S., Bogris, A., Mesaritakis, C. & Kopsinis, Y. Compensation of fiber nonlinearities in digital coherent systems leveraging long short-term memory neural networks. J. Lightwave Technol. 38, 5991â€“5999 (2020).
ArticleÂ ADSÂ Google ScholarÂ
Liu, X. et al. Bi-directional gated recurrent unit neural network based nonlinear equalizer for coherent optical communication system. Opt. Express 29, 5923â€“5933 (2021).
ArticleÂ ADSÂ PubMedÂ Google ScholarÂ
Liu, Y. et al. Attention-aided partial bidirectional RNN-based nonlinear equalizer in coherent optical systems. Opt. Express 30, 32908â€“32923 (2022).
ArticleÂ ADSÂ PubMedÂ Google ScholarÂ
Cho, K. et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In Conference on Empirical Methods in Natural Language Processing 1724â€“1734 (Association for Computational Linguistics, 2014).
Murphy, S., Jamali, F., Townsend, P. & Antony, C. High dynamic range 100G PON enabled by SOA preamplifier and recurrent neural networks. J. Lightwave Technol. 41, 3522â€“3532 (2023).
ArticleÂ ADSÂ CASÂ Google ScholarÂ
Dong, P. et al. Silicon photonics for 800G and beyond. In Optical Fiber Communication Conference, M4H.1 (IEEE, 2022).

Download references

Acknowledgements

The authors thank Xi Xiao, Peiqi Zhou, Qiansheng Wang in National Information Optoelectronics Innovation Center for testing support. This work was supported by National Key Research and Development Program of China (2022YFB2803700 to H.S.), National Natural Science Foundation of China (62235002 to X.W., 62327811 to X.W., 12204021 to H.S., 62322501 to H.S., 12374340 to J.Q.), Beijing Municipal Science and Technology Commission (Z221100006722003 to X.W.), Beijing Municipal Natural Science Foundation (Z210004 to X.W.), IPOC (2021A03 to J.Q.) and Major Key Project of PCL to X.W.

Author information

These authors contributed equally: Changhao Han, Qipeng Yang, Jun Qin, Yan Zhou.

Authors and Affiliations

State Key Laboratory of Photonics and Communications, School of Electronics, Peking University, Beijing, China
Changhao Han,Â Qipeng Yang,Â Zhao Zheng,Â Yimeng Wang,Â Yichen Wu,Â Shaohua Yu,Â Weiwei Hu,Â Chao Peng,Â Haowen ShuÂ &Â Xingjun Wang
Department of Electrical and Computer Engineering, University of California, Santa Barbara, CA, USA
Changhao HanÂ &Â John E. Bowers
Key Laboratory of Information and Communication Systems, Ministry of Information Industry, Beijing Information Science and Technology University, Beijing, China
Jun Qin,Â Yu SunÂ &Â Junde Lu
Peking University Yangtze Delta Institute of Optoelectronics, Nantong, China
Yan Zhou,Â Zhangfeng GeÂ &Â Xingjun Wang
Peng Cheng Laboratory, Shenzhen, China
Yunhao Zhang,Â Haoren Wang,Â Lei Wang,Â Zhixue He,Â Shaohua Yu,Â Chao PengÂ &Â Xingjun Wang
Frontiers Science Center for Nano-optoelectronics, Peking University, Beijing, China
Chao Peng,Â Haowen ShuÂ &Â Xingjun Wang

Authors

Changhao Han
View author publications
Search author on:PubMedÂ Google Scholar
Qipeng Yang
View author publications
Search author on:PubMedÂ Google Scholar
Jun Qin
View author publications
Search author on:PubMedÂ Google Scholar
Yan Zhou
View author publications
Search author on:PubMedÂ Google Scholar
Zhao Zheng
View author publications
Search author on:PubMedÂ Google Scholar
Yunhao Zhang
View author publications
Search author on:PubMedÂ Google Scholar
Haoren Wang
View author publications
Search author on:PubMedÂ Google Scholar
Yu Sun
View author publications
Search author on:PubMedÂ Google Scholar
Junde Lu
View author publications
Search author on:PubMedÂ Google Scholar
Yimeng Wang
View author publications
Search author on:PubMedÂ Google Scholar
Zhangfeng Ge
View author publications
Search author on:PubMedÂ Google Scholar
Yichen Wu
View author publications
Search author on:PubMedÂ Google Scholar
Lei Wang
View author publications
Search author on:PubMedÂ Google Scholar
Zhixue He
View author publications
Search author on:PubMedÂ Google Scholar
Shaohua Yu
View author publications
Search author on:PubMedÂ Google Scholar
Weiwei Hu
View author publications
Search author on:PubMedÂ Google Scholar
Chao Peng
View author publications
Search author on:PubMedÂ Google Scholar
Haowen Shu
View author publications
Search author on:PubMedÂ Google Scholar
John E. Bowers
View author publications
Search author on:PubMedÂ Google Scholar
Xingjun Wang
View author publications
Search author on:PubMedÂ Google Scholar

Contributions

The experiments were conceived by C.H. The devices were designed by C.H. The equalizers were developed by J.Q. The experiments were performed by C.H., Q.Y. and Y.Zhou, with the assistance from Y.Zhang, H.W., L.W., Z.H., S.Y. and W.H. The slow light theory was developed by Z.Z and C.P. The data process was conducted by C.H., J.Q., Y.S., J.L. and Y.Wu. The device characterization was conducted by C.H., Q.Y., Y.Zhou. and Z.G. The results were analyzed by C.H., Q.Y., J.Q. and Y.Zhou. The figure optimization was conducted by C.H., Q.Y., Y.S, J.L. and Y.Wang. All authors participated the discussion of the research. The project was supervised by H.S., J.E.B. and X.W.

Corresponding authors

Correspondence to Haowen Shu, John E. Bowers or Xingjun Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Emir Salih Magden and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisherâ€™s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Transparent Peer Review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Han, C., Yang, Q., Qin, J. et al. Exploring 400 Gbps/Î» and beyond with AI-accelerated silicon photonic slow-light technology. Nat Commun 16, 6547 (2025). https://doi.org/10.1038/s41467-025-61933-5

Download citation

Received: 22 September 2024
Accepted: 06 July 2025
Published: 16 July 2025
DOI: https://doi.org/10.1038/s41467-025-61933-5