Abstract
Silicon photonics is a promising platform for the extensive deployment of optical interconnections, with the feasibility of low-cost and large-scale production at the wafer level. However, the intrinsic efficiency-bandwidth trade-off and nonlinear distortions of pure silicon modulators result in the transmission limits, which raises concerns about the prospects of silicon photonics for ultrahigh-speed scenarios. Here, we propose an artificial intelligence (AI)-accelerated silicon photonic slow-light technology to explore 400 Gbps/λ and beyond transmission. By utilizing the artificial neural network, we achieve a data capacity of 3.2 Tbps based on an 8-channel wavelength-division-multiplexed silicon slow-light modulator chip with a thermal-insensitive structure, leading to an on-chip data-rate density of 1.6 Tb/s/mm2. The demonstration of single-lane 400 Gbps PAM-4 transmission reveals the great potential of standard silicon photonic platforms for next-generation optical interfaces. Our approach increases the transmission rate of silicon photonics significantly and is expected to construct a self-optimizing positive feedback loop with computing centers through AI technology.
Similar content being viewed by others
Introduction
With the rapid growth of global data volume in recent years, high-speed optical interconnection is seen as one promising approach to revolutionize high-performance computing centers1,2,3. Among the reported optoelectronic integration technologies, silicon photonics is an important platform with high complementary metal-oxide semiconductor (CMOS) compatibility, which brings the feasibility of low-cost and large-scale production at the wafer level4,5,6. However, as core electro-optical (EO) conversion devices, pure silicon modulators have a limited plasma dispersion effect7, thereby typically possessing low baud rates8,9. Nevertheless, the utilization of high-order formats can maximize the limited bandwidth resources of pure silicon modulators10. For optical interconnection of 1.6 TbE, intensity modulation/direct detection (IM/DD) represented by four-level pulse amplitude modulation (PAM-4) is standardized in IEEE Standard 802.3dj11. Compared to the complex coherent scheme, the IM/DD system, where the signal is modulated over the intensity of light, has been chosen as the definite technology route for short-reach optical connections in computing centers because of its ease of implementation12. But so far, due to the bandwidth-efficiency trade-off and nonlinear distortions, the growth trend of transmission rates based on silicon modulators has been slow over the years. Although the data rates have continued to increase with the proposal of various solutions13,14, the reported rates still exist a gap from single-lane 400 Gbps. This transmission bottleneck has raised concerns about the viability of pure silicon modulators for ultrahigh-speed scenarios, especially when compared to heterogeneous integration routes represented by thin film lithium niobate15,16,17,18,19, plasmonics materials20,21 and organic polymers22,23. For pure silicon modulators, it is challenging to increase the transmission rate significantly, while achieving a high integration density and a wide optical passband simultaneously13. Although silicon Mach-Zehnder modulators (Si-MZMs) possess the merits of large operating wavelength range, their excessively long modulation arms reduce integration density24,25,26,27,28,29,30,31,32; while silicon micro-ring modulators (Si-MRMs) achieve a compact footprint, the resulting narrow passbands and low thermal stability require additional feedback mechanisms33,34,35,36,37,38,39,40. These issues restrict the deployment of silicon modulators and raise concerns about the future development path of silicon photonics41. To overcome the above limitations, silicon slow-light modulators (Si-SLMs) are proposed as a promising approach on silicon-on-insulator (SOI) platforms42,43,44,45,46. Recently, the theoretical design and device-level research on Si-SLMs have been conducted, leading to a leap in bandwidth performance47. Therefore, the potential of high-bandwidth Si-SLM schemes inspires further exploration of the speed limit of silicon photonics by utilizing high-order formats. Also, developing compact Si-SLM systems to enhance the on-chip data-rate density of silicon platforms can reduce the wafer area budget for a certain data capacity. Accordingly, taking full advantages of Si-SLMs and adopting high-order formats to construct a high-capacity transmission system is needed for 1.6 TbE, and is expected to provide key solutions for next-generation 3.2 TbE optical interfaces.
In practical application scenarios, the adoption of equalization is necessary, especially for high-order modulation48. In most transmission works, linear equalizers such as feed-forward equalization (FFE) and decision-feedback equalization (DFE) are usually adopted and effectively mitigate linear distortions in the system49,50. However, in the process of compensating nonlinear distortions including nonlinear modulation and chirp characteristics, which is particularly important in pure silicon platforms, great challenges have occurred for above conventional equalization methods. On the other aspect, an increasing number of works utilize artificial intelligence (AI) technology to accelerate the discovery in sciences51. In the optoelectronics, silicon photonics has promoted the deployment of optical computing for AI applications52,53 (SiPh for AI), while AI technology can also be applied to realize the potential of silicon photonics54,55 (AI for SiPh). Specially, for signal processing applications, artificial neural network (ANN) equalizers have been proposed to construct a complex map with nonlinear boundaries between the input and output spaces56,57. Compared to conventional equalizers, ANN models the equalization as muti-level problems to mitigate the nonlinear distortions and can provide effective compensation solutions58. More importantly, ANN equalizers are particularly favorable for the optical transmission based on Si-SLMs. Although the bandwidth-efficiency trade-off of pure silicon modulation can be optimized effectively by introducing slow-light resonators, the physical principle of Si-SLMs is still the nonlinear plasma dispersion effect, and the nonlinear cosine transfer functions of the device architecture will also introduce signal distortions. Therefore, by using an ANN equalizer, the nonlinear distortions of Si-SLMs can be reduced. At the manufacturing level, achieving complete uniformity remains challenging in the doping process, while the depletion-mode PN junction is the modulation basis for Si-SLMs9,13. Simultaneously, although one-dimensional waveguide grating structure has improved the process stability compared to two-dimensional photonic crystals42,45, the structural fluctuations of doped silicon gratings still exist due to the fabrication limitations. Therefore, Si-SLMs require an adaptive technical approach at the back end to eliminate the variations caused by process deviations, thus making the deployment of large-scale high-density Si-SLM systems possible.
In this work, we propose an AI-accelerated silicon photonic slow-light technology to explore 400âGbps per wavelength transmission. Using a standard silicon photonic process, we fabricate an 8-channel wavelength-division-multiplexed (WDM) Si-SLM chip on a SOI platform. Benefiting from the innovative slow-light design, the intrinsic bandwidth-efficiency trade-off for pure silicon modulators is effectively mitigated. Under the precise optimization, our compact Si-SLMs exhibit an ultrahigh EO bandwidth of 90âGHz, a remarkable modulation efficiency of 0.82âVâ cm while possessing a wide optical passband of 7ânm around 1550ânm simultaneously. By utilizing the ANN equalizer, we demonstrate a total data capacity of 3.2 Tbps based on our thermal-insensitive Si-SLM chip, with all bit error rates (BERs) below hard-decision forward error correction (HD-FEC) threshold, leading to an on-chip data-rate density of 1.6âTb/s/mm2. Meanwhile, the transmission link does not require individual resonant wavelength adjustment and additional thermo-electric cooler (TEC) platforms, thus reducing the system budget. Notably, adopting the fundamental industry-standard IM/DD format PAM-4, we realize an unprecedented 400âGbps optical transmission per wavelength successfully, which is, to the best of our knowledge, the highest single-lane transmission rate achieved in standard silicon photonic platforms. Our work reveals the great potential of standard silicon photonic platforms in 3.2 TbE optical interconnections, validating the significant value of AI technology for silicon photonics.
Results
Device and chip design
The overall application architecture of the AI-accelerated slow-light technology is conceptually demonstrated in Fig. 1. Here, the AI-accelerated slow-light transceiver module provides an ultrahigh-speed solution for interconnections in computing centers (applications in photonics include neural networks, inverse design, dynamic simulation, etc.). Through collaborative integration with ANN, the Si-SLM system can achieve a leap in throughput per lane and elevates the interface rate of computing centers, thereby leading to faster training iterations for weight programming of ANN and enabling more immediate and efficient signal processing for the Si-SLM system. In turn, the enhanced transceiver module can further increase the interconnection rate for computing centers to achieve more powerful computility. This bi-directional promotion mechanism forms a self-optimizing positive feedback loop between AI-accelerated Si-SLM systems and computing centers, continuously improving the overall architecture performance.
Conceptual drawing of the application architecture of the AI-accelerated slow-light technology. The AI-accelerated slow-light transceiver module provides an ultrahigh-speed solution to increase interconnection rates in computing centers (applications in photonics include neural networks, inverse design, dynamic simulation, etc.). The enhanced computility can lead to faster training iterations for weight programming of artificial neural network (ANN), thus promoting the signal processing for silicon slow-light modulator (Si-SLM) systems to be more immediate. In turn, the enhanced transceiver module can further increase the interface rates of computing centers. Therefore, based on this bi-directional promotion mechanism, a self-optimizing positive feedback loop can be constructed to continuously improve the overall architecture performance.
Based on the above design concept, we designed an ultrahigh-speed WDM Si-SLM chip for computing centers. The slow-light effect is an effective approach to enhance the interaction between light and matter, which has been confirmed in both theory and experiment59,60,61,62,63,64. Especially, thanks to the CMOS compatibility of the silicon slow-light structure, Si-SLMs for ultrahigh-speed transmission can be fabricated under a standard silicon photonic process, without introducing complex manufacturing process and heterogeneous materials, and an 8-channel Si-SLM chip was realized on a 200-mm SOI wafer (Fig. 2a). Benefiting from the compact footprint and dense distribution of Si-SLMs, the actual modulation area (bottom) occupies only 4âmm à 0.5âmm, with the pitch of 500âμm between neighboring devices. Also, corresponding to the transmitter area, the 8-channel GeSi photodetectors (PDs) are also designed and arranged (top) for the complete transceiver. It can be seen that the adoption of Si-SLMs makes the transmitter area already close to the receiver area. For the optical ports, the edge coupler array (left) is used on the same side, and the three groups from bottom to top are modulator inputs, modulator outputs and PD inputs, respectively. Meanwhile, the direct-current (DC) pad array (right) is applied to control the modulator operating point effectively through the TiN heater of each device. For the single device, the architecture and morphology of one Si-SLM can be seen in Fig. 2b. Under a compact Mach-Zehnder interferometer (MZI) structure, the modulation arms can be shrunk to only 249âμm due to the slow-light effect, which is an order of magnitude shorter than conventional Si-MZMs, thus enhancing system integration density. Simultaneously, to enlarge the phase accumulation, also for the better photoelectric integration and low chirp, a dual-drive architecture of GSGSG-type radio-frequency (RF) electrodes is adopted here, which enables the modulator to operate under the push-pull driving mode. At the remote end of the RF electrodes, on-chip termination resistors are integrated to reduce microwave reflection. Near the waveguide output port, TiN heaters are adopted on the waveguides to control the operating point of the modulator (See Supplementary Section 1 for more details). In the modulation arms, the slow-light effect is generated by the coupled-resonator optical waveguide (CROW), which is a one-dimensional waveguide grating structure. The scanning electron microscope (SEM) image of the fabricated slow-light waveguide is also illustrated below the device photograph in Fig. 2b. Here, a complete resonator is constructed by a λ/4 phase shifter region with broader width in the middle position and an equal number of Bragg gratings on both sides with a period of around 300ânm, along the direction of light propagation in the waveguide. Each certain number of gratings construct one side beam of a resonator, and the two beams are separated by the phase shifter from their adjacent one. Therefore, the supercell is created through the phase shifter region, which leads to a mid-gap mode embedded in the bandgap. In our reconfigurable slow-light model, a finite number of resonators are cascaded to construct the modulation arm, and the structure parameters can be selected flexibly, according to specific application scenarios. Based on the grating waveguide configuration, the PN junction with a periodic structure is formed, which will work in the depletion mode under reverse bias voltage. Actually, the slow-light approach is a pure silicon solution with considerable high design freedom. For example, if the design goal is to push the bandwidth to the extreme limit, so as to achieve ultra-low-cost OOK transmission, then fewer resonators are required47. Here, for achieving a deeper modulation depth and better separation of ultrahigh-speed multi-level signals, we redesigned structures fully, cascading more resonators to enlarge the phase accumulation, and focus on the design of more resonators to achieve a balance between bandwidth and efficiency (Supplementary Section 2).
a Optical micrograph of the Si-SLM chip. An 8-channel Si-SLM array was fabricated on a 200-mm silicon-on-insulator (SOI) wafer, and the actual modulation area occupies only 4âmm à 0.5âmm, with the pitch of 500âμm between neighboring devices. b Device morphology of the fabricated Si-SLM. The modulator is under a Mach-Zehnder interferometer (MZI) structure, with the compact modulation arm of only 249âμm. A dual-drive architecture of GSGSG-type radio frequency (RF) electrodes is adopted, which enables the modulator to operate under the push-pull driving mode to enlarge the phase change. The scanning electron microscope (SEM) image of the fabricated slow-light waveguide is shown as the inset below. The waveguides are fabricated under a standard silicon photonic process and all feature sizes are suitable for the commercial silicon photonic foundry. c Measured electro-optic (EO) response under different bias voltages. The S21 response demonstrates that the modulator has an EO bandwidth of 90 GHz under the bias voltage of 3âV. The S11 response is maintained below â10âdB, indicating that the reflection of the modulator is weak. d Modulation efficiency under different wavelengths. An averaged enhanced modulation efficiency of 0.82âVâ cm is obtained due to the introduction of slow-light structure. e Linearity of the Si-SLM. The measured third-order intermodulation spurious free dynamic range (IMD3 SFDR) is 91.82âdB â Hz2/3, mainly due to the nonlinear limitation of the plasma dispersion effect in silicon materials. f Crosstalk of parallel Si-SLMs. The measured crosstalk is around 30âdB at 60âGHz and 20âdB at 90âGHz. Due to the ultra-short RF links, the crosstalk of the Si-SLM array is at a low level and the impact on parallel signal transmission is small. g Passbands of the 8-channel Si-SLM array. The modulator possesses a flat passband of 7ânm around 1550ânm, and the passbands are basically consistent between different devices. For wavelength-division-multiplexed (WDM) system, the specific wavelengths of 8-channels are from 1548ânm to 1555ânm, with the spacing of 1ânm.
Performance characterization
To prove the feasibility of the proposed Si-SLMs for high-order signal transmission, we comprehensively characterized the device performance in detail, including both dynamic and static parameters. First, we evaluated the high-frequency EO response of the designed device, which is a prerequisite for achieving ultrahigh-speed transmission. Also, for Si-SLMs, an ultrahigh EO bandwidth is the most prominent advantage, dictating the achievable data rate. Especially, considering that the ANN equalizer adopted later will improve signal quality, the first thing to ensure is that the signal can complete the entire EO conversion and transmission process with minimal RF loss. Here, to meet the demand of high-order modulation, we control the structure parameters to maximize the EO bandwidth while ensuring a balance between bandwidth and efficiency. We experimentally characterized the small-signal properties of the Si-SLMs through S-parameter measurements to obtain the EO bandwidth, including S21 transmission and S11 reflection responses. Through the careful regulation strategy, the modulator has an EO bandwidth of 90âGHz under the bias voltage of 3âV, as shown in Fig. 2c, which is a fairly high bandwidth value for pure silicon modulators. While the device bandwidth increases with the bias voltage, it already exceeds 70âGHz at only 1 V, and the low bias voltage requirement facilitates the co-packing with integrated driver chips. In terms of reflection performance, the S11 response is maintained below â10âdB, indicating that the reflection of the modulator is weak. Simultaneously, with the adoption of slow-light effect, the modulation efficiency can be improved by enhancing the interaction between light and matter. Here, we apply a low-frequency small signal to the device around the quadrature bias point and obtain the modulation efficiency by fitting the phase variation at multiple wavelengths. Figure 2d demonstrates the modulation efficiency measured under different wavelengths. The modulation efficiency increases slightly closer to the edge of the passband, mainly due to the relatively higher group index at the band edge region. In the passband around 1550ânm, an averaged modulation efficiency of 0.82âVâ cm is obtained, which is also a remarkable value for pure silicon modulation based on plasma dispersion effect. Therefore, by adopting the slow-light approach, we achieved improvements in both bandwidth and modulation efficiency for pure silicon modulators simultaneously. As an approach that can be utilized on SOI platforms, the slow-light structure can mitigate the intrinsic bandwidth-efficiency trade-off in silicon material, achieving the promotions in both EO bandwidth and modulation efficiency. For the optical loss of the modulator, the unoptimized coupling loss between a pair of edge couplers and the fibers is 10âdB (5-dB loss for each one), and the insertion loss of the Si-SLMs is measured to be 10.5âdB (9.1âdB from the modulation arms, remaining from directional couplers and routing waveguides). The optical loss can be further reduced by improving the manufacturing process or introducing a transition structure between the conventional waveguide and the slow-light waveguide. Moreover, by embedding CROW into the modulation arms, the advantages of MZI can be leveraged to maintain a sufficient static extinction ratio (ER) while reducing the device footprint. Through the standard thermal tuning mechanism by TiN heaters, the static ER of the compact modulator is measured to be 36âdB, provided by the resonator-assisted MZI architecture. Next, to characterize the linearity for measuring the multi-level separation capabilities in high-order transmission, we further examined the third order intermodulation (IMD3) spurious free dynamic range (SFDR). For our Si-SLMs, the measured IMD3 SFDR is 91.82âdBâ Hz2/3, shown in Fig. 2e, which is similar to conventional silicon modulators, mainly due to the nonlinear limitation of the plasma dispersion effect of silicon materials, together with the nonlinear cosine transfer functions of the MZI architecture. As a pure silicon device, the physical essence of Si-SLMs is still based on the nonlinear silicon modulation, and the mitigation for subsequent nonlinear distortions is a challenge especially for ultrahigh-speed transmission, which is exactly what the ANN model excels at.
Afterwards, we experimentally characterized the performance of the modulator array on the Si-SLM chip. For WDM modulators, the channel crosstalk is a significant performance indicator to evaluate the overall transmission quality. Also, in terms of the designed layout of the modulator chip, to increase the rate density, a dense arrangement of devices is necessary. Here, the pitch of the modulator array is 500âμm (Fig. 2a), resulting in a minimum distance of only 20âμm between modulators. Nevertheless, Si-SLMs are favorable for dense array designs, because the ultra-compact footprint of Si-SLMs can shorten RF links (approximately ten times that of Si-MZMs), which will reduce the crosstalk between parallel signals at an ultrahigh-frequency region. To quantify the crosstalk, we tested the EO response of parallel modulators by applying a small RF signal on one channel (Ch4) and simultaneously receiving the transmitted signal on the other channels (Ch1, Ch2, Ch3, Ch5, Ch6, Ch7 and Ch8). Figure 2f demonstrates the experimental result of crosstalk, in which the curve of Ch4 is its own anticipated EO bandwidth curve, and the response curves of other channels illustrates the measured crosstalk is around â30âdB at 60âGHz and â20âdB at 90âGHz. In the ultra-compact and ultrahigh-density chip layout, the crosstalk of the WDM Si-SLMs remains at a low level and the impact on parallel signal transmission is small due to the ultra-short RF links. Furthermore, we tested the spectral performance for all channels on the same WDM Si-SLM chip. In practice, the optical passband of the modulator is the reflection of the optical bandwidth, and a wide passband is essential for multi-wavelength applications with high thermal robustness. Theoretically, for the photonic bandgap of the designed structure, the topologically mid-gap mode is generated in the bandgap between the antisymmetric and symmetric transverse electric bands65,66,67,68, as the mode adopted for practical communication application. Thus, the supercell band opens multiple windows including one large passband around 1550ânm. By cascading more resonant cavities, the passband can be kept wide enough while ensuring sufficient efficiency. Here, under careful regulation, the fabricated Si-SLM possesses a flat passband (mid-gap mode) of 7ânm around 1550ânm, with an out-of-band rejection ratio of 50âdB, shown in Fig. 2g. Meanwhile, the high passband conformity between devices can demonstrate that the fabricated Si-SLMs have a relatively favorable process consistency in general. However, from the spectral results, it can be seen that a wavelength shift still occurs, due to the unavoidable fabrication deviations. Although the manufacturing uniformity of Si-SLMs has been improved by introducing one-dimensional CROW compared to two-dimensional photonic crystals, the performance fluctuations between doped silicon grating structures still exist. Therefore, for large-scale Si-SLM systems, the AI equalization at the back end is necessary to minimize device performance fluctuations caused by process errors. Despite this, on the other aspect, the appropriate slight wavelength shift can still extend the available passband to 10ânm on one chip and the passband resources have not been fully utilized yet. Here, for our designed WDM Si-SLM system, the specific wavelengths of 8-channels are from 1548ânm to 1555ânm, with the spacing of 1ânm (Fig. 2g). By increasing the wavelength operating range (e.g. 10ânm), a larger wavelength interval (e.g. 1.25ânm) can be adopted between channels. It is worth noting that, thanks to the wide passband and high uniformity of Si-SLMs, compared with Si-MRMs, there is no need to design separately to obtain individual operating wavelengths for different channels, nor to find the precise wavelength operating point between different resonance periods. Moreover, the compact Si-SLMs possess high thermal robustness, which can reduce the requirements for additional TEC operating platforms, thus saving the system budget.
In short, optimizing the bandwidth-efficiency trade-off while maintaining a sufficiently wide passband through Si-SLMs is the device-level guarantee for achieving ultrahigh-speed transmission per lane. Based on the designed Si-SLMs, the compact WDM chip with low crosstalk and flat passbands can realize ultra-high integration density and lead to a remarkable total capacity.
DNN transmission
For the system level, based on our carefully designed high-bandwidth Si-SLMs, we build an 8-channel WDM Si-SLM transmission system, which is demonstrated in Fig. 3a. On each channel, the Si-SLM encodes the carrier into PAM-4 signal format at different symbol rates. At the receiving side,the signal can be partly coupled to an on-chip GeSi PD on the same chip, whereas the remaining part is sent into a commercial PD. The transmitted ultrahigh-speed signals are recorded by a real-time oscilloscope and then processed with AI equalizers. For the algorithm level, leveraging AI technology to improve the transmission capacity, we adopt the ANN equalizers to improve the nonlinear limitation of conventional equalizers such as FFE and DFE. Although the bandwidth-efficiency trade-off has been mitigated, the Si-SLMs will still introduce nonlinear distortions during signal transmission due to the nonlinear modulation of the pure silicon material and the nonlinear cosine transfer functions of MZI architecture. At the system level, the primary electrical nonlinear sources originate from RF components due to the inherent nonlinear behavior of charge transport at the transistor junction, which is predominantly observed in the transistor driver (DRV) and transimpedance amplifier (TIA). Also, the utilization of erbium-doped fiber amplifiers (EDFA) could introduce nonlinear Kerr impairments to the optical link. Therefore, the resulting nonlinear signal distortions become the bottleneck for improving the signal quality, especially for the multi-level formats at ultra-high speeds. The ANN equalizers, which specialize in compensating nonlinear distortions, are exactly suitable for application in the Si-SLM system.
a Schematic of the WDM Si-SLM transmission system. On each channel, the Si-SLM encodes the carrier into four-level pulse amplitude modulation (PAM-4) signal format at different data rates. The transmitted signals are first sampled by a real-time oscilloscope and afterward processed with AI equalizers offline. b The principal structure of DNN equalizer. In the experiment, the input layer is set with 25 neurons and two hidden layers with 30 neurons are adopted.
To minimize computing resources, we develop a relatively simple ANN model first, extend the network to a deep-learning neural network (DNN) equalizer with only two hidden layers to enhance the system performance. In the DNN, to classify the signal into four categories for PAM-4 signal58, an activation function with four saturation level regions is implemented through equation \(f(x)=2{\eta }_{2}/(1+{e}^{-{\eta }_{1}(x-2\alpha )})-{\eta }_{2}+2\alpha\), where α equals to â1, 0, 1 when xââ¤ââ1, â1 < xââ¤1 and x > 1, respectively, and \({\eta }_{2}=(1+{e}^{-{\eta }_{1}})/(1-{e}^{-{\eta }_{1}})\). The constructed f(x) has four saturation regions which are close to the amplitude of PAM-4 (â3, â1, 1, 3) which makes it suitable for PAM-4 equalization. Similarly, it can be extended to 8-level sigmoid function which can be used to equalize the modulated signals such as PAM-8. Details can be found in Supplementary Section 3. Two hidden layers are incorporated in the DNN. Figure 3b shows the tap-delay two hidden layer DNN equalizer structure. The activation function in each neurons determines the nonlinear mapping characteristic across the network. The weight optimization process involves a gradient descent scheme coupled with the error back-propagation algorithm to iteratively update the network parameters.
Since 224âGbps PAM-4 is the standard rate for 1.6âT interfaces, we focus on the 224âGbps PAM-4 signal transmission here. Thanks to the ultrahigh-bandwidth and wide flat passband of Si-SLMs, we obtained clear eye diagrams of 224âGbps PAM-4 together with 200âGbps PAM-4 at all channels of different wavelengths (1548â1555ânm, with 1ânm spacing, Ch2, Ch4, Ch6 and Ch8 with corresponded wavelengths are demonstrated here), shown in Fig. 4a, with all BERs lower than 2 Ã 10â2 (transmission results for all channels are shown in Supplementary Section 3), leading to a total capacity of 1.6âTbps. Specifically, Fig. 4b demonstrates the BERs at different wavelengths in the passband for 8 channels, with the PAM-4 signal speed set as 130âGbps, 170âGbps, 200âGbps and 224âGbps, and the consistency between all WDM channels is favorable. Moreover, the BER curves with increasing data rates of different channels (Ch2, Ch4, Ch6 and Ch8 are shown for example here) are illustrated in Fig. 4c, while the almost overlapped lines confirm the uniformity between different channels.
a 224âGbps and 200âGbps PAM-4 eye diagrams of different channels. Ch2, Ch4, Ch6 and Ch8 are shown here. b Bit error rate (BER) curves of different data rates under different wavelengths for 8 channels. The PAM-4 signal speeds are set as 130âGbps, 170âGbps, 200âGbps and 224âGbps. The consistency of different wavelengths between channels is favorable. c BER curves with the increasing data rates. Ch2, Ch4, Ch6 and Ch8 are shown here. The almost overlapped lines confirm the uniformity between different channels. d BERs of different patterns under different data rates. There is a similar performance under different signal pattern, which indicates that the implemented DNN equalizer has a robust equalization ability for different data patterns and captures channel response instead of the generation pattern of the pseudo random bit sequence (PRBS). e BER curves of different neuron numbers in the input layer. The BER performance improves when the number of input neurons increases. After reaching a certain neuron count, the BER performance ceases to improve further. f BER curves of different neuron numbers in the hidden layer. No significant variation of BERs is observed as the number of neurons in the hidden layer increases.
Furthermore, we verify that the implemented DNN equalizer captures channel response instead of the generation pattern of the pseudo random bit sequence (PRBS). Different patterns of PRBS are generated, Pattern 0 is used to train the DNN, the others (Pattern 1, 2 and 3) are used for BER test. The results of 170âGbps, 200âGbps and 224âGbps are shown in Fig. 4d, there is a similar performance under different signal patterns, which indicates that the DNN equalizer learns the information of channel rather than the signal pattern and has a robust equalization ability for different data patterns. For the neuron numbers, the BER performance of the DNN initially for all data rates improves when the number of input neurons increases, as shown in Fig. 4e. However, after reaching a certain neuron count, such as 25 neurons, the BER performance ceases to improve further. Meanwhile, as the number of neurons in the hidden layer increases, there is no significant (exponential order) variation in the BER performance, as depicted in Fig. 4f. In the experiment, the input layer is set with 25 neurons and two hidden layers are set with 30 neurons.
As a relatively simple AI equalizer, our designed DNN equalizer with only two hidden layers takes up less computing resources, but still achieves high-quality transmission for 1.6 TbE interface with 224 Gbps per lane. As the simplest high-order modulation format, PAM-4 is highly feasible for data center scenarios. Therefore, it is crucial to realize optical transmission of 224 Gbps PAM-4 per channel based on a Si-SLM chip. Furthermore, based on 224 Gbps PAM-4 transmission per lane, a 3.2 TbE interface can be realized by scaling out the channels. More importantly, the high-quality PAM-4 results of DNN equalizer which takes less resources demonstrate the potential of ANN solutions and its adaptability to slow-light systems, which encourages us to continue developing more efficient ANN equalizers.
GRU transmission
By adopting the designed simple DNN equalizer with less computing resources, we achieve a total capacity of 1.6âTbps transmission with 224âGbps PAM-4 per lane based on our WDM Si-SLM chip. This remarkable result demonstrates the high compatibility between the AI-accelerated technology and Si-SLM systems. Therefore, for the purpose of exploring the potential of AI-accelerated slow-light systems, we develop a more efficient ANN equalizer to push the silicon photonics transmission to the next stage.
Recurrent neural networks (RNN) are widely utilized for their proficiency in modeling temporal dynamics and processing sequential data. In principle, bi-directional RNNs (bi-RNNs) can efficiently handle not only inter-symbol interference among preceding and succeeding symbols caused by chromatic dispersion, but also the nonlinear impairments caused by devices and fiber transmission links69,70,71. Additionally, compared to unidirectional RNN, bi-RNNs model the dependence on past and future states. A gated recurrent unit (GRU) contains two gates including reset gate and update gate70,72. GRU is a less complex variant compared to the long short-term memory (LSTM)70, which is an advanced type of RNN that demonstrates robust capabilities for capturing and modeling long-term dependencies57. The detailed structure of a GRU unit is demonstrated in Fig. 5a. The bidirectional GRU (bi-GRU) model comprises two unidirectional GRU layer operating in opposite directions. By combining forward and backward GRU processing, the model incorporates information from both the future and the past to influence its current states. Figure 5b shows the architecture of the implemented bi-GRU network for nonlinear equalization in our Si-SLM transmission system. The first layer is the input layer, where the current symbol xi is enclosed with its k preceding and k succeeding symbols. This sequence serves as the input to the bi-GRU network. The subsequent layer is the GRU layer, which consists of two GRU links. The input sequence is first passed through the initial GRU layer. Subsequently, the sequence is reversed and processed through the second GRU layer.This approach allows us to process both preceding and subsequent data simultaneously, enabling information from both the past and future to influence the current states. The outputs of the bi-GRU model layer are fully connected to a linear layer. As a result, the output layer produces the predicted class for the current symbol. Consequently, the predicted class yi for the i-th symbol xi is determined. Some reports employ bi-GRU to compensate the fiber nonlinear impairment during the long-distance transmission in coherent system70,73, but there are still less bi-GRU performance evaluation reports in short-reach IM/DD systems, especially for systems that employ silicon devices with data rates around 400âGbps in a high-density silicon chip. In this work, we employ the bi-GRU network to promote the transmission performance. Meanwhile, based on bi-GRU, a new three-layer GRU equalizer (T-biGRU) is further proposed and implemented. The T-biGRU model is determined based on the state of three GRU layers. The first GRU layer processes the data in a forward direction, starting from the beginning of the sequence. The second GRU layer processes data in a backward direction to capture reverse temporal dependencies. The third GRU layer, like the first, processes data in a forward direction from the beginning of the sequence (see Methods). The hidden state encapsulates the flow of symbolic information across recurrent time steps, ensuring continuity and context throughout the sequence. The bi-GRU model relies on the states of two GRU layers, whereas the T-biGRU model utilizes the states of three GRU layers. By integrating forward, backward and repeated forward GRU processing, the T-biGRU model more comprehensively extracts both global and local features of the sequence, thereby further enhancing equalization performance.
a Scheme of one gated recurrent unit (GRU) unit. The detailed structure of a GRU unit includes a reset gate and an update gate. b The architecture of bi-GRU equalizer. The input layer, the GRU layer consisting of two GRU links, the linear layer, the ReLU layer and the output layer are included in the implemented bi-GRU network. c bi-GRU PAM-4 eye diagrams of different data rates. 200âGbps, 240âGbps, 280âGbps, 320âGbps, 360âGbps and 400âGbps for Ch2, Ch4, Ch6 and Ch8 are shown here, with all BERs under hard-decision forward error correction (HD-FEC) threshold. d BER curves of all 8-channel PAM-4 signals with respect to different data rates. The passband consistency remains favorable and a total data capacity of 3.2 Tbps is achieved based on the Si-SLM chip. e BER curves of PAM-4 signal with increasing data rates of different channels. Ch2, Ch4, Ch6 and Ch8 are shown here. The variation trend of BER curves is relatively smooth, and no point where the BER abruptly raises is observed. The inset is the constellation of 400âGbps PAM-4 transmission.
Next, we evaluated the practical PAM-4 transmission for the Si-SLM chip by employing bi-GRU and T-biGRU, as depicted in Figs. 5â9. Here, our goal is to increase the single-lane data rate of Si-SLMs significantly by AI approach. However, the frequency broadening brought by IM/DD signals cannot be ignored at ultrahigh-speed data rates. Therefore, in the practical GRU WDM transmission experiment, we select on-chip channels Ch1, Ch3, Ch5, Ch7 (odd channels) as one WDM transmission path and Ch2, Ch4, Ch6, Ch8 (even channels) as the other WDM transmission path to reduce modulation crosstalk by increasing the spectral separation between channels. After assembling each group of on-chip signals through WDM, the two paths are then transmitted in parallel to reach a total capacity of 3.2âTbps, similar to the solution adopted by commercial optical module manufacturers74. By adopting bi-GRU equalization, we can realize high-quality eye diagrams of PAM-4 signal up to 400âGbps, indicating 3.2 Tbps aggregation data rate of 8 parallel channels. Figure 5c summarizes the optical eye diagrams of PAM-4 signals at gradually increasing rates for different channels, in which the horizontal axis shows the different channels (Ch2, Ch4, Ch6 and Ch8 are shown for examples here) in the WDM system together with their corresponding wavelengths, and the vertical axis shows the different PAM-4 transmission rates (280âGbps, 320âGbps, 360âGbps and 400âGbps are demonstrated). All the eye diagrams are quite clear and the BERs are all below HD-FEC threshold, even up to 400âGbps. Actually, taking 280âGbps PAM-4 as an instance, which is already a fairly high rate for pure silicon modulators, our solution can reduce the BERs to only around 10â5 order at this rate. More importantly, even when the rate increases to 400âGbps, the BERs still maintain below HD-FEC threshold, and the performance consistency between channels does not show downward trend. To the best of our knowledge, this is the highest single-lane IM/DD transmission rate for silicon modulators. In particular, we achieve this goal only using the industry-standard format PAM-4 for computing centers. Simultaneously, the wide optical passbands of Si-SLMs provide the prerequisite for multi-wavelength communication. The BERs at different wavelengths of all channels are well below the HD-FEC threshold for the whole passband, as demonstrated in Fig. 5d, and the passband consistency is favorable, which is critical for multi-wavelength applications. By leveraging single-lane 400âGbps PAM-4 transmission, a total data capacity of 3.2âTbps is achieved based on the 8-channel Si-SLM chip with a compact modulation area of only 4 mm à 0.5âmm, leading to a remarkable on-chip data-rate density of 1.6âTb/s/mm2. Based on the improved data-rate density, more modulator arrays can be integrated on a single wafer with the same tape-out budget, thereby significantly increasing the total transmission capacity for one chip, or reducing the average cost for a certain targeted capacity. For the trend of BERs with increasing rates, the BER curves for different channels are demonstrated in Fig. 5e, and the nearly overlapped curves illustrate the performance uniformity between different channels. Especially, the variation trend of BER curves is relatively smooth, and no point where the BER abruptly raises is observed with the increasing rates. Also, the PAM-4 constellation at 400âGbps is illustrated as the inset, high-quality differentiation between the four levels can be observed. For complete results, all the PAM-4 eye diagrams, constellations and corresponded BERs for all channels and data rates are summarized in Supplementary Section 4.
Moreover, similarly like the case of DNN, the system performance of bi-GRU for data rates of 360âGbps, 380âGbps and 400âGbps under different patterns of PRBS are analyzed, as shown in Fig. 6a. There is a similar performance under different signal patterns, which proves that the implemented bi-GRU equalizer captures the channel response rather than PRBS pattern. Normally, as shown in Fig. 6b, with more GRU units in the hidden layer, the BER will be better. From 340âGbps to 400âGbps, it is effective to reduce BERs by changing GRU numbers, and there is no obvious downward trend found in this effectiveness, which reflects the applicability of the GRU solution to higher rates. In the specific experiment, 200 GRU units is chosen for the equalizer here. In more detail, the comprehensive discussions on AI equalizer network configurations are provided in Supplementary Section 5. Simultaneously, the robustness of the AI network equalizers is also analyzed and the improvement methods are given in Supplementary Section 7.
a BER performance for date rates of 360âGbps, 380âGbps and 400âGbps under different patterns of PRBS. A similar performance under different signal pattern is shown, verifying that the implemented GRU equalizer has a robust equalization ability for different data patterns and captures the channel response rather than PRBS pattern. b BER curves of different GRU numbers in the hidden layer from 340âGbps to 400âGbps. With more GRU units in the hidden layer, the BERs will be better. In the experiment, 200 GRU units is chosen for the equalizer.
The PAM-4 results based on the AI-accelerated slow-light technology demonstrate the great potential of silicon modulators for ultrahigh-speed applications. The ability of pure silicon modulators to achieve such high rates and maintain low BERs in PAM-4 signals, the simplest high-order format that complies with IEEE standards, is an encouragement for the deployment of silicon photonics. Moreover, in order to verify the scalability of our AI-accelerated slow-light system for higher-order signals, we utilized high-speed PAM-8 signal onto the Si-SLM chip. Even for PAM-8 signals with more levels compared to PAM-4, the bi-GRU equalizer still possesses a powerful ability for mitigating the nonlinear distortions in muti-level signals. Here, the specific PAM-8 eye diagrams at 1550ânm (Ch3) from 240âGbps to 390âGbps are demonstrated in Fig. 7a, with all BERs below HD-FEC threshold. Also, the constellations corresponded to each data rates are illustrated, with favorable separation between the 8-levels. For WDM systems, similar to the PAM-4 results, the BERs for all wavelengths in 8 channels are still below HD-FEC threshold in Fig. 7b (all the PAM-8 results are demonstrated in Supplementary Section 4). Figure 7c illustrates the BER curves changing with data rates, and the consistency between channels are quite well. Compared with the PAM-4 format, when the data rates is around 240âGbps, the BERs of PAM-8 is slightly lower than BERs of PAM-4. This is because the baud rate of PAM-8 (80âGbaud) is at a fairly low level, while the PAM-4 baud rate already reaches 120âGbaud. For a relatively low baud rate especially under 100âGbaud, the PAM-8 signal is more advantageous. However, as the data rate continues to rise, the BERs of PAM-8 increase indeed faster than that of PAM-4, and the two are basically the same when the data rate reaches around 400âGbps. The reason is that, as the data rate rises, the nonlinear compensation required for PAM-8 with more signal levels at ultrahigh-speed region gradually increases compared with PAM-4 under the same linearity. Despite this, it is effective to increase the total rate under the limited baud rate by adopting advanced formats with more levels, and our approach can also adapt to the higher-order formats.
a bi-GRU PAM-8 eye diagrams and constellations of different data rates at 1550ânm. 240âGbps, 270âGbps, 300âGbps, 330âGbps, 360âGbps and 390âGbps are shown here, with all BERs under HD-FEC threshold. Favorable separations are achieved between the 8-levels. b BER curves of all 8-channel PAM-8 signals with respect to different data rates. Similarly to PAM-4 signal, all the BERs for different wavelengths in 8 channels are below HD-FEC threshold with high passband consistency. c BER curves of PAM-8 signal with increasing data rates. Ch2, Ch4, Ch6 and Ch8 are shown here. The nearly overlapped curves illustrate the performance uniformity between different channels.
The above experimental results are based on back-to-back (B2B) scenario, and the transmission penalty for 100âm, 200âm and 300âm standard single-mode fiber (SSMF) is then experimentally assessed for both PAM-4 and PAM-8 signals with the bi-GRU equalizer. Considering the consistency of all channels has been proven to be favorable, the BER penalty is illustrated for one channel (Ch3) at 1550ânm. The transmission results for different SSMF distances are demonstrated in Fig. 8, in which Fig. 8a shows the BERs of PAM-4 (280âGbps, 320âGbps, 360âGbps, 400âGbps) and Fig. 8b shows the BERs of PAM-8 (300âGbps, 330âGbps, 360âGbps, 390âGbps). Due to the impact of the power fading effect in optical fiber, the BER performance gradually degrades with increasing transmission distance compared to B2B scenario for both cases of PAM-4 and PAM-8. For the transmission distance of 300âm, the BERs still remain below HD-FEC threshold at 360âGbps, but deteriorate beyond HD-FEC threshold at 400âGbps.
a PAM-4 signal of 280âGbps, 320âGbps, 360âGbps, 400âGbps for B2B, 100âm, 200âm and 300âm SSMF. b PAM-8 signal of 300 Gbps, 330 Gbps, 360 Gbps, 390 Gbps for B2B, 100âm, 200âm and 300âm SSMF. The bi-GRU equalizer is utilized here. For the transmission distance of 300âm, the BERs remain below HD-FEC threshold at 360âGbps, but deteriorate beyond the HD-FEC threshold at 400âGbps.
Furthermore, the transmission results of employing T-biGRU equalizer (Fig. 9a) are evaluated for both PAM-4 (400âGbps in Fig. 9b) and PAM-8 (390âGbps in Fig. 9c). For BER curves of different channels (Ch2, Ch4, Ch6, Ch8), PAM-4 results (280 Gbps, 320âGbps, 360âGbps, 400âGbps) are shown in Fig. 9d, while PAM-8 results (300âGbps, 330âGbps, 360âGbps, 390âGbps) are shown in Fig. 9e. The results indicate that by employing T-biGRU equalizer, the BER performance for around 400âGbps transmission of all channels can be improved to smaller than 10â3, which is slightly better than the bi-GRU case. The T-biGRU results for all eye diagrams, constellations and BERs corresponding to Fig. 9 are demonstrated in Supplementary Section 4. Meanwhile, the performance comparison of three AI network equalizers (DNN, bi-GRU, T-biGRU) together with traditional algorithms (DFE, VNLE) is shown in Supplementary Section 6. However, it is worth noting that the computational complexity of the T-biGRU is 1.5 times that of the bi-GRU (see Methods), leading to increased multiplications and higher power consumption (Supplementary Section 8). In the equalizer selection, it is important to strike a balance among performance, computational complexity, and power consumption to achieve the desired system performance effectively.
a The architecture of T-biGRU equalizer. By integrating forward, backward and repeated forward GRU processing, the T-biGRU model extracts global and local features of the sequence more comprehensively, thereby further enhancing equalization performance. b T-biGRU PAM-4 eye diagram and constellation at 400âGbps. c T-biGRU PAM-8 eye diagram and constellation at 390âGbps. d BER curves of PAM-4 signal with increasing data rates (280âGbps, 320âGbps, 360 Gbps and 400âGbps) of different channels. e BER curves of PAM-8 signal with increasing data rates (300âGbps, 330âGbps, 360âGbps and 390âGbps) of different channels. Ch2, Ch4, Ch6 and Ch8 are shown here, with all BERs lower than 10â3.
Discussion
To explore 400âGbps per wavelength of silicon photonics, we comprehensively optimize the device metrics with careful design. By adopting the slow-light approach, the bandwidth-efficiency trade-off for pure silicon modulators is greatly optimized, paving the way for 400âGbps transmission per wavelength. Our compact Si-SLM presents an ultrahigh EO bandwidth of 90âGHz and a remarkable modulation efficiency of 0.82âVâ cm, while possessing a wide optical passband of 7ânm around 1550ânm. Thanks to the high CMOS compatibility of Si-SLMs, we fabricate an 8-channel Si-SLM chip on a SOI platform using a standard silicon photonic process. In the fabrication flow, no additional heterogeneous materials and complex process are introduced, and this superiority provides a prerequisite for large-scale and low-cost production at the wafer level. Our Si-SLM chip, with its favorable basic performance, provides a foundation for transmission capacity enhancement, and the advantage of wafer-level production utilizing the standard silicon photonic process ensures the feasibility of extensive deployment of this solution.
More specifically, Table 1 summaries the properties of the representative results of Si-MZMs, Si-MRMs and Si-SLMs on SOI platforms in recent years. From the comparison, we can see that the metrics including bandwidth, modulation efficiency, and passband of our designed Si-SLMs are all relatively prominent for pure silicon modulators. While the bandwidth of our device exceeds that of previous modulators, the modulation efficiency has been improved compared to Si-MZMs, and the optical passband is greater than that of Si-MRMs (a full width at half maximum in one resonance period). Also, the transmission performance of our device is ahead of other Si-SLMs. It can be seen that, in recent years, although the transmission rate has continued to raise with the proposal and optimization of various solutions, the overall upward trend is still slow. Notably, our modulator has achieved single-lane 400 Gbps transmission using the industry-standard format PAM-4, with all BERs below HD-FEC threshold in a wide optical passband. As the highest single-lane rate demonstrated in standard silicon photonic platforms so far, this record-high result reveals the great high-speed transmission potential of silicon photonics.
The precise adoption of AI technology is an important guarantee for realizing the potential of our Si-SLM chips, and the slow-light systems are well suitable for accelerated driving with ANN equalizer. While the slow-light approach has maximized the overall performance of pure silicon modulators, its physical essence is still based on nonlinear modulation of doped silicon material (the prerequisite for large-scale industrial application), and the ANN model specializes in compensating for nonlinear distortions exactly. By embedding ANN into the signal processing workflow of Si-SLMs, the intrinsic nonlinear distortions in pure silicon devices can be effectively mitigated. At the manufacturing level, since silicon material does not possess a high and consistent EO coefficient, forming PN junction is the basis for realizing pure silicon modulation, while achieving complete uniformity in doping process is still challenging. Simultaneously, although the CMOS-compatible waveguide grating structure has already improved process stability compared to photonic crystals, the structural fluctuations of doped silicon gratings still exist due to the unavoidable fabrication deviations. Therefore, through AI equalization at the back end, the variations between Si-SLMs caused by process errors can be eliminated, thereby making the deployment of large-scale transmission systems possible. Here, three ANN models have been developed here, including the DNN, bi-GRU and T-biGRU equalizers. First, we adopt the simple DNN equalizer with only two hidden layers, which takes up less computing resources, and achieve a total capacity of 1.6 Tbps with 224âGbps PAM-4 per lane under the IEEE rate standard. More importantly, by adopting the bi-GRU equalizer, we achieve an ultrahigh-speed transmission of 400âGbps per lane and a total capacity of 3.2âTbps based on the Si-SLM chip, utilizing the standard high-order format PAM-4, with all BERs under HD-FEC threshold. Considering the ultra-compact occupied area (4âmm à 0.5âmm), we achieved an on-chip data-rate density of 1.6âTb/s/mm2. Meanwhile, benefiting from the wide flat passband around 1550ânm, the whole links are without individual resonant wavelength adjustment and TEC operating platforms, thus reducing the system budget. Next, we have verified the feasibility of the PAM-8 signal, demonstrating the applicability of our AI-accelerated slow-light solution for higher-order data transmission with more levels. Furthermore, the transmission performances of employing the T-biGRU equalizer are evaluated, and the BERs for around 400âGbps transmission can be slightly improved for both PAM-4 and PAM-8 signals.
In addition, this work illustrates the potential of AI technology in photonics filed, and the proposed AI-accelerated Si-SLM system is a good exemplification of "AI for SiPh" together with "SiPh for AI". For the optical interconnections in computing centers, the AI-accelerated Si-SLM technology provides an ultrahigh-speed deployment solution that enables a leap in single-lane transmission rate of the interfaces. Therefore, the computility of the computing centers can be improved, thus leading to faster training iterations of the ANN. This enhanced ANN will promote the signal processing for Si-SLM systems to be more immediate, thereby further improving the interconnection rates for computing centers. In short, a self-optimizing positive feedback loop between computing centers and Si-SLM systems through ANN can be constructed based on this bi-directional promotion mechanism, leading to a continuous improvement in the whole architecture. Especially, through the efficient photonic neural networks, the deep fusion of optical interconnection and optical computing systems based on this synergy can capitalize on the inherent advantages of photonics to facilitate real-time system optimization.
In summary, we have demonstrated the first pure silicon photonic system with a total capacity of 3.2âTbps based on AI-accelerated slow-light technology on a SOI platform. Integrating AI technology with Si-SLM systems offers a promising pathway to overcome the limitations of silicon photonics in ultrahigh-speed transmission. By utilizing the industry-standard IM/DD format PAM-4, we achieve 400âGbps/λ optical transmission in all wavelength channels, with BERs below HD-FEC threshold. To our best knowledge, this is the highest single-lane transmission rate demonstrated in standard silicon photonic platforms. Under precise regulation, our compact thermal-insensitive Si-SLM chip possesses an ultrahigh bandwidth, a remarkable modulation efficiency, a wide optical passband simultaneously, leading to an on-chip data-rate density of 1.6âTb/s/mm2, without individual resonant wavelength adjustment requirements and additional TEC operating platforms. This proposed AI-accelerated slow-light technology increases the transmission rate of silicon photonic significantly, paving the way for 400âGbps optical transmission per lane. Our work demonstrates the great value of AI for silicon photonics and highlights the potential of standard silicon photonic platforms for next-generation optical interconnections of 3.2âTbE and beyond.
Methods
Fabrication of the devices
The silicon slow-light structure is CMOS-compatible, thus the Si-SLMs can be fabricated under a standard silicon photonic process, leveraging the industrial advantages of silicon photonics. The Si-SLM chips adopt a standard 90ânm photolithography process on a 200-mm SOI wafer with a silicon thickness of 220ânm at CompoundTek Pte. The 90-nm-thick rib area is etched partially for carrier doping and metal contact in 220-nm-thick silicon. The concentrations of P-type and N-type doping in the depletion-mode PN junction are both 5.0 Ã 1017/cm3, to ensure a sufficiently high electrical bandwidth. All feature parameters of the slow-light waveguide satisfy the requirements of the commercial silicon photonic foundry, providing a basis for large-scale and low-cost production at the wafer level.
Experimental details
In the electro-optical response test, a vector network analyzer (Keysight PNA-X Network Analyzer N5247B) with its lightwave component analyzer (Keysight LCA Optical Receiver N4372E) is adopted to measure the bandwidth. For the optical test, the spectra are obtained by a high-resolution optical spectrum analyzer (Yokogawa AQ6370C). For the transmission experiments, the high-speed signal is generated by an arbitrary wave generator (Keysight M8199B), and then the differential signals are amplified by a commercial SHF single-ended driver to obtain a 5-V Vpp and then injected into the modulator working under the dual-drive push-pull configuration. After the electro-optical conversion, the modulated optical signal is amplified by an Amonics pre-amp erbium-doped fiber amplifier to compensate for the loss. At the receiving end, the signals are fast sampled by an oscilloscope (Keysight UXR-Series 256-GSa/s real-time) and afterward offline processed.
Algorithm details
DNN
The implemented lite DNN equalizer consists of an input layer with M neurons, followed by two hidden layers with N1 and N2 neurons, respectively. The architecture culminates in an output layer with a single neuron. The DNN is trained using the mean-square-error (MSE) criterion, employing the back-propagation (BP) algorithm. The training process is executed in two distinct steps. Initially, the forward propagation phase calculates the output based on the input feed. This process is detailed as
where
f ( â ) is the activation function (Supplementary Section 3), as shown in the DNN transmission parts, X(n) is the input vector, Y1(n) and Y2(n) are the outputs of the first and second hidden layers respectively, yout(n) is the output signal, W1, W2 and W3 are the weight matrices between different layers of the DNN equalizer.
The second step in the training process is back-propagation, where the calculated error term is propagated from the output layer back to the input layer. This mechanism allows for the adjustment of the networkâs weights to minimize the error. The square error for a single training example is defined as
where info(n) is the desired signal.
The stochastic gradient descent is utilized to optimize the cost function with respect to the weight matrices, including W1, W2 and W3. The updated weight matrix for the subsequent iteration can be derived by
where η indicates the learning rate of the network.
Consider the four-layer DNN equalizer with nin, nH, nout nodes in the input, hidden and output layer, respectively. Typically, the output layer has a fixed single node. Here, nin can be viewed as the tap number in a digital filter. The primary computational burden in the equalizer arises from the error propagation needed to calculate the squared error derivative of each node in all the hidden layers. Compared to the DNN, which employs a sigmoid function with two saturation regions, the DNN that employs multi-level sigmoid does not incur additional computational requirements. The computational complexity of the DNN equalizer per iteration can be characterized from the program as:
GRU
A GRU unit is composed of a reset gate rt and an update gate zt. The output ht is determined by both current input xt and previous state htâ1 under the control of these two gates. The outputs of the gates and the GRU unit are calculated as follows:
where Wr, Ur, Wz, Uz, Wh and Uh are the weight matrices. br, bz, bh are the synthesis of bias vectors for input xt and previous state htâ1, Ï is the logistic sigmoid function, tanh is the hyperbolic tangent activation function, â denotes the Hadamard product.
Models with bidirectional structure are capable of learning information from both preceding and following data when processing the current data. The bi-GRU model employed in this work comprises two unidirectional GRU layer operating in opposite directions. By combining forward and backward GRU processing, the model incorporates information from both the future and the past to influence its current states. The bi-GRU model relies on the states of two GRU layers, whereas the T-biGRU model utilizes the states of three GRU layers. By integrating forward, backward and repeated forward GRU processing, the T-biGRU model more comprehensively extracts both global and local features of the sequence, thereby further enhancing equalization performance. The bi-GRU model can thus be mathematically described as follows:
where \({\overrightarrow{{h}_{t}}}_{bi-GRU}\) and \({\overleftarrow{{h}_{t}}}_{bi-GRU}\) is the state of the forward and backward GRU, respectively. \({{h}_{t}}_{bi-GRU}\) is the output of bi-GRU. â indicates the operation of concatenating two vectors.
For the case of T-biGRU:
where \({\overrightarrow{{h}_{t1}}}_{T-biGRU}\), \({\overleftarrow{{h}_{t}}}_{T-biGRU}\) and \({\overrightarrow{{h}_{t2}}}_{T-biGRU}\) are the states of the first forward GRU, the backward GRU and the second forward GRU, respectively. \({{h}_{t}}_{T-biGRU}\) represents the output of T-biGRU.
The main computation complexity of one GRU layer can be decribed as:
where nE refers to the input size of the GRU layer, and nH represents the number of GRU units that used in the layer. The complexity of bi-GRU and T-biGRU layer can be calculated as:
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The data that support the plots within this paper and other findings of this study are available on Zenodo database [https://doi.org/10.5281/zenodo.15631151]. All other data used in this study are available from the corresponding authors upon request.
Code availability
The codes that support the findings of this study are available from the corresponding authors upon request.
References
Cheng, Q., Bahadori, M., Glick, M., Rumley, S. & Bergman, K. Recent advances in optical technologies for data centers: a review. Optica 5, 1354â1370 (2018).
Shu, H. et al. Microcomb-driven silicon photonic systems. Nature 605, 457â463 (2022).
Shu, H. et al. Microcomb technology: from principles to applications. Photonics Insights 3, R09âR09 (2024).
Atabaki, A. H. et al. Integrating photonics with silicon nanoelectronics for the next generation of systems on a chip. Nature 556, 349â354 (2018).
Siew, S. Y. et al. Review of silicon photonics technology and platform development. J. Lightwave Technol. 39, 4374â4389 (2021).
Bogaerts, W. & Chrostowski, L. Silicon photonics circuit design: methods, tools and challenges. Laser Photonics Rev. 12, 1â29 (2018).
Soref, R. A. & Bennnett, B. R. Electrooptical effects in silicon. J. Quantum Electron. 23, 123â129 (1987).
Reed, G. T., Mashanovich, G., Gardes, F. Y. & Thomson, D. J. Silicon optical Modulators. Nat. Photonics 4, 518â526 (2010).
Reed, G. T. et al. Recent breakthroughs in carrier depletion based silicon optical modulators. Nanophotonics 3, 229â245 (2014).
Shi, Y. et al. Silicon photonics for high-capacity data communications. Photonics Res. 10, A106âA134 (2022).
Stanley, S. 800G Client Optics in the Data Center. A Heavy Reading white paper produced for Cisco. https://www.cisco.com/c/dam/en/us/products/interfaces-modules/transceiver-modules/white-paper-sp-800g-client-optics-data-center.pdf (2022).
Zhou, X., Urata, R. & Liu, H. Beyond 1 Tb/s intra-data center interconnect technology: IM-DD OR coherent? J. Lightwave Technol. 38, 475â484 (2020).
Rahim, A. et al. Taking silicon photonics modulators to a higher performance level: state-of-the-art and a review of new technologies. Adv. Photonics 3, 024003 (2021).
Zhou, X., Yi, D., Chan, D. W. U. & Tsang, H. K. Silicon photonics for high-speed communications and photonic signal processing. npj Nanophotonics 1, 1â14 (2024).
Wang, C. et al. Integrated lithium niobate electro-optic modulators operating at CMOS-compatible voltages. Nature 562, 101â104 (2018).
He, M. et al. High-performance hybrid silicon and lithium niobate MachâZehnder modulators for 100 Gbitsâ1 and beyond. Nat. Photonics 13, 359â364 (2019).
Xu, M. et al. High-performance coherent optical modulators based on thin-film lithium niobate platform. Nat. Commun. 11, 1â7 (2020).
Zhang, M., Wang, C., Kharel, P., Zhu, D. & Loncar, M. Integrated lithium niobate electro-optic modulators: when performance meets scalability. Optica 8, 652â667 (2021).
Kharel, P., Reimer, C., Luke, K., He, L. & Zhang, M. Breaking voltage-bandwidth limits in integrated lithium niobate modulators using micro-structured electrodes. Optica 8, 357â363 (2021).
Ayata, M. et al. High-speed plasmonic modulator in a single metal layer. Science 358, 630â632 (2017).
Haffner, C. et al. Low-loss plasmon-assisted electro-optic modulator. Nature 556, 483â486 (2018).
Alloatti, L. et al. 100 GHz silicon-organic hybrid modulator. Light Sci. Appl. 3, 1â4 (2014).
Lu, G. W. et al. High-temperature-resistant silicon-polymer hybrid modulator operating at up to 200 Gbitsâ1 for energy-efficient datacentres and harsh-environment applications. Nat. Commun. 11, 1â9 (2020).
Liu, A. et al. A high-speed silicon optical modulator based on a metal-oxide-semiconductor capacitor. Nature 427, 615â618 (2004).
Ding, R. et al. High-speed silicon modulator with slow-Wave electrodes and fully independent differential drive. J. Lightwave Technol. 32, 2240â2247 (2014).
Patel, D. et al. Design, analysis, and transmission system performance of a 41 GHz silicon photonic modulator. Opt. Express 23, 14263â14287 (2015).
Li, M., Wang, L., Li, X., Xiao, X. & Yu, S. Silicon intensity Mach-Zehnder modulator for single lane 100 Gb/s applications. Photonics Res. 6, 109â116 (2018).
Li, K. et al. Electronic-photonic convergence for silicon photonics transmitters beyond 100 Gbps on-off keying. Optica 7, 1514â1516 (2020).
Zhang, H. et al. 800 Gbit/s transmission over 1 km single-mode fiber using a four-channel silicon photonic transmitter. Photonics Res. 8, 1776â1782 (2020).
Alam, M. S. et al. Net 220 Gbps/λ IM/DD Transmssion in O-Band and C-Band With Silicon Photonic Traveling-Wave MZM. J. Lightwave Technol. 39, 4270â4278 (2021).
Mohammadi, A., Zheng, Z., Zhang, X., Rusch, L. A. & Shi, W. Segmented silicon modulator with a bandwidth beyond 67 GHz for high-speed signaling. J. Lightwave Technol. 41, 5059â5066 (2023).
Li, K. et al. An integrated CMOS-silicon photonics transmitter with a 112 gigabaud transmission and picojoule per bit energy efficiency. Nat. Electron. 6, 910â921 (2023).
Xu, Q., Schmidt, B., Pradhan, S. & Lipson, M. Micrometre-scale silicon electro-optic modulator. Nature 435, 325â327 (2005).
Bogaerts, W. et al. Silicon microring resonators. Laser Photonics Rev. 6, 47â73 (2012).
Sun, J. et al. A 128 Gb/s PAM4 silicon microring modulator with integrated thermo-optic resonance tuning. J. Lightwave Technol. 37, 110â115 (2019).
Cai, H., Fu, S., Yu, Y. & Zhang, X. Lateral-zigzag PN junction enabled high-efficiency silicon micro-ring modulator working at 100Gb/s. Photonics Techonol. Lett. 34, 525â528 (2022).
Chan, D. W. U. et al. C-band 67 GHz silicon photonic microring modulator for dispersion-uncompensated 100 Gbaud PAM-4. Opt. Lett. 47, 2935â2938 (2022).
Zhang, Y. et al. 240 Gb/s optical transmission based on an ultrafast silicon microring modulator. Photonics Res. 10, 1127â1133 (2022).
Chan, D. W. U. et al. Efficient 330-Gb/s PAM-8 modulation using silicon microring modulators. Opt. Lett. 48, 1036â1039 (2023).
Yuan, Y. et al. A 5 Ã 200 Gbps microring modulator silicon chip empowered by two-segment Z-shape junctions. Nat. Commun. 15, 1â9 (2024).
Shekhar, S. et al. Roadmapping the next generation of silicon photonics. Nat. Commun. 15, 1â15 (2024).
Hinakura, Y., Arai, H. & Baba, T. 64 Gbps Si photonic crystal slow light modulator by electro-optic phase matching. Opt. Express 27, 14321â14327 (2019).
Hinakura, Y., Akiyama, D., Ito, H. & Baba, T. Silicon photonic crystal modulators for high-speed transmission and wavelength division multiplexing. J. Sel. Top. Quantum Electron. 27, 4900108 (2021).
Kawahara, K. et al. High-speed, low-voltage, low-bit-energy silicon photonic crystal slow-light modulator with impedance-engineered distributed electrodes. Optica 11, 1212â1219 (2024).
Jafari, O., Shi, W. & LaRochelle, S. Mach-Zehnder silicon photonic modulator assisted by phase-shifted Bragg gratings. Photonics Techonol. Lett. 32, 445â448 (2020).
Jafari, O., Zhalehpour, S., Shi, W. & LaRochelle, S. DAC-less PAM-4 slow-light silicon photonic modulator providing high efficiency and stability. J. Lightwave Technol. 39, 5074â5082 (2021).
Han, C. et al. Slow-light silicon modulator with 110-GHz bandwidth. Sci. Adv. 9, eadi5339 (2023).
Zhou, J., Wang, J., Zhu, L. & Zhang, Q. Silicon photonics for 100Gbaud. J. Lightwave Technol. 39, 857â867 (2021).
Zhong, K. et al. Digital signal processing for short-reach optical communications: a review of current technologies and future trends. J. Lightwave Technol. 36, 377â400 (2018).
Che, D. & Chen, X. Modulation format and digital signal processing for IM-DD optics at post-200G era. J. Lightwave Technol. 42, 588â605 (2024).
Wang, H. et al. Scientific discovery in the age of artificial intelligence. Nature 620, 47â60 (2023).
Bai, B. et al. Microcomb-based integrated photonic processing unit. Nat. Commun. 14, 1â10 (2023).
Shen, Y. et al. Deep learning with coherent nanophotonic circuits. Nat. Photonics 11, 441â446 (2017).
Ma, W. et al. Deep learning for the design of photonic structures. Nat. Photonics 15, 77â90 (2021).
Liu, D., Tan, Y., Khoram, E. & Yu, Z. Training deep neural networks for the inverse design of nanophotonic structures. ACS Photonics 5, 1365â1369 (2018).
Zhao, H. & Zhang, J. Adaptively combined FIR and functional link artificial neural network equalizer for nonlinear communication channel. IEEE Trans. Neural Netw. 20, 665â674 (2009).
Liu, C. et al. 81-GHz W-band 60-Gbps 64-QAM wireless transmission based on a dual-GRU equalizer. Opt. Express 30, 2364â2377 (2022).
Liu, S. et al. A multilevel artificial neural network nonlinear equalizer for millimeter-wave mobile fronthaul systems. J. Lightwave Technol. 35, 4406â4417 (2017).
Vlasov, Y. A., Oâboyle, M., Hamann, H. F. & McNab, S. J. Active control of slow light on a chip with photonic crystal waveguides. Nature 438, 65â69 (2005).
Baba, T. Slow light in photonic crystals. Nat. Photonics 2, 465â473 (2008).
Krauss, T. F. Why do we need slow light. Nat. Photonics 2, 448â450 (2008).
Brosi, J. M. et al. High-speed low-voltage electro-optic modulator with a polymer-infiltrated silicon photonic crystal waveguide. Opt. Express 16, 4177â4191 (2008).
Jiang, Y., Jiang, W., Gu, L., Chen, X. & Chen, R. T. 80-micron interaction length silicon photonic crystal waveguide modulator. Appl. Phys. Lett. 87, 221105 (2005).
Lin, C. Y. et al. Electro-optic polymer infiltrated silicon photonic crystal slot waveguide modulator with 23 dB slow light enhancement. Appl. Phys. Lett. 97, 093304 (2010).
Gao, X. et al. Dirac-vortex topological cavities. Nat. Nanotechnol. 15, 1012â1018 (2020).
Yariv, A., Xu, Y., Lee, R. K. & Scherer, A. Coupled-resonator optical waveguide: a proposal and analysis. Opt. Lett. 24, 711â713 (1999).
Yang, Y., Peng, C., Liang, Y., Li, Z. & Noda, S. Analytical perspective for bound states in the continuum in photonic crystal slabs. Phys. Rev. Lett. 113, 037401 (2014).
Jin, J. et al. Topologically enabled ultrahigh-Q guided resonances robust to out-of-plane scattering. Nature 574, 501â504 (2019).
Deligiannidis, S., Bogris, A., Mesaritakis, C. & Kopsinis, Y. Compensation of fiber nonlinearities in digital coherent systems leveraging long short-term memory neural networks. J. Lightwave Technol. 38, 5991â5999 (2020).
Liu, X. et al. Bi-directional gated recurrent unit neural network based nonlinear equalizer for coherent optical communication system. Opt. Express 29, 5923â5933 (2021).
Liu, Y. et al. Attention-aided partial bidirectional RNN-based nonlinear equalizer in coherent optical systems. Opt. Express 30, 32908â32923 (2022).
Cho, K. et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In Conference on Empirical Methods in Natural Language Processing 1724â1734 (Association for Computational Linguistics, 2014).
Murphy, S., Jamali, F., Townsend, P. & Antony, C. High dynamic range 100G PON enabled by SOA preamplifier and recurrent neural networks. J. Lightwave Technol. 41, 3522â3532 (2023).
Dong, P. et al. Silicon photonics for 800G and beyond. In Optical Fiber Communication Conference, M4H.1 (IEEE, 2022).
Acknowledgements
The authors thank Xi Xiao, Peiqi Zhou, Qiansheng Wang in National Information Optoelectronics Innovation Center for testing support. This work was supported by National Key Research and Development Program of China (2022YFB2803700 to H.S.), National Natural Science Foundation of China (62235002 to X.W., 62327811 to X.W., 12204021 to H.S., 62322501 to H.S., 12374340 to J.Q.), Beijing Municipal Science and Technology Commission (Z221100006722003 to X.W.), Beijing Municipal Natural Science Foundation (Z210004 to X.W.), IPOC (2021A03 to J.Q.) and Major Key Project of PCL to X.W.
Author information
Authors and Affiliations
Contributions
The experiments were conceived by C.H. The devices were designed by C.H. The equalizers were developed by J.Q. The experiments were performed by C.H., Q.Y. and Y.Zhou, with the assistance from Y.Zhang, H.W., L.W., Z.H., S.Y. and W.H. The slow light theory was developed by Z.Z and C.P. The data process was conducted by C.H., J.Q., Y.S., J.L. and Y.Wu. The device characterization was conducted by C.H., Q.Y., Y.Zhou. and Z.G. The results were analyzed by C.H., Q.Y., J.Q. and Y.Zhou. The figure optimization was conducted by C.H., Q.Y., Y.S, J.L. and Y.Wang. All authors participated the discussion of the research. The project was supervised by H.S., J.E.B. and X.W.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Emir Salih Magden and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Han, C., Yang, Q., Qin, J. et al. Exploring 400 Gbps/λ and beyond with AI-accelerated silicon photonic slow-light technology. Nat Commun 16, 6547 (2025). https://doi.org/10.1038/s41467-025-61933-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-025-61933-5











