Abstract
Living cells understand their environment by combining, integrating and interpreting chemical and physical stimuli. Despite considerable advances in the design of enzymatic reaction networks that mimic hallmarks of living systems, these approaches lack the complexity to fully capture biological information processing. Here we introduce a scalable approach to design complex enzymatic reaction networks capable of reservoir computation based on recursive competition of substrates. This protease-based network can perform a broad range of classification tasks based on peptide and physicochemical inputs and can simultaneously perform an extensive set of discrete and continuous information processing tasks. The enzymatic reservoir can act as a temperature sensor from 25â°C to 55â°C with 1.3â°C accuracy, and performs decision-making, activation and tuning tasks common to neurological systems. We show a possible route to temporal information processing and a direct interface with optical systems by demonstrating the extension of the network to incorporate sensitivity to light pulses. Our results show a class of competition-based molecular systems capable of increasingly powerful information-processing tasks.

Similar content being viewed by others
Main
Living systems sense, process and classify information from their environment using complex reaction networks1,2. These networks can respond to diverse physical and chemical stimuli such as the availability of nutrients, pH levels, changes in temperature and variations in light levels3. Seminal work has shown that complex networks found in nature share structural design patterns4. These so-called ânetwork motifsâ have served as a template for a wide range of synthetic reaction networks5 that exhibit well-defined behaviour, such as spatiotemporal dynamics6,7,8,9, robustness and adaptation10,11,12, and computational capabilities13,14,15,16,17,18. Importantly, none of the experimental approaches to functional enzymatic reaction networks reported to date fully capture the complexity of living systems. The focus on bottom-up construction of small network motifs is hampered by increasingly complex and laborious molecular designs. To emulate the rich information-processing behaviour found in cells, a new, scalable approach to artificial enzymatic reaction networks is needed.
Recently, we have shown that a self-organizing reaction network based on the prebiotic formose reaction forms a powerful chemical reservoir computer capable of complex systems predictions and forecasting19. Small biochemical networks are unsuitable for such in chemico reservoir computation as these are designed for very specific tasks and operate within narrowly defined parameter space20. In contrast, reaction networks with recursive interactions (such as the prebiotic formose reaction) may generate a wide array of chemical products that are non-linearly dependent on a small number of inputs.
Inspired by the adaptive, multifaceted information processing found in cells, we present a blueprint for scalable, enzymatic reservoir computing. By combining resource competition with recursive interactions, a complex, non-linear enzymatic reaction network emerges in response to changes in reaction conditions, without the need for a fixed network motif. Theoretical studies have shown how crosstalk and resource competition endow highly interconnected protein interaction networks with computational capabilities21,22,23, and recent research has shown competition may play a role in neuronal processes24.
In this study, we experimentally demonstrate that recursive enzymatic competition networks can process physicochemical information in the environment by acting as a reservoir sensor. We designed a protease-based enzymatic reaction network which receives up to seven short peptide substrates as input. Each of the peptides contains multiple cleavage sites for different proteases with different reaction rates. The many possible cleavage pathways lead to competitive and recursive interactions, yielding a highly non-linear network of enzymatic reactions that can be used for various sensing tasks. We first show the capacity of this network for reservoir computation by performing several non-linear classification tasks of different substrate input compositions. Next, we extend the physical reservoir paradigm to physicochemical sensing tasks by incorporating temperature and pH levels as new inputs. We demonstrate the potential of the enzymatic reservoir for general in chemico information processing by the simultaneous sensing, non-linear processing, and classification of changes in its environment. The reservoir can differentiate temperatures reliably with an average error of 1.3â°C over a broad range of temperatures (25â55â°C), and can accurately perform activation, tuning and decision-making tasks over that same range. Finally, we extend the network to detect changes in light-pulse periodicity for the range between 2 and 8âmin. Our work combines principles from cellular reaction networks and neuromorphic computing into a novel paradigm to develop physicochemical sensors suitable for tasks in a biological environment, a next step towards intelligent autonomous molecular systems.
Results and discussion
Design of a recursive enzymatic competition network
The design of our enzymatic reservoir is based on a new class of scalable enzymatic reaction networks that are connected through shared substrates and the recursive use of reaction products. We selected seven enzymes (listed in Fig. 1a): six proteases which selectively cleave peptides at different amino acid residues (trypsin, chymotrypsin, elastase, thrombin, thermolysin and prolyl endopeptidase) and alkaline phosphatase (Supplementary Table 3). We then selected seven short peptides, each containing multiple cleavage sites for two or more enzymes (listed in Fig. 1b). The cleavage fragments can serve as substrates for further cleavage (Fig. 1c). The multitude of cleavage reactions leads to competition between the peptides for available enzymes. Furthermore, the peptides CCFSWRCRC (3), IYPFVEPI (5) and TKIFKI (7) serve as slow substrates for chymotrypsin, prolyl endopeptidase and trypsin, respectively25,26. Such substrates act as effective reversible inhibitors that are slowly degraded, yielding non-linear kinetics that resemble autoactivation. The peptide library also includes a reversible peptide proinhibitor of chymotrypsin which features a phosphorylated serine residue (CCF(pS)WRCRC, 4). This phosphate group can be dephosphorylated by alkaline phosphatase, restoring its inhibitory effect on chymotrypsin and introducing competition with other substrates. The repeated cleavage of peptides by the enzymes leads to the formation of a recursive enzymatic reaction network that is characterized by resource competition for catalysts and emergent inhibitory feedback mechanisms (a schematic of possible cleavage pathways is shown in Fig. 1d), and can generate complex mixtures of output fragments. The actual reaction network varies based on the specific competing interactions between the enzymes and substrates at any particular combination of input concentrations and physicochemical conditions.
a,b, Lists of enzymes (a) and substrates (b) used in the reservoir design. Colour-coded vertical bars indicate potential cleavage sites for corresponding enzymes. c, Schematic representation of the competitive and recursive nature of enzymeâsubstrate interactions orchestrated within the reservoir, leading to a complex mixture of peptide fragments. d, Schematic overview of enzymatic interactions in the designed recursive ERN. Not all possible cleavage pathways are shown. e, Top: Chromatograms showing cleavage patterns in the network for individual peptides. Bottom: comparison between the accumulated cleavage patterns of individual peptides (grey) and the cleavage pattern resulting from using all peptides as input simultaneously (blue).
We immobilized the enzymes onto monodisperse polyacrylamide hydrogel beads, using a previously described approach27. Immobilization of enzymes allows us to separate the enzymatic âhardwareâ from the peptide substrates âsoftwareâ by compartmentalizing the enzymatic reaction network (ERN) inside a continuous stirred tank reactor (CSTR), effectively making the enzymatic reservoir reusable (see Methods for details). Prior to constructing the full network in the CSTR, we tested the activity of each individual enzyme (Supplementary Fig. 1) and determined peptide concentration ranges that gave a substantial response in output. We further examined the cleavage fingerprint of individual peptides produced by the full set of immobilized enzymes using high-performance liquid chromatography (HPLC) (Fig. 1e, top; see Supplementary Information, section 3.2.2 for details), and the fingerprint obtained when introducing all peptides simultaneously (Fig. 1e, bottom). The clear difference in output produced establishes that the competition resulting from the introduction of multiple peptides leads to a significant change in molecular interactions.
Chemical and physicochemical non-linear classification tasks
First, we demonstrate that our ERN has sufficient non-linear dynamics to exhibit reservoir computing capabilities. We assembled the ERN in a CSTR and distributed the peptides into input syringes based on their effective concentration regimes. We combined peptides 1 and 2 in syringe S1, peptides 3â5 in syringe S2, and peptides 6 and 7 were distributed in both S1 and S2. Syringes S1 and S2 served as potential input variables, with input concentrations (effective concentration in the reactor) varying from 40 to 100âμM for S1 and from 15 to 75âμM for S2. The concentrations of peptides 6 and 7 were kept constant at an input concentration of 160âμM, and the total flow rate of the system was kept constant by compensating the change in flow rate of S1 and S2 with an opposite change in flow rate of a buffer solution. The concentration ranges were specifically selected based on experimental observability, solubility, detectable cleaved fragments and non-linear effects (see Supplementary Information, sections 3.2.1 and 3.2.2 and Supplementary Fig. 4 for more details). To increase the experimental throughput of our system, we coupled the reactor directly to an electrospray ionization mass spectrometer (ESI-MS) (see Supplementary Fig. 3 for an overall schematic of the set-up). For each input combination, the network was allowed to reach steady-state over a 1-h equilibration period, during which 103 different ion traces resulting from specific cleaved fragments were recorded. The final output corresponding to each input was obtained as the averaged ion intensities over the final 10âmin of the equilibration period (Methods and Supplementary Fig. 17). The output of the ERN was converted to a âcomputationalâ output by training a single linear readout layer (a standard approach in physical reservoir computing), which multiplies every output feature with a weight and sums the resulting weighted signals (Fig. 2a). For the classification tasks, we applied a linear support vector classifier (LSVC) algorithm to obtain the correct weights, resulting in a binary classification for every input.
a, Schematic of the procedure used to classify input conditions. Every input condition (varying parameters U1 and U2) corresponds to a chemical output composition, which is multiplied by pretrained weights based on a target classification function. The result is summed to a final computational output, resulting in the classification of the input. b, Non-linear classification based on two different sets of substrate inputs (S1 and S2). On the left side is shown the input space. Next are shown the results of classification for an XOR gate, circle classification and hourglass classification, with the background indicating the desired classification output (blueâ=â0, redâ=â1). Locations of the markers indicate the corresponding location in input space, with the colour indicating the average test-set accuracy of that input point for 130 different L5O train-test splits, with +1 corresponding to perfect predictions, and 0 to total failure (Methods). The bar charts below every classification result show the average L5O-CV Φ accuracy for the enzymatic reservoir (ERC), the training algorithm without reservoir (LSVC), and several machine-learning algorithms (SVC, support vector classifier; GP, Gaussian process; MLP, multilayer perceptron; ELM, extreme learning machine). c, Non-linear classification results using pH and temperature as inputs, where similar to b the input space is shown on the left, and the respective accuracies per input point and a comparison of average accuracy to different in silico algorithms are shown on the right.
We first performed the XOR, circle and hourglass classification tasks using S1 and S2 as inputs, randomly sampling the input space in an appropriate range of input concentrations (Fig. 2b). For every input, the reservoir output was collected and trained as described above. To validate the performance of the classification tasks, we calculated the average Φ accuracy (also known as the phi coefficient, or Matthews correlation coefficient) using a repeated stratified leave-5-out cross-validation (L5O-CV) following Baltussen et al.19, and we compared these results to standard in silico classification methods. The results shown in Fig. 2b demonstrate that the ERN reservoir can perform these classification tasks on peptide inputs with a similar accuracy as established machine-learning techniques performing classification on an equivalent input space.
To further verify that the full set of enzymes is necessary to perform these classification tasks, we tested the performance of these classification tasks for subsets of the system with respectively three and five enzymes, instead of the full set of seven enzymes (Supplementary Fig. 15). The inability to perform non-linear classifications with these smaller networks demonstrates that the computational capacity is not straightforward, but an emergent property of the constructed network. We also investigated how many different output species need to be generated or observed to obtain sufficient performance. Depending on the task, anywhere from fewer than 50 to the full dataset of 103 different peptide traces are required to achieve optimal performance (Supplementary Fig. 16). To evaluate the ability of the reservoir for time-series prediction, we applied a dynamic input based on a sine wave. Output signals were sampled every 4.5âmin over 7âh. A ridge regression model was trained to predict the input signal at future time points (9 and 13.5âmin ahead). The results of the prediction are shown in Supplementary Fig. 12, demonstrating that the ERN reservoir is indeed capable of short-term forecasting (see Supplementary Information, section 3.2.5 for full details).
Having established the reservoir computing capabilities of the network based on peptide inputs, we wished to explore the capacity of our ERN to classify inputs from the physical environment in the form of temperature and pH, demonstrating that a change in peptide concentration is not necessary for the network to exhibit reservoir capabilities. The activity of each enzyme in the network will have a characteristic dependence on temperature and pH. Changes in these parameters therefore alter the relative strengths of connections in the network and lead to an observable response in the reservoir output (Supplementary Figs. 5â8). To perform purely physicochemical classification, we restrict the input space to only changes in temperature and pH, maintaining a constant input concentration of all peptides (Fig. 2c). Similar to the S1âS2 inputs described above, we recorded 60 steady-state outputs by randomly sampling 10 different temperature inputs within the range of 25â55â°C in combination with six different pH values ranging from 6 to 8.5 (controlled by separate buffer-syringes, Methods). Figure 2c shows the resulting L5O-CV Φ accuracies for three tasks (XOR, circle,and hourglass classification) of the ERN in comparison to in silico techniques on a similar input space. Comparable results for peptideâtemperature and peptideâpH input combinations are shown in Supplementary Fig. 9. Clearly, changes in physicochemical parameters are sufficient inputs for the network to perform reliable classification of its environment. To demonstrate the broad application to non-linear classification tasks, we provide an extended set of tasks in Supplementary Fig. 14.
Generalized physical information processing
Information-processing networks in cells are not generally limited to single tasks but demonstrate adaptive responses over a broad range of possible input conditions that may then be picked up by subsequent networks for further fine-tuning of cellular behaviour28,29. We demonstrate the capacity of the reservoir for four unique classes of information-processing tasks: linear sensing, activation, tuning and finally decision-making, in a generalized fashion, using temperature as input, without requiring further modification of the ERN (Fig. 3a and Methods). Specifically, the sensing task requires the network to estimate the environmental temperature based on its output composition, which depends on the reservoir producing a linear response over the full range of temperatures. The activation task corresponds to estimating a sigmoidal response as a function of temperature, a function commonly encountered in both biological and machine-learning applications30, and requiring a non-linear response of the reservoir around the infliction point T0 and a static response towards the asymptotes. The tuning task corresponds to estimating the gradient of the activation task, requiring a local pulse-like response around T0, an important feature for learning, and similar to neuronal activity patterns found in the brain31. Finally the decision-making task was implemented as a binary classification task, requiring a switch-like response of the reservoir at the classification boundary T0. All tasks thus require different sets of behaviours that can consistently be decoded by a linear readout layer. In the case of the continuous tasks (sensing, activation, tuning) a ridge regressor was used to train the linear readout layer. For the decision-making task, similar to the previously described classification tasks, a LSVC was used. The reservoir was tested over a large range of temperature conditions (25.5â55â°C) at 1.5â°C intervals at fixed pH and peptide inputs, with outputs obtained as quadruplicates. Specifically for the activation, tuning and decision-making tasks the capacity of the reservoir to correctly estimate a large range of different inflection points, tuning points and classification boundaries was evaluated (examples shown in Fig. 3c).
a, Schematic overview showing the classes of information-processing tasks the enzymatic reservoir can perform using temperature as an input. b, Plot showing the true versus predicted temperature. The datapoints shown are obtained from separate L5O validation runs (only the test set is shown). The average s.d. over all validation runs is 1.33â°C. c, Plots showing example activation, tuning and decision-making tasks at different inflection/tuning/classification boundaries T0, with T0â=â40â°C emphasized by the dashed vertical line (double-headed arrows indicate that boundaries can be shifted left or right). d, Plots showing predictions over the full range of temperature inputs, obtained from multiple L5O validation runs for a specific version of each task class, with T0â=â40â°C. Every point represents a test-set prediction for a different trained readout layer. The black line indicates the true output of the task. e, Plots showing the average L5O test-set R2 and Φ scores over the full range of tasks, with the x axis representing the inflection point, tuning point or classification boundary T0.
As shown in Fig. 3b, the reservoir is capable of sensing temperature across the full range of tested conditions (a 30â°C interval) with an average s.d. of 1.33â°C, obtained via multiple L5O tests. Moreover, the same broad performance is observed for the activation, tuning and decision-making tasks, with the reservoir capable of making accurate predictions across the full range of temperatures. The network achieves slightly lower scores for the tuning task compared with the activation task. This functionality is more challenging to perform because the network must account for twice the amount of non-linearity compared with the activation task, while still relying on only a linear readout. The reliable performance of the ERC across all classes of tasks shows that the recursive network creates both linear and various non-linear responses as a function of temperature, which can subsequently be extracted using only a linear readout layer.
Extending the input space to light pulses
Finally, as a first proof-of-principle of the extensibility of our enzymatic reservoir sensor towards detecting patterns in light intensity and direct interfacing with optical devices, we demonstrate the detection of the periodicity of a periodic blue light pulse. To achieve this task, we extend the reservoir by the inclusion of a merocyanine dye in the buffer solution (Fig. 4a), a photochromic compound that can decrease the solution pH by 3.5 units upon irradiation with blue light32. As we have shown above that the reservoir is sensitive to changes in pH, we expect the reservoir to respond to changes in light upon addition of this dye (Supplementary Fig. 11). Following a procedure similar to the temperature switch introduced above, we maintained a fixed temperature (30â°C) and substrate input concentrations, and exposed the reactor to 17 different light-pulse periodicities (from 30âs to 600âs) with constant average intensity for 15âmin (Fig. 4b). The network outputs were measured offline using HPLC. Importantly, the measured data reflect the cumulative dynamic response of the enzymatic reservoir induced by the light pulses with different periodicities. We then perform a binary classification with respect to a threshold periodicity P0, establishing a light-sensitive switch over a broad range of threshold periodicities (Fig. 4c). We find that the reservoir accurately differentiates timescales across a range of periodicities between 2 and 8âmin (Fig. 4d). These results are remarkable as the network is apparently sensitive to time-dependent changes in pH while the average pH in the reactor is constant. This experiment demonstrates the potential of the enzymatic reservoir to detect temporal information in the environment, offering opportunities for interfacing the enzymatic reservoir with information generated by electronic devices.
a, Blue light irradiation triggers merocyanine-to-spiropyran switching, leading to a decrease in pH. b, Schematic experimental overview for the light frequency sensor. c,d, Plots showing a binary switching task (c) and Φ accuracies of the binary switching task (d) as a function of light-pulse periodicity thresholds (P0). Red and blue dots in c indicate data points on different sides of the decision boundary.
Conclusions
We have introduced a new scalable approach to the design of complex enzymatic reaction networks with emergent behaviour, based on recursive competition for resources. We have shown that this network has in chemico reservoir computation capabilities, can be used as an enzymatic sensor to detect physicochemical changes in its environment and is scalable to new physical inputs. The network is capable of a diverse set of information-processing tasks, showing strong performance on linear sensing, non-linear processing and classification based on both chemical and physical inputs.
We have further showcased how the computational capacity scales up with network complexity by varying the number of enzymes or taking into account smaller subsets of output features. Our approach is inherently scalable: additional enzymes (other proteases, phosphatases or kinases) can be readily added along with additional or longer peptides that contain multiple cleavage or (de)phosphorylation sites. One can also envisage activating or deactivating peptides as substrates for the proteases by enzymatic (de)glycosylation. We therefore expect that the computational performance of the enzymatic reservoir sensor can be further improved. The development of enzymatic reservoirs could be further aided by in silico surrogate models. These could be used to efficiently map and search the high-dimensional space of chemical and physical parameters in which the network would perform optimally for a specific set of computational tasks.
We have demonstrated that the networkâs response can be modulated by physical stimuli such as pH, temperature and light. This tunability opens up promising avenues for engineering dynamic responses from the system by expanding the range of input parameters. One could, for instance, incorporate enzymes or enzyme cascades that modulate the local pH environment in response to an analyte of interest (for example, ATP/ADP ratio, or the presence of certain sugars in a mixture), thereby influencing the reservoir response.
The enzymatic network reported here is the second example of a chemical reaction network capable of reservoir computing. This network shows somewhat inferior computational capabilities compared to the previously reported formose reservoir, which outperformed several in silico machine-learning algorithms in non-linear classification tasks and performed particularly well on dynamic tasks. To further increase the computation capabilities of the enzymatic network, future work could focus on adding additional peptides and proteases to deepen the complexity of the system. An expansion of the number of peptides used could also offer routes to task-adaptive fine-tuning of the responsiveness of the enzymatic reservoir. Comparing the two chemical systems, the formose reaction operates under rather harsh reaction conditions, not compatible with many other chemical systems. In contrast, the reaction conditions of the enzymatic reservoir are mild, and we anticipate that the enzymes could be embedded in hydrogels or other soft materials, allowing for a direct interface between in chemico reservoir computing and responsive materials.
Potentially, a direct cellular interface may be envisioned by replacing the mass spectrometry-based readout layer with an in vitro variant of biochemical decision-making networks that have been shown to function in living cells, such as a metabolic perceptron33 or a protein-based winner-takes-all network34. These biochemical networks would replace our in silico regression models, and their output could then be trained to influence cellular behaviour, thus creating a link between the information-processing capabilities of the reservoir and living systems.
Finally, the capacity for enzymatic reaction networks to process time-based information and the sensitivity to changes in the periodicity of light signals, opens a wide range of possibilities for dynamic sensing and encoding of temporal information in chemical systems using optical pulses. Optically interfacing electronic computers with chemical systems represents a major step towards autonomous molecular systems.
Methods
Materials
All chemicals and reagents were used as received from commercial suppliers without any further treatment unless stated otherwise. Enzymes were purchased from Merck Sigma-Aldrich. Specifications are listed in Supplementary Information, section 1.1.1. All enzymes are assumed to be salt-free for calculating enzyme concentration, and calculations are based on specified molecular weights.
All peptides used in this study were synthesised by CASLO as trifluoroacetate salts with >95% purity. Peptide stock solutions (10âmM) were prepared in dimethylsulfoxide and kept at â20â°C. Product numbers are listed in the Supplementary Table 1.
General experiments were performed using 50âmM phosphate buffer, pHâ7.4. Experiments requiring different pH conditions were performed using BrittonâRobinson buffer (pH). The phosphate buffer was prepared as follows: potassium phosphate dibasic (5.352âg) and potassium phosphate monobasic (1.475âg) were mixed in MilliQ water to total volume of 1,000âml, yielding 50âmM of final concentration. The mixture is adjusted to pHâ7.4 solutions by addition of 2âM HCl and 10âM NaOH. The change in volume due to the addition of 10âM NaOH and 2âM HCl was neglected.
Flow reactions
A poly(methyl methacrylate) CSTR (volume, 100âμl) was prepared as described in Baltussen et al.27 For reactions involving temperature control, in place of a poly(methyl methacrylate) plate, the reactor was constructed above an aluminium plate such that the plate comprised the floor of the reactor. An image of the reactor set-up is available in Supplementary Fig. 3. This enabled high-precision control and monitoring of the temperature inside the reactor. Enzyme beads (4âµl each; total, 28âµl) were fixed inside the CSTR with a polycarbonate membrane of 10-µm pore size over the inlet and outlet. A LabM8 syringe pump platform was used to control input flow rates and temperature. Syringes were prepared with peptide solutions with each concentration configured to enable a combined flow rate of 400 μlâhâ1.
Chemical inputs
The peptides listed in Fig. 1b were used as chemical inputs/substrates for enzymes. Peptides 1 and 2 were combined in syringe S1, while peptides 3, 4 and 5 were combined in syringe S2. Peptides 6 and 7 were distributed across both syringes. Peptides in syringe S1 and S2 served as potential input variables, with input concentrations (effective concentration in the reactor) varying from 40 to 100âμM for S1 and from 15 to 75âμM for S2. The concentration of peptides 6 and 7 was kept constant at an input concentration of 160âμM for all the experiments. The total flow rate of the system was kept constant by compensating the change in flow rate of S1 and S2 with an opposite change in flow rate of a buffer solution. The total flow rate was maintained at 400âµlâhâ1 for most experiments, except for the light-pulse experiments, where a flow rate of 800âµlâhâ1 was used. For the S2 versus T and S2 versus pH classifications, the concentration of peptides (1 and 2) in S1 were kept constant at 70âµM, whereas the peptides (3, 4 and 5) in S2 were randomly varied between 15 and 75âµM, with respective pH and temperature conditions. For the rest of the experiments, peptide concentrations were held constant: peptides 1 and 2 were maintained at 70âµM, peptides 3, 4 and 5 at 45âµM, and peptides 6 and 7 at 160âµM each. For each input combination, the network was allowed to reach steady state over a 1-h equilibration period. To increase the experimental throughput of our system, we coupled the reactor directly to a trapped ion-mobility time-of-flight (TIMS-ToF) mass spectrometer. The final output of the reservoir was obtained as the averaged ion intensities over the final 10-min sample period of the steady-state output (Supplementary Fig. 17).
Physicochemical inputs
Temperature
The reactor temperature was controlled via a custom, tuned heating unit in the aluminium plate compromising the base of the CSTR. This system featured a 50-W heater cartridge with a PT100 temperature sensor. This heating unit was controlled by the same central unit as the flow rate. For S2 versus T and pH versus T classifications, temperatures between 25â°C and 55â°C were randomly sampled with changing peptide concentration. Finally, for generalized physical information processing, a large range of temperature conditions (25.5â55â°C) at 1.5â°C intervals were tested.
pH
BrittonâRobinson buffers of pH 6.0, 6.5, 7.0, 7.5, 8.0 and 8.5 were prepared as follows: boric acid (3.0915âg), 85% aqueous orthophosphoric acid (3.42âml) and glacial acetic acid (2.86âml) were mixed and diluted to total volume of 1,000âml, yielding 50âmM of every acid in the solution. The mixture is adjusted to the respective pH solutions by addition of 2âM HCl and 10âM NaOH. The change in volume due to the addition of 10âM NaOH and 2âM HCl was neglected.
For S2 versus pH and pH versus T classifications, the above-specified pH values were varied with changing peptide concentration and temperature.
Light
Light-emitting diode lights were purchased from Avonec (5âW, λmaxâ=â460ânm; 3âW) and powered by a Thorlabs LEDD1B driver. The timing and the intensity of this light source were externally controlled by an Arduino device. Blue light pulses were applied in alternating on/off cycles of 30â600âs for a total duration of 15âmin. A 30-s pulse induced a significant pH drop, while the longest cycles allowed the pH to return closer to its initial value.
Mass spectrometry
The output of the reactor was measured in real-time using a TIMS-ToF mass spectrometer (Bruker Daltonics). Once the CSTR was filled, the output of the reactor was coupled to an inline check valve, a dilution line (providing 0.046âmlâminâ1 of MilliQ water) and a back-pressure regulator as specified in Baltussen et al. prior to connection to the TIMS-ToF mass spectrometer19. The outflow of the CSTR, following dilution, was continuously injected into the ESI source of the TIMS-ToF instrument. The ESI needle was replaced with a 10-cm-long fused silica capillary (0.19âmm outside diameter, 0.1âmm inside diameter; Postnova Z-FSS-100190). Ions were electrosprayed in positive mode with the following settings: voltage, +4.5âkV; nebulizer pressure, 2.0âbar; drying gas flow, 8âlâminâ1; and source temperature, 200â°C. Ion transfer voltages were: quadrupole ion energy, 5âeV; collision cell in, 70âV; collision energy, 8âeV. The ion transmission was optimized for the mass range of interest (m/z 200â2,000) by using a transfer time of 110âμs, a collision radiofrequency of of 2,000âV peak-to-peak and a prepulse storage time of 10âμs prior to ToF analysis. The mass range scanned by the ToF analyser was m/z 200â2,000. The instrument was calibrated quadratically using three selected ions from an Agilent ESI liquid chromatography/mass spectrometry tuning mix: (118.0863, 0.542âV·sâcmâ2), (322.0481, 0.732âV·.sâcmâ2) and (622.0290, 0.985âV·sâcmâ2).
Reservoir computing pipeline
We use a general reservoir computing approach to perform several computational tasks. In all these tasks, we want to approximate a function f over a space of inputs \(u\in {U}\). Here, the function f can represent a binary classification or a smooth function (often constrained to the interval \([\mathrm{0,1}]\)). The input space can be one-dimensional or multidimensional, and is normalized to the interval \([\mathrm{0,1}]\) as well.
In all tasks, for every input ui we assign a true output yiâ=âf(ui). We then use the reservoir to approximate the true outputs as follows. For every input ui we obtain an enzyme reservoir response xi, consisting of the observed ion signals at steady state as described in the previous sections. This reservoir response can then be transformed into a computational output by multiplying with a weight vector W to obtain a prediction \({\hat{y}}_{i}=W{x}_{i}\). By training this weight vector on a (training) set of input/output combinations \(\left\{{x}_{i}({u}_{i}),\,{y}_{i}({u}_{i})\right\}\), we can ensure that the reservoir output can approximate the true output.
Depending on the nature of the task, either a linear support vector classifier (for classifications) or a ridge regression (for continuous tasks) is chosen to obtain the weight vector W. Both of these training algorithms implement regularization techniques to promote sparser sets of weights, reduce variability in the predicted output and prevent overfitting induced by the high dimensionality of the reservoir output x.
A Jupyter notebook with an example pipeline is provided (notebooks/0-classification-example.ipynb).
Non-linear classification tasks
Three different classification tasks were performed, and four different classification datasets were collected (S1 versus S2, S2 versus T, S2 versus pH, and pH versus T). For every input in the classification datasets, reactor output was measured for at least 1âh to ensure equilibration of the reservoir response. The output in the last 10âmin of this period was averaged to reduce noise. In total 103 features (peptide fragments) were tracked for each dataset. The datasets were independently normalized to remove the mean and scaled to unit variance across features. For the non-linear classification tasks, a LSVC from the scikit-learn Python package35 was trained to obtain classifications. For every task and every dataset a repeated stratified L5O CV was performed. Here, a dataset is divided into batches of five inputs (12 batches in total), ensuring every batch has representative inputs from both class labels (stratified), after which the LSVC is trained on all except one batch. The remaining batch of five inputs is then used to calculate a test score. This is done for every batch, and repeated 10 times for different batch-divisions, resulting in 10âÃâ12â=â120 test-accuracies. These test-accuracies are then used to obtain the final average test-accuracies as well as its s.d.
As test score, Matthewâs correlation coefficient, also known as the Φ score, was used:
where TP denotes the number of true positives, TN true negatives, FP false positives and FN false negatives in the test set. This score returns +1 for perfect predictions and â1 for completely wrong predictions. To convert the score to the accuracies reported in the manuscript, the scores were transformed as
before calculating the average and standard deviation over all repeats.
A Jupyter notebook reproducing the analysis is provided (notebooks/1-classification-all.ipynb).
General information-processing tasks
To demonstrate general information processing using the enzymatic reservoir, the reservoir output was collected at different temperatures, ranging from 25.5â°C to 55â°C at 1.5â°C intervals (21 inputs in total). Similar to the classification tasks, the reservoir was equilibrated for 1âh at every temperature, and the final output was collected as the average over the last 10âmin of the equilibration period. Per temperature input, four repeats were collected across different reactors and different days, resulting in a dataset of 82 inputs. Features with a signal-to-noise ratio below 1 (as determined by comparing for every feature, the s.d. in the total dataset to the average s.d. per temperature) were subsequently removed, resulting in 53 significant features in the dataset. The dataset was normalized to remove the mean and scaled to unit variance across features. Different from the classification tasks, a ridge regressor from the scikit-learn Python package35 was trained to obtain predictions.
The temperature-sensing task was assessed using a normal repeated L5O-CV (non-stratified) with 100 repeats, and the sensor error was obtained as the average s.d. between the predicted and true temperature for every test set. The binary classification task was assessed similarly to the non-linear classification tasks, but now with 100 repeats. The smooth tasks (activation and tuning) were assessed using a normal repeated L5O-CV (non-stratified) with 100 repeats. Here, the final score was obtained as the R2 calculated over all repeated L5O-CV test set predictions, where +1 indicates perfect predictions and 0 indicates predictions comparable to a mean estimator.
For the activation, tuning and binary classification task, the task itself depends on the activation/tuning/threshold temperature T0, defined as:
The final scores of the reservoir for each task were calculated for T0s between 30.75â°C and 51â°C at 1.5â°C intervals, and with k determining the sharpness of the activation and tuning tasks. The tuning task has been multiplied by a factor 4 to represent a maximum output of 1 instead of 0.25.
A Jupyter notebook reproducing the analysis is provided (notebooks/2-temperature-processing.ipynb).
Light-switch tasks
The experiment was conducted at a constant flow rate of 800âµlâhâ1, with a fixed peptide input and the temperature maintained at 30â°C. The reactor was first allowed to reach steady state by running the system for 1âh without any light exposure. Following this, 17 different light-pulse periodicities (from 30âs to 600âs) were applied, keeping a constant cumulative light exposure. Per periodicity, two repeats were collected over a total of 30âmin (15âmin per repeat) to obtain duplicate measurements. After each pulse, the flow experiment was continued without light exposure for an additional 30âmin to allow the system to return to its starting state before proceeding with the next pulse condition. The reservoir output was analysed using HPLC (Supplementary Information, section 3.1.2). The binary classification task was assessed similar to the non-linear classification tasks and the temperature decision-making tasks by repeated stratified L5O-CV with 100 repeats and calculating the average Φ accuracy.
A Jupyter notebook reproducing the analysis is provided (notebooks/3-light_sensor.ipynb).
Data availability
The data supporting the findings of this study are available within the article and its Supplementary Information. Processed data are available via Github at https://doi.org/10.5281/zenodo.14906651 (ref. 36). Source data are provided with this paper.
Code availability
Jupyter notebooks for working with the datasets as described in Methods, allowing reproduction of all reservoir computation and reservoir sensing analyses, are provided via Github at https://doi.org/10.5281/zenodo.14906651 (ref. 36).
References
Nurse, P. Life, logic and information. Nature 454, 424â426 (2008).
Azeloglu, E. U. & Iyengar, R. Signaling networks: Information flow, computation, and decision making. Cold Spring Harb. Perspect. Biol. 7, a005934 (2015).
Schadt, E. E. Molecular networks as sensors and drivers of common human diseases. Nature 461, 218â223 (2009).
Milo, R. et al. Network motifs: simple building blocks of complex networks. Science 298, 824â827 (2002).
Ghosh, S. et al. Exploring emergent properties in enzymatic reaction networks: design and control of dynamic functional systems. Chem. Rev. 124, 2553â2582 (2024).
Semenov, S. N. et al. Rational design of functional and tunable oscillating enzymatic networks. Nat. Chem. 7, 160â165 (2015).
Howlett, M. G., Engwerda, A. H. J., Scanes, R. J. H. & Fletcher, S. P. An autonomously oscillating supramolecular self-replicator. Nat. Chem. 14, 805â810 (2022).
Harmsel et al. A catalytically active oscillator made from small organic molecules. Nature 621, 87â93 (2023).
Lobato-Dauzier, N. et al. Neural coding of temperature with a DNA-based spiking chemical neuron. Nat. Chem. Eng. 1, 510â521 (2024).
Barkai, N. & Leibler, S. Robustness in simple biochemical networks. Nature 387, 913â917 (1997).
Engelhart, A. E., Adamala, K. P. & Szostak, J. W. A simple physical mechanism enables homeostasis in primitive cells. Nat. Chem. 8, 448â453 (2016).
Lillacci, G., Aoki, S. K., Gupta, A., Baumschlager, A. & Khammash, M. A universal rationally-designed biomolecular integral feedback controller for robust perfect adaptation. Nature 570, 533â537 (2019).
Qian, L., Winfree, E. & Bruck, J. Neural network computation with DNA strand displacement cascades. Nature 475, 368â372 (2011).
Chen, Y.-J. J. et al. Programmable chemical controllers made from DNA. Nat. Nanotechnol. 8, 755â762 (2013).
Woods, D. et al. Diverse and robust molecular algorithms using reprogrammable DNA self-assembly. Nature 567, 366â372 (2019).
Okumura, S. et al. Nonlinear decision-making with enzymatic neural networks. Nature 610, 496â501 (2022).
Lv, H. et al. DNA-based programmable gate arrays for general-purpose DNA computing. Nature 622, 292â300 (2023).
Evans, C. G., OâBrien, J., Winfree, E. & Murugan, A. Pattern recognition in the nucleation kinetics of non-equilibrium self-assembly. Nature 625, 500â507 (2024).
Baltussen, M. G., Jong, T. J. de, Duez, Q., Robinson, W. E. & Huck, W. T. S. Chemical reservoir computation in a self-organizing reaction network. Nature 631, 549â555 (2024).
Cucchi, M., Abreu, S., Ciccone, G., Brunner, D. & Kleemann, H. Hands-on reservoir computing: a tutorial for practical implementation. Neuromorphic Comput. Eng. 2, 032002 (2022).
Klumpe, H. E., Garcia-Ojalvo, J., Elowitz, M. B. & Antebi, Y. E. The computational capabilities of many-to-many protein interaction networks. Cell Syst. 14, 430â446 (2023).
Genot, A. J., Fujii, T. & Rondelez, Y. Computing with competition in biochemical networks. Phys. Rev. Lett. 109, 208102 (2012).
Cai, H., Zhang, X., Qiao, R., Wang, X. & Wei, L. Efficient computation by molecular competition networks. Phys. Rev. Res. 6, 033208 (2024).
Zhang, S. X. et al. Stochastic neuropeptide signals compete to calibrate the rate of satiation. Nature 637, 137â144 (2025).
Pogodaev, A. A., Fernández Regueiro, C. L., JakÅ¡taitÄ, M., Hollander, M. J. & Huck, W. T. S. Modular design of small enzymatic reaction networks based on reversible and cleavable inhibitors. Angew. Chem. Int. Ed. 58, 14539â14543 (2019).
Asano, M., Nio, N. & Ariyoshi, Y. Inhibition of prolyl endopeptidase by synthetic peptide fragments of human β-casein. Agric. Biol. Chem. 55, 825â828 (1991).
Baltussen, M. G., van de Wiel, J., Regueiro, C. L. F., JakÅ¡taitÄ, M. & Huck, W. T. S. A Bayesian approach to extracting kinetic information from artificial enzymatic networks. Anal. Chem. 94, 7311â7318 (2022).
Jordan, J. D., Landau, E. M. & Iyengar, R. Signaling networks: the origins of cellular multitasking. Cell 103, 193â200 (2000).
Koseska, A. & Bastiaens, P. I. Cell signaling as a cognitive process. EMBO J. 36, 568â582 (2017).
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436â444 (2015).
Kriegeskorte, N. & Wei, X.-X. Neural tuning and representational geometry. Nat. Rev. Neurosci. 22, 703â718 (2021).
Wimberger, L., Andréasson, J. & Beves, J. E. Basic-to-acidic reversible pH switching with a merocyanine photoacid. Chem. Commun. 58, 5610â5613 (2022).
Pandi, A. et al. Metabolic perceptrons for neural computing in biological systems. Nat. Commun. 10, 3880 (2019).
Chen, Z. et al. A synthetic protein-level neural network in mammalian cells. Science 386, 1243â1250 (2024).
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825â2830 (2011).
Baltussen, M. G. Enzymatic competition reservoir. Zenodo https://doi.org/10.5281/zenodo.14906651 (2025).
Acknowledgements
We dedicate this publication to the memory of Professor Roeland Nolte, who inspired us to explore information processing in molecular systems. We thank the Radboud TechnoCentre, specifically A. de Kleine, for the design and manufacturing of the reactors. We thank M. Derks for his help and work in the design of the syringe pumps, temperature stage and flow set-up. This project has received funding from the European Unionâs Horizon 2020 research and innovation programmes under grant agreement number 833466 (ERC Adv. Grant Life-Inspired), the Dutch Ministry of Education, Culture and Science (Functional Molecular Systems, Gravitation programme 024.001.035) and the Dutch Research Council (grant OCENW.KLEIN.348), and the Australian Research Council (DP220101847).
Author information
Authors and Affiliations
Contributions
W.T.S.H., S.G. and M.G.B. conceived the research. S.G., R.H. and Q.D. developed the experimental methodology. S.G., A.C.K. and R.H. performed bead synthesis and enzymatic essays. S.G., M.G.B. and A.C.K. designed the experiments. S.G., A.C.K. and A.T.T. performed the experiments. Q.D. and M.G.B. wrote the data analysis software. M.G.B. performed analysis and trained the models. M.H.C. and J.E.B. synthesized and analysed the merocyanine dye. S.G., M.G.B. and W.T.S.H. wrote the original manuscript, and all authors discussed results and commented on the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Chemistry thanks Zibo Chen, Irene Otero-Muras and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary Information
Supplementary Figs. 1â17, Discussion and Tables 1â5.
Source data
Source Data Fig. 1
HPLC chromatograms corresponding to Fig. 1e.
Source Data Fig. 2
Steady state output data from ESI-MS.
Source Data Fig. 3
Steady state output data from ESI-MS.
Source Data Fig. 4
Peak integral data from HPLC chromatograms.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ghosh, S., Baltussen, M.G., Knox, A.C. et al. A recursive enzymatic competition network capable of multitask molecular information processing. Nat. Chem. (2025). https://doi.org/10.1038/s41557-025-01981-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41557-025-01981-y






