Introduction

Major depressive disorder (MDD) is a highly prevalent and debilitating condition. Still, 30% of participants with MDD do not respond to treatment, highlighting the need for novel treatment options. In the search for new treatment modalities, the conceptualization of MDD as a condition of chronic stress is of interest [1,2,3]. Specifically, participants with MDD report excessive subjective levels of chronic stress and show corresponding physiological alterations, including autonomic dysregulation. This is evident in reduced heart rate variability (HRV) and impaired responsivity to acute stress on both emotional and physiological measures (e.g., cortisol resistance, higher heart rate, blunted autonomic reactivity) [4, 5]. Hence, specifically targeting biological pathways affected by chronic stress may offer new avenues for MDD treatment.

The vagus nerve (VN) - the principal parasympathetic component of the autonomous nervous system - is a key player in the stress response and recovery. Connecting the brain stem and the inner organs, the VN facilitates the bidirectional communication necessary for rapid adaptations after an acute stress exposure. VN efferents influence cardiac reactivity and inflammatory responses via multiple pathways including the cholinergic anti-inflammatory reflex [6,7,8]. Conversely, interoception and the emotional (and attentional) responses are integrated through strong projections that exist between the VN, brain stem nuclei (nucleus tractus solitarii and dorsal-motor nucleus), the hypothalamus, mesolimbic regions, and higher cortical areas including the insular and ventromedial prefrontal cortices [9,10,11,12,13,14]. Hence, impaired vagus function, indicated by low cardiac parasympathetic activity (CPA) [15] significantly affects the stress response and homeostasis. Currently - with the exception of the invasive ultrasound-guided microneurography - there is no method to assess vagal activity directly for humans. The current non-invasive standard to approximate CPA is through the root mean square of successive differences (RMSSD) or through Respiratory Sinus Arrythmia, derived from an electrocardiogram. RMSSD is a widely used time-domain measure of heart rate variability (HRV) that is thought to specifically index vagally mediated cardiac–parasympathetic activity during normal breathing patterns. Shaffer and Ginsberg [16] highlight that RMSSD is particularly well-suited for short-term recordings (~5 min or less) because it is relatively insensitive to respiratory influences (see also [15]) and reflects fast time-scale fluctuations in heart rate, making it an indicator of parasympathetic tone.

Given the VN’s key role in the stress response and recovery, it is noteworthy that (invasive) vagus nerve stimulation is an approved treatment for treatment-resistant cases of MDD. Indeed, VN stimulation (VNS) has shown promising results in reducing depressive symptoms, but the underlying mechanisms for this effect are not understood.

With the introduction of non-invasive techniques, evidence on the physiological mechanisms is accumulating. Unlike invasive VNS that activates both efferents and afferents in the cervical trunc, transcutaneous auricular VNS (taVNS) selectively stimulates sensory vagal afferents Iin the outer ear. This targeted approach has been shown to 1) elicit activation of vagal brain stem nuclei and the downstream projections in the brain [17,18,19], 2) influence learning, motivation and social behaviour [20,21,22], and 3) exert anti-inflammatory effects under specific conditions [23]. These latter effects are thought to be mediated predominantly through central mechanisms, as downstream effects in peripheral organs such as the heart or spleen result from brainstem activation rather than direct efferent stimulation.

taVNS has been demonstrated to influence emotional regulation during acute stimulation. Specifically, taVNS significantly enhanced positive emotions in healthy participants who exerted effort on low-reward tasks, and the effect was greater in participants with lower baseline levels of positive affect [24]. The impact of taVNS on peripheral inflammation remains inconclusive. While rodent studies have consistently demonstrated inflammation-attenuating effects of VNS [25], our recent meta-analysis of human studies [23] did not show a consistent effect, except for studies involving an acute inflammatory challenge. Methodological heterogeneity across studies and large diversity in participant samples complicates interpretation.

Finally, despite the crucial role of the autonomic system in cardiac stress reactivity, few studies (i.e., [26, 27]) have investigated the impact of taVNS on heart rate and heart-rate variability during acute stress, and to the best of our knowledge, no studies have investigated this in MDD. While generally positive effects of taVNS on depressive symptoms have been reported [28], studies on the mechanism of therapeutic action in MDD remain scarce.

Considering that (1) MDD is a chronic stress condition with reduced HRV [29], and reduced autonomic responses to acute stress [5] and (2) taVNS can selectively activate vagal afferents and modulate the central autonomic network, we hypothesized that taVNS would increase acute stress reactivity, restore blunted HRV, and reduce inflammatory response to acute stress in participants with MDD and healthy controls. In addition, given the previously reported effect on positive emotions, we aimed to explore if positive and negative emotions in response to a stress paradigm are influenced by taVNS. After analyzing the groups by diagnosis, we performed a stratification of participants by RMSSD, as a biological marker of cardiac vagal activity, and hypothesized that particularly participants with low RMSSD will benefit from taVNS compared to those with high RMSSD.

Methods

Ethical approval and pre-registration and trial design

The single-blind randomized controlled trial “MODULATE Depression” was approved by the Ethical Committee of the Goethe University Frankfurt (Approval number 2021-48) and pre-registered at the German Clinical Trials Register (Identifier: DRKS00024823) and can be retrieved in the WHO Clinical Trial Registration Platform.

All participants provided written informed consent followed by a screening visit, and two test-days. On the test days, sham stimulation or transcutaneous auricular vagus nerve stimulation (taVNS) were administered in a counter-balanced order (see Fig. 1.). On the screening day, participants were assessed for eligibility using the Mini-Neuropsychiatric International Interview (MINI 7.0.2 for DSM-5) and the Montgomery Asberg Depression Rating Scale (MADRS), completed various questionnaires (see below for details) and registered in the electronic patient-reported outcome system (P1vital® ePRO system). If eligible, participants were invited to the two identical test days, which took place in the morning between 07.00 am and 12.00 am, with similar starting times for both test days, and lasted approximately 2 h. Participants were required to fast overnight (drinking water was permitted). On arrival of the test day, participants provided a blood sample in fasted state, filled out questionnaires, and were then asked to complete three tasks on the computer: two reward learning tasks (results of the reward learning task are reported elsewhere) and a modified version of the Montreal Imaging Stress Task (mMIST). The mMIST consists of 3 segments (baseline, stress exposure and recovery, each segment lasted 5 min). During the mMIST, an ECG was recorded, starting approximately 3 min prior to the baseline of the task, while participants were seated in front of the computer, to allow for some adaptation to the recording system. After the stress task, another blood sample was taken, approximately 45 min after the stress task onset. In addition, before, shortly after, and 20 min after completion of the stress task, cortisol samples were obtained using Salivette® (Sarstedt). We analysed cortisol levels using a standard ELISA; however, because our stress task occurred in the morning, our results showed a decrease across all participants and the stress condition, likely driven by the cortisol awakening response, rendering the findings uninterpretable. Therefore, we do not report cortisol results.

Fig. 1: Study Design.
figure 1

Left panel: A Sham stimulation at the earlobe. B Verum stimulation at the right cymba conchae. Right panel: Illustration of cross-over, single-blind, randomized, sham-controlled trial. Stimulation was started after the blood sample (−60 min) and ended after the stress task. VAS scales were collected after the baseline (0 min), after stress exposure (5 min) and after recovery (5 min). A final blood sample was collected 40 min after stress exposure. Timescales are approximate and did vary according to time needed for completing experimental procedures such as VAS scales. Image created with Biorender.com.

Participants

Participants were included at the Department of Psychiatry, Psychosomatic Medicine and Psychotherapy at the University Hospital Frankfurt between 24/05/2021 and 27/07/2023. Only clinically diagnosed MDD patients fulfilling the DSM-5 criteria were included in the MDD-group. Potential comorbid diagnoses were assessed with the MINI Diagnostic Interview according to DSM-5 criteria and assessed by trained raters. All participants in the healthy control group had to have an inconspicuous MINI result. All participants were between 18–65 years old. They were excluded if they satisfied the criteria for a bipolar disorder, schizophrenic psychosis, substance abuse disorder or dementia, neurological, autoimmune, rheumatological, neoplastic, dermatologic, or metabolic disorders and for healthy controls only, if they had a previous or current diagnosis of depression. Furthermore, participants with depression induced by substance abuse were excluded. Given that participants with MDD frequently show a high BMI, we excluded only those with a BMI higher than 35 (severe obesity) both for controls and participants with MDD to reflect variability of a naturalistic sample. Participants excluded if they were pregnant or lactating, or had an acute infection as evidenced by CRP levels above 10 mg/L on testday 1 or 2. In case of any respiratory infection, all procedures were postponed until the participant was symptom-free for at least 48 h. For participants with MDD, there was no requirement for treatment resistance.

Intervention

Transcutaneous vagus nerve and sham stimulation

TaVNS was applied at the cymba conchae of the right ear, while sham stimulation was applied with identical parameters at the earlobe (see Fig. 1). Stimulation of the right rather than the left ear was based on preclinical evidence indicating that the right VN has stronger anatomical and functional connections to limbic structures involved in reward and emotional processing [30]. Although both left and right VNS influence cardiac activity, right-sided stimulation has historically been avoided due to the denser innervation of the sinoatrial node by the right VN and a theoretical risk of bradycardia. However, no empirical evidence to date supports safety concerns with transcutaneous right- or bilateral VNS [31] and increases in HRV have been reported following stimulation on both sides. Importantly, in contrast to implanted cervical VNS, taVNS is thought to produce no direct efferent effects on the heart or spleen, sites relevant to cardiac regulation and immune modulation via the cholinergic anti-inflammatory pathway. Therefore, observed physiological changes, including effects on HRV, are likely driven by afferent stimulation of auricular vagal fibers projecting to the brainstem, which then exert downstream influence through central autonomic and neuromodulatory networks. This distinction highlights that taVNS engages brain circuits differently than cervical VNS, where off-target efferent effects cannot be ruled out.

Recent findings further reveal striking lateralization in VN connectivity with the dopaminergic midbrain [31]. Specifically, right cervical VNS appears to engage midbrain dopaminergic pathways more effectively than left-sided stimulation. It has been proposed that right-sided VNS may preferentially activate plasticity-promoting neuromodulatory systems. These findings position right-sided VNS as a promising strategy for enhancing dopamine-dependent processes. Given that one aim of our study was to investigate reward-related processes (reported elsewhere), we opted for right-sided stimulation.

All participants received both interventions (e.g., taVNS either at day 1 or 2 and sham on the other day) in a randomized order. The NEMOS® device for research use (tVNS Technologies, Erlangen) was used, with a pulse duration of 250µs, a pulse frequency of 25 Hz, and a stimulation cycle duration of 32 s on and 28 s off. The stimulation intensity was minimally 100 µA and maximally 5000 µA and was ramped up during the 32 s stimulation cycle. Stimulation intensity was individually determined for each participant and each test day, by increasing the intensity until the participant reported an uncomfortable sensation and then brought down in 100 µA steps until the participant reported a tingling sensation that was not painful. Individual thresholds were recorded. The median stimulation intensity tolerated for sham was 850 µA (range 300–5000 µA) for tVNS it was 800 µA (200–3900 µA). The stimulation started after the blood sample was taken and the baseline assessments (e.g., BDI, demographic questionnaires) had taken place, approximately 10 min before the first reward learning task was initiated. Stimulation was applied continuously for approximately 90 min during the experimental session. Given that stimulation started while participants were filling out questionnaires, slight deviations, corresponding to the time participants required to fill out questionnaires, are possible.

ECG recording and processing

An electrocardiogram (ECG) was recorded using a 10-channel isolated bioamplifier (NeXus-10 MKII, Mind Media BV, Netherlands). Three disposable electrodes (Kendall™, H66LG electrodes) in a modified lead II configuration were placed on the chest to record the ECG at a sampling frequency of 1024 Hz. BioTrace + Software (Mind Media BV, Netherlands) was used to check the signal and export the data. For the modified lead II recording electrodes were placed on the right and left upper chest (approximating the arms), and the ground electrode was placed on the lower left abdomen. ECG signal processing was performed with Kubios Pro Scientific Analysis software [32], which has an inbuilt R peak detection and inbuilt artifact and noise correction. A visual inspection of raw data was performed per segment of interest (the 5-min baseline, stress and recovery). Noise detection was used with the standard settings, and unclear beats were removed manually. Segments with more than 5% of beats in need of correction were excluded from the analysis. For the present analysis we extracted heart rate (HR) in beats per minute, root mean square of successive differences between normal heartbeats (RMSSD) in ms, the absolute power of high frequency heartrate variability (HF)-HRV (0.15–0.4 Hz), and low frequency (LF)-HRV (0.04–0.15 Hz) both fast Fourier transformed. We opted for these measures given our interest in cardiac vagal modulation, which can be approximated by RMSSD. For the sake of comparability with other articles reporting frequency-domain rather than time-domain measures, we also report the HF-HRV. Furthermore, for completeness, we also report LF-HRV, and HR was selected as a general index of cardiac output and physiological homeostasis [33].

Stratification based on RMSSD

After conducting the analysis separated by diagnosis (MDD vs. HC), we performed a post-hoc stratification based on the biological parameter RMSSD. To stratify the participants, we used a median split for RMSSD in milliseconds (ms) across all participants to obtain baseline RMSSD groups. We used the 5 min baseline of the sham intervention for stratification purposes. The RMSSD median used for stratification was 39.14 ms (for the low RMSSD group the range was 5.51 ms - 38.75 ms and for the high group 39.14 ms - 150.80 ms). If no data was present for this timepoint, missing values for RMSSD (n = 11) were imputed using all ECG data with chained random forests as implemented in the missRanger package [34] in R. We opted for RMSSD instead of HF-HRV or respiratory sinus arrhythmia because it is slightly less dependent on respiration [35] and respiration was not measured in this study. Given the known influence of tricyclic antidepressants (TCA) on HRV, we excluded three participants on TCA in the analysis stratified by RMSSD group. The results in- and excluding participants on TCA for both stratification approaches (1. stratification based on diagnosis, and 2. stratification based on RMSSD group) are provided in the supplementary tables.

Blood sample preprocessing and analysis

Blood samples of the fasted participants were collected by venipuncture upon arrival and 45 min after the stress task. The blood draw was performed in a fasted state to reduce variability for the inflammatory protein analyses, given that postprandial levels of proteins can differ, and to standardize metabolic states between testing days [36, 37]. Blood was collected in K3EDTA Monvettes (S-Monovette® K3 EDTA) and immediately transferred to cooling containers. The samples were then spun down (2000g, 10 min) in a precooled centrifuge (4 °C) no later than 30 min after blood sampling. The resulting plasma was then aliquoted into Eppendorf tubes and stored at −70 °C until further analysis. To obtain levels of high sensitive (hs) IL-6 and hs TNF-α, we used enzyme-linked immunosorbent assay with precoated kits (IL-6: Human IL-6 High Sensitivity ELISA, Art.Nr.: BMS213HS; TNF-α: Human TNF-α High Sensitivity ELISA Kit, Art.Nr.: BMS223HS and BMS223-2HS both, Invitrogen/ThermoFisher Scientific) as per manufacturer’s instructions. Samples were measured in duplicate, blank values were subtracted, and a standard curve was obtained for each plate using Microsoft excel, optical densities were converted to concentrations in pg/mL. If the sample duplicate’s coefficient of variation deviated more than 15%, they were repeated. Values below the detection level were replaced by the level of detection (LOD) divided by 2. For TNF-α analytical sensitivity was 0.13 pg/mL, for IL-6 0.03 pg/ml. For TNF-α, of the 361 measurements, 47 were below level of detection and were replaced by the LOD divided by 2 (i.e., 0.13/2). No levels were below the detection level for IL-6. Concentrations ranged from 0.35 pg/mL - 55.70 pg/mL for IL6, and from 0 (undetectable levels) - 8.37 pg/mL for TNF-α.

Sample size

The sample size was determined with a predefined power calculation, for a repeated measures Analysis of Variance (ANOVA) with a within and between effects interaction. A sample size of 76 was determined, given an assumed effect size of 0.25, alpha 0.05, and a reasonable power of 0.95, with a correlation among measures of r = 0.3, 2 groups and 2 measurements (i.e., taVNS/sham) (calculated with G*Power 3.1.9.6). Given the potential dropout effects, and assuming that around 20% of values detected with ELISAs are below the threshold of detection, we aimed to recruit 100 participants at T1 (with 1:1 group distributions). A simulation analysis showed that with 100 participants, a smaller effect size of just below f = 0.152 would still be detected with 80% power (see Supplementary Fig. 1).

Randomization

Participants were randomized to receive either sham stimulation or verum stimulation based on a pre-established list, composed by a random sequence in Excel. Participant IDs were linked to the allocation and could not be changed. The list was assembled in three blocks of 50 participants. Researchers were not blinded to the allocation, but participants were blinded, i.e., they were not informed of which intervention they received on each day. The random allocation sequence was generated by the senior scientists, while allocation was performed by the study staff. The procedure of sham stimulation and taVNS were identical, except for the location of stimulation. Care was taken to spend an equal amount of time on the establishment of the stimulation intensity thresholds and experimental procedures in both intervention arms.

Questionnaires

The Mini-Neuropsychiatric International Interview (MINI, version 7.0.2 for DSM-5) [38] was assessed to exclude psychiatric comorbidities. The MINI is a short, well-validated, semi-structured diagnostic interview for major psychiatric disorders. Items refer to the last month and/or the entire life and are rated on a yes/no basis, allowing us to assess the absence or presence of psychiatric disorders, based on DSM-5 criteria. The Montgomery Asberg Depression Rating Scale (MADRS) is a well-validated 10-item clinical interview assessing symptoms of depression in the past 7 days, with a cumulative score indicative of the presence of no, light, moderate, or severe depression [39]. Proposed cut-offs are 0–6 (absence of depression), 7–19 (mild depression), 20–34 (moderate depression) and 35–60 (severe depression). In addition, the self-rated Beck’s Depression Inventory (BDI-II) and the Perceived Stress Scale (PSS) [40] were administered. The BDI-II is a 21-item, self-rated scale for depressive symptoms with good sensitivity and specificity [41]. The PSS-10 is a commonly used 10-item scale to assess perceived stress in the past 4 weeks. Scores range from 0 (no stress) to 40 (high stress). The Childhood Trauma Questionnaire (CTQ) was collected to assess childhood trauma [42].

Visual Analogue Scales

After the baseline, after the stress and after the recovery phases, participants rated eleven items on a 100-mm Visual Analogue Scale (VAS). Two composite scores were calculated for the items of the VAS scale (one containing negative emotions, the other one positive emotions, consisting of the sum of the items (positive: energetic, motivated, happy, in control; negative: scared, alert, stressed, powerless, tense. Additionally, given the experimental stress exposure, we explored the answers on the VAS item “stressed” separately, hereafter named “stress perception”.

Experimental stress task

To induce stress, we used an adapted, customized, and computerized variation of the mental arithmetic stress task similar to the Montreal Stress Imaging task (MIST) [43]. The task includes a 5 min baseline, a 1 min-control/demonstration task (i.e., a mental arithmetic without stressful elements), a 5min mental arithmetic task with stressful elements (stress phase), and a 5min recovery phase. In the current version, the following elements are included to induce stress: social comparison (e.g., playing against an “imaginative peer” whose progress is monitored by a progress bar on the right side of the screen), adaptive time pressure (i.e., if the participants answered correctly, they would receive less time to answer the next question; if they answered wrong several times, the time limit was increased) and feedback by the researcher (i.e., two times verbal feedback to “answer faster” or “try to answer correctly”). This version is a computerized task, which requires participants to select the right answer on the computer screen without requirement for verbalization and participants were instructed not to talk in order to not disrupt breathing patterns.

Statistical methods

Demographic variables between groups were compared using t-tests, the Wilcoxon test (W), or the Chi-square (χ2) test for proportions. Model assumptions were checked, and if residuals were not normally distributed, variables were transformed as indicated. To assess the VAS score, a composite score was calculated for a negative and a positive component, by summing up relevant items. To assess our main hypothesis, we fitted a linear mixed model (estimated using REML and if necessary nloptwrap optimizer) to predict variables of interest with Intervention (categorical, taVNS vs. sham), RMSSD (categorical, high vs. low RMSSD group; or continuous RMSSD, see Supplementary Tables), Timepoint (categorical, baseline, stress, recovery), Group (categorical, MDD vs HC), Sex (categorical), Age (continuous), Testday (categorical, day 1 vs day 2) and Stimulation Intensity (continuous) (for the analysis comparing MDD vs HC: formula: DV~ Intervention * Group * Timepoint + Sex + Age + Testday + Stimulation Intensity; for the analysis by RMSSD: DV~ Intervention * RMSSD * Timepoint + Sex + Age + Testday + Stimulation Intensity). Testday was added to the model in order to control for any habituation effects to the task. The model included ID as a random effect (formula: ~1 | ID). Models were fitted using lme4 [44] and lmerTest packages [45]. Furthermore, for HRV, we also conducted a model including an Age*Intervention interaction reported separately. A type-III ANOVA table was computed for F and p values for fixed effects using Satterthwaite’s method for the approximation to degrees of freedom. We repeated the same model for post-hoc stratified analyses, where instead of diagnostic group, we used RMSSD-group (low vs. high). The insignificant 3-way interaction for RMSSD was compared with post-hoc tests, given the specific interest to our study design. Where reported, post-hoc tests were computed as contrasts with the emmeans package [46] and if applicable to the contrasts, the standard inbuilt Tukey method for comparing a family of tests of the emmeans package was automatically applied. Additionally, we also investigated antidepressant medication status yes/no (here, most participants were taking SSRIs or SNRIs). Results did not change, and antidepressant medication was not a significant predictor for HRV at the exception of LF-HRV (see Supplementary Information) therefore the models are reported without antidepressant intake as covariate. An overview of the association between antidepressant medication class (none, SSRI, SNRI, TCA, other) and baseline HRV parameters can be found in Supplementary Table 1D).

All statistical analyses were performed with RStudio (Version 4.3.1.) and graphical representations were made in Graphpad Prism, based on summary data extracted from R [47].

Results

In the period from May 2021 to August 2023, 146 participants were assessed for eligibility, of which 26 were excluded or piloted with a different study design, 120 were randomized, 110 completed their first intervention and 101 participants completed their second cross-over phase (Fig. 2). The trial was ended after the target of included participants was reached.

Fig. 2: Consort Flow Diagram of the MODULATE study.
figure 2

Ninety-four percent (61/65) of the randomized participants completed a first test day (T1) with taVNS versus 89 percent (49/55) with a sham stimulation; the difference was not significant (p = 0.543). At the second test day (T2) 84.61 (55/65) of randomized participants received the planned sham stimulation and 83.64% (46/55) received the planned verum stimulation (p > 0.999). In total, 51 controls and 59 participants with MDD were randomized to the intervention and completed the first session (T1) and were thus used for analysis.

Baseline data

For all presented analyses, we compared participants with MDD versus healthy controls (HC) (Table 1). Participants with MDD were older compared to the control group, but had a similar BMI, sex, and smoking status distribution. There was no difference between education level or contraception intake between groups. All questionnaires (BDI-II, MADRS CTQ, PSS) differed significantly.

Table 1 Demographic Characteristics of Participants by Diagnosis.

Results of blinding

Most participants in both intervention arms (sham and taVNS) believed that they were receiving the taVNS stimulation (84% of all cases, 81% during sham, 86% during taVNS X2 = 1.00, p = 0.317) and stimulation intensity was comparable for both interventions (sham median: 850 µA, tVNS median: 800 µA, Wilcoxon test (W) = 5957.5, p = 0.094).

Planned analysis

Diagnosis was not associated with effect of taVNS

To assess if the efficacy of taVNS on our outcomes was modified by diagnosis, we conducted analyses comparing the two groups (MDD vs. HC), adjusted for covariates (see Supplementary Table 1). Timepoint had a significant effect (p < 0.001) on subjective stress levels, which increased during the stress task and reduced following recovery. We found a main effect of group (p < 0.001): Participants with MDD reported higher levels of stress compared to healthy controls across all time points. For emotions, stress perception and HRV, we found a significant main effect of group and timepoint, but no main effect for intervention nor for the interaction between diagnosis and intervention (taVNS vs. sham * MDD vs HC, see Table 2 for the main effect of Intervention and the Intervention*Group interactions). Full models can be found in Supplementary Table 1. Due to the absence of intervention effects, in the following, we decided to apply a post-hoc stratification based on RMSSD group instead of diagnosis, to assess whether those with lower RMSSD would profit more from taVNS than those with higher RMSSD, regardless of diagnosis. Of note, we opted for a median split because we believe that it has clinical utility but also performed analyses with RMSSD as a continuous predictor, which yielded comparable and highly significant results, and is reported in the supplementary material.

Table 2 Results of Planned analysis comparing MDD and HC.

Demographics based on RMSSD group

When split by RMSSD, 55 participants were in the high and 54 in the low RMSSD group. In the low RMSSD group, 39 of 54 (72%) were MDD participants, in the high RMSSD group, 19 of 55 (35%) were MDD participants (see Supplementary Table 2A for demographics by RMSSD). The low RMSSD group had higher depression scores and higher chronic stress levels, which are likely attributable to the higher percentage of participants with MDD in the low RMSSD group (Supplementary Table 2B and C), but childhood trauma was also higher in the low RMSSD control group (p = 0.033), see Supplementary Table 2B. Baseline RMSSD during sham as continuous variable correlated negatively with MADRS values (rho = −0.33, p < 0.001), but not with TNF-α or IL-6 (IL-6: rho = −1.08, p = 0.313, TNF: rho = −0.081, p = 0.458).

The impact of taVNS on stress perception depends on RMSSD

The interaction of intervention (taVNS vs. sham) and RMSSD group was significant (p = 0.024). Follow-up tests showed that for those with high RMSSD, taVNS did not affect stress levels (t = 0.06, p = 0.549), but significantly increased stress perception in those with low RMSSD (t = −2.63, p = 0.009). No other interactions were significant.

Timepoint had a significant main effect for negative emotions (p < 0.001), but not for positive emotions(p = 0.676). The main effect of group was significant in both models (both p < 0.001), but neither the grouping by RMSSD, nor the intervention had an effect on either emotion (see Supplementary Table 3A for fixed effect estimates of the full models and Fig. 3A-I. for graphical representations). Analyses with RMSSD as a continuous score yielded similar results (Supplementary Table 3B).

Fig. 3: Impact of acute stress and taVNS on emotional reactivity.
figure 3

A-C Effects on Stress Perception. All values were transformed with Tukey ladder of power transformation for statistical analyses but are presented as untransformed scores. From left to right: A. Main effect of acute stress, B. Main effect of Group, C. 2-way interaction effect of intervention and RMSSD group. D-F Effects on Negative Emotions (Composite Score). From left to right: D. Main effect of acute stress, E. Main effect of Group, F. 2-way interaction effect of intervention and RMSSD group, G-I Effects on Positive Emotions (Composite Score). From left to right: G. Main effect of acute stress, H. Main effect of Group, I. 2-way interaction effect of intervention and RMSSD group, Significance:***:p < 0.001, ** p < 0.01, *p < 0.05, #p < 0.1, ns:p > 0.1.

The impact of taVNS on heart rate variability depends on RMSSD

To assess the impact of RMSSD group and taVNS on the autonomic stress response, we investigated RMSSD and other measures of heart rate variability (HR, HF-HRV, LF-HRV and LF/HF ratio).

Main effects for timepoint and RMSSD group were significant (all p < 0.05) for all measures, only for the LF-HF ratio, timepoint was not significant (p = 0.149), see Supplementary Table 4A. We found an interaction of RMSSD group and intervention for RMSSD (p < 0.001), HF-HRV (p = 0.002), LF-HRV (p = 0.026), but not HR (p = 0.121) or LF/HF ratio (p = 0.127) (see Supplementary Table 4B for post-hoc comparisons of interactions).

Follow-up tests showed that RMSSD and HF-HRV numerically increased in the low RMSSD group during taVNS (sham-taVNS; RMSSD: t = −1.96, p = 0.050, HF-HRV t = −1.84, p = 0.067), but taVNS reduced RMSSD and HF-HRV in the high RMSSD group (sham-taVNS; RMSSD t = 3.58, p < 0.001; HF-HRV: t = 2.79, p = 0.006), see Fig. 4B. Furthermore, for all variables, except LF-HRV and LF/HF ratio, a significant RMSSD*timepoint interaction occurred (all P < 0.05, see Supplementary Table 4A and 4B). Post-hoc tests for RMSSD showed that participants in the low RMSSD group did not react to stress (baseline-stress: RMSSD: t = 2.23, p = 0.067), compared to strong reactivity in the high RMSSD group (baseline-stress: RMSSD: t = 7.08, p < 0.001), while both had significant recovery, albeit numerically stronger in the high RMSSD group (high: t = −5.15 vs. low: t = −2.98), see Fig. 4A. Results with RMSSD used as a continuous predictor were comparable to the analysis using a factor for HF-HRV and RMSSD (see Supplementary Table 4C for statistics).

Fig. 4: Impact of acute stress and taVNS on heart rate variabitlity (RMSSD in ms).
figure 4

Results are presented as raw values (in ms). A Interaction of Stress and RMSSD group; B Interaction of intervention with RMSSD group. C 3-way interaction for Stress, Intervention and RMSSD group. Note that scales differ between (A,B and C), to be able to display all error bars appropriately. Significance:***:p < 0.001, ** p < 0.01, *p < 0.05, #p < 0.1, ns:p > 0.1.

Furthermore, given the broad age range of participants (18–65), we tested whether taVNS effects on HR and HRV were moderated by age. Significant intervention*age interactions were observed for HF-HRV and RMSSD (Supplementary Table 4D). Estimated marginal means revealed clear age-dependent effects for RMSSD and HF-HRV: at age 20, RMSSD/HF-HRV were higher under taVNS than sham, whereas at ages 40–60, higher values were observed in the sham condition (Supplementary Table 4E). Importantly, in models including the intervention*age interaction, the intervention*RMSSD interaction remained significant, indicating that, although age modulates responsiveness, taVNS effects are also robustly influenced by individual baseline RMSSD rather than being solely driven by age differences.

Exploratory post-hoc analyses 3-way interaction for RMSSD shows recovery of blunted stress response

Given our interest in the treatment effect, we also conducted exploratory contrasts for the non-significant three-way interaction of RMSSD. Contrasts showed that while during sham stimulation, there was indeed a blunted reactivity and recovery for RMSSD in the low RMSSD group (sham; baseline-stress: t = 0.19, p = 0.981, stress-recovery: t = −0.82, p = 0.692), this was restored during taVNS (baseline-stress: t = 2.96, p = 0.009, stress-recovery: t = −3.39, p = 0.002). In the high RMSSD group both sham and taVNS showed significant reactivity and recovery during sham (sham; baseline-stress: t = 4.79, p < 0.001, stress-recovery: t = −3.52, p = 0.001) and taVNS (baseline-stress: t = 5.25, p < 0.001, stress-recovery: t = −4.29, p < 0.001), see Fig. 4C. It should be noted that this analysis was exploratory in nature and should be interpreted with caution, as our sample size may have been too low to detect reliable effects.

Effect of taVNS on inflammatory proteins is opposite dependent on RMSSD

Overall, as expected in a population without overt inflammatory conditions, levels of IL-6 and TNF-α were low. No main effects of RMSSD group, or timepoint occurred for IL-6 or TNF-α (see Supplementary Table 5A for fixed effects). For TNF-α, an intervention*RMSSD group interaction occurred (p = 0.038). Post-hoc comparisons showed that regardless of group or timepoint, in those with higher RMSSD, TNF-α was significantly higher during taVNS than during sham stimulation (sham-taVNS: t = −2.09, p = 0.038) while in those with low RMSSD, TNF-α was not significantly altered (sham-taVNS: t = 0.70, p = 0.483). See Fig. 5 for an overview. If RMSSD (sham, baseline) was used as a continuous variable, the interaction of RMSSD*Intervention was not significant for TNF-α (p = 0.232), see Supplementary Table 5B.

Fig. 5: Levels of TNF-α and IL-6 according to intervention.
figure 5

Values are back-transformed and were measured in pg/mL. They represent the averaged effect of RMSSD group*Intervention regardless of condition (BL/STR). A. Intervention*RMSSD(low/high) for TNF-α, B. Intervention*RMSSD(low/high) for IL-6. Significance: ***:p < 0.001, ** p < 0.01, *p < 0.05, #p < 0.1, ns:p > 0.1.

Discussion

In this counter-balanced, randomized-controlled study, we aimed to assess whether taVNS could improve the acute stress response in participants with MDD by improving subjective stress perception, enhancing cardiac reactivity and reducing inflammation compared to healthy controls. Contrary to our hypothesis, we found no effect of taVNS on these outcomes, nor any interaction of taVNS and diagnosis. In other words, taVNS had no significant impact on our outcomes, with no difference between MDD participants and HC.

Subsequently, we decided to use post-hoc stratification by a biological marker, RMSSD, to approximate cardiac vagal modulation (with RMSSD as a surrogate marker). We reasoned that participants with low RMSSD, indicating low VN activity, were most likely to profit from taVNS. In the low RMSSD group (regardless of diagnosis), taVNS improved the previously absent cardiac vagal reactivity to stress, increased the perception of stress, but only had a minor, non-significant reductive effect on TNF-α. In contrast, and unexpectedly, if RMSSD was high at baseline, taVNS dampened heart rate variability (HRV), increased HR and significantly increased levels of TNF-α. Furthermore, for HRV-based measures, taVNS had a different effect in younger compared to older participants. Our findings clearly suggest that the effects of taVNS depend on baseline RMSSD and age, suggesting that consideration of both is essential for future studies.

taVNS: No effect on biological outcomes in MDD?

An adequate response to acute stress is fundamental for survival and is tightly regulated by the autonomic nervous system. Persisting exposure to physical or emotional stress, however, can disrupt homeostasis, mitigate basal cardiac parasympathetic activity, and impair the physiological ability to face sudden challenges, as often observed in participants suffering from MDD. VNS provides a targeted approach to directly stimulate the underlying biological pathways that are impaired by chronic stress exposure. Surprisingly, against our hypotheses, there was an absence of a main and/or interaction effects for the intervention in our primary analyses comparing participants with MDD and controls. This could indicate that the non-invasive form of taVNS does not have any acute effects, even in a relatively large sample such as ours, or, that acute stimulation sessions are insufficient to produce these effects. However, our post-hoc stratification revealed a critical feature linked to the response to taVNS: the individual RMSSD at baseline. Only in the low RMSSD group, taVNS improved stress perception and recovered cardiac reactivity. While we expected the strongest effects of taVNS for those with low RMSSD, and somewhat attenuated effects in those with high RMSSD, this was not the case. Contrary to our expectations, we did not find weaker effects in those with high RMSSD, but instead, taVNS had stronger and opposed effects to the anti-inflammatory, HRV-enhancing effects we expected from the literature. This is a crucial insight, as effects in opposed directions in those with high and low RMSSD may lead to null findings for (ta)VNS paradigms, if baseline levels are disregarded.

The absence of a significant modifying effect of MDD on the intervention may be partly attributed to the diagnostic criteria for depression, which currently rely solely on symptomatology and likely encompass various distinct pathophysiological mechanisms. This may explain why the majority, but not all, MDD participants exhibit low RMSSD in our study, which is in line with recent meta-analytic findings [48]. Consequently, RMSSD could serve as a valuable tool for selecting participants based on a biological stratification marker for VNS treatment.

Chronic stress is linked to RMSSD levels

In line with previous research [49], we found that chronic stress and childhood trauma were associated with low RMSSD in our sample. This association has been predominantly reported in clinical populations suffering from chronic stress conditions (i.e., Posttraumatic Stress Disorder, Anxiety Disorders, MDD) [50]. The well-orchestrated response to acute stress starts with an increase in sympathetic activity, catecholamine and cortisol production, energy supply through gluconeogenesis and lipolysis, and cardiac (e.g. increased heart rate and blood pressure) and immune arousal, but is temporally limited and terminated by parasympathetic activation. Yet, sustained stress and the lack of habituation to stressors can lead to deficient RMSSD (i.e., [51]), failing to regulate the aforementioned systems and therefore leading to a chronic elevation of sympathetic activity, catecholamine, and cortisol production and reduced cardiac vagal modulation. As in previous studies [5, 52, 53], we here observed the biological effects of chronic stress often described in the literature: We found that in the low RMSSD group, the baseline level of HR was higher, cardiac stress reactivity was blunted, and baseline levels of perceived stress and depressive symptoms were higher.

Differential effects of taVNS depend on RMSSD: mechanistic considerations

Intriguingly, taVNS influenced cardiac reactivity and recovery for those with low RMSSD. Stimulation of the VN’s sensory afferents at the outer ear is considered to activate the dorsomotor nucleus. This activation relayed through the afferent nucleus (NTS) 54, modulates the parasympathetic projections directed towards the sinoatrial and atrioventricular nodal cells as well as the atrial myocytes [54, 55]. It is thus possible that taVNS can counteract the effects of chronic stress by activating parasympathetic projections. The opposed effects in the high RMSSD group suggest that this mechanism may differ in this group, either by engagement of a different mechanism, overactivation of the projections and/or by activation of the sympathetic rather than parasympathetic nervous system. Indeed, previous studies have discussed the possibility of low-level sympatho-excitation by vagal afferents due to bioelectric stimulation as applied during taVNS [56]. This could provide a potential explanation for the inconsistency reported regarding (ta)VNS effects on heart rate variability, which have shown reductions [57,58,59], increases [60,61,62,63], or no distinguishable effect [64,65,66,67], summing up to a lack of a consistent effect on HRV [68].

Age moderates the effect of taVNS for HRV-based measured

Another intriguing finding was the significant interaction between age and our intervention. Particularly for RMSSD, older participants showed a reduction during taVNS, whereas younger participants showed either increases or no change. This suggests that parasympathetic responsiveness to acute taVNS declines with age, potentially reflecting age-related changes in autonomic flexibility or vagal nerve sensitivity. This finding is in line with consistent evidence showing that HRV and autonomic flexibility decline with age [16, 69]. However, the impact of age on cardiac responses to taVNS has not been systematically examined and warrants further investigation.

No acute effects of taVNS on emotional states

Furthermore, as a treatment for depression, the integration between the autonomic state, interoceptive signals, and emotional states is crucial and is mediated by highly interconnected brain stem nuclei and higher cortical regions that receive vagal afferents. Stimulation of the vagus nerve (direct and/or via taVNS) provides an intriguing option to investigate the effect of stimulation on this integrative process since it can stimulate the nucleus tractus solitarii, and the interconnected limbic and cortical areas that belong to the central autonomic network in both rodents and humans [10,11,12,13,14]. Chronic VNS has been shown to improve negative mood states in humans [2, 3, 70] and low RMSSD has been linked to negative emotional states, in line with our data. A recent rodent study elegantly demonstrated that reduced RMSSD due to prolonged stress resulted in alterations in the connection between the prefrontal cortex and amygdala, leading to anxious behavior similar to that seen in vagotomy, and chronic VNS restored brain signals and the anxious phenotype [51]. In our sample, however, no intervention effect was found for negative emotions after the short intervention period. This is not entirely surprising, as our study was designed to assess the acute mechanistic effects of VNS, and not the anti-depressant effects of VNS, which would require repeated exposure to stimulation. It is thus possible that long-term stimulation would yield different results. Future studies in humans need to shed more light on this process using prolonged exposure.

Differential effects of taVNS on inflammatory cytokines depending on RMSSD

The seminal discovery by Borovikova et al. [8] highlighted the relevance of the VN for regulating inflammatory responses through the cholinergic anti-inflammatory pathway. Since then, several studies, also in humans, have shown that low cardiac parasympathetic activity is associated with increased cytokines at baseline and after immunological challenges [71, 72]. Selective stimulation of the VN - both invasive and transcutaneous - suppresses circulating cytokines, but most of the significant effects were only applicable to subgroups and in conditions of acute immunological events [23]. Our results showing differential effects of taVNS depending on baseline RMSSD raise new questions: first, our results suggest that taVNS could have a modest anti-inflammatory potential if baseline RMSSD is low, whereas if the RMSSD is already high, taVNS can increase TNF-α. Second, we did not find a change in IL-6 after taVNS, which is in line with preclinical literature and suggests a rather specific effect of (ta)VNS on TNF-α. Analog to the findings in HRV, this might explain the rise in cytokines reported following taVNS in healthy human participants (e.g., [73, 74]), who were not recruited based on baseline RMSSD and might have had high RMSSD. Finally, for those in the low RMSSD group, stress perception increased rather than decreased - a finding that is rather counter-intuitive, since successful taVNS is thought to reduce stress. A possible explanation could be that taVNS may increase the awareness of bodily states (i.e., interoception) [75] including the subjective perception of stress, but further research is needed on this topic. We are intrigued whether similar findings can be found in studies using longer durations and different pathologies.

Inverse effects of taVNS with higher RMSSD? Clinical implications and call for stratification in existing cohorts using (ta)VNS

Although there is a general correlation between high RMSSD, favourable health outcomes and a well-regulated autonomic nervous system, taken together our results suggest that RMSSD beyond a certain threshold, triggers an alternative mechanism (potentially SNS rather than PNS activation) in response to taVNS. Our results align with prior research that documented decreased heart rate variability following (ta)VNS [58, 59] and increases in cytokines in healthy participants [73]. Most of these results involved youthful, healthy participants with probably high RMSSD at baseline, who may have surpassed a threshold with (ta)VNS. In the clinical context, vagal hyperactivity (which we don’t see in our participants) has been linked to bradycardia, syncopes, hypotension, and excessive gastric acid secretion [76,77,78]. Here, we used post-hoc stratification with a median split for RMSSD as an index of cardiac vagal activation, due to the sample size constraints, but the exploratory analyses using continuous RMSSD produced similar results. It is thus possible that a dose-response relationship is present, which should be explored in larger sample sizes.

We identified one small clinical trial which also used stratification by baseline RMSSD after a long-term intervention with invasive VNS in participants suffering from Crohn’s disease [79]. Intriguingly, the effect of VNS on heart rate variability was restorative in 4 of 5 participants with low RMSSD but decreased in the one person with high RMSSD at baseline. Others have reported that responders to VNS (as defined by more than 20% decrease in LF/HF), had significantly lower basal RMSSD than non-responders [63]. Although it is obvious, that VNS should be particularly beneficial if RMSSD is compromised, the majority of studies so far have not used RMSSD for stratification. From a clinical perspective, if RMSSD is indeed found to be a useful predictor of long-term treatment outcomes of taVNS in other studies, this could contribute strongly to a precision medicine approach beyond psychiatry, reducing waiting times for procedures and permitting clinicians a better prediction of success rates. This is particularly relevant given that between initiation of VNS and clinical response, there can be a time lapse of 3–12 months and cumulative response rates show response rates only for approximately half the population [80].

If HRV data is available, we therefore call to researchers for a secondary analysis of (ta)VNS datasets, which could offer more insight, particularly for studies assessing long-term effects and antidepressant response.

Methodological considerations

A distinctive aspect of our study lies in our choice to stimulate the right outer ear. This decision was informed by rodent studies, that demonstrated a closer link between the right VN and limbic areas critical for reward behaviour and emotional processing. Rodent investigations have demonstrated that both right and left VNS impact cardiac reactivity, while effects were more often observed on the right side [54, 81, 82]. Because of the greater innervation of the sinus-atrial node by the right VN, invasive and most non-invasive taVNS applications have avoided this stimulation side fearing bradycardia. Nevertheless, there is currently no data suggesting safety concerns with transcutaneous VNS administered to the right side or bilaterally [83] and increases in HRV through VNS have been demonstrated for both right [61, 62, 84] and left VNS [60, 63, 85].

In addition, it was shown that specific stimulation parameters elicit an increase rather than a decrease of cytokines [86] indicating that a systematic investigation of stimulation parameters is necessary in future studies to further explore this dynamic in humans. Finally, in our study, we used post-hoc stratification with a median split for RMSSD as an index of cardiac vagal activation. Our cut-off value of 39.14 ms situates well within population-based reference data for the mean age of our sample (i.e., 50th percentile for age group 30–40: 37–41 ms albeit for 24-h recordings [87], and age group 35–39: 39.9–46 ms [88]). While continuous analyses yielded similar results, cut-off values (per age category) may be an attractive option for clinical application. To establish such cut-offs, future, population-specific studies controlling for confounding variables are needed.

Limitations

While our study is highly novel, several limitations need to be mentioned. First, we used transcutaneous, not invasive VNS. We suggested the observed effects to be mediated through the activation of specific brain regions triggered by afferent fibre signaling. Although this signaling pathway lacks the precision of direct vagal fibre stimulation, recent rodent studies suggest it may achieve comparable outcomes [89]. We also did not measure respiration, but for some parameters, RMSSD is only reliably measured when the respiration rates are in the normal range (e.g., between 7.2–24 breaths/minute) [15], which could affect the results if breathing occurred outside of these ranges. Future studies should therefore measure breathing rates. In addition, while we conducted a power analysis for the primary aims of the study, the here reported analysis is conducted on secondary outcomes and may not be sufficiently powered to detect effects, particularly for the three-way-interaction. Therefore, a replication of results is needed. Finally, the effects we observed (notably for inflammation) are small and observed during acute stimulation. It remains to be seen whether taVNS or invasive VNS will also have this effect during longer stimulations and in clinical populations with higher levels of inflammation.

Conclusion and outlook

In conclusion, our findings indicate that baseline RMSSD matters, at least for the biological outcomes of taVNS. A compromised CPA (here approximated by RMSSD) is associated with impaired cardiac and inflammatory responses to acute stress exposure and can be improved by acute taVNS. However, participants with high RMSSD may react with the opposite pattern, which could lead to null findings, as we observed in our analysis using diagnostic groups (e.g., MDD vs. controls). We call to all researchers who measured an ECG at baseline in studies using (ta)VNS to perform a secondary analysis. If our findings are replicated, this may help to find a stratification marker for successful taVNS for MDD, and other diagnoses.