Chapter 6 Articulatory suppression effects on induced rumination

his study explores whether the speech motor system is involved in verbal rumination, a particular kind of inner speech. The motor simulation hypothesis considers inner speech as an action, accompanied by simulated speech percepts, that would as such involve the speech motor system. If so, we could expect verbal rumination to be disrupted by concurrent involvement of the speech apparatus. We recruited 106 healthy adults and measured their self-reported level of rumination before and after a rumination induction, as well as after five minutes of a subsequent motor task (either an articulatory suppression -silent mouthing- task or a finger tapping control task). We also evaluated to what extent ruminative thoughts were experienced with a verbal quality or in another modality (e.g., visual images, non-speech sounds). Self-reported levels of rumination showed a decrease after both motor activities (silent mouthing and finger-tapping), with only a slightly stronger decrease after the articulatory suppression than the control task. The rumination level decrease was not moderated by the modality of the ruminative thoughts. We discuss these results within the framework of verbal rumination as simulated speech and suggest alternative ways to test the engagement of the speech motor system in verbal rumination.⁴⁶

6.1 Introduction

A large part of our inner conscious experience involves verbal content, with internal monologues and conversations. Inner speech is considered as a major component of conscious experience and cognition (Hubbard, 2010; Hurlburt et al., 2013; Klinger & Cox, 1987). An important issue concerns the format and nature of inner speech and whether it is better described as a mere evocation of abstract amodal verbal representations (i.e., without articulatory or auditory sensation) or as a concrete motor simulation of actual speech production (for reviews, see Alderson-Day & Fernyhough, 2015; Lœvenbruck et al., 2018; Perrone-Bertolotti et al., 2014). In the first case, inner speech is seen as divorced from bodily experience, and includes, at most, faded auditory representations. In the second case, inner speech is considered as a physical process that unfolds over time, leading to an enactive re-creation of auditory percepts, via the simulation of articulatory actions. The latter hypothesis is interesting in the context of persistent negative and maladaptive forms of inner speech, such as rumination. If this hypothesis is correct, we could expect rumination –as a particular type of inner speech– to be disrupted by concurrent involvement of the speech muscles. The present study aims at testing this specific idea.

Introspective explorations of the characteristics of inner speech have led to different views on the relative importance of its auditory and articulatory components, and on the involvement of motor processes. It has been suggested successively that speech motor representations would be purely motoric (Stricker, 1880), that they would be expressed dominantly in an auditory format (Egger, 1881), or that they would consist in a mix of these in the overall population (Ballet, 1886). The intuitive distinction between auditory and motor phenomena is sometimes referred to in contemporary research by the terms of inner ear and inner voice, in line with Baddeley’s classic model of working memory (e.g., Baddeley et al., 1984; see also Buchsbaum, 2013). Baddeley’s model relies on a partnership between an inner ear (i.e., storage) and an inner voice (i.e., subvocal rehearsal), which can be assessed by selectively blocking either one of these components (e.g., Smith, Wilson, & Reisberg, 1995).

Empirical arguments supporting the crucial role of the inner voice in verbal working memory (subvocal articulatory rehearsal) can be found in studies using articulatory suppression, in which the action component (i.e., the inner voice) of inner speech is disrupted. Articulatory suppression usually refers to a task which requires participants to utter speech sounds (or to produce speech gestures without sound), so that this activity disrupts ongoing speech production processes. Articulatory suppression can be produced with different degrees of vocalisation, going from overt uttering to whispering, mouthing (i.e., silent articulation), and simple clamping of the speech articulators. Many studies have shown that articulatory suppression can be used to disrupt the subvocal rehearsal mechanism of verbal working memory and –as a consequence– impair the recall of verbal material (e.g., Baddeley et al., 1984; Larsen & Baddeley, 2003).

Based on the study of errors accompanying the covert production of tongue twisters, inner speech has also been suggested to be impoverished (as compared to overt speech) and to lack a full specification of articulatory features (e.g., Oppenheim & Dell, 2008, 2010). More precisely, these studies have shown the phonemic similarity effect (the tendency, in overt speech, to exchange phonemes with similar articulatory features) to be absent in inner speech. In contrast to these results, however, Corley et al. (2011) found the phonemic similarity effect to be present in inner speech, suggesting that inner speech may not necessarily be impoverished at the articulatory level.

In a study aiming at investigating the role of covert enactment in auditory imagery (defined as imagined speech, produced by oneself or another individual), Reisberg, Smith, Baxter, & Sonenshine (1989) observed that the verbal transformation effect (Warren & Gregory, 1958), namely the alteration of speech percepts when certain speech sounds are uttered in a repetitive way, also occurred during inner speech (although the verbal transformation effect was smaller than during overt speech), but was suppressed by concurrent articulation (e.g., chewing) or clamping the articulators. The fact that the verbal transformation effect was observed during inner speech and that it was reduced by concurrent chewing, even in inner speech, speaks in favour of the view of inner speech as an enacted simulation of overt speech.

Another piece of evidence for the effect of articulatory suppression on inner speech comes from a recent study by Topolinski & Strack (2009) on the mere exposure effect, namely the fact that repeated exposure to a stimulus influences the evaluation of this stimulus in a positive way (Zajonc, 1968). Topolinski and Strack’s study showed that the mere exposure effect for visually presented verbal material could be completely suppressed by blocking subvocal rehearsal (i.e., inner speech) when asking participants to chew a gum. The effect was preserved, however, when participants kneaded a soft ball with their hand (Topolinski & Strack, 2009). This finding suggests that blocking speech motor simulation interfered with the inner rehearsal of the visually presented verbal stimuli, thereby destroying the positive exposure effect. It provides additional experimental support to the view that inner speech involves a motor component.

The occurrence of motor simulation during inner speech is further backed by several studies using physiological measures to evaluate inner speech production properties. Using electrodes inserted in the tongue tip or lips of five participants, Jacobson (1931) was able to detect electromyographic (EMG) activity during several tasks requiring inner speech. Similarly, Sokolov (1972) recorded intense lip and tongue muscle activation when participants had to perform complex tasks that necessitated substantial inner speech production (e.g., problem solving). Another study using surface electromyography (sEMG) demonstrated an increase in activity of the lip muscles during silent recitation tasks compared to rest, but no increase during the non-linguistic visualisation task (Livesay et al., 1996). An increase in the lip and forehead muscular activity has also been observed during induced rumination (Nalborczyk et al., 2017). Furthermore, this last study also suggested that speech-related muscle relaxation was slightly more efficient in reducing subjective levels of rumination than non speech-related muscle relaxation, suggesting that relaxing or inhibiting the speech muscles could disrupt rumination.

Rumination is a “class of conscious thoughts that revolve around a common instrumental theme and that recur in the absence of immediate environmental demands requiring the thoughts” (Martin & Tesser, 1996). Despite the fact that depressed patients report positive metacognitive beliefs about ruminating, which is often seen as a coping strategy in order to regulate mood (e.g., Papageorgiou & Wells, 2001), rumination is known to significantly worsen mood (e.g., Moberly & Watkins, 2008; Nolen-Hoeksema & Morrow, 1993), impair cognitive flexibility (e.g., Davis & Nolen-Hoeksema, 2000; Lyubomirsky et al., 1998), and to lead toward pronounced social exclusion and more interpersonal distress (Lam, Schuck, Smith, Farmer, & Checkley, 2003). Although partly visual, rumination is a predominantly verbal process (Goldwin & Behar, 2012; McLaughlin et al., 2007) and can therefore be considered as a maladaptive type of inner speech. In a study on worry, another form of repetitive negative thinking, Rapee (1993) observed a tendency for articulatory suppression, but not for visuo-spatial tasks, to produce some interference with worrying. He concluded that worry involves the phonological aspect of the central executive of working memory. We further add that, since repeating a word seems to reduce the ability to worry, this study suggests that articulatory aspects are at play during worry.

In this context, the question we addressed in this study is whether verbal rumination consists of purely abstract verbal representations or whether it is better described as a motor simulation of speech production, engaging the speech apparatus. If the latter hypothesis is correct, rumination experienced in verbal form (in contrast to other forms, such as pictoral representations) should be disrupted by mouthing (i.e., silent articulation), and should not be disrupted by a control task that does not involve speech muscles (e.g., finger-tapping). Specifically, we thus sought to test the hypotheses that rumination could be disrupted by articulatory suppression (but not by finger-tapping), and that this disruption would be more pronounced when rumination is experienced in a verbal form than in a non-verbal form.

6.2 Methods

In the Methods and Data analysis sections, we report how we determined our sample size, all data exclusions, all manipulations, and all measures in the study (Simmons et al., 2012). A pre-registered version of our protocol can be found on OSF: https://osf.io/3bh67/.

6.2.1 Sample

We originally planned for 128 participants to take part in the study. This sample size was set on the basis of results obtained by Topolinski & Strack (2009), who observed an effect size around $\eta_{p}^{2}=.06$. We expected a similar effect size for the current rumination disruption, since rumination can be conceived of as a subtype of inner speech.⁴⁷

As we anticipated drop-out of participants due to our inclusion criteria (see below), a total of 184 undergraduate students in psychology from Univ. Grenoble Alpes took part in this experiment, in exchange for course credits. They were recruited via mailing list, online student groups, and posters. Each participant provided a written consent and this study was approved by the local ethics committee (CERNI N° 2016-05-31-9). To be eligible, participants had to be between 18 and 35 years of age, with no self-reported history of motor, neurological, psychiatric, or speech-development disorders. All participants spoke French as their mother tongue. After each participant gave their written consent, they completed the Center for Epidemiologic Studies - Depression scale (CES-D; Radloff, 1977). The CES-D is a 12-item questionnaire, validated in French (Morin et al., 2011), aiming to assess the level of depressive symptoms in a subclinical population. Participants exceeding the threshold of clinical depressive symptoms (i.e., >23 for females and >17 for males; Radloff, 1977) were not included in the study for ethical reasons (N = 26). These participants were then fully debriefed about the aims of the experiment and were given the necessary information concerning available psychological care on campus.

To investigate articulatory suppression effects in the context of rumination, a successful induction of rumination is a prerequisite. Therefore, analyses were only conducted on participants who showed an effect of the rumination induction (i.e., strictly speaking, participants who reported more rumination after the induction than before). We thus discarded participants who did not show any increase in rumination level (N = 52, 32.91% of total sample). The final sample comprised 106 participants (Mean age = 20.3, SD = 2.57, Min-Max = 18-31, 96 females).

6.2.2 Material

The experiment was programmed with OpenSesame software (Mathôt et al., 2012) and stimuli were displayed on a DELL latitude E6500 computer screen.

6.2.2.1 Questionaires

To control for confounding variables likely to be related to the intensity of the induction procedure, we administered the French version of the Positive and Negative Affect Schedule (PANAS; Watson, Clark, & Tellegen, 1988), adapted to French by Gaudreau, Sanchez, & Blondin (2006). This questionnaire includes 20 items, from which we can compute an overall index of both positive (by summing the scores on 10 positive items, thereafter PANASpos) and negative affect (PANASneg) at baseline. This questionnaire was administered at baseline. In order to evaluate trait rumination, at the end of the experiment participants completed the short version of the Ruminative Response Scale (RRS-R, Treynor et al., 2003), validated in French (Douilliez, Guimpel, Baeyens, & Philippot, in preparation). From this questionnaire, scores on two dimensions were analysed (RRSbrooding and RRSreflection).

6.2.2.2 Measures

Measures of state rumination were recorded using a Visual Analogue Scale (VAS) previously used in Nalborczyk et al. (2017). This scale measured the degree of agreement with the sentence “At this moment, I am brooding on negative things” (translated from French), on a continuum between “Not at all” and “A lot” (afterwards coded between 0 and 100). This scale is subsequently referred to as the RUM scale. It was used three times in the experiment, at baseline (after training but before the experiment started), after rumination induction, and after a motor task.

Additionally, participants answered questions about the modality of the thoughts that occurred while performing the motor task. This last questionnaire consisted of one question evaluating the occurrence frequency of different modalities of inner thoughts (e.g., visual imagery, verbal thoughts, music). Then, a verbal/non-verbal ratio (i.e., the score on the verbal item divided by the mean of the score on the non-verbal items) was computed and used in the analyses, hereafter referred to as the Verbality continuous predictor (this scale is available online: https://osf.io/3bh67/).⁴⁸

6.2.2.3 Tasks

In the first part of the experiment, ruminative thoughts were induced using a classical induction procedure (Nolen-Hoeksema & Morrow, 1993). Then a motor task was executed. Participants were randomly allocated to one of two conditions. In the Mouthing condition, the task consisted of repetitively making mouth opening-closing movements at a comfortable pace. This condition was selected as it is commonly used in articulatory suppression studies. As a control, a finger-tapping condition was used (the Tapping condition), that consisted of tapping on the desk with the index finger of the dominant hand at a comfortable pace.

Although finger-tapping tasks are generally considered as good control conditions when using speech motor tasks, since they are comparable in terms of general attentional demands, it may be that orofacial gestures are intrinsically more complex than manual gestures (i.e., more costly, Emerson & Miyake, 2003). To discard the possibility that orofacial gestures (related to the Mouthing condition) would be cognitively more demanding than manual ones (related to the Tapping condition), we designed a pre-test experiment in order to compare the two interference motor tasks used in the main experiment. Results of this control experiment showed no difference on reaction times during a visual search task between the two interference tasks (i.e., mouthing and finger-tapping). Full details are provided in Appendix B.

6.2.3 Procedure

The experiment took place individually in a quiet and dimmed room. The total duration of the session ranged between 35min and 40min. Before starting the experiment, participants were asked to perform the motor task during 1 min, while following a dot moving at a random pace on the screen in front of them. This task was designed to train the participants to perform the motor task adequately. Following this training and after describing the experiment, the experimenter left the room and each participant had to fill-in a baseline questionnaire (adaptation of PANAS, see above) presented on the computer screen. Baseline state rumination was then evaluated using the RUM scale. The whole experiment was video-monitored using a Sony HDR-CX240E video camera, in order to check that the participants effectively completed the task.

6.2.3.1 Rumination induction

Rumination induction consisted of two steps. The first step consisted of inducing a negative mood in order to enhance the effects of the subsequent rumination induction. Participants were asked to recall a significant personal failure experienced in the past five years. Then, participants were invited to evaluate the extent to which this memory was “intense for them” on a VAS between “Not at all” and “A lot”, afterwards coded between 0 and 100, and referred to as Vividness.

The second step consisted of the rumination induction proper. We used a French translation of Nolen-Hoeksema & Morrow (1993)’s rumination induction procedure. Participants had to read a list of 44 sentences related to the meaning, the causes and the consequences of their current affective or physiological state. Each phrase was presented on a computer screen for 10 seconds and the total duration of this step was 7 minutes and 20 seconds. State rumination was then evaluated again using the same VAS as the one used at baseline (RUM).

6.2.3.2 Motor task

After the rumination induction, participants were asked to continue to think about “the meaning, causes, and consequences” of their feelings while either repetitively making mouth movements (for participants allocated in the “Mouthing” condition) or finger-tapping with the dominant hand for five minutes (for participants allocated in the “Tapping” condition). Afterwards, state rumination was again evaluated using the RUM scale.

In order to evaluate trait rumination, participants completed the short version of the RRS (see above). Then, they filled in the questionnaire on the modality of the thoughts that occurred while performing the motor task (see above). Figure 6.1 summarises the full procedure.

Figure 6.1: Timeline of the experiment, from top to bottom.

6.2.4 Data analysis

Statistical analyses were conducted using R version 3.5.3 (R Core Team, 2018) and are reported with the papaja (Aust & Barth, 2018) and knitr (Xie, 2018) packages.

6.2.4.1 Rumination induction

We centred and standardised each predictor in order to facilitate the interpretation of parameters. To assess the effects of the rumination induction on self-reported state rumination, data were then analysed using Induction (2 modalities, before and after induction, contrast-coded) as a within-subject categorical predictor and RUM as a dependent variable in a Bayesian multilevel linear model (BMLM), using the brms package (Bürkner, 2018 b).⁴⁹ This model was compared with more complex models including effects of control variables, including baseline affect state (PANAS scores), trait rumination (RRS scores), the vividness of the memory chosen during the induction (Vividness score), or the degree of verbality of the ruminative thoughts (Verbality index).

Models were compared using the Widely Applicable Information Criterion (WAIC; Watanabe, 2010) –a generalisation of the Akaike information criterion (Akaike, 1974)– and evidence ratios (Burnham & Anderson, 2002; Burnham, Anderson, & Huyvaert, 2011; Hegyi & Garamszegi, 2011). The WAIC provides a relative measure of predictive accuracy of the models (the WAIC is an approximation of the out-of-sample deviance of a model) and balances underfitting and overfitting by sanctioning models for their number of parameters. Evidence ratios (ERs) were computed as the ratios of weights: $ER_{ij} = \dfrac{w_{i}}{w_{j}}$, where $w_{i}$ and $w_{j}$ are the Akaike weights of models $i$ and $j$, respectively. These weights can be interpreted as the probability of the model being the best model in terms of out-of-sample prediction (Burnham & Anderson, 2002). Whereas the use of WAIC is appropriate for model comparison and selection, it tells us nothing about the absolute fit of the model. To estimate this fit, we computed the Bayesian $R^2$ for MLMs using the bayes_R2() method in the brms package (Bürkner, 2018 b).

Models were fitted using weakly informative priors (see the supplementary materials for code details). Two Markov Chain Monte-Carlo (MCMC) were ran for each model to approximate the posterior distribution, including each 5.000 iterations and a warmup of 2.000 iterations. Posterior convergence was assessed examining trace plots as well as the Gelman-Rubin statistic $\hat{R}$. Constant effect estimates were summarised via their posterior mean and 95% credible interval (CrI), where a credible interval can be considered as the Bayesian analogue of a classical confidence interval. When applicable, we also report Bayes factors (BFs), computed using the Savage-Dickey method, which consists in taking the ratio of the posterior density at the point of interest divided by the prior density at that point. These BFs can be interpreted as an updating factor, from prior knowledge (what we knew before seeing the data) to posterior knowledge (what we know after seeing the data).

6.2.4.2 Articulatory suppression effects

To assess the effects of articulatory suppression on self-reported state rumination, data were analysed in the same fashion as in the first part of the experiment, using Session (2 modalities, before and after motor activity, contrast-coded) as a within-subject categorical predictor, and Condition (2 modalities, Mouthing and Tapping) as a between-subject categorical predictor and RUM as a dependent variable. Moreover, effects of baseline affect state (PANAS and RRS scores), the vividness of the rumination induction memory and the verbality index were assessed by comparing models with and without these additional predictors.

6.3 Results

The results section follows the data analysis workflow. More precisely, for each part of the experiment (i.e., first the analysis of the induction effects and then, the analysis of the impact of mouthing vs. finger-tapping), we first present the results of the model comparison stage in which we compare different models of increasing complexity. Subsequently, we report the estimates of the best model (the model with the lowest WAIC) and base our conclusions on this model.

Recall that, to assess rumination induction, the dependent variable is RUM, the main categorical predictor is Induction and additional continuous predictors are PANAS, RRS, Vividness, and Verbality. To assess articulatory suppression effects, the dependent variable is RUM, the main categorical predictors are Session (within-subject) and Condition (between-subject), and additional continuous predictors are PANAS, RRS and Vividness. Summary statistics (mean and standard deviation) for all these variables can be found in Table 6.1.

Table 6.1: Descriptive statistics (mean and standard deviation) of each recorded variable, for the final sample of participants that were included in the study.

Variables	Baseline	Post-induction	Post-motor	Baseline	Post-induction	Post-motor
RUM	28.5 (26.49)	54.66 (25.16)	45.47 (27.25)	20.96 (21.82)	46.77 (25.74)	43.54 (29.57)
Age	20.3 (2.65)	-	-	20.31 (2.53)	-	-
PANASneg	15.65 (5.67)	-	-	15.46 (5.08)	-	-
PANASpos	30.91 (4.48)	-	-	31.25 (4.4)	-	-
RRSbrooding	12.2 (2.43)	-	-	12.06 (2.62)	-	-
RRSreflection	12.22 (3.22)	-	-	11.71 (3.26)	-	-
Verbality	1.67 (1.18)	-	-	1.67 (1.26)	-	-
Vividness	54.17 (28.94)	-	-	59.78 (24.63)	-	-

Figure 6.2 shows the overall evolution of the mean RUM scores (i.e., self-reported state rumination) through the experiment according to each Session (Baseline, Post-induction, Post-motor) and Condition (Mouthing, Tapping). As displayed in this figure, an important inter-individual variability was observed in all conditions. After the rumination induction, RUM score increased in both groups, and decreased after the motor task, with a stronger decrease in the Mouthing condition.

$Mean RUM score by Session and Condition, along with violin plots and individual data. Error bars represent 95\% confidence intervals.$

Figure 6.2: Mean RUM score by Session and Condition, along with violin plots and individual data. Error bars represent 95% confidence intervals.

6.3.1 Correlation matrix between continuous predictors

To prevent multicollinearity, we estimated the correlation between each pair of continuous predictors. Figure 6.3 displays these correlations along with the marginal distribution of each variable. The absence of strong correlations ($r > 0.8$) between any of these variables suggests that they can each be included as control variables in the following statistical models.

$Diagonal: marginal distribution of each variable. Panels above the diagonal: Pearson's correlations between main continuous predictors, along with 95\% CIs. The absolute size of the correlation coefficient is represented by the size of the text (lower coefficients appear as smaller). Panels below the diagonal: scatterplot of each variables pair.$

Figure 6.3: Diagonal: marginal distribution of each variable. Panels above the diagonal: Pearson’s correlations between main continuous predictors, along with 95% CIs. The absolute size of the correlation coefficient is represented by the size of the text (lower coefficients appear as smaller). Panels below the diagonal: scatterplot of each variables pair.

6.3.2 Rumination induction

To examine the efficiency of the induction procedure (i.e., the effect of Induction) while controlling for the other variables (i.e., Vividness, RRSbrooding, RRSreflection, PANASpos, and PANASneg),⁵⁰ we then compared the parsimony of models containing main constant effects and a varying intercept for Participant. Model comparison showed that the best model (i.e., the model with the lowest WAIC) was the model including Induction, PANASpos, PANASneg, RRSbooding, and an interaction term between Induction and Vividness as predictors (see Table 6.2). Fit of the best model was moderate ($R^2$ = 0.667, 95% CrI [0.574, 0.736]).

Table 6.2: Comparison of models, ordered by WAIC. The best model has the lowest WAIC.

	$WAIC$	$pWAIC$	$\Delta_{WAIC}$	$Weight$
$Int+Ind+PANASpos+PANASneg+Ind:Viv+RRSbro$	1857.01	61.35	0.00	0.350
$Int+Ind+PANASpos+PANASneg+Ind:Viv+RRSbro+RRSref$	1857.37	61.13	0.35	0.293
$Int+Ind+PANASpos+PANASneg+Ind:Viv+RRSref$	1858.01	61.40	0.99	0.213
$Int+Ind+PANASneg+Ind:Viv$	1859.84	63.54	2.83	0.085
$Int+Ind+PANASpos+Ind:Viv$	1862.42	64.51	5.41	0.023
$Int+Ind+PANASneg$	1863.08	66.34	6.07	0.017
$Int+Ind+PANASpos$	1863.98	62.28	6.96	0.011
$Int+Ind+Ind:Viv$	1865.41	63.04	8.40	0.005
$Int+Ind$	1867.09	65.06	10.08	0.002

Note. $pWAIC$ is the number of (effective) parameters in the model. $Int$ = Intercept, $Ind$ = Induction, $Viv$ = Vividness, $RRSbro$ = RRSbrooding, $RRSref$ = RRSreflection. All models include a varying intercept for Participant.

Constant effect estimates for the best model are reported in Table 6.3. Based on these values, it seems that Induction (i.e., the effects of the rumination induction) increased RUM scores by approximately 24.73 points on average ($d_{av} =$ 1.037, 95% CI [0.748, 1.325]). The main positive effect of PANASneg and the main negative effects of PANASpos indicate, respectively, that negative baseline mood was associated with higher levels of rumination while positive baseline mood was associated with lower levels of self-reported rumination.

Table 6.3: Coefficient estimates, standard errors (SE), 95% CrI (Lower, Upper), Rhat and Bayes factor (BF10) for the best model.

Term	Estimate	SE	Lower	Upper	Rhat	BF10
Intercept	37.799	1.893	33.951	41.551	1.00	8.095*10{}16
Induction	24.734	2.235	20.543	29.092	1.00	-3.19*10{}18
PANASpos	-7.058	1.862	-10.757	-3.438	1.00	131.6
PANASneg	7.313	1.878	3.683	11.019	1.00	773.9
RRSreflection	2.276	1.913	-1.341	6.174	1.00	0.381
Induction:Vividness	4.113	2.118	-0.028	8.404	1.00	1.397

Higher scores on Vividness were associated with higher increase in self-reported rumination after induction, as revealed by the positive coefficient of the interaction term. This suggests that participants who recalled a more vivid negative memory tended to show a higher increase in rumination after the induction procedure than participants with a less vivid memory.

6.3.3 Articulatory suppression effects on induced rumination

In order to examine the effect of the two motor tasks (articulatory suppression and finger-tapping, Condition variable) on RUM while controlling for other variables (i.e., Vividness, RRSbrooding, RRSreflection, Verbality, PANASpos, and PANASneg), we compared the parsimony of several models, with or without these variables or their interaction. Given the group differences on RUM score at baseline (i.e., after training), we also included this score as a control variable in the models, as the RUMb variable ($M_{mouthing}$ = 28.5, $M_{tapping}$ = 20.96). Based on our hypotheses, we examined the three-way interaction between Session, Condition and Verbality. More precisely, we expected that greater amounts of verbal thoughts would be associated with a greater difference in the effects of the motor task on self-reported state rumination (i.e., RUM) with respect to the group (i.e., mouthing vs. finger-tapping). Model comparison showed that the best model was the model including Session, Cond, an interaction term between Session and Condition, RUMb, PANASneg, RRSbrooding and RRSreflection as predictors (cf. Table 6.5). Absolute fit of the best model was moderate ($R^2$ = 0.654, 95% CrI [0.558, 0.724]). Therefore, contrary to our hypothesis, the best model did not include the three-way interaction between Session, Condition and Verbality as a constant effect. It did include an interaction between Session and Condition, however.

Table 6.4: Comparison of models, ordered by WAIC. The best model has the lowest WAIC.

	$WAIC$	$pWAIC$	$\Delta_{WAIC}$	$Weight$
$Session+Cond+Session:Cond+RUMb+PANASn+RRSb+RRSr$	1857.78	64.22	0.00	0.403
$Session+Cond+RUMb+PANASn+RRSb+RRSr$	1858.70	63.98	0.92	0.254
$Session+Cond+Session:Cond+Session:Cond:Verb+RUMb+PANASp+RRSb+RRSr$	1859.10	63.52	1.32	0.208
$Session+Cond+Session:Cond+Session:Cond:Verb+RUMb+PANASn+RRSb+RRSr$	1860.61	64.81	2.83	0.098
$Session+Cond+Session:Cond$	1863.63	69.18	5.85	0.022
$Session+Cond$	1866.36	69.17	8.58	0.006
$Session$	1866.40	69.70	8.62	0.005
$Session+Cond+Session:Cond:Verb$	1866.64	69.22	8.86	0.005
$Null\ model$	1876.91	67.11	19.13	0.000

Note. $K$ is the number of estimated parameters in the model. $Int$ = Intercept, $Cond$ = Condition, $RUMb$ = RUM baseline score, $Verb$ = Verbality, $RRSb$ = RRSbrooding, $RRSr$ = RRSreflection. All models include a constant intercept and a varying intercept for Participant.

Parameter values of the best model for the second part of the experiment are reported in Table 6.5. Based on these values, it seems that self-reported rumination decreased after both motor tasks (the coefficient for Session is negative), but this decrease was substantially larger in the Mouthing condition ($d_{av} =$ -0.351, 95% CI [-0.735, 0.034]) than in the Tapping condition ($d_{av} =$ -0.117, 95% CI [-0.506, 0.273]), as can be read from the coefficient of the interaction term between Session and Condition (Est = 4.978, SE = 3.929, 95% CrI [-2.753, 12.926]). Importantly, the large uncertainty associated with this result (as expressed by the width of the confidence interval) warrants a careful interpretation of this result, that should be considered as suggestive evidence, rather than conclusive evidence.

However, the Bayesian framework provides tools that permit richer inference. First, we can compare the relative weight of the best model (the model with the lowest WAIC) with a similar model without the interaction term (the second model in Table 6.4). This reveals that the model including an interaction term between Session and Condition is 1.5845591 more credible than the model without the interaction term, which can be considered as weak but meaningful evidence (Burnham & Anderson, 2002).

Second, we can look at the BF for this particular parameter. As can be seen from Table 6.5, the BF for the interaction term is equal to 1.12, which is evidence for neither the presence or the absence of effect. However, this BF is computed using the Savage-Dickey method⁵¹ and as such is extremely sensitive to the prior choice. Thus, other priors (for instance a prior that is more peaked on zero) would provide stronger evidence for the interaction effect.

Third, and more interestingly, we can also directly look at the posterior distribution of the parameter of interest (the interaction term). Figure 6.4 shows this posterior distribution, its mean and 95% credible interval, as well as the proportion of the distribution which is above 0. This reveals that although the 95% credible interval largely encompasses 0, there is a 0.898 probability that the interaction between Session and Condition is positive (given the data and the priors). This suggests that the decrease in RUM score was indeed larger in the mouthing than in the tapping group.

$Posterior distribution of the interaction parameter between Session (before vs. after the motor task) and Condition (mouthing vs. finger-tapping). The mean and the 95\% credible interval are displayed at the top and the bottom of the histogram. The green text indicates the proportion of the distribution that is either below or above zero.$

Figure 6.4: Posterior distribution of the interaction parameter between Session (before vs. after the motor task) and Condition (mouthing vs. finger-tapping). The mean and the 95% credible interval are displayed at the top and the bottom of the histogram. The green text indicates the proportion of the distribution that is either below or above zero.

Table 6.5: Coefficient estimates, standard errors (SE), 95% CrI (Lower, Upper), Rhat and Bayes factor (BF10) for the best model.

Term	Estimate	SE	Lower	Upper	Rhat	BF10
Intercept	47.673	1.973	43.938	51.742	1.000	2.159*10{}17
Session	-5.900	2.141	-10.083	-1.634	1.000	9.522
Condition	-0.968	3.662	-7.923	6.707	1.000	0.395
RUMbaseline	12.807	2.164	8.652	17.134	1.000	1.634*10{}21
RRSbrooding	2.300	1.995	-1.964	6.239	1.001	0.417
RRSreflection	-1.870	1.988	-5.932	1.975	1.001	0.327
PANASneg	0.615	2.244	-3.949	4.951	1.000	0.23
Session:Condition	4.978	3.929	-2.753	12.926	1.000	0.895

The large variation between participants can be appreciated by computing the intra-class correlation (ICC), expressed as $\sigma_{intercept}^{2}/(\sigma_{intercept}^{2}+\sigma_{residuals}^{2})$. For the best model, the ICC is equal to 0.365 (95% CrI [0.123, 0.547]), indicating that 36.5% of the variance in the outcome that remains after accounting for the effect of the predictors, is attributable to systematic inter-individual differences.

Figure 6.5 shows the effects of Verbality on the relative change (i.e., after - before) in self-reported rumination after both motor activities (i.e., Mouthing and Tapping). As Verbality was centred before analysis, its score cannot be interpreted in absolute terms. However, a high score on this index indicates more verbal than non-verbal (e.g., visual images, non-speech sounds) thoughts, whereas a low score indicates more non-verbal than verbal thoughts. Contrary to our predictions but consistent with the model comparison, this figure depicts a similar relationship between Verbality and the change in RUM score (between before and after the motor task), according to the Condition. In the Mouthing condition, the change in RUM score did decrease for participants with a higher self-reported degree of verbal content. This suggests that the more verbal the rumination is, the more it is affected by mouthing interference. But contrary to our expectation, a similar trend (although perhaps weaker) was observed in the Tapping condition. This suggests that the more verbal the rumination is, the more it is affected by any motor task.

Mean RUM relative change after motor activity, as a function of the degree of Verbality, in the mouthing (the green dots and regression line) and finger tapping (the orange dots and regression line) conditions.

Figure 6.5: Mean RUM relative change after motor activity, as a function of the degree of Verbality, in the mouthing (the green dots and regression line) and finger tapping (the orange dots and regression line) conditions.

6.4 Discussion

The purpose of the current study was to investigate the effects of articulatory suppression on induced verbal rumination. We predicted that if verbal rumination, which can be construed as a type of inner speech, does involve the mental simulation of overt speech production, its generation should be disrupted by articulatory suppression, but not by finger tapping. This prediction was not strictly corroborated by the data, as we observed a decrease of self-reported rumination after both types of motor activities (see Figure 6.2 and Table 6.4), with a somewhat stronger decrease in the Mouthing condition. In the following, we examine the validity of our methods and discuss interpretations of our results. Finally, we formulate how subsequent research should address this kind of question and suggest alternative ways to test the above mentioned hypothesis. We begin by discussing the results of the rumination induction procedure.

6.4.1 Rumination induction

It is noteworthy that 32.91% of the total sample of participants who were recruited did not respond to this induction, and were therefore not included in the analyses. Moreover, as reported in Table 6.3, it seems that the Vividness of the memory chosen by the participant during the mood induction was moderating the effect of the rumination induction. In other words, the more vivid (i.e., the more “intense”) the memory, the more successful the rumination induction was. This highlights the fact that this aspect should be carefully controlled each time a mood induction is used in order to foster subsequent repetitive negative thinking.

Moreover, we observed a group difference of approximately 7.5 points in the average RUM score at baseline. This difference might be explained by motor training, which took place before baseline measurement of state rumination. During this training, participants had to perform the motor task (either finger-tapping or mouthing) in front of a black screen on which a white dot was moving randomly for 1 min. During the task, the experimenter stayed in the room to check that participants were performing the motor task adequately. Being an unusual and potentially embarrassing motor activity, mouthing might have been a higher source of stress for the participants, as compared to the more common activity of finger-tapping. This group difference in baseline state rumination subsisted after the induction, as the group difference after the induction was of approximately 8 points (see summary statistics in Table 6.1 and full dataset in the supplementary materials).

6.4.2 Articulatory suppression effects

In the following section, we discuss in more depth the results of the second part of the study, which aimed at comparing the effects of articulatory suppression and finger-tapping on self-reported rumination.

First, it is important to examine whether the weakness of the effect of the interaction we had predicted between session and condition could come from a lack of statistical power. We planned 128 participants in order to reach a power of .80 for a targeted effect size of $\eta_{p}^{2}=.06$. As explained above, out of the 184 recruited participants, only 106 could be included in the study. With 106 participants, the a priori power for detecting an effect size of $\eta_{p}^{2}=.06$ was approximately of .70, which is much higher than the median power in typical psychological studies.

Second, it is important to acknowledge that despite the weakness of the difference between the two conditions in their influence on the level of self-reported rumination (i.e., RUM), both activities did lead, on average, to a decrease in self-reported rumination of approximately 6 points on the VAS (as indicated by the slope for Session in Table 6.5). This decrease might be interpreted in two ways. First, it might be explained by the simple exposition to the VAS and by compliance effects. When asked to rate their level of rumination again after five minutes of motor activity, some participants might be prompted to indicate a lower level of rumination than before the motor task. But compliance effects could similarly lead participants to consider the motor task as irritating, and therefore as prone to rumination increase. Some participants could therefore also be biased towards indicating a higher level of rumination after the motor task. Second, it might be considered that this decrease reflects a genuine decrease in rumination. In the following, we adopt the latter perspective and discuss explanations for the weak difference between the two conditions.

6.4.2.1 Effect of the rumination quality (verbality)

Our prediction was that rumination in verbal form would be more disrupted by mouthing than rumination in non-verbal form, while both kinds of rumination would not be disrupted (or similarly disrupted) by finger-tapping. In other words, we hypothesised a three-way interaction, between the effect of time (i.e., Session), Condition, and Verbality. In the following, we discuss the absence of this interaction. Then, we focus on the weak difference between the two conditions (omitting Verbality), and discuss some explanations for this weak difference.

First, the absence of the three-way interaction might come from a difficulty for the participants to have clear introspective access to the ruminative thoughts they experienced during the motor task. For instance, we know that introspective description of inner speech differs considerably, between people trained to regularly report on their episodes of inner speaking, and people without such training (e.g., Hurlburt et al., 2013). Moreover, as the Verbality questionnaire was presented at the end of the experiment, one cannot exclude that it was partly contaminated by recall, which, when done verbally, has been shown to artificially increase the subjective verbality index (Hurlburt, 2011).

6.4.2.2 Difference between motor conditions

Leaving the self-reported quality of rumination aside, we now turn to a discussion of the weak difference between the two motor conditions. We think this result can be explained in at least two non-exclusive ways. First, we could argue that the decrease observed in both conditions was due to an unexpected effect of finger-tapping on rumination. Second, we could argue that the effect of the articulatory suppression was somehow weaker than expected. In the following, we provide arguments and explanations for each of these possibilities.

Steady finger-tapping is usually considered as a relevant control condition for evaluating articulatory suppression, since it specifically recruits the hand motor system and should not interfere with the oral motor system, while being comparable in terms of general attentional demands (e.g., Gruber, 2001; Logie & Baddeley, 1987). However, using more complex rhythmic patterns of finger-tapping, Saito (1994) observed a fade-out of the phonological similarity effect in a verbal memory task with spoken recall, when subjects were asked to tap with either their right (dominant) or left hand, while the phonological similarity effect was conserved in the control condition (no tapping). The author concluded that a complex rhythmic tapping task can interfere with the running of speech motor programs (Saito, 1994, p. 185). More specifically, he suggested that complex, non-automatised, rhythmic finger tapping could use speech motor programs, which are useful to control speech prosody and rhythmic activity. We further suggest that a novel complex rhythmic task might require silent verbalisation and, therefore, might itself be an articulatory suppression task. In line with these findings, another study showed that for right-handed subjects, tapping with a finger of the right hand is more effective at interfering with performance of a verbal memory task than is tapping with a finger of the left hand (Friedman, Polson, & Dafoe, 1988). Although Friedman et al.’s findings are difficult to interpret, because task priority was manipulated and this may have led to conflict resolution, which might have been dealt with differentially according to the hand involved, they do suggest that a finger tapping task is not always the best control for articulatory suppression. This might explain the decrease of self-reported rumination observed in our own study, after the finger-tapping, and suggests that we might observe different results by asking participants to tap with the finger of their non-dominant hand. We think it is important to note for future studies that our results, together with those of Saito (1994) and Friedman et al. (1988), suggest that finger-tapping could in fact interfere with inner speech. In other words, finger-tapping, with the dominant hand, is probably not an appropriate control condition when studying articulatory suppression.

An alternative way to explain the absence of differences between the two motor conditions is to suppose that the effects of the articulatory suppression were weaker than we expected. The rhythmic mouthing task might have become too automatised to disrupt inner speech programming. This idea finds some support in the results of Saito (1997), who observed an effect of articulatory suppression on the phonological similarity effect in a memory task only when the articulatory suppression was intermittent (i.e., “ah, ah, ah…”) but no effect when participants had to utter a continuous “ah–”. This can be explained by considering that the intermittent articulatory suppression would impose a greater load on speech motor programming than the continuous articulatory suppression (Saito, 1997, p. 569). In a similar vein, Macken & Jones (1995) found stronger effects of articulatory suppression when participants were asked to repeat a sequence of different letters than when they were asked to repeat a single letter. One way to examine this hypothesis within our own protocol would be to ask participants to make sequences of various mouth movements, rather than repeating a single movement. Alternatively, the relatively weak effects of articulatory suppression on rumination may also be explained by the specific time course of our experimental design. Indeed, the articulatory suppression was performed after participants went through the entire rumination induction procedure (i.e., after reading all the rumination induction prompts). We speculate that the effects of the articulatory suppression might have been stronger if it was performed during the rumination (e.g., between each prompt) instead.

In a broader perspective, relating to the original research question, we should mention two additional interpretations of our results. So far, we considered different ways to explain either how the finger-tapping task could interfere with rumination or how the articulatory suppression task might have failed to disrupt rumination. However, if we assume that our scales (especially the RUM outcome response and the Verbality scale) are reliable and that the articulatory suppression was efficient in its intended purpose (i.e., suppressing speech motor activity), we are forced to admit that either i) rumination is not a type of inner speech that can be disrupted by peripheral muscle perturbation (i.e., it could be described as a more abstract form of inner speech) or that ii) inner speech, more broadly, does not depend on peripheral speech muscle activity. Although we think that these questions cannot be answered from our present results, we acknowledge that these two possibilities are compatible with our results.

In summary, the current research is one of the first behavioural studies exploring the association between verbal rumination and the speech motor system. While the observed data did not strictly corroborate our original hypotheses, we explored several explanations for the weak difference between articulatory suppression and the control task, and related our findings to previous works on the role of inner speech in verbal working memory. These results have important implications for future studies on articulatory suppression during inner speech or verbal working memory tasks. More precisely, they highlight the need for further investigation of the most appropriate control task when studying the effects of articulatory suppression.

6.5 Acknowledgements

We thank David Meary for his technical support in programming the eye-tracking experiment, Elena Keracheva for her help during data collection as well as Rafael Laboissiere and Brice Beffara for their advice concerning data analysis.

A lot of useful packages have been used for the writing of this paper, among which the papaja and knitr packages for writing and formatting (Aust & Barth, 2018; Xie, 2018), the ggplot2, ggforce, GGally, DiagrammeR, patchwork, BEST, and plotly packages for plotting (Iannone, 2018; Kruschke & Meredith, 2018; Pedersen, 2017, 2018; Schloerke et al., 2018; Sievert et al., 2017; Wickham et al., 2018), the sjstats and tidybayes packages for data analysis (Kay, 2018; Lüdecke, 2018), as well as the tidyverse and glue packages for code writing and formatting (Hester, 2017; Wickham, 2017).

6.6 Funding information

This project was funded by the ANR project INNERSPEECH (grant number ANR-13-BSH2-0003-01). The first author of the manuscript is funded by a doctoral fellowship from Univ. Grenoble Alpes.

6.7 Data Accessibility Statement

Pre-registered protocol, preprint, data, as well as reproducible code and figures are available at: https://osf.io/3bh67/.

This experimental chapter is a submitted manuscript reformatted for the need of this thesis. Source: Nalborczyk, L., Perrone-Bertolotti, M., Baeyens, C., Grandchamp, R., Spinelli, E., Koster, E.H.W., & Lvenbruck, H. (submitted). Articulatory suppression effects on induced rumination. Pre-registered protocol, preprint, data, as well as reproducible code and figures are available at: https://osf.io/3bh67/.↩
In the original power calculations included in the OSF preregistration platform, we had inadequately specified the effect size in GPower, but we only realised this erroneous specification after the freezing of the preregistration on the OSF platform. Therefore, the current sample size slightly differs from the preregistered one.↩
We computed this ratio because we were interested in the proportion of verbal thoughts relative to all thoughts and not in the total amount of verbal thoughts per se.↩
An introduction to Bayesian statistics is outside the scope of this paper. However, the interested reader is referred to Nalborczyk et al. (2019 a) for an introduction to Bayesian multilevel modelling using the brms package.↩
Note that we only included predictors that were theoretically relevant (as recommended, amongst others, by Burnham & Anderson, 2002, 2004). We did not blindly assess every combination of predictors.↩
This method simply consists in taking the ratio of the posterior density at the point of interest divided by the prior density at that point (for a practical introduction, see Wagenmakers et al., 2010).↩

	\(WAIC\)	\(pWAIC\)	\(\Delta_{WAIC}\)	\(Weight\)
\(Int+Ind+PANASpos+PANASneg+Ind:Viv+RRSbro\)	1857.01	61.35	0.00	0.350
\(Int+Ind+PANASpos+PANASneg+Ind:Viv+RRSbro+RRSref\)	1857.37	61.13	0.35	0.293
\(Int+Ind+PANASpos+PANASneg+Ind:Viv+RRSref\)	1858.01	61.40	0.99	0.213
\(Int+Ind+PANASneg+Ind:Viv\)	1859.84	63.54	2.83	0.085
\(Int+Ind+PANASpos+Ind:Viv\)	1862.42	64.51	5.41	0.023
\(Int+Ind+PANASneg\)	1863.08	66.34	6.07	0.017
\(Int+Ind+PANASpos\)	1863.98	62.28	6.96	0.011
\(Int+Ind+Ind:Viv\)	1865.41	63.04	8.40	0.005
\(Int+Ind\)	1867.09	65.06	10.08	0.002

	\(WAIC\)	\(pWAIC\)	\(\Delta_{WAIC}\)	\(Weight\)
\(Session+Cond+Session:Cond+RUMb+PANASn+RRSb+RRSr\)	1857.78	64.22	0.00	0.403
\(Session+Cond+RUMb+PANASn+RRSb+RRSr\)	1858.70	63.98	0.92	0.254
\(Session+Cond+Session:Cond+Session:Cond:Verb+RUMb+PANASp+RRSb+RRSr\)	1859.10	63.52	1.32	0.208
\(Session+Cond+Session:Cond+Session:Cond:Verb+RUMb+PANASn+RRSb+RRSr\)	1860.61	64.81	2.83	0.098
\(Session+Cond+Session:Cond\)	1863.63	69.18	5.85	0.022
\(Session+Cond\)	1866.36	69.17	8.58	0.006
\(Session\)	1866.40	69.70	8.62	0.005
\(Session+Cond+Session:Cond:Verb\)	1866.64	69.22	8.86	0.005
\(Null\ model\)	1876.91	67.11	19.13	0.000