Video Abstract
Children who receive more responsive care during their early childhood tend to exhibit stronger cognitive development, mental well-being, and physical health across their life course.
Determine how to design effective responsivity training programs for caregivers.
We searched seven electronic databases through October 2020.
Randomized trials (k = 120) of programs training parents of children ages 0 to 6 to be more responsive.
Two reviewers independently extracted data. Data were pooled by using random-effects pairwise and network meta-analyses.
Programs had, on average, a medium effect (d = 0.56; 95% confidence interval [CI]: 0.47 to 0.65). The most effective programs included didactic teaching and opportunities for parents to observe models, practice skills, and receive feedback (d = 1.07; 95% CI: 0.37 to 1.77), or all these instructional methods in addition to reflection (d = 0.86; 95% CI: 0.64 to 1.09). Programs that had participants observe examples of responsivity (d = 0.70; 95% CI: 0.57 to 0.83), used researchers as facilitators (d = 0.89; 95% CI: 0.66 to 1.12), assigned homework (d = 0.85; 95% CI: 0.66 to 1.02), and had a narrow scope (d = 0.72; 95% CI: 0.57 to 0.87) were more effective than those that did not.
Most samples included only mothers from Western countries and lacked follow-up data.
Having parents observe examples of responsive caregiving and complete home-practice in short, focused programs may be an effective, scalable approach to enhancing responsivity in the general population and reducing inequalities in child development.
Currently, >250 million children worldwide are not reaching their developmental potential1 ; to do so, children must consistently experience positive, responsive interactions with stable, supportive caregivers. Indeed, children whose bids for connection, care, or comfort are consistently left unmet are more likely to suffer adverse outcomes over the life course, such as lower educational achievement, anxiety, depression, or cardiovascular disease.2–4 Receiving consistent and constructive responses, in contrast, has been identified as a cornerstone of physical, neurophysiological, and psychological well-being.5 Responsive caregivers are attuned to both children’s emotional and cognitive states: they are not only aware of how children are feeling and respond empathically6 but also recognize what children are interested in and capable of, and respond with the right level of support or challenge to help them learn.7 Parent responsivity (also termed sensitivity because both terms are used interchangeably to measure the same construct) is positively associated with children’s global neural structure (eg, increased brain volume and cortical thickness) and localized neural function (eg, activation in language and emotion regulation networks).8–10 It is therefore hypothesized that engaging children in back-and-forth interactions that address their needs or expand on their interests is a mechanism that strengthens neural connections in regions of the brain responsible for language, executive function, and self-regulation, building blocks for positive adjustment across the life course.11,12 For this reason, responsive caregiving, as one important component of nurturing care, can contribute to breaking the intergenerational cycle of poverty and building strong societies.13
Numerous interventions have demonstrated that responsivity is a skill that can be taught and that improving parent responsivity leads to subsequent improvements in child outcomes.5,14,15 In a meta-analysis from 2003, researchers examined the impact of interventions targeting parental sensitivity or children’s attachment security (including a high proportion of clinical samples) and found that programs that had fewer sessions, had a clear-cut focus, and included video feedback were more effective than those that did not.16 Since then, there has been a proliferation of both research and investment in responsive caregiving, including in low- and middle-income countries, many of which are making substantial investments into enhancing early childhood development at a population level.17,18 To ensure such efforts are as effective and efficient as possible, there is a need for an updated integration of evidence on how to enhance responsive caregiving in the general population.18
An important question in the design of parenting programs is how parenting skills are best taught and learnt. Although meta-analyses exist summarizing evidence on which instructional methods (eg, didactic teaching, skill practice) support learning in academic or professional contexts,19,20 no meta-analysis to date has focused on pedagogical practices in responsivity training. The primary goal with this study was to apply a new approach to data synthesis (network meta-analysis) to identify the optimal combination of instructional methods for teaching responsive caregiving. Although pairwise meta-analyses can be used to identify the relative effectiveness of interventions that have been tested against a common comparator (eg, control group), with network meta-analyses, researchers can compare and rank the relative efficacy of different types of interventions, even in the absence of direct pairwise comparisons between those interventions.21 Furthermore, network meta-analyses allow for the inclusion of studies with >2 groups and those without a control group. We used this approach to better understand what helps adults learn to interact responsively with young children. In addition to shedding light on the value of different instructional methods, we provide updated answers about whether the effectiveness of responsivity training programs in general populations varies on the basis of study, program, or sample characteristics. Given the previous review on this topic from 200316 and the current most common practices in the field, we hypothesized that focused programs that emphasized rehearsal and feedback, particularly video feedback, would be most effective.
Methods
Identification and Selection of Studies
A protocol for this study was developed and registered with the International Prospective Register of Systematic Reviews (CRD42018100100). A search strategy (detailed in Section 1 of the Supplemental Information) was developed in consultation with a librarian and experts in developmental psychology and population health; the strategy was then reviewed by another librarian using the Peer Review of Electronic Search Strategies checklist.22 We searched in PsycINFO, Medline, the Cumulative Index to Nursing and Allied Health Literature, Social Work Abstracts, the Education Resources Information Center, ABI/INFORM, and the Cochrane Central Register of Controlled Trials from the inception of each of these databases to October 22, 2020. To identify additional studies, we scanned the reference lists of both included studies and relevant reviews retrieved in our search. The initial intent was to assess effective methods for teaching general populations of adults to be empathetic and responsive to what others are thinking and feeling. However, because of the number of studies identified and to ensure homogeneity of networks, we subdivided the analyses. The current article is focused on studies training parents only.
We included randomized trials of training programs teaching adults how to be more responsive toward their children. Studies had to be available in English (the language of the authors, a limitation because of coding capacities) and report on a quantitative measure of parents’ ability to understand what their children were thinking or feeling and respond in an appropriate (ie, sensitive, responsive, and empathic) manner. When data were not reported in a way that could be converted into an effect size, we contacted the author (20 contacts and 8 responses). We excluded studies in which the reported prevalence of any mental illness (eg, depression, anxiety, addiction, schizophrenia, and/or autism spectrum disorder) among parents was >25% (details are provided in Section 2 of the Supplemental Information). We applied this criterion to limit our findings to the general population, in which rates of psychopathology are ∼20%,23 because responsivity learning may be dampened or different in clinical populations; indeed, parents facing the challenges of depression are, on average, less responsive and show smaller gains from responsivity training.24,25 We also only included studies in which researchers assessed responsivity toward children in early childhood (ie, average age of <6 years) because this is the developmental period during which developmental trajectories are the most malleable and responsivity is most strongly associated with children’s later cognitive and socioemotional outcomes.5,18
Node Formation
We convened a group of 5 parenting researchers, adult educators, and clinical psychologists to conduct a qualitative consensus-based categorization procedure.26 All participants first familiarized themselves with existing meta-analyses of instructional methods.19,20 Then they reviewed the same 10 articles in which researchers describe different parenting interventions, defined their instructional approaches, independently categorized them, and discussed discrepancies to identify a final consensus. This resulted in coding based on 5 types of instructional methods: (1) didactic learning (reading about or being told how to be responsive), (2) observation (watching examples of responsive interactions), (3) reflection (reflecting on and discussing one’s own experience with responsivity), (4) rehearsal (practicing being responsive during interactions), and (5) feedback (receiving feedback on one’s display of responsivity). These methods were coded on the basis of their presence or absence in the program description; unfortunately, most articles did not provide sufficient detail to evaluate the relative intensity of, or time spent using, these different instructional approaches. We also distinguished 3 types of control groups, as per common practice in network meta analyses27 : (1) those who received no training (eg, waitlist); (2) those who were receiving standard care (ie, treatment as usual), such as neonatal care or services to support a child with a developmental delay; and (3) those who were active controls, receiving training similar in format to the intervention group but covering other topics in child development (eg, nutrition or health) and not responsivity.
Risk of Bias and Data Extraction
The first author trained advanced undergraduate and graduate student research assistants on each phase of study selection and data extraction; assistants had to obtain an interrater reliability >80% on a sample of articles to pass the training. Once trained, 2 research assistants independently screened all abstracts and full texts, abstracted data, and appraised study quality using the Cochrane risk-of-bias tool.28 Coders extracted moderators, including characteristics of the study (eg, year and publication type), program (eg, individual or group delivery, facilitator, dose, and inclusion of home visits or video feedback), and sample (eg, age, education, income, and child health) using a predefined data extraction sheet, according to the definitions outlined in Section 2 of the Supplemental Information. Although all material was double coded and all discrepancies were resolved by the first author, it is notable that coder agreement was high (93% for abstract screening and 86% for both full-text review and data extraction).
For studies that reported multiple outcomes, we selected the most methodologically rigorous measures of responsivity, choosing reliable observational outcomes over parent-reported outcomes. Although observational measures are not without their limitations,29 they are the most sensitive to change after intervention and provide the best prediction of child outcomes.30,31 When multiple outcomes of the same methodologic quality (eg, multiple observational measures of responsivity) were reported in a given study, we calculated a pooled effect size across these measures.32 For 3-arm trials in which 2 of the 3 interventions were functionally equivalent (ie, minor difference in content and format of delivery), we calculated a pooled effect size for those 2 interventions relative to the third. Details of the measures and study arms used for each comparison are available in Section 3 of the Supplemental Information.
Statistical Analysis
We used Comprehensive Meta-Analysis software to calculate the standardized mean difference (Cohen’s d) effect sizes and standard errors. Then, we evaluated the assumptions of network meta-analysis. We assessed transitivity by examining the comparability of the distributions of study, program, and sample characteristics across comparisons. We assessed consistency globally using the design-by-treatment interaction model and locally using the loop-specific method.33,34 If assumptions were met, we conducted a random-effects network meta-analysis using the network and network graphs packages in Stata (version 16; Stata Corp, College Station, TX), assuming a common within-network between-study variance (τ2). The efficacy of different interventions was ranked by using surface under the cumulative ranking (SUCRA) curve values.35 To assess reporting bias, we plotted a comparison-adjusted funnel plot.36 Sensitivity analyses were conducted, excluding outliers (defined as having a standardized mean difference 3 SDs above or below the mean) and studies with a high risk of bias. We also conducted meta-regressions for variables identified as potentially unevenly distributed on the basis of transitivity analyses.
In addition, we conducted a random-effects pairwise meta-analysis in Comprehensive Meta-Analysis using the subset of studies that had a control group. For three-arm studies in which 2 of the interventions were not equivalent, we used comparisons between the most intensive intervention group and control group. We assessed for statistical heterogeneity with the I2 statistic and explored it using mixed-effects model meta-regressions.28 We assessed for publication bias using the funnel plot and Egger’s test.28
Results
After screening 10 725 citations, 1637 articles were assessed for eligibility, and ultimately 119 articles in which researchers reported on data from 120 unique samples and 12 376 participants were included (Fig 1). A summary of the characteristics of these studies is reported in Table 1 (see Section 3 of the Supplemental Information for details). Most studies were conducted from 2010 to 2020 in North America or Europe; participants were predominantly white mothers with less than a university education. Most programs were delivered one-on-one by professionals, with a median duration of 8 sessions or 11.5 weeks. The majority of studies exhibited a low risk of bias in all categories except allocation concealment, for which 65% had an unclear risk (see Section 4 of the Supplemental Information for details).
Preferred Reporting Items for Systematic Reviews and Meta-Analyses flow used to identify studies for inclusion.
Preferred Reporting Items for Systematic Reviews and Meta-Analyses flow used to identify studies for inclusion.
Study and Participant Characteristics
. | All Studies (N = 120), No. (%) . | Pairwise Meta-analysis (n = 113), No. (%) . |
---|---|---|
Study characteristics | ||
Year of publication | ||
1978–1989 | 7 (6) | 6 (5) |
1990–1999 | 13 (11) | 13 (12) |
2000–2009 | 21 (17) | 19 (17) |
2010–2020 | 79 (66) | 75 (66) |
Country region | ||
North America | 59 (49) | 55 (48) |
Latin America and the Caribbean | 2 (2) | 2 (2) |
Europe and Central Asia | 36 (30) | 35 (31) |
Sub-Saharan Africa | 2 (2) | 2 (2) |
South Asia | 3 (2) | 2 (2) |
East Asia and Pacific | 18 (15) | 17 (15) |
Sample size | ||
1–49 | 32 (27) | 31 (28) |
50–99 | 39 (32) | 34 (30) |
100–149 | 16 (13) | 16 (14) |
150–199 | 9 (8) | 9 (8) |
≥200 | 24 (20) | 23 (20) |
Participant characteristics | ||
Mean age, y | ||
<20 | 2 (2) | 2 (2) |
20–29 | 38 (32) | 36 (32) |
30–39 | 47 (39) | 44 (39) |
≥40 | 7 (6) | 7 (6) |
Not reported | 26 (21) | 24 (21) |
Mean education, y | ||
<12 | 32 (27) | 31 (27) |
12–15.9 | 53 (44) | 51 (45) |
≥16 | 14 (12) | 12 (11) |
Not reported | 21 (17) | 19 (17) |
Caregiver | ||
Mothers only | 70 (58) | 65 (58) |
Fathers only | 6 (5) | 6 (5) |
Mother or father | 27 (22) | 27 (24) |
Mixed caregivers | 9 (8) | 7 (6) |
Mothers and fathers (attend together) | 8 (7) | 8 (7) |
Ethnicity | ||
>50% white | 52 (43) | 46 (41) |
>50% Black and/or African American | 7 (6) | 7 (6) |
>50% Hispanic or Latino | 3 (2) | 3 (3) |
>50% Asian | 1 (1) | 1 (1) |
>50% Aboriginal or indigenous | 1 (1) | 1 (1) |
Mixed ethnicities | 15 (13) | 15 (13) |
Not reported | 41 (34) | 40 (35) |
. | All Studies (N = 120), No. (%) . | Pairwise Meta-analysis (n = 113), No. (%) . |
---|---|---|
Study characteristics | ||
Year of publication | ||
1978–1989 | 7 (6) | 6 (5) |
1990–1999 | 13 (11) | 13 (12) |
2000–2009 | 21 (17) | 19 (17) |
2010–2020 | 79 (66) | 75 (66) |
Country region | ||
North America | 59 (49) | 55 (48) |
Latin America and the Caribbean | 2 (2) | 2 (2) |
Europe and Central Asia | 36 (30) | 35 (31) |
Sub-Saharan Africa | 2 (2) | 2 (2) |
South Asia | 3 (2) | 2 (2) |
East Asia and Pacific | 18 (15) | 17 (15) |
Sample size | ||
1–49 | 32 (27) | 31 (28) |
50–99 | 39 (32) | 34 (30) |
100–149 | 16 (13) | 16 (14) |
150–199 | 9 (8) | 9 (8) |
≥200 | 24 (20) | 23 (20) |
Participant characteristics | ||
Mean age, y | ||
<20 | 2 (2) | 2 (2) |
20–29 | 38 (32) | 36 (32) |
30–39 | 47 (39) | 44 (39) |
≥40 | 7 (6) | 7 (6) |
Not reported | 26 (21) | 24 (21) |
Mean education, y | ||
<12 | 32 (27) | 31 (27) |
12–15.9 | 53 (44) | 51 (45) |
≥16 | 14 (12) | 12 (11) |
Not reported | 21 (17) | 19 (17) |
Caregiver | ||
Mothers only | 70 (58) | 65 (58) |
Fathers only | 6 (5) | 6 (5) |
Mother or father | 27 (22) | 27 (24) |
Mixed caregivers | 9 (8) | 7 (6) |
Mothers and fathers (attend together) | 8 (7) | 8 (7) |
Ethnicity | ||
>50% white | 52 (43) | 46 (41) |
>50% Black and/or African American | 7 (6) | 7 (6) |
>50% Hispanic or Latino | 3 (2) | 3 (3) |
>50% Asian | 1 (1) | 1 (1) |
>50% Aboriginal or indigenous | 1 (1) | 1 (1) |
Mixed ethnicities | 15 (13) | 15 (13) |
Not reported | 41 (34) | 40 (35) |
Network Meta-analysis
A total of 16 different combinations of instructional methods or types of control groups were used across the studies, with 35 unique direct comparisons. The network geometry is presented in Fig 2. The most frequent type of intervention included all instructional methods, whereas there were relatively fewer studies of interventions in which researchers used only 1 or 2 approaches. The network meta-analysis revealed no evidence of inconsistency (global: χ2[19] = 11.03; P = .92; local: all P >.10) but some heterogeneity (τ2 = 0.21).
Network diagram. The size of the node corresponds to the number of studies which included that combination of instructional approaches; nodes for which there were direct comparisons are linked with a line, the width of which indicates the number of studies in which researchers tested that comparison.
Network diagram. The size of the node corresponds to the number of studies which included that combination of instructional approaches; nodes for which there were direct comparisons are linked with a line, the width of which indicates the number of studies in which researchers tested that comparison.
Figure 3 displays the average effect size for each combination of instructional methods compared with the pure control groups with no training. In addition, a table listing the effect sizes for all possible two-way comparisons between different combinations of instructional methods is available in Section 5 of the Supplemental Information. Several combinations of instructional methods were significantly more effective than all 3 types of control groups, with medium to large effect sizes: programs using didactic, observation, rehearsal, and feedback (d = 0.89 to 1.07); all 5 instructional methods (d = 0.68 to 0.86); didactic, observation, and reflection (d = 0.63 to 0.81); didactic, observation, reflection, and rehearsal (d = 0.54 to 0.72); and didactic, reflection, rehearsal, and feedback (d = 0.31 to 0.50). Relative efficacy ranking, based on the SUCRA curves, revealed that the most effective types of program were those that included: (1) didactic, observation, rehearsal, and feedback, (2) all 5 instructional methods, and (3) didactic, observation, and reflection. The SUCRA values and curves are presented in Section 5 of the Supplemental Information.
Forest plot from network meta-analysis in which we compare each combination of instructional methods to control.
Forest plot from network meta-analysis in which we compare each combination of instructional methods to control.
The comparison-adjusted funnel plot (Section 6 of the Supplemental Information) suggested that small studies may exaggerate the effectiveness of less intensive interventions, although this was heavily influenced by a few outliers. In the sensitivity analysis without outliers, we identified the same top 3 combinations of instructional methods. In the analysis with only low risk-of-bias studies, we identified programs using all instructional methods as being the most effective and those without feedback being more effective than those without reflection (see Section 6 of the Supplemental Information for details). Program rankings also differed somewhat in adjusted versus unadjusted models (see Section 6 of the Supplemental Information). Nonetheless, programs that used didactic instruction, observation, rehearsal, and feedback ranked in the top 3 in 83% of network meta-regression analyses, whereas those using all instructional methods were ranked in the top 3 75% of the time. In all scenarios, at least 1 of these types of programs was in the top 3 and in many cases both were in the top 3, underscoring the consistent superiority of programs using these combinations of instructional approaches.
Pairwise Random-Effects Meta-analysis
The majority of studies included a control group (113 samples involving 11 693 parents) and could be included in the pairwise meta-analysis. A forest plot for these studies is available in Section 7 of the Supplemental Information. Overall, there was a medium effect size of 0.56 (95% confidence interval [CI]: 0.47 to 0.65; without 3 outliers: d = 0.52; 95% CI: 0.43 to 0.60).37 Although there was evidence of publication bias (Egger’s test P < .0001; funnel plot in Section 8 of the Supplemental Information), Duval and Tweedie’s trim-and-fill procedure did not suggest trimming any studies or produce a different estimate.
Because there was substantial heterogeneity (I2 = 79.52; P < .001), we tested whether this could be explained by study, program, or sample characteristics. The results of the meta-regressions are presented in Table 2. Programs that were more effective required participants to complete homework assignments (d = 0.84; 95% CI: 0.66 to 1.02, compared with d = 0.40; 95% CI: 0.31 to 0.49), were facilitated by a researcher (d = 0.89; 95% CI: 0.66 to 1.12), rather than a different type of facilitator (d ranging from 0.20 to 0.53), and were focused narrowly on teaching responsivity (d = 0.72; 95% CI: 0.57 to 0.87), rather than covering a broad range of unrelated topics (d = 0.40; 95% CI: 0.30 to 0.50). Programs in which the participants’ children were preschoolers were also more effective (d = 0.87; 95% CI: 0.64 to 1.10), on average, than those in which children were infants (d = 0.47; 95% CI: 0.34 to 0.60) or spanned multiple age categories (d = 0.31; 95% CI: 0.07 to 0.54). Finally, program effects were stronger for participants whose children did not have any physical impairments (d = 0.62; 95% CI: 0.51 to 0.73), compared with those whose children were physical impaired (d = 0.35; 95% CI: 0.21 to 0.49). In a multivariate regression considering all moderators that were significant in the individual analyses, only use of homework (P = .006), type of facilitator (P = .007), and program scope (P = .02) remained statistically significant predictors of program effect size. Together, these 3 variables explained 40% of the variance in program effectiveness. Sensitivity analyses were also conducted, replicating all meta-regressions without the 3 outlier studies (see Section 8 in the Supplemental Information). In these analyses, significant effects for child age and physical impairment were not replicated (P = .12 and .07, respectively). However, there was an effect of program length whereby programs that lasted >1 year were significantly less effective (d = 0.17; 95% CI: −0.008 to 0.35) than those taking place within 1 year (d ranging from 0.54 to 0.66).
Meta-Regressions for Categorical and Continuous Moderators
. | k . | d or Ba . | 95% CI . | P . | R2 . |
---|---|---|---|---|---|
Study characteristics | |||||
Year of publication | 113 | −0.01 | −0.02 to 0.0005 | .06 | 0.03 |
Type of publication | |||||
Journal | 110 | 0.56 | 0.47 to 0.65 | .95 | 0.00 |
Dissertation | 3 | 0.61 | 0.20 to 1.02 | ||
Country income | |||||
High | 103 | 0.56 | 0.46 to 0.65 | .42 | 0.00 |
Upper middle | 8 | 0.87 | 0.40 to 1.33 | ||
Lower middle | 2 | 0.35 | −0.21 to 0.91 | ||
Type of control group | |||||
Pure control | 36 | 0.73 | 0.55 to 0.92 | .14 | 0.07 |
Active control | 36 | 0.49 | 0.37 to 0.61 | ||
Standard care | 41 | 0.49 | 0.34 to 0.64 | ||
Risk of bias | |||||
High | 33 | 0.60 | 0.46 to 0.75 | .61 | 0.01 |
Low | 80 | 0.55 | 0.44 to 0.66 | ||
Program characteristics | |||||
No. sessions | |||||
<5 sessions | 19 | 0.68 | 0.45 to 0.91 | .69 | 0.03 |
5–10 sessions | 54 | 0.52 | 0.39 to 0.64 | ||
11–20 sessions | 21 | 0.61 | 0.38 to 0.84 | ||
>20 sessions | 18 | 0.47 | 0.28 to 0.67 | ||
Program length | |||||
<1 mo | 7 | 0.64 | 0.17 to 1.12 | .06 | 0.11 |
1–5 mo | 78 | 0.60 | 0.49 to 0.71 | ||
6 mo to 1 y | 16 | 0.72 | 0.46 to 0.97 | ||
>1 y | 9 | 0.17 | −0.008 to 0.35 | ||
Program hours | 88 | 0.003 | −0.003 to 0.009 | .32 | 0.01 |
Frequency (average sessions per wk) | 109 | 0.10 | −0.10 to 0.30 | .31 | 0.01 |
Delivery format | |||||
In-person group | 17 | 0.57 | 0.32 to 0.82 | .71 | 0.03 |
In-person individual | 72 | 0.60 | 0.49 to 0.71 | ||
In-person group and individual | 20 | 0.48 | 0.26 to 0.70 | ||
Online and in person | 1 | 0.23 | −0.73 to 1.18 | ||
Online only | 3 | 0.27 | −0.32 to 0.86 | ||
Program facilitator | |||||
Professional | 69 | 0.53 | 0.41 to 0.64 | .009 | 0.16 |
Paraprofessional | 9 | 0.30 | 0.14 to 0.46 | ||
Professional or paraprofessional | 2 | 0.20 | 0.07 to 0.33 | ||
Researcher | 28 | 0.89 | 0.66 to 1.12 | ||
No facilitator | 3 | 0.27 | −0.32 to 0.86 | ||
Program designer | |||||
Affiliated | 78 | 0.58 | 0.48 to 0.68 | .59 | 0.01 |
Unaffiliated | 35 | 0.52 | 0.35 to 0.70 | ||
Scope of training | |||||
Narrow | 58 | 0.72 | 0.57 to 0.87 | .005 | 0.12 |
Broad | 55 | 0.40 | 0.30 to 0.50 | ||
Home visits | |||||
Yes | 72 | 0.51 | 0.41 to 0.61 | .22 | 0.02 |
No | 41 | 0.67 | 0.49 to 0.84 | ||
Percent home visits | 110 | −0.05 | −0.27 to 0.16 | .63 | 0.00 |
Video feedback | |||||
Yes | 52 | 0.61 | 0.48 to 0.74 | .43 | 0.02 |
No | 61 | 0.52 | 0.40 to 0.63 | ||
Homework | |||||
Yes | 43 | 0.84 | 0.66 to 1.02 | .0001 | 0.18 |
No | 70 | 0.40 | 0.31 to 0.49 | ||
Sample characteristics | |||||
Parent age | 89 | 0.02 | −0.0002 to 0.04 | .05 | 0.08 |
Parent education | 93 | 0.006 | −0.03 to 0.05 | .76 | 0.01 |
Parent participant | |||||
Mothers only | 65 | 0.50 | 0.39 to 0.60 | .29 | 0.07 |
Fathers only | 6 | 0.61 | 0.24 to 0.98 | ||
Mother or father | 27 | 0.76 | 0.53 to 1.00 | ||
Mixed caregivers | 7 | 0.31 | 0.09 to 0.54 | ||
Mothers and fathers | 8 | 0.62 | 0.21 to 1.03 | ||
Low income | |||||
Yes | 40 | 0.55 | 0.41 to 0.69 | .89 | 0.00 |
No | 69 | 0.57 | 0.45 to 0.68 | ||
Challenged parenting | |||||
Yes | 61 | 0.58 | 0.46 to 0.71 | .82 | 0.01 |
No | 52 | 0.54 | 0.41 to 0.66 | ||
Child age category | |||||
Infant | 42 | 0.47 | 0.34 to 0.60 | .01 | 0.11 |
Toddler | 30 | 0.52 | 0.38 to 0.66 | ||
Preschooler | 31 | 0.87 | 0.64 to 1.10 | ||
Multiple categories | 10 | 0.31 | 0.07 to 0.54 | ||
Child physical risk | |||||
Yes | 20 | 0.35 | 0.21 to 0.49 | .05 | 0.04 |
No | 93 | 0.62 | 0.51 to 0.73 | ||
Child neurocognitive risk | |||||
Yes | 48 | 0.57 | 0.43 to 0.71 | .97 | 0.00 |
No | 65 | 0.56 | 0.44 to 0.68 | ||
Instructional methods | |||||
Didactic | |||||
Yes | 103 | 0.58 | 0.48 to 0.67 | .36 | 0.01 |
No | 10 | 0.41 | 0.15 to 0.68 | ||
Observation | |||||
Yes | 73 | 0.70 | 0.57 to 0.83 | .001 | 0.10 |
No | 40 | 0.33 | 0.23 to 0.43 | ||
Reflection | |||||
Yes | 106 | 0.56 | 0.47 to 0.65 | .78 | 0.00 |
No | 7 | 0.63 | 0.24 to 1.02 | ||
Rehearsal | |||||
Yes | 101 | 0.57 | 0.48 to 0.67 | .50 | 0.01 |
No | 12 | 0.49 | 0.15 to 0.84 | ||
Feedback | |||||
Yes | 94 | 0.56 | 0.47 to 0.66 | .99 | 0.00 |
No | 19 | 0.57 | 0.34 to 0.81 |
. | k . | d or Ba . | 95% CI . | P . | R2 . |
---|---|---|---|---|---|
Study characteristics | |||||
Year of publication | 113 | −0.01 | −0.02 to 0.0005 | .06 | 0.03 |
Type of publication | |||||
Journal | 110 | 0.56 | 0.47 to 0.65 | .95 | 0.00 |
Dissertation | 3 | 0.61 | 0.20 to 1.02 | ||
Country income | |||||
High | 103 | 0.56 | 0.46 to 0.65 | .42 | 0.00 |
Upper middle | 8 | 0.87 | 0.40 to 1.33 | ||
Lower middle | 2 | 0.35 | −0.21 to 0.91 | ||
Type of control group | |||||
Pure control | 36 | 0.73 | 0.55 to 0.92 | .14 | 0.07 |
Active control | 36 | 0.49 | 0.37 to 0.61 | ||
Standard care | 41 | 0.49 | 0.34 to 0.64 | ||
Risk of bias | |||||
High | 33 | 0.60 | 0.46 to 0.75 | .61 | 0.01 |
Low | 80 | 0.55 | 0.44 to 0.66 | ||
Program characteristics | |||||
No. sessions | |||||
<5 sessions | 19 | 0.68 | 0.45 to 0.91 | .69 | 0.03 |
5–10 sessions | 54 | 0.52 | 0.39 to 0.64 | ||
11–20 sessions | 21 | 0.61 | 0.38 to 0.84 | ||
>20 sessions | 18 | 0.47 | 0.28 to 0.67 | ||
Program length | |||||
<1 mo | 7 | 0.64 | 0.17 to 1.12 | .06 | 0.11 |
1–5 mo | 78 | 0.60 | 0.49 to 0.71 | ||
6 mo to 1 y | 16 | 0.72 | 0.46 to 0.97 | ||
>1 y | 9 | 0.17 | −0.008 to 0.35 | ||
Program hours | 88 | 0.003 | −0.003 to 0.009 | .32 | 0.01 |
Frequency (average sessions per wk) | 109 | 0.10 | −0.10 to 0.30 | .31 | 0.01 |
Delivery format | |||||
In-person group | 17 | 0.57 | 0.32 to 0.82 | .71 | 0.03 |
In-person individual | 72 | 0.60 | 0.49 to 0.71 | ||
In-person group and individual | 20 | 0.48 | 0.26 to 0.70 | ||
Online and in person | 1 | 0.23 | −0.73 to 1.18 | ||
Online only | 3 | 0.27 | −0.32 to 0.86 | ||
Program facilitator | |||||
Professional | 69 | 0.53 | 0.41 to 0.64 | .009 | 0.16 |
Paraprofessional | 9 | 0.30 | 0.14 to 0.46 | ||
Professional or paraprofessional | 2 | 0.20 | 0.07 to 0.33 | ||
Researcher | 28 | 0.89 | 0.66 to 1.12 | ||
No facilitator | 3 | 0.27 | −0.32 to 0.86 | ||
Program designer | |||||
Affiliated | 78 | 0.58 | 0.48 to 0.68 | .59 | 0.01 |
Unaffiliated | 35 | 0.52 | 0.35 to 0.70 | ||
Scope of training | |||||
Narrow | 58 | 0.72 | 0.57 to 0.87 | .005 | 0.12 |
Broad | 55 | 0.40 | 0.30 to 0.50 | ||
Home visits | |||||
Yes | 72 | 0.51 | 0.41 to 0.61 | .22 | 0.02 |
No | 41 | 0.67 | 0.49 to 0.84 | ||
Percent home visits | 110 | −0.05 | −0.27 to 0.16 | .63 | 0.00 |
Video feedback | |||||
Yes | 52 | 0.61 | 0.48 to 0.74 | .43 | 0.02 |
No | 61 | 0.52 | 0.40 to 0.63 | ||
Homework | |||||
Yes | 43 | 0.84 | 0.66 to 1.02 | .0001 | 0.18 |
No | 70 | 0.40 | 0.31 to 0.49 | ||
Sample characteristics | |||||
Parent age | 89 | 0.02 | −0.0002 to 0.04 | .05 | 0.08 |
Parent education | 93 | 0.006 | −0.03 to 0.05 | .76 | 0.01 |
Parent participant | |||||
Mothers only | 65 | 0.50 | 0.39 to 0.60 | .29 | 0.07 |
Fathers only | 6 | 0.61 | 0.24 to 0.98 | ||
Mother or father | 27 | 0.76 | 0.53 to 1.00 | ||
Mixed caregivers | 7 | 0.31 | 0.09 to 0.54 | ||
Mothers and fathers | 8 | 0.62 | 0.21 to 1.03 | ||
Low income | |||||
Yes | 40 | 0.55 | 0.41 to 0.69 | .89 | 0.00 |
No | 69 | 0.57 | 0.45 to 0.68 | ||
Challenged parenting | |||||
Yes | 61 | 0.58 | 0.46 to 0.71 | .82 | 0.01 |
No | 52 | 0.54 | 0.41 to 0.66 | ||
Child age category | |||||
Infant | 42 | 0.47 | 0.34 to 0.60 | .01 | 0.11 |
Toddler | 30 | 0.52 | 0.38 to 0.66 | ||
Preschooler | 31 | 0.87 | 0.64 to 1.10 | ||
Multiple categories | 10 | 0.31 | 0.07 to 0.54 | ||
Child physical risk | |||||
Yes | 20 | 0.35 | 0.21 to 0.49 | .05 | 0.04 |
No | 93 | 0.62 | 0.51 to 0.73 | ||
Child neurocognitive risk | |||||
Yes | 48 | 0.57 | 0.43 to 0.71 | .97 | 0.00 |
No | 65 | 0.56 | 0.44 to 0.68 | ||
Instructional methods | |||||
Didactic | |||||
Yes | 103 | 0.58 | 0.48 to 0.67 | .36 | 0.01 |
No | 10 | 0.41 | 0.15 to 0.68 | ||
Observation | |||||
Yes | 73 | 0.70 | 0.57 to 0.83 | .001 | 0.10 |
No | 40 | 0.33 | 0.23 to 0.43 | ||
Reflection | |||||
Yes | 106 | 0.56 | 0.47 to 0.65 | .78 | 0.00 |
No | 7 | 0.63 | 0.24 to 1.02 | ||
Rehearsal | |||||
Yes | 101 | 0.57 | 0.48 to 0.67 | .50 | 0.01 |
No | 12 | 0.49 | 0.15 to 0.84 | ||
Feedback | |||||
Yes | 94 | 0.56 | 0.47 to 0.66 | .99 | 0.00 |
No | 19 | 0.57 | 0.34 to 0.81 |
Column contains d (effect size) for categorical variables and B (unstandardized regression coefficient) for continuous variables.
We also used meta-regression to test for the effects of individual instructional methods. Programs that had caregivers observe examples of responsivity were significantly more effective (d = 0.70; 95% CI: 0.57 to 0.83), on average, than those that did not (d = 0.33; 95% CI: 0.23 to 0.43), even when outlier studies were removed. Programs were not consistently more or less effective whether they included didactic, reflection, rehearsal, or feedback methods.
Discussion
Enhancing parental responsivity is one way our society can invest in improving the mental and physical well-being of future generations and reduce intergenerational inequalities.13 The current review was conducted to help maximize the efficiency of efforts to enhance parental responsivity at a population level. The 3 most effective types of programs identified in the network meta-analysis all included didactic and observation activities, with rehearsal and feedback being an added feature of the single most effective category of programs. Across all sensitivity analyses, programs using all 5 instructional methods (didactic, observation, reflection, rehearsal, and feedback) were consistently identified as the most or second most effective and had a large effect size. Taken together, these results suggest that it is valuable to include all instructional methods when designing training programs; however, in cases in which reflection, rehearsal, or feedback are not feasible or too costly omitting one or a few of these formats should not come at a high cost to parent learning. In contrast, observational teaching should not be omitted, as the results of the pairwise meta-analysis suggest that one of the best ways for parents to learn how to be responsive caregivers is to observe good examples of responsive caregiving, an important and novel finding for program designers.
The pairwise meta-analysis also revealed 3 aspects of program design robustly associated with above-average effect sizes. First, in line with previous research,16 programs in which parents were only instructed on responsive caregiving were more effective than those with a broader scope. This replicated finding underscores the principle that when programs try to do everything, it is hard to do any one thing well. Second, programs in which homework was assigned to participants were significantly more effective than those in which it was not. Indeed, in the present meta-analysis, this aspect of program design individually accounted for 18% of the variance in program effectiveness. Although there is evidence to suggest that participants who complete more homework in parent training programs demonstrate stronger outcomes,38 existing research has not compared the effectiveness of programs that do or do not assign homework. The current findings imply that including such assignments is valuable and that responsivity is a skill that is best learnt through consistent practice. Finally, the programs that were most effective tended to be facilitated by researchers. Past reviews have found mixed evidence about the relative effectiveness of professionals, compared with nonprofessionals, as parenting program facilitators; however, these reviews have all categorized researchers as professionals.16,39 The current results suggest that researchers may be particularly effective at supporting parents to be more responsive. Although there is little literature to draw on in explaining this result, we speculate this distinction could be attributable to researchers’ enhanced understanding of key concepts related to responsivity and ability to convey a narrow, focused message regarding a program’s learning goals, thereby "teaching to the test."
This review also highlighted several program characteristics that do not enhance the learning of responsivity among parents without elevated levels of psychopathology: program dosage (sessions, hours, weeks, and frequency), individual versus group delivery, and the use of home visits or video feedback. In other words, long, individualized training programs may not help parents become substantially more responsive than short, group programs do. Furthermore, results suggest responsivity training programs benefit parents similarly, regardless of their level of education and income. Although these findings are in contrast with our hypotheses and findings from some older reviews,16,39,40 because researchers in these previous reviews assessed the effects of programs with different foci (eg, attachment or disruptive behavior) and included quasi-experimental studies as well as studies with clinical populations (eg, depressed mothers), direct comparisons cannot be made. Furthermore, these past meta-analyses are not representative of the abundance of research that has emerged since 2005. Indeed, in a more recent review, researchers have called into question the notion that parent and child characteristics moderate the effects of parenting programs.41 On the basis of the current results, we conclude that, although one-on-one training using personalized video feedback or home visits undoubtedly helps parents learn to be more responsive, these labor-intensive programs may not be absolutely necessary to elevate responsivity among parents in the general population. That being said, there may be other benefits of individualized programs that were not captured in the current analysis; for example, such programs may be necessary to reach certain parents.42
Contributions and Limitations
In this study, we used network meta-analysis to compare the relative effectiveness of various combinations of instructional methods in parent responsivity training programs. We summarized evidence from >100 randomized trials, the majority of which had a low risk of bias, and results were robust to the removal of outliers and studies with a high risk of bias. However, all research was published in English and conducted predominantly with white mothers in Western populations; as there was also evidence of publication bias, our search could have been strengthened through greater identification of international and unpublished data sources. Nonetheless, because the gap in research on interventions for fathers and caregivers from diverse backgrounds was also identified in a different recent review,43 this limitation was likely not a consequence of our search strategy or inclusion criteria and, moving forward, the academic community will need to conduct more research with fathers and ethnically diverse populations for us to better optimize program design for these populations. It is also important to note that our results are only applicable to parents without identified levels of psychopathology and this criterion led to the exclusion of some more vulnerable and diverse samples. Given that responsivity inversely correlates with psychopathology in parents,24 it will be important for researchers in future studies to establish whether optimal instructional methods differ for clinical and general populations. Finally, we also identified a gap in the reporting of details of program design and delivery as well as a shortage of follow-up data. Increasing the availability of such data would allow us to assess the relative efficacy of different instructional methods on the basis of the amount of time spent using each method and determine if certain instructional methods more effectively support sustained behavior change, compared with short-term learning.
Conclusions
The current findings provide several actionable recommendations for health professionals, policy makers, and program developers looking to improve children’s developmental trajectories. Specifically, results suggest that using expensive methods, such as year-long individual home visits, to help parents in the general population be more responsive toward their young children is not necessary. Developing short, focused programs led by researchers that include opportunities for parents to learn about and observe examples of responsive parent-child interactions with some take-home assignments between sessions could be just as or even more effective. Such recommendations may be applied to improving population health across low-, middle-, and high-income countries, during routine encounters with health professionals or using low-resource group programs implemented in health care or community settings. These results may make it easier to support more parents without using more resources, ultimately improving the lives of more children.
Dr Sokolovic conceptualized and designed the study, coordinated and supervised data extraction, conducted the statistical analyses, drafted the initial manuscript, and reviewed and revised the manuscript; Dr Rodrigues conceptualized and designed the study, selected the articles and extracted the data, and reviewed and revised the manuscript; Dr Tricco conceptualized and designed the study and reviewed and revised the manuscript; Ms Dobrina selected the articles, extracted the data, and reviewed and revised the manuscript; Dr Jenkins conceptualized and designed the study, conducted the statistical analyses, and critically reviewed the manuscript for important intellectual content; and all authors approved the final manuscript as submitted and agree to be accountable for all aspects of the work.
FUNDING: Ms Sokolovic was supported by the Social Sciences and Humanities Research Council of Canada (grant 767-2018-2577) to undertake this review. Dr Rodrigues is also supported in part by the Social Sciences and Humanities Research Council of Canada. Dr Tricco is supported by a Tier 2 Canada Research Chair in Knowledge Synthesis. Dr Jenkins is supported by the Atkinson Foundation.
References
Competing Interests
POTENTIAL CONFLICT OF INTEREST: Ms Sokolovic is the author of a parenting program (Support, not Perfection) and receives no royalties for the program; the other authors have indicated they have no potential conflicts of interest to disclose.
FINANCIAL DISCLOSURE: The authors have indicated they have no financial relationships relevant to this article to disclose.
Comments