Video Abstract

Video Abstract

Close modal
CONTEXT:

Children who receive more responsive care during their early childhood tend to exhibit stronger cognitive development, mental well-being, and physical health across their life course.

OBJECTIVE:

Determine how to design effective responsivity training programs for caregivers.

DATA SOURCES:

We searched seven electronic databases through October 2020.

STUDY SELECTION:

Randomized trials (k = 120) of programs training parents of children ages 0 to 6 to be more responsive.

DATA EXTRACTION:

Two reviewers independently extracted data. Data were pooled by using random-effects pairwise and network meta-analyses.

RESULTS:

Programs had, on average, a medium effect (d = 0.56; 95% confidence interval [CI]: 0.47 to 0.65). The most effective programs included didactic teaching and opportunities for parents to observe models, practice skills, and receive feedback (d = 1.07; 95% CI: 0.37 to 1.77), or all these instructional methods in addition to reflection (d = 0.86; 95% CI: 0.64 to 1.09). Programs that had participants observe examples of responsivity (d = 0.70; 95% CI: 0.57 to 0.83), used researchers as facilitators (d = 0.89; 95% CI: 0.66 to 1.12), assigned homework (d = 0.85; 95% CI: 0.66 to 1.02), and had a narrow scope (d = 0.72; 95% CI: 0.57 to 0.87) were more effective than those that did not.

LIMITATIONS:

Most samples included only mothers from Western countries and lacked follow-up data.

CONCLUSIONS:

Having parents observe examples of responsive caregiving and complete home-practice in short, focused programs may be an effective, scalable approach to enhancing responsivity in the general population and reducing inequalities in child development.

Currently, >250 million children worldwide are not reaching their developmental potential1 ; to do so, children must consistently experience positive, responsive interactions with stable, supportive caregivers. Indeed, children whose bids for connection, care, or comfort are consistently left unmet are more likely to suffer adverse outcomes over the life course, such as lower educational achievement, anxiety, depression, or cardiovascular disease.24  Receiving consistent and constructive responses, in contrast, has been identified as a cornerstone of physical, neurophysiological, and psychological well-being.5  Responsive caregivers are attuned to both children’s emotional and cognitive states: they are not only aware of how children are feeling and respond empathically6  but also recognize what children are interested in and capable of, and respond with the right level of support or challenge to help them learn.7  Parent responsivity (also termed sensitivity because both terms are used interchangeably to measure the same construct) is positively associated with children’s global neural structure (eg, increased brain volume and cortical thickness) and localized neural function (eg, activation in language and emotion regulation networks).810  It is therefore hypothesized that engaging children in back-and-forth interactions that address their needs or expand on their interests is a mechanism that strengthens neural connections in regions of the brain responsible for language, executive function, and self-regulation, building blocks for positive adjustment across the life course.11,12  For this reason, responsive caregiving, as one important component of nurturing care, can contribute to breaking the intergenerational cycle of poverty and building strong societies.13 

Numerous interventions have demonstrated that responsivity is a skill that can be taught and that improving parent responsivity leads to subsequent improvements in child outcomes.5,14,15  In a meta-analysis from 2003, researchers examined the impact of interventions targeting parental sensitivity or children’s attachment security (including a high proportion of clinical samples) and found that programs that had fewer sessions, had a clear-cut focus, and included video feedback were more effective than those that did not.16  Since then, there has been a proliferation of both research and investment in responsive caregiving, including in low- and middle-income countries, many of which are making substantial investments into enhancing early childhood development at a population level.17,18  To ensure such efforts are as effective and efficient as possible, there is a need for an updated integration of evidence on how to enhance responsive caregiving in the general population.18 

An important question in the design of parenting programs is how parenting skills are best taught and learnt. Although meta-analyses exist summarizing evidence on which instructional methods (eg, didactic teaching, skill practice) support learning in academic or professional contexts,19,20  no meta-analysis to date has focused on pedagogical practices in responsivity training. The primary goal with this study was to apply a new approach to data synthesis (network meta-analysis) to identify the optimal combination of instructional methods for teaching responsive caregiving. Although pairwise meta-analyses can be used to identify the relative effectiveness of interventions that have been tested against a common comparator (eg, control group), with network meta-analyses, researchers can compare and rank the relative efficacy of different types of interventions, even in the absence of direct pairwise comparisons between those interventions.21  Furthermore, network meta-analyses allow for the inclusion of studies with >2 groups and those without a control group. We used this approach to better understand what helps adults learn to interact responsively with young children. In addition to shedding light on the value of different instructional methods, we provide updated answers about whether the effectiveness of responsivity training programs in general populations varies on the basis of study, program, or sample characteristics. Given the previous review on this topic from 200316  and the current most common practices in the field, we hypothesized that focused programs that emphasized rehearsal and feedback, particularly video feedback, would be most effective.

A protocol for this study was developed and registered with the International Prospective Register of Systematic Reviews (CRD42018100100). A search strategy (detailed in Section 1 of the Supplemental Information) was developed in consultation with a librarian and experts in developmental psychology and population health; the strategy was then reviewed by another librarian using the Peer Review of Electronic Search Strategies checklist.22  We searched in PsycINFO, Medline, the Cumulative Index to Nursing and Allied Health Literature, Social Work Abstracts, the Education Resources Information Center, ABI/INFORM, and the Cochrane Central Register of Controlled Trials from the inception of each of these databases to October 22, 2020. To identify additional studies, we scanned the reference lists of both included studies and relevant reviews retrieved in our search. The initial intent was to assess effective methods for teaching general populations of adults to be empathetic and responsive to what others are thinking and feeling. However, because of the number of studies identified and to ensure homogeneity of networks, we subdivided the analyses. The current article is focused on studies training parents only.

We included randomized trials of training programs teaching adults how to be more responsive toward their children. Studies had to be available in English (the language of the authors, a limitation because of coding capacities) and report on a quantitative measure of parents’ ability to understand what their children were thinking or feeling and respond in an appropriate (ie, sensitive, responsive, and empathic) manner. When data were not reported in a way that could be converted into an effect size, we contacted the author (20 contacts and 8 responses). We excluded studies in which the reported prevalence of any mental illness (eg, depression, anxiety, addiction, schizophrenia, and/or autism spectrum disorder) among parents was >25% (details are provided in Section 2 of the Supplemental Information). We applied this criterion to limit our findings to the general population, in which rates of psychopathology are ∼20%,23  because responsivity learning may be dampened or different in clinical populations; indeed, parents facing the challenges of depression are, on average, less responsive and show smaller gains from responsivity training.24,25  We also only included studies in which researchers assessed responsivity toward children in early childhood (ie, average age of <6 years) because this is the developmental period during which developmental trajectories are the most malleable and responsivity is most strongly associated with children’s later cognitive and socioemotional outcomes.5,18 

We convened a group of 5 parenting researchers, adult educators, and clinical psychologists to conduct a qualitative consensus-based categorization procedure.26  All participants first familiarized themselves with existing meta-analyses of instructional methods.19,20  Then they reviewed the same 10 articles in which researchers describe different parenting interventions, defined their instructional approaches, independently categorized them, and discussed discrepancies to identify a final consensus. This resulted in coding based on 5 types of instructional methods: (1) didactic learning (reading about or being told how to be responsive), (2) observation (watching examples of responsive interactions), (3) reflection (reflecting on and discussing one’s own experience with responsivity), (4) rehearsal (practicing being responsive during interactions), and (5) feedback (receiving feedback on one’s display of responsivity). These methods were coded on the basis of their presence or absence in the program description; unfortunately, most articles did not provide sufficient detail to evaluate the relative intensity of, or time spent using, these different instructional approaches. We also distinguished 3 types of control groups, as per common practice in network meta analyses27 : (1) those who received no training (eg, waitlist); (2) those who were receiving standard care (ie, treatment as usual), such as neonatal care or services to support a child with a developmental delay; and (3) those who were active controls, receiving training similar in format to the intervention group but covering other topics in child development (eg, nutrition or health) and not responsivity.

The first author trained advanced undergraduate and graduate student research assistants on each phase of study selection and data extraction; assistants had to obtain an interrater reliability >80% on a sample of articles to pass the training. Once trained, 2 research assistants independently screened all abstracts and full texts, abstracted data, and appraised study quality using the Cochrane risk-of-bias tool.28  Coders extracted moderators, including characteristics of the study (eg, year and publication type), program (eg, individual or group delivery, facilitator, dose, and inclusion of home visits or video feedback), and sample (eg, age, education, income, and child health) using a predefined data extraction sheet, according to the definitions outlined in Section 2 of the Supplemental Information. Although all material was double coded and all discrepancies were resolved by the first author, it is notable that coder agreement was high (93% for abstract screening and 86% for both full-text review and data extraction).

For studies that reported multiple outcomes, we selected the most methodologically rigorous measures of responsivity, choosing reliable observational outcomes over parent-reported outcomes. Although observational measures are not without their limitations,29  they are the most sensitive to change after intervention and provide the best prediction of child outcomes.30,31  When multiple outcomes of the same methodologic quality (eg, multiple observational measures of responsivity) were reported in a given study, we calculated a pooled effect size across these measures.32  For 3-arm trials in which 2 of the 3 interventions were functionally equivalent (ie, minor difference in content and format of delivery), we calculated a pooled effect size for those 2 interventions relative to the third. Details of the measures and study arms used for each comparison are available in Section 3 of the Supplemental Information.

We used Comprehensive Meta-Analysis software to calculate the standardized mean difference (Cohen’s d) effect sizes and standard errors. Then, we evaluated the assumptions of network meta-analysis. We assessed transitivity by examining the comparability of the distributions of study, program, and sample characteristics across comparisons. We assessed consistency globally using the design-by-treatment interaction model and locally using the loop-specific method.33,34  If assumptions were met, we conducted a random-effects network meta-analysis using the network and network graphs packages in Stata (version 16; Stata Corp, College Station, TX), assuming a common within-network between-study variance (τ2). The efficacy of different interventions was ranked by using surface under the cumulative ranking (SUCRA) curve values.35  To assess reporting bias, we plotted a comparison-adjusted funnel plot.36  Sensitivity analyses were conducted, excluding outliers (defined as having a standardized mean difference 3 SDs above or below the mean) and studies with a high risk of bias. We also conducted meta-regressions for variables identified as potentially unevenly distributed on the basis of transitivity analyses.

In addition, we conducted a random-effects pairwise meta-analysis in Comprehensive Meta-Analysis using the subset of studies that had a control group. For three-arm studies in which 2 of the interventions were not equivalent, we used comparisons between the most intensive intervention group and control group. We assessed for statistical heterogeneity with the I2 statistic and explored it using mixed-effects model meta-regressions.28  We assessed for publication bias using the funnel plot and Egger’s test.28 

After screening 10 725 citations, 1637 articles were assessed for eligibility, and ultimately 119 articles in which researchers reported on data from 120 unique samples and 12 376 participants were included (Fig 1). A summary of the characteristics of these studies is reported in Table 1 (see Section 3 of the Supplemental Information for details). Most studies were conducted from 2010 to 2020 in North America or Europe; participants were predominantly white mothers with less than a university education. Most programs were delivered one-on-one by professionals, with a median duration of 8 sessions or 11.5 weeks. The majority of studies exhibited a low risk of bias in all categories except allocation concealment, for which 65% had an unclear risk (see Section 4 of the Supplemental Information for details).

FIGURE 1

Preferred Reporting Items for Systematic Reviews and Meta-Analyses flow used to identify studies for inclusion.

FIGURE 1

Preferred Reporting Items for Systematic Reviews and Meta-Analyses flow used to identify studies for inclusion.

Close modal
TABLE 1

Study and Participant Characteristics

All Studies (N = 120), No. (%)Pairwise Meta-analysis (n = 113), No. (%)
Study characteristics   
 Year of publication   
  1978–1989 7 (6) 6 (5) 
  1990–1999 13 (11) 13 (12) 
  2000–2009 21 (17) 19 (17) 
  2010–2020 79 (66) 75 (66) 
 Country region   
  North America 59 (49) 55 (48) 
  Latin America and the Caribbean 2 (2) 2 (2) 
  Europe and Central Asia 36 (30) 35 (31) 
  Sub-Saharan Africa 2 (2) 2 (2) 
  South Asia 3 (2) 2 (2) 
  East Asia and Pacific 18 (15) 17 (15) 
 Sample size   
  1–49 32 (27) 31 (28) 
  50–99 39 (32) 34 (30) 
  100–149 16 (13) 16 (14) 
  150–199 9 (8) 9 (8) 
  ≥200 24 (20) 23 (20) 
Participant characteristics   
 Mean age, y   
  <20 2 (2) 2 (2) 
  20–29 38 (32) 36 (32) 
  30–39 47 (39) 44 (39) 
  ≥40 7 (6) 7 (6) 
  Not reported 26 (21) 24 (21) 
 Mean education, y   
  <12 32 (27) 31 (27) 
  12–15.9 53 (44) 51 (45) 
  ≥16 14 (12) 12 (11) 
  Not reported 21 (17) 19 (17) 
 Caregiver   
  Mothers only 70 (58) 65 (58) 
  Fathers only 6 (5) 6 (5) 
  Mother or father 27 (22) 27 (24) 
  Mixed caregivers 9 (8) 7 (6) 
  Mothers and fathers (attend together) 8 (7) 8 (7) 
 Ethnicity   
  >50% white 52 (43) 46 (41) 
  >50% Black and/or African American 7 (6) 7 (6) 
  >50% Hispanic or Latino 3 (2) 3 (3) 
  >50% Asian 1 (1) 1 (1) 
  >50% Aboriginal or indigenous 1 (1) 1 (1) 
  Mixed ethnicities 15 (13) 15 (13) 
  Not reported 41 (34) 40 (35) 
All Studies (N = 120), No. (%)Pairwise Meta-analysis (n = 113), No. (%)
Study characteristics   
 Year of publication   
  1978–1989 7 (6) 6 (5) 
  1990–1999 13 (11) 13 (12) 
  2000–2009 21 (17) 19 (17) 
  2010–2020 79 (66) 75 (66) 
 Country region   
  North America 59 (49) 55 (48) 
  Latin America and the Caribbean 2 (2) 2 (2) 
  Europe and Central Asia 36 (30) 35 (31) 
  Sub-Saharan Africa 2 (2) 2 (2) 
  South Asia 3 (2) 2 (2) 
  East Asia and Pacific 18 (15) 17 (15) 
 Sample size   
  1–49 32 (27) 31 (28) 
  50–99 39 (32) 34 (30) 
  100–149 16 (13) 16 (14) 
  150–199 9 (8) 9 (8) 
  ≥200 24 (20) 23 (20) 
Participant characteristics   
 Mean age, y   
  <20 2 (2) 2 (2) 
  20–29 38 (32) 36 (32) 
  30–39 47 (39) 44 (39) 
  ≥40 7 (6) 7 (6) 
  Not reported 26 (21) 24 (21) 
 Mean education, y   
  <12 32 (27) 31 (27) 
  12–15.9 53 (44) 51 (45) 
  ≥16 14 (12) 12 (11) 
  Not reported 21 (17) 19 (17) 
 Caregiver   
  Mothers only 70 (58) 65 (58) 
  Fathers only 6 (5) 6 (5) 
  Mother or father 27 (22) 27 (24) 
  Mixed caregivers 9 (8) 7 (6) 
  Mothers and fathers (attend together) 8 (7) 8 (7) 
 Ethnicity   
  >50% white 52 (43) 46 (41) 
  >50% Black and/or African American 7 (6) 7 (6) 
  >50% Hispanic or Latino 3 (2) 3 (3) 
  >50% Asian 1 (1) 1 (1) 
  >50% Aboriginal or indigenous 1 (1) 1 (1) 
  Mixed ethnicities 15 (13) 15 (13) 
  Not reported 41 (34) 40 (35) 

A total of 16 different combinations of instructional methods or types of control groups were used across the studies, with 35 unique direct comparisons. The network geometry is presented in Fig 2. The most frequent type of intervention included all instructional methods, whereas there were relatively fewer studies of interventions in which researchers used only 1 or 2 approaches. The network meta-analysis revealed no evidence of inconsistency (global: χ2[19] = 11.03; P = .92; local: all P >.10) but some heterogeneity (τ2 = 0.21).

FIGURE 2

Network diagram. The size of the node corresponds to the number of studies which included that combination of instructional approaches; nodes for which there were direct comparisons are linked with a line, the width of which indicates the number of studies in which researchers tested that comparison.

FIGURE 2

Network diagram. The size of the node corresponds to the number of studies which included that combination of instructional approaches; nodes for which there were direct comparisons are linked with a line, the width of which indicates the number of studies in which researchers tested that comparison.

Close modal

Figure 3 displays the average effect size for each combination of instructional methods compared with the pure control groups with no training. In addition, a table listing the effect sizes for all possible two-way comparisons between different combinations of instructional methods is available in Section 5 of the Supplemental Information. Several combinations of instructional methods were significantly more effective than all 3 types of control groups, with medium to large effect sizes: programs using didactic, observation, rehearsal, and feedback (d = 0.89 to 1.07); all 5 instructional methods (d = 0.68 to 0.86); didactic, observation, and reflection (d = 0.63 to 0.81); didactic, observation, reflection, and rehearsal (d = 0.54 to 0.72); and didactic, reflection, rehearsal, and feedback (d = 0.31 to 0.50). Relative efficacy ranking, based on the SUCRA curves, revealed that the most effective types of program were those that included: (1) didactic, observation, rehearsal, and feedback, (2) all 5 instructional methods, and (3) didactic, observation, and reflection. The SUCRA values and curves are presented in Section 5 of the Supplemental Information.

FIGURE 3

Forest plot from network meta-analysis in which we compare each combination of instructional methods to control.

FIGURE 3

Forest plot from network meta-analysis in which we compare each combination of instructional methods to control.

Close modal

The comparison-adjusted funnel plot (Section 6 of the Supplemental Information) suggested that small studies may exaggerate the effectiveness of less intensive interventions, although this was heavily influenced by a few outliers. In the sensitivity analysis without outliers, we identified the same top 3 combinations of instructional methods. In the analysis with only low risk-of-bias studies, we identified programs using all instructional methods as being the most effective and those without feedback being more effective than those without reflection (see Section 6 of the Supplemental Information for details). Program rankings also differed somewhat in adjusted versus unadjusted models (see Section 6 of the Supplemental Information). Nonetheless, programs that used didactic instruction, observation, rehearsal, and feedback ranked in the top 3 in 83% of network meta-regression analyses, whereas those using all instructional methods were ranked in the top 3 75% of the time. In all scenarios, at least 1 of these types of programs was in the top 3 and in many cases both were in the top 3, underscoring the consistent superiority of programs using these combinations of instructional approaches.

The majority of studies included a control group (113 samples involving 11 693 parents) and could be included in the pairwise meta-analysis. A forest plot for these studies is available in Section 7 of the Supplemental Information. Overall, there was a medium effect size of 0.56 (95% confidence interval [CI]: 0.47 to 0.65; without 3 outliers: d = 0.52; 95% CI: 0.43 to 0.60).37  Although there was evidence of publication bias (Egger’s test P < .0001; funnel plot in Section 8 of the Supplemental Information), Duval and Tweedie’s trim-and-fill procedure did not suggest trimming any studies or produce a different estimate.

Because there was substantial heterogeneity (I2 = 79.52; P < .001), we tested whether this could be explained by study, program, or sample characteristics. The results of the meta-regressions are presented in Table 2. Programs that were more effective required participants to complete homework assignments (d = 0.84; 95% CI: 0.66 to 1.02, compared with d = 0.40; 95% CI: 0.31 to 0.49), were facilitated by a researcher (d = 0.89; 95% CI: 0.66 to 1.12), rather than a different type of facilitator (d ranging from 0.20 to 0.53), and were focused narrowly on teaching responsivity (d = 0.72; 95% CI: 0.57 to 0.87), rather than covering a broad range of unrelated topics (d = 0.40; 95% CI: 0.30 to 0.50). Programs in which the participants’ children were preschoolers were also more effective (d = 0.87; 95% CI: 0.64 to 1.10), on average, than those in which children were infants (d = 0.47; 95% CI: 0.34 to 0.60) or spanned multiple age categories (d = 0.31; 95% CI: 0.07 to 0.54). Finally, program effects were stronger for participants whose children did not have any physical impairments (d = 0.62; 95% CI: 0.51 to 0.73), compared with those whose children were physical impaired (d = 0.35; 95% CI: 0.21 to 0.49). In a multivariate regression considering all moderators that were significant in the individual analyses, only use of homework (P = .006), type of facilitator (P = .007), and program scope (P = .02) remained statistically significant predictors of program effect size. Together, these 3 variables explained 40% of the variance in program effectiveness. Sensitivity analyses were also conducted, replicating all meta-regressions without the 3 outlier studies (see Section 8 in the Supplemental Information). In these analyses, significant effects for child age and physical impairment were not replicated (P = .12 and .07, respectively). However, there was an effect of program length whereby programs that lasted >1 year were significantly less effective (d = 0.17; 95% CI: −0.008 to 0.35) than those taking place within 1 year (d ranging from 0.54 to 0.66).

TABLE 2

Meta-Regressions for Categorical and Continuous Moderators

kd or Ba95% CIPR2
Study characteristics      
 Year of publication 113 −0.01 −0.02 to 0.0005 .06 0.03 
 Type of publication      
  Journal 110 0.56 0.47 to 0.65 .95 0.00 
  Dissertation 0.61 0.20 to 1.02   
 Country income      
  High 103 0.56 0.46 to 0.65 .42 0.00 
  Upper middle 0.87 0.40 to 1.33   
  Lower middle 0.35 −0.21 to 0.91   
 Type of control group      
  Pure control 36 0.73 0.55 to 0.92 .14 0.07 
  Active control 36 0.49 0.37 to 0.61   
  Standard care 41 0.49 0.34 to 0.64   
 Risk of bias      
  High 33 0.60 0.46 to 0.75 .61 0.01 
  Low 80 0.55 0.44 to 0.66   
Program characteristics      
 No. sessions      
  <5 sessions 19 0.68 0.45 to 0.91 .69 0.03 
  5–10 sessions 54 0.52 0.39 to 0.64   
  11–20 sessions 21 0.61 0.38 to 0.84   
  >20 sessions 18 0.47 0.28 to 0.67   
 Program length      
  <1 mo 0.64 0.17 to 1.12 .06 0.11 
  1–5 mo 78 0.60 0.49 to 0.71   
  6 mo to 1 y 16 0.72 0.46 to 0.97   
  >1 y 0.17 −0.008 to 0.35   
 Program hours 88 0.003 −0.003 to 0.009 .32 0.01 
 Frequency (average sessions per wk) 109 0.10 −0.10 to 0.30 .31 0.01 
 Delivery format      
  In-person group 17 0.57 0.32 to 0.82 .71 0.03 
  In-person individual 72 0.60 0.49 to 0.71   
  In-person group and individual 20 0.48 0.26 to 0.70   
  Online and in person 0.23 −0.73 to 1.18   
  Online only 0.27 −0.32 to 0.86   
 Program facilitator      
  Professional 69 0.53 0.41 to 0.64 .009 0.16 
  Paraprofessional 0.30 0.14 to 0.46   
  Professional or paraprofessional 0.20 0.07 to 0.33   
  Researcher 28 0.89 0.66 to 1.12   
  No facilitator 0.27 −0.32 to 0.86   
 Program designer      
  Affiliated 78 0.58 0.48 to 0.68 .59 0.01 
  Unaffiliated 35 0.52 0.35 to 0.70   
 Scope of training      
  Narrow 58 0.72 0.57 to 0.87 .005 0.12 
  Broad 55 0.40 0.30 to 0.50   
 Home visits      
  Yes 72 0.51 0.41 to 0.61 .22 0.02 
  No 41 0.67 0.49 to 0.84   
 Percent home visits 110 −0.05 −0.27 to 0.16 .63 0.00 
 Video feedback      
  Yes 52 0.61 0.48 to 0.74 .43 0.02 
  No 61 0.52 0.40 to 0.63   
 Homework      
  Yes 43 0.84 0.66 to 1.02 .0001 0.18 
  No 70 0.40 0.31 to 0.49   
Sample characteristics      
 Parent age 89 0.02 −0.0002 to 0.04 .05 0.08 
 Parent education 93 0.006 −0.03 to 0.05 .76 0.01 
 Parent participant      
  Mothers only 65 0.50 0.39 to 0.60 .29 0.07 
  Fathers only 0.61 0.24 to 0.98   
  Mother or father 27 0.76 0.53 to 1.00   
  Mixed caregivers 0.31 0.09 to 0.54   
  Mothers and fathers 0.62 0.21 to 1.03   
 Low income      
  Yes 40 0.55 0.41 to 0.69 .89 0.00 
  No 69 0.57 0.45 to 0.68   
 Challenged parenting      
  Yes 61 0.58 0.46 to 0.71 .82 0.01 
  No 52 0.54 0.41 to 0.66   
 Child age category      
  Infant 42 0.47 0.34 to 0.60 .01 0.11 
  Toddler 30 0.52 0.38 to 0.66   
  Preschooler 31 0.87 0.64 to 1.10   
  Multiple categories 10 0.31 0.07 to 0.54   
 Child physical risk      
  Yes 20 0.35 0.21 to 0.49 .05 0.04 
  No 93 0.62 0.51 to 0.73   
 Child neurocognitive risk      
  Yes 48 0.57 0.43 to 0.71 .97 0.00 
  No 65 0.56 0.44 to 0.68   
Instructional methods      
 Didactic      
  Yes 103 0.58 0.48 to 0.67 .36 0.01 
  No 10 0.41 0.15 to 0.68   
 Observation      
  Yes 73 0.70 0.57 to 0.83 .001 0.10 
  No 40 0.33 0.23 to 0.43   
 Reflection      
  Yes 106 0.56 0.47 to 0.65 .78 0.00 
  No 0.63 0.24 to 1.02   
 Rehearsal      
  Yes 101 0.57 0.48 to 0.67 .50 0.01 
  No 12 0.49 0.15 to 0.84   
 Feedback      
  Yes 94 0.56 0.47 to 0.66 .99 0.00 
  No 19 0.57 0.34 to 0.81   
kd or Ba95% CIPR2
Study characteristics      
 Year of publication 113 −0.01 −0.02 to 0.0005 .06 0.03 
 Type of publication      
  Journal 110 0.56 0.47 to 0.65 .95 0.00 
  Dissertation 0.61 0.20 to 1.02   
 Country income      
  High 103 0.56 0.46 to 0.65 .42 0.00 
  Upper middle 0.87 0.40 to 1.33   
  Lower middle 0.35 −0.21 to 0.91   
 Type of control group      
  Pure control 36 0.73 0.55 to 0.92 .14 0.07 
  Active control 36 0.49 0.37 to 0.61   
  Standard care 41 0.49 0.34 to 0.64   
 Risk of bias      
  High 33 0.60 0.46 to 0.75 .61 0.01 
  Low 80 0.55 0.44 to 0.66   
Program characteristics      
 No. sessions      
  <5 sessions 19 0.68 0.45 to 0.91 .69 0.03 
  5–10 sessions 54 0.52 0.39 to 0.64   
  11–20 sessions 21 0.61 0.38 to 0.84   
  >20 sessions 18 0.47 0.28 to 0.67   
 Program length      
  <1 mo 0.64 0.17 to 1.12 .06 0.11 
  1–5 mo 78 0.60 0.49 to 0.71   
  6 mo to 1 y 16 0.72 0.46 to 0.97   
  >1 y 0.17 −0.008 to 0.35   
 Program hours 88 0.003 −0.003 to 0.009 .32 0.01 
 Frequency (average sessions per wk) 109 0.10 −0.10 to 0.30 .31 0.01 
 Delivery format      
  In-person group 17 0.57 0.32 to 0.82 .71 0.03 
  In-person individual 72 0.60 0.49 to 0.71   
  In-person group and individual 20 0.48 0.26 to 0.70   
  Online and in person 0.23 −0.73 to 1.18   
  Online only 0.27 −0.32 to 0.86   
 Program facilitator      
  Professional 69 0.53 0.41 to 0.64 .009 0.16 
  Paraprofessional 0.30 0.14 to 0.46   
  Professional or paraprofessional 0.20 0.07 to 0.33   
  Researcher 28 0.89 0.66 to 1.12   
  No facilitator 0.27 −0.32 to 0.86   
 Program designer      
  Affiliated 78 0.58 0.48 to 0.68 .59 0.01 
  Unaffiliated 35 0.52 0.35 to 0.70   
 Scope of training      
  Narrow 58 0.72 0.57 to 0.87 .005 0.12 
  Broad 55 0.40 0.30 to 0.50   
 Home visits      
  Yes 72 0.51 0.41 to 0.61 .22 0.02 
  No 41 0.67 0.49 to 0.84   
 Percent home visits 110 −0.05 −0.27 to 0.16 .63 0.00 
 Video feedback      
  Yes 52 0.61 0.48 to 0.74 .43 0.02 
  No 61 0.52 0.40 to 0.63   
 Homework      
  Yes 43 0.84 0.66 to 1.02 .0001 0.18 
  No 70 0.40 0.31 to 0.49   
Sample characteristics      
 Parent age 89 0.02 −0.0002 to 0.04 .05 0.08 
 Parent education 93 0.006 −0.03 to 0.05 .76 0.01 
 Parent participant      
  Mothers only 65 0.50 0.39 to 0.60 .29 0.07 
  Fathers only 0.61 0.24 to 0.98   
  Mother or father 27 0.76 0.53 to 1.00   
  Mixed caregivers 0.31 0.09 to 0.54   
  Mothers and fathers 0.62 0.21 to 1.03   
 Low income      
  Yes 40 0.55 0.41 to 0.69 .89 0.00 
  No 69 0.57 0.45 to 0.68   
 Challenged parenting      
  Yes 61 0.58 0.46 to 0.71 .82 0.01 
  No 52 0.54 0.41 to 0.66   
 Child age category      
  Infant 42 0.47 0.34 to 0.60 .01 0.11 
  Toddler 30 0.52 0.38 to 0.66   
  Preschooler 31 0.87 0.64 to 1.10   
  Multiple categories 10 0.31 0.07 to 0.54   
 Child physical risk      
  Yes 20 0.35 0.21 to 0.49 .05 0.04 
  No 93 0.62 0.51 to 0.73   
 Child neurocognitive risk      
  Yes 48 0.57 0.43 to 0.71 .97 0.00 
  No 65 0.56 0.44 to 0.68   
Instructional methods      
 Didactic      
  Yes 103 0.58 0.48 to 0.67 .36 0.01 
  No 10 0.41 0.15 to 0.68   
 Observation      
  Yes 73 0.70 0.57 to 0.83 .001 0.10 
  No 40 0.33 0.23 to 0.43   
 Reflection      
  Yes 106 0.56 0.47 to 0.65 .78 0.00 
  No 0.63 0.24 to 1.02   
 Rehearsal      
  Yes 101 0.57 0.48 to 0.67 .50 0.01 
  No 12 0.49 0.15 to 0.84   
 Feedback      
  Yes 94 0.56 0.47 to 0.66 .99 0.00 
  No 19 0.57 0.34 to 0.81   
a

Column contains d (effect size) for categorical variables and B (unstandardized regression coefficient) for continuous variables.

We also used meta-regression to test for the effects of individual instructional methods. Programs that had caregivers observe examples of responsivity were significantly more effective (d = 0.70; 95% CI: 0.57 to 0.83), on average, than those that did not (d = 0.33; 95% CI: 0.23 to 0.43), even when outlier studies were removed. Programs were not consistently more or less effective whether they included didactic, reflection, rehearsal, or feedback methods.

Enhancing parental responsivity is one way our society can invest in improving the mental and physical well-being of future generations and reduce intergenerational inequalities.13  The current review was conducted to help maximize the efficiency of efforts to enhance parental responsivity at a population level. The 3 most effective types of programs identified in the network meta-analysis all included didactic and observation activities, with rehearsal and feedback being an added feature of the single most effective category of programs. Across all sensitivity analyses, programs using all 5 instructional methods (didactic, observation, reflection, rehearsal, and feedback) were consistently identified as the most or second most effective and had a large effect size. Taken together, these results suggest that it is valuable to include all instructional methods when designing training programs; however, in cases in which reflection, rehearsal, or feedback are not feasible or too costly omitting one or a few of these formats should not come at a high cost to parent learning. In contrast, observational teaching should not be omitted, as the results of the pairwise meta-analysis suggest that one of the best ways for parents to learn how to be responsive caregivers is to observe good examples of responsive caregiving, an important and novel finding for program designers.

The pairwise meta-analysis also revealed 3 aspects of program design robustly associated with above-average effect sizes. First, in line with previous research,16  programs in which parents were only instructed on responsive caregiving were more effective than those with a broader scope. This replicated finding underscores the principle that when programs try to do everything, it is hard to do any one thing well. Second, programs in which homework was assigned to participants were significantly more effective than those in which it was not. Indeed, in the present meta-analysis, this aspect of program design individually accounted for 18% of the variance in program effectiveness. Although there is evidence to suggest that participants who complete more homework in parent training programs demonstrate stronger outcomes,38  existing research has not compared the effectiveness of programs that do or do not assign homework. The current findings imply that including such assignments is valuable and that responsivity is a skill that is best learnt through consistent practice. Finally, the programs that were most effective tended to be facilitated by researchers. Past reviews have found mixed evidence about the relative effectiveness of professionals, compared with nonprofessionals, as parenting program facilitators; however, these reviews have all categorized researchers as professionals.16,39  The current results suggest that researchers may be particularly effective at supporting parents to be more responsive. Although there is little literature to draw on in explaining this result, we speculate this distinction could be attributable to researchers’ enhanced understanding of key concepts related to responsivity and ability to convey a narrow, focused message regarding a program’s learning goals, thereby "teaching to the test."

This review also highlighted several program characteristics that do not enhance the learning of responsivity among parents without elevated levels of psychopathology: program dosage (sessions, hours, weeks, and frequency), individual versus group delivery, and the use of home visits or video feedback. In other words, long, individualized training programs may not help parents become substantially more responsive than short, group programs do. Furthermore, results suggest responsivity training programs benefit parents similarly, regardless of their level of education and income. Although these findings are in contrast with our hypotheses and findings from some older reviews,16,39,40  because researchers in these previous reviews assessed the effects of programs with different foci (eg, attachment or disruptive behavior) and included quasi-experimental studies as well as studies with clinical populations (eg, depressed mothers), direct comparisons cannot be made. Furthermore, these past meta-analyses are not representative of the abundance of research that has emerged since 2005. Indeed, in a more recent review, researchers have called into question the notion that parent and child characteristics moderate the effects of parenting programs.41  On the basis of the current results, we conclude that, although one-on-one training using personalized video feedback or home visits undoubtedly helps parents learn to be more responsive, these labor-intensive programs may not be absolutely necessary to elevate responsivity among parents in the general population. That being said, there may be other benefits of individualized programs that were not captured in the current analysis; for example, such programs may be necessary to reach certain parents.42 

In this study, we used network meta-analysis to compare the relative effectiveness of various combinations of instructional methods in parent responsivity training programs. We summarized evidence from >100 randomized trials, the majority of which had a low risk of bias, and results were robust to the removal of outliers and studies with a high risk of bias. However, all research was published in English and conducted predominantly with white mothers in Western populations; as there was also evidence of publication bias, our search could have been strengthened through greater identification of international and unpublished data sources. Nonetheless, because the gap in research on interventions for fathers and caregivers from diverse backgrounds was also identified in a different recent review,43  this limitation was likely not a consequence of our search strategy or inclusion criteria and, moving forward, the academic community will need to conduct more research with fathers and ethnically diverse populations for us to better optimize program design for these populations. It is also important to note that our results are only applicable to parents without identified levels of psychopathology and this criterion led to the exclusion of some more vulnerable and diverse samples. Given that responsivity inversely correlates with psychopathology in parents,24  it will be important for researchers in future studies to establish whether optimal instructional methods differ for clinical and general populations. Finally, we also identified a gap in the reporting of details of program design and delivery as well as a shortage of follow-up data. Increasing the availability of such data would allow us to assess the relative efficacy of different instructional methods on the basis of the amount of time spent using each method and determine if certain instructional methods more effectively support sustained behavior change, compared with short-term learning.

The current findings provide several actionable recommendations for health professionals, policy makers, and program developers looking to improve children’s developmental trajectories. Specifically, results suggest that using expensive methods, such as year-long individual home visits, to help parents in the general population be more responsive toward their young children is not necessary. Developing short, focused programs led by researchers that include opportunities for parents to learn about and observe examples of responsive parent-child interactions with some take-home assignments between sessions could be just as or even more effective. Such recommendations may be applied to improving population health across low-, middle-, and high-income countries, during routine encounters with health professionals or using low-resource group programs implemented in health care or community settings. These results may make it easier to support more parents without using more resources, ultimately improving the lives of more children.

Dr Sokolovic conceptualized and designed the study, coordinated and supervised data extraction, conducted the statistical analyses, drafted the initial manuscript, and reviewed and revised the manuscript; Dr Rodrigues conceptualized and designed the study, selected the articles and extracted the data, and reviewed and revised the manuscript; Dr Tricco conceptualized and designed the study and reviewed and revised the manuscript; Ms Dobrina selected the articles, extracted the data, and reviewed and revised the manuscript; Dr Jenkins conceptualized and designed the study, conducted the statistical analyses, and critically reviewed the manuscript for important intellectual content; and all authors approved the final manuscript as submitted and agree to be accountable for all aspects of the work.

FUNDING: Ms Sokolovic was supported by the Social Sciences and Humanities Research Council of Canada (grant 767-2018-2577) to undertake this review. Dr Rodrigues is also supported in part by the Social Sciences and Humanities Research Council of Canada. Dr Tricco is supported by a Tier 2 Canada Research Chair in Knowledge Synthesis. Dr Jenkins is supported by the Atkinson Foundation.

CI

confidence interval

SUCRA

surface under the cumulative ranking

1
Black
MM
,
Walker
SP
,
Fernald
LCH
, et al;
Lancet Early Childhood Development Series Steering Committee
.
Early childhood development coming of age: science through the life course
.
Lancet
.
2017
;
389
(
10064
):
77
90
2
Houtepen
LC
,
Heron
J
,
Suderman
MJ
,
Fraser
A
,
Chittleborough
CR
,
Howe
LD
.
Associations of adverse childhood experiences with educational attainment and adolescent health and the role of family and socioeconomic factors: a prospective cohort study in the UK
.
PLoS Med
.
2020
;
17
(
3
):
e1003031
3
Raby
KL
,
Roisman
GI
,
Labella
MH
,
Martin
J
,
Fraley
RC
,
Simpson
JA
.
The legacy of early abuse and neglect for social and academic competence from childhood to adulthood
.
Child Dev
.
2019
;
90
(
5
):
1684
1701
4
Hughes
K
,
Bellis
MA
,
Hardcastle
KA
, et al
.
The effect of multiple adverse childhood experiences on health: a systematic review and meta-analysis
.
Lancet Public Health
.
2017
;
2
(
8
):
e356
e366
5
Eshel
N
,
Daelmans
B
,
de Mello
MC
,
Martines
J
.
Responsive parenting: interventions and outcomes
.
Bull World Health Organ
.
2006
;
84
(
12
):
991
998
6
Ainsworth
MDS
,
Blehar
MC
,
Waters
E
,
Wall
SN
.
Patterns of Attachment: A Psychological Study of the Strange Situation
.
New York, NY
:
Lawrence Erlbaum
;
1978
7
Vygotsky
LS
.
Mind in Society: The Development of Higher Psychological Processes
.
Cambridge, MA
:
Harvard University Press
;
1980
8
Kok
R
,
Thijssen
S
,
Bakermans-Kranenburg
MJ
, et al
.
Normal variation in early parental sensitivity predicts child structural brain development
.
J Am Acad Child Adolesc Psychiatry
.
2015
;
54
(
10
):
824
831.e1
9
Tan
PZ
,
Oppenheimer
CW
,
Ladouceur
CD
,
Butterfield
RD
,
Silk
JS
.
A review of associations between parental emotion socialization behaviors and the neural substrates of emotional reactivity and regulation in youth
.
Dev Psychol
.
2020
;
56
(
3
):
516
527
10
Romeo
RR
,
Leonard
JA
,
Robinson
ST
, et al
.
Beyond the 30-million-word gap: children’s conversational exposure is associated with language-related brain function
.
Psychol Sci
.
2018
;
29
(
5
):
700
710
11
Shonkoff
JP
.
Protecting brains, not simply stimulating minds
.
Science
.
2011
;
333
(
6045
):
982
983
12
Kerr
KL
,
Ratliff
EL
,
Cosgrove
KT
,
Bodurka
J
,
Morris
AS
,
Kyle Simmons
W
.
Parental influences on neural mechanisms underlying emotion regulation
.
Trends Neurosci Educ
.
2019
;
16
:
100118
13
Daelmans
B
,
Darmstadt
GL
,
Lombardi
J
, et al;
Lancet Early Childhood Development Series Steering Committee
.
Early childhood development: the foundation of sustainable development
.
Lancet
.
2017
;
389
(
10064
):
9
11
14
Milgrom
J
,
Newnham
C
,
Anderson
PJ
, et al
.
Early sensitivity training for parents of preterm infants: impact on the developing brain
.
Pediatr Res
.
2010
;
67
(
3
):
330
335
15
Obradović
J
,
Yousafzai
AK
,
Finch
JE
,
Rasheed
MA
.
Maternal scaffolding and home stimulation: key mediators of early intervention effects on children’s cognitive development
.
Dev Psychol
.
2016
;
52
(
9
):
1409
1421
16
Bakermans-Kranenburg
MJ
,
van IJzendoorn
MH
,
Juffer
F
.
Less is more: meta-analyses of sensitivity and attachment interventions in early childhood
.
Psychol Bull
.
2003
;
129
(
2
):
195
215
17
Jeong
J
,
Pitchik
HO
,
Yousafazai
AK
.
Stimulation interventions and parenting in low- and middle-income countries: a meta-analysis
.
Pediatrics
.
2018
;
141
(
4
):
e20173510
18
Britto
PR
,
Singh
M
,
Dua
T
,
Kaur
R
,
Yousafzai
AK
.
What implementation evidence matters: scaling-up nurturing interventions that promote early childhood development
.
Ann N Y Acad Sci
.
2018
;
1419
(
1
):
5
16
19
Trivette
CM
,
Dunst
CJ
,
Hamby
DW
,
O’Herin
CE
.
Characteristics and Consequences of Adult Learning Methods and Strategies
.
Morganton, NC
:
Winterberry Press
;
2009
20
Hattie
JAC
.
Visible Learning: A Synthesis of Over 800 Meta-Analyses Relating to Achievement
.
London, United Kingdom
:
Routledge
;
2008
21
Cipriani
A
,
Higgins
JPT
,
Geddes
JR
,
Salanti
G
.
Conceptual and technical challenges in network meta-analysis
.
Ann Intern Med
.
2013
;
159
(
2
):
130
137
22
McGowan
J
,
Sampson
M
,
Salzwedel
DM
,
Cogo
E
,
Foerster
V
,
Lefebvre
C
.
PRESS peer review of electronic search strategies: 2015 guideline statement
.
J Clin Epidemiol
.
2016
;
75
:
40
46
23
Substance Abuse and Mental Health Services Administration
.
Key Substance Use and Mental Health Indicators in the United States: Results From the 2019 National Survey on Drug Use and Health
.
Rockville, MD
:
Center for Behavioral Health Statistics and Quality
;
2020
24
Bernard
K
,
Nissim
G
,
Vaccaro
S
,
Harris
JL
,
Lindhiem
O
.
Association between maternal depression and maternal sensitivity from birth to 12 months: a meta-analysis
.
Attach Hum Dev
.
2018
;
20
(
6
):
578
599
25
Christodoulou
J
,
Rotheram-Borus
MJ
,
Bradley
AK
,
Tomlinson
M
.
Home visiting and antenatal depression affect the quality of mother and child interactions in South Africa
.
J Am Acad Child Adolesc Psychiatry
.
2019
;
58
(
12
):
1165
1174
26
Watt
J
,
Goodarzi
Z
,
Tricco
AC
,
Veroniki
AA
,
Straus
SE
.
Comparative safety and efficacy of pharmacological and non-pharmacological interventions for the behavioral and psychological symptoms of dementia: protocol for a systematic review and network meta-analysis
.
Syst Rev
.
2017
;
6
(
1
):
182
27
Caldwell
DM
,
Davies
SR
,
Hetrick
SE
, et al
.
School-based interventions to prevent anxiety and depression in children and young people: a systematic review and network meta-analysis [published correction appears in Lancet Psychiatry. 2020;7(9):e59]
.
Lancet Psychiatry
.
2019
;
6
(
12
):
1011
1020
28
Higgins
JPT
,
Thomas
J
,
Chander
J
, eds., et al
.
Cochrane Handbook for Systematic Reviews of Interventions
, 2nd ed.
Chichester, United Kingdom
:
John Wiley & Sons
;
2019
29
Bennetts
SK
,
Mensah
FK
,
Green
J
,
Hackworth
NJ
,
Westrupp
EM
,
Reilly
S
.
Mothers’ experiences of parent-reported and video-recorded observational assessments
.
J Child Fam Stud
.
2017
;
26
(
12
):
3312
3326
30
Aspland
H
,
Gardner
F
.
Observational measures of parent-child interaction: an introductory review
.
Child Adolesc Ment Health
.
2003
;
8
(
3
):
136
143
31
Zaslow
MJ
,
Weinfield
NS
,
Gallagher
M
, et al
.
Longitudinal prediction of child outcomes from differing measures of parenting in a low-income sample
.
Dev Psychol
.
2006
;
42
(
1
):
27
37
32
Borenstein
M
,
Hedges
LV
,
Higgins
JPT
,
Rothstein
HR
.
Multiple Outcomes or Time-Points within a Study
. In:
Introduction to Meta-Analysis
.
Chichester, United Kingdom
:
John Wiley & Sons
;
2011
:
225
238
33
Veroniki
AA
,
Vasiliadis
HS
,
Higgins
JPT
,
Salanti
G
.
Evaluation of inconsistency in networks of interventions [published correction appears in Int J Epidemiol. 2013;42(3):919]
.
Int J Epidemiol
.
2013
;
42
(
1
):
332
345
34
White
IR
,
Barrett
JK
,
Jackson
D
,
Higgins
JPT
.
Consistency and inconsistency in network meta-analysis: model estimation using multivariate meta-regression
.
Res Synth Methods
.
2012
;
3
(
2
):
111
125
35
Salanti
G
,
Ades
AE
,
Ioannidis
JPA
.
Graphical methods and numerical summaries for presenting results from multiple-treatment meta-analysis: an overview and tutorial
.
J Clin Epidemiol
.
2011
;
64
(
2
):
163
171
36
Chaimani
A
,
Higgins
JPT
,
Mavridis
D
,
Spyridonos
P
,
Salanti
G
.
Graphical tools for network meta-analysis in STATA
.
PLoS One
.
2013
;
8
(
10
):
e76654
37
Cohen
J
.
Statistical Power Analysis for the Behavioral Sciences
.
New York, NY
:
Academic Press
;
2013
38
Ros
R
,
Hernandez
J
,
Graziano
PA
,
Bagner
DM
.
Parent training for children with or at risk for developmental delay: the role of parental homework completion
.
Behav Ther
.
2016
;
47
(
1
):
1
13
39
Sweet
MA
,
Appelbaum
MI
.
Is home visiting an effective strategy? A meta-analytic review of home visiting programs for families with young children
.
Child Dev
.
2004
;
75
(
5
):
1435
1456
40
Lundahl
B
,
Risser
HJ
,
Lovejoy
MC
.
A meta-analysis of parent training: moderators and follow-up effects
.
Clin Psychol Rev
.
2006
;
26
(
1
):
86
104
41
Leijten
P
,
Raaijmakers
M
,
Wijngaards
L
, et al
.
Understanding who benefits from parenting interventions for children’s conduct problems: an integrative data analysis
.
Prev Sci
.
2018
;
19
(
4
):
579
588
42
Mytton
J
,
Ingram
J
,
Manns
S
,
Thomas
J
.
Facilitators and barriers to engagement in parenting programs: a qualitative systematic review
.
Health Educ Behav
.
2014
;
41
(
2
):
127
137
43
Bergström
M
,
Gebreslassie
M
,
Hedqvist
M
,
Lindberg
L
,
Sarkadi
A
,
Hjern
A
.
Narrative review of interventions suitable for well-baby clinics to promote infant attachment security and parents’ sensitivity
.
Acta Paediatr
.
2020
;
109
(
9
):
1745
1757

Competing Interests

POTENTIAL CONFLICT OF INTEREST: Ms Sokolovic is the author of a parenting program (Support, not Perfection) and receives no royalties for the program; the other authors have indicated they have no potential conflicts of interest to disclose.

FINANCIAL DISCLOSURE: The authors have indicated they have no financial relationships relevant to this article to disclose.