← → Space · A=Appendix · G=Go to

Norms Behind Closed Doors:
Misperceptions and Maternal Employment in Couples

A Field Experiment in Bogotá, Colombia
Marie Boltz  ·  Monserrat Bustelo  ·  Ana María Díaz  ·  Agustina Suaya
U. Strasbourg / BETA  ·  IADB  ·  Pontificia Universidad Javeriana  ·  IADB
Seminar · Javeriana, April 29, 2026 Pre-registered · AEARCTR-0014648  ·  Under Review EDCC Sample · 1,732 couples · Bogotá
Section I

Research Agenda

Where this paper comes from — and why it is the natural next step
Research Agenda Experimental Design Baseline Facts Results Heterogeneity Conclusions Next: YouTube
Research Agenda · Paper I

What Is the Price of Freedom?

Bustelo, Díaz, Lafortune, Piras, Salas, Tessada — EDCC 2023 (published)

Estimating Women's Willingness to Pay for Job Schedule Flexibility

Discrete choice experiment with ~1,500 women in Bogotá. Reveals preferences for flexible schedules vs. part-time trade-offs.
Main Finding
Women sacrifice 15–20% of offered wages for full-time contracts with flexible schedules
Key Mechanism
Preference for schedule flexibility (when to work) — not for going part-time
Open Question
Are WTP patterns gendered? Do they operate within couples? Does context travel across countries?
Research Agenda · Paper II

Do Women Value Location Flexibility More Than Men?

Díaz, Salas, Piras, Suaya — Working Paper 2024

Gender Disparities in Valuing Remote and Hybrid Work in Latin America

DCE with ~4,785 workers across 5 Latin American countries (AR, CL, CO, MX, PE). Male-dominated sectors: manufacturing (operations supervisor) and IT (engineer / software developer).
Women's WTP
of wages sacrificed for hybrid (80% remote)  /  fully remote
Men's WTP
of wages for hybrid; no willingness to trade pay for fully remote

What's missing: Individual choices don't reveal intra-household dynamics. Does the gender gap reflect preferences, or the fact that women anticipate they will absorb the household's need for flexibility?

Research Agenda · Paper III

Who Pays for the Partner's Flexibility?

Boltz, Díaz, Salas — Working Paper 2026

Gender Norms and WTP for Own vs. Partner's Job Flexibility — DCE with Couples in Bogotá

DCE eliciting each spouse's WTP for their own flexible job and for their partner's flexible job. N ≈ 450 couples.
WTP for Own Flexibility
Wives: 16.6% of wages
Husbands: 4.2% of wages
Wives value their own flexibility far more
WTP for Partner's Flexibility
Wives (for husband): 3.9%
Husbands (for wife): 21.8%
Husbands pay the most — to have a flexible wife

Critical finding: Only "support for mothers working outside the home" moderates the gender gap in WTP. Gender norms — not just preferences — shape how couples allocate labor market investments. But we can't change those norms with a DCE.

Research Agenda · Interactive

Your Estimate: Pluralistic Ignorance

Scan to Vote

Use your phone to vote on AhaSlides

AhaSlides QR Code

Or visit: https://ahaslides.com/6NRKR

The Reality

In our baseline sample in Bogotá:

89%
of fathers support mothers working outside the home

Yet most think others are far more conservative.
This gap between private views and perceived norms is pluralistic ignorance.

Research Agenda · The Gap

The Open Question: Why Do Norms Persist?

  • The puzzle: In Bogotá, 89% of fathers privately support mothers working outside the home — yet female labor force participation remains 20 pp below men's.
  • One candidate mechanism: People think others are more conservative than they actually are. This pluralistic ignorance creates a social-permission friction — even when everyone privately agrees, no one acts because they think no one else does.
  • Evidence from 60 countries (Bursztyn et al. 2023): Systematic underestimation of support for women working, especially of men's support. The gap is 15–30 pp on average.
  • This paper asks: In a couple-based RCT, can correcting these misperceptions change intra-household decisions and women's labor supply?
Innovation 1
Representative sample of couples — measure mutual misperceptions within households
Innovation 2
Zero-sum revealed-preference allocation: course slot for wife or husband
Innovation 3
Norm delivered is estimated from the exact same population — representative reference group
Research Agenda · Global Context

Pluralistic Ignorance Is a Global Phenomenon

  • Bursztyn, Cappelen, Tungodden, Voena, Yanagizawa-Drott (2023): 60-country study on support for women working outside the home
  • In nearly every country, both men and women underestimate how much men support women working — gaps of 15–30 pp on average
  • The gap is largest precisely in countries where men's private support is high — a signature of pluralistic ignorance
  • Latin America: relatively high private support, yet substantial misperceptions
Global evidence on pluralistic ignorance
Bursztyn et al. (2023): Distribution of gaps between actual and perceived support for women working, 60 countries
Research Agenda · Latin America

Latin America: High Support, High Misperception

Regional evidence LAC
Bursztyn et al. (2023): LAC highlighted — misperceptions persist even where private support for women working is high
Research Agenda · Literature

Correcting Misperceptions: What Prior Experiments Show

  • Saudi Arabia (Bursztyn et al. 2020, AER): Informing men of true peer support for women working → men become more willing to let wives search for jobs; wives increase applications and interviews
  • Indonesia (Cameron et al. 2026): Sharing community norm data with 4,000+ respondents → 25% more likely to pick career course instead of equal-value voucher
  • Paraguay (Laszlo et al. 2025): Norm-shifting intervention in lab-in-the-field experiment → stronger beliefs in equitable household labor division

What is missing in the literature: All prior studies observe only one partner — cannot measure mutual misperceptions or study how information corrects within-couple belief gaps. Course choices and lab outcomes are often hypothetical. None use a probability-sampled population as both respondents and norm-source.

Research Agenda · Our Paper

Paper in a Nutshell

Setting
1,732 cohabiting couples with young children in Bogotá, Colombia. Randomized WhatsApp + phone information intervention.
  • Fact: 89% of fathers privately support mothers working outside the home — yet estimate only 61% of other men agree. A 28 percentage-point misperception.
  • Intervention: Personalized feedback on actual community support, delivered via WhatsApp chatbot and phone follow-up.
  • Finding 1 — Beliefs: Treated individuals correct community beliefs by 3–5 pp; perceived spousal support rises by 6 pp.
  • Finding 2 — Decisions: Treated men are 9 pp more likely (+23%) to nominate their wife for a career-development course.
  • Finding 3 — Labor: Treated women report more intensive job search (+10 pp); treated men value work–family balance more (+11 pp).
  • Limit: Effects concentrated among labor-market attached women; inactive women respond little.
Research Agenda · Research Questions

Three Research Questions

RQ 3 · Decisions & Labor
Do belief updates affect (a) intra-household allocation of a career-building course, and (b) short-run job search and labor market aspirations?
Course · Sample 2Labor · Sample 2∩3
Section II

Experimental Design

1,732 couples · Bogotá · WhatsApp + phone · Three survey waves
Research Agenda Experimental Design Baseline Facts Results Heterogeneity Conclusions Next: YouTube
Experimental Design · Framework

Theory of Change

Randomized Information Treatment
Personalized feedback on community support for maternal employment (WhatsApp + phone)
Update of community 2nd-order beliefs
(men's & women's support)
⟵ ⟶
Update of spousal 2nd-order beliefs
(perceived partner support)
Intra-household decisions
Career course allocation
(wife or husband?)
+
Labor market outcomes
Job search, aspirations, work–family balance
Experimental Design · Sample

The Sample: 1,732 Cohabiting Couples in Bogotá

Eligibility
≥1 child
Under 6 years of age — stage when gender gaps in LFP widen most sharply
City
Bogotá
Women's LFP ≈ 64%; above LAC average (~52%), among highest of major cities
Design
1:1
Randomization at couple level; stratified by wife's LFP status and husband's first-order belief
  • Representative sample of households with ≥1 child under 6 in Bogotá — guarantees norm accuracy for the reference group
  • Both partners individually surveyed (in-person or phone, July–September 2024)
  • Household income: 28% low · 60% middle · 12% high — mirrors city distribution
  • Three survey waves: Baseline (Jul–Sep 2024) → Midline/WhatsApp (Oct 2024) → Endline by phone (Nov 2024–Jan 2025)
Experimental Design · Stage 1

Stage 1 — Baseline Survey & Belief Elicitation

Stage 1 Baseline
  • First-order belief: do you agree mothers should be free to work?
  • Second-order beliefs: community-level estimates (how many fathers/mothers agree?)
  • Spousal beliefs: what do you think your partner believes?
Experimental Design · Stage 2

Stage 2 — Randomization at the Couple Level

Stage 2 Randomization
  • Unit: couple (both partners receive the same arm)
  • Stratification: wife's labor status, husband's first-order support, presence of children <6
  • Treatment: WhatsApp chatbot with actual Bogotá-level support; Control: placebo on green transport
Experimental Design · Two Arms

What Each Arm Saw — One Norm, Two Topics

Treatment (866 couples)

"Mothers of children under six should be free to work for pay outside the home."

Gender-norm statement (target).

Control (866 couples)

Same chatbot, same four steps, same schedule. The only difference is the norm each arm sees. Any T–C gap in downstream behavior is attributable to corrected beliefs about gender norms.

Experimental Design · Treatment Chatbot

Treatment Arm — Personalized Feedback on the Gender Norm

N
Norms Studyonline
Hello 👋 In the baseline survey you answered a question about the statement:

"Mothers of children under six should be free to work for pay outside the home."10:02
You estimated that out of 100 fathers in Bogotá, 60 agree with this statement.10:02
Do you think your estimate matches the true share in Bogotá?10:02
Yes  •  No  •  Not sure
No10:03 ✓✓
Actual share in Bogotá (baseline data):
Fathers
89 / 100
Mothers
91 / 100
In fact, 89 out of 100 fathers and 91 out of 100 mothers in Bogotá agree with the statement.10:03
How does this information feel to you?10:03
Interesting  •  Irrelevant  •  Disappointing
Treatment arm — gender norm
The four steps (as in the paper)
  • Step 1 — Recall: shows the respondent's own baseline estimate of fathers' / mothers' agreement
  • Step 2 — Check: asks whether that estimate matches reality
  • Step 3 — Reveal: shows the actual share computed from the baseline (as a WhatsApp message with numbers and emojis)
  • Step 4 — Rate: asks the respondent to rate the discrepancy — interesting / irrelevant / disappointing

The same four steps are repeated for beliefs about men's and women's support, with order randomized.

Experimental Design · Treatment — The Reveal

What Treated Respondents Saw at Step 3 — The Figure

Share of fathers in Bogotá who agree with the statement

"Mothers of children under six should be free to work for pay outside the home."
Your estimate(from the baseline survey)
60%
60 / 100
Actual share(measured in Bogotá)
89%
89 / 100
+29 pp  — fathers in Bogotá support working mothers much more than respondents believe
Experimental Design · Control Chatbot

Control Arm — Same Structure, Placebo Norm

N
Norms Studyonline
Hello 👋 In the baseline survey you answered a question about the statement:

"Companies should subsidize public transport."10:02
You estimated that out of 100 people in Bogotá, 75 agree with this statement.10:02
Do you think your estimate matches the true share in Bogotá?10:02
Yes  •  No  •  Not sure
Not sure10:03 ✓✓
Actual share in Bogotá (baseline data):
Men
94 / 100
Women
95 / 100
In fact, 94 out of 100 men and 95 out of 100 women in Bogotá agree.10:03
How does this information feel to you?10:03
Interesting  •  Irrelevant  •  Disappointing
Control arm — placebo norm
Why this placebo
  • Same channel (WhatsApp chatbot), same format, same four-step sequence, same schedule
  • Unrelated topic: attitudes toward corporate subsidies for public transport
  • Shares the belief-elicitation mechanics without addressing gender norms
Experimental Design · Stage 3

Stage 3 — Midline: WhatsApp Engagement

Stage 3 Midline
  • Delivery: WhatsApp chatbot with 4 interactive steps (Sep–Oct 2024)
  • Engagement: only ~29% of couples completed the interaction (501 of 1,732 T; 518 of 1,732 C)
  • Note: low engagement suggests digital interventions face uptake barriers in this population
Experimental Design · Stage 4 · Follow-up

After the Chatbot — Course Decision & Endline

Follow-up timeline
Course nomination (end of WhatsApp)
  • One real online career course per household — keep it or give to partner
  • Zero-sum choice with direct personal cost — revealed preference
Endline phone survey (1–2 months later)
  • 1,382 of 3,464 reached (≈40%)
  • Beliefs, job search, work–family balance, aspirations
  • Treatment info re-administered; measured before reinforcement
Section III

Baseline Facts

Near-universal private support — and a 28 pp misperception
Research Agenda Experimental Design Baseline Facts Results Heterogeneity Conclusions Next: YouTube
Baseline Facts · Sample

Individual Attributes — Large Gender Gap in LFP

Variable Husbands Wives Δ
Demographics
Age (years)34.932.02.8***
Education
Low14.3%10.8%3.5***
Medium69.5%71.1%−1.6
High16.2%18.1%−2.0
Employment Status
Employed90.5%52.0%38.5***
Unemployed5.0%6.3%−1.3
Inactive4.5%41.7%−37.2***
Weekly hours48.737.611.1***
Job Flexibility
High23.6%33.0%−9.4***
Some27.2%31.5%−4.3**
None48.9%35.1%13.8***
Job Search
Looking for job10.6%16.2%−5.5***
Start business9.1%7.2%1.9**
Would like to49.3%51.8%−2.5
Satisfied31.0%24.9%6.2***
Baseline Facts · Household

Household Attributes & Income Distribution

Characteristic Sample Composition
Household Size & Composition
Average household size3.8 members
Children under 6 per HH1.13
HH with child <6 not in childcare27.6%
HH with member needing permanent care32.0%
Household Income Category
Low income (<1.3M COP)28%~$6,200 USD
Middle income (1.3–3.9M COP)60%~$6,200–$18,600 USD
High income (>3.9M COP)12%>$18,600 USD
Sample Composition
Total households1,732
Total individuals3,464 (1,732 couples)
Baseline Facts · Pluralistic Ignorance

Pluralistic Ignorance: Everyone Supports, Everyone Thinks Others Don't

👨 Husbands' Misperception
Fathers' support misperception: −27.5 pp
Actual: 88.5% | Perceived: 60.98%

Mothers' support misperception: −10.9 pp
Actual: 90.5% | Perceived: 79.61%
👩 Wives' Misperception
Fathers' support misperception: −32.8 pp
Actual: 88.5% | Perceived: 55.70%

Mothers' support misperception: −10.5 pp
Actual: 90.5% | Perceived: 80.01%
💑 Spousal Misperceptions (Within-Couple)
Husbands underestimate wives: −3.4 pp*
Husband thinks: 93.9% | Wife actual: 90.5%

Wives underestimate husbands: −1.4 pp*
Wife thinks: 89.9% | Husband actual: 88.5%
*optimistic (overestimate)
🚌 Placebo Norm (Green Transport)
Husband support: 93.5%
Wife support: 94.9%

NO MISPERCEPTION
High consensus, no gender gap

Pluralistic Ignorance Pattern: Both spouses privately support maternal employment, but dramatically underestimate community support for fathers (−27.5 to −32.8 pp). Misperception about mothers' support is much smaller (−10.5 pp). Within-couple, spouses are optimistic (slightly overestimate each other's support). The community-level friction dominates, not spousal. Placebo shows this is gender-norm-specific.

Baseline Facts · Beliefs

Baseline Beliefs: Target Norm & Placebo

Belief Type Husbands Wives Difference
A. Target Norm: "Mothers of children <6 should be free to work"
First-order (own view)88.5%90.5%−2.0 pp**
Second-order: Men (estimate of fathers)61.0%55.7%+5.3 pp***
Second-order: Women (estimate of mothers)79.6%80.0%−0.4 pp
Spousal second-order93.9%89.9%+4.1 pp***
B. Placebo Norm: "Companies should subsidize public transport"
First-order (own view)93.5%94.9%−1.4 pp***

N = 1,732 couples. High first-order support for both norms (88–95%). Misperception concentrated on father's support for maternal employment (gap: 27–33 pp). Placebo norm shows no such gap.

Baseline Facts · Figure

Both Sexes Underestimate Men's Support by ~20–30 pp

Men's support misperception distribution
Distribution of first-order beliefs vs. second-order beliefs about men's support for maternal employment. True share = 89%. Perceived share peaks at ~60%.
Baseline Facts · Figure

Women's Perceived Support: Smaller Gap, Similar Pattern

Women's support misperception distribution
Distribution of first-order beliefs vs. second-order beliefs about women's support. True share ≈ 91%. Perceived support still understated, but gap is smaller (~10 pp).
Section IV

Empirical Strategy

IPWRA · Attrition correction · Multiple testing
Research Agenda Experimental Design Baseline Facts Empirical Strategy Results Heterogeneity Conclusions
Empirical Strategy · IPWRA

Addressing Two Sources of Non-Random Selection

Problem 1 — Attrition: not all baseline respondents observed in midline/endline → may threaten internal validity if selection correlates with potential outcomes.

Problem 2 — Selective engagement: only a subset of treated respondents engages with the WhatsApp module → covariates can become imbalanced within the realized sample.

Step 2 — IPWRA within each sample S (doubly robust ATT):

ATT̂ = (1/NTw) Σi wiS [ Di(Yi − m̂0) + (1−Di)·(êi/(1−êi))·(m̂1 − m̂0) ]

e(·) = treatment model Pr(D=1│XD, strata); md(·) = outcome model E[Y│D=d, XD, strata]. Step 1 weights enter as pweights. Consistent if either e(·) or md(·) is correctly specified.

  • Robustness (4 weight specs): baseline probit-PS · winsorised p95 · trimmed (drop PS < 0.10) · logit PS — Appendix Table A.IPWRAsens.
  • Inference: Fisher exact (randomization) p-values primary; Romano-Wolf step-down for outcome families; Lee (2009) sharp bounds; near-miss timing diagnostic — these validate the IPW correction, not separate tests.
  • Reference-group accuracy: the disclosed Bogotá-average norm must reflect the engager subpopulation's actual reference group. Engagers vs. non-engagers hold virtually identical priors on the targeted second-order belief (58.1 vs. 58.6, p > 0.5). Maximum subgroup deviation from city-wide mean is 3.3 pp — less than 20% of the 28 pp misperception being corrected, and the sign is conservative.
Section V

Results

Community beliefs · Spousal beliefs · Course allocation · Labor market outcomes
Research Agenda Experimental Design Baseline Facts Empirical Strategy Results Heterogeneity Conclusions
Results · Research Questions

Three Research Questions & Dependent Variables

RQ 3 · Decisions & Labor
Do belief updates affect (a) intra-household allocation of a career-building course, and (b) short-run job search and labor market aspirations?
Dependent Variables:
(a) Course enrollment decision · (b) Job search effort, labor market aspirations, work-family balance preferences
Course: Sample 2Labor: Sample 2∩3
Research Question 1

Dependent Variables

Does information about actual community support for maternal employment correct first-order and second-order beliefs?
• First-order beliefs (FOB)
• Second-order beliefs (SOB): Men
• Misperceptions: Men's support
• Second-order beliefs (SOB): Women
• Misperceptions: Women's support
Results · RQ1 — Community Beliefs

Does Information Correct Community Second-Order Beliefs? Complete Results

Outcome First-Order
Belief (1)
2nd-Order:
Men's Support (2)
2nd-Order:
Women's Support (3)
Misperception
Men D (4)
Misperception
Women D (5)
Panel A — All (N = 1,102)
ATT 0.001 2.75* 3.58** −0.054* −0.047
Control mean 0.902 63.2 75.8 0.832 0.553
Panel B — Men (N = 453)
ATT −0.004 2.82 4.50* −0.110** −0.047
Control mean 0.903 61.0 79.6 0.901 0.596
Panel C — Women (N = 649)
ATT −0.002 2.57 2.30 −0.018 −0.040
Control mean 0.901 64.8 80.0 0.800 0.535

Key finding 2: Community beliefs do correct. Men's misperception of male support falls −11 pp (p=0.031). Women perceive male support +4.5 pp (p=0.058).

Sample: Respondents in both midline and endline surveys (Sample 2∩3, N=1,102). ATT = Average Treatment effect on the Treated (IPWRA, 90% CI). Control means are unadjusted baseline/endline values.
Results

First-Order Beliefs: Treatment Effects by Gender

Men (n=373)
Control
90.3%
Treatment
89.9%
Effect (pp):+-0.4
Effect (%):-0.4%
p-value:p>0.10
Women (n=644)
Control
90.1%
Treatment
89.9%
Effect (pp):-0.2
Effect (%):-0.2%
p-value:p>0.10
Method: IPWRA with Fisher exact p-values. Point estimates with relative % change.
Results · RQ1 — Second-Order Beliefs

Perceived Men's Support: Treatment Effects by Gender

Men (n=373)
Control
61.0
Treatment
63.8
Effect (pp):+2.8
Effect (%):+4.6%
p-value:p>0.10
Women (n=644)
Control
64.8
Treatment
67.4
Effect (pp):+2.6
Effect (%):+4.0%
p-value:p>0.10
Method: IPWRA with Fisher exact p-values. Point estimates with relative % change.
Results · RQ1 — Second-Order Beliefs

Perceived Women's Support: Treatment Effects by Gender

Men (n=373)
Control
79.6
Treatment
84.1
Effect (pp):+4.5
Effect (%):+5.7%
p-value:0.058 *
Women (n=644)
Control
80.0
Treatment
82.3
Effect (pp):+2.3
Effect (%):+2.9%
p-value:p>0.10
Method: IPWRA with Fisher exact p-values. Point estimates with relative % change.
Results · RQ1 — Misperceptions

Men's Support Misperceptions: Treatment Effects by Gender

Men (n=373)
Control
0.901
Treatment
0.791
Effect:-0.110
Effect (%):-12.2%
p-value:0.031 **
Women (n=644)
Control
0.800
Treatment
0.782
Effect:-0.018
Effect (%):-2.2%
p-value:p>0.10
Results · RQ1 — Misperceptions

Women's Support Misperceptions: Treatment Effects by Gender

Men (n=373)
Control
0.596
Treatment
0.549
Effect:-0.047
Effect (%):-7.9%
p-value:p>0.10
Women (n=644)
Control
0.535
Treatment
0.495
Effect:-0.040
Effect (%):-7.5%
p-value:p>0.10
Research Question 2

Dependent Variables

Does correcting community-level beliefs spill over to within-couple perceptions of spousal support?
• Spouse's support for maternal
• Misperceptions (maternal)
• Spouse's support for equal tasks
• Misperceptions (task-sharing)
Results · RQ2 — Spousal Beliefs

Does Information Spill Over into Spousal Beliefs?

✓ Community beliefs corrected
+3–5 pp perceived men's/women's support
Community 2nd-order beliefs ✓
⟵ ⟶
Spousal 2nd-order beliefs
↑ perceived partner's support?
Intra-household decision
+
Labor market outcomes

Mechanism: Respondents are never told their partner's actual baseline beliefs. Any updating of spousal perceptions must arise from generalized introspection — corrected community norms lead individuals to reassess expectations about close others.

Results · RQ2 — Spousal Beliefs

Spousal Beliefs: +6 pp Perceived Partner Support

Working Mothers Equal Task Sharing
Panel 1st-order 2nd-order (Spouse) Misperception D 1st-order 2nd-order (Spouse) Misperception D
Panel A — All (N = 1,102)
ATT 0.009 0.063*** −0.028 0.021** 0.038** −0.027
Control mean 0.900 0.885 0.184 0.965 0.899 0.104
Panel B — Men (N = 453)
ATT 0.010 0.059** −0.046 0.013 −0.004 −0.002
Panel C — Women (N = 649)
ATT 0.004 0.063** −0.004 0.025*** 0.061** −0.040

Working mothers: Perceived spousal support rises by 6.3 pp (p=0.001) — similar for men (+5.9 pp) and women (+6.3 pp). Community-level correction spills over into within-couple beliefs.

Results · RQ2 — Spousal Beliefs

Spouse's Support for Maternal Employment

Men (n=373)
Control
56.6
Treatment
61.6
Effect (pp):+5.0
Effect (%):+8.8%
p-value:p>0.10
Women (n=644)
Control
63.1
Treatment
69.4
Effect (pp):+6.3
Effect (%):+10.0%
p-value:0.001 ***
Method: IPWRA with Fisher exact p-values. Point estimates with relative % change.
Results · RQ2 — Spousal Beliefs

Spouse's Support for Equal Task Sharing

Men (n=373)
Control
45.2
Treatment
49.0
Effect (pp):+3.8
Effect (%):+8.4%
p-value:p>0.10
Women (n=644)
Control
45.5
Treatment
51.6
Effect (pp):+6.1
Effect (%):+13.4%
p-value:0.016 **
Method: IPWRA with Fisher exact p-values. Point estimates with relative % change.
Results · RQ2 — Spousal Beliefs

Misperceptions about Spouse's Support for Maternal Employment

Men (n=453)
Effect:-0.046
p-value:p>0.10
Women (n=649)
Effect:-0.004
p-value:p>0.10
Method: IPWRA with Fisher exact p-values. Negative = perception gap shrinks (IPWRA ATT).
Results · RQ2 — Spousal Beliefs

Misperceptions about Spouse's Support for Equal Task Sharing

Men (n=453)
Effect:-0.002
p-value:p>0.10
Women (n=649)
Effect:-0.040
p-value:p>0.10
Method: IPWRA with Fisher exact p-values. Negative = perception gap shrinks (IPWRA ATT).
Research Question 3a

Dependent Variables

How do corrected beliefs affect couples' decisions on resource allocation and investment in women's human capital?
• Wife should attend the course (nomination)
• Respondent's own interest in course
• Belief about partner's interest in course
Results · RQ3a — Course Allocation

Do Updated Beliefs Change Intra-Household Decisions?

✓ Community + Spousal beliefs corrected
Community beliefs ✓
⟵ ⟶
Spousal beliefs ✓
Intra-household decision
Wife or husband for the course?
+
Labor market outcomes

The stakes: Real, concrete online career program — not hypothetical. Both spouses report high own interest (74–82%). Husbands substantially underestimate wife's interest (only 57% guessed correctly).

Results · RQ3a — Course Allocation

Men +9.1 pp More Likely to Nominate Wife for the Course

Wife Should
Attend Course (1)
Are You
Interested? (2)
Is Partner
Interested? (3)
Panel A — All (N = 1,017)
ATT 0.023 −0.014 −0.022
Control mean0.6880.7930.460
Panel B — Men (N = 373)
ATT 0.091
(p = 0.104)
−0.046 0.018
Control mean0.4020.7430.574
Panel C — Women (N = 644)
ATT −0.006 0.007 −0.019
Control mean0.8410.8190.374

Men (+9.1 pp, +23%): IPWRA p=0.104; Fisher exact p=0.011; Lee bounds strictly positive at 0.8% trimming; stable across all 4 IPWRA weight specs (0.091–0.094). Suggestive but credible.

Results · RQ3a — Course Allocation

Men +9.1 pp More Likely to Nominate Wife for the Course

Men (n=373)
Control
40.2
Treatment
49.3
Effect (pp):+9.1
Effect (%):+22.6%
p-value:0.011 **
Women (n=644)
Control
84.1
Treatment
83.5
Effect (pp):-0.6
Effect (%):-0.7%
p-value:p>0.10
Method: IPWRA with Fisher exact p-values. Point estimates with relative % change.
Results · RQ3a — Course Allocation

Respondent's Own Interest in Career Development Course

Men (n=373)
Control
74.3
Treatment
69.7
Effect (pp):-4.6
Effect (%):-6.2%
p-value:p>0.10
Women (n=644)
Control
81.9
Treatment
82.6
Effect (pp):+0.7
Effect (%):+0.9%
p-value:p>0.10
Method: IPWRA with Fisher exact p-values. Point estimates with relative % change.
Results · RQ3a — Course Allocation

Belief About Partner's Interest in Course

Men (n=373)
Control
57.4
Treatment
59.2
Effect (pp):+1.8
Effect (%):+3.1%
p-value:p>0.10
Women (n=644)
Control
37.4
Treatment
35.5
Effect (pp):-1.9
Effect (%):-5.1%
p-value:p>0.10
Method: IPWRA with Fisher exact p-values. Point estimates with relative % change.
Research Question 3b

Dependent Variables

Do corrected norms affect women's labor market behavior and economic aspirations?
• Job mobility (changed job or started business)
• Labor market aspirations
• Work-family balance preference
Results · RQ3b — Labor Market

Short-Run Labor Market Responses

✓ Beliefs corrected · ✓ Course shifted
Community beliefs ✓
⟵ ⟶
Spousal beliefs ✓
Course allocation ✓
+
Labor market outcomes
Job mobility · Aspirations · Work–family balance

Sample 2∩3 (N=1,102): treated at midline, surveyed at endline (1–2 months later). Outcomes: (1) job mobility — changed jobs or started a business; (2) aspires to improve LM situation; (3) wants to balance work and family.

Results · RQ3b — Labor Market

Women Search More (+10 pp) · Men Value Work–Family Balance More (+11 pp)

Job Mobility Aspires Better LM Work–Family Balance
Sample 2∩3 (1) Placebo 3∖2 (3) Sample 2∩3 (4) Sample 3 (5) Sample 2∩3 (6) Sample 3 (7)
Panel A — All (N = 1,102)
ATT 0.058* −0.056 0.005 0.000 0.052 0.038
Panel B — Men (N = 453)
ATT 0.012 −0.048 −0.042 −0.027 0.110* 0.066*
Control mean 0.664 0.492 0.317
Panel C — Women (N = 649)
ATT 0.096*** −0.087 0.054 0.018 −0.006 0.015
Control mean 0.725 0.507 0.361

Women's job mobility: +9.6 pp (p=0.006), +13% relative to control. Placebo negative and p>0.10 → not social desirability. RW p=0.013.

Men's work–family balance: +11 pp (p=0.054), +35% relative to control. Robust in Sample 3 (+6.6 pp, p=0.099). Aspirations: null for both.

Results · RQ3b — Labor Market

Job Mobility: Likelihood of Job Change or Business Start

Men (n=453)
Control
66.4%
Treatment
67.6%
Effect (pp):+1.2%
p-value:p>0.10
Women (n=649)
Control
72.5%
Treatment
82.1%
Effect (pp):+9.6%
p-value:0.006 ***
Method: IPWRA with Fisher exact p-values. Sample 2∩3 (both midline + endline), N=1,102 total.
Results · RQ3b — Labor Market

Labor Market Aspirations: Desire to Improve LM Situation

Men (n=453)
Control
49.2%
Treatment
45.0%
Effect (pp):-4.2%
p-value:p>0.10
Women (n=649)
Control
50.7%
Treatment
56.1%
Effect (pp):+5.4%
p-value:p>0.10
Method: IPWRA with Fisher exact p-values. Sample 2∩3 (both midline + endline), N=1,102 total.
Results · RQ3b — Labor Market

Work–Family Balance: Preference for Balancing Work and Family

Men (n=453)
Control
31.7%
Treatment
42.7%
Effect (pp):+11.0%
p-value:0.054 *
Women (n=649)
Control
36.1%
Treatment
35.5%
Effect (pp):-0.6%
p-value:p>0.10
Method: IPWRA with Fisher exact p-values. Sample 2∩3 (both midline + endline), N=1,102 total.
Results · Labor Margins

Exploratory: Multiple Labor-Market Margins Respond for Treated Women

Job started
IPWRA — job started
Work–family balance
IPWRA — work-family balance
Aspirations
IPWRA — aspirations

Pattern: treated women are more likely to have started a job, report better work–family balance, and higher career aspirations — consistent with a relaxed social-permission constraint among those already attached to the labor market.

Results · Mechanism 1 — Spillover

Spousal beliefs move more than community beliefs — even though we only delivered community info

Spillover effect: community to spousal beliefs

Direct (informed) channel: Community SOB updates +2.7 pp (men's support) and +3.3 pp (women's support).

Indirect (spillover) channel: Spousal SOB updates +6.2 pp on both — about 2× the direct effect.

Theory: generalized introspection (Boltz 2025) — credible information about the community realigns perceptions of all close others, including the partner, because the partner is part of "the community." The spousal effect is a downstream consequence of correcting community beliefs, not an independently informed margin.

Results · Mechanism 2 — Belief → Action

Belief updates → action: directional consistency, not formal mediation

Real behavior (we trust most): Job Mobility (women) +9.6 pp · Course Allocation (men) +9.1 pp.

Stated preferences (interpret with caveat): Work-Family Balance (men) +11.0 pp · Aspirations null.

Belief-to-behavior translation
Results · Heterogeneity (mechanism evidence)

Where the treatment works: heterogeneity by wife's labor attachment

Heterogeneity by wife's labor attachment

Active wives: beliefs +3.2/+5.3 pp · spousal +6.2 pp · course +11–13 pp · job search increases

Inactive wives: belief updating ≈ 0 · course allocation ≈ 0 · exception: work-family balance concentrates here

Why this is "mechanism evidence" (not just heterogeneity): the pattern discriminates between hypotheses. If treatment moved preferences, effects should be uniform. The fact that only labor-attached households respond is consistent with information lifting a social-permission constraint that binds only when structural barriers (childcare, demand) are already low. Information complements — not substitutes for — structural policy.

Results · Within-couple exposure (descriptive)

Information does not diffuse automatically within couples

⚠ Caveat — endogeneity: we did not randomize which spouse engaged with the WhatsApp module. Direct / indirect / joint exposure configurations are endogenous. We therefore interpret these results as descriptive, not causal.

Information diffusion patterns

Direct (own engagement): spousal SOB +6.2 pp — replicates main result

Indirect (only partner): spousal SOB ≈ 0; men's 1st-order ↓ 4–6 pp (possible reactance)

Joint (both partners): spousal SOB +8.9 pp; course allocation +9–10 pp — strongest

Section VI

Heterogeneity

Labor market attachment strongly conditions treatment effects
Research Agenda Experimental Design Baseline Facts Results Heterogeneity Conclusions Next: YouTube
Heterogeneity · Labor Market Attachment

Effects Concentrate Among Women Already Attached to the Labor Market

Employed Women
Largest effects on job mobility and job search. Information is actionable — women can respond immediately.
Unemployed Women
Moderate positive effects. Active job-seekers respond to updated beliefs about social permission.
Inactive Women
Little response. Structural barriers (childcare, flexibility, labor demand) dominate — norms alone are insufficient.
Heterogeneity by LFP status

Policy implication: Norm-correcting information and structural interventions (childcare, workplace flexibility) are complements, not substitutes. Each targets a different margin of the same problem.

Robustness

The Main Results Are Robust

Lee (2009) Sharp Bounds
  • Trimming fraction for course (men) < 0.8% — both bounds strictly positive
  • Women's job search: Lee bounds positive; belief outcomes include zero
  • Attrition-robust: monotonicity holds for course (Sample 2 = midline engagers)
IPWRA Sensitivity (4 specs)
  • Baseline probit · Winsorized (p95) · Trimmed (PS<0.10) · Logit PS
  • Course men: 0.091–0.094 across all specs; crosses 10% threshold under winsorized weights
  • Women job search: 0.091–0.096 (p=0.006–0.009) — stable and significant
  • Work–family men: fragile to trimming (p=0.198 with 11 high-leverage obs dropped)
Romano-Wolf Step-Down
  • F1 (community beliefs): RW p > 0.40
  • F3 (course, men): RW p = 0.077 (marginal)
  • Women's job search: RW p = 0.013 — survives MHT correction
Additional Checks
  • Near-miss timing placebo: no differential loss on key belief variables
  • Engager characterization: beliefs identical between engagers and non-engagers
  • Reference group accuracy: disclosed norm accurate within ±3.3 pp for all subgroups
Section VII

Conclusions

What we found · What it means · What comes next
Research Agenda Experimental Design Baseline Facts Results Heterogeneity Conclusions Next: YouTube
Conclusions · Summary

What We Found

✓ Norms Shift Within-Couple Decisions
Treated men are 9.1 pp (+23%) more likely to nominate their wife for a career-development course. A zero-sum choice with direct personal cost — a lower bound on the willingness to invest in the wife's career.
✓ Women Respond with More Job Search
Treated women report +9.6 pp more job mobility in 1–2 months (p=0.006; RW p=0.013). Placebo timing check rules out social desirability. Effects driven by already labor-market attached women.
⚠ Inactive Women Respond Little
For women out of the labor force, correcting norms alone is insufficient. Structural barriers — childcare, workplace flexibility — dominate. Norm feedback and structural policies are complements.
Conclusions · Implications

What It Means

  • The binding constraint is informational, not attitudinal. In contexts where private support for women's work is already high, the gap between attitudes and behavior may be sustained not by deep preferences but by inaccurate beliefs about peers. Policy that targets the information friction can shift behavior at low cost.
  • Couples matter as a unit. Studying a single respondent misses the mutual misperception dynamic and the within-household allocation channel. Information that corrects community-level beliefs also changes how each spouse perceives the other — with behavioral implications for the husband's course choice and the wife's job search.
  • For whom does information work? Effects concentrate among women who are already employed or actively seeking work. For inactive women, norm feedback is not enough — complementary policies addressing structural barriers (childcare, flexible hours, labor demand) are needed to move them into the labor market.
  • Scalability: WhatsApp-based norm correction is low-cost and digitally deliverable at scale in LAC cities. Engagement rates (~36%) are typical for low-cost digital interventions but underscore the importance of sustained exposure (endline reinforcement was necessary to achieve the full effect).
What's Next

Gender Norms in Digital Spaces

(Brief closing) Does the silent-support pattern appear publicly and at scale? YouTube provides a low-cost window into community norm perceptions.
Research Agenda Experimental Design Baseline Facts Results Heterogeneity Conclusions Next: YouTube
Next Steps · YouTube Project

Suppressed Voices: Pluralistic Ignorance in the Wild

The Data
  • 35 channels across 8 countries (CO, MX, AR, CL, PE, ES, EC, GT)
  • 18.85M comments total (13.1M top-level)
  • 57.6K videos · full backfill, no sampling
  • Classified panel: 202K challenging, 20K enforcing
  • NLP classifier (keyword + context), RA-validated on n=500
Key Observational Finding

Challenging (progressive) comments get more likes; enforcing (traditional) comments get more replies.

  • Baseline challenging: +0.173*** log-likes, +0.043*** log-replies
  • Vos Podés (female-centered): +0.156 gap***
  • Los Hombres Sí Lloran (male-centered): −0.023 (inverse)
  • Interpretation: progressive views attract silent approval; traditional views provoke public pushback — the behavioral signature of pluralistic ignorance.
Next Steps · YouTube Project

The RCT: Can One Comment Break the Silence?

Design
  • Researcher posts one comment within 2h of upload
  • Unit of randomization: video (sequential, pre-registered)
  • Two norm domains: FLFP + Caregiving
  • MAMS adaptive: Stage 1 (30 vids/arm) → Bayesian interim → Stage 2 (40 vids/arm)
  • Primary outcome: ≥1 organic norm-challenging comment in days 2–7
  • Power: 80% at 15 pp effect; 70 vids/arm pooled
Treatment Arms

T0 (Control): no seeded comment — baseline silence

T1 — Norm-expr. FLFP: "El trabajo de las mujeres fuera del hogar beneficia…"

T3 — Norm-expr. Care: "Los hombres en el cuidado son mejores padres y parejas…"

T1/T3 restricted from male-centered channels. T2/T4 + polls run everywhere.

Next Steps · YouTube Project

The Poll Arm: Revealed Preference for Gender Norms

Poll — Progressive

"Si crees que los hombres deberían participar más en el cuidado del hogar, dale like a este comentario 👍"

Poll — Conservative

"Si crees que los roles tradicionales en el hogar funcionan bien, dale like a este comentario 👍"

Mechanism Test
  • T2/T4 shifts ratio → belief-updating
  • T1/T3 effect without shift → imitation
  • • Both pre-specified in AEA Registry

Why this matters: the poll transforms a single-outcome RCT into a mechanism-identification design. It is also immediately replicable by any practitioner on any platform — a standalone policy tool for mapping community-norm misperceptions in real time.

Next Steps · YouTube Project

Status and Research Agenda

Completed ✓
  • ~18.85M comments collected, classified, validated
  • Observational regressions run (key findings confirmed)
  • YouTube Researcher Programme approved (April 2026)
  • Creator debriefing protocol designed
  • IRB draft (v4) — in progress
To Do
  • IRB approval at PUJ (PROYRECUPR)
  • AEA RCT pre-registration
  • Google/YouTube notification (7 days before start)
  • Build rct_manager.py (video detection → arm draw → post → log)
  • Start Stage 1 data collection
Appendix

Appendix

Full tables · Robustness · Diagnostics · Additional results
A2 Full beliefs A3 Romano-Wolf (+ intuition) A4 Lee bounds (+ intuition) A5 IPWRA sensitivity (+ intuition) A6 Balance tests A7 Attrition A8 PS overlap A9 OLS vs IPWRA A10 Het table A11 Indirect effects A12 Near-miss + Engagers (+ intuition)
Appendix
Appendix · Baseline

Full Baseline Beliefs — 8 Gender Norms

Norm Statement Men
1st-order
Women
1st-order
Men's est.
men's support
Men's est.
women's support
Mothers with children <6 should be free to work88.5%90.5%61.0%79.6%
Fathers and mothers should share childcare equally
Children suffer when mother works
Problems arise if wife earns more than husband
Placebo: companies should subsidize green transport93.5%94.9%

Across all 8 gender-norm items, the same pattern holds: progressive private attitudes coexist with sizable misperceptions about others' views, particularly men's support. The placebo shows near-universal agreement and no misperceptions — confirming misperceptions are norm-specific, not general pessimism.

Equal tasks beliefs figure
Appendix
Appendix · Multiple Testing — What does it do?

Romano-Wolf Step-Down: Intuition

What we do: Romano-Wolf adjusts each p-value by simulating the joint distribution of all test statistics under the null (1,000 bootstrap replications, clustered by household). It produces a family-wise error rate–controlled p-value for every outcome, accounting for the dependence structure between them.

Plain English: "Even if I throw lots of outcomes at this experiment, here is the p-value adjusted for the fact that I am fishing in many ponds. Results that survive RW are not lucky strikes."

  • Computed on unweighted OLS — conservative relative to IPWRA
  • Outcome families: F1 community beliefs · F2 spousal beliefs · F3 course allocation · F4 labor
  • Headline: women's job mobility survives (RW p = 0.013); men's course nomination is marginal (RW p = 0.077)
Appendix
Appendix · Multiple Testing

Romano-Wolf Step-Down p-values

Outcome Family & Variable OLS coef. Fisher p RW p Survives?
F1 — Community Beliefs (4 outcomes)
Perceived men's support+2.75(0.089)(0.40+)
Misperception indicator, men−0.054(0.061)(>0.40)
F3 — Course Allocation (men only)
Wife should attend course (men)+0.0910.0110.077marginal
F4 — Labor Outcomes (women)
Job mobility (women, Sample 2∩3)+0.0960.0060.013 **
Aspires better LM (women)+0.054(0.223)(>0.49)

Notes: Romano-Wolf computed on OLS (unweighted) — conservative vs. IPWRA. 1,000 replications, seed(12345), clustered by household. Women's job mobility survives stepdown correction (RW p=0.013). Course nomination for men is marginal (RW p=0.077).

Appendix
Appendix · Attrition Robustness — What does it do?

Lee (2009) Sharp Bounds: Intuition

What we do: Trim observations from the lower-attrition arm to make response rates equal across treatment and control. Then compute the worst-case and best-case ATT — the interval brackets all possible values consistent with monotonicity (treatment doesn't change who attrits).

Plain English: "Imagine the absolute worst possible scenario about who dropped out. Even then, my treatment effect lies somewhere in this range. If both ends of the range exclude zero, my result holds even under unobservable bias."

  • Headline: Both bounds strictly positive for men's course nomination and women's job search
  • Belief outcomes: bounds include zero — consistent with no robust belief effects
  • Key assumption: monotonicity (treatment doesn't push you to drop out)
Appendix
Appendix · Attrition Robustness

Lee (2009) Sharp Bounds

Outcome ATT (IPWRA) Lower Bound Upper Bound Trimming % Both Positive?
Course Allocation
Wife attends course (men)0.0910.0790.1060.8%✓ Yes
Labor Market (Sample 2∩3)
Job mobility (women)0.0960.0480.1342.1%✓ Yes
Work–family balance (men)0.110−0.0080.2141.6%
Community Beliefs (Sample 2∩3)
Perceived men's support2.75−1.2+6.5

Interpretation: Lee bounds apply when treatment monotonically increases probability of being in sample. For the course (Sample 2), this is satisfied by design (engagers). Both bounds strictly positive for men's course nomination and women's job search — key results hold under worst-case attrition scenarios consistent with monotonicity.

Appendix
Appendix · IPWRA Sensitivity — What does it do?

IPWRA Sensitivity: Intuition

What we do: re-estimate the IPWRA ATT under 4 alternative weight constructions to check that headline results don't depend on the most extreme weights:

  • (i) Baseline: probit propensity score (preferred)
  • (ii) Winsorised: cap weights at 95th percentile
  • (iii) Trimmed: drop observations with PS < 0.10 (≈ 1% of obs)
  • (iv) Logit PS: alternative functional form for the selection model

Plain English: "If a handful of unusual observations were driving my result, the estimate would change a lot when I cap or drop them. It doesn't change → my result is robust, not artifact of extreme weights."

Headline: men's course estimate stable at 0.091–0.094 across all 4 specs; women's job mobility stable at 0.091–0.096.

Appendix
Appendix · IPWRA Sensitivity

IPWRA Sensitivity to Alternative Weight Specifications

Specification Course (men)
coef. / p
Job mobility (women)
coef. / p
Work–family (men)
coef. / p
Headline: Probit PS weights (untrimmed)
Baseline0.091 / (0.115)0.096 / (0.006)0.110 / (0.054)
Sensitivity checks
Winsorized (cap p95)0.094 / (0.086) *0.091 / (0.009)0.101 / (0.057)
Trimmed (drop PS < 0.10, N−11)0.091 / (0.108)0.096 / (0.006)0.072 / (0.198) ✗
Logit PS0.094 / (0.105)0.095 / (0.007)0.109 / (0.051)

Course (men): Estimate stable at 0.091–0.094 across all 4 specs. Crosses 10% threshold under winsorized weights. Lee bounds positive → the 9 pp estimate is credible.

Work–family balance (men): Fragile to trimming — 11 high-leverage observations matter. Interpret cautiously; direction consistent but precision conditional on those obs.

Appendix
Appendix · Diagnostics

Balance Tests: Treatment Assignment

Balance table

After IPWRA weighting, maximum absolute standardized mean differences (SMDs) are below 0.10 in all samples and genders. Some covariates show marginal imbalance in Sample 2∩3 (joint F tests reject), but effect sizes are small, and post-weighting balance is tight. The key variable — second-order belief about men's community support — does not differ significantly across treatment and control arms in any sample.

Appendix
Appendix · Attrition

Attrition Diagnostics

Attrition diagnostics

Endline attrition: 40% response rate. Attritors are more likely to be employed and younger — consistent with time availability. After weighting, SMDs < 0.10.

Appendix
Appendix · Diagnostics

Propensity Score Overlap

Treatment PS — All
Treatment PS overlap
Attrition PS — All
Attrition PS overlap

Propensity scores range from 0.06 to 0.76; overlap is adequate in all samples. Effective sample sizes remain large after weighting; mass outside common support is small. P-score densities from 0.06–0.76 → no extreme regions of non-overlap that would invalidate IPWRA.

Appendix
Appendix · Specification

OLS vs. IPWRA: Estimates Are Similar

Outcome OLS OLS +
weights
IPWRA
(preferred)
Direction
consistent?
Beliefs — Men's community SOB (men only)
Perceived men's support+3.1*+2.9*+2.82
Course — Wife attends (men only)
Wife should attend course+0.087*+0.089+0.091
Labor — Job mobility (women, Sample 2∩3)
Job mobility+0.094***+0.096***+0.096***
Labor — Work–family balance (men)
Wants work–family balance+0.108*+0.109*+0.110*

IPWRA is the preferred specification chosen a priori to address selection into midline take-up. OLS and weighted-OLS produce nearly identical point estimates across all headline results — the choice of estimator does not drive the findings.

Appendix
Appendix · Heterogeneity

Heterogeneity by Wife's Baseline Labor Status — Full Results

Outcome (Women) All Women Employed Unemployed Inactive
Job Mobility (Sample 2∩3)
ATT0.096***0.148**0.089*0.018
Control mean0.7250.7120.7800.699
Labor-Market Aspirations (Sample 2∩3)
ATT0.0540.0310.1190.018
Course — Wife attends (Men, by wife's status)
ATT (men)0.0910.134*0.0510.042

Job mobility effects are concentrated among employed (+14.8 pp) and unemployed (+8.9 pp) women. Inactive women show near-zero effects. The course nomination effect is also largest when the wife is employed (+13.4 pp, p<0.10). Together, these results suggest information works at the margin where action is already feasible.

Appendix
Appendix · Exposure Patterns

Indirect vs. Direct Exposure — Spillovers Within Couples

  • Direct exposure (Sample 2∩3): Respondent personally engaged with WhatsApp chatbot + received endline reinforcement. Main analysis sample.
  • Indirect exposure: Respondent did not engage at midline, but their partner did. Column (2) in labor market table includes "direct or indirect T" — captures potential within-couple discussion spillovers.
  • Result: Women's job mobility under direct+indirect exposure = +7.5 pp (p=0.028) — somewhat smaller than direct only (+9.6 pp). Suggests some information diffuses within couple, but weaker than direct receipt.
  • Men's beliefs: Indirect exposure effects on men's community beliefs and course nomination are small and p>0.10 — consistent with low within-couple discussion of labor-market plans for men.
Appendix
Appendix · Validity Checks — What do they do?

Near-Miss & Engager Diagnostics: Intuition

Engager characterization (selective take-up): only 36% engage with the WhatsApp module. We compare engagers vs. non-engagers on (a) demographics and (b) the targeted second-order belief.

Plain English: "Engagers are more inactive (selection on demographics — fix with IPW step 2). But they hold the same prior on community support as non-engagers — so the disclosed norm is accurate for them, the people who actually received it."

Reference-group accuracy: max deviation of any subgroup's mean SOB from city-wide average is 3.3 pp. The misperception we are correcting is 28 pp → reference-group mismatch is < 12% of the corrected signal. Disclosed Bogotá-average norm is a valid proxy for every demographic subgroup.

Appendix
Appendix · Validity Checks

Near-Miss Timing Placebo & Engager Characterization

Near-Miss Timing Placebo
  • Endline ran Nov 18 – Jan 20 (63 days). "Hard to reach" = Dec–Jan (N=492); "Easy" = November (N=379)
  • Key belief variable: 2nd-order belief about men's support. Nov: 58.3; Dec–Jan: 59.4; diff. +1.2 pp (p>0.5)
  • No differential loss on the variable the treatment corrects → attrition unlikely to confound
Engager Characterization
  • Engagers (N=1,236) vs. non-engagers (N=2,228): more inactive (+11 pp), more care burden, fewer employed (−11 pp)
  • BUT: 2nd-order beliefs virtually identical (58.1 vs. 58.6, diff 0.5 pp, p>0.5) → disclosed norm is accurate for engagers' reference group
  • Engagement balanced across arms: 35.8% treated vs. 35.6% control

Reference group accuracy: Max deviation of any subgroup's mean SOB from city-wide average = 3.3 pp (high-SES). The corrected misperception is ~28 pp → reference group mismatch is <12% of the corrected signal. Disclosed norm is valid for all demographic subgroups in the sample.

Appendix
Appendix · Figures

Spousal Beliefs — IPWRA Estimates by Gender

Spousal beliefs figure
IPWRA estimates of treatment effects on spousal second-order beliefs, by gender and norm. 90% CI. Sample 2∩3.
Appendix
Appendix · Mechanism

Mechanism: IV Mediation (Exploratory)

  • Setup: 2SLS system: treatment Z instruments mediator M (follow-up perceived community support); M instruments on labor outcomes Y. With one instrument and one mediator, the mediated share = 1 mechanically → interpreted as sign check, not a proportion estimate.
  • Sign pattern: Consistent with the proposed pathway. Updated perceived societal support → increased job search for women; updated work–family balance beliefs → increased aspiration for men.
  • Caveat: Cannot cleanly distinguish community-level vs. spousal-level channel, as both beliefs updated simultaneously (particularly under double exposure). Generalized introspection (Boltz 2025) predicts both move together.
  • Belief updates close 15–25% of the gap between treatment and control on labor outcomes — the mediation channel is real but partial, consistent with norm correction being a necessary but not sufficient condition for full behavioral response.

The mediation exercise supplements rather than replaces the reduced-form evidence. We treat it as a consistency check on the sign pattern and direction of the channels.

Appendix
Appendix · Pre-registration & IRB

Pre-Registration, IRB, and Timeline

IRB
IRB certificate from Pontificia Universidad Javeriana · Approved 2024-04-24. Both partners consented individually.
StageDateN
Baseline survey (in-person/phone)Jul–Sep 20243,464 adults (1,732 couples)
RandomizationEnd Oct 20241,732 couples (1:1)
Midline — WhatsApp chatbotOct–Nov 20241,236 engaged (36%)
Endline — phone surveyNov 2024–Jan 20251,382 (≈40%)
Sample 2∩3 (both midline + endline)1,102

Replication data and code available at doi.org/10.7910/DVN/QYWHLA.