Norms Behind Closed Doors:
Misperceptions and Maternal Employment in Couples

A Field Experiment in Bogotá, Colombia

Marie Boltz · Monserrat Bustelo · Ana María Díaz · Agustina Suaya

U. Strasbourg / BETA · IADB · Pontificia Universidad Javeriana · IADB

Seminar · Javeriana, April 29, 2026 Pre-registered · AEARCTR-0014648 · Under Review EDCC Sample · 1,732 couples · Bogotá

Section I

Research Agenda

Where this paper comes from — and why it is the natural next step

Research Agenda Experimental Design Baseline Facts Results Heterogeneity Conclusions Next: YouTube

Research Agenda · Paper I

What Is the Price of Freedom?

Bustelo, Díaz, Lafortune, Piras, Salas, Tessada — EDCC 2023 (published)

Estimating Women's Willingness to Pay for Job Schedule Flexibility

Discrete choice experiment with ~1,500 women in Bogotá. Reveals preferences for flexible schedules vs. part-time trade-offs.

Main Finding

Women sacrifice 15–20% of offered wages for full-time contracts with flexible schedules

Key Mechanism

Preference for schedule flexibility (when to work) — not for going part-time

Open Question

Are WTP patterns gendered? Do they operate within couples? Does context travel across countries?

What we learned: Flexibility is highly valued and commands a real wage sacrifice. Care burdens drive the premium. But we only studied women, only in one city, and couldn't say anything about household dynamics or gender norms.

Research Agenda · Paper II

Do Women Value Location Flexibility More Than Men?

Díaz, Salas, Piras, Suaya — Working Paper 2024

Gender Disparities in Valuing Remote and Hybrid Work in Latin America

DCE with ~4,785 workers across 5 Latin American countries (AR, CL, CO, MX, PE). Male-dominated sectors: manufacturing (operations supervisor) and IT (engineer / software developer).

Women's WTP

~13% / 10%

of wages sacrificed for hybrid (80% remote) / fully remote

Men's WTP

~8% / ≈0%

of wages for hybrid; no willingness to trade pay for fully remote

Substantive finding: Women are willing to sacrifice ~62.5% more wages than men for flexibility. For fully remote work, the gender gap is most stark: women give up ~10% of their salary, men give up essentially nothing.

What's missing: Individual choices don't reveal intra-household dynamics. Does the gender gap reflect preferences, or the fact that women anticipate they will absorb the household's need for flexibility?

Research Agenda · Paper III

Who Pays for the Partner's Flexibility?

Boltz, Díaz, Salas — Working Paper 2026

Gender Norms and WTP for Own vs. Partner's Job Flexibility — DCE with Couples in Bogotá

DCE eliciting each spouse's WTP for their own flexible job and for their partner's flexible job. N ≈ 450 couples.

WTP for Own Flexibility

Wives: 16.6% of wages
Husbands: 4.2% of wages
Wives value their own flexibility far more

WTP for Partner's Flexibility

Wives (for husband): 3.9%
Husbands (for wife): 21.8%
Husbands pay the most — to have a flexible wife

Critical finding: Only "support for mothers working outside the home" moderates the gender gap in WTP. Gender norms — not just preferences — shape how couples allocate labor market investments. But we can't change those norms with a DCE.

Research Agenda · Interactive

Your Estimate: Pluralistic Ignorance

Scan to Vote

Use your phone to vote on AhaSlides

Or visit: https://ahaslides.com/6NRKR

The Reality

In our baseline sample in Bogotá:

89%

of fathers support mothers working outside the home

Yet most think others are far more conservative.
This gap between private views and perceived norms is pluralistic ignorance.

Research Agenda · The Gap

The Open Question: Why Do Norms Persist?

Three papers document that gender norms shape labor market choices — from WTP for flexibility to intra-household allocations. But why do these norms persist, especially when private attitudes are already progressive?

The puzzle: In Bogotá, 89% of fathers privately support mothers working outside the home — yet female labor force participation remains 20 pp below men's.
One candidate mechanism: People think others are more conservative than they actually are. This pluralistic ignorance creates a social-permission friction — even when everyone privately agrees, no one acts because they think no one else does.
Evidence from 60 countries (Bursztyn et al. 2023): Systematic underestimation of support for women working, especially of men's support. The gap is 15–30 pp on average.
This paper asks: In a couple-based RCT, can correcting these misperceptions change intra-household decisions and women's labor supply?

Innovation 1

Representative sample of couples — measure mutual misperceptions within households

Innovation 2

Zero-sum revealed-preference allocation: course slot for wife or husband

Innovation 3

Norm delivered is estimated from the exact same population — representative reference group

Research Agenda · Global Context

Pluralistic Ignorance Is a Global Phenomenon

Bursztyn, Cappelen, Tungodden, Voena, Yanagizawa-Drott (2023): 60-country study on support for women working outside the home
In nearly every country, both men and women underestimate how much men support women working — gaps of 15–30 pp on average
The gap is largest precisely in countries where men's private support is high — a signature of pluralistic ignorance
Latin America: relatively high private support, yet substantial misperceptions

Global evidence on pluralistic ignorance

Bursztyn et al. (2023): Distribution of gaps between actual and perceived support for women working, 60 countries

Research Agenda · Latin America

Latin America: High Support, High Misperception

Bursztyn et al. (2023): LAC highlighted — misperceptions persist even where private support for women working is high

Research Agenda · Literature

Correcting Misperceptions: What Prior Experiments Show

Saudi Arabia (Bursztyn et al. 2020, AER): Informing men of true peer support for women working → men become more willing to let wives search for jobs; wives increase applications and interviews
Indonesia (Cameron et al. 2026): Sharing community norm data with 4,000+ respondents → 25% more likely to pick career course instead of equal-value voucher
Paraguay (Laszlo et al. 2025): Norm-shifting intervention in lab-in-the-field experiment → stronger beliefs in equitable household labor division

What is missing in the literature: All prior studies observe only one partner — cannot measure mutual misperceptions or study how information corrects within-couple belief gaps. Course choices and lab outcomes are often hypothetical. None use a probability-sampled population as both respondents and norm-source.

Our contribution: Both partners randomized, surveyed, and treated. Zero-sum career course allocation. Representative Bogotá sample delivers a norm calibrated on the exact reference group.

Research Agenda · Our Paper

Paper in a Nutshell

Question

Can correcting pluralistic ignorance about support for maternal employment shift intra-household decisions and women's labor supply?

Setting

1,732 cohabiting couples with young children in Bogotá, Colombia. Randomized WhatsApp + phone information intervention.

Fact: 89% of fathers privately support mothers working outside the home — yet estimate only 61% of other men agree. A 28 percentage-point misperception.
Intervention: Personalized feedback on actual community support, delivered via WhatsApp chatbot and phone follow-up.
Finding 1 — Beliefs: Treated individuals correct community beliefs by 3–5 pp; perceived spousal support rises by 6 pp.
Finding 2 — Decisions: Treated men are 9 pp more likely (+23%) to nominate their wife for a career-development course.
Finding 3 — Labor: Treated women report more intensive job search (+10 pp); treated men value work–family balance more (+11 pp).
Limit: Effects concentrated among labor-market attached women; inactive women respond little.

Research Agenda · Research Questions

Three Research Questions

RQ 1 · Societal Beliefs

Does information on actual community support correct second-order beliefs about the share of men and women who support maternal employment?

Community beliefs · Sample 2∩3

RQ 2 · Spousal Beliefs

Does correcting community-level misperceptions spill over into updated beliefs about the partner's views on maternal employment and task-sharing?

Spousal second-order beliefs · Sample 2∩3

RQ 3 · Decisions & Labor

Do belief updates affect (a) intra-household allocation of a career-building course, and (b) short-run job search and labor market aspirations?

Course · Sample 2 Labor · Sample 2∩3

Causal pathway: Information → community belief update → spousal belief update → intra-household decision → labor market behavior

Section II

Experimental Design

1,732 couples · Bogotá · WhatsApp + phone · Three survey waves

Research Agenda Experimental Design Baseline Facts Results Heterogeneity Conclusions Next: YouTube

Experimental Design · Framework

Theory of Change

Randomized Information Treatment
Personalized feedback on community support for maternal employment (WhatsApp + phone)

▼

Update of community 2nd-order beliefs
(men's & women's support)

⟵ ⟶

Update of spousal 2nd-order beliefs
(perceived partner support)

▼

Intra-household decisions
Career course allocation
(wife or husband?)

+

Labor market outcomes
Job search, aspirations, work–family balance

Key mechanism: Within-couple misperceptions are optimistic (spouses slightly overestimate each other's support). The dominant friction is at the community level — each spouse hesitates to advocate for the wife's career because they believe broad societal disapproval is high.

Experimental Design · Sample

The Sample: 1,732 Cohabiting Couples in Bogotá

Couples

1,732

Cohabiting heterosexual couples; 3,464 adults surveyed

Eligibility

≥1 child

Under 6 years of age — stage when gender gaps in LFP widen most sharply

City

Bogotá

Women's LFP ≈ 64%; above LAC average (~52%), among highest of major cities

Design

1:1

Randomization at couple level; stratified by wife's LFP status and husband's first-order belief

Representative sample of households with ≥1 child under 6 in Bogotá — guarantees norm accuracy for the reference group
Both partners individually surveyed (in-person or phone, July–September 2024)
Household income: 28% low · 60% middle · 12% high — mirrors city distribution
Three survey waves: Baseline (Jul–Sep 2024) → Midline/WhatsApp (Oct 2024) → Endline by phone (Nov 2024–Jan 2025)

Experimental Design · Stage 1

Stage 1 — Baseline Survey & Belief Elicitation

First-order belief: do you agree mothers should be free to work?
Second-order beliefs: community-level estimates (how many fathers/mothers agree?)
Spousal beliefs: what do you think your partner believes?

Experimental Design · Stage 2

Stage 2 — Randomization at the Couple Level

Unit: couple (both partners receive the same arm)
Stratification: wife's labor status, husband's first-order support, presence of children <6
Treatment: WhatsApp chatbot with actual Bogotá-level support; Control: placebo on green transport

Experimental Design · Two Arms

What Each Arm Saw — One Norm, Two Topics

Treatment (866 couples)

"Mothers of children under six should be free to work for pay outside the home."

Gender-norm statement (target).

Control (866 couples)

"Companies should subsidize public transport."

Placebo norm — unrelated topic.

Same chatbot, same four steps, same schedule. The only difference is the norm each arm sees. Any T–C gap in downstream behavior is attributable to corrected beliefs about gender norms.

Experimental Design · Treatment Chatbot

Treatment Arm — Personalized Feedback on the Gender Norm

N

Norms Studyonline

Hello 👋 In the baseline survey you answered a question about the statement:

"Mothers of children under six should be free to work for pay outside the home."10:02

You estimated that out of 100 fathers in Bogotá, 60 agree with this statement.10:02

Do you think your estimate matches the true share in Bogotá?10:02

Yes • No • Not sure

No10:03 ✓✓

Actual share in Bogotá (baseline data):

Fathers

89 / 100

Mothers

91 / 100

In fact, 89 out of 100 fathers and 91 out of 100 mothers in Bogotá agree with the statement.10:03

How does this information feel to you?10:03

Interesting • Irrelevant • Disappointing

Treatment arm — gender norm

The four steps (as in the paper)

Step 1 — Recall: shows the respondent's own baseline estimate of fathers' / mothers' agreement
Step 2 — Check: asks whether that estimate matches reality
Step 3 — Reveal: shows the actual share computed from the baseline (as a WhatsApp message with numbers and emojis)
Step 4 — Rate: asks the respondent to rate the discrepancy — interesting / irrelevant / disappointing

The same four steps are repeated for beliefs about men's and women's support, with order randomized.

Experimental Design · Treatment — The Reveal

What Treated Respondents Saw at Step 3 — The Figure

Share of fathers in Bogotá who agree with the statement

"Mothers of children under six should be free to work for pay outside the home."

Your estimate(from the baseline survey)

60%

60 / 100

Actual share(measured in Bogotá)

89%

89 / 100

+29 pp — fathers in Bogotá support working mothers much more than respondents believe

Repeated for mothers: a second pair of bars shows the respondent's estimate for mothers (≈80%) and the actual (≈91%). Order randomized across respondents.

Experimental Design · Control Chatbot

Control Arm — Same Structure, Placebo Norm

N

Norms Studyonline

Hello 👋 In the baseline survey you answered a question about the statement:

"Companies should subsidize public transport."10:02

You estimated that out of 100 people in Bogotá, 75 agree with this statement.10:02

Do you think your estimate matches the true share in Bogotá?10:02

Yes • No • Not sure

Not sure10:03 ✓✓

Actual share in Bogotá (baseline data):

Men

94 / 100

Women

95 / 100

In fact, 94 out of 100 men and 95 out of 100 women in Bogotá agree.10:03

How does this information feel to you?10:03

Interesting • Irrelevant • Disappointing

Control arm — placebo norm

Why this placebo

Same channel (WhatsApp chatbot), same format, same four-step sequence, same schedule
Unrelated topic: attitudes toward corporate subsidies for public transport
Shares the belief-elicitation mechanics without addressing gender norms

Identification: the T – C contrast isolates the effect of correcting beliefs about maternal employment, net of any attention, engagement, or framing effect from the chatbot itself.

Experimental Design · Stage 3

Stage 3 — Midline: WhatsApp Engagement

Delivery: WhatsApp chatbot with 4 interactive steps (Sep–Oct 2024)
Engagement: only ~29% of couples completed the interaction (501 of 1,732 T; 518 of 1,732 C)
Note: low engagement suggests digital interventions face uptake barriers in this population

Experimental Design · Stage 4 · Follow-up

After the Chatbot — Course Decision & Endline

Course nomination (end of WhatsApp)

One real online career course per household — keep it or give to partner
Zero-sum choice with direct personal cost — revealed preference

Endline phone survey (1–2 months later)

1,382 of 3,464 reached (≈40%)
Beliefs, job search, work–family balance, aspirations
Treatment info re-administered; measured before reinforcement

Section III

Baseline Facts

Near-universal private support — and a 28 pp misperception

Research Agenda Experimental Design Baseline Facts Results Heterogeneity Conclusions Next: YouTube

Baseline Facts · Sample

Individual Attributes — Large Gender Gap in LFP

Variable	Husbands	Wives	Δ
Demographics
Age (years)	34.9	32.0	2.8***
Education
Low	14.3%	10.8%	3.5***
Medium	69.5%	71.1%	−1.6
High	16.2%	18.1%	−2.0
Employment Status
Employed	90.5%	52.0%	38.5***
Unemployed	5.0%	6.3%	−1.3
Inactive	4.5%	41.7%	−37.2***
Weekly hours	48.7	37.6	11.1***
Job Flexibility
High	23.6%	33.0%	−9.4***
Some	27.2%	31.5%	−4.3**
None	48.9%	35.1%	13.8***
Job Search
Looking for job	10.6%	16.2%	−5.5***
Start business	9.1%	7.2%	1.9**
Would like to	49.3%	51.8%	−2.5
Satisfied	31.0%	24.9%	6.2***

Baseline Facts · Household

Household Attributes & Income Distribution

Characteristic	Sample Composition
Household Size & Composition
Average household size	3.8 members
Children under 6 per HH	1.13
HH with child <6 not in childcare	27.6%
HH with member needing permanent care	32.0%
Household Income Category
Low income (<1.3M COP)	28%	~$6,200 USD
Middle income (1.3–3.9M COP)	60%	~$6,200–$18,600 USD
High income (>3.9M COP)	12%	>$18,600 USD
Sample Composition
Total households	1,732
Total individuals	3,464 (1,732 couples)

Baseline Facts · Pluralistic Ignorance

Pluralistic Ignorance: Everyone Supports, Everyone Thinks Others Don't

👨 Husbands' Misperception

Fathers' support misperception: −27.5 pp
Actual: 88.5% | Perceived: 60.98%

Mothers' support misperception: −10.9 pp
Actual: 90.5% | Perceived: 79.61%

👩 Wives' Misperception

Fathers' support misperception: −32.8 pp
Actual: 88.5% | Perceived: 55.70%

Mothers' support misperception: −10.5 pp
Actual: 90.5% | Perceived: 80.01%

💑 Spousal Misperceptions (Within-Couple)

Husbands underestimate wives: −3.4 pp*
Husband thinks: 93.9% | Wife actual: 90.5%

Wives underestimate husbands: −1.4 pp*
Wife thinks: 89.9% | Husband actual: 88.5%
*optimistic (overestimate)

🚌 Placebo Norm (Green Transport)

Husband support: 93.5%
Wife support: 94.9%

NO MISPERCEPTION
High consensus, no gender gap

Pluralistic Ignorance Pattern: Both spouses privately support maternal employment, but dramatically underestimate community support for fathers (−27.5 to −32.8 pp). Misperception about mothers' support is much smaller (−10.5 pp). Within-couple, spouses are optimistic (slightly overestimate each other's support). The community-level friction dominates, not spousal. Placebo shows this is gender-norm-specific.

Baseline Facts · Beliefs

Baseline Beliefs: Target Norm & Placebo

Belief Type	Husbands	Wives	Difference
A. Target Norm: "Mothers of children <6 should be free to work"
First-order (own view)	88.5%	90.5%	−2.0 pp**
Second-order: Men (estimate of fathers)	61.0%	55.7%	+5.3 pp***
Second-order: Women (estimate of mothers)	79.6%	80.0%	−0.4 pp
Spousal second-order	93.9%	89.9%	+4.1 pp***
B. Placebo Norm: "Companies should subsidize public transport"
First-order (own view)	93.5%	94.9%	−1.4 pp***

N = 1,732 couples. High first-order support for both norms (88–95%). Misperception concentrated on father's support for maternal employment (gap: 27–33 pp). Placebo norm shows no such gap.

Baseline Facts · Figure

Both Sexes Underestimate Men's Support by ~20–30 pp

Men's support misperception distribution

Distribution of first-order beliefs vs. second-order beliefs about men's support for maternal employment. True share = 89%. Perceived share peaks at ~60%.

Baseline Facts · Figure

Women's Perceived Support: Smaller Gap, Similar Pattern

Women's support misperception distribution

Distribution of first-order beliefs vs. second-order beliefs about women's support. True share ≈ 91%. Perceived support still understated, but gap is smaller (~10 pp).

Section IV

Empirical Strategy

IPWRA · Attrition correction · Multiple testing

Research Agenda Experimental Design Baseline Facts Empirical Strategy Results Heterogeneity Conclusions

Empirical Strategy · IPWRA

Addressing Two Sources of Non-Random Selection

Target estimand — ATT (Average Treatment effect on the Treated):

ATT = E[ Y_i(1) − Y_i(0) │ D_i = 1 ] ⟸ CATE: τ(x) = E[ Y(1) − Y(0) │ X = x ]

Baseline regression (Eq. 1): y_i = β₀ + β₁D_i + ρ y_i0 + X_i'γ + ε_i

β₁ = ATT (within strata, clustered SE at household, Fisher exact p-values).

Problem 1 — Attrition: not all baseline respondents observed in midline/endline → may threaten internal validity if selection correlates with potential outcomes.

Problem 2 — Selective engagement: only a subset of treated respondents engages with the WhatsApp module → covariates can become imbalanced within the realized sample.

Step 1 — Selection model. For each sample S ∈ {midline, endline, both, endline-only}, estimate via probit:

p_i^S = Pr(S_i = 1 │ D_i, X_i) ⟹ w_i^S = Pr(S=1) / p̂_i^S

X_i ≈ 50 baseline covariates (demographics, household, labor, beliefs, strata). Weights stabilized around 1; estimated separately by gender.

Step 2 — IPWRA within each sample S (doubly robust ATT):

ATT̂ = (1/N_T^w) Σ_i w_i^S [ D_i(Y_i − m̂₀) + (1−D_i)·(ê_i/(1−ê_i))·(m̂₁ − m̂₀) ]

e(·) = treatment model Pr(D=1│X^D, strata); m_d(·) = outcome model E[Y│D=d, X^D, strata]. Step 1 weights enter as pweights. Consistent if either e(·) or m_d(·) is correctly specified.

Robustness (4 weight specs): baseline probit-PS · winsorised p95 · trimmed (drop PS < 0.10) · logit PS — Appendix Table A.IPWRAsens.
Inference: Fisher exact (randomization) p-values primary; Romano-Wolf step-down for outcome families; Lee (2009) sharp bounds; near-miss timing diagnostic — these validate the IPW correction, not separate tests.
Reference-group accuracy: the disclosed Bogotá-average norm must reflect the engager subpopulation's actual reference group. Engagers vs. non-engagers hold virtually identical priors on the targeted second-order belief (58.1 vs. 58.6, p > 0.5). Maximum subgroup deviation from city-wide mean is 3.3 pp — less than 20% of the 28 pp misperception being corrected, and the sign is conservative.

Section V

Results

Community beliefs · Spousal beliefs · Course allocation · Labor market outcomes

Research Agenda Experimental Design Baseline Facts Empirical Strategy Results Heterogeneity Conclusions

Results · Research Questions