Add TROP estimator #72

igerber · 2026-01-18T13:16:21Z

Addition of TROP estimator based on https://arxiv.org/abs/2508.21536

Implements the TROP estimator from Athey, Imbens, Qu & Viviano (2025) following the exact paper methodology: - Nuclear norm regularized factor model (interactive fixed effects) - Exponential distance-based unit weights: ω_j = exp(-λ_unit × d(j,i)) - Exponential time decay weights: θ_s = exp(-λ_time × |s-t|) - LOOCV tuning parameter selection over (λ_time, λ_unit, λ_nn) grid - Bootstrap and jackknife variance estimation New files: - diff_diff/trop.py: TROP class and TROPResults dataclass - tests/test_trop.py: 23 tests covering all functionality - docs/tutorials/10_trop.ipynb: Tutorial notebook Updated files: - diff_diff/__init__.py: Add TROP exports, bump version to 2.1.0 - pyproject.toml: Bump version to 2.1.0 - README.md: Add TROP documentation and API reference - CLAUDE.md: Add module documentation - CHANGELOG.md: Add v2.1.0 release notes Reference: https://arxiv.org/abs/2508.21536 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Key changes to match the reference implementation: 1. Unit distance computation: Now computes RMSE from the average of treated units over pre-treatment periods, matching the paper's Equation 3. 2. Time distance computation: Centers weights around the treatment period midpoint (T - T_treat/2) rather than individual observations. 3. Global weight matrix: Computes weights once as outer product of unit and time weights, matching reference implementation. 4. Numerical stability improvements: - Truncated SVD reconstruction using only non-zero singular values - Input/output sanitization for NaN/Inf values - Suppress expected numerical warnings during ill-conditioned matmul - Proper handling of edge cases in soft-thresholding All 23 tests pass with no warnings. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Include the Athey et al. (2025) TROP paper and a detailed analysis document comparing the reference implementation to the paper. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Review identifies critical methodology deviations from the paper: - Unit distance computed from treated average instead of pairwise - Time distance from treatment center instead of specific period - Global weights instead of observation-specific weights per (i,t) - Algorithm structure differs from paper's Algorithm 2 Recommends changes before merging to preserve triple robustness property.

This commit addresses the methodology review and aligns the TROP implementation with the paper's specification (Athey, Imbens, Qu & Viviano 2025): **Algorithm Changes:** - Restructured fit() to follow Algorithm 2: for each treated (i,t), compute observation-specific weights, fit model, compute τ̂_{it}, then average - Changed unit distance from "average of treated" to pairwise RMSE between each control unit j and specific treated unit i (Equation 3) - Changed time distance from |s - (T - T_treat/2)| to simple |t - s| where t is the specific treatment period (Equation 3) - Added _compute_observation_weights() for per-(i,t) weight matrices - Updated _loocv_score_obs_specific() to use observation-specific weights - Updated bootstrap and jackknife variance methods accordingly **New Methodology Tests:** - test_limiting_case_uniform_weights: λ_unit = λ_time = λ_nn = 0 gives TWFE-like - test_unit_weights_reduce_bias: unit weighting helps with heterogeneous controls - test_time_weights_reduce_bias: time weighting helps with trending data - test_factor_model_reduces_bias: nuclear norm helps with factor structure - test_paper_dgp_recovery: validates against paper's Table 2 simulation DGP **Performance Note:** The per-observation model fitting is more computationally intensive but provides the triple robustness property described in Theorem 5.1. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Create docs/api/trop.rst with full API documentation - Add TROP and TROPResults to docs/api/index.rst - Include algorithm description, tuning parameters, and usage examples - Reference paper (Athey, Imbens, Qu & Viviano 2025) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Add 'treat' column (unit-level ever-treated indicator) to generate_factor_dgp() - Update SDID calls to use 'treat' instead of 'treated' - TROP continues to use 'treated' (observation-level indicator) - Add clarifying comments explaining the difference The tutorial now correctly demonstrates TROP vs SDID comparison, showing TROP's advantage under factor confounding (bias 0.08 vs 1.31). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

igerber and others added 4 commits January 17, 2026 18:44

Add TROP paper and implementation analysis PDF

ea8b6ac

Include the Athey et al. (2025) TROP paper and a detailed analysis document comparing the reference implementation to the paper. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Remove TROP implementation analysis PDF

34d9e41

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

igerber and others added 3 commits January 18, 2026 10:13

igerber merged commit 57fd4dc into main Jan 18, 2026
4 checks passed

igerber deleted the claude/add-trop-estimator branch January 18, 2026 16:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add TROP estimator #72

Add TROP estimator #72

igerber commented Jan 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add TROP estimator #72

Add TROP estimator #72

Conversation

igerber commented Jan 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants