diff --git a/CHANGELOG.md b/CHANGELOG.md
index e93f4df..d5bb0a8 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -5,6 +5,24 @@ All notable changes to this project will be documented in this file.
 The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
 and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
 
+## [2.1.0] - 2026-01-17
+
+### Added
+- **Triply Robust Panel (TROP) estimator** implementing Athey, Imbens, Qu & Viviano (2025)
+  - `TROP` class combining three robustness components:
+    - Factor model adjustment via SVD (removes unobserved confounders with factor structure)
+    - Synthetic control style unit weights
+    - SDID style time weights
+  - `TROPResults` dataclass with ATT, factors, loadings, unit/time weights
+  - `trop()` convenience function for quick estimation
+  - Automatic rank selection methods: cross-validation (`'cv'`), information criterion (`'ic'`), elbow detection (`'elbow'`)
+  - Bootstrap and placebo-based variance estimation
+  - Full integration with existing infrastructure (exports in `__init__.py`, sklearn-compatible API)
+  - Tutorial notebook: `docs/tutorials/10_trop.ipynb`
+  - Comprehensive test suite: `tests/test_trop.py`
+
+**Reference**: Athey, S., Imbens, G. W., Qu, Z., & Viviano, D. (2025). "Triply Robust Panel Estimators." *Working Paper*. [arXiv:2508.21536](https://arxiv.org/abs/2508.21536)
+
 ## [2.0.3] - 2026-01-17
 
 ### Changed
@@ -392,6 +410,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
   - `to_dict()` and `to_dataframe()` export methods
   - `is_significant` and `significance_stars` properties
 
+[2.1.0]: https://github.com/igerber/diff-diff/compare/v2.0.3...v2.1.0
 [2.0.3]: https://github.com/igerber/diff-diff/compare/v2.0.2...v2.0.3
 [2.0.2]: https://github.com/igerber/diff-diff/compare/v2.0.1...v2.0.2
 [2.0.1]: https://github.com/igerber/diff-diff/compare/v2.0.0...v2.0.1
diff --git a/CLAUDE.md b/CLAUDE.md
index 2f39e2a..9d7e785 100644
--- a/CLAUDE.md
+++ b/CLAUDE.md
@@ -89,6 +89,14 @@ pytest tests/test_rust_backend.py -v
   - Regression adjustment, IPW, and doubly robust estimation methods
   - Proper covariate handling (unlike naive DDD implementations)
 
+- **`diff_diff/trop.py`** - Triply Robust Panel (TROP) estimator (v2.1.0):
+  - `TROP` - Athey, Imbens, Qu & Viviano (2025) estimator with factor model adjustment
+  - `TROPResults` - Results with ATT, factors, loadings, unit/time weights
+  - `trop()` - Convenience function for quick estimation
+  - Three robustness components: factor adjustment, unit weights, time weights
+  - Automatic rank selection via cross-validation, information criterion, or elbow detection
+  - Bootstrap and placebo-based variance estimation
+
 - **`diff_diff/bacon.py`** - Goodman-Bacon decomposition for TWFE diagnostics:
   - `BaconDecomposition` - Decompose TWFE into weighted 2x2 comparisons (Goodman-Bacon 2021)
   - `BaconDecompositionResults` - Results with comparison weights and estimates by type
@@ -250,6 +258,7 @@ See `docs/performance-plan.md` for full optimization details and `docs/benchmark
   - `07_pretrends_power.ipynb` - Pre-trends power analysis (Roth 2022), MDV, power curves
   - `08_triple_diff.ipynb` - Triple Difference (DDD) estimation with proper covariate handling
   - `09_real_world_examples.ipynb` - Real-world data examples (Card-Krueger, Castle Doctrine, Divorce Laws)
+  - `10_trop.ipynb` - Triply Robust Panel (TROP) estimation with factor model adjustment
 
 ### Benchmarks
 
@@ -282,6 +291,7 @@ Tests mirror the source modules:
 - `tests/test_staggered.py` - Tests for CallawaySantAnna
 - `tests/test_sun_abraham.py` - Tests for SunAbraham interaction-weighted estimator
 - `tests/test_triple_diff.py` - Tests for Triple Difference (DDD) estimator
+- `tests/test_trop.py` - Tests for Triply Robust Panel (TROP) estimator
 - `tests/test_bacon.py` - Tests for Goodman-Bacon decomposition
 - `tests/test_linalg.py` - Tests for unified OLS backend, robust variance estimation, LinearRegression helper, and InferenceResult
 - `tests/test_utils.py` - Tests for parallel trends, robust SE, synthetic weights
diff --git a/README.md b/README.md
index a3faf69..dbb769e 100644
--- a/README.md
+++ b/README.md
@@ -73,6 +73,7 @@ Signif. codes: '***' 0.001, '**' 0.01, '*' 0.05, '.' 0.1
 - **Staggered adoption**: Callaway-Sant'Anna (2021) and Sun-Abraham (2021) estimators for heterogeneous treatment timing
 - **Triple Difference (DDD)**: Ortiz-Villavicencio & Sant'Anna (2025) estimators with proper covariate handling
 - **Synthetic DiD**: Combined DiD with synthetic control for improved robustness
+- **Triply Robust Panel (TROP)**: Factor-adjusted DiD with synthetic weights (Athey et al. 2025)
 - **Event study plots**: Publication-ready visualization of treatment effects
 - **Parallel trends testing**: Multiple methods including equivalence tests
 - **Goodman-Bacon decomposition**: Diagnose TWFE bias by decomposing into 2x2 comparisons
@@ -98,6 +99,7 @@ We provide Jupyter notebook tutorials in `docs/tutorials/`:
 | `07_pretrends_power.ipynb` | Pre-trends power analysis (Roth 2022), MDV, power curves |
 | `08_triple_diff.ipynb` | Triple Difference (DDD) estimation with proper covariate handling |
 | `09_real_world_examples.ipynb` | Real-world data examples (Card-Krueger, Castle Doctrine, Divorce Laws) |
+| `10_trop.ipynb` | Triply Robust Panel (TROP) estimation with factor model adjustment |
 
 ## Data Preparation
 
@@ -1115,6 +1117,179 @@ SyntheticDiD(
 )
 ```
 
+### Triply Robust Panel (TROP)
+
+TROP (Athey, Imbens, Qu & Viviano 2025) extends Synthetic DiD by adding interactive fixed effects (factor model) adjustment. It's particularly useful when there are unobserved time-varying confounders with a factor structure that could bias standard DiD or SDID estimates.
+
+TROP combines three robustness components:
+1. **Nuclear norm regularized factor model**: Estimates interactive fixed effects L_it via soft-thresholding
+2. **Exponential distance-based unit weights**: ω_j = exp(-λ_unit × distance(j,i))
+3. **Exponential time decay weights**: θ_s = exp(-λ_time × |s-t|)
+
+Tuning parameters are selected via leave-one-out cross-validation (LOOCV).
+
+```python
+from diff_diff import TROP, trop
+
+# Fit TROP model with automatic tuning via LOOCV
+trop_est = TROP(
+    lambda_time_grid=[0.0, 0.5, 1.0, 2.0],  # Time decay grid
+    lambda_unit_grid=[0.0, 0.5, 1.0, 2.0],  # Unit distance grid
+    lambda_nn_grid=[0.0, 0.1, 1.0],          # Nuclear norm grid
+    n_bootstrap=200
+)
+results = trop_est.fit(
+    panel_data,
+    outcome='gdp_growth',
+    treatment='treated',
+    unit='state',
+    time='year',
+    post_periods=[2015, 2016, 2017, 2018]
+)
+
+# View results
+results.print_summary()
+print(f"ATT: {results.att:.3f} (SE: {results.se:.3f})")
+print(f"Effective rank: {results.effective_rank:.2f}")
+
+# Selected tuning parameters
+print(f"λ_time: {results.lambda_time:.2f}")
+print(f"λ_unit: {results.lambda_unit:.2f}")
+print(f"λ_nn: {results.lambda_nn:.2f}")
+
+# Examine unit effects
+unit_effects = results.get_unit_effects_df()
+print(unit_effects.head(10))
+```
+
+Output:
+```
+===========================================================================
+         Triply Robust Panel (TROP) Estimation Results
+               Athey, Imbens, Qu & Viviano (2025)
+===========================================================================
+
+Observations:                      500
+Treated units:                       1
+Control units:                      49
+Treated observations:                4
+Pre-treatment periods:               6
+Post-treatment periods:              4
+
+---------------------------------------------------------------------------
+             Tuning Parameters (selected via LOOCV)
+---------------------------------------------------------------------------
+Lambda (time decay):               1.0000
+Lambda (unit distance):            0.5000
+Lambda (nuclear norm):             0.1000
+Effective rank:                      2.35
+LOOCV score:                     0.012345
+Variance method:                bootstrap
+Bootstrap replications:              200
+
+---------------------------------------------------------------------------
+Parameter         Estimate     Std. Err.     t-stat      P>|t|
+---------------------------------------------------------------------------
+ATT                 2.5000       0.3892      6.424      0.0000   ***
+---------------------------------------------------------------------------
+
+95% Confidence Interval: [1.7372, 3.2628]
+
+Signif. codes: '***' 0.001, '**' 0.01, '*' 0.05, '.' 0.1
+===========================================================================
+```
+
+#### When to Use TROP Over Synthetic DiD
+
+Use TROP when you suspect **factor structure** in the data—unobserved confounders that affect outcomes differently across units and time:
+
+| Scenario | Use SDID | Use TROP |
+|----------|----------|----------|
+| Simple parallel trends | ✓ | ✓ |
+| Unobserved factors (e.g., economic cycles) | May be biased | ✓ |
+| Strong unit-time interactions | May be biased | ✓ |
+| Low-dimensional confounding | ✓ | ✓ |
+
+**Example scenarios where TROP excels:**
+- Regional economic shocks that affect states differently based on industry composition
+- Global trends that impact countries differently based on their economic structure
+- Common factors in financial data (market risk, interest rates, etc.)
+
+**How TROP works:**
+
+1. **Factor estimation**: Estimates interactive fixed effects L_it using nuclear norm regularization (encourages low-rank structure)
+2. **Unit weights**: Exponential distance-based weighting ω_j = exp(-λ_unit × d(j,i)) where d(j,i) is the RMSE of outcome differences
+3. **Time weights**: Exponential decay weighting θ_s = exp(-λ_time × |s-t|) based on proximity to treatment
+4. **ATT computation**: τ = Y_it - α_i - β_t - L_it for treated observations
+
+```python
+# Compare TROP vs SDID under factor confounding
+from diff_diff import SyntheticDiD
+
+# Synthetic DiD (may be biased with factors)
+sdid = SyntheticDiD()
+sdid_results = sdid.fit(data, outcome='y', treatment='treated',
+                        unit='unit', time='time', post_periods=[5,6,7])
+
+# TROP (accounts for factors)
+trop_est = TROP()  # Uses default grids with LOOCV selection
+trop_results = trop_est.fit(data, outcome='y', treatment='treated',
+                            unit='unit', time='time', post_periods=[5,6,7])
+
+print(f"SDID estimate: {sdid_results.att:.3f}")
+print(f"TROP estimate: {trop_results.att:.3f}")
+print(f"Effective rank: {trop_results.effective_rank:.2f}")
+```
+
+**Tuning parameter grids:**
+
+```python
+# Custom tuning grids (searched via LOOCV)
+trop = TROP(
+    lambda_time_grid=[0.0, 0.1, 0.5, 1.0, 2.0, 5.0],  # Time decay
+    lambda_unit_grid=[0.0, 0.1, 0.5, 1.0, 2.0, 5.0],  # Unit distance
+    lambda_nn_grid=[0.0, 0.01, 0.1, 1.0, 10.0]        # Nuclear norm
+)
+
+# Fixed tuning parameters (skip LOOCV search)
+trop = TROP(
+    lambda_time_grid=[1.0],   # Single value = fixed
+    lambda_unit_grid=[1.0],   # Single value = fixed
+    lambda_nn_grid=[0.1]      # Single value = fixed
+)
+```
+
+**Parameters:**
+
+```python
+TROP(
+    lambda_time_grid=None,      # Time decay grid (default: [0, 0.1, 0.5, 1, 2, 5])
+    lambda_unit_grid=None,      # Unit distance grid (default: [0, 0.1, 0.5, 1, 2, 5])
+    lambda_nn_grid=None,        # Nuclear norm grid (default: [0, 0.01, 0.1, 1, 10])
+    max_iter=100,               # Max iterations for factor estimation
+    tol=1e-6,                   # Convergence tolerance
+    alpha=0.05,                 # Significance level
+    variance_method='bootstrap', # 'bootstrap' or 'jackknife'
+    n_bootstrap=200,            # Bootstrap replications
+    seed=None                   # Random seed
+)
+```
+
+**Convenience function:**
+
+```python
+# One-liner estimation with default tuning grids
+results = trop(
+    data,
+    outcome='y',
+    treatment='treated',
+    unit='unit',
+    time='time',
+    post_periods=[5, 6, 7],
+    n_bootstrap=200
+)
+```
+
 ## Working with Results
 
 ### Export Results
@@ -1680,6 +1855,74 @@ SyntheticDiD(
 | `get_unit_weights_df()` | Get unit weights as DataFrame |
 | `get_time_weights_df()` | Get time weights as DataFrame |
 
+### TROP
+
+```python
+TROP(
+    lambda_time_grid=None,     # Time decay grid (default: [0, 0.1, 0.5, 1, 2, 5])
+    lambda_unit_grid=None,     # Unit distance grid (default: [0, 0.1, 0.5, 1, 2, 5])
+    lambda_nn_grid=None,       # Nuclear norm grid (default: [0, 0.01, 0.1, 1, 10])
+    max_iter=100,              # Max iterations for factor estimation
+    tol=1e-6,                  # Convergence tolerance
+    alpha=0.05,                # Significance level for CIs
+    variance_method='bootstrap',  # 'bootstrap' or 'jackknife'
+    n_bootstrap=200,           # Bootstrap/jackknife iterations
+    seed=None                  # Random seed
+)
+```
+
+**fit() Parameters:**
+
+| Parameter | Type | Description |
+|-----------|------|-------------|
+| `data` | DataFrame | Panel data |
+| `outcome` | str | Outcome variable column name |
+| `treatment` | str | Treatment indicator column (0/1) |
+| `unit` | str | Unit identifier column |
+| `time` | str | Time period column |
+| `post_periods` | list | List of post-treatment period values |
+
+### TROPResults
+
+**Attributes:**
+
+| Attribute | Description |
+|-----------|-------------|
+| `att` | Average Treatment effect on the Treated |
+| `se` | Standard error (bootstrap or jackknife) |
+| `t_stat` | T-statistic |
+| `p_value` | P-value |
+| `conf_int` | Confidence interval |
+| `n_obs` | Number of observations |
+| `n_treated` | Number of treated units |
+| `n_control` | Number of control units |
+| `n_treated_obs` | Number of treated unit-time observations |
+| `unit_effects` | Dict mapping unit IDs to fixed effects |
+| `time_effects` | Dict mapping periods to fixed effects |
+| `treatment_effects` | Dict mapping (unit, time) to individual effects |
+| `lambda_time` | Selected time decay parameter |
+| `lambda_unit` | Selected unit distance parameter |
+| `lambda_nn` | Selected nuclear norm parameter |
+| `factor_matrix` | Low-rank factor matrix L (n_periods x n_units) |
+| `effective_rank` | Effective rank of factor matrix |
+| `loocv_score` | LOOCV score for selected parameters |
+| `pre_periods` | List of pre-treatment periods |
+| `post_periods` | List of post-treatment periods |
+| `variance_method` | Variance estimation method |
+| `bootstrap_distribution` | Bootstrap distribution (if bootstrap) |
+
+**Methods:**
+
+| Method | Description |
+|--------|-------------|
+| `summary(alpha)` | Get formatted summary string |
+| `print_summary(alpha)` | Print summary to stdout |
+| `to_dict()` | Convert to dictionary |
+| `to_dataframe()` | Convert to pandas DataFrame |
+| `get_unit_effects_df()` | Get unit fixed effects as DataFrame |
+| `get_time_effects_df()` | Get time fixed effects as DataFrame |
+| `get_treatment_effects_df()` | Get individual treatment effects as DataFrame |
+
 ### SunAbraham
 
 ```python
@@ -2154,6 +2397,17 @@ This library implements methods from the following scholarly works:
 
 - **Arkhangelsky, D., Athey, S., Hirshberg, D. A., Imbens, G. W., & Wager, S. (2021).** "Synthetic Difference-in-Differences." *American Economic Review*, 111(12), 4088-4118. [https://doi.org/10.1257/aer.20190159](https://doi.org/10.1257/aer.20190159)
 
+### Triply Robust Panel (TROP)
+
+- **Athey, S., Imbens, G. W., Qu, Z., & Viviano, D. (2025).** "Triply Robust Panel Estimators." *Working Paper*. [https://arxiv.org/abs/2508.21536](https://arxiv.org/abs/2508.21536)
+
+  This paper introduces the TROP estimator which combines three robustness components:
+  - **Factor model adjustment**: Low-rank factor structure via SVD removes unobserved confounders
+  - **Unit weights**: Synthetic control style weighting for optimal comparison
+  - **Time weights**: SDID style time weighting for informative pre-periods
+
+  TROP is particularly useful when there are unobserved time-varying confounders with a factor structure that affect different units differently over time.
+
 ### Triple Difference (DDD)
 
 - **Ortiz-Villavicencio, M., & Sant'Anna, P. H. C. (2025).** "Better Understanding Triple Differences Estimators." *Working Paper*. [https://arxiv.org/abs/2505.09942](https://arxiv.org/abs/2505.09942)
diff --git a/TROP-ref/2508.21536v2.pdf b/TROP-ref/2508.21536v2.pdf
new file mode 100644
index 0000000..6d21e05
Binary files /dev/null and b/TROP-ref/2508.21536v2.pdf differ
diff --git a/diff_diff/__init__.py b/diff_diff/__init__.py
index f2cf888..8cb9065 100644
--- a/diff_diff/__init__.py
+++ b/diff_diff/__init__.py
@@ -100,6 +100,11 @@
     TripleDifferenceResults,
     triple_difference,
 )
+from diff_diff.trop import (
+    TROP,
+    TROPResults,
+    trop,
+)
 from diff_diff.utils import (
     WildBootstrapResults,
     check_parallel_trends,
@@ -126,7 +131,7 @@
     load_mpdta,
 )
 
-__version__ = "2.0.4"
+__version__ = "2.1.0"
 __all__ = [
     # Estimators
     "DifferenceInDifferences",
@@ -136,6 +141,7 @@
     "CallawaySantAnna",
     "SunAbraham",
     "TripleDifference",
+    "TROP",
     # Bacon Decomposition
     "BaconDecomposition",
     "BaconDecompositionResults",
@@ -154,6 +160,8 @@
     "SABootstrapResults",
     "TripleDifferenceResults",
     "triple_difference",
+    "TROPResults",
+    "trop",
     # Visualization
     "plot_event_study",
     "plot_group_effects",
diff --git a/diff_diff/trop.py b/diff_diff/trop.py
new file mode 100644
index 0000000..df71028
--- /dev/null
+++ b/diff_diff/trop.py
@@ -0,0 +1,1348 @@
+"""
+Triply Robust Panel (TROP) estimator.
+
+Implements the TROP estimator from Athey, Imbens, Qu & Viviano (2025).
+TROP combines three robustness components:
+1. Nuclear norm regularized factor model (interactive fixed effects)
+2. Exponential distance-based unit weights
+3. Exponential time decay weights
+
+The estimator uses leave-one-out cross-validation for tuning parameter
+selection and provides robust treatment effect estimates under factor
+confounding.
+
+References
+----------
+Athey, S., Imbens, G. W., Qu, Z., & Viviano, D. (2025). Triply Robust Panel
+Estimators. *Working Paper*. https://arxiv.org/abs/2508.21536
+"""
+
+import warnings
+from dataclasses import dataclass, field
+from typing import Any, Dict, List, Optional, Tuple, Union
+
+import numpy as np
+import pandas as pd
+from scipy import stats
+
+from diff_diff.results import _get_significance_stars
+from diff_diff.utils import compute_confidence_interval, compute_p_value
+
+
+@dataclass
+class TROPResults:
+    """
+    Results from a Triply Robust Panel (TROP) estimation.
+
+    TROP combines nuclear norm regularized factor estimation with
+    exponential distance-based unit weights and time decay weights.
+
+    Attributes
+    ----------
+    att : float
+        Average Treatment effect on the Treated (ATT).
+    se : float
+        Standard error of the ATT estimate.
+    t_stat : float
+        T-statistic for the ATT estimate.
+    p_value : float
+        P-value for the null hypothesis that ATT = 0.
+    conf_int : tuple[float, float]
+        Confidence interval for the ATT.
+    n_obs : int
+        Number of observations used in estimation.
+    n_treated : int
+        Number of treated units.
+    n_control : int
+        Number of control units.
+    n_treated_obs : int
+        Number of treated unit-time observations.
+    unit_effects : dict
+        Estimated unit fixed effects (alpha_i).
+    time_effects : dict
+        Estimated time fixed effects (beta_t).
+    treatment_effects : dict
+        Individual treatment effects for each treated (unit, time) pair.
+    lambda_time : float
+        Selected time weight decay parameter.
+    lambda_unit : float
+        Selected unit weight decay parameter.
+    lambda_nn : float
+        Selected nuclear norm regularization parameter.
+    factor_matrix : np.ndarray
+        Estimated low-rank factor matrix L (n_periods x n_units).
+    effective_rank : float
+        Effective rank of the factor matrix (sum of singular values / max).
+    loocv_score : float
+        Leave-one-out cross-validation score for selected parameters.
+    variance_method : str
+        Method used for variance estimation.
+    alpha : float
+        Significance level for confidence interval.
+    pre_periods : list
+        List of pre-treatment period identifiers.
+    post_periods : list
+        List of post-treatment period identifiers.
+    n_bootstrap : int, optional
+        Number of bootstrap replications (if bootstrap variance).
+    bootstrap_distribution : np.ndarray, optional
+        Bootstrap distribution of estimates.
+    """
+
+    att: float
+    se: float
+    t_stat: float
+    p_value: float
+    conf_int: Tuple[float, float]
+    n_obs: int
+    n_treated: int
+    n_control: int
+    n_treated_obs: int
+    unit_effects: Dict[Any, float]
+    time_effects: Dict[Any, float]
+    treatment_effects: Dict[Tuple[Any, Any], float]
+    lambda_time: float
+    lambda_unit: float
+    lambda_nn: float
+    factor_matrix: np.ndarray
+    effective_rank: float
+    loocv_score: float
+    variance_method: str
+    alpha: float = 0.05
+    pre_periods: List[Any] = field(default_factory=list)
+    post_periods: List[Any] = field(default_factory=list)
+    n_bootstrap: Optional[int] = field(default=None)
+    bootstrap_distribution: Optional[np.ndarray] = field(default=None, repr=False)
+
+    def __repr__(self) -> str:
+        """Concise string representation."""
+        sig = _get_significance_stars(self.p_value)
+        return (
+            f"TROPResults(ATT={self.att:.4f}{sig}, "
+            f"SE={self.se:.4f}, "
+            f"eff_rank={self.effective_rank:.1f}, "
+            f"p={self.p_value:.4f})"
+        )
+
+    def summary(self, alpha: Optional[float] = None) -> str:
+        """
+        Generate a formatted summary of the estimation results.
+
+        Parameters
+        ----------
+        alpha : float, optional
+            Significance level for confidence intervals. Defaults to the
+            alpha used during estimation.
+
+        Returns
+        -------
+        str
+            Formatted summary table.
+        """
+        alpha = alpha or self.alpha
+        conf_level = int((1 - alpha) * 100)
+
+        lines = [
+            "=" * 75,
+            "Triply Robust Panel (TROP) Estimation Results".center(75),
+            "Athey, Imbens, Qu & Viviano (2025)".center(75),
+            "=" * 75,
+            "",
+            f"{'Observations:':<25} {self.n_obs:>10}",
+            f"{'Treated units:':<25} {self.n_treated:>10}",
+            f"{'Control units:':<25} {self.n_control:>10}",
+            f"{'Treated observations:':<25} {self.n_treated_obs:>10}",
+            f"{'Pre-treatment periods:':<25} {len(self.pre_periods):>10}",
+            f"{'Post-treatment periods:':<25} {len(self.post_periods):>10}",
+            "",
+            "-" * 75,
+            "Tuning Parameters (selected via LOOCV)".center(75),
+            "-" * 75,
+            f"{'Lambda (time decay):':<25} {self.lambda_time:>10.4f}",
+            f"{'Lambda (unit distance):':<25} {self.lambda_unit:>10.4f}",
+            f"{'Lambda (nuclear norm):':<25} {self.lambda_nn:>10.4f}",
+            f"{'Effective rank:':<25} {self.effective_rank:>10.2f}",
+            f"{'LOOCV score:':<25} {self.loocv_score:>10.6f}",
+        ]
+
+        # Variance method info
+        lines.append(f"{'Variance method:':<25} {self.variance_method:>10}")
+        if self.variance_method == "bootstrap" and self.n_bootstrap is not None:
+            lines.append(f"{'Bootstrap replications:':<25} {self.n_bootstrap:>10}")
+
+        lines.extend([
+            "",
+            "-" * 75,
+            f"{'Parameter':<15} {'Estimate':>12} {'Std. Err.':>12} "
+            f"{'t-stat':>10} {'P>|t|':>10} {'':>5}",
+            "-" * 75,
+            f"{'ATT':<15} {self.att:>12.4f} {self.se:>12.4f} "
+            f"{self.t_stat:>10.3f} {self.p_value:>10.4f} {self.significance_stars:>5}",
+            "-" * 75,
+            "",
+            f"{conf_level}% Confidence Interval: [{self.conf_int[0]:.4f}, {self.conf_int[1]:.4f}]",
+        ])
+
+        # Add significance codes
+        lines.extend([
+            "",
+            "Signif. codes: '***' 0.001, '**' 0.01, '*' 0.05, '.' 0.1",
+            "=" * 75,
+        ])
+
+        return "\n".join(lines)
+
+    def print_summary(self, alpha: Optional[float] = None) -> None:
+        """Print the summary to stdout."""
+        print(self.summary(alpha))
+
+    def to_dict(self) -> Dict[str, Any]:
+        """
+        Convert results to a dictionary.
+
+        Returns
+        -------
+        Dict[str, Any]
+            Dictionary containing all estimation results.
+        """
+        return {
+            "att": self.att,
+            "se": self.se,
+            "t_stat": self.t_stat,
+            "p_value": self.p_value,
+            "conf_int_lower": self.conf_int[0],
+            "conf_int_upper": self.conf_int[1],
+            "n_obs": self.n_obs,
+            "n_treated": self.n_treated,
+            "n_control": self.n_control,
+            "n_treated_obs": self.n_treated_obs,
+            "n_pre_periods": len(self.pre_periods),
+            "n_post_periods": len(self.post_periods),
+            "lambda_time": self.lambda_time,
+            "lambda_unit": self.lambda_unit,
+            "lambda_nn": self.lambda_nn,
+            "effective_rank": self.effective_rank,
+            "loocv_score": self.loocv_score,
+            "variance_method": self.variance_method,
+        }
+
+    def to_dataframe(self) -> pd.DataFrame:
+        """
+        Convert results to a pandas DataFrame.
+
+        Returns
+        -------
+        pd.DataFrame
+            DataFrame with estimation results.
+        """
+        return pd.DataFrame([self.to_dict()])
+
+    def get_treatment_effects_df(self) -> pd.DataFrame:
+        """
+        Get individual treatment effects as a DataFrame.
+
+        Returns
+        -------
+        pd.DataFrame
+            DataFrame with unit, time, and treatment effect columns.
+        """
+        return pd.DataFrame([
+            {"unit": unit, "time": time, "effect": effect}
+            for (unit, time), effect in self.treatment_effects.items()
+        ])
+
+    def get_unit_effects_df(self) -> pd.DataFrame:
+        """
+        Get unit fixed effects as a DataFrame.
+
+        Returns
+        -------
+        pd.DataFrame
+            DataFrame with unit and effect columns.
+        """
+        return pd.DataFrame([
+            {"unit": unit, "effect": effect}
+            for unit, effect in self.unit_effects.items()
+        ])
+
+    def get_time_effects_df(self) -> pd.DataFrame:
+        """
+        Get time fixed effects as a DataFrame.
+
+        Returns
+        -------
+        pd.DataFrame
+            DataFrame with time and effect columns.
+        """
+        return pd.DataFrame([
+            {"time": time, "effect": effect}
+            for time, effect in self.time_effects.items()
+        ])
+
+    @property
+    def is_significant(self) -> bool:
+        """Check if the ATT is statistically significant at the alpha level."""
+        return bool(self.p_value < self.alpha)
+
+    @property
+    def significance_stars(self) -> str:
+        """Return significance stars based on p-value."""
+        return _get_significance_stars(self.p_value)
+
+
+class TROP:
+    """
+    Triply Robust Panel (TROP) estimator.
+
+    Implements the exact methodology from Athey, Imbens, Qu & Viviano (2025).
+    TROP combines three robustness components:
+
+    1. **Nuclear norm regularized factor model**: Estimates interactive fixed
+       effects L_it via matrix completion with nuclear norm penalty ||L||_*
+
+    2. **Exponential distance-based unit weights**: ω_j = exp(-λ_unit × d(j,i))
+       where d(j,i) is the RMSE of outcome differences between units
+
+    3. **Exponential time decay weights**: θ_s = exp(-λ_time × |s-t|)
+       weighting pre-treatment periods by proximity to treatment
+
+    Tuning parameters (λ_time, λ_unit, λ_nn) are selected via leave-one-out
+    cross-validation on control observations.
+
+    Parameters
+    ----------
+    lambda_time_grid : list, optional
+        Grid of time weight decay parameters. Default: [0, 0.1, 0.5, 1, 2, 5].
+    lambda_unit_grid : list, optional
+        Grid of unit weight decay parameters. Default: [0, 0.1, 0.5, 1, 2, 5].
+    lambda_nn_grid : list, optional
+        Grid of nuclear norm regularization parameters. Default: [0, 0.01, 0.1, 1].
+    max_iter : int, default=100
+        Maximum iterations for nuclear norm optimization.
+    tol : float, default=1e-6
+        Convergence tolerance for optimization.
+    alpha : float, default=0.05
+        Significance level for confidence intervals.
+    variance_method : str, default='bootstrap'
+        Method for variance estimation: 'bootstrap' or 'jackknife'.
+    n_bootstrap : int, default=200
+        Number of replications for variance estimation.
+    seed : int, optional
+        Random seed for reproducibility.
+
+    Attributes
+    ----------
+    results_ : TROPResults
+        Estimation results after calling fit().
+    is_fitted_ : bool
+        Whether the model has been fitted.
+
+    Examples
+    --------
+    >>> from diff_diff import TROP
+    >>> trop = TROP()
+    >>> results = trop.fit(
+    ...     data,
+    ...     outcome='outcome',
+    ...     treatment='treated',
+    ...     unit='unit',
+    ...     time='period',
+    ...     post_periods=[5, 6, 7, 8]
+    ... )
+    >>> results.print_summary()
+
+    References
+    ----------
+    Athey, S., Imbens, G. W., Qu, Z., & Viviano, D. (2025). Triply Robust
+    Panel Estimators. *Working Paper*. https://arxiv.org/abs/2508.21536
+    """
+
+    def __init__(
+        self,
+        lambda_time_grid: Optional[List[float]] = None,
+        lambda_unit_grid: Optional[List[float]] = None,
+        lambda_nn_grid: Optional[List[float]] = None,
+        max_iter: int = 100,
+        tol: float = 1e-6,
+        alpha: float = 0.05,
+        variance_method: str = 'bootstrap',
+        n_bootstrap: int = 200,
+        seed: Optional[int] = None,
+    ):
+        # Default grids from paper
+        self.lambda_time_grid = lambda_time_grid or [0.0, 0.1, 0.5, 1.0, 2.0, 5.0]
+        self.lambda_unit_grid = lambda_unit_grid or [0.0, 0.1, 0.5, 1.0, 2.0, 5.0]
+        self.lambda_nn_grid = lambda_nn_grid or [0.0, 0.01, 0.1, 1.0, 10.0]
+
+        self.max_iter = max_iter
+        self.tol = tol
+        self.alpha = alpha
+        self.variance_method = variance_method
+        self.n_bootstrap = n_bootstrap
+        self.seed = seed
+
+        # Validate parameters
+        valid_variance_methods = ("bootstrap", "jackknife")
+        if variance_method not in valid_variance_methods:
+            raise ValueError(
+                f"variance_method must be one of {valid_variance_methods}, "
+                f"got '{variance_method}'"
+            )
+
+        # Internal state
+        self.results_: Optional[TROPResults] = None
+        self.is_fitted_: bool = False
+        self._optimal_lambda: Optional[Tuple[float, float, float]] = None
+
+    def fit(
+        self,
+        data: pd.DataFrame,
+        outcome: str,
+        treatment: str,
+        unit: str,
+        time: str,
+        post_periods: Optional[List[Any]] = None,
+    ) -> TROPResults:
+        """
+        Fit the TROP model.
+
+        Parameters
+        ----------
+        data : pd.DataFrame
+            Panel data with observations for multiple units over multiple
+            time periods.
+        outcome : str
+            Name of the outcome variable column.
+        treatment : str
+            Name of the treatment indicator column (0/1).
+            Should be 1 for treated unit-time observations.
+        unit : str
+            Name of the unit identifier column.
+        time : str
+            Name of the time period column.
+        post_periods : list, optional
+            List of time period values that are post-treatment.
+            If None, infers from treatment indicator.
+
+        Returns
+        -------
+        TROPResults
+            Object containing the ATT estimate, standard error,
+            factor estimates, and tuning parameters.
+        """
+        # Validate inputs
+        required_cols = [outcome, treatment, unit, time]
+        missing = [c for c in required_cols if c not in data.columns]
+        if missing:
+            raise ValueError(f"Missing columns: {missing}")
+
+        # Get unique units and periods
+        all_units = sorted(data[unit].unique())
+        all_periods = sorted(data[time].unique())
+
+        n_units = len(all_units)
+        n_periods = len(all_periods)
+
+        # Create mappings
+        unit_to_idx = {u: i for i, u in enumerate(all_units)}
+        period_to_idx = {p: i for i, p in enumerate(all_periods)}
+        idx_to_unit = {i: u for u, i in unit_to_idx.items()}
+        idx_to_period = {i: p for p, i in period_to_idx.items()}
+
+        # Create outcome matrix Y (n_periods x n_units) and treatment matrix D
+        Y = np.full((n_periods, n_units), np.nan)
+        D = np.zeros((n_periods, n_units), dtype=int)
+
+        for _, row in data.iterrows():
+            i = unit_to_idx[row[unit]]
+            t = period_to_idx[row[time]]
+            Y[t, i] = row[outcome]
+            D[t, i] = int(row[treatment])
+
+        # Identify treated observations
+        treated_mask = D == 1
+        n_treated_obs = np.sum(treated_mask)
+
+        if n_treated_obs == 0:
+            raise ValueError("No treated observations found")
+
+        # Identify treated and control units
+        unit_ever_treated = np.any(D == 1, axis=0)
+        treated_unit_idx = np.where(unit_ever_treated)[0]
+        control_unit_idx = np.where(~unit_ever_treated)[0]
+
+        if len(control_unit_idx) == 0:
+            raise ValueError("No control units found")
+
+        # Determine pre/post periods
+        if post_periods is None:
+            # Infer from first treatment time
+            first_treat_period = None
+            for t in range(n_periods):
+                if np.any(D[t, :] == 1):
+                    first_treat_period = t
+                    break
+            if first_treat_period is None:
+                raise ValueError("Could not infer post-treatment periods")
+            pre_period_idx = list(range(first_treat_period))
+            post_period_idx = list(range(first_treat_period, n_periods))
+        else:
+            post_period_idx = [period_to_idx[p] for p in post_periods if p in period_to_idx]
+            pre_period_idx = [i for i in range(n_periods) if i not in post_period_idx]
+
+        if len(pre_period_idx) < 2:
+            raise ValueError("Need at least 2 pre-treatment periods")
+
+        pre_periods_list = [idx_to_period[i] for i in pre_period_idx]
+        post_periods_list = [idx_to_period[i] for i in post_period_idx]
+        n_treated_periods = len(post_period_idx)
+
+        # Step 1: Grid search with LOOCV for tuning parameters
+        best_lambda = None
+        best_score = np.inf
+
+        # Control observations mask (for LOOCV)
+        control_mask = D == 0
+
+        for lambda_time in self.lambda_time_grid:
+            for lambda_unit in self.lambda_unit_grid:
+                for lambda_nn in self.lambda_nn_grid:
+                    try:
+                        score = self._loocv_score_obs_specific(
+                            Y, D, control_mask, control_unit_idx,
+                            lambda_time, lambda_unit, lambda_nn,
+                            n_units, n_periods
+                        )
+                        if score < best_score:
+                            best_score = score
+                            best_lambda = (lambda_time, lambda_unit, lambda_nn)
+                    except (np.linalg.LinAlgError, ValueError):
+                        continue
+
+        if best_lambda is None:
+            warnings.warn(
+                "All tuning parameter combinations failed. Using defaults.",
+                UserWarning
+            )
+            best_lambda = (1.0, 1.0, 0.1)
+            best_score = np.nan
+
+        self._optimal_lambda = best_lambda
+        lambda_time, lambda_unit, lambda_nn = best_lambda
+
+        # Step 2: Final estimation - per-observation model fitting following Algorithm 2
+        # For each treated (i,t): compute observation-specific weights, fit model, compute τ̂_{it}
+        treatment_effects = {}
+        tau_values = []
+        alpha_estimates = []
+        beta_estimates = []
+        L_estimates = []
+
+        # Get list of treated observations
+        treated_observations = [(t, i) for t in range(n_periods) for i in range(n_units)
+                                if D[t, i] == 1]
+
+        for t, i in treated_observations:
+            # Compute observation-specific weights for this (i, t)
+            weight_matrix = self._compute_observation_weights(
+                Y, D, i, t, lambda_time, lambda_unit, control_unit_idx,
+                n_units, n_periods
+            )
+
+            # Fit model with these weights
+            alpha_hat, beta_hat, L_hat = self._estimate_model(
+                Y, control_mask, weight_matrix, lambda_nn,
+                n_units, n_periods
+            )
+
+            # Compute treatment effect: τ̂_{it} = Y_{it} - α̂_i - β̂_t - L̂_{it}
+            tau_it = Y[t, i] - alpha_hat[i] - beta_hat[t] - L_hat[t, i]
+
+            unit_id = idx_to_unit[i]
+            time_id = idx_to_period[t]
+            treatment_effects[(unit_id, time_id)] = tau_it
+            tau_values.append(tau_it)
+
+            # Store for averaging
+            alpha_estimates.append(alpha_hat)
+            beta_estimates.append(beta_hat)
+            L_estimates.append(L_hat)
+
+        # Average ATT
+        att = np.mean(tau_values)
+
+        # Average parameter estimates for output (representative)
+        alpha_hat = np.mean(alpha_estimates, axis=0) if alpha_estimates else np.zeros(n_units)
+        beta_hat = np.mean(beta_estimates, axis=0) if beta_estimates else np.zeros(n_periods)
+        L_hat = np.mean(L_estimates, axis=0) if L_estimates else np.zeros((n_periods, n_units))
+
+        # Compute effective rank
+        _, s, _ = np.linalg.svd(L_hat, full_matrices=False)
+        if s[0] > 0:
+            effective_rank = np.sum(s) / s[0]
+        else:
+            effective_rank = 0.0
+
+        # Step 4: Variance estimation
+        if self.variance_method == "bootstrap":
+            se, bootstrap_dist = self._bootstrap_variance(
+                data, outcome, treatment, unit, time, post_periods_list,
+                best_lambda
+            )
+        else:
+            se, bootstrap_dist = self._jackknife_variance(
+                Y, D, control_mask, control_unit_idx, best_lambda,
+                n_units, n_periods
+            )
+
+        # Compute test statistics
+        if se > 0:
+            t_stat = att / se
+            p_value = 2 * (1 - stats.t.cdf(abs(t_stat), df=max(1, n_treated_obs - 1)))
+        else:
+            t_stat = 0.0
+            p_value = 1.0
+
+        conf_int = compute_confidence_interval(att, se, self.alpha)
+
+        # Create results dictionaries
+        unit_effects_dict = {idx_to_unit[i]: alpha_hat[i] for i in range(n_units)}
+        time_effects_dict = {idx_to_period[t]: beta_hat[t] for t in range(n_periods)}
+
+        # Store results
+        self.results_ = TROPResults(
+            att=att,
+            se=se,
+            t_stat=t_stat,
+            p_value=p_value,
+            conf_int=conf_int,
+            n_obs=len(data),
+            n_treated=len(treated_unit_idx),
+            n_control=len(control_unit_idx),
+            n_treated_obs=n_treated_obs,
+            unit_effects=unit_effects_dict,
+            time_effects=time_effects_dict,
+            treatment_effects=treatment_effects,
+            lambda_time=lambda_time,
+            lambda_unit=lambda_unit,
+            lambda_nn=lambda_nn,
+            factor_matrix=L_hat,
+            effective_rank=effective_rank,
+            loocv_score=best_score,
+            variance_method=self.variance_method,
+            alpha=self.alpha,
+            pre_periods=pre_periods_list,
+            post_periods=post_periods_list,
+            n_bootstrap=self.n_bootstrap if self.variance_method == "bootstrap" else None,
+            bootstrap_distribution=bootstrap_dist if len(bootstrap_dist) > 0 else None,
+        )
+
+        self.is_fitted_ = True
+        return self.results_
+
+    def _compute_unit_distance_pairwise(
+        self,
+        Y: np.ndarray,
+        D: np.ndarray,
+        j: int,
+        i: int,
+        target_period: int,
+    ) -> float:
+        """
+        Compute pairwise distance from control unit j to treated unit i.
+
+        Following the paper's Equation 3 (page 7):
+        dist_unit_{-t}(j, i) = sqrt(
+            Σ_u 1{u≠t}(1-W_{iu})(1-W_{ju})(Y_{iu} - Y_{ju})²
+            / Σ_u 1{u≠t}(1-W_{iu})(1-W_{ju})
+        )
+
+        This computes the RMSE between units j and i over periods where
+        both are untreated, excluding the target period t.
+
+        Parameters
+        ----------
+        Y : np.ndarray
+            Outcome matrix (n_periods x n_units).
+        D : np.ndarray
+            Treatment indicator matrix (n_periods x n_units).
+        j : int
+            Index of control unit.
+        i : int
+            Index of treated unit.
+        target_period : int
+            Target treatment period t (excluded from distance computation).
+
+        Returns
+        -------
+        float
+            Pairwise RMSE distance between units j and i.
+        """
+        n_periods = Y.shape[0]
+
+        sq_diffs = []
+        for u in range(n_periods):
+            # Exclude target period and periods where either unit is treated
+            if u == target_period:
+                continue
+            # (1 - W_{iu})(1 - W_{ju}) means both must be untreated
+            if D[u, i] == 1 or D[u, j] == 1:
+                continue
+            if np.isnan(Y[u, i]) or np.isnan(Y[u, j]):
+                continue
+
+            sq_diffs.append((Y[u, i] - Y[u, j]) ** 2)
+
+        if len(sq_diffs) > 0:
+            return np.sqrt(np.mean(sq_diffs))
+        else:
+            return np.inf
+
+    def _compute_observation_weights(
+        self,
+        Y: np.ndarray,
+        D: np.ndarray,
+        i: int,
+        t: int,
+        lambda_time: float,
+        lambda_unit: float,
+        control_unit_idx: np.ndarray,
+        n_units: int,
+        n_periods: int,
+    ) -> np.ndarray:
+        """
+        Compute observation-specific weight matrix for treated observation (i, t).
+
+        Following the paper's Algorithm 2 (page 27):
+        - Time weights θ_s^{i,t} = exp(-λ_time × |t - s|)
+        - Unit weights ω_j^{i,t} = exp(-λ_unit × dist_unit_{-t}(j, i))
+
+        Parameters
+        ----------
+        Y : np.ndarray
+            Outcome matrix (n_periods x n_units).
+        D : np.ndarray
+            Treatment indicator matrix (n_periods x n_units).
+        i : int
+            Treated unit index.
+        t : int
+            Treatment period index.
+        lambda_time : float
+            Time weight decay parameter.
+        lambda_unit : float
+            Unit weight decay parameter.
+        control_unit_idx : np.ndarray
+            Indices of control units.
+        n_units : int
+            Number of units.
+        n_periods : int
+            Number of periods.
+
+        Returns
+        -------
+        np.ndarray
+            Weight matrix (n_periods x n_units) for observation (i, t).
+        """
+        # Time distance: |t - s| following paper's Equation 3 (page 7)
+        dist_time = np.array([abs(t - s) for s in range(n_periods)])
+        time_weights = np.exp(-lambda_time * dist_time)
+
+        # Unit distance: pairwise RMSE from each control j to treated i
+        unit_weights = np.zeros(n_units)
+
+        if lambda_unit == 0:
+            # Uniform weights when lambda_unit = 0
+            unit_weights[:] = 1.0
+        else:
+            for j in control_unit_idx:
+                dist = self._compute_unit_distance_pairwise(Y, D, j, i, t)
+                if np.isinf(dist):
+                    unit_weights[j] = 0.0
+                else:
+                    unit_weights[j] = np.exp(-lambda_unit * dist)
+
+        # Treated unit i gets weight 1 (or could be omitted since we fit on controls)
+        # We include treated unit's own observation for model fitting
+        unit_weights[i] = 1.0
+
+        # Weight matrix: outer product (n_periods x n_units)
+        W = np.outer(time_weights, unit_weights)
+
+        return W
+
+    def _soft_threshold_svd(
+        self,
+        M: np.ndarray,
+        threshold: float,
+    ) -> np.ndarray:
+        """
+        Apply soft-thresholding to singular values (proximal operator for nuclear norm).
+
+        Parameters
+        ----------
+        M : np.ndarray
+            Input matrix.
+        threshold : float
+            Soft-thresholding parameter.
+
+        Returns
+        -------
+        np.ndarray
+            Matrix with soft-thresholded singular values.
+        """
+        if threshold <= 0:
+            return M
+
+        # Handle NaN/Inf values in input
+        if not np.isfinite(M).all():
+            M = np.nan_to_num(M, nan=0.0, posinf=0.0, neginf=0.0)
+
+        try:
+            U, s, Vt = np.linalg.svd(M, full_matrices=False)
+        except np.linalg.LinAlgError:
+            # SVD failed, return zero matrix
+            return np.zeros_like(M)
+
+        # Check for numerical issues in SVD output
+        if not (np.isfinite(U).all() and np.isfinite(s).all() and np.isfinite(Vt).all()):
+            # SVD produced non-finite values, return zero matrix
+            return np.zeros_like(M)
+
+        s_thresh = np.maximum(s - threshold, 0)
+
+        # Use truncated reconstruction with only non-zero singular values
+        nonzero_mask = s_thresh > 1e-10
+        if not np.any(nonzero_mask):
+            return np.zeros_like(M)
+
+        # Truncate to non-zero components for numerical stability
+        U_trunc = U[:, nonzero_mask]
+        s_trunc = s_thresh[nonzero_mask]
+        Vt_trunc = Vt[nonzero_mask, :]
+
+        # Compute result, suppressing expected numerical warnings from
+        # ill-conditioned matrices during alternating minimization
+        with np.errstate(divide='ignore', over='ignore', invalid='ignore'):
+            result = (U_trunc * s_trunc) @ Vt_trunc
+
+        # Replace any NaN/Inf in result with zeros
+        if not np.isfinite(result).all():
+            result = np.nan_to_num(result, nan=0.0, posinf=0.0, neginf=0.0)
+
+        return result
+
+    def _estimate_model(
+        self,
+        Y: np.ndarray,
+        control_mask: np.ndarray,
+        weight_matrix: np.ndarray,
+        lambda_nn: float,
+        n_units: int,
+        n_periods: int,
+        exclude_obs: Optional[Tuple[int, int]] = None,
+    ) -> Tuple[np.ndarray, np.ndarray, np.ndarray]:
+        """
+        Estimate the model: Y = α + β + L + τD + ε with nuclear norm penalty on L.
+
+        Uses alternating minimization:
+        1. Fix L, solve for α, β
+        2. Fix α, β, solve for L via soft-thresholding
+
+        Parameters
+        ----------
+        Y : np.ndarray
+            Outcome matrix (n_periods x n_units).
+        control_mask : np.ndarray
+            Boolean mask for control observations.
+        weight_matrix : np.ndarray
+            Pre-computed global weight matrix (n_periods x n_units).
+        lambda_nn : float
+            Nuclear norm regularization parameter.
+        n_units : int
+            Number of units.
+        n_periods : int
+            Number of periods.
+        exclude_obs : tuple, optional
+            (t, i) observation to exclude (for LOOCV).
+
+        Returns
+        -------
+        tuple
+            (alpha, beta, L) estimated parameters.
+        """
+        W = weight_matrix
+
+        # Mask for estimation (control obs only, excluding LOOCV obs if specified)
+        est_mask = control_mask.copy()
+        if exclude_obs is not None:
+            t_ex, i_ex = exclude_obs
+            est_mask[t_ex, i_ex] = False
+
+        # Handle missing values
+        valid_mask = ~np.isnan(Y) & est_mask
+
+        # Initialize
+        alpha = np.zeros(n_units)
+        beta = np.zeros(n_periods)
+        L = np.zeros((n_periods, n_units))
+
+        # Alternating minimization
+        for iteration in range(self.max_iter):
+            alpha_old = alpha.copy()
+            beta_old = beta.copy()
+            L_old = L.copy()
+
+            # Step 1: Update α and β (weighted means)
+            R = Y - L  # Residual without fixed effects
+
+            # Weighted mean for alpha (unit effects)
+            for i in range(n_units):
+                mask_i = valid_mask[:, i]
+                if np.any(mask_i):
+                    weights_i = W[mask_i, i]
+                    # Handle case where weights sum to zero (unit not in weight computation)
+                    weight_sum = np.sum(weights_i)
+                    if weight_sum > 0:
+                        alpha[i] = np.average(R[mask_i, i] - beta[mask_i], weights=weights_i)
+                    else:
+                        # Use unweighted mean for units with zero total weight
+                        alpha[i] = np.mean(R[mask_i, i] - beta[mask_i])
+                else:
+                    alpha[i] = 0.0
+
+            # Weighted mean for beta (time effects)
+            for t in range(n_periods):
+                mask_t = valid_mask[t, :]
+                if np.any(mask_t):
+                    weights_t = W[t, mask_t]
+                    # Handle case where weights sum to zero
+                    weight_sum = np.sum(weights_t)
+                    if weight_sum > 0:
+                        beta[t] = np.average(R[t, mask_t] - alpha[mask_t], weights=weights_t)
+                    else:
+                        # Use unweighted mean for periods with zero total weight
+                        beta[t] = np.mean(R[t, mask_t] - alpha[mask_t])
+                else:
+                    beta[t] = 0.0
+
+            # Step 2: Update L with nuclear norm penalty
+            # L = soft_threshold(Y - α - β, λ_nn)
+            R_for_L = np.zeros((n_periods, n_units))
+            for t in range(n_periods):
+                for i in range(n_units):
+                    if valid_mask[t, i]:
+                        R_for_L[t, i] = Y[t, i] - alpha[i] - beta[t]
+                    else:
+                        # Impute with current L
+                        R_for_L[t, i] = L[t, i]
+
+            L = self._soft_threshold_svd(R_for_L, lambda_nn)
+
+            # Check convergence
+            alpha_diff = np.max(np.abs(alpha - alpha_old))
+            beta_diff = np.max(np.abs(beta - beta_old))
+            L_diff = np.max(np.abs(L - L_old))
+
+            if max(alpha_diff, beta_diff, L_diff) < self.tol:
+                break
+
+        return alpha, beta, L
+
+    def _loocv_score_obs_specific(
+        self,
+        Y: np.ndarray,
+        D: np.ndarray,
+        control_mask: np.ndarray,
+        control_unit_idx: np.ndarray,
+        lambda_time: float,
+        lambda_unit: float,
+        lambda_nn: float,
+        n_units: int,
+        n_periods: int,
+    ) -> float:
+        """
+        Compute leave-one-out cross-validation score with observation-specific weights.
+
+        Following the paper's Equation 5 (page 8):
+        Q(λ) = Σ_{j,s: D_js=0} [τ̂_js^loocv(λ)]²
+
+        For each control observation (j, s), treat it as pseudo-treated,
+        compute observation-specific weights, fit model excluding (j, s),
+        and sum squared pseudo-treatment effects.
+
+        Parameters
+        ----------
+        Y : np.ndarray
+            Outcome matrix (n_periods x n_units).
+        D : np.ndarray
+            Treatment indicator matrix (n_periods x n_units).
+        control_mask : np.ndarray
+            Boolean mask for control observations.
+        control_unit_idx : np.ndarray
+            Indices of control units.
+        lambda_time : float
+            Time weight decay parameter.
+        lambda_unit : float
+            Unit weight decay parameter.
+        lambda_nn : float
+            Nuclear norm regularization parameter.
+        n_units : int
+            Number of units.
+        n_periods : int
+            Number of periods.
+
+        Returns
+        -------
+        float
+            LOOCV score (lower is better).
+        """
+        # Get all control observations
+        control_obs = [(t, i) for t in range(n_periods) for i in range(n_units)
+                       if control_mask[t, i] and not np.isnan(Y[t, i])]
+
+        # Subsample for computational tractability (as noted in paper's footnote)
+        rng = np.random.default_rng(self.seed)
+        max_loocv = min(100, len(control_obs))
+        if len(control_obs) > max_loocv:
+            indices = rng.choice(len(control_obs), size=max_loocv, replace=False)
+            control_obs = [control_obs[idx] for idx in indices]
+
+        tau_squared_sum = 0.0
+        n_valid = 0
+
+        for t, i in control_obs:
+            try:
+                # Compute observation-specific weights for pseudo-treated (i, t)
+                weight_matrix = self._compute_observation_weights(
+                    Y, D, i, t, lambda_time, lambda_unit, control_unit_idx,
+                    n_units, n_periods
+                )
+
+                # Estimate model excluding observation (t, i)
+                alpha, beta, L = self._estimate_model(
+                    Y, control_mask, weight_matrix, lambda_nn,
+                    n_units, n_periods, exclude_obs=(t, i)
+                )
+
+                # Pseudo treatment effect
+                tau_ti = Y[t, i] - alpha[i] - beta[t] - L[t, i]
+                tau_squared_sum += tau_ti ** 2
+                n_valid += 1
+
+            except (np.linalg.LinAlgError, ValueError):
+                continue
+
+        if n_valid == 0:
+            return np.inf
+
+        return tau_squared_sum / n_valid
+
+    def _bootstrap_variance(
+        self,
+        data: pd.DataFrame,
+        outcome: str,
+        treatment: str,
+        unit: str,
+        time: str,
+        post_periods: List[Any],
+        optimal_lambda: Tuple[float, float, float],
+    ) -> Tuple[float, np.ndarray]:
+        """
+        Compute bootstrap standard error using unit-level block bootstrap.
+
+        Parameters
+        ----------
+        data : pd.DataFrame
+            Original data.
+        outcome : str
+            Outcome column name.
+        treatment : str
+            Treatment column name.
+        unit : str
+            Unit column name.
+        time : str
+            Time column name.
+        post_periods : list
+            Post-treatment periods.
+        optimal_lambda : tuple
+            Optimal (lambda_time, lambda_unit, lambda_nn).
+
+        Returns
+        -------
+        tuple
+            (se, bootstrap_estimates).
+        """
+        rng = np.random.default_rng(self.seed)
+        all_units = data[unit].unique()
+        n_units = len(all_units)
+
+        bootstrap_estimates = []
+
+        for b in range(self.n_bootstrap):
+            # Sample units with replacement
+            sampled_units = rng.choice(all_units, size=n_units, replace=True)
+
+            # Create bootstrap sample with unique unit IDs
+            boot_data = pd.concat([
+                data[data[unit] == u].assign(**{unit: f"{u}_{idx}"})
+                for idx, u in enumerate(sampled_units)
+            ], ignore_index=True)
+
+            try:
+                # Fit with fixed lambda (skip LOOCV for speed)
+                att = self._fit_with_fixed_lambda(
+                    boot_data, outcome, treatment, unit, time,
+                    post_periods, optimal_lambda
+                )
+                bootstrap_estimates.append(att)
+            except (ValueError, np.linalg.LinAlgError, KeyError):
+                continue
+
+        bootstrap_estimates = np.array(bootstrap_estimates)
+
+        if len(bootstrap_estimates) < 10:
+            warnings.warn(
+                f"Only {len(bootstrap_estimates)} bootstrap iterations succeeded. "
+                "Standard errors may be unreliable.",
+                UserWarning
+            )
+            if len(bootstrap_estimates) == 0:
+                return 0.0, np.array([])
+
+        se = np.std(bootstrap_estimates, ddof=1)
+        return se, bootstrap_estimates
+
+    def _jackknife_variance(
+        self,
+        Y: np.ndarray,
+        D: np.ndarray,
+        control_mask: np.ndarray,
+        control_unit_idx: np.ndarray,
+        optimal_lambda: Tuple[float, float, float],
+        n_units: int,
+        n_periods: int,
+    ) -> Tuple[float, np.ndarray]:
+        """
+        Compute jackknife standard error (leave-one-unit-out).
+
+        Uses observation-specific weights following Algorithm 2.
+
+        Parameters
+        ----------
+        Y : np.ndarray
+            Outcome matrix.
+        D : np.ndarray
+            Treatment matrix.
+        control_mask : np.ndarray
+            Control observation mask.
+        control_unit_idx : np.ndarray
+            Indices of control units.
+        optimal_lambda : tuple
+            Optimal tuning parameters.
+        n_units : int
+            Number of units.
+        n_periods : int
+            Number of periods.
+
+        Returns
+        -------
+        tuple
+            (se, jackknife_estimates).
+        """
+        lambda_time, lambda_unit, lambda_nn = optimal_lambda
+        jackknife_estimates = []
+
+        # Get treated unit indices
+        treated_unit_idx = np.where(np.any(D == 1, axis=0))[0]
+
+        for leave_out in treated_unit_idx:
+            # Create mask excluding this unit
+            Y_jack = Y.copy()
+            D_jack = D.copy()
+            Y_jack[:, leave_out] = np.nan
+            D_jack[:, leave_out] = 0
+
+            control_mask_jack = D_jack == 0
+
+            # Get remaining treated observations
+            treated_obs_jack = [(t, i) for t in range(n_periods) for i in range(n_units)
+                                if D_jack[t, i] == 1]
+
+            if not treated_obs_jack:
+                continue
+
+            try:
+                # Compute ATT using observation-specific weights (Algorithm 2)
+                tau_values = []
+                for t, i in treated_obs_jack:
+                    # Compute observation-specific weights for this (i, t)
+                    weight_matrix = self._compute_observation_weights(
+                        Y_jack, D_jack, i, t, lambda_time, lambda_unit,
+                        control_unit_idx, n_units, n_periods
+                    )
+
+                    # Fit model with these weights
+                    alpha, beta, L = self._estimate_model(
+                        Y_jack, control_mask_jack, weight_matrix, lambda_nn,
+                        n_units, n_periods
+                    )
+
+                    # Compute treatment effect
+                    tau = Y_jack[t, i] - alpha[i] - beta[t] - L[t, i]
+                    tau_values.append(tau)
+
+                if tau_values:
+                    jackknife_estimates.append(np.mean(tau_values))
+
+            except (np.linalg.LinAlgError, ValueError):
+                continue
+
+        jackknife_estimates = np.array(jackknife_estimates)
+
+        if len(jackknife_estimates) < 2:
+            return 0.0, jackknife_estimates
+
+        # Jackknife SE formula
+        n = len(jackknife_estimates)
+        mean_est = np.mean(jackknife_estimates)
+        se = np.sqrt((n - 1) / n * np.sum((jackknife_estimates - mean_est) ** 2))
+
+        return se, jackknife_estimates
+
+    def _fit_with_fixed_lambda(
+        self,
+        data: pd.DataFrame,
+        outcome: str,
+        treatment: str,
+        unit: str,
+        time: str,
+        post_periods: List[Any],
+        fixed_lambda: Tuple[float, float, float],
+    ) -> float:
+        """
+        Fit model with fixed tuning parameters (for bootstrap).
+
+        Uses observation-specific weights following Algorithm 2.
+        Returns only the ATT estimate.
+        """
+        lambda_time, lambda_unit, lambda_nn = fixed_lambda
+
+        # Setup matrices
+        all_units = sorted(data[unit].unique())
+        all_periods = sorted(data[time].unique())
+
+        n_units = len(all_units)
+        n_periods = len(all_periods)
+
+        unit_to_idx = {u: i for i, u in enumerate(all_units)}
+        period_to_idx = {p: i for i, p in enumerate(all_periods)}
+
+        Y = np.full((n_periods, n_units), np.nan)
+        D = np.zeros((n_periods, n_units), dtype=int)
+
+        for _, row in data.iterrows():
+            i = unit_to_idx[row[unit]]
+            t = period_to_idx[row[time]]
+            Y[t, i] = row[outcome]
+            D[t, i] = int(row[treatment])
+
+        control_mask = D == 0
+
+        # Get control unit indices
+        unit_ever_treated = np.any(D == 1, axis=0)
+        control_unit_idx = np.where(~unit_ever_treated)[0]
+
+        # Get list of treated observations
+        treated_observations = [(t, i) for t in range(n_periods) for i in range(n_units)
+                                if D[t, i] == 1]
+
+        if not treated_observations:
+            raise ValueError("No treated observations")
+
+        # Compute ATT using observation-specific weights (Algorithm 2)
+        tau_values = []
+        for t, i in treated_observations:
+            # Compute observation-specific weights for this (i, t)
+            weight_matrix = self._compute_observation_weights(
+                Y, D, i, t, lambda_time, lambda_unit, control_unit_idx,
+                n_units, n_periods
+            )
+
+            # Fit model with these weights
+            alpha, beta, L = self._estimate_model(
+                Y, control_mask, weight_matrix, lambda_nn,
+                n_units, n_periods
+            )
+
+            # Compute treatment effect: τ̂_{it} = Y_{it} - α̂_i - β̂_t - L̂_{it}
+            tau = Y[t, i] - alpha[i] - beta[t] - L[t, i]
+            tau_values.append(tau)
+
+        return np.mean(tau_values)
+
+    def get_params(self) -> Dict[str, Any]:
+        """Get estimator parameters."""
+        return {
+            "lambda_time_grid": self.lambda_time_grid,
+            "lambda_unit_grid": self.lambda_unit_grid,
+            "lambda_nn_grid": self.lambda_nn_grid,
+            "max_iter": self.max_iter,
+            "tol": self.tol,
+            "alpha": self.alpha,
+            "variance_method": self.variance_method,
+            "n_bootstrap": self.n_bootstrap,
+            "seed": self.seed,
+        }
+
+    def set_params(self, **params) -> "TROP":
+        """Set estimator parameters."""
+        for key, value in params.items():
+            if hasattr(self, key):
+                setattr(self, key, value)
+            else:
+                raise ValueError(f"Unknown parameter: {key}")
+        return self
+
+
+def trop(
+    data: pd.DataFrame,
+    outcome: str,
+    treatment: str,
+    unit: str,
+    time: str,
+    post_periods: Optional[List[Any]] = None,
+    **kwargs,
+) -> TROPResults:
+    """
+    Convenience function for TROP estimation.
+
+    Parameters
+    ----------
+    data : pd.DataFrame
+        Panel data.
+    outcome : str
+        Outcome variable column name.
+    treatment : str
+        Treatment indicator column name.
+    unit : str
+        Unit identifier column name.
+    time : str
+        Time period column name.
+    post_periods : list, optional
+        Post-treatment periods.
+    **kwargs
+        Additional arguments passed to TROP constructor.
+
+    Returns
+    -------
+    TROPResults
+        Estimation results.
+
+    Examples
+    --------
+    >>> from diff_diff import trop
+    >>> results = trop(data, 'y', 'treated', 'unit', 'time', post_periods=[5,6,7])
+    >>> print(f"ATT: {results.att:.3f}")
+    """
+    estimator = TROP(**kwargs)
+    return estimator.fit(data, outcome, treatment, unit, time, post_periods)
diff --git a/docs/api/index.rst b/docs/api/index.rst
index 06f13fd..9f1c3a3 100644
--- a/docs/api/index.rst
+++ b/docs/api/index.rst
@@ -19,6 +19,7 @@ Core estimator classes for DiD analysis:
    diff_diff.CallawaySantAnna
    diff_diff.SunAbraham
    diff_diff.TripleDifference
+   diff_diff.TROP
 
 Results Classes
 ---------------
@@ -39,6 +40,7 @@ Result containers returned by estimators:
    diff_diff.SunAbrahamResults
    diff_diff.SABootstrapResults
    diff_diff.TripleDifferenceResults
+   diff_diff.trop.TROPResults
 
 Visualization
 -------------
@@ -180,6 +182,7 @@ Detailed documentation by module:
    estimators
    staggered
    triple_diff
+   trop
    results
    visualization
    diagnostics
diff --git a/docs/api/trop.rst b/docs/api/trop.rst
new file mode 100644
index 0000000..3712a4e
--- /dev/null
+++ b/docs/api/trop.rst
@@ -0,0 +1,193 @@
+Triply Robust Panel (TROP)
+==========================
+
+Triply Robust Panel estimator for panel data with factor confounding.
+
+This module implements the methodology from Athey, Imbens, Qu & Viviano (2025),
+which combines three robustness components:
+
+1. **Nuclear norm regularized factor model**: Estimates interactive fixed effects
+   via matrix completion with nuclear norm penalty ||L||_*
+
+2. **Exponential distance-based unit weights**: ω_j = exp(-λ_unit × d(j,i))
+   where d(j,i) is the pairwise RMSE between units over pre-treatment periods
+
+3. **Exponential time decay weights**: θ_s = exp(-λ_time × |t-s|)
+   weighting periods by proximity to the specific treatment period t
+
+**When to use TROP:**
+
+- Suspected **factor structure** in the data (e.g., economic cycles, regional shocks)
+- **Unobserved time-varying confounders** that affect units differently over time
+- Standard parallel trends may be violated due to latent common factors
+- Reasonably long pre-treatment period to estimate factors
+
+**Reference:** Athey, S., Imbens, G. W., Qu, Z., & Viviano, D. (2025). Triply Robust
+Panel Estimators. *Working Paper*. `arXiv:2508.21536 <https://arxiv.org/abs/2508.21536>`_
+
+.. module:: diff_diff.trop
+
+TROP
+----
+
+Main estimator class for Triply Robust Panel estimation.
+
+.. autoclass:: diff_diff.TROP
+   :members:
+   :undoc-members:
+   :show-inheritance:
+   :inherited-members:
+
+   .. rubric:: Methods
+
+   .. autosummary::
+
+      ~TROP.fit
+      ~TROP.get_params
+      ~TROP.set_params
+
+TROPResults
+-----------
+
+Results container for TROP estimation.
+
+.. autoclass:: diff_diff.trop.TROPResults
+   :members:
+   :undoc-members:
+   :show-inheritance:
+
+   .. rubric:: Methods
+
+   .. autosummary::
+
+      ~TROPResults.summary
+      ~TROPResults.print_summary
+      ~TROPResults.to_dict
+      ~TROPResults.to_dataframe
+      ~TROPResults.get_treatment_effects_df
+      ~TROPResults.get_unit_effects_df
+      ~TROPResults.get_time_effects_df
+
+Convenience Function
+--------------------
+
+.. autofunction:: diff_diff.trop
+
+Tuning Parameters
+-----------------
+
+TROP uses leave-one-out cross-validation (LOOCV) to select three tuning parameters:
+
+.. list-table::
+   :header-rows: 1
+   :widths: 15 35 50
+
+   * - Parameter
+     - Description
+     - Effect
+   * - ``λ_time``
+     - Time weight decay
+     - Higher values weight periods closer to treatment more heavily
+   * - ``λ_unit``
+     - Unit distance decay
+     - Higher values weight similar control units more heavily
+   * - ``λ_nn``
+     - Nuclear norm penalty
+     - Higher values encourage lower-rank factor structure
+
+Algorithm
+---------
+
+TROP follows Algorithm 2 from the paper:
+
+1. **Grid search with LOOCV**: For each (λ_time, λ_unit, λ_nn) combination,
+   compute cross-validation score by treating control observations as pseudo-treated
+
+2. **Per-observation estimation**: For each treated observation (i, t):
+
+   a. Compute observation-specific weights θ^{i,t} and ω^{i,t}
+   b. Fit weighted model: Y = α + β + L + ε with nuclear norm penalty on L
+   c. Compute τ̂_{it} = Y_{it} - α̂_i - β̂_t - L̂_{it}
+
+3. **Average**: ATT = mean(τ̂_{it}) over all treated observations
+
+This structure provides the **triple robustness** property (Theorem 5.1):
+the estimator is consistent if any one of the three components
+(unit weights, time weights, factor model) is correctly specified.
+
+Example Usage
+-------------
+
+Basic usage::
+
+    from diff_diff import TROP
+
+    trop = TROP(
+        lambda_time_grid=[0.0, 0.5, 1.0, 2.0],
+        lambda_unit_grid=[0.0, 0.5, 1.0, 2.0],
+        lambda_nn_grid=[0.0, 0.1, 1.0],
+        n_bootstrap=200,
+        seed=42
+    )
+
+    results = trop.fit(
+        data,
+        outcome='y',
+        treatment='treated',
+        unit='unit_id',
+        time='period',
+        post_periods=[10, 11, 12, 13, 14]
+    )
+    results.print_summary()
+
+Quick estimation with convenience function::
+
+    from diff_diff import trop
+
+    results = trop(
+        data,
+        outcome='y',
+        treatment='treated',
+        unit='unit_id',
+        time='period',
+        post_periods=[10, 11, 12, 13, 14],
+        n_bootstrap=200
+    )
+
+Examining factor structure::
+
+    # Get the estimated factor matrix
+    L = results.factor_matrix
+    print(f"Effective rank: {results.effective_rank:.2f}")
+
+    # Individual treatment effects
+    effects_df = results.get_treatment_effects_df()
+    print(effects_df)
+
+Comparison with Synthetic DiD
+-----------------------------
+
+TROP extends Synthetic DiD by adding factor model adjustment:
+
+.. list-table::
+   :header-rows: 1
+   :widths: 20 40 40
+
+   * - Feature
+     - Synthetic DiD
+     - TROP
+   * - Unit weights
+     - Constrained to sum to 1
+     - Exponential distance-based
+   * - Time weights
+     - Constrained to sum to 1
+     - Exponential time decay
+   * - Factor adjustment
+     - None
+     - Nuclear norm regularized L
+   * - Robustness
+     - Doubly robust
+     - Triply robust
+
+Use **SDID** when parallel trends is plausible. Use **TROP** when you suspect
+factor confounding (regional shocks, economic cycles, latent factors).
diff --git a/docs/tutorials/10_trop.ipynb b/docs/tutorials/10_trop.ipynb
new file mode 100644
index 0000000..21fa5b5
--- /dev/null
+++ b/docs/tutorials/10_trop.ipynb
@@ -0,0 +1,571 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Triply Robust Panel (TROP) Estimator\n",
+    "\n",
+    "This notebook demonstrates the **Triply Robust Panel (TROP)** estimator (Athey, Imbens, Qu & Viviano, 2025), which combines three robustness components:\n",
+    "\n",
+    "1. **Nuclear Norm Regularized Factor Model**: Estimates interactive fixed effects via matrix completion with nuclear norm penalty\n",
+    "2. **Exponential Distance-Based Unit Weights**: ω_j = exp(-λ_unit × distance(j,i)) based on outcome similarity\n",
+    "3. **Exponential Time Decay Weights**: θ_s = exp(-λ_time × |s-t|) weighting by proximity to treatment\n",
+    "\n",
+    "TROP is particularly useful when:\n",
+    "- There may be unobserved time-varying confounders with factor structure\n",
+    "- Standard DiD or SDID may be biased due to latent factors\n",
+    "- You want robust inference under factor confounding\n",
+    "\n",
+    "We'll cover:\n",
+    "1. When to use TROP\n",
+    "2. Basic estimation with LOOCV tuning\n",
+    "3. Understanding tuning parameters\n",
+    "4. Examining factor structure\n",
+    "5. Comparing TROP vs SDID"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import numpy as np\n",
+    "import pandas as pd\n",
+    "from diff_diff import TROP, trop, SyntheticDiD\n",
+    "\n",
+    "# For nicer plots (optional)\n",
+    "try:\n",
+    "    import matplotlib.pyplot as plt\n",
+    "    plt.style.use('seaborn-v0_8-whitegrid')\n",
+    "    HAS_MATPLOTLIB = True\n",
+    "except ImportError:\n",
+    "    HAS_MATPLOTLIB = False\n",
+    "    print(\"matplotlib not installed - visualization examples will be skipped\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 1. When to Use TROP\n",
+    "\n",
+    "Consider TROP when:\n",
+    "- You suspect **factor structure** in the data (e.g., economic cycles, regional shocks)\n",
+    "- **Unobserved confounders** affect units differently over time\n",
+    "- Standard parallel trends assumption may be violated due to common factors\n",
+    "- You have a **reasonably long pre-treatment period** to estimate factors\n",
+    "\n",
+    "The key difference from SDID is that TROP explicitly models and removes interactive fixed effects (factor contributions) before computing treatment effects."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": "def generate_factor_dgp(\n    n_units=50,\n    n_pre=10,\n    n_post=5,\n    n_treated=10,\n    n_factors=2,\n    treatment_effect=2.0,\n    factor_strength=1.0,\n    noise_std=0.5,\n    seed=42\n):\n    \"\"\"\n    Generate panel data with known factor structure.\n    \n    DGP: Y_it = mu + alpha_i + beta_t + L_it + tau*D_it + eps_it\n    \n    where L_it = Lambda_i'F_t is the interactive fixed effects component.\n    \n    This creates a scenario where standard DiD/SDID may be biased,\n    but TROP should recover the true treatment effect.\n    \n    Returns DataFrame with columns:\n    - 'treated': observation-level indicator (1 if treated AND post-period) - for TROP\n    - 'treat': unit-level ever-treated indicator (1 for all periods if unit is treated) - for SDID\n    \"\"\"\n    rng = np.random.default_rng(seed)\n    \n    n_control = n_units - n_treated\n    n_periods = n_pre + n_post\n    \n    # Generate factors F: (n_periods, n_factors)\n    F = rng.normal(0, 1, (n_periods, n_factors))\n    \n    # Generate loadings Lambda: (n_factors, n_units)\n    # Make treated units have correlated loadings (creates confounding)\n    Lambda = rng.normal(0, 1, (n_factors, n_units))\n    Lambda[:, :n_treated] += 0.5  # Treated units have higher loadings\n    \n    # Unit fixed effects\n    alpha = rng.normal(0, 1, n_units)\n    alpha[:n_treated] += 1.0  # Treated units have higher intercept\n    \n    # Time fixed effects\n    beta = np.linspace(0, 2, n_periods)\n    \n    # Generate outcomes\n    data = []\n    for i in range(n_units):\n        is_treated = i < n_treated\n        \n        for t in range(n_periods):\n            post = t >= n_pre\n            \n            y = 10.0 + alpha[i] + beta[t]\n            y += factor_strength * (Lambda[:, i] @ F[t, :])  # L_it component\n            \n            if is_treated and post:\n                y += treatment_effect\n            \n            y += rng.normal(0, noise_std)\n            \n            data.append({\n                'unit': i,\n                'period': t,\n                'outcome': y,\n                'treated': int(is_treated and post),  # Observation-level (for TROP)\n                'treat': int(is_treated)               # Unit-level ever-treated (for SDID)\n            })\n    \n    return pd.DataFrame(data)\n\n\n# Generate data with factor structure\ntrue_att = 2.0\nn_factors = 2\nn_pre = 10\nn_post = 5\n\ndf = generate_factor_dgp(\n    n_units=50,\n    n_pre=n_pre,\n    n_post=n_post,\n    n_treated=10,\n    n_factors=n_factors,\n    treatment_effect=true_att,\n    factor_strength=1.5,  # Strong factor confounding\n    noise_std=0.5,\n    seed=42\n)\n\nprint(f\"Dataset: {len(df)} observations\")\nprint(f\"Treated units: 10\")\nprint(f\"Control units: 40\")\nprint(f\"Pre-treatment periods: {n_pre}\")\nprint(f\"Post-treatment periods: {n_post}\")\nprint(f\"True treatment effect: {true_att}\")\nprint(f\"True number of factors: {n_factors}\")"
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "if HAS_MATPLOTLIB:\n",
+    "    # Visualize the data\n",
+    "    fig, ax = plt.subplots(figsize=(12, 6))\n",
+    "    \n",
+    "    # Identify treated vs control units\n",
+    "    treated_units = df.groupby('unit')['treated'].max()\n",
+    "    control_unit_ids = treated_units[treated_units == 0].index[:20]  # First 20 controls\n",
+    "    treated_unit_ids = treated_units[treated_units == 1].index[:5]   # First 5 treated\n",
+    "    \n",
+    "    # Plot control units (gray, thin lines)\n",
+    "    for unit_id in control_unit_ids:\n",
+    "        unit_data = df[df['unit'] == unit_id]\n",
+    "        ax.plot(unit_data['period'], unit_data['outcome'], \n",
+    "                color='gray', alpha=0.3, linewidth=0.5)\n",
+    "    \n",
+    "    # Plot treated units (colored, thick lines)\n",
+    "    colors = plt.cm.Reds(np.linspace(0.4, 0.9, 5))\n",
+    "    for i, unit_id in enumerate(treated_unit_ids):\n",
+    "        unit_data = df[df['unit'] == unit_id]\n",
+    "        ax.plot(unit_data['period'], unit_data['outcome'], \n",
+    "                color=colors[i], linewidth=2, label=f'Treated {i+1}')\n",
+    "    \n",
+    "    # Mark treatment time\n",
+    "    ax.axvline(x=n_pre - 0.5, color='black', linestyle='--', label='Treatment')\n",
+    "    \n",
+    "    ax.set_xlabel('Period')\n",
+    "    ax.set_ylabel('Outcome')\n",
+    "    ax.set_title('Panel Data with Factor Structure')\n",
+    "    ax.legend(loc='upper left')\n",
+    "    plt.tight_layout()\n",
+    "    plt.show()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 2. Basic TROP Estimation\n",
+    "\n",
+    "TROP uses leave-one-out cross-validation (LOOCV) to select three tuning parameters:\n",
+    "- **λ_time**: Time weight decay (higher = focus on periods near treatment)\n",
+    "- **λ_unit**: Unit weight decay (higher = focus on similar units)\n",
+    "- **λ_nn**: Nuclear norm regularization (higher = lower rank factor model)\n",
+    "\n",
+    "By default, TROP searches over a grid of values for each parameter."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Fit TROP with automatic tuning via LOOCV\n",
+    "trop_est = TROP(\n",
+    "    lambda_time_grid=[0.0, 0.5, 1.0, 2.0],   # Time decay grid\n",
+    "    lambda_unit_grid=[0.0, 0.5, 1.0, 2.0],   # Unit distance grid  \n",
+    "    lambda_nn_grid=[0.0, 0.1, 1.0],          # Nuclear norm grid\n",
+    "    n_bootstrap=100,    # Bootstrap replications for SE\n",
+    "    seed=42\n",
+    ")\n",
+    "\n",
+    "post_periods = list(range(n_pre, n_pre + n_post))\n",
+    "\n",
+    "results = trop_est.fit(\n",
+    "    df,\n",
+    "    outcome='outcome',\n",
+    "    treatment='treated',\n",
+    "    unit='unit',\n",
+    "    time='period',\n",
+    "    post_periods=post_periods\n",
+    ")\n",
+    "\n",
+    "print(results.summary())"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Check the key results\n",
+    "print(f\"True ATT: {true_att:.4f}\")\n",
+    "print(f\"Estimated ATT: {results.att:.4f}\")\n",
+    "print(f\"Bias: {results.att - true_att:.4f}\")\n",
+    "print()\n",
+    "print(f\"Selected tuning parameters:\")\n",
+    "print(f\"  λ_time: {results.lambda_time:.2f}\")\n",
+    "print(f\"  λ_unit: {results.lambda_unit:.2f}\")\n",
+    "print(f\"  λ_nn: {results.lambda_nn:.2f}\")\n",
+    "print(f\"\\nEffective rank of factor matrix: {results.effective_rank:.2f}\")\n",
+    "print(f\"True rank: {n_factors}\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 3. Understanding the Tuning Parameters\n",
+    "\n",
+    "The three tuning parameters control different aspects of the estimation:\n",
+    "\n",
+    "### λ_time (Time Decay)\n",
+    "Controls how much weight to place on periods close to treatment:\n",
+    "- **λ_time = 0**: Equal weight to all pre-treatment periods\n",
+    "- **λ_time > 0**: More weight on recent pre-treatment periods\n",
+    "\n",
+    "### λ_unit (Unit Distance)\n",
+    "Controls how much weight to place on similar control units:\n",
+    "- **λ_unit = 0**: Equal weight to all control units\n",
+    "- **λ_unit > 0**: More weight on control units with similar pre-treatment trajectories\n",
+    "\n",
+    "### λ_nn (Nuclear Norm)\n",
+    "Controls the rank of the factor model:\n",
+    "- **λ_nn = 0**: No regularization (full rank)\n",
+    "- **λ_nn > 0**: Encourages low-rank factor structure"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Effect of different nuclear norm regularization levels\n",
+    "print(\"Effect of nuclear norm regularization (λ_nn):\")\n",
+    "print(\"=\"*65)\n",
+    "print(f\"{'λ_nn':>10} {'ATT':>12} {'Bias':>12} {'Eff. Rank':>15}\")\n",
+    "print(\"-\"*65)\n",
+    "\n",
+    "for lambda_nn in [0.0, 0.1, 1.0, 10.0]:\n",
+    "    trop_fixed = TROP(\n",
+    "        lambda_time_grid=[1.0],  # Fixed\n",
+    "        lambda_unit_grid=[1.0],  # Fixed\n",
+    "        lambda_nn_grid=[lambda_nn],  # Vary this\n",
+    "        n_bootstrap=20,\n",
+    "        seed=42\n",
+    "    )\n",
+    "    \n",
+    "    res = trop_fixed.fit(\n",
+    "        df,\n",
+    "        outcome='outcome',\n",
+    "        treatment='treated',\n",
+    "        unit='unit',\n",
+    "        time='period',\n",
+    "        post_periods=post_periods\n",
+    "    )\n",
+    "    \n",
+    "    bias = res.att - true_att\n",
+    "    print(f\"{lambda_nn:>10.1f} {res.att:>12.4f} {bias:>12.4f} {res.effective_rank:>15.2f}\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 4. Examining the Factor Structure\n",
+    "\n",
+    "TROP estimates a low-rank factor matrix L that captures interactive fixed effects. We can examine this structure."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Examine the factor matrix\n",
+    "L = results.factor_matrix\n",
+    "print(f\"Factor matrix shape: {L.shape} (periods x units)\")\n",
+    "print(f\"Effective rank: {results.effective_rank:.2f}\")\n",
+    "\n",
+    "# Compute singular values to see rank structure\n",
+    "U, s, Vt = np.linalg.svd(L, full_matrices=False)\n",
+    "print(f\"\\nSingular values (top 5): {s[:5].round(2)}\")\n",
+    "print(f\"Variance explained by top 2: {(s[:2]**2).sum() / (s**2).sum() * 100:.1f}%\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "if HAS_MATPLOTLIB:\n",
+    "    fig, axes = plt.subplots(1, 2, figsize=(14, 5))\n",
+    "    \n",
+    "    # Scree plot of singular values\n",
+    "    ax1 = axes[0]\n",
+    "    ax1.bar(range(1, min(11, len(s)+1)), s[:10])\n",
+    "    ax1.set_xlabel('Component')\n",
+    "    ax1.set_ylabel('Singular Value')\n",
+    "    ax1.set_title('Scree Plot of Factor Matrix')\n",
+    "    ax1.axhline(y=0, color='gray', linestyle='-', linewidth=0.5)\n",
+    "    \n",
+    "    # Heatmap of factor matrix\n",
+    "    ax2 = axes[1]\n",
+    "    im = ax2.imshow(L, aspect='auto', cmap='RdBu_r', vmin=-2, vmax=2)\n",
+    "    ax2.set_xlabel('Unit')\n",
+    "    ax2.set_ylabel('Period')\n",
+    "    ax2.set_title('Factor Matrix L (Interactive Fixed Effects)')\n",
+    "    ax2.axhline(y=n_pre - 0.5, color='black', linestyle='--', linewidth=2)\n",
+    "    plt.colorbar(im, ax=ax2, label='L_it')\n",
+    "    \n",
+    "    plt.tight_layout()\n",
+    "    plt.show()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 5. Examining Unit and Time Effects\n",
+    "\n",
+    "TROP also estimates traditional unit and time fixed effects (α_i and β_t)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Unit effects\n",
+    "unit_effects_df = results.get_unit_effects_df()\n",
+    "print(\"Unit effects (first 10):\")\n",
+    "print(unit_effects_df.head(10).to_string(index=False))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Time effects\n",
+    "time_effects_df = results.get_time_effects_df()\n",
+    "print(\"Time effects:\")\n",
+    "print(time_effects_df.to_string(index=False))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "if HAS_MATPLOTLIB:\n",
+    "    fig, axes = plt.subplots(1, 2, figsize=(14, 5))\n",
+    "    \n",
+    "    # Unit effects\n",
+    "    ax1 = axes[0]\n",
+    "    ax1.bar(range(len(unit_effects_df)), unit_effects_df['effect'])\n",
+    "    ax1.axvline(x=9.5, color='red', linestyle='--', label='Treated/Control boundary')\n",
+    "    ax1.set_xlabel('Unit')\n",
+    "    ax1.set_ylabel('Effect')\n",
+    "    ax1.set_title('Unit Fixed Effects (α_i)')\n",
+    "    ax1.legend()\n",
+    "    \n",
+    "    # Time effects\n",
+    "    ax2 = axes[1]\n",
+    "    ax2.plot(time_effects_df['time'], time_effects_df['effect'], 'o-', linewidth=2)\n",
+    "    ax2.axvline(x=n_pre - 0.5, color='black', linestyle='--', label='Treatment')\n",
+    "    ax2.set_xlabel('Period')\n",
+    "    ax2.set_ylabel('Effect')\n",
+    "    ax2.set_title('Time Fixed Effects (β_t)')\n",
+    "    ax2.legend()\n",
+    "    \n",
+    "    plt.tight_layout()\n",
+    "    plt.show()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 6. Comparing TROP vs SDID\n",
+    "\n",
+    "Let's compare TROP with Synthetic DiD to see the benefit of factor adjustment when the DGP has factor structure."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": "# SDID (no factor adjustment)\n# Note: SDID uses 'treat' (unit-level ever-treated indicator)\nsdid = SyntheticDiD(\n    n_bootstrap=100,\n    seed=42\n)\n\nsdid_results = sdid.fit(\n    df,\n    outcome='outcome',\n    treatment='treat',  # Unit-level ever-treated indicator\n    unit='unit',\n    time='period',\n    post_periods=post_periods\n)\n\n# TROP (with factor adjustment)\n# Note: TROP uses 'treated' (observation-level treatment indicator)\ntrop_est2 = TROP(\n    lambda_nn_grid=[0.0, 0.1, 1.0],  # Allow factor estimation\n    n_bootstrap=100,\n    seed=42\n)\n\ntrop_results = trop_est2.fit(\n    df,\n    outcome='outcome',\n    treatment='treated',  # Observation-level indicator\n    unit='unit',\n    time='period',\n    post_periods=post_periods\n)\n\nprint(\"Comparison: SDID vs TROP\")\nprint(\"=\"*60)\nprint(f\"True ATT: {true_att:.4f}\")\nprint()\nprint(f\"Synthetic DiD (no factor adjustment):\")\nprint(f\"  ATT: {sdid_results.att:.4f}\")\nprint(f\"  SE: {sdid_results.se:.4f}\")\nprint(f\"  Bias: {sdid_results.att - true_att:.4f}\")\nprint()\nprint(f\"TROP (with factor adjustment):\")\nprint(f\"  ATT: {trop_results.att:.4f}\")\nprint(f\"  SE: {trop_results.se:.4f}\")\nprint(f\"  Bias: {trop_results.att - true_att:.4f}\")\nprint(f\"  Effective rank: {trop_results.effective_rank:.2f}\")"
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 7. Monte Carlo Comparison\n",
+    "\n",
+    "Let's run a small Monte Carlo simulation to compare TROP and SDID under the factor DGP."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": "# Monte Carlo comparison\nn_sims = 20\ntrop_estimates = []\nsdid_estimates = []\n\nprint(f\"Running {n_sims} simulations...\")\n\nfor sim in range(n_sims):\n    # Generate new data (includes both 'treated' and 'treat' columns)\n    sim_data = generate_factor_dgp(\n        n_units=50,\n        n_pre=10,\n        n_post=5,\n        n_treated=10,\n        n_factors=2,\n        treatment_effect=2.0,\n        factor_strength=1.5,\n        noise_std=0.5,\n        seed=100 + sim\n    )\n    \n    # TROP (uses observation-level 'treated')\n    try:\n        trop_m = TROP(\n            lambda_time_grid=[1.0],\n            lambda_unit_grid=[1.0],\n            lambda_nn_grid=[0.1],\n            n_bootstrap=10, \n            seed=42 + sim\n        )\n        trop_res = trop_m.fit(\n            sim_data,\n            outcome='outcome',\n            treatment='treated',\n            unit='unit',\n            time='period',\n            post_periods=list(range(10, 15))\n        )\n        trop_estimates.append(trop_res.att)\n    except Exception as e:\n        print(f\"TROP failed on sim {sim}: {e}\")\n    \n    # SDID (uses unit-level 'treat')\n    try:\n        sdid_m = SyntheticDiD(n_bootstrap=10, seed=42 + sim)\n        sdid_res = sdid_m.fit(\n            sim_data,\n            outcome='outcome',\n            treatment='treat',  # Unit-level ever-treated indicator\n            unit='unit',\n            time='period',\n            post_periods=list(range(10, 15))\n        )\n        sdid_estimates.append(sdid_res.att)\n    except Exception as e:\n        print(f\"SDID failed on sim {sim}: {e}\")\n\nprint(f\"\\nMonte Carlo Results (True ATT = {true_att})\")\nprint(\"=\"*60)\nprint(f\"{'Estimator':<15} {'Mean':>12} {'Bias':>12} {'RMSE':>12}\")\nprint(\"-\"*60)\n\nif trop_estimates:\n    trop_mean = np.mean(trop_estimates)\n    trop_bias = trop_mean - true_att\n    trop_rmse = np.sqrt(np.mean([(e - true_att)**2 for e in trop_estimates]))\n    print(f\"{'TROP':<15} {trop_mean:>12.4f} {trop_bias:>12.4f} {trop_rmse:>12.4f}\")\n\nif sdid_estimates:\n    sdid_mean = np.mean(sdid_estimates)\n    sdid_bias = sdid_mean - true_att\n    sdid_rmse = np.sqrt(np.mean([(e - true_att)**2 for e in sdid_estimates]))\n    print(f\"{'SDID':<15} {sdid_mean:>12.4f} {sdid_bias:>12.4f} {sdid_rmse:>12.4f}\")"
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "if HAS_MATPLOTLIB and trop_estimates and sdid_estimates:\n",
+    "    # Visualize Monte Carlo results\n",
+    "    fig, ax = plt.subplots(figsize=(10, 6))\n",
+    "    \n",
+    "    ax.hist(sdid_estimates, bins=15, alpha=0.6, label='SDID', color='blue')\n",
+    "    ax.hist(trop_estimates, bins=15, alpha=0.6, label='TROP', color='red')\n",
+    "    ax.axvline(x=true_att, color='black', linewidth=2, linestyle='--', label=f'True ATT = {true_att}')\n",
+    "    \n",
+    "    ax.set_xlabel('Estimated ATT')\n",
+    "    ax.set_ylabel('Frequency')\n",
+    "    ax.set_title('Monte Carlo Distribution of Estimates')\n",
+    "    ax.legend()\n",
+    "    plt.tight_layout()\n",
+    "    plt.show()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 8. Using the Convenience Function\n",
+    "\n",
+    "For quick estimation, you can use the `trop()` convenience function."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# One-liner estimation with default tuning grid\n",
+    "quick_results = trop(\n",
+    "    df,\n",
+    "    outcome='outcome',\n",
+    "    treatment='treated',\n",
+    "    unit='unit',\n",
+    "    time='period',\n",
+    "    post_periods=post_periods,\n",
+    "    n_bootstrap=50,\n",
+    "    seed=42\n",
+    ")\n",
+    "\n",
+    "print(f\"Quick estimation:\")\n",
+    "print(f\"  ATT: {quick_results.att:.4f}\")\n",
+    "print(f\"  SE: {quick_results.se:.4f}\")\n",
+    "print(f\"  λ_time: {quick_results.lambda_time:.2f}\")\n",
+    "print(f\"  λ_unit: {quick_results.lambda_unit:.2f}\")\n",
+    "print(f\"  λ_nn: {quick_results.lambda_nn:.2f}\")\n",
+    "print(f\"  Effective rank: {quick_results.effective_rank:.2f}\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 9. Variance Estimation Methods\n",
+    "\n",
+    "TROP supports two methods for variance estimation:\n",
+    "- **Bootstrap** (default): Unit-level block bootstrap\n",
+    "- **Jackknife**: Leave-one-treated-unit-out"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Compare variance estimation methods\n",
+    "print(\"Variance estimation comparison:\")\n",
+    "print(\"=\"*50)\n",
+    "\n",
+    "for method in ['bootstrap', 'jackknife']:\n",
+    "    trop_var = TROP(\n",
+    "        lambda_time_grid=[1.0],\n",
+    "        lambda_unit_grid=[1.0], \n",
+    "        lambda_nn_grid=[0.1],\n",
+    "        variance_method=method,\n",
+    "        n_bootstrap=100,\n",
+    "        seed=42\n",
+    "    )\n",
+    "    \n",
+    "    res = trop_var.fit(\n",
+    "        df,\n",
+    "        outcome='outcome',\n",
+    "        treatment='treated',\n",
+    "        unit='unit',\n",
+    "        time='period',\n",
+    "        post_periods=post_periods\n",
+    "    )\n",
+    "    \n",
+    "    print(f\"\\n{method.capitalize()}:\")\n",
+    "    print(f\"  ATT: {res.att:.4f}\")\n",
+    "    print(f\"  SE: {res.se:.4f}\")\n",
+    "    print(f\"  95% CI: [{res.conf_int[0]:.4f}, {res.conf_int[1]:.4f}]\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 10. Results Export\n",
+    "\n",
+    "TROP results can be easily exported to different formats."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Convert to dictionary\n",
+    "results_dict = results.to_dict()\n",
+    "print(\"Results as dictionary:\")\n",
+    "for key, value in results_dict.items():\n",
+    "    if isinstance(value, float):\n",
+    "        print(f\"  {key}: {value:.4f}\")\n",
+    "    else:\n",
+    "        print(f\"  {key}: {value}\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Convert to DataFrame\n",
+    "results_df = results.to_dataframe()\n",
+    "print(\"\\nResults as DataFrame:\")\n",
+    "print(results_df.T)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Individual treatment effects\n",
+    "treatment_effects_df = results.get_treatment_effects_df()\n",
+    "print(\"\\nIndividual treatment effects (first 10):\")\n",
+    "print(treatment_effects_df.head(10).to_string(index=False))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Summary\n",
+    "\n",
+    "Key takeaways for TROP:\n",
+    "\n",
+    "1. **Best use cases**: Factor confounding, unobserved time-varying confounders with interactive effects\n",
+    "2. **Factor estimation**: Nuclear norm regularization with LOOCV for tuning\n",
+    "3. **Three tuning parameters**: λ_time, λ_unit, λ_nn selected automatically\n",
+    "4. **Unit weights**: Exponential distance-based weighting of control units\n",
+    "5. **Time weights**: Exponential decay weighting of pre-treatment periods\n",
+    "\n",
+    "**When to use TROP vs SDID**:\n",
+    "- Use **SDID** when parallel trends is plausible and factors are not a concern\n",
+    "- Use **TROP** when you suspect factor confounding (regional shocks, economic cycles, latent factors)\n",
+    "- Running both provides a useful robustness check\n",
+    "\n",
+    "**Reference**:\n",
+    "- Athey, S., Imbens, G. W., Qu, Z., & Viviano, D. (2025). Triply Robust Panel Estimators. *Working Paper*. https://arxiv.org/abs/2508.21536"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "name": "python",
+   "version": "3.11.0"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
\ No newline at end of file
diff --git a/pyproject.toml b/pyproject.toml
index d73637a..00a2caf 100644
--- a/pyproject.toml
+++ b/pyproject.toml
@@ -4,7 +4,7 @@ build-backend = "maturin"
 
 [project]
 name = "diff-diff"
-version = "2.0.4"
+version = "2.1.0"
 description = "A library for Difference-in-Differences causal inference analysis"
 readme = "README.md"
 license = "MIT"
diff --git a/tests/test_trop.py b/tests/test_trop.py
new file mode 100644
index 0000000..b4df5a3
--- /dev/null
+++ b/tests/test_trop.py
@@ -0,0 +1,962 @@
+"""Tests for Triply Robust Panel (TROP) estimator."""
+
+import numpy as np
+import pandas as pd
+import pytest
+
+from diff_diff import SyntheticDiD
+from diff_diff.trop import TROP, TROPResults, trop
+
+
+def generate_factor_dgp(
+    n_units: int = 50,
+    n_pre: int = 10,
+    n_post: int = 5,
+    n_treated: int = 10,
+    n_factors: int = 2,
+    treatment_effect: float = 2.0,
+    factor_strength: float = 1.0,
+    noise_std: float = 0.5,
+    seed: int = 42,
+) -> pd.DataFrame:
+    """
+    Generate panel data with known factor structure.
+
+    DGP: Y_it = mu + gamma_i + delta_t + Lambda_i'F_t + tau*D_it + eps_it
+    """
+    rng = np.random.default_rng(seed)
+
+    n_control = n_units - n_treated
+    n_periods = n_pre + n_post
+
+    # Generate factors F: (n_periods, n_factors)
+    F = rng.normal(0, 1, (n_periods, n_factors))
+
+    # Generate loadings Lambda: (n_factors, n_units)
+    Lambda = rng.normal(0, 1, (n_factors, n_units))
+    Lambda[:, :n_treated] += 0.5
+
+    # Unit fixed effects
+    gamma = rng.normal(0, 1, n_units)
+    gamma[:n_treated] += 1.0
+
+    # Time fixed effects
+    delta = np.linspace(0, 2, n_periods)
+
+    # Generate outcomes
+    data = []
+    for i in range(n_units):
+        is_treated = i < n_treated
+
+        for t in range(n_periods):
+            period = t
+            post = t >= n_pre
+
+            y = 10.0 + gamma[i] + delta[t]
+            y += factor_strength * (Lambda[:, i] @ F[t, :])
+
+            # Treatment effect only for treated units in post period
+            treatment_indicator = 1 if (is_treated and post) else 0
+            if treatment_indicator:
+                y += treatment_effect
+
+            y += rng.normal(0, noise_std)
+
+            data.append({
+                "unit": i,
+                "period": period,
+                "outcome": y,
+                "treated": treatment_indicator,
+            })
+
+    return pd.DataFrame(data)
+
+
+@pytest.fixture
+def factor_dgp_data():
+    """Generate data with factor structure and known treatment effect."""
+    return generate_factor_dgp(
+        n_units=30,
+        n_pre=8,
+        n_post=4,
+        n_treated=5,
+        n_factors=2,
+        treatment_effect=2.0,
+        factor_strength=1.0,
+        noise_std=0.5,
+        seed=42,
+    )
+
+
+@pytest.fixture
+def simple_panel_data():
+    """Generate simple panel data without factors."""
+    rng = np.random.default_rng(123)
+
+    n_units = 20
+    n_treated = 5
+    n_pre = 5
+    n_post = 3
+    true_att = 3.0
+
+    data = []
+    for i in range(n_units):
+        is_treated = i < n_treated
+        for t in range(n_pre + n_post):
+            post = t >= n_pre
+            y = 10.0 + i * 0.1 + t * 0.5
+            treatment_indicator = 1 if (is_treated and post) else 0
+            if treatment_indicator:
+                y += true_att
+            y += rng.normal(0, 0.5)
+            data.append({
+                "unit": i,
+                "period": t,
+                "outcome": y,
+                "treated": treatment_indicator,
+            })
+
+    return pd.DataFrame(data)
+
+
+class TestTROP:
+    """Tests for TROP estimator."""
+
+    def test_basic_fit(self, simple_panel_data):
+        """Test basic model fitting."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            n_bootstrap=10,
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        assert isinstance(results, TROPResults)
+        assert trop_est.is_fitted_
+        assert results.n_obs == len(simple_panel_data)
+        assert results.n_control == 15
+        assert results.n_treated == 5
+
+    def test_fit_with_factors(self, factor_dgp_data):
+        """Test fitting with factor structure."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1, 1.0],
+            n_bootstrap=20,
+            seed=42
+        )
+        post_periods = list(range(8, 12))
+        results = trop_est.fit(
+            factor_dgp_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=post_periods,
+        )
+
+        assert isinstance(results, TROPResults)
+        assert results.effective_rank >= 0
+        assert results.factor_matrix.shape == (12, 30)  # n_periods x n_units
+
+    def test_treatment_effect_recovery(self, factor_dgp_data):
+        """Test that TROP recovers treatment effect direction."""
+        true_att = 2.0
+
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 0.5, 1.0],
+            lambda_unit_grid=[0.0, 0.5, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            n_bootstrap=30,
+            seed=42
+        )
+        post_periods = list(range(8, 12))
+        results = trop_est.fit(
+            factor_dgp_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=post_periods,
+        )
+
+        # ATT should be positive (correct direction)
+        assert results.att > 0
+        # Should be reasonably close to true value
+        assert abs(results.att - true_att) < 3.0
+
+    def test_tuning_parameter_selection(self, simple_panel_data):
+        """Test that LOOCV selects tuning parameters."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 0.5, 1.0, 2.0],
+            lambda_unit_grid=[0.0, 0.5, 1.0],
+            lambda_nn_grid=[0.0, 0.1, 1.0],
+            n_bootstrap=10,
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        # Check that lambda values are from the grid
+        assert results.lambda_time in trop_est.lambda_time_grid
+        assert results.lambda_unit in trop_est.lambda_unit_grid
+        assert results.lambda_nn in trop_est.lambda_nn_grid
+
+    def test_bootstrap_variance(self, simple_panel_data):
+        """Test bootstrap variance estimation."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            variance_method="bootstrap",
+            n_bootstrap=30,
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        assert results.se > 0
+        assert results.variance_method == "bootstrap"
+        assert results.n_bootstrap == 30
+        assert results.bootstrap_distribution is not None
+
+    def test_jackknife_variance(self, simple_panel_data):
+        """Test jackknife variance estimation."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            variance_method="jackknife",
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        assert results.se >= 0
+        assert results.variance_method == "jackknife"
+
+    def test_confidence_interval(self, simple_panel_data):
+        """Test confidence interval properties."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            alpha=0.05,
+            n_bootstrap=30,
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        lower, upper = results.conf_int
+        assert lower < results.att < upper
+        assert lower < upper
+
+    def test_get_set_params(self):
+        """Test sklearn-compatible get_params and set_params."""
+        trop_est = TROP(alpha=0.05)
+
+        params = trop_est.get_params()
+        assert params["alpha"] == 0.05
+
+        trop_est.set_params(alpha=0.10)
+        assert trop_est.alpha == 0.10
+
+    def test_invalid_variance_method(self):
+        """Test error on invalid variance method."""
+        with pytest.raises(ValueError):
+            TROP(variance_method="invalid")
+
+    def test_missing_columns(self, simple_panel_data):
+        """Test error when column is missing."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0],
+            lambda_unit_grid=[0.0],
+            lambda_nn_grid=[0.0],
+            n_bootstrap=5
+        )
+        with pytest.raises(ValueError, match="Missing columns"):
+            trop_est.fit(
+                simple_panel_data,
+                outcome="nonexistent",
+                treatment="treated",
+                unit="unit",
+                time="period",
+            )
+
+    def test_no_treated_observations(self):
+        """Test error when no treated observations."""
+        data = pd.DataFrame({
+            "unit": [0, 0, 1, 1],
+            "period": [0, 1, 0, 1],
+            "outcome": [1, 2, 3, 4],
+            "treated": [0, 0, 0, 0],
+        })
+
+        trop_est = TROP(
+            lambda_time_grid=[0.0],
+            lambda_unit_grid=[0.0],
+            lambda_nn_grid=[0.0],
+            n_bootstrap=5
+        )
+        with pytest.raises(ValueError, match="No treated observations"):
+            trop_est.fit(
+                data,
+                outcome="outcome",
+                treatment="treated",
+                unit="unit",
+                time="period",
+            )
+
+    def test_no_control_units(self):
+        """Test error when no control units."""
+        data = pd.DataFrame({
+            "unit": [0, 0, 1, 1],
+            "period": [0, 1, 0, 1],
+            "outcome": [1, 2, 3, 4],
+            "treated": [0, 1, 0, 1],  # Both units become treated
+        })
+
+        trop_est = TROP(
+            lambda_time_grid=[0.0],
+            lambda_unit_grid=[0.0],
+            lambda_nn_grid=[0.0],
+            n_bootstrap=5
+        )
+        with pytest.raises(ValueError, match="No control units"):
+            trop_est.fit(
+                data,
+                outcome="outcome",
+                treatment="treated",
+                unit="unit",
+                time="period",
+            )
+
+
+class TestTROPResults:
+    """Tests for TROPResults dataclass."""
+
+    def test_summary(self, simple_panel_data):
+        """Test that summary produces string output."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            n_bootstrap=10,
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        summary = results.summary()
+        assert isinstance(summary, str)
+        assert "ATT" in summary
+        assert "TROP" in summary
+        assert "LOOCV" in summary
+        assert "Lambda" in summary
+
+    def test_to_dict(self, simple_panel_data):
+        """Test conversion to dictionary."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            n_bootstrap=10,
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        d = results.to_dict()
+        assert "att" in d
+        assert "se" in d
+        assert "lambda_time" in d
+        assert "lambda_unit" in d
+        assert "lambda_nn" in d
+        assert "effective_rank" in d
+
+    def test_to_dataframe(self, simple_panel_data):
+        """Test conversion to DataFrame."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            n_bootstrap=10,
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        df = results.to_dataframe()
+        assert isinstance(df, pd.DataFrame)
+        assert len(df) == 1
+        assert "att" in df.columns
+
+    def test_get_treatment_effects_df(self, simple_panel_data):
+        """Test getting treatment effects DataFrame."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            n_bootstrap=10,
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        effects_df = results.get_treatment_effects_df()
+        assert isinstance(effects_df, pd.DataFrame)
+        assert "unit" in effects_df.columns
+        assert "time" in effects_df.columns
+        assert "effect" in effects_df.columns
+        assert len(effects_df) == results.n_treated_obs
+
+    def test_get_unit_effects_df(self, simple_panel_data):
+        """Test getting unit effects DataFrame."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            n_bootstrap=10,
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        effects_df = results.get_unit_effects_df()
+        assert isinstance(effects_df, pd.DataFrame)
+        assert "unit" in effects_df.columns
+        assert "effect" in effects_df.columns
+
+    def test_get_time_effects_df(self, simple_panel_data):
+        """Test getting time effects DataFrame."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            n_bootstrap=10,
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        effects_df = results.get_time_effects_df()
+        assert isinstance(effects_df, pd.DataFrame)
+        assert "time" in effects_df.columns
+        assert "effect" in effects_df.columns
+
+    def test_is_significant(self, simple_panel_data):
+        """Test significance property."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            alpha=0.05,
+            n_bootstrap=30,
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        assert isinstance(results.is_significant, bool)
+
+    def test_significance_stars(self, simple_panel_data):
+        """Test significance stars."""
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            n_bootstrap=30,
+            seed=42
+        )
+        results = trop_est.fit(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+        )
+
+        stars = results.significance_stars
+        assert stars in ["", ".", "*", "**", "***"]
+
+
+class TestTROPvsSDID:
+    """Tests comparing TROP to SDID under different DGPs."""
+
+    def test_trop_handles_factor_dgp(self):
+        """Test that TROP works on factor DGP data."""
+        data = generate_factor_dgp(
+            n_units=30,
+            n_pre=8,
+            n_post=4,
+            n_treated=5,
+            n_factors=2,
+            treatment_effect=2.0,
+            factor_strength=1.5,
+            noise_std=0.5,
+            seed=42,
+        )
+        post_periods = list(range(8, 12))
+
+        # TROP should complete without error
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1, 1.0],
+            n_bootstrap=20,
+            seed=42
+        )
+        results = trop_est.fit(
+            data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=post_periods,
+        )
+
+        assert results.att != 0
+        assert results.se >= 0
+
+
+class TestConvenienceFunction:
+    """Tests for trop() convenience function."""
+
+    def test_convenience_function(self, simple_panel_data):
+        """Test that convenience function works."""
+        results = trop(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+            lambda_time_grid=[0.0, 1.0],
+            lambda_unit_grid=[0.0, 1.0],
+            lambda_nn_grid=[0.0, 0.1],
+            n_bootstrap=10,
+            seed=42,
+        )
+
+        assert isinstance(results, TROPResults)
+        assert results.n_obs == len(simple_panel_data)
+
+    def test_convenience_with_kwargs(self, simple_panel_data):
+        """Test convenience function with additional kwargs."""
+        results = trop(
+            simple_panel_data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=[5, 6, 7],
+            lambda_time_grid=[0.0, 0.5, 1.0],
+            lambda_unit_grid=[0.0, 0.5],
+            lambda_nn_grid=[0.0, 0.1],
+            max_iter=50,
+            n_bootstrap=10,
+            seed=42,
+        )
+
+        assert isinstance(results, TROPResults)
+
+
+class TestMethodologyVerification:
+    """Tests verifying TROP methodology matches paper specifications.
+
+    These tests verify:
+    1. Limiting cases match expected behavior
+    2. Treatment effect recovery under paper's simulation DGP
+    3. Observation-specific weighting produces expected results
+    """
+
+    def test_limiting_case_uniform_weights(self):
+        """
+        Test limiting case: λ_unit = λ_time = 0, λ_nn = 0.
+
+        With all lambdas at zero, TROP should use uniform weights and no
+        nuclear norm regularization, giving TWFE-like estimates.
+        """
+        # Generate simple data with known treatment effect
+        rng = np.random.default_rng(42)
+        n_units = 15
+        n_treated = 5
+        n_pre = 5
+        n_post = 3
+        true_att = 3.0
+
+        data = []
+        for i in range(n_units):
+            is_treated = i < n_treated
+            unit_fe = rng.normal(0, 0.5)
+            for t in range(n_pre + n_post):
+                post = t >= n_pre
+                time_fe = 0.2 * t
+                y = 10.0 + unit_fe + time_fe
+                treatment_indicator = 1 if (is_treated and post) else 0
+                if treatment_indicator:
+                    y += true_att
+                y += rng.normal(0, 0.3)
+                data.append({
+                    "unit": i,
+                    "period": t,
+                    "outcome": y,
+                    "treated": treatment_indicator,
+                })
+
+        df = pd.DataFrame(data)
+        post_periods = list(range(n_pre, n_pre + n_post))
+
+        # TROP with uniform weights
+        trop_est = TROP(
+            lambda_time_grid=[0.0],
+            lambda_unit_grid=[0.0],
+            lambda_nn_grid=[0.0],
+            n_bootstrap=10,
+            seed=42
+        )
+        results = trop_est.fit(
+            df,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=post_periods,
+        )
+
+        # Should recover treatment effect within reasonable tolerance
+        assert abs(results.att - true_att) < 1.0, \
+            f"ATT={results.att:.3f} should be close to true={true_att}"
+        # Check that uniform weights were selected
+        assert results.lambda_time == 0.0
+        assert results.lambda_unit == 0.0
+        assert results.lambda_nn == 0.0
+
+    def test_unit_weights_reduce_bias(self):
+        """
+        Test that unit distance-based weights reduce bias when controls vary.
+
+        When control units have varying similarity to treated units, using
+        distance-based unit weights should improve estimation.
+        """
+        rng = np.random.default_rng(123)
+        n_units = 25
+        n_treated = 5
+        n_pre = 6
+        n_post = 3
+        true_att = 2.5
+
+        data = []
+        # Create heterogeneous control units - some similar to treated, some different
+        for i in range(n_units):
+            is_treated = i < n_treated
+            # Treated units and first 5 controls are similar
+            if is_treated or i < n_treated + 5:
+                unit_fe = 5.0 + rng.normal(0, 0.3)
+            else:
+                # Remaining controls are dissimilar
+                unit_fe = 10.0 + rng.normal(0, 0.5)
+
+            for t in range(n_pre + n_post):
+                post = t >= n_pre
+                time_fe = 0.2 * t
+                y = unit_fe + time_fe
+                treatment_indicator = 1 if (is_treated and post) else 0
+                if treatment_indicator:
+                    y += true_att
+                y += rng.normal(0, 0.3)
+                data.append({
+                    "unit": i,
+                    "period": t,
+                    "outcome": y,
+                    "treated": treatment_indicator,
+                })
+
+        df = pd.DataFrame(data)
+        post_periods = list(range(n_pre, n_pre + n_post))
+
+        # TROP with unit weighting enabled
+        trop_est = TROP(
+            lambda_time_grid=[0.0],
+            lambda_unit_grid=[0.0, 1.0, 2.0],
+            lambda_nn_grid=[0.0],
+            n_bootstrap=10,
+            seed=42
+        )
+        results = trop_est.fit(
+            df,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=post_periods,
+        )
+
+        # Should recover treatment effect reasonably well
+        assert abs(results.att - true_att) < 1.5, \
+            f"ATT={results.att:.3f} should be close to true={true_att}"
+
+    def test_time_weights_reduce_bias(self):
+        """
+        Test that time distance-based weights reduce bias with trending data.
+
+        When pre-treatment outcomes are trending, weighting recent periods
+        more heavily should improve estimation.
+        """
+        rng = np.random.default_rng(456)
+        n_units = 20
+        n_treated = 5
+        n_pre = 8
+        n_post = 3
+        true_att = 2.0
+
+        data = []
+        for i in range(n_units):
+            is_treated = i < n_treated
+            unit_fe = rng.normal(0, 0.5)
+
+            for t in range(n_pre + n_post):
+                post = t >= n_pre
+                # Time trend that accelerates near treatment
+                time_fe = 0.1 * t + 0.05 * (t ** 2 / n_pre)
+                y = 10.0 + unit_fe + time_fe
+                treatment_indicator = 1 if (is_treated and post) else 0
+                if treatment_indicator:
+                    y += true_att
+                y += rng.normal(0, 0.3)
+                data.append({
+                    "unit": i,
+                    "period": t,
+                    "outcome": y,
+                    "treated": treatment_indicator,
+                })
+
+        df = pd.DataFrame(data)
+        post_periods = list(range(n_pre, n_pre + n_post))
+
+        # TROP with time weighting enabled
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 0.5, 1.0],
+            lambda_unit_grid=[0.0],
+            lambda_nn_grid=[0.0],
+            n_bootstrap=10,
+            seed=42
+        )
+        results = trop_est.fit(
+            df,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=post_periods,
+        )
+
+        # Should recover treatment effect direction
+        assert results.att > 0, f"ATT={results.att:.3f} should be positive"
+        # Check that time weighting was considered
+        assert results.lambda_time in [0.0, 0.5, 1.0]
+
+    def test_factor_model_reduces_bias(self):
+        """
+        Test that nuclear norm regularization reduces bias with factor structure.
+
+        Following paper's simulation: when true DGP has interactive fixed effects,
+        the factor model component should help recover the treatment effect.
+        """
+        # Generate data with known factor structure
+        data = generate_factor_dgp(
+            n_units=40,
+            n_pre=10,
+            n_post=5,
+            n_treated=8,
+            n_factors=2,
+            treatment_effect=2.0,
+            factor_strength=1.5,  # Strong factors
+            noise_std=0.5,
+            seed=789,
+        )
+        post_periods = list(range(10, 15))
+
+        # TROP with nuclear norm regularization
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 0.5],
+            lambda_unit_grid=[0.0, 0.5],
+            lambda_nn_grid=[0.0, 0.1, 1.0, 5.0],
+            n_bootstrap=20,
+            seed=42
+        )
+        results = trop_est.fit(
+            data,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=post_periods,
+        )
+
+        true_att = 2.0
+        # With factor adjustment, should recover treatment effect
+        assert abs(results.att - true_att) < 2.0, \
+            f"ATT={results.att:.3f} should be within 2.0 of true={true_att}"
+        # Factor matrix should capture some structure
+        assert results.effective_rank > 0, "Factor matrix should have positive rank"
+
+    def test_paper_dgp_recovery(self):
+        """
+        Test treatment effect recovery using paper's simulation DGP.
+
+        Based on Table 2 (page 32) simulation settings:
+        - Factor model with 2 factors
+        - Treatment effect = 0 (null hypothesis)
+        - Should produce estimates centered around zero
+
+        This is a methodological validation test.
+        """
+        # Generate data similar to paper's simulation
+        rng = np.random.default_rng(2024)
+        n_units = 50
+        n_treated = 10
+        n_pre = 10
+        n_post = 5
+        n_factors = 2
+        true_tau = 0.0  # Null treatment effect
+
+        # Generate factors F: (n_periods, n_factors)
+        F = rng.normal(0, 1, (n_pre + n_post, n_factors))
+
+        # Generate loadings Lambda: (n_factors, n_units)
+        Lambda = rng.normal(0, 1, (n_factors, n_units))
+        # Treated units have different loadings (selection on unobservables)
+        Lambda[:, :n_treated] += 0.5
+
+        # Unit fixed effects
+        gamma = rng.normal(0, 1, n_units)
+        gamma[:n_treated] += 1.0  # Selection on levels
+
+        # Time fixed effects (linear trend)
+        delta = np.linspace(0, 2, n_pre + n_post)
+
+        data = []
+        for i in range(n_units):
+            is_treated = i < n_treated
+            for t in range(n_pre + n_post):
+                post = t >= n_pre
+                # Y = mu + gamma_i + delta_t + Lambda_i'F_t + tau*D + eps
+                y = 10.0 + gamma[i] + delta[t]
+                y += Lambda[:, i] @ F[t, :]  # Factor component
+                treatment_indicator = 1 if (is_treated and post) else 0
+                if treatment_indicator:
+                    y += true_tau
+                y += rng.normal(0, 0.5)  # Idiosyncratic noise
+
+                data.append({
+                    "unit": i,
+                    "period": t,
+                    "outcome": y,
+                    "treated": treatment_indicator,
+                })
+
+        df = pd.DataFrame(data)
+        post_periods = list(range(n_pre, n_pre + n_post))
+
+        # TROP estimation
+        trop_est = TROP(
+            lambda_time_grid=[0.0, 0.5, 1.0],
+            lambda_unit_grid=[0.0, 0.5, 1.0],
+            lambda_nn_grid=[0.0, 0.1, 1.0],
+            n_bootstrap=30,
+            seed=42
+        )
+        results = trop_est.fit(
+            df,
+            outcome="outcome",
+            treatment="treated",
+            unit="unit",
+            time="period",
+            post_periods=post_periods,
+        )
+
+        # Under null hypothesis, ATT should be close to zero
+        # Allow for estimation error (this is a finite sample)
+        assert abs(results.att) < 2.0, \
+            f"ATT={results.att:.3f} should be close to true={true_tau} under null"
+        # Check that factor model was used
+        assert results.effective_rank >= 0