Skip to content

Conversation

@okiner-3
Copy link
Collaborator

Add a tutorial with Oregon Health Insurance Experiment.

@okiner-3 okiner-3 self-assigned this Oct 14, 2025
@okiner-3 okiner-3 requested a review from TomeHirata October 14, 2025 08:00
@TomeHirata
Copy link
Collaborator

TomeHirata commented Oct 16, 2025

@okiner-3 Thank you so much for the PR! Sorry if it was not clear in the ticket description, but we want to focus on the local distributional treatment effects (LDTE) and LPTE since incomplete compliance was observed in the experiment. In the Oregon dataset, the treatment column represents the treatment assignment, the numhh_list column is the strata, and the ohp_all_ever_inperson column indicates the actual treatment received. We can keep covariates and outcomes as they are. Could you please take a look at https://cyberagentailab.github.io/python-dte-adjustment/api/local.html and revise the content? It is valuable to compare ITT and LDTE, so we can keep the current analysis as it is, but let's also include the LDTE/LPTE results. Let me know if you need further clarification.

@okiner-3
Copy link
Collaborator Author

@TomeHirata
I understood. I will take care of it!

@TomeHirata TomeHirata requested a review from Copilot October 21, 2025 11:16
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR adds a comprehensive tutorial demonstrating the use of Local Distribution Treatment Effects (LDTE) analysis with the Oregon Health Insurance Experiment dataset. The tutorial showcases how to handle non-compliance scenarios using instrumental variable approaches when not all participants assigned to treatment actually enrolled.

Key changes:

  • New comprehensive tutorial file analyzing emergency department costs and visits using local distribution treatment effects
  • Tutorial demonstrates both simple and ML-adjusted estimators for handling non-compliance
  • Includes stratified analysis by household registration patterns to examine treatment effect heterogeneity

Reviewed Changes

Copilot reviewed 2 out of 11 changed files in this pull request and generated 4 comments.

File Description
docs/source/tutorials/oregon.rst Comprehensive tutorial implementing LDTE analysis for the Oregon Health Insurance Experiment with non-compliance handling
docs/source/tutorials.rst Added reference to the new Oregon tutorial in the documentation index

Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

@TomeHirata
Copy link
Collaborator

Hi, @okiner-3. How's the progress so far? Feel free to let me know if you have any questions!

@okiner-3 okiner-3 requested a review from TomeHirata December 21, 2025 12:08
@okiner-3
Copy link
Collaborator Author

@TomeHirata
I'm sorry it took so long.
The general and abstract considerations have been grounded in the current data and results.
Additionally, duplicate sections and code that were accidentally included have been removed.


**2. Covariate Adjustment Effects and Confidence Intervals**

The confidence intervals remain wide for both estimators, though ML adjustment shows slightly more consistent patterns in the moderate cost range. The limited precision suggests: (1) substantial heterogeneity in treatment effects within cost bins, (2) limited predictive power of covariates for specific cost levels, or (3) relatively small sample sizes within individual bins.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did we observe no precision gain?

- **"Signed self up" stratum**: Confidence intervals remain wide but manageable for both estimators, showing similar patterns to the overall population.
- **"Signed self up + others" stratum**:

- Extreme estimation instability, particularly for ML-adjusted estimator
Copy link
Collaborator

@TomeHirata TomeHirata Dec 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The ML estimation is a bit odd, have you tried using different ml models or fold number?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changing the ML estimation model stabilized the confidence intervals.


Stratified analysis uncovers dramatic treatment effect heterogeneity: single-person households ("signed self up") show moderate effects (LDTE ≈ -0.18 to -0.20), while multi-person households ("signed self up + others") exhibit 3-4x larger effects (LDTE ≈ -0.55). This suggests household structure is a critical moderator—insurance enables care-seeking for multiple family members when households include dependents.

**4. Limited Efficiency Gains from ML Adjustment**
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should figure out the way to increase the efficiency gain instead of listing its difficulty here.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No implementation errors were found.
Some exploration was conducted by adding other features, but no improvement was observed.

@okiner-3 okiner-3 requested a review from TomeHirata January 21, 2026 09:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants