Make xgboost optional in synthetic dataset generation by Si-ra-kri · Pull Request #872 · uber/causalml

Si-ra-kri · 2026-02-07T05:33:12Z

This change makes xgboost an optional dependency for synthetic dataset generation. When xgboost is not installed, LinearRegression-based
learners continue to work without import errors.

CLAassistant · 2026-02-07T05:33:18Z

All committers have signed the CLA.

Si-ra-kri · 2026-02-08T04:24:57Z

I’ve fixed the indentation issue that caused the CI failure.
The latest commit resolves the error. Please let me know if you’d like me to trigger CI again.

Si-ra-kri · 2026-02-20T17:34:21Z

Hi! Just a gentle follow-up - all CI checks are passing now.
Please let me know if any changes are needed from my side.

jeongyoonlee · 2026-02-20T18:37:02Z

Code review

Found 1 issue:

Indentation bug in get_synthetic_preds silently drops S, T, and X learner predictions — only R Learner results are ever computed.

models = [(LinearRegression, "LR")] is correctly inside the for base_learner, label_l loop (line 76), but if XGBRegressor is not None: (line 77) and for model, label_m in models: (line 80) are dedented one level — placing them outside the outer loop. After all four base-learner iterations complete, base_learner and label_l hold only the last values (BaseRRegressor / "R"), so only "R Learner (LR)" (and "R Learner (XGB)" if installed) are added to preds_dict. The S, T, and X learner predictions are silently skipped.

causalml/causalml/dataset/synthetic.py

Lines 72 to 83 in ff8dea7

    
           for base_learner, label_l in zip( 
        
               [BaseSRegressor, BaseTRegressor, BaseXRegressor, BaseRRegressor], 
        
               ["S", "T", "X", "R"], 
        
           ): 
        
               models = [(LinearRegression, "LR")] 
        
           if XGBRegressor is not None: 
        
               models.append((XGBRegressor, "XGB")) 
        
           for model, label_m in models: 
        
               learner = base_learner(model()) 
        
               model_name = "{} Learner ({})".format(label_l, label_m)

get_synthetic_preds_holdout does not have this bug — all three lines sit correctly inside the outer loop there:

causalml/causalml/dataset/synthetic.py

Lines 380 to 390 in ff8dea7

    
           for base_learner, label_l in zip( 
        
               [BaseSRegressor, BaseTRegressor, BaseXRegressor, BaseRRegressor], 
        
               ["S", "T", "X", "R"], 
        
           ): 
        
               models = [(LinearRegression, "LR")] 
        
               if XGBRegressor is not None: 
        
                   models.append((XGBRegressor, "XGB")) 
        
               for model, label_m in models: 
        
                   # RLearner will need to fit on the p_hat 
        
                   if label_l != "R": 
        
                       learner = base_learner(model())

The fix is to indent lines 77–81 of get_synthetic_preds by four additional spaces so they match the structure in get_synthetic_preds_holdout.

🤖 Generated with Claude Code

_{- If this code review was useful, please react with 👍. Otherwise, react with 👎.}

…be skipped

Si-ra-kri · 2026-02-20T19:22:20Z

Thanks for catching that, indentation has been fixed and pushed.
Please let me know if anything else needs adjustment.

jeongyoonlee

LGTM

Make xgboost optional in synthetic dataset generation

6496ca8

Fix indentation error in synthetic dataset generation

9207cd5

Fix black formatting and indentation in synthetic dataset

ff8dea7

Fix indentation bug in get_synthetic_preds causing S/T/X learners to …

7739742

…be skipped

jeongyoonlee approved these changes Feb 20, 2026

View reviewed changes

jeongyoonlee merged commit b4c76dd into uber:master Feb 20, 2026
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make xgboost optional in synthetic dataset generation#872

Make xgboost optional in synthetic dataset generation#872
jeongyoonlee merged 4 commits intouber:masterfrom
Si-ra-kri:fix-optional-xgboost

Si-ra-kri commented Feb 7, 2026

Uh oh!

CLAassistant commented Feb 7, 2026 •

edited

Loading

Uh oh!

Si-ra-kri commented Feb 8, 2026

Uh oh!

Si-ra-kri commented Feb 20, 2026

Uh oh!

jeongyoonlee commented Feb 20, 2026

Uh oh!

Si-ra-kri commented Feb 20, 2026

Uh oh!

jeongyoonlee left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Si-ra-kri commented Feb 7, 2026

Uh oh!

CLAassistant commented Feb 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Si-ra-kri commented Feb 8, 2026

Uh oh!

Si-ra-kri commented Feb 20, 2026

Uh oh!

jeongyoonlee commented Feb 20, 2026

Code review

Uh oh!

Si-ra-kri commented Feb 20, 2026

Uh oh!

jeongyoonlee left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CLAassistant commented Feb 7, 2026 •

edited

Loading