Scenario Analysis¶

Methods for configuring scenario analysis and convergence failure tolerance.

set_scenario_configs()¶

MCPower.set_scenario_configs(configs_dict)[source]¶

Set custom scenario configurations for robustness analysis.

Scenario analysis (enabled via scenarios=True in find_power or find_sample_size) runs the simulation under multiple assumption-violation profiles. The defaults define "realistic" and "doomer" scenarios; use this method to override or add custom scenarios.

Provided configs are merged with the defaults: existing scenario keys are updated, new keys are added.

Parameters:: configs_dict (Dict) – Mapping of scenario names to configuration dicts. Each configuration may include keys such as "heterogeneity", "heteroskedasticity", "correlation_noise_sd", and "distribution_change_prob". See DEFAULT_SCENARIO_CONFIG in mcpower.core.scenarios for the full list of keys.
Returns:: For method chaining.
Return type:: self
Raises:: TypeError – If configs_dict is not a dictionary.

Default Scenarios¶

Scenario analysis runs the power simulation under multiple assumption-violation profiles:

Optimistic – The user’s exact settings with no perturbations (all values zero/normal).
Realistic – Moderate assumption violations.
Doomer – Severe assumption violations.

Configuration Keys – General (All Model Types)¶

Key	Type	Default (Realistic)	Default (Doomer)	Description
`heterogeneity`	`float`	`0.2`	`0.4`	Effect size heterogeneity. SD of per-simulation effect multiplier.
`heteroskedasticity`	`float`	`0.15`	`0.35`	Correlation between predicted values and error SD.
`correlation_noise_sd`	`float`	`0.15`	`0.30`	SD of noise added to predictor correlations.
`distribution_change_prob`	`float`	`0.5`	`0.8`	Probability that a variable’s distribution is swapped.
`new_distributions`	`list`	`["right_skewed", "left_skewed", "uniform"]`	same	Replacement distributions for swapped variables.

Configuration Keys – Residual Perturbation (All Model Types)¶

Key	Type	Default (Realistic)	Default (Doomer)	Description
`residual_dists`	`list[str]`	`["heavy_tailed", "skewed"]`	same	Pool of non-normal distributions. Per-simulation, one is randomly picked.
`residual_change_prob`	`float`	`0.5`	`0.8`	Probability that residuals use a non-normal distribution per simulation.
`residual_df`	`int`	`8`	`5`	Degrees of freedom for heavy-tailed (t) or skewed (chi-squared) residuals.

Configuration Keys – Mixed Model (Cluster Models Only)¶

These keys are only consumed when cluster specifications are present. They are ignored for OLS analyses.

Key	Type	Default (Realistic)	Default (Doomer)	Description
`icc_noise_sd`	`float`	`0.15`	`0.30`	SD of multiplicative noise on the ICC.
`random_effect_dist`	`str`	`"heavy_tailed"`	`"heavy_tailed"`	Distribution of random effects (`"normal"`, `"heavy_tailed"`, or `"skewed"`).
`random_effect_df`	`int`	`10`	`3`	Degrees of freedom when `random_effect_dist` is not `"normal"`.

Examples¶

Override realistic scenario parameters:

from mcpower import MCPower

model = MCPower("outcome = treatment + motivation")
model.set_simulations(400)
model.set_variable_type("treatment=binary")
model.set_effects("treatment=0.5, motivation=0.3")

model.set_scenario_configs({
    "realistic": {
        "heterogeneity": 0.3,
        "heteroskedasticity": 0.15,
    },
})

model.find_sample_size(
    target_test="treatment",
    from_size=50, to_size=300, by=30,
    scenarios=True,
)

Override mixed model parameters:

model = MCPower("y ~ treatment + (1|school)")
model.set_simulations(400)
model.set_cluster("school", ICC=0.2, n_clusters=20)
model.set_effects("treatment=0.5")

model.set_scenario_configs({
    "realistic": {
        "icc_noise_sd": 0.10,
        "random_effect_dist": "normal",
    },
    "doomer": {
        "icc_noise_sd": 0.40,
        "random_effect_df": 2,
        "residual_df": 3,
    },
})

model.find_power(sample_size=1000, scenarios=True)

Add a custom scenario – custom scenarios inherit all default (optimistic) values:

model.set_scenario_configs({
    "extreme": {
        "heterogeneity": 0.6,
        "heteroskedasticity": 0.4,
        "correlation_noise_sd": 0.5,
        "distribution_change_prob": 0.8,
    },
})

model.find_power(sample_size=200, scenarios=True)
# Runs: optimistic, realistic, doomer, AND extreme

Run only specific scenarios:

model.find_power(sample_size=200, scenarios=["optimistic", "extreme"])
# Runs only: optimistic and extreme (skips realistic and doomer)

Unknown names raise a ValueError listing the available scenarios.

set_max_failed_simulations()¶

MCPower.set_max_failed_simulations(percentage)[source]¶

Set the maximum acceptable proportion of failed simulations.

When a simulation iteration fails (e.g. due to convergence issues in mixed models), it is discarded. If the failure rate exceeds this threshold, the analysis raises an error rather than returning unreliable results.

Parameters:: percentage (float) – Maximum failure rate as a proportion (0–1). Default is 0.03 (3%). For mixed models with small samples or high ICC, consider raising to 0.10.
Returns:: For method chaining.
Return type:: self
Raises:: ValueError – If percentage is not between 0 and 1.

Common Thresholds¶

Threshold	Meaning	Use Case
`0.03`	3% (default)	Random intercept models with adequate sample sizes
`0.10`	10%	Random intercept models with smaller samples or higher ICC
`0.20`	20%	Random slope models
`0.30`	30%	Complex nested models or random slopes with small clusters

Examples¶

from mcpower import MCPower

# Random intercept model -- default threshold is usually fine
model = MCPower("satisfaction ~ treatment + (1|school)")
model.set_simulations(400)
model.set_cluster("school", ICC=0.2, n_clusters=20)
model.set_effects("treatment=0.5")
model.find_power(sample_size=1000)

# Random slopes -- relax the threshold
model = MCPower("y ~ x1 + (1 + x1|school)")
model.set_simulations(400)
model.set_cluster("school", ICC=0.2, n_clusters=20,
                   random_slopes=["x1"], slope_variance=0.1)
model.set_effects("x1=0.5")
model.set_max_failed_simulations(0.20)
model.find_power(sample_size=1000)

# Nested design -- more tolerance needed
model = MCPower("score ~ treatment + (1|school/classroom)")
model.set_simulations(400)
model.set_cluster("school", ICC=0.15, n_clusters=10)
model.set_cluster("classroom", ICC=0.10, n_per_parent=3)
model.set_effects("treatment=0.5")
model.set_max_failed_simulations(0.30)
model.find_power(sample_size=1500)

When Failures Are Expected¶

Convergence failures are more common when:

Small cluster sizes – Fewer than 10 observations per cluster increases failure rates.
High ICC – Strong within-cluster correlation (> 0.4) makes estimation harder.
Random slopes – More parameters to estimate per cluster.
Nested effects – Multiple variance components to estimate simultaneously.
Scenario analysis (doomer) – Perturbed distributions and ICC noise amplify convergence difficulty.

Diagnosing High Failure Rates¶

If you encounter the error “Too many failed simulations”, consider:

Increasing the sample size – More observations per cluster improves convergence.
Increasing n_clusters – More clusters are statistically more informative.
Reducing ICC – If plausible for your research context.
Relaxing the threshold – As a last resort, if you understand the trade-off.

For standard OLS (linear regression) models, convergence failures do not occur. This method has no practical effect on OLS analyses.

Scenario Analysis¶

set_scenario_configs()¶

Default Scenarios¶

Configuration Keys – General (All Model Types)¶

Configuration Keys – Residual Perturbation (All Model Types)¶

Configuration Keys – Mixed Model (Cluster Models Only)¶

Examples¶

See Also¶

set_max_failed_simulations()¶

Common Thresholds¶

Examples¶

When Failures Are Expected¶

Diagnosing High Failure Rates¶

See Also¶