Power Analysis

You want to design an experiment. You have a target equivalence boundary in mind, a rough idea of your measurement noise, and a minimum power you need to hit. The question is: how many replicates do you need?

The power analysis module answers that question by simulation. It generates synthetic data where every feature is truly equivalent, then tests how many the TOST pipeline correctly recovers as equivalent. By sweeping across equivalence boundaries, replicate counts, and CV levels, you get a map of statistical power before you run a single real sample.

In [1]:

Copied!

import numpy as np

from questvar import run_power_analysis
import numpy as np

from questvar import run_power_analysis

Configure the sweep¶

Five parameters define the design space.

eq_boundaries lists the equivalence thresholds to test. Wider boundaries make equivalence easier to declare. Narrow boundaries need more data. Typical values range from 0.1 to 1.0 log2 fold change.

n_reps_list lists the replicate counts per condition. Power increases with more replicates but with diminishing returns. Three replicates is a bare minimum across most omics. Ten is generous.

cv_mean_list lists the mean CV levels to simulate. CV is the ratio of standard deviation to mean intensity. Lower CV means cleaner data. Proteomics data often runs between 0.15 and 0.35. Transcriptomics tends lower.

n_prts controls how many features are simulated per Monte Carlo iteration. More features give more precise power estimates. 5,000 is a good balance of speed and accuracy.

n_iterations controls how many Monte Carlo iterations run per design point. More iterations tighten the confidence bands. 10 is fast but noisy. 100 is stable.

target_power sets the power level the design search aims for. 0.80 is standard across biomedical research.

random_seed makes the simulation deterministic. Same seed, same results. This is critical for reproducibility.

In [2]:

Copied!





results = run_power_analysis(
    eq_boundaries=np.array([0.1, 0.3, 0.5, 0.7, 1.0]),
    n_reps_list=[3, 5, 10],
    cv_mean_list=[0.15, 0.25],
    n_prts=5000,
    n_iterations=10,
    target_power=0.80,
    random_seed=42,
    n_jobs=1,
)
results = run_power_analysis(
    eq_boundaries=np.array([0.1, 0.3, 0.5, 0.7, 1.0]),
    n_reps_list=[3, 5, 10],
    cv_mean_list=[0.15, 0.25],
    n_prts=5000,
    n_iterations=10,
    target_power=0.80,
    random_seed=42,
    n_jobs=1,
)

Read the summary¶

The summary shows the design grid dimensions, convergence diagnostics, and the search results. It tells you whether any tested design meets your target power.

In [3]:

Copied!

print(results.summary())
print(results.summary())

Power Analysis Results
========================================
  Design points:      42
  Monte Carlo runs:   420
  Convergence:        31 converged, 11 not converged
  Runtime (s):        4.92
  Design ranges:
    cv_mean: 2 points  value=0.150..0.250  SEI=0.000..0.003  Power=0.203..0.250  Feasible=0/2
    cv_mean_n_reps: 6 points  value=0.150..0.250  SEI=0.000..0.127  Power=0.203..0.327  Feasible=0/6
    cv_thr: 1 points  value=1.000  SEI=0.003  Power=0.203  Feasible=0/1
    eq_thr: 5 points  value=0.100..1.000  SEI=0.003..0.867  Power=0.203..1.000  Feasible=2/5
    eq_thr_cv_mean: 10 points  value=0.100..1.000  SEI=0.000..0.867  Power=0.203..1.000  Feasible=3/10
    eq_thr_n_reps: 15 points  value=0.100..1.000  SEI=0.003..0.996  Power=0.203..1.000  Feasible=9/15
    n_reps: 3 points  value=3.000..10.000  SEI=0.003..0.127  Power=0.203..0.327  Feasible=0/3
  Recommended designs:
    n_reps: no feasible design  reason=no tested value met the requested target power
    eq_thr: no feasible design  reason=feasible solution found
    cv_mean: no feasible design  reason=no tested value met the requested target power
    cv_thr: no feasible design  reason=no tested value met the requested target power

Three sections matter.

"Design ranges" groups the results by parameter. Each group shows the range of values tested, the SEI (Stable Equivalence Index) range, the power range, and how many designs in that group are feasible (meet the target power).

"Recommended designs" shows the minimal design found for each axis. If no design meets the target, it tells you why. The most common reason: the tested values did not reach the target power, which means you need more replicates or a wider equivalence boundary.

"Convergence" tells you whether the SEI estimates are stable at the current iteration count. SEI coefficient of variation below 0.10 is a reasonable threshold.

Inspect the design table¶

The design_table() method pivots the results so you can read power across two axes at once. This is useful for finding the cheapest design that meets your target.

In [4]:

Copied!

pivot = results.design_table(row_axis="eq_thr", col_axis="n_reps", metric="power")
print(pivot)
pivot = results.design_table(row_axis="eq_thr", col_axis="n_reps", metric="power")
print(pivot)

shape: (5, 4)
┌────────┬──────────┬──────────┬──────────┐
│ eq_thr ┆ 3        ┆ 5        ┆ 10       │
│ ---    ┆ ---      ┆ ---      ┆ ---      │
│ f64    ┆ f64      ┆ f64      ┆ f64      │
╞════════╪══════════╪══════════╪══════════╡
│ 0.1    ┆ 0.203342 ┆ 0.242545 ┆ 0.327113 │
│ 0.3    ┆ 0.366876 ┆ 0.622805 ┆ 0.879217 │
│ 0.5    ┆ 0.670837 ┆ 0.934987 ┆ 1.0      │
│ 0.7    ┆ 0.892839 ┆ 1.0      ┆ 1.0      │
│ 1.0    ┆ 1.0      ┆ 1.0      ┆ 1.0      │
└────────┴──────────┴──────────┴──────────┘

Each cell shows power for one combination of equivalence boundary and replicate count. Values near 1.0 mean the design reliably recovers equivalent features. Values near 0.20 mean the design is barely better than random.

Read across a row to see how power changes with more replicates at a fixed boundary. Read down a column to see how power changes with a wider boundary at a fixed replicate count.

Find the optimal design¶

The optimal_design() method returns the cheapest design that meets the target power for a given axis. It searches the design grid for the minimal n_reps, minimal eq_thr, or maximal cv_mean that achieves the target.

In [5]:

Copied!





best = results.optimal_design("n_reps")
if best:
    print(f"Optimal n_reps: {best}")
else:
    print("No design reached target power in the tested range.")
best = results.optimal_design("n_reps")
if best:
    print(f"Optimal n_reps: {best}")
else:
    print("No design reached target power in the tested range.")

Optimal n_reps: {'search_for': 'n_reps', 'objective': 'smallest replicate count meeting target power', 'direction': 'min', 'target_power': 0.8, 'target_sei': 0.8, 'solution_value': None, 'solution_found': False, 'reason': 'no tested value met the requested target power', 'feasible_min': None, 'feasible_max': None, 'monotone_axis': True, 'monotonicity_direction': 'nondecreasing', 'nearest_infeasible_value': None, 'solution_power': None, 'fixed_parameters': {'n_reps': 3, 'eq_thr': 0.1, 'cv_mean': 0.15, 'cv_thr': 1.0}}

The result tells you the recommended value, the power achieved, and the fixed parameter settings used for that search axis. If no design is feasible, you need to widen the search range or lower the target.

Visualize the power profile¶

The power profile figure shows power on the y-axis against equivalence boundary on the x-axis. Each line is a different replicate count. The shaded band shows the 90 percent quantile range across Monte Carlo iterations. The dashed horizontal line marks the target power.

In [6]:

Copied!

import matplotlib.pyplot as plt

fig = results.plot(ci_method="quantile", ci=0.90)
plt.close(fig)
fig
import matplotlib.pyplot as plt

fig = results.plot(ci_method="quantile", ci=0.90)
plt.close(fig)
fig

Out[6]:

No description has been provided for this image

The figure helps you pick the right combination of eq_thr and n_reps for your experiment. If the line for n_reps=5 crosses the target power at eq_thr=0.5, then five replicates with a 0.5 equivalence boundary meet your design goal.

From here you can adjust the sweep parameters, tighten the CV assumptions, or run a larger simulation with more iterations for tighter confidence bands.