Quick Start

You have two conditions, replicate measurements, and a feature table. You need a decision on each feature: equivalent, different, or inconclusive. A non-significant difference test does not mean two groups are equivalent. QuEStVar adds an equivalence test so that case is explicit instead of implied.

This tutorial runs through the full workflow on realistic proteomics data. 10,000 features, 10 replicates per condition, missing values, a known mixture of equivalent, differential, and high-noise features.

In [1]:

Copied!





from contextlib import suppress
from pathlib import Path

import polars as pl

from questvar import QuestVar

_candidates = [Path("data/demo_realistic.tsv"), Path.cwd() / "data" / "demo_realistic.tsv"]
with suppress(NameError):
    _candidates.append(
        Path(__file__).resolve().parent.parent.parent / "data" / "demo_realistic.tsv"
    )
data_path = next((p for p in _candidates if p.exists()), _candidates[0])
df = pl.read_csv(data_path, separator="\t", null_values=["", "NA", "NaN"])
print(f"Features: {df.shape[0]:,}, columns: {df.shape[1]}")
print(f"Missing values: {df.null_count().sum_horizontal().item():,}")
from contextlib import suppress
from pathlib import Path

import polars as pl

from questvar import QuestVar

_candidates = [Path("data/demo_realistic.tsv"), Path.cwd() / "data" / "demo_realistic.tsv"]
with suppress(NameError):
    _candidates.append(
        Path(__file__).resolve().parent.parent.parent / "data" / "demo_realistic.tsv"
    )
data_path = next((p for p in _candidates if p.exists()), _candidates[0])
df = pl.read_csv(data_path, separator="\t", null_values=["", "NA", "NaN"])
print(f"Features: {df.shape[0]:,}, columns: {df.shape[1]}")
print(f"Missing values: {df.null_count().sum_horizontal().item():,}")

Features: 10,000, columns: 21
Missing values: 25,234

Configure the analysis¶

Four thresholds control the decision boundary.

cv_thr sets the CV filter. Features with CV above this in either condition are excluded before testing. A value of 1.0 keeps everything. Tighten it to 0.3 for low-noise features only.

eq_thr defines the equivalence window in log2 fold change units. A feature with |log2FC| below this boundary and a significant TOST result is called equivalent. Start with 0.5.

df_thr defines the difference boundary. Features with |log2FC| above this and a significant t-test are called differential. Must be larger than eq_thr. The gap between them is the zone where neither test is decisive.

p_thr is the adjusted p-value cutoff for both tests. Default 0.05.

allow_missing controls whether features with missing values get CV-filtered or tested. True means CV is computed on available replicates per feature. False means any missing value produces a NaN CV and the feature is excluded. The demo data has missing values, so set this to True.

correction controls multiple testing correction. BH-FDR is the default. Other options: bonferroni, holm, hochberg, BY, qvalue, or none.

In [2]:

Copied!





qv = QuestVar(
    cv_thr=1.0,
    eq_thr=0.5,
    df_thr=1.0,
    p_thr=0.05,
    correction="fdr",
    allow_missing=True,
)
qv = QuestVar(
    cv_thr=1.0,
    eq_thr=0.5,
    df_thr=1.0,
    p_thr=0.05,
    correction="fdr",
    allow_missing=True,
)

Run the test¶

cond_1 and cond_2 are lists of column names. Each must have at least two replicates. Paired testing is available with is_paired=True.

In [3]:

Copied!

cond_1 = [f"c1_{i}" for i in range(10)]
cond_2 = [f"c2_{i}" for i in range(10)]
results = qv.test(df, cond_1=cond_1, cond_2=cond_2)
cond_1 = [f"c1_{i}" for i in range(10)]
cond_2 = [f"c2_{i}" for i in range(10)]
results = qv.test(df, cond_1=cond_1, cond_2=cond_2)

Read the summary¶

The summary tells you what happened in one block. Out of 10,000 input features, some were excluded by the CV filter, some were tested, and each tested feature got a status.

In [4]:

Copied!

print(results.summary())
print(results.summary())

QuEStVar  ['c1_0', 'c1_1', 'c1_2', 'c1_3', 'c1_4', 'c1_5', 'c1_6', 'c1_7', 'c1_8', 'c1_9'] vs ['c2_0', 'c2_1', 'c2_2', 'c2_3', 'c2_4', 'c2_5', 'c2_6', 'c2_7', 'c2_8', 'c2_9']
  Input features:      10000
  Excluded by CV:      907
  Tested:              9093
  Equivalent  (+1):     1660  (18.3%)
  Differential (-1):    2004  (22.0%)
  Not significant (0):  5429  (59.7%)
  Thresholds:  eq=0.5  df=1.0  cv=1.0  p=0.05
  Correction:  fdr

Three statuses appear.

Equivalent (+1) means the effect falls inside the equivalence boundary and the TOST p-value is significant. These features are stable across conditions. Your summary tells you the exact count.

Differential (-1) means the effect exceeds the difference boundary and the t-test p-value is significant. These features change between conditions.

Not significant (0) means neither test was decisive. The effect falls between eq_thr and df_thr, or the p-value is above the cutoff. The bulk of the data usually lands here. This is normal.

Excluded features failed the CV filter. They had too much variance within one condition to produce a reliable test. The info sidecar tracks why each feature was excluded.

Visualize with the Antler plot¶

The Antler plot is the main diagnostic figure. The y-axis shows signed -log10 adjusted p-value. Equivalence results appear above zero. Difference results appear below zero. The x-axis is log2 fold change.

Blue dashed lines mark the equivalence boundary. Red dotted lines mark the difference boundary. Features in the upper band between the blue lines are equivalent. Features in the lower band outside the red lines are differential. Everything else is inconclusive.

In [5]:

Copied!

import matplotlib.pyplot as plt

fig = results.plot(cond_1_label="Control", cond_2_label="Treatment")
plt.close(fig)
fig
import matplotlib.pyplot as plt

fig = results.plot(cond_1_label="Control", cond_2_label="Treatment")
plt.close(fig)
fig

Out[5]:

No description has been provided for this image

Save and reload¶

Results save to parquet with two sidecar files: one for the CV filter info table, one for metadata (config, condition labels). Loading them back gives you the same object with the same methods.

In [6]:

Copied!

results.save("tmp/quick_start_results.parquet")
reloaded = type(results).load("tmp/quick_start_results.parquet")
print(f"Reloaded: {len(reloaded.data)} features, {len(reloaded.info)} total")
results.save("tmp/quick_start_results.parquet")
reloaded = type(results).load("tmp/quick_start_results.parquet")
print(f"Reloaded: {len(reloaded.data)} features, {len(reloaded.info)} total")

---------------------------------------------------------------------------
FileNotFoundError                         Traceback (most recent call last)
Cell In[6], line 1
----> 1 results.save("tmp/quick_start_results.parquet")
      2 reloaded = type(results).load("tmp/quick_start_results.parquet")
      3 print(f"Reloaded: {len(reloaded.data)} features, {len(reloaded.info)} total")

File ~/work/QuEStVar/QuEStVar/src/questvar/_api.py:531, in TestResults.save(self, path)
    529 stem = Path(path).with_suffix("")
    530 if suffix == ".parquet":
--> 531     self.data.write_parquet(path)
    532     self.info.write_parquet(f"{stem}.info.parquet")
    533 elif suffix == ".csv":

File ~/work/QuEStVar/QuEStVar/.venv/lib/python3.12/site-packages/polars/dataframe/frame.py:4351, in DataFrame.write_parquet(self, file, compression, compression_level, statistics, row_group_size, data_page_size, use_pyarrow, pyarrow_options, partition_by, partition_chunk_size_bytes, storage_options, credential_provider, retries, metadata, arrow_schema, mkdir)
   4347             )
   4348 
   4349         from polars.lazyframe.opt_flags import QueryOptFlags
   4350 
-> 4351         self.lazy().sink_parquet(
   4352             target,
   4353             compression=compression,
   4354             compression_level=compression_level,

File ~/work/QuEStVar/QuEStVar/.venv/lib/python3.12/site-packages/polars/lazyframe/frame.py:2982, in LazyFrame.sink_parquet(***failed resolving arguments***)
   2980     ldf_py = ldf_py.with_optimizations(optimizations._pyoptflags)
   2981     ldf = LazyFrame._from_pyldf(ldf_py)
-> 2982     ldf.collect(engine=engine)
   2983     return None
   2984 return LazyFrame._from_pyldf(ldf_py)

File ~/work/QuEStVar/QuEStVar/.venv/lib/python3.12/site-packages/polars/_utils/deprecation.py:97, in deprecate_streaming_parameter.<locals>.decorate.<locals>.wrapper(*args, **kwargs)
     93         kwargs["engine"] = "in-memory"
     95     del kwargs["streaming"]
---> 97 return function(*args, **kwargs)

File ~/work/QuEStVar/QuEStVar/.venv/lib/python3.12/site-packages/polars/lazyframe/opt_flags.py:343, in forward_old_opt_flags.<locals>.decorate.<locals>.wrapper(*args, **kwargs)
    340         optflags = cb(optflags, kwargs.pop(key))  # type: ignore[no-untyped-call,unused-ignore]
    342 kwargs["optimizations"] = optflags
--> 343 return function(*args, **kwargs)

File ~/work/QuEStVar/QuEStVar/.venv/lib/python3.12/site-packages/polars/lazyframe/frame.py:2510, in LazyFrame.collect(self, type_coercion, predicate_pushdown, projection_pushdown, simplify_expression, slice_pushdown, comm_subplan_elim, comm_subexpr_elim, cluster_with_columns, collapse_joins, no_optimization, engine, background, optimizations, **_kwargs)
   2508 # Only for testing purposes
   2509 callback = _kwargs.get("post_opt_callback", callback)
-> 2510 return wrap_df(ldf.collect(engine, callback))

FileNotFoundError: No such file or directory (os error 2): tmp/quick_start_results.parquet

From here you can adjust thresholds, try different correction methods, run a power analysis to pick an equivalence boundary, or move to a multi-comparison workflow with compare_all_pairs.