Free Study Design Checklist

Research Question & Estimand (1–5)

Before touching data or running a single test.

PICO/PECO clearly defined

Population, Intervention/Exposure, Comparator, Outcome — each unambiguous.

If you can't state your comparator in one sentence, your design isn't ready.

Estimand specified

ATE, ATT, CATE, or marginal effect? This determines your entire analysis strategy.

"We estimate the effect of X on Y" without specifying for whom = reviewer bait.

Causal vs. descriptive intent stated

Are you making a causal claim? If yes, you need an identification strategy. If no, don't use causal language.

Target trial articulated

Even for observational studies — what RCT would you run if you could? This exposes design gaps.

Feasibility assessed

Data availability, sample size, timeline, ethics approval, budget. Kill unfeasible designs early.

Design & Identification (6–12)

The architecture of your study.

Study design explicitly named and justified

RCT, cohort, case-control, cross-sectional, quasi-experimental — and why this one over alternatives.

Time zero defined

When does follow-up begin? Misalignment = immortal time bias.

Immortal time bias inflates treatment effects. One of the most common errors in observational studies.

Exposure/treatment well-defined

Consistency assumption: same label → same intervention? "Statin use" vs "atorvastatin 40mg for ≥6 months" are different studies.

Control/comparator group appropriate

Active comparator vs. no treatment vs. standard of care — each answers a different question.

DAG drawn and confounders identified

List confounders, mediators, colliders. If you haven't drawn a DAG, you don't know what to adjust for.

Adjusting for a collider or mediator introduces bias. A DAG prevents this.

Positivity verified

Every covariate stratum has both treated and untreated. Violations break propensity-based methods.

No unmeasured confounding (or sensitivity analysis planned)

Name the confounders you can't measure. Plan E-value, bias analysis, or bounding approach.

Measurement & Data (13–18)

Garbage in, garbage out — but methodically.

Outcome validated

ICD codes? Self-report? Lab values? What's the sensitivity/specificity of your outcome definition?

Exposure ascertainment independent of outcome

Detection bias: knowing the outcome shouldn't change how you measure the exposure (and vice versa).

Missing data mechanism identified

MCAR, MAR, MNAR? This determines whether multiple imputation, IPW, or sensitivity analysis is appropriate.

Sample size / power calculation done

Based on clinically meaningful effect size, not "what we can detect with our data."

Post-hoc power calculations are meaningless. Plan prospectively.

Follow-up period adequate and justified

Long enough to observe outcome? Short enough to maintain retention? Time horizon matches the question.

Data source limitations documented

Claims ≠ clinical reality. EHR ≠ complete medical history. Be explicit about what your data can and cannot capture.

Analysis Plan (19–24)

Statistics serve the design, not the other way around.

Primary analysis method justified

Why this method? How does it handle the specific threats you identified? Don't default to logistic regression.

Competing risks addressed

If patients can die before the event, Kaplan-Meier overestimates cumulative incidence. Use CIF or cause-specific hazards.

Sensitivity analyses pre-specified

At minimum: different confounder sets, alternate outcome definitions, subgroup analyses, E-value for unmeasured confounding.

Multiple comparisons handled

Pre-specify primary outcome. Secondary outcomes are hypothesis-generating. Don't p-hack — it shows.

Effect modification vs. confounding distinguished

Subgroup analysis ≠ stratified adjustment. Are you looking for who benefits differently, or trying to remove bias?

Reporting guideline identified

CONSORT, STROBE, RECORD, PRISMA — choose before writing, not after. Reviewers check.

Validity & Interpretation (25–27)

What could still go wrong?

Internal validity threats enumerated

Selection bias, information bias, confounding — specific to YOUR design, not a generic list.

External validity / generalizability assessed

Who does your finding apply to? Single-center academic hospital ≠ community practice.

Reviewer rebuttals drafted

What will Reviewer 2 attack? Prepare defenses now. If you can't defend the design, redesign it.

If your only defense is "we adjusted for confounders," the study is vulnerable.

Want this done automatically?

ProtoCol runs this entire analysis on your research aim in under 20 minutes. Power calculations, DAGs, and reviewer rebuttals included.

Start a consultation →

27-Point Study Design Checklist

Unlock the checklist