Reproducible Research Experiment Prompts¶

Six prompts for researchers using FCC as an experimental platform. They emphasize reproducibility, FAIR data principles, and transparent methods reporting. Each prompt assumes the reader has a specific hypothesis and a dataset in hand.

Personas Used¶

Persona ID	Full Name	Category	Role in Prompts
RSN	Reproducibility Sentinel	open_science	Determinism, versioning, seeds
FDS	FAIR Data Steward	open_science	Findable, Accessible, Interoperable, Reusable
CSL	Citizen Science Liaison	open_science	Community engagement, lay summaries
ESC	Experiment Scientist	open_science	Hypothesis design, statistics

Prompt 1: Designing a Reproducible Simulation Study¶

Audience: Scientific Difficulty: advanced Personas: ESC, RSN

Context¶

A researcher wants to compare two prompt strategies for a diagnostic-assist persona.

Prompt¶

ESC drafts; RSN reviews.

Find: state the hypothesis formally (H0, H1, effect size of interest),
identify the dependent variable (CLEAR+ Efficacy by default), and
enumerate the confounders that must be controlled.

Create: write a preregistration document with (a) sample size
calculation, (b) random seed plan (master seed + derivations), (c)
analysis plan (primary test, secondary tests, corrections), (d)
stopping rule.

Critique: RSN to identify any experimenter degrees of freedom that
could inflate the false-positive rate, and propose guardrails
(blinded analysis, held-out test set, audit hooks in the event bus).

Expected Output¶

Preregistration document
Seed and guardrail plan

Prompt 2: FAIR Data Audit¶

Audience: Scientific Difficulty: intermediate Personas: FDS

Prompt¶

Audit the attached dataset against the 15 FAIR sub-principles. For
each sub-principle, return a score in {0, 1, 2} with a one-sentence
rationale and a concrete remediation step where the score is below 2.
Produce a summary heatmap description usable in a README.

Expected Output¶

Sub-principle scorecard
Remediation list
Heatmap description

Variations¶

Compare two datasets and highlight the larger FAIR gap

Prompt 3: Methods Section Grounded in R.I.S.C.E.A.R.¶

Audience: Scientific Difficulty: intermediate Personas: ESC, RSN

Prompt¶

Write a Methods section (600-800 words) for a journal submission
that describes a simulation experiment using three FCC personas.
Structure the section so every claim cites either (a) a specific
R.I.S.C.E.A.R. slot in the persona YAML, (b) a benchmark run ID,
or (c) a seeded code commit hash. Finish with an "Availability
Statement" paragraph.

Expected Output¶

Methods draft
Availability statement

Prompt 4: Replication Package Structure¶

Audience: Scientific Difficulty: advanced Personas: RSN, FDS

Prompt¶

Design the directory layout of a replication package that someone
can clone and run end-to-end in < 30 minutes on a laptop with the
mock simulation engine. Include:
- `data/` (inputs, with sha256 manifest)
- `code/` (FCC version pin, environment lockfile)
- `results/` (committed outputs, trace hashes)
- `docs/` (method note, lay summary)
- `Makefile` targets: `make env`, `make run`, `make verify`

Provide the Makefile content and the exact command a reviewer
should run to confirm byte-for-byte reproduction of `results/`.

Expected Output¶

Directory layout
Makefile
Verify command

Prompt 5: Community Engagement and Lay Summary¶

Audience: Scientific Difficulty: intermediate Personas: CSL, FDS

Prompt¶

Turn a technical preregistration into a 400-word lay summary suitable
for a citizen-science partner organization. Avoid jargon; use one
concrete analogy from daily life; invite two specific forms of
participation (data contribution and interpretation review). Include
a short FAQ covering "why this matters," "who funds it," and "how
findings will be shared."

Expected Output¶

Lay summary
FAQ

Prompt 6: Publishing to Open Repositories¶

Audience: Scientific Difficulty: intermediate Personas: FDS, RSN

Prompt¶

Draft a publication plan that places the replication package on
Zenodo with a DOI, the preprint on a suitable archive, and the
persona YAMLs on a protocol registry. Specify:
- License choices (code vs data vs prose)
- Metadata per target (Zenodo JSON, preprint cover letter)
- Embargo plan, if any
- A README badge list that links all artifacts

Critique the plan for any single point of failure in long-term
preservation.

Expected Output¶

Publication plan
Preservation critique

Reproducible Research Experiment Prompts¶

Personas Used¶

Prompt 1: Designing a Reproducible Simulation Study¶

Context¶

Prompt¶

Expected Output¶

Prompt 2: FAIR Data Audit¶

Prompt¶

Expected Output¶

Variations¶

Prompt 3: Methods Section Grounded in R.I.S.C.E.A.R.¶

Prompt¶

Expected Output¶

Prompt 4: Replication Package Structure¶

Prompt¶

Expected Output¶

Prompt 5: Community Engagement and Lay Summary¶

Prompt¶

Expected Output¶

Prompt 6: Publishing to Open Repositories¶

Prompt¶

Expected Output¶

See Also¶