bio-logic
Evaluate scientific rigor, methods, biases, and evidence quality for claims, papers, and study designs.
SKILL.md
| Name | bio-logic |
| Description | Evaluate scientific rigor, methods, biases, and evidence quality for claims, papers, and study designs. |
name: bio-logic description: Evaluate scientific rigor, methods, biases, and evidence quality for claims, papers, and study designs.
Bio-Logic: Scientific Reasoning Evaluation
Use structured frameworks to evaluate scientific claims, methodology, and evidence strength.
Instructions
- Identify the task (claim assessment, paper critique, study design review, project interpretation, or hypothesis revision).
- For project work, maintain a hypothesis register with at least 5 distinct active hypotheses until the project is no longer exploratory.
- After each major intermediate result, reflect on what changed, revise hypothesis status, and identify the next discriminating check.
- When findings need context, pair this reasoning with a literature-search skill such as
/polars-dovmed, then revise hypotheses against the literature. - Apply the relevant checklist below.
- Structure output using the provided format.
Project Hypothesis Loop
Use this loop for omics projects, unexpected results, exploratory analyses, and any request that asks what results mean.
- Initialize: create at least 5 working hypotheses. Include biological mechanisms, technical artifacts, null explanations, sampling/batch effects, and annotation/database artifacts where relevant.
- Build an analysis playbook: search the literature for the inferred organism, virus group, data type, or closest lineage. Summarize typical analyses, comparison baselines, markers/features, plots, and outlier criteria used by scientists in that literature.
- Reflect: for every major intermediate result or QC gate, state the observation, QC status, strongest interpretation, remaining alternatives, and what evidence would separate them.
- Contextualize: compare findings against the playbook and run additional literature searches for central or unexpected findings. Use broad synonym-aware queries and cite DOI/PMCID when available.
- Revise: update each hypothesis as supported, weakened, ruled out, or unresolved. Keep ruled-out hypotheses visible with the evidence that changed their status.
- Replace: if fewer than 5 active hypotheses remain during exploratory work, add plausible replacements or explicitly state why no additional plausible alternatives exist.
Critique Checklist
Use relevant sections based on the review scope. Skip items not applicable to the study type.
## Methodology
- [ ] Design matches research question (causal claim → RCT needed)
- [ ] Sample size justified (power analysis reported)
- [ ] Randomization/blinding implemented where feasible
- [ ] Confounders identified and controlled
- [ ] Measurements validated and reliable
## Statistics
- [ ] Tests appropriate for data type
- [ ] Assumptions checked
- [ ] Multiple comparisons corrected
- [ ] Effect sizes + CIs reported (not just p-values)
- [ ] Missing data handled appropriately
## Interpretation
- [ ] Conclusions match evidence strength
- [ ] Limitations acknowledged
- [ ] Causal claims only from experimental designs
- [ ] No cherry-picking or overgeneralization
## Red Flags
- [ ] P-values clustered just below .05
- [ ] Outcomes differ from registration
- [ ] Correlation presented as causation
- [ ] Subgroups analyzed without preregistration
Claim Assessment
- Identify claim type (causal, associational, descriptive).
- Match evidence to claim type.
- Check logical connection between data and conclusion.
- Ensure confidence matches evidence strength.
Claim strength ladder:
| Language | Requires |
|---|---|
| "Proves" / "Demonstrates" | Strong experimental evidence |
| "Suggests" / "Indicates" | Observational with controlled confounds |
| "Associated with" | Observational, no causal claim |
| "May" / "Might" | Preliminary or hypothesis-generating |
Output Format
## Summary
[1-2 sentences: What was studied and main finding]
## Strengths
- [Specific methodological strengths]
## Concerns
### Critical (threaten main conclusions)
- [Issue + why it matters]
### Important (affect interpretation)
- [Issue + why it matters]
### Minor (worth noting)
- [Issue]
## Evidence Rating
[GRADE level: High/Moderate/Low/Very Low with justification]
## Bottom Line
[What can/cannot be concluded from this evidence]
Project Reasoning Output Format
## Current Result
[Observed intermediate/final result and QC status]
## Hypothesis Register
| Rank | Hypothesis | Status | Evidence For | Evidence Against | Next Discriminating Check |
|------|------------|--------|--------------|------------------|---------------------------|
| 1 | [Hypothesis] | supported/weakened/ruled out/unresolved | [Evidence] | [Evidence] | [Test] |
| 2 | [Hypothesis] | ... | ... | ... | ... |
| 3 | [Hypothesis] | ... | ... | ... | ... |
| 4 | [Hypothesis] | ... | ... | ... | ... |
| 5 | [Hypothesis] | ... | ... | ... | ... |
## Literature Context
- [Paper or search result, DOI/PMCID if available]: supports/contradicts/narrows [hypothesis] because [reason].
## Analysis Playbook
- [Literature-derived analysis/comparison/outlier check and why it applies]
## Revision
[What changed since the previous register and why]
## Next Step
[Most informative analysis, control, or experiment]
Quick Reference
| Task | Action |
|---|---|
| Review a paper | Use the Critique Checklist below |
| Evaluate a claim | Use Claim Assessment steps |
| Assess evidence strength | See references/evidence.md |
| Identify biases | See references/biases.md |
| Statistical errors | See references/stats.md |
| Logical fallacies | See references/fallacies.md |
| Study design review | See references/design.md |
Input Requirements
- Manuscript or study summary
- Claim statements to evaluate
- Methods and results (as available)
Output
- Structured critique with strengths, concerns, and evidence rating
- Clear recommendation on claim strength and limitations
Quality Gates
- Claim type matches evidence type
- Key methodological risks are identified
- Evidence rating is justified with concrete issues
- Exploratory project work includes at least 5 active hypotheses or explains why fewer are plausible
- Discovery work includes a literature-derived analysis playbook before selecting analyses
- Intermediate results update hypothesis status before the next workflow step
- Central or unexpected findings are checked against the literature before final interpretation
Examples
Example 1: Quick Paper Critique
Input: "Review this RCT on vitamin D and depression"
Output:
## Summary
RCT of 200 adults testing vitamin D supplementation for depression over 12 weeks.
## Strengths
- Randomized, double-blind, placebo-controlled
- Validated depression scale (PHQ-9)
- Adequate power analysis
## Concerns
### Critical
- 35% dropout, differential by group (attrition bias)
- ITT analysis not performed
### Important
- Single-site limits generalizability
## Evidence Rating
Moderate (downgraded from high due to attrition bias)
## Bottom Line
Suggestive but not conclusive due to differential attrition.
Example 2: Claim Assessment
Input: "This study proves that coffee prevents Alzheimer's"
Assessment: If based on observational data, this is a correlation→causation error. Appropriate language: "Coffee consumption is associated with lower Alzheimer's risk."
Troubleshooting
Issue: Insufficient methods details Solution: Request missing design/statistics information before rating evidence.
Issue: Conflicting results across studies Solution: Report uncertainty and suggest stronger study designs for resolution.
Related Skills
/ai-scientist-evaluator— rubric-scored evaluation of AI scientist outputs/manuscript-review-council— multi-reviewer critique of peer-reviewed manuscripts/proposal-review— structured critique of funding proposals