%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#f0eee6', 'primaryTextColor': '#141413', 'primaryBorderColor': '#d1cfc5', 'lineColor': '#d97757', 'secondaryColor': '#faf9f5', 'tertiaryColor': '#e6e3da', 'edgeLabelBackground': '#faf9f5', 'clusterBkg': '#f0eee6', 'clusterBorder': '#d1cfc5'}}}%%
flowchart LR
D(Discovery<br/>Literature + Data):::phase1 --> S(Strategy<br/>Identification Design):::phase2
S --> E(Execution<br/>Code + Paper):::phase3
E --> R(Review<br/>Simulated Peer Review):::phase4
R -->|score >= 95| SUB(Submission<br/>Package + Gate):::phase5
classDef phase1 fill:#f0eee6,stroke:#788c5d,color:#141413,stroke-width:2px
classDef phase2 fill:#f0eee6,stroke:#4d7c5a,color:#141413,stroke-width:2px
classDef phase3 fill:#f0eee6,stroke:#d97757,color:#141413,stroke-width:2px
classDef phase4 fill:#f0eee6,stroke:#b85c3e,color:#141413,stroke-width:2px
classDef phase5 fill:#f0eee6,stroke:#87867f,color:#141413,stroke-width:2px
User Guide
Common workflows and what to expect
A Typical Workflow
Every interaction follows the same loop: you describe, agents execute, critics score, you approve. Claude plans the work, dispatches specialized agents, and presents results for your sign-off. For complex tasks, it asks 3–5 clarifying questions before committing to a plan.
You can enter at any stage. Each phase produces structured output saved to quality_reports/.
Your First Review
Simulated peer review catches fatal problems before real referees do. One command dispatches two independent reviewers calibrated to your target journal.
What you type: /review --peer JHR paper/main.tex
What happens behind the scenes: The editor does a desk review — checks scope, assigns each referee a disposition (Credibility, Policy, Skeptic, etc.) and pet peeves that shape their review personality. Two blind referees then review independently: a domain-referee (subject expertise) and a methods-referee (empirical methods). Neither sees the other’s report.
What you see: An editorial decision that classifies every concern as FATAL, ADDRESSABLE, or TASTE:
# Editorial Decision --- Journal of Human Resources
**Decision:** Major Revisions
## Referee 1 (domain-referee) --- Disposition: Policy
- ADDRESSABLE: Missing discussion of policy implementation costs (Section 6)
- ADDRESSABLE: Literature review omits Garcia & Santos (2024) on same population
- TASTE: Prefer confidence intervals over p-values
## Referee 2 (methods-referee) --- Disposition: Credibility
- FATAL: Pre-trends test shows significant coefficient at t-2 (Figure 3)
- ADDRESSABLE: Clustering at state level with 12 states --- use wild bootstrap
- ADDRESSABLE: No sensitivity analysis (Rambachan & Roth 2023)
## Action Items
- MUST: Address pre-trends concern (fatal if unresolved)
- MUST: Add wild bootstrap standard errors
- SHOULD: Add sensitivity bounds
- SHOULD: Cite Garcia & Santos (2024)What to do next: Work through MUST items first. Use /revise to route each comment to the right agent — new analysis goes to the coder, clarifications go to the writer, disagreements get flagged for your review. For more on the peer review architecture (dispositions, pet peeves, journal calibration), see Agents.
Your First Draft
What you type: /write intro
What happens behind the scenes: Claude reads your paper type (reduced-form, structural, descriptive, or theory+empirics), your strategy memo, and any existing sections. The writer drafts using paragraph-level argument moves — each paragraph has one job:
Paragraph 1 [MOTIVATION]: Big question --- why this matters for policy
Paragraph 2 [GAP]: What the literature hasn't answered
Paragraph 3 [THIS PAPER]: "We estimate..." --- result preview with magnitude
Paragraph 4 [IDENTIFICATION]: Source of variation, one-sentence design
Paragraph 5 [PREVIEW]: Key finding with effect size
Paragraph 6 [ROADMAP]: Section-by-section guide
A cleanup pass then strips 24 known AI writing patterns (hedging, filler phrases, over-signposting). The argument structure adapts to your paper type — reduced-form leads with identification and result, structural leads with the model and counterfactual, descriptive leads with the data innovation and key fact.
What you see: A LaTeX section file saved to paper/sections/intro.tex, plus a claim-source map linking every numerical claim to a specific script and output file.
What to do next: Review for accuracy and voice. If you have prior papers, run /write style-guide [dir] first to calibrate the writer to your voice. For all available sections and flags, see the Command Reference.
Running the Full Pipeline
What you type: /new-project [topic]
What happens behind the scenes: The pipeline runs through all five phases, pausing for your approval at each transition:
- Discovery — librarian and explorer run in parallel, searching for literature and datasets. Their critics review coverage and feasibility. You see an annotated bibliography and ranked dataset list.
- Strategy — the strategist classifies your paper type and designs the identification strategy. The strategist-critic runs a 4-phase adversarial review. You see a strategy memo with assumptions, pseudo-code, and a robustness plan.
- Execution — the coder translates the strategy memo into numbered scripts; the writer drafts the manuscript from actual results. Both are reviewed by their paired critics. You see scripts, tables, figures, and LaTeX sections.
- Review — the editor dispatches two blind referees. You see an editorial decision with classified concerns and action items.
- Submission — the verifier runs a 10-check audit on the replication package. The final gate enforces score >= 95 with all components >= 80.
What you see: At each gate, a summary with scores and the next decision point. The pipeline never advances without your approval.
What to do next: Read the strategy memo carefully — it becomes the contract the coder implements. Review referee reports as you would real ones. For details on any phase, see the Command Reference.
Revising After Referee Reports
What you type: /revise [referee-report.pdf]
What happens behind the scenes: Each referee comment is classified into one of five categories and routed to the appropriate agent:
| Classification | Routed to | Example |
|---|---|---|
| NEW ANALYSIS | Coder (flagged for your approval first) | “Control for X” or “Show results by subgroup” |
| CLARIFICATION | Writer | “Explain your exclusion restriction more clearly” |
| REWRITE | Writer | “Restructure Section 4” |
| DISAGREE | You (mandatory review) | “Your identification assumption is violated” |
| MINOR | Writer | Typos, formatting, citation additions |
What you see: A tracking document with counts and priorities, plus a draft point-by-point LaTeX response letter. DISAGREE items are always flagged — Claude never autonomously pushes back on referees.
What to do next: Review DISAGREE items and decide your response. Approve NEW ANALYSIS tasks before they execute. Run /review --peer --r2 to simulate the next referee round with the same dispositions.
Creating a Talk
A talk is not a paper read aloud — it is a narrative with a different arc. The storyteller agent builds format-aware slide decks that respect this distinction.
What you type: /talk create seminar
What happens behind the scenes: The storyteller reads your paper and builds a slide deck in the specified format (seminar: 25–35 slides, 30–45 min). It applies format-specific narrative arcs — reduced-form talks lead with the policy question, structural talks build the model step by step, descriptive talks open with the data innovation. The storyteller-critic reviews for overflow, content fidelity, and narrative flow.
What you see: A Quarto RevealJS .qmd file in paper/quarto/ with speaker notes on every slide, plus a critic review report. Talk scores are advisory — they do not block commits.
What to do next: Review the slides against your paper. Run /talk audit to catch text overflow and sizing issues. Run /talk compile to build the HTML presentation. For Beamer LaTeX output, add --beamer — that produces a .tex file in paper/talks/ instead. For all format options, see the Command Reference.
Session Guards
Two commands protect you from expensive mistakes. Both are session-scoped — they reset when the conversation ends.
/freeze paper/ locks a directory from edits for the rest of the session. Use it when you are reviewing code but jotting notes about the paper — the system will refuse to modify frozen paths, preventing accidental overwrites while your attention is elsewhere.
/careful activates confirmation prompts before every file write. Use it before working on a critical branch, during final pre-submission edits, or any time the cost of a wrong edit exceeds the cost of an extra confirmation step.
Neither command persists across sessions. Start a new conversation and all guards are cleared.
Quality Gates
Three thresholds govern what actions the system allows:
| Gate | Score required | What it means |
|---|---|---|
| Commit | >= 80 | Work can be committed to the repository |
| PR | >= 90 | Work can be opened as a pull request |
| Submission | >= 95 + all components >= 80 | Work can be submitted to a journal |
Scores are weighted aggregates of individual critic scores. Each critic starts at 100 and deducts for issues found. The system manages this automatically — you will rarely think about it unless a score drops below 80, which blocks commits until the flagged issues are fixed.
Recovering from a low score: Run /review --all to get a per-component breakdown. Fix critical issues first (they carry the largest deductions). Re-run the relevant review to update the score. For the full scoring formula and component weights, see Rules & Invariants.
Troubleshooting
LaTeX Won’t Compile
Symptom: latexmk or xelatex errors, missing packages.
- Check you have XeLaTeX and latexmk installed:
which xelatex latexmk paper/latexmkrcconfigures TEXINPUTS and BIBINPUTS automatically- Missing package? Install via TeX Live:
tlmgr install [package] - On Overleaf: set compiler to XeLaTeX via Menu > Compiler (Overleaf reads
latexmkrcautomatically)
Hooks Not Firing
Symptom: No file protection or context survival.
- Check hooks are configured in
.claude/settings.json - Ensure Python 3 is available:
which python3 - Check hook file permissions:
ls -la .claude/hooks/ - Hook blocking an edit you want to make? See Hooks & Automation for how to customize protected patterns.
Claude Ignores Rules
Symptom: Claude doesn’t follow conventions in .claude/rules/.
- Rules use
paths:frontmatter — check the path matches your files - Too many rules? Claude follows ~150 instructions reliably. Consolidate.
- Try: “Read
.claude/rules/[rule].mdand follow it for this task”
Context Lost After Compaction
Symptom: Claude forgets what you were working on.
- Point Claude to the plan: “Read
quality_reports/plans/[latest].md” - Check session log: “Read
quality_reports/session_logs/[latest].md” - The
post-compact-restore.pyhook should print recovery info automatically. See Hooks & Automation for details.
Quality Score Too Low
Symptom: Score stuck below 80, can’t commit.
- Run
/review --allto get detailed issue breakdown - Fix critical issues first (they cost the most points)
- Check per-component scores — one weak component can drag down the aggregate
Upgrading
Run /tools upgrade to upgrade your project to the latest clo-author version. It replaces .claude/ and templates/, preserving your paper, scripts, data, bibliography, and CLAUDE.md.
Your files are safe. The upgrade only touches .claude/ (the infrastructure) and templates/. Your paper, scripts, data, bibliography, and all research content are never modified or deleted.
Manual alternative: Download the latest release, delete your .claude/ directory, and copy the new one in. Your CLAUDE.md stays yours — the infrastructure reads whatever paths you have there.