New Project Setup Checklist

Complete checklist for creating a new project workspace in the Docxology Template. Lessons are framed around folder patterns and the stable exemplar projects/template_code_project/.

For a copy-paste LLM scaffold anchored on that exemplar, see new-project-one-shot-prompt.md. Other active layouts under projects/ are listed in _generated/active_projects.md. For archived reference trees (not run by default), see projects_archive/.

Key Principle: A project is auto-discovered if projects/<name>/manuscript/config.yaml exists. No infrastructure changes needed.

3-Directory Lifecycle: Projects live in one of three directories: projects/ (active, rendered by ./run.sh), projects_in_progress/ (WIP, not auto-discovered), or projects_archive/ (completed/paused, not auto-discovered). Move projects freely between directories; only what is in projects/ is rendered.

Troubleshooting

Project Not Discovered

Symptom: Project doesn't appear in ./run.sh menu

Cause: Missing manuscript/config.yaml

Solution: Create projects/<name>/manuscript/config.yaml with required fields

Test Import Errors

Symptom: ModuleNotFoundError: No module named '<package>'

Cause: Missing or incorrect tests/conftest.py

Solution:

import os
import sys
os.environ.setdefault("MPLBACKEND", "Agg")
ROOT = os.path.abspath(os.path.join(os.path.dirname(__file__), ".."))
SRC = os.path.join(ROOT, "src")
if SRC not in sys.path:
    sys.path.insert(0, SRC)

Stage 4 Fails Silently

Symptom: 4 stages, <2s - Analysis stage fails immediately

Cause: Project-specific packages missing from root venv

Diagnosis & Fix:

# Check which packages are missing
.venv/bin/python -c "import scipy"  # Replace with your package

# Add to root pyproject.toml dependencies
uv sync

Config Warning Spam

Symptom: WARNING: Unknown config key 'X' on every run

Cause: Non-standard keys in config.yaml

Solution: Nest project-specific keys under project_config: prefix

1. Directory Scaffold

flowchart TB
    P[/projects/&lt;name&gt;//]
    P --> M[/manuscript/]
    P --> SC[/scripts/]
    P --> SRC[/src/&lt;name&gt;/]
    P --> T[/tests/]
    P --> DATA[/data/<br/>Input datasets · optional/]
    P --> OUT[/output/<br/>Pipeline-generated artifacts/]
    P --> META[AGENTS.md · README.md · pyproject.toml]

    M --> M_F[config.yaml ⚠ required ·<br/>01_abstract.md · 02_introduction.md · ... ·<br/>AGENTS.md · README.md]
    SC --> SC_F[&lt;analysis_scripts&gt;.py ·<br/>AGENTS.md · README.md]
    SRC --> SRC_F[__init__.py · &lt;domain_modules&gt;.py ·<br/>AGENTS.md · README.md]
    T --> T_F[conftest.py ⚠ required · sys.path setup ·<br/>test_&lt;module&gt;.py · AGENTS.md · README.md]
    OUT --> OUT_F[/figures/ · /logs//]

    classDef d fill:#0f172a,stroke:#0f172a,color:#fff
    classDef pkg fill:#1e3a8a,stroke:#0f172a,color:#fff
    classDef f fill:#0f766e,stroke:#0f172a,color:#fff
    class P d
    class M,SC,SRC,T,DATA,OUT pkg
    class M_F,SC_F,SRC_F,T_F,OUT_F,META f

2. Critical Setup Files

`tests/conftest.py` — Required

Without this file, pytest cannot import your src/ modules.

"""Pytest configuration for <project_name> tests."""

import os
import sys

# Force headless backend for matplotlib in tests
os.environ.setdefault("MPLBACKEND", "Agg")

# Add src/ to path so we can import project modules
ROOT = os.path.abspath(os.path.join(os.path.dirname(__file__), ".."))
SRC = os.path.join(ROOT, "src")
if SRC not in sys.path:
    sys.path.insert(0, SRC)

Lesson learned: Omitting conftest.py causes ModuleNotFoundError: No module named '<package>' when running tests via pytest or the pipeline's 01_run_tests.py. The pipeline's test runner adds src/ via PYTHONPATH, but pytest's collection phase happens before that can take effect for direct pytest invocations.

`pyproject.toml` — Minimum Viable

[project]
name = "<project-name>"
version = "1.0.0"
description = "Short description"
readme = "README.md"
requires-python = ">=3.12"
dependencies = [
    "matplotlib>=3.10.0",
    # Add your project-specific deps here
]

[tool.pytest.ini_options]
testpaths = ["tests"]
python_files = "test_*.py"

[tool.coverage.run]
source = ["src/<package_name>"]

`manuscript/config.yaml` — Minimum Viable

paper:
  title: "Your Paper Title"
  version: "1.0"
  date: "2026-03-07"

authors:
  - name: "Author Name"
    corresponding: true

testing:
  max_test_failures: 0
  max_infra_test_failures: 3
  max_project_test_failures: 0

3. Common Pitfalls and Solutions

Pitfall 1: `functools.partial` objects lack `name`

Symptom: AttributeError: 'functools.partial' object has no attribute '__name__'

Where it hits: infrastructure/scientific/stability.py, infrastructure/scientific/benchmarking.py — when passing functools.partial (created by make_quadratic_problem) to check_numerical_stability or benchmark_function.

Fix pattern:

# ❌ Breaks on functools.partial
function_name = func.__name__

# ✅ Works on any callable
function_name = getattr(func, "__name__",
    getattr(getattr(func, "func", None), "__name__", repr(func)))

Lesson: Always use getattr chains when accessing __name__ on callable parameters.

Pitfall 2: Undefined module-level constants

Symptom: NameError: name 'project_root' is not defined

Where it hits: Any script that uses project_root in functions but only defines it inside if __name__ == "__main__":.

Fix: Define module-level constants at the top of the file:

from pathlib import Path

project_root = Path(__file__).resolve().parent.parent

Pitfall 3: Missing `MPLBACKEND` in tests

Symptom: Tests hang or crash when matplotlib tries to open a display window.

Fix: Set in conftest.py:

os.environ.setdefault("MPLBACKEND", "Agg")

Pitfall 4: Broken imports in pipeline scripts

Symptom: Pipeline stage fails immediately with ImportError.

Example: 02_run_analysis.py imported format_error_with_suggestions from infrastructure.core.logging.logging_utils, but this symbol was never defined.

Prevention:

Always test pipeline stage scripts standalone: uv run python scripts/02_run_analysis.py --project <name>
Check that all imports resolve before committing.
Use __all__ in __init__.py to make the public API explicit.

Pitfall 5: Emoji glyphs missing from matplotlib fonts

Symptom: UserWarning: Glyph XXXXX (\\N{CLIPBOARD}) missing from font(s) DejaVu Sans

Fix: Replace emoji characters with text labels in matplotlib figures. DejaVu Sans does not include emoji glyphs.

Pitfall 6: Project-specific packages absent from root venv → silent Stage 4 failure

Symptom: ❌ project_name: 4 stages, 7.7s — Stage 4 (Analysis) fails in under 1 second. No import error appears in the console because it is swallowed by subprocess capture.

Root cause: Your project's pyproject.toml lists packages (scipy, pandas, wordcloud, rdflib, scikit-learn, networkx, requests) but the project has no local .venv/. Analysis scripts therefore run under the root .venv, which lacks those packages.

Rule: If projects/<name>/.venv does not exist, every package in projects/<name>/pyproject.toml#dependencies must also be in the root pyproject.toml.

Fix:

# Root pyproject.toml — add all project-specific packages here
[project]
dependencies = [
  "numpy>=1.22",
  "pyyaml>=6.0",
  "matplotlib>=3.7",
  # project-specific requirements (no local .venv)
  "scipy>=1.10.0",
  "pandas>=2.0.0",
  "networkx>=3.0",
  "requests>=2.31.0",
  "rdflib>=7.0.0",
  "wordcloud>=1.9.0",
  "scikit-learn>=1.3.0",
]

uv sync   # installs newly declared packages

Diagnosis:

# Check whether root venv has the packages:
.venv/bin/python -c "import scipy, pandas, wordcloud, rdflib, sklearn, networkx" 2>&1

# Run the analysis script directly to see the actual error:
.venv/bin/python projects/<name>/scripts/01_*.py

Pitfall 7: `matplotlib` in optional dependency group, not core

Symptom: ModuleNotFoundError: No module named 'matplotlib' in analysis scripts, even though it's in pyproject.toml.

Root cause: matplotlib was listed under [project.optional-dependencies] dashboard instead of [project.dependencies]. uv sync (default) does not install optional groups.

Fix: Move matplotlib to core:

[project]
dependencies = [
  "matplotlib>=3.7",  # ← core, not [project.optional-dependencies]
]

Pitfall 8: Unknown keys in `config.yaml` fire warnings on every run

Symptom: Pipeline prints 6+ WARNING: Unknown config key 'X' in .../config.yaml lines on every test and setup stage.

Root cause: The infrastructure's config loader validates keys against a known schema. Project-specific keys (e.g., search, knowledge_graph, pipeline_stages, llm_extraction, hypothesis_definitions, subfield_keywords) that are not in the shared schema trigger warnings.

Fix options:

Remove non-standard top-level keys from config.yaml (preferred for warnings hygiene).
Store project-specific config in a separate file (e.g., project_config.yaml) read directly by project scripts.
Register the key in the infrastructure config schema if it truly belongs there.

Example — subfield_keywords warning with suggestion:

WARNING: Unknown config key 'subfield_keywords' in .../config.yaml — did you mean 'keywords'?

This is non-fatal, but noisy across every pipeline stage.

4. Thin Orchestrator Rules for `scripts/`

Rule	Description
No domain logic	Import ALL logic from `src/<package>/`
Configuration-driven	Read from `config.yaml` or env vars
Stateless	No persistent state between invocations
Logged	Use `get_logger(__name__)` for all output
`PROJECT_DIR`-aware	Read `os.environ.get("PROJECT_DIR", ...)` for path resolution
`MPLBACKEND=Agg`	Always set headless matplotlib

5. Documentation Duality Checklist

Every directory must have:

File	Audience	Content
`README.md`	Humans	Purpose, quick start, directory table
`AGENTS.md`	AI agents	API tables, dependency graphs, patterns

Lesson: Missing AGENTS.md files cause AI agents to repeatedly re-discover project structure instead of reading cached knowledge.

6. Test Suite Requirements

Requirement	Standard
Coverage threshold	≥90% for project code
Zero-Mock policy	No `unittest.mock`, no `MagicMock`, no `@patch`
Markers	`@pytest.mark.requires_ollama` for service-dependent tests
Timeouts	60s+ for integration tests
Path computation	`REPO_ROOT = Path(__file__).resolve().parent.parent.parent.parent`
Assertions	Use minimum-count checks (`≥N`) for forward compatibility

7. Pipeline Integration Verification

After creating your project, verify each pipeline stage:

# 1. Tests pass
uv run python scripts/01_run_tests.py --project <name>

# 2. Analysis scripts execute
uv run python scripts/02_run_analysis.py --project <name>

# 3. Full pipeline
./run.sh  # Select your project, then option 9

# 4. With steganography
./secure_run.sh --project <name>

Lesson: Always test each stage independently before running the full pipeline. A failure in Stage 4 (Analysis) will mask issues in later stages.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Project Setup Checklist

Troubleshooting

Project Not Discovered

Test Import Errors

Stage 4 Fails Silently

Config Warning Spam

1. Directory Scaffold

2. Critical Setup Files

`tests/conftest.py` — Required

`pyproject.toml` — Minimum Viable

`manuscript/config.yaml` — Minimum Viable

3. Common Pitfalls and Solutions

Pitfall 1: `functools.partial` objects lack `name`

Pitfall 2: Undefined module-level constants

Pitfall 3: Missing `MPLBACKEND` in tests

Pitfall 4: Broken imports in pipeline scripts

Pitfall 5: Emoji glyphs missing from matplotlib fonts

Pitfall 6: Project-specific packages absent from root venv → silent Stage 4 failure

Pitfall 7: `matplotlib` in optional dependency group, not core

Pitfall 8: Unknown keys in `config.yaml` fire warnings on every run

4. Thin Orchestrator Rules for `scripts/`

5. Documentation Duality Checklist

6. Test Suite Requirements

7. Pipeline Integration Verification

FilesExpand file tree

new-project-setup.md

Latest commit

History

new-project-setup.md

File metadata and controls

New Project Setup Checklist

Troubleshooting

Project Not Discovered

Test Import Errors

Stage 4 Fails Silently

Config Warning Spam

1. Directory Scaffold

2. Critical Setup Files

tests/conftest.py — Required

pyproject.toml — Minimum Viable

manuscript/config.yaml — Minimum Viable

3. Common Pitfalls and Solutions

Pitfall 1: functools.partial objects lack __name__

Pitfall 2: Undefined module-level constants

Pitfall 3: Missing MPLBACKEND in tests

Pitfall 4: Broken imports in pipeline scripts

Pitfall 5: Emoji glyphs missing from matplotlib fonts

Pitfall 6: Project-specific packages absent from root venv → silent Stage 4 failure

Pitfall 7: matplotlib in optional dependency group, not core

Pitfall 8: Unknown keys in config.yaml fire warnings on every run

4. Thin Orchestrator Rules for scripts/

5. Documentation Duality Checklist

6. Test Suite Requirements

7. Pipeline Integration Verification

`tests/conftest.py` — Required

`pyproject.toml` — Minimum Viable

`manuscript/config.yaml` — Minimum Viable

Pitfall 1: `functools.partial` objects lack `name`

Pitfall 3: Missing `MPLBACKEND` in tests

Pitfall 7: `matplotlib` in optional dependency group, not core

Pitfall 8: Unknown keys in `config.yaml` fire warnings on every run

4. Thin Orchestrator Rules for `scripts/`