Thin, modular wrapper over the external amalgkit CLI with a plan/execute workflow.
- Main Documentation Index - Complete documentation map
- API Reference - Complete function documentation
- Workflow Guide - Workflow planning and execution
- Step Documentation - All 11 step guides
- Function Index - Quick function lookup
flowchart LR
AworkflowConfig[Workflow Config] --> BplanWorkflow[plan_workflow]
B --> CorderedSteps[Ordered Steps]
C --> DexecuteWorkflow[execute_workflow]
D --> EamalgkitSubcommands[amalgkit subcommands]
E --> Flogs+Manifest[Logs + Manifest]
See: amalgkit/README.md, workflow.md, CONFIGURATION.md.
Advanced: step runners in metainformant.rna.engine.workflow_steps provide a stable call surface per subcommand.
See: Step Documentation
Notes
- Ensure
amalgkitis installed and on PATH; verify usingcheck_cli_available execute_workflowwrites per-step logs towork_dir/logsand a JSONL manifest towork_dir/amalgkit.manifest.jsonl- To parameterize steps by species/tissues and output layout, see Configuration Guide
These tools provide automation and precision beyond the standard amalgkit capabilities:
- Tissue Patching: Metadata correction system (
tissue_patches.yaml). - Ortholog Generation: Automated cross-species mapping.
- Streaming Orchestration: High-efficiency ENA download/quant pipeline.
- Troubleshooting & Hacks: Solutions for common bottlenecks.
Meta-analysis integrates RNA-seq data across studies to detect robust gene expression patterns with improved power. A typical flow:
- Data collection and QC: gather SRA/ENA records, assess read quality (e.g., FastQC), trim adapters (e.g., Trimmomatic/fastp)
- Alignment/quantification: map or quasi-map reads (e.g., HISAT2/STAR or Salmon/RSEM)
- Normalization and batch handling: account for depth/length and correct study-level effects (e.g., ComBat, limma removeBatchEffect)
- Differential expression: within-study DE (e.g., DESeq2, edgeR)
- Meta-analysis of DE: combine evidence across studies (e.g., metaRNASeq; p-value or effect-size aggregation)
- Functional interpretation: GO/pathway enrichment
The metainformant.rna wrappers organize these concerns into reproducible steps under a single working directory (default under output/).
- Discovery, planning, selection:
metadata→integrate→config→select - Acquisition:
getfastq(downloads raw FASTQ intowork_dir/fastqby convention) - Quantification:
quant(e.g., Salmon) producing per-sample abundances underwork_dir/quant - Aggregation:
mergeyielding a merged table such aswork_dir/merged_abundance.tsv - Normalization:
cstmm(count scaling) underwork_dir/cstmm - Curation and QC:
curate,csca(sample clustering/assessment),sanity(final checks) - Downstream DE/meta-analysis: export merged counts/TPMs to R/Python tools (e.g., DESeq2/edgeR/metaRNASeq) for study-level DE and cross-study meta-analysis
See workflow.md for planning/execution and Step Documentation for per-step details and expected artifacts.
- GETTING_STARTED.md - Installation and setup
- README.md - Complete documentation index
- workflow.md - Workflow planning and execution
- Step Documentation - Individual step documentation
- CONFIGURATION.md - Configuration management
- API.md - Complete API reference
- ORCHESTRATION.md - Orchestrator overview
- metaRNASeq: RNA-seq meta-analysis methodology (arXiv)
- metaTP: automated metatranscriptomic pipeline with Snakemake (BMC Bioinformatics)
- SMAGEXP: Galaxy suite unifying metaMA/metaRNASeq (arXiv)
- Flimma: federated limma-voom for privacy-preserving DE (arXiv)