███████╗████████╗ █████╗ ██████╗ ██╗ ██╗███████╗ ██████╗ ██████╗ ██████╗ ███████╗
██╔════╝╚══██╔══╝██╔══██╗██╔══██╗██║ ██║██╔════╝██╔════╝██╔═══██╗██╔══██╗██╔════╝
███████╗ ██║ ███████║██████╔╝███████║███████╗██║ ██║ ██║██████╔╝█████╗
╚════██║ ██║ ██╔══██║██╔═══╝ ██╔══██║╚════██║██║ ██║ ██║██╔═══╝ ██╔══╝
███████║ ██║ ██║ ██║██║ ██║ ██║███████║╚██████╗╚██████╔╝██║ ███████╗
╚══════╝ ╚═╝ ╚═╝ ╚═╝╚═╝ ╚═╝ ╚═╝╚══════╝ ╚═════╝ ╚═════╝ ╚═╝ ╚══════╝A species-optimized computational pipeline for rapid, accessible Staphylococcus aureus genotyping and surveillance
NB: Use -c bbeckley-hub anaconda channel do not use -c bioconda, fixing missing database issues-waiting PR merge on the updated recipe!
Two ways to use StaphScope:
🖥️ Command-line tool for high-throughput, local analysis
🌐 StaphScope Web for non-bioinformaticians – https://staphscope.dpdns.org
- 🎯 Overview
- ✨ Key Features
- 🌐 StaphScope Web Platform
- ⚡ Quick Start (CLI)
- 🔧 Installation (CLI)
- 🐳 Staphscope Docker Usage
- 🔗 Integrated External Tools & Dependencies
- 🚀 Usage Guide (CLI)
- 📁 Output Structure
- 🔍 Analytical Modules
- 📈 Performance Benchmarks
- 🔬 Validation & Accuracy
- 🆚 Tool Comparison
- 🤖 AI Integration Guide
- 🔮 Future Development
- ❓ Frequently Asked Questions
- 🐛 Troubleshooting
- 📚 Citation
- 🙏 Acknowledgements
- 👥 Authors & Contact
- 📄 License
- 📚 Third-Party Tool Citations
StaphScope is an automated, locally-executable computational pipeline designed specifically for comprehensive Staphylococcus aureus genomic surveillance. It addresses the critical bottleneck in MRSA (Methicillin-Resistant S. aureus) research by integrating six essential genotyping methods into a single, cohesive workflow.
- Fragmented Bioinformatics: Traditional MRSA analysis requires 5+ separate tools with conflicting dependencies
- Resource Barriers: Web-based services need constant internet and raise data privacy concerns
- Time Constraints: Generalist platforms take hours; outbreaks need answers in minutes
- Interpretation Challenges: Raw data without epidemiological context limits actionable insights
StaphScope delivers:
- ✅ Single-command installation via Conda
- ✅ 10-14 minute complete analysis (24 samples, 16 cores)
- ✅ 100% local execution with data privacy
- ✅ Intelligent resource management using Python's psutil library
- ✅ Interactive HTML reports with epidemiological context
- ✅ Automated MRSA/MSSA classification with confidence scoring
- ✅ Web-based interface for non-bioinformaticians
Perfect for: Clinical labs, outbreak investigations, research studies, and public health surveillance.
| Module | 🎯 Purpose | 📊 Key Outputs | ⚡ Speed |
|---|---|---|---|
| FASTA QC | Comprehensive quality control (N50/N70/N90, GC%, contig stats) | HTML, TSV, JSON reports with visual summaries | <30 sec |
| MLST Typing | Phylogenetic classification via 7 housekeeping genes | ST, CC, allele profiles, epidemiological context | <1 min |
| spa Typing | Hypervariable region analysis of protein A gene | spa type, repeat patterns, alignment metrics | <1 min |
| SCCmec Typing | Methicillin resistance cassette characterization | SCCmec type (I-XIII), mec/ccr complexes, confidence scores | 1-2 min |
| AMR Profiling | Comprehensive resistance gene detection (AMRFinderPlus) | 5,000+ AMR genes, risk categorization, cross-sample patterns | 2-3 min |
| ABRicate Screening | Multi-database virulence/plasmid detection (9 databases) | Plasmid replicons, virulence factors, clinical flags | 3-4 min |
| Visualization Suite | Publication-ready graphics using seaborn, plotly, matplotlib | 14+ graph types in PDF, PNG, SVG, interactive HTML | 1-2 min |
| Lineage Database | Global epidemiological context | 44 major lineages, geographical distribution, outbreak potential | Instant |
- Automated MRSA Classification: Based on concurrent mecA/mecC + SCCmec detection
- Clinical Gene Flagging: Automatic highlighting of PVL, enterotoxins, van genes
- Risk Assessment: Categorizes genes as 'Critical Risk' (e.g., mecA, vanA) or 'High Risk'
- Cross-Genome Pattern Discovery: Summarizes gene frequencies across entire sample sets
- Curated Lineage Database: 44 major lineages with HA-MRSA, CA-MRSA, LA-MRSA classifications
- 8-10× faster than Bactopia for S. aureus-specific analyses
- Linear scaling with sample numbers (R² = 0.931)
- Dynamic resource allocation using Python psutil
- Low memory footprint: Runs on 4GB RAM, scales to HPC clusters
See a complete interactive report generated by StaphScope:
The report includes AMR and virulence gene tables, filter buttons, combination tables, and FASTA QC metrics.
For researchers and clinicians who prefer a graphical interface, StaphScope Web provides all the power of the command-line tool in an easy-to-use web application.
- ✅ Drag-and-drop file upload (single, multiple, or ZIP archives)
- ✅ Module selection – choose which analyses to run
- ✅ Real-time progress monitoring with live logs
- ✅ Beautiful HTML reports with interactive visualizations
- ✅ Download all results as a single ZIP file
- ✅ Responsive design – works on desktop and tablet
- ✅ No installation required – works in any modern browser
- Backend: Flask (Python web framework)
- Task Queue: Celery with Redis broker
- Bioinformatics Engine: StaphScope CLI (via Conda)
- Frontend: Bootstrap 5, JavaScript
- Deployment: Gunicorn + Nginx
🌐 Try StaphScope Web today: https://staphscope.dpdns.org
📦 Web Repository: https://github.com/bbeckley-hub/staphscope-web
Note: The web version limits uploads to 10 files per job for fair resource usage. For larger datasets, please use the command-line tool.
Note: Currently hosted on personal infrastructure; availability may vary as we work toward sustainable 24/7 hosting.
# Method 1: Conda (Recommended)
conda create -n staphscope -c conda-forge -c bbeckley-hub staphscope -y
conda activate staphscope
# Method 2: Mamba (Faster installation)
mamba create -n staphscope -c conda-forge -c bioconda -c bbeckley-hub staphscope -y
mamba activate staphscope
# Method 3: From source
git clone https://github.com/bbeckley-hub/staphscope-typing-tool.git
cd staphscope-typing-tool
conda env create -f environment.yml
conda activate staphscope
pip install -e .# Single genome
staphscope -i genome.fasta -o results/
# Batch processing (24 genomes)
staphscope -i "*.fna" -o batch_results --threads 16
# Complete in ~14 minutes! 🎉usage: staphscope [-h] -i INPUT -o OUTPUT [-t THREADS] [--skip-amr]
[--skip-abricate] [--skip-mlst] [--skip-spa] [--skip-sccmec]
[--skip-lineage] [--skip-comprehensive]
StaphScope: Complete S. aureus Typing Pipeline
optional arguments:
-h, --help show this help message and exit
-i INPUT, --input INPUT
Input FASTA file(s) - can use glob patterns like
"*.fna" or "*.fasta"
-o OUTPUT, --output OUTPUT
Output directory for all results
-t THREADS, --threads THREADS
Number of threads (default: 2)
--skip-amr Skip AMR analysis (AMRfinderPlus)
--skip-abricate Skip ABRicate analysis
--skip-mlst Skip MLST analysis
--skip-spa Skip spa typing analysis
--skip-sccmec Skip SCCmec analysis
--skip-lineage Skip lineage reference generation
--skip-comprehensive Skip comprehensive report generation (MLST + spa +
SCCmec)
Examples:
staphscope -i genome.fna -o results/
staphscope -i "*.fna" -o batch_results --threads 8
staphscope -i "*.fasta" -o analysis --threads 16 --skip-lineage
staphscope -i "genome*.fa" -o results/ --threads 4 --skip-comprehensive
Supported FASTA formats: .fna, .fasta, .fa, .fn
Analysis Modules:
• MLST (Multi-Locus Sequence Typing)
• spa typing (Staphylococcal Protein A)
• SCCmec typing (Methicillin Resistance Cassette)
• AMR profiling (Antimicrobial Resistance)
• ABRicate (Comprehensive resistance/Plasmid/virulence)
• Lineage reference database
• Comprehensive report (MLST + spa + SCCmec summary)
Output: Comprehensive results for all analyses in organized directories
Please run abricate --setupdb for recent gene annotations!!!
⭐ Star us on GitHub if you find this tool useful!
Transforming fragmented genomic data into coherent biological narratives 🧬✨
| Resource | Minimum | Recommended | Production |
|---|---|---|---|
| CPU Cores | 2 | 8+ | 16+ |
| RAM | 4 GB | 8 GB | 16 GB |
| Storage | 2 GB | 10 GB | 50 GB+ |
| OS | Linux, macOS, WSL2 | Linux | Linux Cluster |
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash Miniconda3-latest-Linux-x86_64.sh
source ~/.bashrc# Add channels in correct order
conda config --add channels conda-forge
conda config --add channels bioconda
conda config --add channels bbeckley-hub
# Create and activate environment
conda create -n staphscope python=3.9 staphscope -y
conda activate staphscope
# Verify installation
staphscope --helpabricate --setupdb# Pull the latest image
docker pull bbeckleyhub/staphscope:latest
# Test installation
docker run --rm bbeckleyhub/staphscope:latest --help
# Analyze your data
docker run --rm \
-v $(pwd)/genomes:/data/input \
-v $(pwd)/results:/data/output \
bbeckleyhub/staphscope:latest \
-i "*.fasta" -o /data/output -t 4
# Outputs
sudo chown -R $USER:$USER ./output
On HPC clusters that support Singularity/Apptainer, you can run StaphScope without sudo and output files will be owned by your user automatically.
Important: StaphScope writes temporary files inside its own installation directory (e.g.,
/opt/staphscope/...). Singularity mounts containers as read‑only by default, so you must add the--writable-tmpfsflag to allow these writes. The flag creates an ephemeral, writable overlay in memory – no permanent changes are made to the container.
singularity pull staphscope.sif docker://bbeckleyhub/staphscope:latest
singularity run --writable-tmpfs -B $(pwd):/data staphscope.sif -i "/data/*.fasta" -o /data/outputIf you encounter TLS timeouts or other network errors (common on some HPCs), convert an existing Docker image to a Singularity SIF file on a machine with Docker, then transfer the .sif file to the HPC.
Step 1 – on a machine with Docker (e.g., your laptop):
docker pull bbeckleyhub/staphscope:latest
docker save bbeckleyhub/staphscope:latest -o staphscope.tar
singularity build staphscope.sif docker-archive://staphscope.tarNow copy staphscope.sif to your HPC home or project directory (e.g., using scp).
Step 2 – on the HPC (no sudo needed):
singularity run --writable-tmpfs -B $(pwd):/data staphscope.sif -i "/data/*.fasta" -o /data/output| Flag | Purpose |
|---|---|
--writable-tmpfs |
Creates a temporary writable overlay – required for StaphScope to write intermediate files to /opt/... |
-B $(pwd):/data |
Binds your current directory to /data inside the container (input files are read from here, output is written here) |
-i "/data/*.fasta" |
Input pattern – use quotes to prevent shell expansion on the host |
-o /data/output |
Output directory (will appear as ./output on your host) |
You can use any StaphScope flag, e.g.:
singularity run --writable-tmpfs -B $(pwd):/data staphscope.sif \
-i "/data/*.fasta" -o /data/output --threads 8 --skip-amrAfter a successful run, you will see output indicating each module completed. All result files in ./output will be owned by your HPC user – no sudo chown needed.
StaphScope integrates several powerful open-source tools and databases. These are not bundled directly in this repository. Instead, they are automatically installed as dependencies via Conda (as defined in environment.yml). The MIT license that applies to the StaphScope pipeline code does not cover these external tools. Each tool is used under the terms of its own license, and we gratefully acknowledge their authors.
| Tool/Database | Purpose | Source | License |
|---|---|---|---|
| MLST | Multi-locus sequence typing | tseemann/mlst | GPL v2 |
| ABRicate | Mass screening for resistance/virulence | tseemann/abricate | GPL v2 |
| AMRFinderPlus | Antimicrobial resistance gene detection | ncbi/amr | Public Domain |
| SCCmecFinder | SCCmec typing | genomicepidemiology/Sccmecfinder | Apache-2.0 |
| spa typing | spa gene typing | spa.ridom.de | Free for academic use |
| PubMedST | MLST allele database | pubmlst.org | Open access for research |
# Single genome
staphscope -i genome.fasta -o results/
# Batch processing with wildcards
staphscope -i "*.fna" -o results_2025 --threads 8
# Skip specific modules
staphscope -i sample.fna -o results --skip-spa --skip-lineage- Accepted:
.fna,.fasta,.fa,.fn - Required: Assembled genomes (contigs or complete)
- Batch patterns:
*.fasta,sample_*.fna, etc.
# Daily surveillance of 12 isolates
staphscope -i "daily_isolates/*.fasta" -o /mnt/shared/surveillance/$(date +%Y%m%d) --threads 12
# Complete in ~8 minutes# Urgent investigation (8 suspected cases)
staphscope -i "outbreak/*.fasta" -o /tmp/urgent_analysis --skip-lineage
# Results in ~4 minutesStaphScope generates a comprehensive, organized output directory:
Staphscope/
├── abricate_results/ # Multi-database screening (9 DBs)
├── amr_results/ # AMR gene profiling (AMRFinder+)
├── mlst_results/ # MLST typing
├── sccmec_results/ # SCCmec typing
├── spa_results/ # spa typing
├── lineage_results/ # Phylogenetic lineage
├── qc_results/ # FASTA quality control
├── visualization_results/ # Publication-ready plots
└── Staphscope_final_report/ # Consolidated reports (HTML/JSON/TSV)
Each module contains:
- Per-sample directories with raw outputs
- Summary files (TSV/JSON) for cross-sample analysis
- Interactive HTML reports for visualization
- Master reports combining all results
- Metrics: N50/N70/N90, L50/L70/L90, GC content, total length, contig count
- Outputs: HTML reports with histograms, TSV/JSON for downstream analysis
- Database: PubMedST S. aureus
- Method: BLAST-based allele calling
- Output: ST, CC, 7-gene profile, epidemiological context
- Database: Ridom spa repeat database
- Method: BLAST against repeat sequences
- Output: spa type, repeat pattern, alignment metrics
- Method: Hierarchical two-method system (gene-based + k-mer homology)
- Output: SCCmec type (I-XIII), confidence scores, mec/ccr complexes
- Subtyping: Types IV and V community-associated cassettes
- Tool: NCBI-AMRFinderPlus v4.2.4
- Coverage: 5,000+ AMR genes
- Risk Assessment: Critical Risk (mecA, vanA, cfr), High Risk (erm, tetM)
- Databases: VFDB, ResFinder, CARD, PlasmidFinder, MegaRes, NCBI, ARG-ANNOT, ECOH, EcoLi_VF
- Thresholds: ≥80% identity and coverage
- Clinical Flags: PVL, enterotoxins, van genes
- Libraries: seaborn, plotly, matplotlib
- Plot Types: Box plots, violin plots, bar charts, heatmaps, correlation matrices, pie charts, line graphs
- Formats: PNG, SVG, PDF, interactive HTML
- Content: 44 major S. aureus lineages (18 HA-MRSA, 19 CA-MRSA, 7 LA-MRSA)
- Metadata: Geographical distribution, clinical significance, outbreak potential
| System | Samples | Time | Speed vs Bactopia |
|---|---|---|---|
| Laptop (2 cores, 8GB) | 1 | 2m 33s | 5× faster |
| Laptop (2 cores, 8GB) | 24 | 28m 17s | 6× faster |
| Workstation (16 cores, 16GB) | 1 | 1m 31s | 8× faster |
| Workstation (16 cores, 16GB) | 24 | 14m 34s | 10× faster |
| Workstation (16 cores, 16GB) | 100 | ~60m | 12× faster |
- Memory Usage: 2-4 GB typical, scales linearly
- Storage: ~100 MB per sample
- CPU: Dynamic allocation via psutil
100% concordance with gold-standard reference genomes:
| Reference Strain | Expected Type | StaphScope Result |
|---|---|---|
| USA300 | ST8–t008–IV(2B) | ✅ ST8–t008–IV(2B) |
| N315 | ST5–t002–II(2A) | ✅ ST5–t002–II(2A) |
| MRSA252 | ST36–t018–II(2A) | ✅ ST36–t018–II(2A) |
| TW20 | ST239–t037–III(3A) | ✅ ST239–t037–III(3A) |
| NCTC8325 | ST8–t211–None | ✅ ST8–t211–Not Assigned |
- MRSA: 21 isolates (87.5%)
- MSSA: 3 isolates (12.5%)
- Dominant STs: ST5 (9), ST8 (5), ST22 (2)
- Critical Genes: mecA (21), mecC (1), fosB (20)
- PVL: 7 isolates (29.2%), all ST8/ST59
- Plasmids: 14/24 genomes (58.3%) with plasmid replicons
| Feature | StaphScope | Bactopia | Nullarbor | Mykrobe |
|---|---|---|---|---|
| Analysis Focus | 🎯 S. aureus-optimized | Multi-species | Multi-species | Multi-species |
| Input Format | Assembled genomes | Raw reads | Raw reads | Raw reads |
| Installation | Single Conda package | Complex (Nextflow+Docker) | Conda + DB downloads | Single Conda |
| Execution | Local CLI + Web GUI | Local/Cluster | Local | CLI + Web GUI |
| Parallelization | Auto-resource detection | Pipeline-level | Sample-level | Single-threaded |
| MRSA Features | Integrated classification + lineage DB | General typing | General typing | Resistance only |
| Critical Gene Flagging | ✅ mecA, PVL, van genes | ❌ | ❌ | ❌ |
| Resource Needs | Low-moderate (2+ GB) | High (HPC recommended) | High (Cluster) | Low-moderate |
| Web Interface | ✅ StaphScope Web | ❌ | ❌ | ✅ Mykrobe web |
StaphScope generates comprehensive HTML reports that are perfect for AI analysis. Here's how to use AI tools to get more from your data.
- Install any AI browser extension (ChatGPT, Claude, Gemini)
- Open your report:
staphscope_ultimate_report.html - Select text in any section (AMR Genes, MLST Analysis, etc.)
- Right-click → Ask AI with your question
For MLST Analysis:
- "What is the clinical significance of ST5 vs ST8?"
- "Which samples are MRSA and what ST are they?"
For AMR Genes:
- "Explain the mecA gene and its importance"
- "Which samples have multiple resistance genes?"
- "What treatment implications do these genes have?"
For Virulence Factors:
- "Which samples carry PVL toxin?"
- "Are there any high-risk virulence combinations?"
For Pattern Discovery:
- "Are there correlations between ST and specific genes?"
- "Identify any concerning patterns in this dataset"
- Provide context: "I'm analyzing S. aureus genomics data..."
- Be specific: Instead of "tell me about this", ask "what does SCCmec type IV indicate?"
- Ask for interpretations: "What are the clinical implications of these findings?"
- Request summaries: "Summarize the resistance profile of sample XYZ"
StaphScope reports are structured with clear tables and organized data that AI can easily understand. Each gene is shown with all genomes that contain it, making pattern analysis straightforward.
"AI provides powerful insights but always verify critical findings with domain experts."
# Planned machine learning module
staphscope --ml-predict --input results.json --model outbreak_risk
# Raw read support (in development)
staphscope --raw-reads sample_R1.fastq sample_R2.fastq --assembler shovill- Outbreak Prediction: Identify emerging patterns and transmission networks
- Phenotype Inference: Predict virulence, transmissibility from genotype
- Risk Scoring: Automated risk assessment for clinical isolates
- Anomaly Detection: Flag novel or unexpected genetic combinations
- Raw Read Support: Direct FASTQ analysis with integrated assembly (Snippy)
- Real-Time Updates: Live database synchronization
- Plugin System: Community-contributed analysis modules
- Database Contributions: User-submitted lineage updates
- Translation Support: Help translate the interface
Q: Is StaphScope free to use?
A: Yes! StaphScope is open-source under the MIT License. Free for academic, clinical, and commercial use.
Q: What makes StaphScope different from other tools?
A: StaphScope is S. aureus-optimized, integrates 6 analysis types in one workflow, runs 8-10× faster than generalist tools, and includes a curated global lineage database.
Q: Can I use StaphScope for clinical diagnosis?
A: StaphScope is a research tool. While highly accurate, results should be validated with orthogonal methods for clinical decision-making.
Q: Which version should I use – CLI or Web?
A: Use the Web version for convenience, small batches (≤10 files), and if you prefer a graphical interface. Use the CLI version for large batches (100+ genomes), integration into pipelines, or when working with sensitive data locally.
Q: Why only assembled genomes? When will raw read support be added?
A: We focused first on assembled genomes for speed and simplicity. Raw read support is our #1 priority for 2026 development.
Q: How often are databases updated?
A: We have planned sequential releases when database updates are needed. The lineage database is manually curated every 6 months. Users can run abricate --setupdb anytime.
Q: Can I run StaphScope on Windows?
A: Yes, via WSL2 (Windows Subsystem for Linux). Native Windows support is planned.
Q: How do I handle very large batches (1000+ genomes)?
A: Use the CLI with glob patterns and appropriate threading. StaphScope scales linearly.
Q: What does "Not Assigned" mean for SCCmec typing?
A: This indicates insufficient evidence for cassette classification—usually MSSA or novel SCCmec types.
Q: How is MRSA status determined?
A: MRSA = positive for both SCCmec element AND mecA or mecC gene. MSSA = lacks either criterion.
Q: Are virulence factors from other species filtered out?
A: Yes! The ABRicate module uses S. aureus-optimized thresholds and databases to minimize cross-species false positives.
# Issue: Database errors
# Solution:
abricate --setupdb
# Issue: Missing dependencies
# Solution:
conda remove staphscope
conda clean --all
conda install -c bbeckley-hub staphscope # Fresh install
# Issue: Web version not loading
# Solution: Check internet connection or try a different browser.
# The service may be temporarily down; check GitHub for updates.- Check existing issues: GitHub Issues
- Search closed issues: Many problems already solved
- Create new issue: Include:
- Full error message
staphscope --version- Conda environment list (
conda list) - Example command that failed
- Email support: brownbeckley94@gmail.com (response within 48 hours)
If you use StaphScope in your research, please cite:
Beckley, B., Amarh, V. (2026). StaphScope: a species‑optimized computational pipeline for rapid and accessible Staphylococcus aureus genotyping and surveillance. BMC Genomics, 27:123.
DOI: 10.1186/s12864-026-12609-x
@article{beckley2026staphscope,
title={StaphScope: a species‑optimized computational pipeline for rapid and accessible Staphylococcus aureus genotyping and surveillance},
author={Beckley, Brown and Amarh, Vincent},
journal={BMC Genomics},
volume={27},
pages={123},
year={2026},
doi={10.1186/s12864-026-12609-x}
}@software{staphscope2026,
author = {Brown Beckley},
title = {StaphScope: A species-optimized computational pipeline for Staphylococcus aureus genotyping},
year = {2026},
publisher = {GitHub},
url = {https://github.com/bbeckley-hub/staphscope-typing-tool}
}Please also cite the essential tools that make StaphScope possible (see BibTeX in the repository).
StaphScope stands on the shoulders of giants. We are deeply grateful to:
- Torsten Seemann for MLST, ABRicate, and countless foundational tools
- NCBI team for AMRFinderPlus
- CGE team for SCCmecFinder and database curation
- PubMedST, Ridom, CARD, VFDB for essential databases
- Python community for Biopython, pandas, plotly, seaborn, matplotlib
- Early adopters and beta testers for invaluable feedback
- Peer reviewers & Editorial Team @BMC GENOMICS for their constructive feedback, which significantly strengthened this tool and it manuscript.
"If we ever meet in person, the drinks are on me!" – Brown Beckley
Brown Beckley (Primary Developer)
- University of Ghana Medical School
- 📧 brownbeckley94@gmail.com
- 🐙 GitHub: bbeckley-hub
- LinkedIn: @brownbeckley
- 📞 +233 508820617
Amarh Vincent (Co-Author)
- University of Ghana Medical School
We welcome collaborations on:
- MRSA epidemiology studies
- Clinical validation projects
- Bioinformatics tool development
- Global surveillance initiatives
- Public health applications
The StaphScope pipeline code (the workflow engine, report generation, HTML templates, and Python modules written by the authors) is licensed under the MIT License – see the LICENSE file for details.
The web interface is also open-source and available under the MIT License in its separate repository.
StaphScope executes several external bioinformatics tools, which are installed as Conda dependencies. Each tool is the property of its respective developers and is used under its own license:
| Tool | License |
|---|---|
mlst (Torsten Seemann) |
GPL v2 |
ABRicate (Torsten Seemann) |
GPL v2 |
AMRFinderPlus (NCBI) |
Public Domain |
SCCmecFinder (CGE) |
Apache-2.0 |
spa typing (Ridom) |
Free for academic use |
By using StaphScope, you agree to comply with the licenses of these third-party tools.
StaphScope integrates several powerful open-source tools and databases. If you use StaphScope in your research, please also cite the following essential tools:
@software{seemann_mlst_2018,
author = {Seemann, T.},
title = {MLST: Scan contig files against traditional PubMLST typing schemes},
year = {2018},
publisher = {GitHub},
url = {https://github.com/tseemann/mlst}
}@software{seemann_abricate_2018,
author = {Seemann, T.},
title = {ABRicate: Mass screening of contigs for antimicrobial resistance and virulence genes},
year = {2018},
publisher = {GitHub},
url = {https://github.com/tseemann/abricate}
}@article{feldgarden_amrfinderplus_2019,
author = {Feldgarden, M. et al.},
title = {AMRFinderPlus and the Reference Gene Catalog facilitate examination of the genomic links among antimicrobial resistance, stress response, and virulence},
journal = {Scientific Reports},
volume = {11},
pages = {12728},
year = {2019},
doi = {10.1038/s41598-021-91456-0}
}@article{kaya_sccmecfinder_2018,
author = {Kaya, H. et al.},
title = {SCCmecFinder, a Web-Based Tool for Typing of Staphylococcal Cassette Chromosome mec in Staphylococcus aureus Using Whole-Genome Sequence Data},
journal = {mSphere},
volume = {3},
number = {1},
pages = {e00612-17},
year = {2018},
doi = {10.1128/mSphere.00612-17}
}@article{mellmann_spa_typing_2005,
author = {Mellmann, A. et al.},
title = {Evidenzbasierte Hygienemassnahmen mittels spa-Typisierung bei MRSA-Häufungen im Krankenhaus},
journal = {Deutsche Medizinische Wochenschrift},
volume = {130},
number = {22},
pages = {1364-1368},
year = {2005},
doi = {10.1055/s-2005-868351},
note = {Database: https://spa.ridom.de}
}@article{alcock_card_2023,
author = {Alcock, B. P. et al.},
title = {CARD 2023: expanded curation, support for machine learning, and resistome prediction at the Comprehensive Antibiotic Resistance Database},
journal = {Nucleic Acids Research},
volume = {51},
number = {D1},
pages = {D690-D699},
year = {2023},
doi = {10.1093/nar/gkac920}
}@article{bortolaia_resfinder_2020,
author = {Bortolaia, V. et al.},
title = {ResFinder 4.0 for predictions of phenotypes from genotypes},
journal = {Journal of Antimicrobial Chemotherapy},
volume = {75},
number = {12},
pages = {3491-3500},
year = {2020},
doi = {10.1093/jac/dkaa345}
}@article{gupta_argannot_2014,
author = {Gupta, S. K. et al.},
title = {ARG-ANNOT, a new bioinformatic tool to discover antibiotic resistance genes in bacterial genomes},
journal = {Antimicrobial Agents and Chemotherapy},
volume = {58},
number = {1},
pages = {212-220},
year = {2014},
doi = {10.1128/AAC.01310-13}
}@article{chen_vfdb_2016,
author = {Chen, L. et al.},
title = {VFDB 2016: hierarchical and refined dataset for big data analysis—10 years on},
journal = {Nucleic Acids Research},
volume = {44},
number = {D1},
pages = {D694-D697},
year = {2016},
doi = {10.1093/nar/gkv1239}
}@article{carattoli_plasmidfinder_2014,
author = {Carattoli, A. et al.},
title = {In silico detection and typing of plasmids using PlasmidFinder and plasmid multilocus sequence typing},
journal = {Antimicrobial Agents and Chemotherapy},
volume = {58},
number = {7},
pages = {3895-3903},
year = {2014},
doi = {10.1128/AAC.02412-14}
}@article{joensen_ecoh_2015,
author = {Joensen, K. G. et al.},
title = {Rapid and easy in silico serotyping of Escherichia coli isolates by use of whole-genome sequencing data},
journal = {Journal of Clinical Microbiology},
volume = {53},
number = {8},
pages = {2410-2426},
year = {2015},
doi = {10.1128/JCM.00008-15}
}@article{bonin_megares_2023,
author = {Bonin, N. et al.},
title = {MEGARes and AMR++, v3.0: an updated comprehensive database of antimicrobial resistance determinants and an improved software pipeline for classification using high-throughput sequencing},
journal = {Nucleic Acids Research},
volume = {51},
number = {D1},
pages = {D744-D752},
year = {2023},
doi = {10.1093/nar/gkac1047}
}When citing StaphScope in your publications, please include the main StaphScope citation along with citations for the specific tools and databases you used:
"Genomic analysis was performed using StaphScope [Beckley & Amarh, 2026], which integrates MLST [Seemann, 2018], ABRicate [Seemann, 2018], AMRFinderPlus [Feldgarden et al., 2019], and SCCmecFinder [Kaya et al., 2018] for comprehensive S. aureus characterization. Antimicrobial resistance genes were identified using the CARD [Alcock et al., 2023] and ResFinder [Bortolaia et al., 2020] databases."
| Choose Your Platform | |
|---|---|
| 🖥️ Command Line | For high-throughput, local analysis |
| 🌐 StaphScope Web | For non-bioinformaticians – https://staphscope.dpdns.org |
From days to minutes. From fragmented to integrated. From data to insights.
StaphScope: Precision surveillance for the antibiotic resistance era.
⭐ If you find this tool useful, please star the repository! ⭐
Join the Fight Against Antimicrobial Resistance
Antimicrobial resistance (AMR) represents one of the most significant global health threats of our time. We invite researchers, clinicians, and public health professionals to collaborate with us in expanding and validating our database, sharing regional epidemiological data, and advancing AMR surveillance.
Together, we can enhance global AMR monitoring and develop more effective treatment strategies.