A major update to the OpenProblems framework, switching from a Python-based framework to a Viash + Nextflow-based framework. This update features the same concepts as the previous version, but with a new implementation that is more flexible, scalable, and maintainable.
Most relevant parts of the overall structure:
-
src/tasks: Benchmarking tasks:batch_integration: Batch integrationdenoising: Denoisingdimensionality_reduction: Dimensionality reductionmatch_modalities: Match modalitiespredict_modality: Predict modalityspatial_decomposition: Spatial decompositionspatially_variable_genes: Spatially variable genes
-
src/datasets: Components for creating common datasets. Loaders:cellxgene_census: Query cells from a CellxGene Censusopenproblems_neurips2021_bmmc: Fetch a dataset from the OpenProblems NeurIPS2021 competitionopenproblems_neurips2022_pbmc: Fetch a dataset from the OpenProblems NeurIPS2022 competitionopenproblems_v1: Fetch a legacy OpenProblems v1 datasetopenproblems_v1_multimodal: Fetch a legacy OpenProblems v1 multimodal datasettenx_vision: Fetch a and convert 10x Visium datasetzenodo_spatial: Fetch and process an Anndata file containing DBiT seq, MERFISH, seqFISH, Slide-seq v2, STARmap, and Stereo-seq data from Zenodo.zenodo_spatial_slidetags: Download a compressed file containing gene expression matrix and spatial locations from zenodo.
-
src/common: Common components used by all tasks.check_dataset_schema: Check whether an h5ad dataset adheres to a dataset schemacheck_yaml_schema: Check whether a YAML adheres to a JSON schemacomp_tests: Reusable component unit testscreate_component: Create a component Viash component.create_task_readme: Create a README for an OpenProblems task.extract_metadata: Extract the.unsmetadata from an h5ad file.helper_functions: Commonly used helper functions in Python or in R,process_task_results: Process the raw tasks results (containing raw logs, unprocessed component configs, and various metrics) into nicely formatted task results.schemas: JSON schemas for YAML files in the repositorysync_test_resources: Synchronise the test resources from s3 to resources_test
For more information related to the structure of this repository, see the documentation.