Skip to content

Latest commit

 

History

History
107 lines (72 loc) · 9.69 KB

File metadata and controls

107 lines (72 loc) · 9.69 KB
graph LR
    Database_Management["Database Management"]
    Variant_Data_Ingestion["Variant Data Ingestion"]
    Variant_Data_Retrieval["Variant Data Retrieval"]
    Variant_Data_Formatting["Variant Data Formatting"]
    External_Data_Handling["External Data Handling"]
    Testing_Framework["Testing Framework"]
    Variant_Data_Ingestion -- "uses" --> Database_Management
    Variant_Data_Ingestion -- "uses" --> Variant_Data_Formatting
    Variant_Data_Retrieval -- "executes queries via" --> Database_Management
    Variant_Data_Retrieval -- "formats query inputs with" --> Variant_Data_Formatting
    Database_Management -- "manages file downloads through" --> External_Data_Handling
    Testing_Framework -- "tests" --> Database_Management
    Testing_Framework -- "tests" --> Variant_Data_Ingestion
    Testing_Framework -- "tests" --> Variant_Data_Retrieval
    Testing_Framework -- "tests" --> Variant_Data_Formatting
    click Database_Management href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/gnomAD_DB/Database Management.md" "Details"
    click Variant_Data_Ingestion href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/gnomAD_DB/Variant Data Ingestion.md" "Details"
    click Variant_Data_Retrieval href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/gnomAD_DB/Variant Data Retrieval.md" "Details"
    click Variant_Data_Formatting href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/gnomAD_DB/Variant Data Formatting.md" "Details"
    click External_Data_Handling href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/gnomAD_DB/External Data Handling.md" "Details"
    click Testing_Framework href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/gnomAD_DB/Testing Framework.md" "Details"
Loading

CodeBoardingDemoContact

Component Details

This graph represents the architecture of the gnomAD_DB project, which is designed to manage and query gnomAD variant data. The main flow involves handling external gnomAD data files, ingesting them into an SQLite database, and providing various interfaces for retrieving variant information. A dedicated testing framework ensures the integrity and functionality of all core components.

Database Management

This component encapsulates the core SQLite database functionalities. It is responsible for establishing and managing database connections, creating the necessary table structure, and executing direct SQL queries. It serves as the low-level interface for all database interactions.

Related Classes/Methods:

Variant Data Ingestion

This component is responsible for processing and inserting variant data into the gnomAD database. It includes methods for sanitizing input dataframes to ensure they conform to the expected format before storage.

Related Classes/Methods:

Variant Data Retrieval

This component provides various interfaces for querying variant information from the gnomAD database. It supports querying by DataFrame, genomic interval, or a string representation of a variant, and can handle parallel processing for large queries.

Related Classes/Methods:

Variant Data Formatting

This component provides helper functions for parsing and packing variant data from various input formats (e.g., string, pandas Series) into a consistent structure suitable for database operations. It also handles gnomAD version parsing and validation.

Related Classes/Methods:

External Data Handling

This component manages the downloading and unzipping of gnomAD data files from external URLs. It includes utilities for displaying download progress and handling file extraction.

Related Classes/Methods:

Testing Framework

This component contains the test suite for the gnomAD_DB project. It sets up a test database instance, loads sample data, and performs various assertions to verify the correctness of data ingestion, retrieval, and formatting functionalities.

Related Classes/Methods: