graph LR
ConfGF_Runner["ConfGF Runner"]
Distance_Score_Model["Distance Score Model"]
Molecular_Data_Processing["Molecular Data Processing"]
Conformation_Generation_Evaluation["Conformation Generation & Evaluation"]
General_Utilities["General Utilities"]
ConfGF_Runner -- "orchestrates" --> Distance_Score_Model
ConfGF_Runner -- "manages" --> Molecular_Data_Processing
ConfGF_Runner -- "utilizes" --> Conformation_Generation_Evaluation
ConfGF_Runner -- "uses" --> General_Utilities
Distance_Score_Model -- "processes data from" --> Molecular_Data_Processing
Distance_Score_Model -- "leverages" --> General_Utilities
Molecular_Data_Processing -- "prepares data for" --> Distance_Score_Model
Molecular_Data_Processing -- "uses" --> General_Utilities
Conformation_Generation_Evaluation -- "is used by" --> ConfGF_Runner
Conformation_Generation_Evaluation -- "relies on" --> General_Utilities
click ConfGF_Runner href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main//ConfGF/ConfGF Runner.md" "Details"
click Distance_Score_Model href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main//ConfGF/Distance Score Model.md" "Details"
click Molecular_Data_Processing href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main//ConfGF/Molecular Data Processing.md" "Details"
click Conformation_Generation_Evaluation href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main//ConfGF/Conformation Generation & Evaluation.md" "Details"
click General_Utilities href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main//ConfGF/General Utilities.md" "Details"
The ConfGF project implements a framework for generating molecular conformations using a score-based generative model. The main flow involves training a Distance Score Model to learn the distribution of inter-atomic distances, which is then used by the ConfGF Runner to generate 3D molecular structures through Langevin Dynamics. Molecular data is prepared and transformed by the Molecular Data Processing component, and the quality of generated conformations is assessed by the Conformation Generation & Evaluation module. Various General Utilities support these core functionalities.
The central orchestration component responsible for managing the entire ConfGF pipeline, including training the distance score model, evaluating its performance, and generating molecular conformations using Langevin Dynamics. It coordinates interactions between data handling, model execution, and evaluation modules.
Related Classes/Methods:
ConfGF.confgf.runner.default_runner.DefaultRunner(18:395)ConfGF.confgf.runner.default_runner.DefaultRunner.train(110:183)ConfGF.confgf.runner.default_runner.DefaultRunner.evaluate(79:107)ConfGF.confgf.runner.default_runner.DefaultRunner.distance_Langevin_Dynamics(195:218)ConfGF.confgf.runner.default_runner.DefaultRunner.position_Langevin_Dynamics(222:258)
Implements the core graph neural network model, DistanceScoreMatch, which learns to predict scores related to inter-atomic distances. This model is central to the ConfGF framework, utilizing graph convolutions and transformations to process molecular graphs and output distance-based scores.
Related Classes/Methods:
ConfGF.confgf.models.scorenet.DistanceScoreMatch(12:200)ConfGF.confgf.models.scorenet.DistanceScoreMatch.extend_graph(48:91)ConfGF.confgf.models.scorenet.DistanceScoreMatch.get_score(103:122)ConfGF.confgf.models.scorenet.DistanceScoreMatch.forward(124:200)ConfGF.confgf.layers.gin.GraphIsomorphismNetwork(64:130)ConfGF.confgf.layers.gin.GINEConv(14:61)ConfGF.confgf.layers.common.MultiLayerPerceptron(47:94)
Handles the entire lifecycle of molecular data, from loading raw SMILES strings or RDKit molecules to transforming them into graph representations suitable for the neural network. This includes managing datasets, adding higher-order edges, and incorporating chemical properties.
Related Classes/Methods:
ConfGF.confgf.dataset.dataset.smiles_to_data(77:124)ConfGF.confgf.dataset.dataset.rdmol_to_data(27:75)ConfGF.confgf.dataset.dataset.GEOMDataset(407:439)ConfGF.confgf.dataset.dataset.GEOMDataset_PackedConf(444:492)ConfGF.confgf.utils.transforms.AddHigherOrderEdges(8:57)ConfGF.confgf.utils.transforms.AddEdgeLength(59:67)ConfGF.confgf.utils.chem.mol_to_smiles(75:76)ConfGF.confgf.utils.chem.get_atom_symbol(71:72)
Provides functionalities for generating 3D molecular conformations from distance matrices using distance geometry and for evaluating the quality of these generated conformations. It includes metrics like RMSD and MMD to assess structural accuracy.
Related Classes/Methods:
ConfGF.confgf.utils.distgeom.Embed3D(39:57)ConfGF.confgf.utils.distgeom.embed_3D(3:36)ConfGF.confgf.utils.distgeom.get_d_from_pos(59:60)ConfGF.confgf.utils.evaluation.get_rmsd_confusion_matrix(11:32)ConfGF.confgf.utils.evaluation.evaluate_conf(35:38)ConfGF.confgf.utils.evaluation.compute_mmd(162:180)ConfGF.confgf.utils.chem.set_rdmol_positions(49:57)
A collection of foundational helper functions that support various operations across the ConfGF project. This includes general PyTorch utilities for learning rate scheduling and tensor manipulation, as well as common readout functions for graph neural networks.
Related Classes/Methods: