Skip to content

Latest commit

 

History

History
171 lines (141 loc) · 22.9 KB

File metadata and controls

171 lines (141 loc) · 22.9 KB
graph LR
    Core_Data_Management["Core Data Management"]
    Data_Preparation_Annotation["Data Preparation & Annotation"]
    Model_Core_Metrics["Model Core & Metrics"]
    Data_Tensorization["Data Tensorization"]
    Output_Conversion_Services["Output & Conversion Services"]
    Output_Conversion_Services -- "orchestrates" --> Data_Tensorization
    Output_Conversion_Services -- "uses" --> Data_Tensorization
    Output_Conversion_Services -- "manages" --> Model_Core_Metrics
    Output_Conversion_Services -- "orchestrates" --> Core_Data_Management
    Output_Conversion_Services -- "processes data from" --> Core_Data_Management
    Output_Conversion_Services -- "leverages" --> Data_Preparation_Annotation
    Data_Tensorization -- "prepares data for" --> Model_Core_Metrics
    Data_Tensorization -- "utilizes" --> Data_Preparation_Annotation
    Data_Tensorization -- "depends on" --> Data_Preparation_Annotation
    Data_Tensorization -- "uses" --> Core_Data_Management
    Model_Core_Metrics -- "uses" --> Core_Data_Management
    Model_Core_Metrics -- "evaluates" --> Data_Preparation_Annotation
    Model_Core_Metrics -- "integrates with" --> Data_Preparation_Annotation
    click Core_Data_Management href "./Core Data Management.md" "Details"
    click Data_Preparation_Annotation href "./Data Preparation & Annotation.md" "Details"
    click Model_Core_Metrics href "./Model Core & Metrics.md" "Details"
    click Data_Tensorization href "./Data Tensorization.md" "Details"
    click Output_Conversion_Services href "./Output & Conversion Services.md" "Details"
Loading

CodeBoardingDemoContact

Component Details

CoSpred is a software system designed for predicting mass spectrometry spectra of peptides. It processes raw input data, performs peptide annotation and data tensorization, utilizes a deep learning model for prediction, and converts the results into various standardized output formats, while also providing a service API for external access.

Core Data Management

Manages data input/output operations (reading/writing HDF5, Arrow, CSV) and provides fundamental utility functions for data validation, sequence manipulation, and array reshaping.

Related Classes/Methods:

Data Preparation & Annotation

Responsible for sanitizing, transforming, and annotating raw input data, including normalization, masking, peptide fragment annotation with m/z values, and matching operations.

Related Classes/Methods:

Model Core & Metrics

Manages the lifecycle of the prediction model, including loading pre-trained weights and custom layers, and provides functionalities for computing various performance metrics to evaluate prediction quality.

Related Classes/Methods:

Data Tensorization

Converts processed data into numerical tensors, preparing it for input into the neural network model by encoding sequences, precursor charges, and applying m/z values.

Related Classes/Methods:

Output & Conversion Services

Provides the core prediction service API, orchestrating the data flow from input to output, and facilitates the conversion of prediction results into various standardized formats (e.g., DIANN, MSP, MGF, CSV, MaxQuant).

Related Classes/Methods: