Objective: Define the project scope and select a single target disease for gene-disease association prediction.
Starting point: General project idea of predicting gene-disease associations using OpenTargets and STRING database data. Please check the Machine Learning Focus - Gene-Disease Association Prediction post to see detils of the initial data being used by the ML teams.
Tasks:
Select one specific disease for prediction:
Research disease categories in OpenTargets
Analyze data availability and quality for different diseases
Choose one disease based on criteria such as data abundance, research interest, and potential impact
Document selection criteria and rationale
Analyze data characteristics for the chosen disease:
Investigate the number of known gene associations
Assess the quality and completeness of available data
Identify any unique features or challenges associated with the selected disease
Define project scope:
Specify the exact prediction task (e.g., binary classification of gene-disease associations)
Outline the potential impact and applications of accurate predictions for the chosen disease
Create project documentation:
Write a detailed project objective
Justify the choice of the target disease
Specify evaluation criteria for the prediction task
Document relevant data sources (OpenTargets and STRING)
Expected outcome:
Selection of one target disease for prediction, with detailed justification
Analysis of data characteristics for the chosen disease
Clear project scope and objectives
Detailed project documentation including rationale for disease selection, objectives, and relevant data sources
Are you available to meet with Anya this weekend to discuss your findings? If you have a preference, please let us know below (pick all time slots that may work):
Sat (08/03) morning Pacific
Sat afternoon Pacific
Sat evening Pacific
Sun morning Pacific
Sun afternoon Pacific
Sun evening Pacific
0voters
Please make sure to come prepared. We will use OpenTarget and STRINGdb for bulk data. OMIM does not permit scraping and requires a license for API, but we can get useful targeted data.
Meeting is an open one, all are welcome to attend.