PROJECT // Colin Gerber

Clinical Data Visualization

Data Processing for the Project

In order to support a better search interaction for the clinical trials data we decided we would need to focus three fundamental building blocks to start with. Thos three building blocks were deduplicating the institutions/facilities in the database, creating research author graph, and implementing a thesaurus of MeSH terms to common terms.

Natural Language Processing for the Project

In order to further improve the search abilities we decided that we would need to make a couple of improvements to the data in the database. We focused on two primary improvments. Better MeSH tagging of the trials and more category tags for the eligibility criteria of the trials.

Data Processing for the Project

Sponsor and facility deduplication

Researcher and publication links

Medical thesaurus

Natural Language Processing for the Project

MeSH tagging of trials

Extracting structured eligibility criteria