Skip to content

edhou20/Medical-Texts-NLP-Clustering-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Medical-Texts-NLP-Clustering-

Collection of 30 medical papers, coded in Python to extract title and abstract, vectorize documents based on 2 NLP models Word2Vec and Doc2Vec, implement dimensionality reduction, determine optimal set of clusters, and cluster via personally-coded unsupervised learning