Skip to content

Its Python based Topic Modelling project which identifies hidden topics from the given data using LDA(Latent Dirichlet Allocation)

Notifications You must be signed in to change notification settings

Kaustubh-Tambe/BBC-News_Topic-Modelling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

BBC-News_Topic-Modelling

project indentify the topics from the BBC news data.Its a python based Project and uses LDA(Latent Dirichlet Allocation) Algorithm for Topic Modelling. Data Contains various news feeds from BBC News which are then filtered and processed using various Python Packages such as NLTK, Genim and Spacy. However rather than using ntlk.stopwords i have use different file(extended_stopwords.txt), which contains extra set of words for removing stopwords from my data in preprocessing step. Further Details are mentione as comment or text in code .pynb file.

About

Its Python based Topic Modelling project which identifies hidden topics from the given data using LDA(Latent Dirichlet Allocation)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published