Skip to content

Gokultcr/NLP-20newsgroup-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

NLP-20newsgroup-data

Exploring different NLP operations on 20 newsgroup dataset.

The data contains approximately 20,000 across 20 online newsgroups

The 20 different newsgroups are:

  • alt.atheism
  • comp.graphics
  • comp.os.ms-windows.misc
  • comp.sys.ibm.pc.hardware
  • comp.sys.mac.hardware
  • comp.windows.x
  • misc.forsale
  • rec.autos
  • rec.motorcycles
  • rec.sport.baseball
  • rec.sport.hockey
  • sci.crypt
  • sci.electronics
  • sci.med
  • sci.space
  • soc.religion.christian
  • talk.politics.guns
  • talk.politics.mideast
  • talk.politics.misc
  • talk.religion.misc

Libraries

  • numpy
  • matplotlib
  • sklearn
  • seaborn
  • nltk

Releases

No releases published

Packages

No packages published