Skip to content
#

bag-of-words

Here are 596 public repositories matching this topic...

Sentiment analysis on the IMDB dataset using Bag of Words models (Unigram, Bigram, Trigram, Bigram with TF-IDF) and Sequence to Sequence models (one-hot vectors, word embeddings, pretrained embeddings like GloVe, and transformers with positional embeddings).

  • Updated Jun 4, 2024
  • Jupyter Notebook

In this project, we aim to analyze hotel reviews to determine the underlying sentiment expressed by customers. Our goal is to differentiate between positive and negative reviews using Natural Language Processing (NLP) techniques and machine learning algorithms.

  • Updated May 28, 2024
  • Jupyter Notebook

Using text analytics to understand cultural patterns in philosophical texts. Exploring gender, author, region, and time-period differences, and extracting key philosophical concepts.

  • Updated May 28, 2024
  • Jupyter Notebook

The projects here demonstrate how a textual corpus is prepared for analysis, preprocessing steps for computational text mining and extraction of business insights. Concepts such as feature representation using bag of words and TF-IDF are demonstrated, clustering and supervised machine learning algorithms like regression and others are used on a DTM

  • Updated May 9, 2024

The author implemented support vector machine for sentiments analysis and applied two feature extractions, Bag-of-Words (CountVectorizer) and TF-IDF (TfidfVectorizer), after which the results for both methods were analysed. The accuracy obtained for both methods were (BoW = 87%) and (TF-IDF = 86%).

  • Updated Apr 26, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the bag-of-words topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bag-of-words topic, visit your repo's landing page and select "manage topics."

Learn more