Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve Accuracy of The Model #1

Open
meuzgebre opened this issue Aug 23, 2022 · 3 comments
Open

Improve Accuracy of The Model #1

meuzgebre opened this issue Aug 23, 2022 · 3 comments

Comments

@meuzgebre
Copy link

I have run your code you have implemented naive_bayes.GaussianNB estimator resulted with 0.6 accuracy. It is a suitable estimator for text data. However, since you converted the text values in to a numeric form you can use other estimators such as LogisticRegression or Linear SVM for a better accuracy. I have added LogesticRegression model to your code with out any other modification and I have got around 0.79 accuracy.
In addition, you can improve the accuracy by:

  • Removing nan and null values from the dataset.
  • For feature extraction, use only headline, category and article as your data as the rest of the columns are not necessary.
@IsraelAbebe
Copy link
Owner

IsraelAbebe commented Aug 23, 2022

Good job @meuzgebre , we released this dataset so many people would work towards improving this accuracy. we would like to see this result you talked about become the SOTA algorithm for this dataset here.

if you have some writeup and updated code we are happy to mention it in this readme.

@meuzgebre
Copy link
Author

Hey @IsraelAbebe checkout my pull requests.

@IsraelAbebe
Copy link
Owner

@meuzgebre can you send a pull request to the new branch I created for you , I would like to put it there and edit the readme.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants