Skip to content

TheRaphael0000/anime_wordclouds

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Information

Feel free to create pull requests, but do not commit subtitles !

To create a visualization :

  1. Extracts the subtitles using FFMPEG to the VTT format, due to obvious copyright problems, they can't be on the repository.
  2. Preprocess the image using a graphical tool to create a mask.
    • Black: Word cloud space
    • White: Kept as is from the image
    • Grey value: Discarded from the visualization
  3. From this mask and the words obtained from the subtitles, the script uses nltk to remove stop words, wordcloud to create a visualization and a bit of numpy image math's.

List

  1. Cowboy Bebop
  2. Neon Genesis Evangelion
  3. Darling in the Franxx
  4. Mirai Nikki
  5. Death Note
  6. Steins;Gate
  7. One-Punch Man

Cowboy Bebop

Data used:

Reddit posts: r/dataisbeautiful / r/cowboybebop

Neon Genesis Evangelion

Data used:

Reddit posts : r/dataisbeautiful r/evangelion

Darling in the Franxx

Data used:

Mirai Nikki

Data used:

Death Note

Data used:

Steins;Gate

Data used:

One-Punch Man

Data used: