Skip to content

CraigKelly/yangling

Repository files navigation

yangling

Help for wrangling the yelp academic dataset

Important: this is a pretty hacky collection of scripts used for some very specific research purposes. They might be helpful, but you probably don't want to treat these as a general solution for anything :)

Using and Dependencies

You need:

  • dmk (which requires Go unless you download a binary - see below)
  • Python 3
  • unidecode

You should install Go and Python 3 the "normal" way for your OS (and why didn't you already have them installed?!)

dmk can be installed with: go get -u github.com/CraigKelly/dmk

Note: if you don't want to install Go, you can just download the dmk executable from https://github.com/CraigKelly/dmk/tree/master/dist. Just look in the folder matching your platform.

You can install unidecode locallly: pip3 install --user unidecode

Once you have everything working, just running dmk should be sufficient. The scripts assume that you've extracted the yelp academic dataset to this directory.

Licensing

Code in this repository is licensed under the MIT license (see LICENSE).

About

Help for wrangling the yelp academic dataset

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published