Skip to content

Beers' characteristics crawler for drizly.com. Written in Python.

License

Notifications You must be signed in to change notification settings

mawippel/drizly-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Drizly Web Crawler

Beers' characteristics Web Crawler. Extracting from Drizly website.

Intro

Its main goal is to retrieve the beers characteristics, given a certain beer style. It crawls all the beers in all pages until it reaches the end.

Setup

# Create python venv
python3.7 -m venv .venv

# Linux
source .venv/bin/activate

# Windows
.venv\Scripts\activate

pip install -r requirements.txt

Running the Crawler

To run the crawler you must pass the category's endpoint as an argument to the python script. An example is shown below:

python .\drizly_crawler.py /beer/ale/ipa/c15

In this example, the seed of the crawler will be the https://drizly.com/beer/ale/ipa/c15 page. When the end of the page is reached, it jumps to the next page and all the crawling process runs again, until all the pages for this beer style are crawled.

Buy Me A Coffee

About

Beers' characteristics crawler for drizly.com. Written in Python.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages