zip-crawl

Introduction

This script is used to crawl jobs posted on ZipRecruiter.
Each page for a search has approximately 20 job postings.
I tested searching java and python, each search returned 25 pages, no more.

Getting started

Step 1: Clone repository to local machine

git clone https://github.com/shao-shuai/zip-crawl.git

Step 2: Create python virtual environment

python3 -m venv venv             # create virtualenv
source venv/bin/activate         # activate virtualenv
pip install -r requirements.txt  # install requirements

Step 3: Run script

zip_crawler.py takes 2 arguments, the first argument is the keyword (e.g., java) you want to search on ZipRecruiter, the second argument is the number of pages (e.g., 1-25) you want to crawl
```
python3 zip_crawler.py java 10
```

zip_crawler_url.py

python3 zip_crawler_url.py 'https://www.ziprecruiter.com/c/CyberCoders/Job/Sr.-Backend-Engineer-Python-&-AWS-Azure-GCP-100-REMOTE!/-in-Redwood-City,CA?jid=DQ10301beeda6fa0a85bd8b3abddec6a60&job_id=1dc2e1fe2f8dc47041fcda53050882f4'
Please input saving path: ./output
Please input filename (with file extension): test.txt

zip_crawler_minputs.py - example

python3 zip_crawler_minputs.py 
Please input a keyword: python 
Please input a folder name: python
Please input job location: 
Please input nubmer of postings to crawl:

python3 zip_crawler_minputs.py 
Please input a keyword: java
Please input a folder name: test
Please input job location: 
Please input nubmer of postings to crawl: 9

python3 zip_crawler_minputs.py 
Please input a keyword: java
Please input a folder name: java
Please input job location: san francisco
Please input the nubmer of postings to crawl: 3

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Output		Output
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
zip_crawler.py		zip_crawler.py
zip_crawler_minputs.py		zip_crawler_minputs.py
zip_crawler_url.py		zip_crawler_url.py
zip_crawler_url_2.py		zip_crawler_url_2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Output

Output

.gitignore

.gitignore

README.md

README.md

requirements.txt

requirements.txt

zip_crawler.py

zip_crawler.py

zip_crawler_minputs.py

zip_crawler_minputs.py

zip_crawler_url.py

zip_crawler_url.py

zip_crawler_url_2.py

zip_crawler_url_2.py

Repository files navigation

zip-crawl

Introduction

Getting started

Step 1: Clone repository to local machine

Step 2: Create python virtual environment

Step 3: Run script

About

Releases

Packages

Languages

shao-shuai/zip-crawl

Folders and files

Latest commit

History

Repository files navigation

zip-crawl

Introduction

Getting started

Step 1: Clone repository to local machine

Step 2: Create python virtual environment

Step 3: Run script

About

Topics

Resources

Stars

Watchers

Forks

Languages