Prize Scraper for polla.cl

This script is designed to scrape the prize information from polla.cl, update the relevant values in a Google Sheet, and log the number of cells updated.

Libraries Used

bs4 (Beautiful Soup): Used to parse the HTML source code of the website.
selenium: Used to interact with the website and retrieve the HTML source code.
googleapiclient: Used to interact with the Google Sheets API.
google.oauth2.service_account: Used to authenticate the script with the Google Sheets API.
os: Used to retrieve an environment variable.
sys: Used to exit the script if necessary.
time: Used to introduce a sleep period in case of temporary errors.
logging: Used for improved error reporting and logging.

Usage

Download and install the required libraries.
Download and install ChromeDriver.
Create a service account JSON file and save it as service-account.json in the same directory as the script.
Get the Google Sheets spreadsheet ID and update the SPREADSHEET_ID variable in the script.
Set the CREDENTIALS environment variable to the contents of your service account JSON file.
Run the script: python main.py

Notes

The script is set up to use a specific Google Sheet and range. This can be adjusted in the code as necessary.
The script is set up to run after every draw of the lottery, which occurs three times a week: using GitHub Actions and cron the script is set to run at a specified time, ensuring that the data in the spreadsheet is always up to date.
The script is set up to run in headless mode, meaning that the Chrome window will not be visible. This can be adjusted in the code as necessary.
Detailed error messages are logged to app.log using Python's built-in logging module for improved error reporting and debugging.
The script gracefully handles exceptions during web scraping and Google Sheets update operations, ensuring robustness and reliability.

Name		Name	Last commit message	Last commit date
Latest commit History 128 Commits
.github/workflows		.github/workflows
.gitignore		.gitignore
README.md		README.md
license.md		license.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

.gitignore

.gitignore

README.md

README.md

license.md

license.md

main.py

main.py

requirements.txt

requirements.txt

Repository files navigation

Prize Scraper for polla.cl

Libraries Used

Usage

Notes

About

Contributors 2

Languages

License

cortega26/polla

Folders and files

Latest commit

History

Repository files navigation

Prize Scraper for polla.cl

Libraries Used

Usage

Notes

About

Topics

Resources

License

Stars

Watchers

Forks

Languages