Skip to content

web scraping of jofogas.hu Budapest real estate ads as a showcase project

License

Notifications You must be signed in to change notification settings

hornlaszlomark/jofogas_scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

🏡 Web scraping project of jofogas.hu 🏠

As part of my application, I'd like to present some of my skills to the recruiter(s) of Casafari in a showcase project.

Goal:

Challenges:

  • creating the scraping logic
  • dealing with missing data and outliers

Python libraries:

  • BeautifulSoup
  • requests
  • pandas

Selenium is slower compared to the ones I used for this project. I'm going replicate the whole thing in Scrapy later.

TODO:

  • adding user-agent
  • scraping the amount of pictures are uploaded
  • scraping the real estate agency's name and contact
  • scraping the contents of the real estate pages (category, type of heating, floor, view, etc.)
  • async/await introduction for parallel processing (reducing wait time)
  • production ready implementation

About

web scraping of jofogas.hu Budapest real estate ads as a showcase project

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages