Skip to content

Scrapes images from Chinese website and recreates it as a gallery flask webpage by Dork Zeegler

Notifications You must be signed in to change notification settings

boonecabaldev/scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spider Test

5-19

Added img_file_path to entites. Ready for flask.

5-18

  • Fixed 302 image download problem with code:
USER_AGENT = 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.124 Safari/537.36'
DEFAULT_REQUEST_HEADERS = {
    'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8',
    'Accept-Language': 'en',
}
DEFAULT_REQUEST_HEADERS = {
    'Referer': 'https://newtrade6699.x.yupoo.com/albums'
}

DOWNLOAD_DELAY = 0.25    # 250 ms of delay
HTTPCACHE_ENABLED = True

COOKIES_ELABLED = True

Next

  1. add img_file_path to HatComponent and hatleaf
  2. make flask website

5-14

  • MySpider.start_requests: clear db
  • Added one-to-many relation between HatComponent and HatLeaf
  • Separated out items from models
  • Just confirmed we are saving nodes and leafs in db
  • I might have fixed the problem of adding relations between tables

About

Scrapes images from Chinese website and recreates it as a gallery flask webpage by Dork Zeegler

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published