Skip to content

Latest commit

 

History

History
41 lines (29 loc) · 1.09 KB

README.md

File metadata and controls

41 lines (29 loc) · 1.09 KB

Spider Test

5-19

Added img_file_path to entites. Ready for flask.

5-18

  • Fixed 302 image download problem with code:
USER_AGENT = 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.124 Safari/537.36'
DEFAULT_REQUEST_HEADERS = {
    'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8',
    'Accept-Language': 'en',
}
DEFAULT_REQUEST_HEADERS = {
    'Referer': 'https://newtrade6699.x.yupoo.com/albums'
}

DOWNLOAD_DELAY = 0.25    # 250 ms of delay
HTTPCACHE_ENABLED = True

COOKIES_ELABLED = True

Next

  1. add img_file_path to HatComponent and hatleaf
  2. make flask website

5-14

  • MySpider.start_requests: clear db
  • Added one-to-many relation between HatComponent and HatLeaf
  • Separated out items from models
  • Just confirmed we are saving nodes and leafs in db
  • I might have fixed the problem of adding relations between tables