Skip to content

2125-jht/spider

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

spider

spider.py:主程序

爬取网页:https://movie.douban.com/top250?start= ,点击下一页时网址会附加上一段(有规律),遍历网址时用得到

temp.html:用于观察网页html形式,方便总结出用什么样的正则表达式去匹配想要的信息

Releases

No releases published

Packages

No packages published