Commit Graph

10 Commits

Author SHA1 Message Date
Ruben
41c3aa822b
Update output.txt 2018-04-02 22:12:20 +08:00
Your Name
86f6f51763 Merge branch 'master' of github.com:lixiang0/baike-spider 2018-04-02 22:04:43 +08:00
Your Name
328ea2657b add page extract 2018-04-02 22:01:08 +08:00
Your Name
1f123e5758 define custom thread number,modified saved path 2018-03-23 11:45:06 +08:00
ruben
122a696e09 1.添加10秒超时;2.检查文件名是否合法;3.修复中止程序时无法保存爬取状态的问题 2018-01-18 20:33:54 +08:00
ruben
4035aa2625 1.检查webpages路径,没有则创建
2.解决终止多线程爬取时不能保存爬取状态的问题
2018-01-17 01:09:52 +08:00
ruben
d75df2faf8 update 2018-01-16 21:09:24 +08:00
Ruben
fe4de1191e
Update README.md 2018-01-16 21:02:56 +08:00
ruben
9d52f16f2c 4线程爬取 2018-01-16 21:00:36 +08:00
ruben
4e496c11c6 init 2018-01-15 15:56:26 +08:00