Ruben
|
41c3aa822b
|
Update output.txt
|
2018-04-02 22:12:20 +08:00 |
|
Your Name
|
86f6f51763
|
Merge branch 'master' of github.com:lixiang0/baike-spider
|
2018-04-02 22:04:43 +08:00 |
|
Your Name
|
328ea2657b
|
add page extract
|
2018-04-02 22:01:08 +08:00 |
|
Your Name
|
1f123e5758
|
define custom thread number,modified saved path
|
2018-03-23 11:45:06 +08:00 |
|
ruben
|
122a696e09
|
1.添加10秒超时;2.检查文件名是否合法;3.修复中止程序时无法保存爬取状态的问题
|
2018-01-18 20:33:54 +08:00 |
|
ruben
|
4035aa2625
|
1.检查webpages路径,没有则创建
2.解决终止多线程爬取时不能保存爬取状态的问题
|
2018-01-17 01:09:52 +08:00 |
|
ruben
|
d75df2faf8
|
update
|
2018-01-16 21:09:24 +08:00 |
|
Ruben
|
fe4de1191e
|
Update README.md
|
2018-01-16 21:02:56 +08:00 |
|
ruben
|
9d52f16f2c
|
4线程爬取
|
2018-01-16 21:00:36 +08:00 |
|
ruben
|
4e496c11c6
|
init
|
2018-01-15 15:56:26 +08:00 |
|