枫雪 发表于 2013-6-3 17:06:27

【求助】关于采集分类

为什么我在使用【多页网址获取】的时候,采集分页中间会跳过,总共27页。
起始网址:http://www.******.com/list/list1.html
采集分页的时候结果如下:
http://www.******.com/list/list1.html
http://www.******.com/list/list1_2.html
http://www.******.com/list/list1_3.html
http://www.******.com/list/list1_4.html
http://www.******.com/list/list1_5.html
http://www.******.com/list/list1_6.html
http://www.******.com/list/list1_7.html
http://www.******.com/list/list1_8.html
http://www.******.com/list/list1_27.html
http://www.******.com/list/list1_26.html
http://www.******.com/list/list1_20.html
http://www.******.com/list/list1_21.html
http://www.******.com/list/list1_22.html
http://www.******.com/list/list1_23.html
http://www.******.com/list/list1_24.html
http://www.******.com/list/list1_25.html

为什么没中间的几页?

如果把起始网址换成:http://www.******.com/list/list1_9.html
采集分页的时候结果如下:
http://www.******.com/list/list1_9.html
http://www.******.com/list/list1.html
http://www.******.com/list/list1_8.html
http://www.******.com/list/list1_5.html
http://www.******.com/list/list1_6.html
http://www.******.com/list/list1_7.html
http://www.******.com/list/list1_10.html
http://www.******.com/list/list1_11.html
http://www.******.com/list/list1_12.html
http://www.******.com/list/list1_27.html
http://www.******.com/list/list1_26.html
http://www.******.com/list/list1_20.html
http://www.******.com/list/list1_21.html
http://www.******.com/list/list1_22.html
http://www.******.com/list/list1_23.html
http://www.******.com/list/list1_24.html
http://www.******.com/list/list1_25.html

还是少页数

lmj243 发表于 2013-6-3 22:27:14

分布获取的规则不对,

www.bccpw.com 发表于 2013-6-5 09:51:14

不错不错,是个好工具

wxl08 发表于 2013-6-5 10:09:23

列表分页规则没有获取正确,原理是上下页模式的,从当前页仅获取下一页的连接地址,以此类推

jinyaowei 发表于 2013-6-15 23:38:52

来学习了沙发呵呵!
页: [1]
查看完整版本: 【求助】关于采集分类