【求助】关于采集分类
为什么我在使用【多页网址获取】的时候,采集分页中间会跳过,总共27页。起始网址:http://www.******.com/list/list1.html
采集分页的时候结果如下:
http://www.******.com/list/list1.html
http://www.******.com/list/list1_2.html
http://www.******.com/list/list1_3.html
http://www.******.com/list/list1_4.html
http://www.******.com/list/list1_5.html
http://www.******.com/list/list1_6.html
http://www.******.com/list/list1_7.html
http://www.******.com/list/list1_8.html
http://www.******.com/list/list1_27.html
http://www.******.com/list/list1_26.html
http://www.******.com/list/list1_20.html
http://www.******.com/list/list1_21.html
http://www.******.com/list/list1_22.html
http://www.******.com/list/list1_23.html
http://www.******.com/list/list1_24.html
http://www.******.com/list/list1_25.html
为什么没中间的几页?
如果把起始网址换成:http://www.******.com/list/list1_9.html
采集分页的时候结果如下:
http://www.******.com/list/list1_9.html
http://www.******.com/list/list1.html
http://www.******.com/list/list1_8.html
http://www.******.com/list/list1_5.html
http://www.******.com/list/list1_6.html
http://www.******.com/list/list1_7.html
http://www.******.com/list/list1_10.html
http://www.******.com/list/list1_11.html
http://www.******.com/list/list1_12.html
http://www.******.com/list/list1_27.html
http://www.******.com/list/list1_26.html
http://www.******.com/list/list1_20.html
http://www.******.com/list/list1_21.html
http://www.******.com/list/list1_22.html
http://www.******.com/list/list1_23.html
http://www.******.com/list/list1_24.html
http://www.******.com/list/list1_25.html
还是少页数
分布获取的规则不对, 不错不错,是个好工具 列表分页规则没有获取正确,原理是上下页模式的,从当前页仅获取下一页的连接地址,以此类推 来学习了沙发呵呵!
页:
[1]