多页JS分页问题~
采集地址为:http://www.968115.cn/search.action?bean.code=ckxmn&bean.whereList=ID&bean.whereList=402880b33e361984013e3619992e0008&bean.whereList=和地址2:http://www.968115.cn/search.action?bean.code=ckxmn&bean.whereList=ID&bean.whereList=402880b33e307405013e30744e3f0009&bean.whereList=
类似这样的还有很多页!这里面的“提供服务”栏是用JS分页,但是每页通过Fiddler2观察到的bean.code都不一样,也就是每个表的POST规则都要不一样,如果这样的页面有好几百页,怎样用一种规则把它们都采集下来呢?! 用POST格式采集。那个会变的字符串设置成随机值 303718 发表于 2014-10-10 13:53 static/image/common/back.gif
用POST格式采集。那个会变的字符串设置成随机值
谢谢您的回复,比如同栏目类似两个列表的Raw Stream分别为
bean.code=jg01n&bean.pageNo=2&bean.whereList%5B0%5D=ORG_ID&bean.whereList%5B0%5D=402880b33e361984013e3619992e0008&bean.whereList%5B0%5D=&bean.whereList%5B1%5D=SERVICE_NEW_TYPE1&bean.whereList%5B1%5D=&bean.whereList%5B1%5D=&bean.whereList%5B2%5D=ID&bean.whereList%5B2%5D=&bean.whereList%5B2%5D=&bean.whereList%5B5%5D=SERVICE_NAME&bean.whereList%5B5%5D=&bean.whereList%5B5%5D=&orgType=2&orgType=2&orgType=2&orgType=2&orgType=2&ss=
和
bean.code=jg01n&bean.pageNo=2&bean.whereList%5B0%5D=ORG_ID&bean.whereList%5B0%5D=402880b33e307405013e30744d640001&bean.whereList%5B0%5D=&bean.whereList%5B1%5D=SERVICE_NEW_TYPE1&bean.whereList%5B1%5D=&bean.whereList%5B1%5D=&bean.whereList%5B2%5D=ID&bean.whereList%5B2%5D=&bean.whereList%5B2%5D=&bean.whereList%5B5%5D=SERVICE_NAME&bean.whereList%5B5%5D=&bean.whereList%5B5%5D=&orgType=2&orgType=2&orgType=2&orgType=2&orgType=2&ss=
我让POST规则设置为
bean.code=jg01n&bean.pageNo=[分页]&bean.whereList%5B0%5D=ORG_ID&bean.whereList%5B0%5D=&bean.whereList%5B0%5D=&bean.whereList%5B1%5D=SERVICE_NEW_TYPE1&bean.whereList%5B1%5D=&bean.whereList%5B1%5D=&bean.whereList%5B2%5D=ID&bean.whereList%5B2%5D=&bean.whereList%5B2%5D=&bean.whereList%5B5%5D=SERVICE_NAME&bean.whereList%5B5%5D=&bean.whereList%5B5%5D=&orgType=2&orgType=2&orgType=2&orgType=2&orgType=2&ss=
为什么加上随机值后,连原来的单POST页的分页都搜不到了呢(设置为POST随机值的那串数字虽然这里前几位是相同的,但是该网站后面有些页面是不同的,应该这一整串全部都是随机数了) 这种POST的数据太乱了有时候看的眼花
页:
[1]