sandbord 发表于 2010-3-31 14:30:18

从没见过的内容分页问题问题,请高手指点

本帖最后由 sandbord 于 2010-3-31 14:35 编辑

采集页:http://www.lady8844.com/shoushen/ywyl/2010-03-21/1269183597d379157.html需要采集其分页,我设为
分页连接样式:<a href='[参数]'>
分页网址:http://www.lady8844.com/shoushen/ywyl/2010-03-21/[参数1]
如图:
可以采集到分页,可是问题是分页中的/2010-03-21/本应是一个变量,我这样设置采这篇文章是可以,但是并不具有通用性,而如果要通用,则似乎分页网址应该设这个时间为参数,如此,分页网址却比样式中的参数,从而导致采集不成功。
问题:请教各路高手,象这样的参数数量前后不一的问题,应该如何解决?
谢谢!!

都市乞丐 发表于 2010-3-31 17:46:25

让他自己识别就行了, 不需要自定义的

wuxiguacom 发表于 2010-3-31 18:25:33

采用 上下页/上N页下N页模式 就可以解决了

chengdushaogg 发表于 2010-3-31 18:41:00

以下是加密内容,就是下载站的,下载地址必须经过他的站打开才能下载,复制到浏览器就不行,求解决


var OX_0ba4f39a = '';
OX_0ba4f39a += "<"+"a href=\'http://ads.supplyframe.com/openads/www/delivery/ck.php?oaparams=2__bannerid=7481__zoneid=92__source=Electronic+Components%7CSemiconductors+and+Integrated+Circuits%7CAmplifier+and+Linear+ICs%7COperational+Amplifiers__cb=f699207d31__oadest=http%3A%2F%2Fwww.supplyframe.com%2Ftrackingservlet%2Ftrack%2F%3Faction%3DadClick%26value1%3DElectronic+Components%7CSemiconductors+and+Integrated+Circuits%7CAmplifier+and+Linear+ICs%7COperational+Amplifiers%26value2%3D7481%26value3%3D3%26zone%3D92%26url%3Dhttp%253A%252F%252Fwww.supplyframe.com\' target=\'_blank\'><"+"img src=\'http://ads.supplyframe.com/images/3989e3299e15cf6670d03f94249d32ce.gif\' width=\'728\' height=\'120\' alt=\'\' title=\'\' border=\'0\' /><"+"/a><"+"div id=\'beacon_f699207d31\' style=\'position: absolute; left: 0px; top: 0px; visibility: hidden;\'><"+"img src=\'http://ads.supplyframe.com/openads/www/delivery/lg.php?bannerid=7481&amp;campaignid=1031&amp;zoneid=92&amp;source=Electronic Components|Semiconductors and Integrated Circuits|Amplifier and Linear ICs|Operational Amplifiers&amp;loc=http%3A%2F%2Fwww.baidu.com%2F&amp;cb=f699207d31\' width=\'0\' height=\'0\' alt=\'\' style=\'width: 0px; height: 0px;\' /><"+"/div><"+"script type=\"text/javascript\">\n";
OX_0ba4f39a += "var gaJsHost = ((\"https:\" == document.location.protocol) ? \"https://ssl.\" :\n";
OX_0ba4f39a += "\"http://www.\");\n";
OX_0ba4f39a += "document.write(unescape(\"%3Cscript src=\'\" + gaJsHost +\n";
OX_0ba4f39a += "\"google-analytics.com/ga.js\' type=\'text/javascript\'%3E%3C/script%3E\"));\n";
OX_0ba4f39a += "<"+"/script>\n";
OX_0ba4f39a += "<"+"script type=\"text/javascript\">\n";
OX_0ba4f39a += "var pageTracker;\n";
OX_0ba4f39a += "setTimeout(\'startGA();\', 500);\n";
OX_0ba4f39a += "function startGA()\n";
OX_0ba4f39a += "{\n";
OX_0ba4f39a += "pageTracker = _gat._getTracker(\"UA-81436-2\");\n";
OX_0ba4f39a += "pageTracker._initData();\n";
OX_0ba4f39a += "pageTracker._trackPageview();\n";
OX_0ba4f39a += "}\n";
OX_0ba4f39a += "<"+"/script>\n";
OX_0ba4f39a += "<"+"script type=\"text/javascript\">\n";
OX_0ba4f39a += "_qoptions={\n";
OX_0ba4f39a += "qacct:\"p-16me67-PXyTDw\"\n";
OX_0ba4f39a += "};\n";
OX_0ba4f39a += "<"+"/script>\n";
OX_0ba4f39a += "<"+"script type=\"text/javascript\"\n";
OX_0ba4f39a += "src=\"http://edge.quantserve.com/quant.js\"><"+"/script>\n";
OX_0ba4f39a += "<"+"noscript>\n";
OX_0ba4f39a += "<"+"a href=\"http://www.quantcast.com/p-16me67-PXyTDw\" target=\"_blank\"><"+"img\n";
OX_0ba4f39a += "src=\"http://pixel.quantserve.com/pixel/p-16me67-PXyTDw.gif\" style=\"display:\n";
OX_0ba4f39a += "none;\" border=\"0\" height=\"1\" width=\"1\" alt=\"Quantcast\"/><"+"/a>\n";
OX_0ba4f39a += "<"+"/noscript><"+"img src=\'http://www.supplyframe.com/trackingservlet/impression/?action=adImpression&amp;value1=Electronic Components|Semiconductors and Integrated Circuits|Amplifier and Linear ICs|Operational Amplifiers&amp;value2=7481&amp;value3=3&amp;zone=92\' width=\'0\' height=\'0\' alt=\'\' />\n";
document.write(OX_0ba4f39a);

wensrrr 发表于 2010-3-31 19:27:10

{:4_197:}来看看`学习下``

sandbord 发表于 2010-3-31 21:51:49

非常感谢都市乞丐和wuxiguacom两位朋友的指点,{:2_141:},一语点破,俺终于成功了……
页: [1]
查看完整版本: 从没见过的内容分页问题问题,请高手指点