eyht 发表于 2008-10-22 14:53:01

这种结构的页面如何采集?

<a class="mypager" title="转到第3页" href="javascript:__doPostBack('right_1$AspNetPager1','3')"></a><span style="width:1px;"></span><a class="mypager" title="转到第4页" href="javascript:__doPostBack('right_1$AspNetPager1','4')"></a><span style="width:1px;"></span><a class="mypager" title="转到第5页" href="javascript:__doPostBack('right_1$AspNetPager1','5')"></a><span style="width:1px;"></span><a class="mypager" title="转到第6页" href="javascript:__doPostBack('right_1$AspNetPager1','6')"></a>

需要采集网址的页面是这种结构:
href="javascript:__doPostBack('right_1$AspNetPager1','3')"

怎么采集所有页面的网址呢?

冲锋火车头 发表于 2008-10-22 14:56:11

js的呀 不会。。。

沦陷今生 发表于 2008-10-22 15:10:59

可以采集,获取POST数据

eyht 发表于 2008-10-22 16:05:16

抓取到是这样的代码:
__EVENTTARGET=right_1%24AspNetPager1&__EVENTARGUMENT=4&__VIEWSTATE...................%24chengname=&Left_10%24chengmi=&right_1%24AspNetPager1_input=3
有两个变量咋办?
如红色所示,前者是当前的页码,后者是前页的页码
都用[分页]不行呀,采集到的都是首页的网址

[ 本帖最后由 eyht 于 2008-10-22 16:57 编辑 ]

chenfy 发表于 2008-10-22 16:57:10

找到分页的地方就能采集。。

eyht 发表于 2008-10-22 16:58:29

找到了,如5楼所示,有两个页面地址,如何处理呢?

[ 本帖最后由 eyht 于 2008-10-22 17:28 编辑 ]

eyht 发表于 2008-10-22 21:34:27

:Q :Q :Q

eyht 发表于 2008-10-23 08:49:56

js这样的如何办?
post抓取到是这样的代码:
__EVENTTARGET=right_1%24AspNetPager1&__EVENTARGUMENT=4&__VIEWSTATE...................%24chengname=&Left_10%24chengmi=&right_1%24AspNetPager1_input=3
有两个变量咋办?
如红色所示,前者是当前的页码,后者是前页的页码
都用[分页]不行呀,采集到的都是首页的网址

chenfy 发表于 2008-10-23 09:24:45

用当前的就可以了。不用其它页面的

eyht 发表于 2008-10-24 13:28:01

好象还是不行
全部代码如下,帮我看看该如何post吧,谢谢
__EVENTTARGET=right_1%24AspNetPager1&__EVENTARGUMENT=3&__VIEWSTATE=%2FwEPDwUKMTMyNDg5OTE1OQ9kFgICAQ9kFgICAQ9kFioCBg8WAh4HVmlzaWJsZWhkAgcPFgIfAGhkAggPFgIfAGhkAgoPFgIfAGhkAgwPFgIfAGhkAg0PFgIfAGhkAg4PFgIfAGhkAg8PFgIfAGhkAhAPFgIfAGhkAhEPZBYCZg9kFgJmD2QWAmYPDxYCHgRUZXh0BQYzMzM4MzRkZAISDxYCHwBoZAITDxYCHwBoZAIUDxYCHwBoZAIWDxYCHwBoZAIXDxYCHwBoZAIYDxYCHwBoZAIZDxYCHwBoZAIaDxYCHwBoZAIbDxYCHwBoZAIcDxYCHwBoFgJmD2QWAmYPZBYCZg8PFgIfAQUGMzMzODM0ZGQCHQ9kFgQCAQ88KwALAQAPFgoeDERhdGFLZXlGaWVsZAUCaWQeCERhdGFLZXlzFggC4ccCAoDsAQK80wECuNMBArfTAQKYwAECy74BAsO9AR4LXyFJdGVtQ291bnQCCB4JUGFnZUNvdW50AgEeFV8hRGF0YVNvdXJjZUl0ZW1Db3VudAIIZBYCZg9kFhACAQ9kFgJmD2QWBmYPFQUFNDE5NTMyU3R1ZGllcyBpbiBTdXJmYWNlIFNjaWVuY2UgYW5kIENhdGFseXNpc%2BaWsOWinjXljbcCODEBMNIB44CQU1NTQy0xNDjjgJFNZXNvcG9yb3VzIENyeXN0YWxzIGFuZCBSZWxhdGVkIE5hbm8tc3RydWN0dXJlZCBNYXRlcmlhbHMgaHR0cDovL3d3dy5jaGVtai5jbi92aWV3dGhyZWFkLnBocD90aWQ9NDAwNyZleHRyYT1wYWdlJTNEMSAg44CQU1NTQy0xNDTjgJFDaGFyYWN0ZXJpemF0aW9uIG9mIFBvcm91cyBTb2xpZHMgVkkgaHR0cDovL3d3dy5jaGVtai5jbi92aWV3Li4uZAIBDxUBBTQxOTUzZAICDxUBBTQxOTUzZAICD2QWAmYPZBYGZg8VBQUzMDIwOCMyMDA2LTIwMDflgqzljJbnsbvmnJ%2FliIpJRuWAvOWvueavlAM0MTcBMZICQURWIENBVEFMIOeUsTIwMDblubTnmoQxMS4yNeS4i%2BmZjeiHszIwMDflubTnmoQ3LjY2N%2B%2B8jC0zLjU4MyAgQ0FUQUwgUkVWIOeUsTIwMDblubTnmoQ5LjIyMuS4i%2BmZjeiHszIwMDflubTnmoQ2LjMzM%2B%2B8jC0yLjg4OSAgQURWIFNZTlRIIENBVEFMIOeUsTIwMDblubTnmoQ0Ljc2MuWinuWKoOiHszIwMDflubTnmoQ0Ljk3N%2B%2B8jCArMC4yMTUgIEogQ0FUQUwg55SxMjAwNuW5tOeahDQuNTMz5aKe5Yqg6IezMjAwN%2BW5tOeahDQuNzM377yMICswLjIwNCAgQVBQTCBDQVRBTCBCLS4uLmQCAQ8VAQUzMDIwOGQCAg8VAQUzMDIwOGQCAw9kFgJmD2QWBmYPFQUFMjcwNjg2ODYz6K6h5YiS5paw5p2Q5paZ5oqA5pyv6aKG5Z%2Bf6K%2B%2B6aKY5YKs5YyW55u45YWz6YOo5YiGAzQ5MwEx1AM4NjPorqHliJLmlrDmnZDmlpnmioDmnK%2Fpoobln58yMDA45bm05bqm5LiT6aKY6K%2B%2B6aKY55Sz6K%2B35oyH5Y2XJmxkcXVvOyDljYHkuIDkupQmcmRxdW875pyf6Ze077yM5L6d5o2u44CK5Zu95a625Lit6ZW%2F5pyf56eR5a2m5ZKM5oqA5pyv5Y%2BR5bGV6KeE5YiS57qy6KaB77yIMjAwNi0yMDIw77yJ44CL44CB44CK5Zu95a62JmxkcXVvO%2BWNgeS4gOS6lCZyZHF1bzvnp5HlrabmioDmnK%2Flj5HlsZXop4TliJLjgIvlkozjgIo4NjPorqHliJImbGRxdW875Y2B5LiA5LqUJnJkcXVvO%2BWPkeWxlee6siDopoHjgIvvvIw4NjPorqHliJLmlrDmnZDmlpnmioDmnK%2Fpoobln5%2Flm7Tnu5XotYTmupDjgIHog73mupDjgIHliLbpgKDkuJrjgIHkv6Hmga%2FjgIHnjq%2FlooPjgIHkurrlj6PkuI7lgaXlurfnrYnlm73msJHnu4%2FmtY7lkoznpL7kvJrlj5HlsZXnmoTlhbPplK7mioDmnK%2Fpoobln5%2Flr7nmlrDmnZDmlpnnmoTph43lpKfpnIAuLi5kAgEPFQEFMjcwNjhkAgIPFQEFMjcwNjhkAgQPZBYCZg9kFgZmDxUFBTI3MDY0MENoYW5naW5nIHRoZSBmYWNlIG9mIGEgd2F0ZXIgc3BsaXR0aW5nIGNhdGFseXN0IAMzMDIBMMoBQ2hhbmdpbmcgdGhlIGZhY2Ugb2YgYSB3YXRlciBzcGxpdHRpbmcgY2F0YWx5c3RBdXN0cmFsaWFuIGNoZW1pc3RzIGhhdmUgZ3Jvd24gY3J5c3RhbHMgb2YgdGhlIHdhdGVyLXNwbGl0dGluZyBjYXRhbHlzdCB0aXRhbml1bSBkaW94aWRlIHRoYXQgYXJlIG1hbnkgdGltZXMgbW9yZSByZWFjdGl2ZSB0aGFuIHVzdWFsLiAgQW5hdGFzZSxvbmUgb2YgdC4uLmQCAQ8VAQUyNzA2NGQCAg8VAQUyNzA2NGQCBQ9kFgJmD2QWBmYPFQUFMjcwNjNI6KGo6Z2i5byC6LSo57uT5YWJ5YKs5YyW5YmC5Yi25aSH5Y%2BK5YW25YWJ5YKs5YyW5Yi25rCi56CU56m25Y%2BW5b6X6L%2Bb5bGVAzMwNQEwogPooajpnaLlvILotKjnu5PlhYnlgqzljJbliYLliLblpIflj4rlhbblhYnlgqzljJbliLbmsKLnoJTnqbblj5blvpfov5vlsZXlpKfov57ljJbnianmiYDmnY7ngb%2Flm6LpmJ%2Fnu6flnKjlhYnlgqzljJbliYLooajpnaLlvILnm7jnu5Plj4og5YW25YWJ5YKs5YyW5oCn6IO955qE56CU56m25bel5L2cKEFuZ2V3LiBDaGVtLiBJbnQuIEVkLjIwMDjvvIxET0k6IDAuMTAwMi9hbmllLjIwMDcwNDc4OCnlj5blvpfph43opoHov5vlsZXnmoTln7rnoYDkuIrvvIzmnIDov5Hlj4jlnKjooajpnaLlvILotKjnu5PlhYnlgqzljJbliYLliLblpIflj4rlhbblhYnlgqzljJbliLbmsKLnoJTnqbbkuK3lj5blvpfph43opoHov5vlsZXvvIznoJTnqbbnu5Pmnpzlj5HooajlnKggSi4gQW0uIENoZW0uIFNvYy4oMjAwOO%2B8jERPSTogMTAuMTAyMS9qLi4uZAIBDxUBBTI3MDYzZAICDxUBBTI3MDYzZAIGD2QWAmYPZBYGZg8VBQUyNDYwMCpDYXRhbHlzdCBtaW1pY3MgbmF0dXJlJ3MgbWV0aGFuZSBveGlkYXRpb24DMzc1ATDKAUNhdGFseXN0IG1pbWljcyBuYXR1cmUncyBtZXRoYW5lIG94aWRhdGlvblNjaWVudGlzdHMgaW4gRnJhbmNlIGhhdmUgZGV2ZWxvcGVkIHRoZSBmaXJzdCBtaWxkLCBlbnp5bWUtaW5zcGlyZWQgbWV0aG9kIHRvIGNvbnZlcnQgbWV0aGFuZSB0byBpbmR1c3RyaWFsbHkgdmFsdWFibGUgcHJvZHVjdHMuQWxleGFuZGVyU29yb2tpbiBhbmQgY29sbGVhZ3UuLi5kAgEPFQEFMjQ2MDBkAgIPFQEFMjQ2MDBkAgcPZBYCZg9kFgZmDxUFBTI0Mzk1M01vcmUgRWZmaWNpZW50IEZ1ZWwgQ2VsbHMsIFRoYW5rcyBUbyBBIE5ldyBDYXRhbHlzdAMzODMBMMoBTW9yZSBFZmZpY2llbnQgRnVlbCBDZWxscywgVGhhbmtzIFRvIEEgTmV3IENhdGFseXN0TWV0aGFub2wgZnVlbCBjZWxscyBhcmUgYW4gZWZmaWNpZW50IGFuZCBzdXN0YWluYWJsZSBhbHRlcm5hdGl2ZSB0b2Zvc3NpbCBmdWVscywgYnV0IHRoZXkgYXJlIHN0aWxsIG5vdCBlY29ub21pY2FsbHkgdmlhYmxlLiBOZXZlcnRoZWxlc3MsZm9yIGhpcyBQaC4uLmQCAQ8VAQUyNDM5NWQCAg8VAQUyNDM5NWQCCA9kFgJmD2QWBmYPFQUFMjQyNTkz5YWw5bee5YyW54mp5omA5LiJ5oiQ5p6c6I6355SY6IKD55yB56eR5a2m5oqA5pyv5aWWAzI4OQEwrgLlhbDlt57ljJbnianmiYDkuInmiJDmnpzojrfnlJjogoPnnIHnp5HlrabmioDmnK%2FlpZYgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAo5p%2Bl55yL5Y6f5paHKS8vDQogICAgICAgICAgICAg5YWw5bee5YyW5a2m54mp55CG56CU56m25omAICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICDlnKgg6L%2BR5pel5Li%2B6KGM55qEMjAwN%2BW5tOW6pueUmOiCg%2BecgeenkeWtpuaKgOacr%2BWlluWKseWkp%2BS8muS4ii4uLmQCAQ8VAQUyNDI1OWQCAg8VAQUyNDI1OWQCAw8PFgQeC1JlY29yZGNvdW50AiweDkN1c3RvbUluZm9UZXh0BVXmgLvmlbDvvJo8Yj40NDwvYj4g5oC76aG15pWw77yaPGI%2BNjwvYj4g5b2T5YmN6aG177yaPGZvbnQgY29sb3I9InJlZCI%2BPGI%2BMTwvYj48L2ZvbnQ%2BZGRkaL%2F3bmQh59FToXFXu79GLCxGl%2Bw%3D&Left_10%24chengname=&Left_10%24chengmi=&right_1%24AspNetPager1_input=2
页: [1] 2
查看完整版本: 这种结构的页面如何采集?