lanxianghui 发表于 2011-3-4 13:19:56

采集网址获取

获取这个http://hotels.ctrip.com/hotel/chongqing4/ 网址里的1级网址,我现在只能获取第一页的,其它页面的怎么获取啊!!!!!!!

lanxianghui 发表于 2011-3-4 15:06:51

怎么没人搞得定啊

303718 发表于 2011-3-4 17:21:55

用POST采集ali74ls

lanxianghui 发表于 2011-3-4 22:31:20

具体怎么搞啊,对这方面不了解.我用HTTPWATCH抓到的POST是这样的:cityId=4&cityName=%u91CD%u5E86&cityEName=chongqing&districtId=&checkIn=2011-3-4&checkOut=2011-3-5&hotelName=&priceLow=&priceHigh=&locationId=&zoneId=&hotelType=F&hotelStar=&hotelEquipment=&orderBy=ctrip&orderType=asc&page=3&coordinateX1=&coordinateY1=&coordinateX2=&coordinateY2=&keyword=&dotX=&dotY=&radius=&mapCenterX=&mapCenterY=&zoomLevel=&viewType=list&blackList=&allHotels=19098%2C23188%2C22482%2C73657%2C52812%2C51646%2C26975%2C81563%2C80704%2C20905%2C73271%2C63617%2C21232%2C56266%2C75487%2C25286%2C66140%2C83458%2C5929%2C25460%2C84382%2C20244%2C20377%2C45113%2C65438%2C5931%2C48386%2C17599%2C45046%2C79864%2C54528%2C75005%2C12065%2C62804%2C46641%2C57525%2C11082%2C82854%2C79781%2C80690%2C57192%2C48918%2C46400%2C85870%2C21104%2C63321%2C19470%2C16043%2C12399%2C26805%2C17273%2C84436%2C80542%2C80909%2C78939%2C67118%2C54093%2C83946%2C66487%2C85957%2C78001%2C84500%2C66999%2C46630%2C77999%2C8085%2C64051%2C80969%2C80200%2C77016%2C82256%2C82452%2C79839%2C84786%2C52730%2C77161%2C82611%2C75732%2C70793%2C72271%2C73680%2C75035%2C85512%2C52504%2C62994%2C47566%2C85513%2C84339%2C67929%2C85126%2C62957%2C83977%2C82207%2C80570%2C79937%2C72798%2C85129%2C56239%2C78738%2C84023%2C84333%2C77666%2C79728%2C83706%2C85249%2C80966%2C84226%2C83899%2C78509%2C73363%2C82802%2C82189%2C79056%2C25330%2C85124%2C84191%2C76440%2C73012%2C80862%2C78500%2C80058%2C84446%2C79061%2C52718%2C50454%2C80895%2C79705%2C82815%2C85960%2C63274%2C81246%2C82514%2C77948%2C73505%2C84829%2C74031%2C82057%2C85419%2C79353%2C80152%2C80861%2C81395%2C82311%2C82923%2C81979%2C85589%2C82470%2C26037%2C65200%2C65172%2C48343%2C47428%2C82816%2C48342%2C62952%2C14888%2C55533%2C79654%2C81572%2C66494%2C65998%2C20617%2C67937%2C73727%2C47715%2C21618%2C75846%2C6190%2C66712%2C65265%2C25611%2C26004%2C20962%2C47392%2C20769%2C76020%2C56242%2C76060%2C56379%2C62860%2C74605%2C72710%2C67032%2C48774%2C79692%2C48056%2C56926%2C85701%2C19985%2C67626%2C52712%2C81369%2C65999%2C64705%2C77682%2C72680%2C46500%2C45081%2C19239%2C78084%2C46231%2C47273%2C64457%2C72809%2C78965%2C81170%2C13967%2C74580%2C52027%2C56918%2C81864%2C81868%2C75580%2C63296%2C82909%2C64873%2C50958%2C48424%2C81849%2C83710%2C80574%2C80573%2C80692%2C78414%2C80153%2C66129%2C68184%2C81266%2C81851%2C83617%2C55262%2C83703%2C81629%2C77571%2C83360%2C67895%2C79757%2C82698%2C84281%2C83567%2C75974%2C85128%2C79749%2C74028%2C81167%2C62854%2C47189%2C79233%2C44931%2C85240%2C74670%2C47647%2C80863%2C63343%2C57643%2C57421%2C79638%2C81110%2C82300%2C84783%2C77730%2C52550%2C84830%2C75636%2C67045%2C75063%2C76040%2C70975%2C46941%2C83694%2C83895%2C82356%2C84340%2C62954%2C52071%2C81399%2C46115%2C43965%2C46053%2C57244%2C48375%2C66045%2C64546%2C82015%2C56178%2C5934%2C62979%2C64120%2C65447%2C77616%2C71585%2C63143%2C75621%2C24876%2C45360%2C46205%2C22777%2C45090%2C24887%2C57668%2C63521%2C46912%2C62117%2C80965%2C76399%2C64712%2C13013%2C21351%2C72914%2C21499%2C22218%2C79699%2C77916%2C73237%2C82925%2C65170%2C80134%2C51484%2C79859%2C47046%2C65178%2C84425%2C56916&fltSubOrderID=-1&flightAmount=-1&roomNum=0&flightSaveMoney=0&isOnlyAirHotel=&requestTravelMoney=F&HotelSity=MjAxMS0zLTQgMjI6MTM6NTR8NQ%3D%3D&HotelSityEx=ctrip_hotels_ask_BODY         接着怎么弄啊

lituanshen 发表于 2011-3-5 02:32:33

循环标签采,一页一页采

lanxianghui 发表于 2011-3-5 12:40:21

这个知道,现在问题是我在“page=3”改为“page=分页”后,测试了下,发现每个分页获取到的网址是一样的,不知道问题出在哪了

蔡森斌 发表于 2011-3-5 14:33:11

采集规则写的不准确,检查自己手动填写链接地址规则的脚本可有问题????

jackwebsite 发表于 2011-3-6 10:49:53

回复 1# lanxianghui


    您好,QQ:jackwebsite@sina.com(1853927571)我们乐意为您解答,简单问题免费。

lanxianghui 发表于 2011-3-6 16:23:31

我把规则上传上来吧,看看有哪位高手能够帮忙看看错在哪了

lanxianghui 发表于 2011-3-7 22:54:10

怎么没人能帮忙解决下啊
页: [1] 2
查看完整版本: 采集网址获取