页面没有静态化怎么采集
比如就像这个页面 页面没有静态化http://www.cheminfo.gov.cn/zxzx/page_info.aspx?id=319998&Tname=gnschq&c=30文章的内容部分是用个实现的,根本没有文章主体内容信息
<script>
var divIdTmp = "#_divXX";
$.ajax({
type: "POST",
url: chemInfo.searchPostUrl,
data: "swichParam=xx&id=319998&Tname=gnschq",
success: function(msg){
msg = msg.replace(new RegExp("<","gi"),'<')
.replace(new RegExp(">","gi"),'>')
.replace(new RegExp("&nbsp;","gi")," ")
.replace(new RegExp("&trade;","gi"),escape("?"));
//.substring(0,200)+'...';
//.replace(new RegExp("&nbsp;","gi")," ");//.substring(0,200)+'...';
$(divIdTmp).html(msg);
},
beforeSend:function(){
$(divIdTmp).html(chemInfo.ImgLoading);
},
error: function(msg)
{
$(divIdTmp).html("提取数据错误!");
//alert(msg);
}
});
</script> POST AJAX,不可以直接采集的,只能通过接口实现 高难度的了
还没到那个境界
页:
[1]