从列表页跳到的不是内容页
<head><meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<meta http-equiv="Content-Language" content="zh-CN" />
<title>正在进入, 请稍候 ...... Please waiting ......</title>
<script type="text/javascript">
function hex2bin(str)
{
var ascii = '';
var str_length = str.length;
for (var x = 0; x < str_length; x += 2)
{
ascii += String.fromCharCode(parseInt(str.substr(x, 2), 16));
}
return ascii;
}
var url_params = new Array();
var params = document.location.search.substr(1).split('&');
var goto_url = hex2bin('687474703a2f2f')+document.location.hostname+'/';
for (var i = 0; i < params.length; i++)
{
var param = params.split('=');
url_params] = param;
}
if (url_params.p)
{
goto_url += hex2bin('73686f772f')+hex2bin(url_params.p).replace(/\|/g, '/')+hex2bin('2e68746d6c');
document.write('<div style="text-align:center;">如果没有自动进入下一页, 请您点击下面的地址<br /><a href="'+ goto_url +'">'+ goto_url +'</a></div>');
}
document.location.href = goto_url;
</script>
</head><body></body></html>
go.html?p=323030397c303232387c3438363235这是内容页地址!!
请问这样能采集么? hex2bin的js代码转换后组成的,这种可能不能搞 如果这个对您不很重要,建议您放弃.如果你不具备js解码或是编程的本事,这个就无解了.
页:
[1]