这么简单的一个页面,竟然半天采集不到?
<html><head>
<title></title>
<meta http-equiv="Content-Type" content="text/html; charset=gb2312">
<link rel="stylesheet" href="style.css" type="text/css">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<table width="600" border="0" cellspacing="0" cellpadding="3" align="center" height="47">
<tr>
<td background="images/bg_top.gif">
<div align="right"><br>
<img src="images/print.gif" width="20" height="21" align="absmiddle">
打印本页面 </div>
</td>
</tr>
</table>
<br>
<table cellpadding="0" cellspacing="0" border="0" width="600" align="center">
<tr>
<td valign="top">
<table cellpadding="0" cellspacing="0" border="0" height="22">
<tr>
<td rowspan="3"height="22" valign="bottom" width="12"><img src="images/grey_spacer.gif" alt="" width="12" height="1"></td>
<td rowspan="3"width="1" height="22"><img src="images/grey_spacer.gif" alt="" height="22" width="1"></td>
<td valign="top" height="1"><img src="images/grey_spacer.gif" alt="" height="1" width="100%"></td>
<td rowspan="3"width="1" height="22"><img src="images/grey_spacer.gif" alt="" height="22" width="1"></td>
<td rowspan="3"valign="bottom" width="5" height="22"><img src="images/grey_spacer.gif" alt="" height="1" width="5" /></td>
<td rowspan="3"valign="bottom" width="100%" height="22"><img src="images/grey_spacer.gif" alt="" width="100%" height="1"></td>
</tr>
<tr>
<td height="20" nowrap align="center" class="tabSelected">企业基本信息</td>
</tr>
<tr>
<td height="1" bgcolor="#ffffff"></td>
</tr>
</table>
</td>
</tr>
</table>
<table cellpadding="10" cellspacing="0" border="0" width="600" align="center">
<tr>
<td style="border-left: 1px solid #cccccc; border-right: 1px solid #cccccc; border-bottom: 1px solid #cccccc">
<table width="100%" border="0" cellspacing="0" cellpadding="3">
<tr>
<td width="15%">执照注册号:</td>
<td width="34%"><font face="Verdana">5227322211316</font></td>
<td width="13%">地址/住所:</td>
<td width="38%">
三合镇中山路 </td>
</tr>
<tr>
<td width="15%">企业名称:</td>
<td width="34%">三都县海丰矿业开发有限公司</td>
<td width="13%">企业类别:</td>
<td width="38%">
</td>
</tr>
<tr>
<td width="15%">法定代表人:</td>
<td width="34%">庄辉海</td>
<td width="13%">企业状态:</td>
<td width="38%">
开业 </td>
</tr>
</table>
</td>
</tr>
</table>
<table width="600" border="0" cellspacing="0" cellpadding="0" align="center">
<tr>
<td>
<IFRAME name=info width=600 height=260 marginwidth="0" marginheight="0"frameborder="0" scrolling="0"
src="info1.php?nbxh=5227325000001476&zch=5227322211316" allowTransparency="true"></IFRAME>
</td>
</tr>
</table>
<br>
<table width="600" border="0" cellspacing="0" cellpadding="0" align="center">
<tr>
<td bgcolor="#666666" height=1></td>
</tr>
<tr>
<td bgcolor="#E7E7E7" height="30">
<table width="596" border="0" cellspacing="0" cellpadding="2" align="center">
<tr>
<td width="21%"><img src="gs.gif" width="30" height="36"></td>
<td width="79%">
<div align="right"><font face="Verdana, Arial, Helvetica, sans-serif">Copyright
2004, gzgs.gov.cn</font>版权所有</div>
</td>
</tr>
</table>
</td>
</tr>
<tr>
<td bgcolor="#999999" height=1></td>
</tr>
</table>
</body>
</html>
===============================
想采里面的企业名称,标签开始字符串:
<td width="15%">企业名称:</td>
<td width="34%">
结束字符串:
</td>
<td width="13%">企业类别:</td>
<td width="38%">
这样设置采集不到?为什么呀为什么? 标签开始字符串:<td width="15%">企业名称:</td>(*)<td width="34%">
结束字符串:</td>
注意删除多余了前空格或者后空格 承接各类网站建设.域名注册.虚拟主机.网站信息采集.网站推广 网站优化 网站维护 联系QQ:8823513 标签开始字符串:企业名称:(*)
结束字符串:
注意删除多余了前空格或者后空格
都市乞丐 发表于 2010-2-5 10:19 http://bbs.locoy.com/images/common/back.gif
恩,用你的方法可以了。非常感谢,春节快乐
页:
[1]