|
我用的是3.2 SP5版本,,为什么采集到的数据有大量分行符(<br>)
采集页代码:
-
- <div class="moviename">
- <span class="moviecname">叶一茜</span> </div>
- <span id="Middle_ac" class="left_btn"></span>
- </div>
- <div id="votestar" class="m_votestar"></div>
- <div class="m_info_list">
- <ul class="m_movie_list">
- <li>
- <span class="m_t pointer" onClick="sp_edit(event,3,17749)" onMouseOver="sp_show(event,3)" onMouseOut="ClosePan()">出生日期:</span>
- <span class="m_c">1984年 <a href="/cmd/samebirthday.aspx?birthday=1984%E5%B9%B49%E6%9C%883%E6%97%A5" style="margin:-5px;">9月3日</a>
- <a href="/cmd/showconstellation.aspx?cons=%E5%A4%84%E5%A5%B3">
- <img src="/images/cons/cons6.gif" alt="处女座" border="0" class="cons"/></a>
- </span>
- </li>
- <li>
- <span class="m_t pointer" onClick="sp_edit(event,4,17749)" onMouseOver="sp_show(event,4)" onMouseOut="ClosePan()">出生地点:</span>
- <span class="m_c">福建省南平市政和县</span>
- </li>
- <li>
- <span class="m_t pointer" onClick="sp_edit(event,5,17749)" onMouseOver="sp_show(event,5)" onMouseOut="ClosePan()">地区:</span>
- <span class="m_c"><a href="/cmd/combschstar.aspx?g=&c=1&con=&s=1" target="_blank">中国大陆</a></span>
- </li>
- <li>
- <span class="m_t pointer" onClick="sp_edit(event,6,17749)" onMouseOver="sp_show(event,6)" onMouseOut="ClosePan()">血型:</span>
- <span class="m_c">A 型</span>
- </li>
- <li>
- <span class="m_t pointer" onClick="sp_edit(event,7,17749)" onMouseOver="sp_show(event,7)" onMouseOut="ClosePan()">身高:</span>
- <span class="m_c">170 厘米</span>
- </li>
- <li>
- <span class="m_t pointer" onClick="sp_edit(event,8,17749)" onMouseOver="sp_show(event,8)" onMouseOut="ClosePan()">体重:</span>
- <span class="m_c">50 公斤</span>
- </li>
- <li>
- <span class="m_t pointer" onClick="sp_edit(event,10,17749)" onMouseOver="sp_show(event,10)" onMouseOut="ClosePan()">婚姻状况:</span>
- <span class="m_c">已婚</span>
- </li>
- <li>
- <span class="m_t pointer" onClick="sp_edit(event,11,17749)" onMouseOver="sp_show(event,11)" onMouseOut="ClosePan()">家庭成员:</span>
- <span class="m_c">丈夫:田亮</span>
- </li>
- <li>
- <span class="m_t pointer" onClick="sp_edit(event,9,17749)" onMouseOver="sp_show(event,9)" onMouseOut="ClosePan()">别名昵称:</span>
- <span class="m_c">叶茜(曾用名),</span>
- </li>
- <li>
- <span class="m_t">明星博客:</span>
- <span class="m_c">http://blog.sina.com.cn/yeyiqian</span>
- </li>
- <li>
- <span class="m_t">投票:</span>
- <span class="m_c" id="viewvote"></span>
- </li>
复制代码
采集到的是这样的...
【简单介绍】:
出生日期:
1984年 9月3日
出生地点:
福建省南平市政和县
地区:
中国大陆
血型:
A 型
身高:
170 厘米
体重:
50 公斤
婚姻状况:
已婚
家庭成员:
丈夫:田亮
别名昵称:
叶茜(曾用名),
明星博客:
http://blog.sina.com.cn/yeyiqian
我也把所有的HTML标签排除都选上了,都没有解决,,,.
谁有办法帮帮我?谢谢 |
|