akira2010 发表于 2015-11-24 23:26:22

采集行记录的奇怪问题

各位大侠:
      碰上以下比较奇怪的问题:想提取这个网站的机构信息
view-source:http://resource.stockstar.com/DataCenter/PrivateData/GetOrgHold.aspx?stockcode=000651&edate=2015-06-30&otype=18
列表里面有5条记录

序号截止日股票代码股票简称机构名称机构持股(万股)占流通A股比例相关功能
12015-06-30000651格力电器MORGAN STANLEY & CO. INTERNATIONAL PLC.2747.480.92%行情 资讯 股吧
22015-06-30000651格力电器YALE UNIVERSITY3158.181.06%行情 资讯 股吧
32015-06-30000651格力电器CITIGROUP GLOBAL MARKETS LIMITED2884.530.97%行情 资讯 股吧
42015-06-30000651格力电器UBSAG3502.961.17%行情 资讯 股吧
52015-06-30000651格力电器MERRILL LYNCH INTERNATIONAL3718.231.25%行情


源页面代码是:

                                <table cellspacing="1" cellpadding="0" width="100%" bgcolor="#e0e0e0" style="border-width:0px;">
        <tr class="td1">
                <td>序号</td><td>截止日</td><td>股票代码</td><td>股票简称</td><td>机构名称</td><td>机构持股(万股)</td><td>占流通A股比例</td><td>相关功能</td>
        </tr><tr class="td3">
                <td>1</td><td>2015-06-30</td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">000651</a></td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">格力电器</a></td><td align="left">MORGAN STANLEY & CO. INTERNATIONAL PLC.</td><td align="right">2747.48</td><td align="right">0.92%</td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">行情</a> <a target="_blank" href="http://news.stockstar.com/info/dstock.aspx?code=000651">资讯</a> <a target="_blank" href="http://bar.stockstar.com/redir1.asp?code=000651">股吧</a></td>
        </tr><tr class="td4">
                <td>2</td><td>2015-06-30</td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">000651</a></td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">格力电器</a></td><td align="left">YALE UNIVERSITY</td><td align="right">3158.18</td><td align="right">1.06%</td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">行情</a> <a target="_blank" href="http://news.stockstar.com/info/dstock.aspx?code=000651">资讯</a> <a target="_blank" href="http://bar.stockstar.com/redir1.asp?code=000651">股吧</a></td>
        </tr><tr class="td3">
                <td>3</td><td>2015-06-30</td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">000651</a></td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">格力电器</a></td><td align="left">CITIGROUP GLOBAL MARKETS LIMITED</td><td align="right">2884.53</td><td align="right">0.97%</td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">行情</a> <a target="_blank" href="http://news.stockstar.com/info/dstock.aspx?code=000651">资讯</a> <a target="_blank" href="http://bar.stockstar.com/redir1.asp?code=000651">股吧</a></td>
        </tr><tr class="td4">
                <td>4</td><td>2015-06-30</td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">000651</a></td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">格力电器</a></td><td align="left">UBSAG</td><td align="right">3502.96</td><td align="right">1.17%</td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">行情</a> <a target="_blank" href="http://news.stockstar.com/info/dstock.aspx?code=000651">资讯</a> <a target="_blank" href="http://bar.stockstar.com/redir1.asp?code=000651">股吧</a></td>
        </tr><tr class="td3">
                <td>5</td><td>2015-06-30</td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">000651</a></td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">格力电器</a></td><td align="left">MERRILL LYNCH INTERNATIONAL</td><td align="right">3718.23</td><td align="right">1.25%</td><td><a target="_blank" href="http://stock.quote.stockstar.com/000651.shtml">行情</a> <a target="_blank" href="http://news.stockstar.com/info/dstock.aspx?code=000651">资讯</a> <a target="_blank" href="http://bar.stockstar.com/redir1.asp?code=000651">股吧</a></td>
        </tr>
</table>
                                        <div class="more_page"><span id="lblRowsCount">共5条</span>&nbsp;第<span id="lblCurrentPage">1</span>页/共<span id="lblPageCount">1</span>页&nbsp;<a href="javascript:pGo(1)">第一页</a>&nbsp;<a href="javascript:pGo(1)">上一页</a>&nbsp;<a href="javascript:pGo(1)">下一页</a>&nbsp;<a href="javascript:pGo(1)">最后一页</a>&nbsp;<input name="i" type="text" id="txtPages" onKeyPress="f()" style="width:20px;" />&nbsp;<input type="submit" name="btnGo" value="" id="btnGo" class="searchinput2" style="border-style:None;" onclick="javascript:pGo(document.getElementById('txtPages').value)" /></div>
                                </div>
                                <form id="fmAllPostAction" method="post">
                         <input type="hidden" id="ahidden_stockcode" name="hidden_stockcode" value="000651" />
                         <input type="hidden" id="ahidden_enddate" name="hidden_enddate" value="2015-06-30" />                        
                         <input type="hidden" id="ahidden_page" name="hidden_page" value="1" />





我使用标签截取 <td align="left">   结尾是</td> 在浏览器代码模式下可以截取,但是火车头里只能截取3条记录,有2条截取不到,例如第一个MORGEN的就不行,不知道啥原因?

imfly 发表于 2015-11-25 18:07:51

我试了下是可以的~点击循环采集~试下重启采集器~
如还有问题咨询下官方客服800019423~
页: [1]
查看完整版本: 采集行记录的奇怪问题