maxec 发表于 2012-6-21 18:12:55

已解决--表格内容 分标签 循环采集的问题

本帖最后由 maxec 于 2012-6-28 11:33 编辑

表格很规范,但每一个单元格没有有区别的起始标志,使用循环采集试了,结果不正确。

希望实现表格内容的循环采集,每一行是一条数据,标签就是表头。(即和表格是一样的)

典型网页URL:
http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/products/index.html

如何实现?请各位大侠指点?
十分感谢!

表格截图


——————
备注:为了描述更准确,把标题修改了一下
——————
感谢zhudi :问题已解决。

关键思路,使用前后截取,各个标签独立,灵活使用(*)参数

比如设置第一个单元格标签采集规则
开始字符串
   <tr><td><a href="http://(*).analog.com(*)target="_blank">
结束字符串
   </a></td>

第二个单元格标签开始字符串

<tr><td><a href="http://(*).analog.com(*)target="_blank">(*)</a></td><td>

第三个开始字符串

<tr><td><a href="http://(*).analog.com(*)target="_blank">(*)</a></td><td>(*)</td><td>

以此类推即可

附上任务可供遇上同类问题的“火车零件”们参考

maxec 发表于 2012-6-21 18:14:19

本帖最后由 maxec 于 2012-6-21 18:14 编辑

放假了,也要等答案{:soso_e118:} 自己顶!

maxec 发表于 2012-6-25 10:39:06

放假回来,继续等

zhudi 发表于 2012-6-25 12:05:59

可以的,楼主再尝试尝试。。

leejunji 发表于 2012-6-25 12:25:31

】:<td><a href="http://search.analog.com/search/ProductSearch.aspx?query=ad8476&amp;locale=zh" target="_blank">AD8476</a></td><td>&nbsp;
</td><td>
-
</td><td>5MHz</td><td>1</td><td>300&micro;A</td><td>10V/&micro;s</td><td>-120dBc</td><td>-130dBc</td><td>10KHz</td><td>Yes</td><td>8-Lead MSOP</td><td>
$<sort_value>1.99</sort_value></td>
███████████████第2条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4940-1/products/product.html" target="_blank">ADA4940-1</a></td><td>&nbsp;
</td><td>
-
</td><td>260MHz</td><td>1</td><td>1.18mA</td><td>95V/&micro;s</td><td>-123dBc</td><td>-126dBc</td><td>50KHz</td><td>Yes</td><td>16-Lead LFCSP</td><td>
$<sort_value>1.59</sort_value></td>
███████████████第3条记录████████████████
【内容】:<td><a href="http://search.analog.com/search/ProductSearch.aspx?query=ada4940-2&amp;locale=zh" target="_blank">ADA4940-2</a></td><td>&nbsp;
</td><td>
-
</td><td>260MHz</td><td>1</td><td>1.25mA</td><td>95V/&micro;s</td><td>-123dBc</td><td>-126dBc</td><td>50KHz</td><td>Yes</td><td>24-Lead LFCSP</td><td>
$<sort_value>2.59</sort_value></td>
███████████████第4条记录████████████████
【内容】:<td><a href="http://search.analog.com/search/ProductSearch.aspx?query=adl5565&amp;locale=zh" target="_blank">ADL5565</a></td><td>&nbsp;
</td><td>
-
</td><td>6GHz</td><td>2</td><td>80mA</td><td>11KV/&micro;s</td><td>-108dBc</td><td>-109dBc</td><td>100MHz</td><td>-</td><td>16-Lead LFCSP</td><td>
$<sort_value>3.68</sort_value></td>
███████████████第5条记录████████████████
【内容】:<td><a href="http://search.analog.com/search/ProductSearch.aspx?query=ad8475&amp;locale=zh" target="_blank">AD8475</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td><a href="http://search.analog.com/search/ProductSearch.aspx?query=ad8475&amp;locale=zh#prod_cross_sell" target="_blank">
YES
</a></td><td>150MHz</td><td>0.4</td><td>3mA</td><td>50V/&micro;s</td><td>-110dBc</td><td>-108dBc</td><td>100KHz</td><td>Yes</td><td>10-Lead uSOIC, 16-Lead LFCSP</td><td>
$<sort_value>1.99</sort_value></td>
███████████████第6条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4930-1/products/product.html" target="_blank">ADA4930-1</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td>
-
</td><td>1.35GHz</td><td>1</td><td>34mA</td><td>3.4KV/&micro;s</td><td>-104dBc</td><td>-101dBc</td><td>10MHz</td><td>-</td><td>16-Lead LFCSP</td><td>
$<sort_value>3.79</sort_value></td>
███████████████第7条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4930-2/products/product.html" target="_blank">ADA4930-2</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td>
-
</td><td>1.35GHz</td><td>1</td><td>34mA</td><td>-</td><td>-104dBc</td><td>-101dBc</td><td>10MHz</td><td>-</td><td>24-Lead LFCSP</td><td>
$<sort_value>6.29</sort_value></td>
███████████████第8条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4960-1/products/product.html" target="_blank">ADA4960-1</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td>
-
</td><td>5GHz</td><td>2</td><td>60mA</td><td>8.7KV/&micro;s</td><td>-73dBc</td><td>-72dBc</td><td>1GHz</td><td>-</td><td>16-Lead LFCSP</td><td>
$<sort_value>6.95</sort_value></td>
███████████████第9条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4950-1/products/product.html" target="_blank">ADA4950-1</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td>
-
</td><td>750MHz</td><td>1</td><td>9.5mA</td><td>2.9KV/&micro;s</td><td>-98dBc</td><td>-99dBc</td><td>20MHz</td><td>-</td><td>16-Lead LFCSP, 24-Lead LFCSP</td><td>
$<sort_value>2.99</sort_value></td>
███████████████第10条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4950-2/products/product.html" target="_blank">ADA4950-2</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td>
-
</td><td>750MHz</td><td>1</td><td>9.5mA</td><td>2.9KV/&micro;s</td><td>-98dBc</td><td>-99dBc</td><td>20MHz</td><td>-</td><td>16-Lead LFCSP, 24-Lead LFCSP</td><td>
$<sort_value>5.29</sort_value></td>
███████████████第11条记录████████████████
【内容】:<td><a href="http://search.analog.com/search/ProductSearch.aspx?query=adl5561&amp;locale=zh" target="_blank">ADL5561</a></td><td>&nbsp;
</td><td>
-
</td><td>2.9GHz</td><td>2</td><td>40mA</td><td>9.8KV/&micro;s</td><td>-96dBc</td><td>-87dBc</td><td>70MHz</td><td>-</td><td>16-Lead LFCSP</td><td>
$<sort_value>3.68</sort_value></td>
███████████████第12条记录████████████████
【内容】:<td><a href="http://search.analog.com/search/ProductSearch.aspx?query=adl5562&amp;locale=zh" target="_blank">ADL5562</a></td><td>&nbsp;
</td><td><a href="http://search.analog.com/search/ProductSearch.aspx?query=adl5562&amp;locale=zh#prod_cross_sell" target="_blank">
YES
</a></td><td>3.3GHz</td><td>2</td><td>80mA</td><td>9.8KV/&micro;s</td><td>-104dBc</td><td>-87dBc</td><td>140MHz</td><td>-</td><td>16-Lead LFCSP</td><td>
$<sort_value>3.68</sort_value></td>
███████████████第13条记录████████████████
【内容】:<td><a href="http://search.analog.com/search/ProductSearch.aspx?query=ad8275&amp;locale=zh" target="_blank">AD8275</a></td><td>&nbsp;
</td><td>
-
</td><td>15MHz</td><td>0.2</td><td>1.9mA</td><td>25V/&micro;s</td><td>-100dBc</td><td>-100dBc</td><td>100KHz</td><td>Yes</td><td>8-Lead MSOP</td><td>
$<sort_value>1.6</sort_value></td>
███████████████第14条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4927-1/products/product.html" target="_blank">ADA4927-1</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td>
-
</td><td>2.3GHz</td><td>1</td><td>22.1mA</td><td>5KV/&micro;s</td><td>-87dBc</td><td>-89dBc</td><td>100MHz</td><td>-</td><td>16-Lead LFCSP</td><td>
$<sort_value>3.79</sort_value></td>
███████████████第15条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4927-2/products/product.html" target="_blank">ADA4927-2</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td>
-
</td><td>2.3GHz</td><td>1</td><td>22.1mA</td><td>5KV/&micro;s</td><td>-87dBc</td><td>-89dBc</td><td>100MHz</td><td>-</td><td>24-Lead LFCSP</td><td>
$<sort_value>6.29</sort_value></td>
███████████████第16条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4932-1/products/product.html" target="_blank">ADA4932-1</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4932-1/products/product.html#prod_cross_sell" target="_blank">
YES
</a></td><td>560MHz</td><td>1</td><td>9.6mA</td><td>410V/&micro;s</td><td>-104dBc</td><td>-120dBc</td><td>10MHz</td><td>-</td><td>16-Lead LFCSP</td><td>
$<sort_value>2.95</sort_value></td>
███████████████第17条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4932-2/products/product.html" target="_blank">ADA4932-2</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td>
-
</td><td>560MHz</td><td>1</td><td>9.6mA</td><td>410V/&micro;s</td><td>-100dBc</td><td>-120dBc</td><td>10MHz</td><td>-</td><td>24-Lead LFCSP</td><td>
$<sort_value>5.29</sort_value></td>
███████████████第18条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4939-1/products/product.html" target="_blank">ADA4939-1</a></td><td>&nbsp;
</td><td>
-
</td><td>1.4GHz</td><td>2</td><td>37.7mA</td><td>6.8KV/&micro;s</td><td>-83dBc</td><td>-97dBc</td><td>70MHz</td><td>-</td><td>16-Lead LFCSP</td><td>
$<sort_value>3.79</sort_value></td>
███████████████第19条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4939-2/products/product.html" target="_blank">ADA4939-2</a></td><td>&nbsp;
</td><td>
-
</td><td>1.4GHz</td><td>2</td><td>37.7mA</td><td>6.8KV/&micro;s</td><td>-77dBc</td><td>-91dBc</td><td>70MHz</td><td>-</td><td>16-Lead LFCSP</td><td>
$<sort_value>5.69</sort_value></td>
███████████████第20条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4937-2/products/product.html" target="_blank">ADA4937-2</a></td><td>&nbsp;
</td><td>
-
</td><td>1.9GHz</td><td>1</td><td>39.5mA</td><td>6KV/&micro;s</td><td>-77dBc</td><td>-84dBc</td><td>100MHz</td><td>-</td><td>16-Lead LFCSP, 24-Lead LFCSP</td><td>
$<sort_value>5.69</sort_value></td>
███████████████第21条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4938-1/products/product.html" target="_blank">ADA4938-1</a></td><td>&nbsp;
</td><td>
-
</td><td>1GHz</td><td>1</td><td>37mA</td><td>4.7KV/&micro;s</td><td>-82dBc</td><td>-82dBc</td><td>-</td><td>-</td><td>16-Lead LFCSP, 24-Lead LFCSP</td><td>
$<sort_value>3.79</sort_value></td>
███████████████第22条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4938-2/products/product.html" target="_blank">ADA4938-2</a></td><td>&nbsp;
</td><td>
-
</td><td>1GHz</td><td>1</td><td>37mA</td><td>4.7KV/&micro;s</td><td>-82dBc</td><td>-82dBc</td><td>50MHz</td><td>-</td><td>16-Lead LFCSP, 24-Lead LFCSP</td><td>
$<sort_value>5.69</sort_value></td>
███████████████第23条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4937-1/products/product.html" target="_blank">ADA4937-1</a></td><td>&nbsp;
</td><td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4937-1/products/product.html#prod_cross_sell" target="_blank">
YES
</a></td><td>1.9GHz</td><td>1</td><td>39.5mA</td><td>6KV/&micro;s</td><td>-77dBc</td><td>-84dBc</td><td>100MHz</td><td>-</td><td>16-Lead LFCSP</td><td>
$<sort_value>3.79</sort_value></td>
███████████████第24条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4941-1/products/product.html" target="_blank">ADA4941-1</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4941-1/products/product.html#prod_cross_sell" target="_blank">
YES
</a></td><td>30MHz</td><td>2</td><td>2.3mA</td><td>22V/&micro;s</td><td>-112dBc</td><td>-110dBc</td><td>100KHz</td><td>Yes</td><td>8-Lead LFCSP, 8-Lead SOIC</td><td>
$<sort_value>2.42</sort_value></td>
███████████████第25条记录████████████████
【内容】:<td><a href="http://search.analog.com/search/ProductSearch.aspx?query=ad8352&amp;locale=zh" target="_blank">AD8352</a></td><td>&nbsp;
</td><td><a href="http://search.analog.com/search/ProductSearch.aspx?query=ad8352&amp;locale=zh#prod_cross_sell" target="_blank">
YES
</a></td><td>2.2GHz</td><td>2</td><td>37mA</td><td>11KV/&micro;s</td><td>-81dBc</td><td>-87dBc</td><td>190MHz</td><td>-</td><td>16-Lead LFCSP</td><td>
$<sort_value>3.53</sort_value></td>
███████████████第26条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ada4922-1/products/product.html" target="_blank">ADA4922-1</a></td><td>&nbsp;
</td><td>
-
</td><td>38MHz</td><td>2</td><td>9.4mA</td><td>730V/&micro;s</td><td>116dBc</td><td>109dBc</td><td>10KHz</td><td>-</td><td>-</td><td>
$<sort_value>3.63</sort_value></td>
███████████████第27条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ad8137/products/product.html" target="_blank">AD8137</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td>
-
</td><td>110MHz</td><td>1</td><td>3.6mA</td><td>450V/&micro;s</td><td>-100dBc</td><td>-105dBc</td><td>5MHz</td><td>-</td><td>8-Lead SOIC</td><td>
$<sort_value>1.1</sort_value></td>
███████████████第28条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ad8139/products/product.html" target="_blank">AD8139</a></td><td>&nbsp;
</td><td>
-
</td><td>410MHz</td><td>1</td><td>24.5mA</td><td>800V/&micro;s</td><td>-90dBc</td><td>-105dBc</td><td>5MHz</td><td>-</td><td>8-Lead CSP, 8-Lead SOIC</td><td>
$<sort_value>3.75</sort_value></td>
███████████████第29条记录████████████████
【内容】:<td><a href="http://search.analog.com/search/ProductSearch.aspx?query=ad8351&amp;locale=zh" target="_blank">AD8351</a></td><td>&nbsp;
</td><td>
-
</td><td>2.2GHz</td><td>1</td><td>28mA</td><td>13KV/&micro;s</td><td>-95dBc</td><td>-93dBc</td><td>70MHz</td><td>-</td><td>10-Lead uSOIC</td><td>
$<sort_value>2.68</sort_value></td>
███████████████第30条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ad8132/products/product.html" target="_blank">AD8132</a></td><td><span title="中文数据手册" class="checkMark">✓</span><br />&nbsp;
</td><td>
-
</td><td>350MHz</td><td>1</td><td>10.7mA</td><td>1.2KV/&micro;s</td><td>-100dBc</td><td>-99dBc</td><td>5MHz</td><td>-</td><td>8-Lead MSOP, 8-Lead SOIC</td><td>
$<sort_value>1.67</sort_value></td>
███████████████第31条记录████████████████
【内容】:<td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ad8138/products/product.html" target="_blank">AD8138</a></td><td>&nbsp;
</td><td><a href="http://www.analog.com/zh/high-speed-op-amps/high-speed-differential-amplifiers/ad8138/products/product.html#prod_cross_sell" target="_blank">
YES
</a></td><td>320MHz</td><td>1</td><td>20mA</td><td>1.5KV/&micro;s</td><td>-94dBc</td><td>-114dBc</td><td>5MHz</td><td>-</td><td>8-Lead CSP, 8-Lead SOIC</td><td>
$<sort_value>3.75</sort_value></td>
███████████████第32条记录████████████████
【内容】:<td><a href="http://search.analog.com/search/ProductSearch.aspx?query=ad8350&amp;locale=zh" target="_blank">AD8350</a></td><td>&nbsp;
</td><td>
-
</td><td>900MHz</td><td>5.6</td><td>28mA</td><td>2KV/&micro;s</td><td>-66dBc</td><td>-65dBc</td><td>50MHz</td><td>-</td><td>8-Lead SOIC/8-Lead uSOIC</td><td>
$<sort_value>2.52</sort_value></td>
███████████████第33条记录████████████████
【内容】:<td><a href="http://search.analog.com/search/ProductSearch.aspx?query=ad6630&amp;locale=zh" target="_blank">AD6630</a></td><td>&nbsp;
</td><td>
-
</td><td>700MHz</td><td>15.8</td><td>30mA</td><td>3.7KV/&micro;s</td><td>-</td><td>-</td><td>-</td><td>-</td><td>16-Lead SOIC</td><td>
$<sort_value>4.18</sort_value></td>
███████████████第34条记录████████████████

leejunji 发表于 2012-6-25 12:25:50

这样的循环可以吗?

leejunji 发表于 2012-6-25 12:26:13

复制太多了发太多了

maxec 发表于 2012-6-25 15:16:03

本帖最后由 maxec 于 2012-6-25 15:16 编辑

先谢谢 leejunji 的回复。
我想要的结果关键是:每一条记录里有多个标签把表格内容分开。

即:在每一条记录中按照表头有多个标签,比如:
【Part# 】:ADA4940-1
【PDF】: -
【Circuit】:        260MHz11.18mA95V/μs
【-3 dB BW        】:---
【Minimum Gain (Acl)】:6       
【Supply Current        】:
【Slew Rate (V/us)        】:--
【Distortion (2nd)        】:--
【Distortion (3rd)        】:--
【Test Frequency (MHz)        】:--

请leejunji 再试试,可不可以,多谢!

maxec 发表于 2012-6-25 15:20:18

zhudi 发表于 2012-6-25 12:05 static/image/common/back.gif
可以的,楼主再尝试尝试。。

没成功,请指教。

如果支持所谓的完整匹配,就好了。就是比如每10对 <td> < /td> 一个完整循环,这样就不会乱了。 火车好像不是这个模式。使用了循环匹配后,不同标签都会采集到相同的内容。

maxec 发表于 2012-6-25 17:44:09

这个问题在行业网站信息采集中很有代表性,所以不能沉下去哦。
希望高手指点。

页: [1] 2
查看完整版本: 已解决--表格内容 分标签 循环采集的问题