BUG:采集英文文章,会自动去掉词之间的空格.
采集英文文章,会自动去掉词之间的空格.[ 本帖最后由 liao365 于 2006-2-22 15:30 编辑 ] 应该是你规则没设置好,我刚才采了个英文网,一切正常,没有出现你说的现象。
帮帮我!!!!
试了多次,你能再帮下我吗?是不是哪里过滤掉"空格",连标题内的空格都没有了,入库的是风讯文章系统
采集的地址:http://www.bioon.com/biology/international/78185.shtml
原代码是
<P>
<P><FONT face=Arial><FONT size=1>Human genetic diseases that resemble accelerated aging provide<SUP> </SUP>useful models for gerontologists. They combine known single-gene<SUP> </SUP>mutations with deficits in selected tissues that are reminiscent<SUP> </SUP>of changes seen during normal aging. Here, we describe recent<SUP> </SUP>progress toward linking molecular and cellular changes with<SUP> </SUP>the phenotype seen in two of these disorders. One in particular,<SUP> </SUP>Werner syndrome, provides evidence to support the hypothesis<SUP> </SUP>that the senescence of somatic cells may be a causal agent of<SUP> </SUP>normal aging.<SUP> </SUP></FONT></FONT>
采集入库后的代码:
<P>Humangeneticdiseasesthatresembleacceleratedagingprovide<SUP></SUP>usefulmodelsforgerontologists.Theycombineknownsingle-gene<SUP></SUP>mutationswithdeficitsinselectedtissuesthatarereminiscent<SUP></SUP>ofchangesseenduringnormalaging.Here,wedescriberecent<SUP></SUP>progresstowardlinkingmolecularandcellularchangeswith<SUP></SUP>thephenotypeseenintwoofthesedisorders.Oneinparticular,<SUP></SUP>Wernersyndrome,providesevidencetosupportthehypothesis<SUP></SUP>thatthesenescenceofsomaticcellsmaybeacausalagentof<SUP></SUP>normalaging.<SUP></SUP></FONT></FONT> 采集的测试页,没发现异常,但入库后,就发现这样了, 我刚才试采了你这个站,发英文帖正常
[ 本帖最后由 netdream 于 2006-2-20 22:17 编辑 ] netdream MM,把你的规则和网址私下发点给我吧,行咩?
呵呵~~~! 呵呵,我做的规则和网址集,一般很少有人喜欢的,因为我做的是我想要的,汗........... 发一点来看看嘛 奇怪你的钱怎么增加这么快呀?是不是。。。。。。·#¥¥¥¥%¥% 原帖由 netdream 于 2006-2-20 22:37 发表
呵呵,我做的规则和网址集,一般很少有人喜欢的,因为我做的是我想要的,汗...........
我有好多的,可以交换呀,我现在没多少时间去做采集规则