2.0测试报告(嘿嘿,测试到很晚哦)
今天试用了2.0的暂时发现以下这些问题:1、编码设定问题。由于我采集的一些网站采集链接的时候可以按默认编码,但是采集回来的网址中有些带有中文,这就需要设定utf的编码。按现在的情况如果设定了utf编码网址便采集不到,反过来采集不到内容。所以编码还是需要分开设定。
中文网址的问题之前我反映过,带中文的网址采集回来的内容中有很大的部分中文字符成了乱码或者根本采集不到内容。如果我把编码换成utf的就可以采集到内容(不过采集的内容还是不全或者有乱码,最终还是需要解决中文网址的问题),从目前的版本看中文网址的问题没解决。
2、循环问题,在2.0好像没有循环次数设定功能。如果在采集论坛的时候需要采集其中的2个回帖那就需要设定3次循环了,但目前2.0的好像没这个功能?
3、分页设定问题,这个2.0和1.21的都没有,不知道默认是采集多少分页?
4、在采集网址的时候不能设定“+”这个条件,比如有写网址中有*****+****我要把他排除掉,排除的条件是“网址中不含 + ”好像不可用。
5、采集链接没显示数目,在测试的时候都不知道采集回来的网址有多少只能手工计算。既然是测试就应该知道我采集回来的网址的数目是否于目标源相同,否则一定是规则不完善或者根本就是错误的。这个功能在1.2的有,怎么到了2.0就没了呢?
6、发布的时候总是出错提示如下
See the end of this message for details on invoking
just-in-time (JIT) debugging instead of this dialog box.
************** Exception Text **************
System.Data.OleDb.OleDbException: 未指定的错误
at System.Data.OleDb.OleDbConnection.ProcessResults(Int32 hr)
at System.Data.OleDb.OleDbConnection.InitializeProvider()
at System.Data.OleDb.OleDbConnection.Open()
at LocoySpiderV2.DB.OleDBOperator.Open()
at LocoySpiderV2.frmTabList.spiderAddressTimer_Tick(Object sender, EventArgs e)
at System.Windows.Forms.Timer.OnTick(EventArgs e)
at System.Windows.Forms.Timer.Callback(IntPtr hWnd, Int32 msg, IntPtr idEvent, IntPtr dwTime)
************** Loaded Assemblies **************
mscorlib
Assembly Version: 1.0.5000.0
Win32 Version: 1.1.4322.573
CodeBase: file:///c:/windows/microsoft.net/framework/v1.1.4322/mscorlib.dll
----------------------------------------
LocoySpiderV2
Assembly Version: 2.0.2362.37225
Win32 Version: 2.0.2362.37225
CodeBase: file:///C:/Documents%20and%20Settings/wt/桌面/LocoySpiderV2Alpha/LocoySpiderV2.exe
----------------------------------------
System.Windows.Forms
Assembly Version: 1.0.5000.0
Win32 Version: 1.1.4322.573
CodeBase: file:///c:/windows/assembly/gac/system.windows.forms/1.0.5000.0__b77a5c561934e089/system.windows.forms.dll
----------------------------------------
System
Assembly Version: 1.0.5000.0
Win32 Version: 1.1.4322.573
CodeBase: file:///c:/windows/assembly/gac/system/1.0.5000.0__b77a5c561934e089/system.dll
----------------------------------------
DevExpress.Utils3
Assembly Version: 3.2.1.0
Win32 Version: 3.2.1.0
CodeBase: file:///C:/Documents%20and%20Settings/wt/桌面/LocoySpiderV2Alpha/DevExpress.Utils3.DLL
----------------------------------------
System.Drawing
Assembly Version: 1.0.5000.0
Win32 Version: 1.1.4322.573
CodeBase: file:///c:/windows/assembly/gac/system.drawing/1.0.5000.0__b03f5f7f11d50a3a/system.drawing.dll
----------------------------------------
DevExpress.XtraEditors3
Assembly Version: 3.2.1.0
Win32 Version: 3.2.1.0
CodeBase: file:///C:/Documents%20and%20Settings/wt/桌面/LocoySpiderV2Alpha/DevExpress.XtraEditors3.DLL
----------------------------------------
DevExpress.XtraBars3
Assembly Version: 3.7.1.0
Win32 Version: 3.7.1.0
CodeBase: file:///C:/Documents%20and%20Settings/wt/桌面/LocoySpiderV2Alpha/DevExpress.XtraBars3.DLL
----------------------------------------
DevExpress.XtraTreeList3
Assembly Version: 1.11.1.0
Win32 Version: 1.11.1.0
CodeBase: file:///C:/Documents%20and%20Settings/wt/桌面/LocoySpiderV2Alpha/DevExpress.XtraTreeList3.DLL
----------------------------------------
AxInterop.SHDocVw
Assembly Version: 1.1.0.0
Win32 Version: 1.1.0.0
CodeBase: file:///C:/Documents%20and%20Settings/wt/桌面/LocoySpiderV2Alpha/AxInterop.SHDocVw.DLL
----------------------------------------
System.Xml
Assembly Version: 1.0.5000.0
Win32 Version: 1.1.4322.573
CodeBase: file:///c:/windows/assembly/gac/system.xml/1.0.5000.0__b77a5c561934e089/system.xml.dll
----------------------------------------
System.Data
Assembly Version: 1.0.5000.0
Win32 Version: 1.1.4322.573
CodeBase: file:///c:/windows/assembly/gac/system.data/1.0.5000.0__b77a5c561934e089/system.data.dll
----------------------------------------
Interop.SHDocVw
Assembly Version: 1.1.0.0
Win32 Version: 1.1.0.0
CodeBase: file:///C:/Documents%20and%20Settings/wt/桌面/LocoySpiderV2Alpha/Interop.SHDocVw.DLL
----------------------------------------
Accessibility
Assembly Version: 1.0.5000.0
Win32 Version: 1.1.4322.573
CodeBase: file:///c:/windows/assembly/gac/accessibility/1.0.5000.0__b03f5f7f11d50a3a/accessibility.dll
----------------------------------------
DevExpress.Data3
Assembly Version: 3.2.1.0
Win32 Version: 3.2.1.0
CodeBase: file:///C:/Documents%20and%20Settings/wt/桌面/LocoySpiderV2Alpha/DevExpress.Data3.DLL
----------------------------------------
Microsoft.mshtml
Assembly Version: 7.0.3300.0
Win32 Version: 7.0.3300.0
CodeBase: file:///C:/Documents%20and%20Settings/wt/桌面/LocoySpiderV2Alpha/Microsoft.mshtml.DLL
----------------------------------------
XMLconfig
Assembly Version: 1.0.2163.27571
Win32 Version: 1.0.2163.27571
CodeBase: file:///C:/Documents%20and%20Settings/wt/桌面/LocoySpiderV2Alpha/XMLconfig.DLL
----------------------------------------
Microsoft.VisualBasic
Assembly Version: 7.0.5000.0
Win32 Version: 7.10.3052.4
CodeBase: file:///c:/windows/assembly/gac/microsoft.visualbasic/7.0.5000.0__b03f5f7f11d50a3a/microsoft.visualbasic.dll
----------------------------------------
System.Web
Assembly Version: 1.0.5000.0
Win32 Version: 1.1.4322.573
CodeBase: file:///c:/windows/assembly/gac/system.web/1.0.5000.0__b03f5f7f11d50a3a/system.web.dll
----------------------------------------
************** JIT Debugging **************
To enable just in time (JIT) debugging, the config file for this
application or machine (machine.config) must have the
jitDebugging value set in the system.windows.forms section.
The application must also be compiled with debugging
enabled.
For example:
<configuration>
<system.windows.forms jitDebugging="true" />
</configuration>
When JIT debugging is enabled, any unhandled exception
will be sent to the JIT debugger registered on the machine
rather than being handled by this dialog.
暂时就这些了,以后发现了再补上。
下面顺便看下我用循环采集测试的结果, 1.21的循环分页采集功能更强
1.21 3月份的版本 ( 很经典的哦,谁要的拷贝回去;P)
;P
【标题】: 汤加丽写真
【内容】: http://www.yabuli.net/meinvxiezhen/UploadFiles_9359/200605/20060516094011227.jpg
http://www.woaitu.com/meinv/tjl/029.jpg
http://www.yabuli.net/meinvxiezhen/UploadFiles_9359/200605/20060516094127990.jpg
http://www.woaitu.com/meinv/tjl/028.jpg
http://www.365zn.com/mrl/pic/mrpic2006in1944.jpg
http://www.woaitu.com/meinv/tjl/073.jpg
http://www.tupianba.com/hot/tangjiali/tangjiali_01.jpg
http://www.woaitu.com/meinv/tjl/046.jpg
http://www.kkpic.com/data/200510/28ea461197ffe55ab66a3c6ce1d693d4_m.jpg
http://www.woaitu.com/meinv/tjl/010.jpg
http://tu.ywzc.net/rtys/tangjiali/24.jpg
http://tu.ywzc.net/rtys/tangjiali/27.jpg
http://www.woaitu.com/meinv/tjl/058.jpg
http://www.woaitu.com/meinv/tjl/056.jpg
http://www.ezhuzi.com/star/admin/images/upfile/200612510319.jpg
http://www.woaitu.com/meinv/tjl/077.jpg<BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P>http://www.woaitu.com/meinv/tjl/038.jpg
http://www.xianwang.com/tangjiali/20060511153030_56855.jpg
http://www.9bian.com/mm/UploadFiles_3432/200605/2006516115534453.jpg
http://pic.patchsky.com/infoPic/pic17/pic17692c1.jpg
http://www.xianwang.com/tangjiali/20060511153030_47566.jpg
http://www.toto.cc/infoPic/pic54/pic54472c1.jpg
http://www.woaitu.com/meinv/tjl/079.jpg
http://www.toto.cc/infoPic/pic54/pic54463c1.jpg
http://www.babycom.cn/jianfei/jianfei/nvmingxing/tangjiali/tangjiali.jpg
http://tu.ywzc.net/rtys/tangjiali/32.jpg
http://www.9bian.com/mm/UploadFiles_3432/200605/2006516115534131.jpg
http://www.18yihou.com/infoPic/pic51/pic51524c1.jpg
http://tu.ywzc.net/rtys/tangjiali/28.jpg
http://tu.ywzc.net/rtys/tangjiali/22.jpg
http://www.18yihou.com/infoPic/pic51/pic51520c1.jpg
http://down.veryol.com/uploadfiles/image/10004/TXT-2006425223929360.jpg<BR><P></P>http://www.toto.cc/infoPic/pic54/pic54470c1.jpg
http://www.toto.cc/infoPic/pic54/pic54465c1.jpg
http://www.xianwang.com/tangjiali/20060511153030_53104.jpg
http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416090752609.jpg
http://www.toto.cc/infoPic/pic54/pic54468c1.jpg
http://www.xianwang.com/tangjiali/20060511153030_91313.jpg
http://www.toto.cc/infoPic/pic54/pic54467c1.jpg
http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416090747341.jpg
http://www.xianwang.com/tangjiali/20060511153030_90486.jpg
http://www.xianwang.com/tangjiali/20060511153030_59117.jpg
http://www.xianwang.com/tangjiali/20060511153030_87209.jpg
http://www.xianwang.com/tangjiali/20060511153030_93598.jpg
http://www.xianwang.com/tangjiali/20060511153030_11236.jpg
http://www.channeleat.com/Files/ImgPic/2006-4/29/v42913.jpg
http://www.i-power.com.cn/image20010518/8655.jpg
http://www.wsxq.com/nanchang/zixun/Upload/newsimg/2005121235342734.jpg<BR><P></P>http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416091227699.jpg
http://www.xianwang.com/tangjiali/20060511153030_22775.jpg
http://pic.northeast.cn/0/00/12/16/121658_984637.jpg
http://www.woaitu.com/meinv/tjl/074.jpg
http://www.xianwang.com/tangjiali/20060511153030_20319.jpg
http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416091225196.jpg
http://www.xianwang.com/tangjiali/20060511153030_27626.jpg
http://pic.patchsky.com/infoPic/pic17/pic17697c1.jpg
http://www.9bian.com/mm/UploadFiles_3432/200605/2006516115532881.jpg
http://img4.pcpop.com/PicImages/480x480/0/143/000143549.jpg
http://news.driverchina.com/Files/BeyondPic/2005-11/14/080.jpg
http://www.9bian.com/mm/UploadFiles_3432/200605/2006516115533566.jpg
http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416090751590.jpg
http://www.taomn.com/pic/bimg/albums/photos/1/0/1/101705/1127469264.jpg
http://down.veryol.com/uploadfiles/image/10004/TXT-2006425223930112.jpg
http://ent.tom.com/images/houwei/tjl060125/x/3.jpg<BR><P></P>http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416090750918.jpg
http://tu.ywzc.net/rtys/tangjiali/13.jpg
http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416090752691.jpg
http://down.veryol.com/uploadfiles/image/10004/TXT-2006425223930784.jpg
http://www.kunde58.com/power/Article/UploadFiles/200508/20050816081709865.jpg
http://down.veryol.com/uploadfiles/image/10004/TXT-2006425223929797.jpg
http://www.18yihou.com/infoPic/pic27/pic27122c1.jpg
http://down.veryol.com/uploadfiles/image/10004/TXT-2006425223929371.jpg
http://www.toto.cc/infoPic/pic54/pic54469c1.jpg
http://www.18yihou.com/infoPic/pic27/pic27130c1.jpg
http://ent.tom.com/images/houwei/tjl060125/x/2.jpg
http://img4.pcpop.com/PicImages/480x480/0/143/000143588.jpg
http://www.18yihou.com/infoPic/pic27/pic27120c1.jpg
http://www.18yihou.com/infoPic/pic27/pic27121c1.jpg
http://photocdn.sohu.com/20060515/Img243244360.jpg
http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416090751754.jpg<BR><P></P>http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416090751233.jpg
http://news.driverchina.com/Files/BeyondPic/2005-11/14/057.jpg
http://www.kunde58.com/power/Article/UploadFiles/200508/20050816081703381.jpg
http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416090752166.jpg
http://www.kunde58.com/power/Article/UploadFiles/200508/20050816081709693.jpg
http://news.driverchina.com/Files/BeyondPic/2005-11/14/055.jpg
http://www.kunde58.com/power/Article/UploadFiles/200512/20051223141918927.jpg
http://news.driverchina.com/Files/BeyondPic/2005-11/14/004.jpg
http://ent.tom.com/images/houwei/tjl060125/x/1.jpg
http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416091227474.jpg
http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416091227323.jpg
http://tu.ywzc.net/rtys/tangjiali/15.jpg
http://news.driverchina.com/Files/BeyondPic/2005-11/14/059.jpg
http://news.driverchina.com/Files/BeyondPic/2005-11/14/030.jpg
http://news.driverchina.com/Files/BeyondPic/2005-11/14/075.jpg
http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416091230527.jpg<BR><P></P>http://www.qdsssh.com/tpgl/UploadFiles/200504/20050416091229924.jpg
http://www.der8.cn/uploadfile/user/2005-11/29/2005112915184793637.jpg
http://tu.ywzc.net/rtys/tangjiali/17.jpg
http://fs4.139.com/2/2030/lqq899/photo/20065281621800_500.jpg
http://ad.tom.com/search/pic/templates/l_s_mlf10.jpg
http://www.kkpic.com/data/200510/bd669c55ab39cd48cdcd3123c1a9f845_m.jpg
http://www.kkpic.com/data/200510/6cafcaf8d74a5d4352999e5d4c08e780_m.jpg
http://news.driverchina.com/Files/BeyondPic/2005-11/14/047.jpg
http://www.kkpic.com/data/200510/1a0ca5d52c2ed6f8b08be5985d30efa6_m.jpg
http://www.lxsh.com.cn/Article/UploadFiles/200505/20050528113026120.jpg
http://club.vgogo.com/uploadfile/200512311134860116.jpg
http://ent.tom.com/images/houwei/tjl060125/x/4.jpg
http://www.der8.cn/uploadfile/user/2005-11/29/2005112914273422650.jpg
http://www.kunde58.com/power/Article/UploadFiles/200508/20050816081708563.jpg
http://img4.pcpop.com/PicImages/480x480/0/147/000147345.jpg
http://fs4.139.com/2/2045/fqjk126/photo/2006525151624201_500.jpg<BR><P></P>http://www.kkpic.com/data/200510/2d866fc9ef802723e2cac1cd6d98ecfa_m.jpg
http://club.vgogo.com/uploadfile/200512311134934596.jpg
http://www.12638.com/news/UploadFiles_7329/200604/20060409190104281.jpg
http://www.kunde58.com/power/Article/UploadFiles/200506/20050616155633647.jpg
http://club.vgogo.com/uploadfile/200512311114862656.jpg
http://www.kunde58.com/power/Article/UploadFiles/200506/20050616155633129.jpg
http://www.der8.cn/uploadfile/user/2005-11/29/2005112914273460136.jpg
http://club.vgogo.com/uploadfile/200512311134934746.jpg
http://www.12638.com/news/UploadFiles_7329/200604/20060409190342996.jpg
http://shop-cnkeyword.web11.bootchina.com/netstar/img/tangjiali/1new.jpg
http://www.der8.cn/uploadfile/user/2005-11/29/2005112915254491135.jpg
http://www.der8.cn/uploadfile/user/2005-11/29/2005112915254429861.jpg
http://www.babycom.cn/jianfei/jianfei/nvmingxing/tangjiali/汤加丽1.jpg
http://img4.pcpop.com/PicImages/480x480/0/147/000147347.jpg
http://www.abcd5.com/tangjialifiles/20050715rt11s.jpg
http://www.abcd5.com/tangjialifiles/20050715rt10s.jpg<BR><P></P>http://www.abcd5.com/tangjialifiles/TN_WW_CSA_BodyArt_TangJiaLi_033.jpg
http://www.der8.cn/uploadfile/user/2005-11/29/2005112915254348580.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101241501.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101242190.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101237473.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101241499.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101242192.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101241536.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101237349.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101241543.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101240246.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101237471.jpg
http://www.5fat.com/jianfei/nvmingxing/tangjiali/汤加丽.jpg
http://club.vgogo.com/uploadfile/200512311114734394.jpg
http://www.der8.cn/uploadfile/user/2005-11/29/2005112915184791565.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101241508.jpg<BR><P></P>http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101242197.jpg
http://img4.pcpop.com/PicImages/480x480/0/143/000143547.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101234592.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101237347.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101242060.jpg
http://club.vgogo.com/uploadfile/200512311134983552.jpg
http://www.der8.cn/uploadfile/user/2005-11/29/2005112914273495705.jpg
http://star.loudi.tv/loudi/20051012/537/sina1.com_2005101240244.jpg
http://img4.pcpop.com/PicImages/480x480/0/143/000143548.jpg
http://img4.pcpop.com/PicImages/480x480/0/143/000143570.jpg
http://www.dabaoku.net/liuxing/wangluo/tangjiali2/030bd.jpg
http://www.36588.com.cn/imagelib/11/40/image/汤加丽精美写真39.jpg
http://club.vgogo.com/uploadfile/200512311114757567.jpg
http://www.dabaoku.net/liuxing/wangluo/tangjiali2/044br.jpg
http://www.soart.cn/Article/UploadFiles/200512/20051213122720992.jpg
http://www.kkpic.com/data/200510/a0ee27292688ab100203dbcc5e369cf4_m.jpg
2.0内侧版,条件一模一样但是采集回来的结果差很多,我测试了好几遍结果都一样。
【标题】: 汤加丽写真
【内容】: http://www.woaitu.com/meinv/tjl/038.jpg
http://www.xianwang.com/tangjiali/20060511153030_56855.jpg
http://www.9bian.com/mm/UploadFiles_3432/200605/2006516115534453.jpg
http://pic.patchsky.com/infoPic/pic17/pic17692c1.jpg
http://www.xianwang.com/tangjiali/20060511153030_47566.jpg
http://www.toto.cc/infoPic/pic54/pic54472c1.jpg
http://www.woaitu.com/meinv/tjl/079.jpg
http://www.toto.cc/infoPic/pic54/pic54463c1.jpg
http://www.babycom.cn/jianfei/jianfei/nvmingxing/tangjiali/tangjiali.jpg
http://tu.ywzc.net/rtys/tangjiali/32.jpg
http://www.9bian.com/mm/UploadFiles_3432/200605/2006516115534131.jpg
http://www.18yihou.com/infoPic/pic51/pic51524c1.jpg
http://tu.ywzc.net/rtys/tangjiali/28.jpg
http://tu.ywzc.net/rtys/tangjiali/22.jpg
http://www.18yihou.com/infoPic/pic51/pic51520c1.jpg
http://down.veryol.com/uploadfiles/image/10004/TXT-2006425223929360.jpg<BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P><BR><P></P>
[ 本帖最后由 insun 于 2006-7-11 02:30 编辑 ] 我最希望看到的是中文网址的问题能得到解决,但是比较遗憾目前问题还在;另外没办法在采集的内容插入自定义内容。本以为可以通过自定义标签来添加,不过发布的时候只有显示内容。不知道能不能通过别的办法实现?
[ 本帖最后由 insun 于 2006-7-11 03:06 编辑 ] 非常详细的测试报告,我会对照着进行完善,谢谢insun 你的努力! 老大客气了,发现问题了我会再次提交。 补充一个问题,今天在采集测试的时候发现了一个问题:在内容发布的时候是不是一次读取所有频道网站的网址?我记得我发布测试的那个目标站点的网址只有100多条,但是2.0却获取了800多个文章地址。如果是对比的需要也不应该所有的频道都拿来对比的,对比历史应该是单个频道自己比对才对,这样的速度才会快。另外一个问题上面说过了,就是一发布内容就出错,我把两个的图放在一起,看下图:
http://www.imagekafe.com/files/da995.gif
另外建议论坛开启贴图功能,要不就借助网站的外部工具,看下图,如果需要我可以提供模版,dz的模版:http://www.imagekafe.com/files/7140f.gif 辅助工具的“传值加解密”这个功能不能用,提示:
Applacation has generated an exception that could not bu handled
Process id=0xc14(3092), Thread id=0xc8c (3212)
Click OK to terminate the application
Click CANCEL to debug the application
html的内置标签排除不干净,我把所有的html标签都选择了,可是还是出现了,需要手动设置
看下面采集测试,我已经选定了所有的html标签了:
【标题】: Stroke gives woman foreign accent - FreeWebspace.net Community
【内容】: <div style="margin:20px; margin-top:5px; ">
<div cla ="smallfont" style="margin-bottom:2px">Quote:
<div>Originally Posted by <strong> C News</strong>
<div style="font-style:italic">A Geordie woman has a arently developed foreign accents after waking up following a stroke.
Source: http://news. c.co.uk/1/hi/england/tyne/5144300.stm
另外一点是,一对标签应该是可以一起排除的,但是一起排除的话会把内容也给排除了:lol比如说: <strong></strong>这一对标签按理讲我应该可以用<(*)strong>直接排除,但是不行,非得写两个
[ 本帖最后由 insun 于 2006-7-12 07:25 编辑 ] 采集网址填写的排除规则没用。看我发布的那个天天bt的规则,里面我已经写了内容网址不包括_sort但是每个分类采集回来的都有个包含这个的网址。这个问题比较常遇到,今天才记起来 解决了一个问题了,我知道为什么会提示“未指定错误”。这个是由于我自己在修改好发布方式的时候保存了,但是目标站却没保存。也就是虽然发布方式是保存了,但是目标站点却没选定发布方式,所以会出现“未指定的错误”。
我想问个问题:在“规则类型”中的网址包括图片和影音的吗?因为之前我用1.21的采集内容选定“只采集网址”这样采集不到图片的地址。不知道2.0的版本是否包括呢?
页:
[1]