这是我采集的,为什么会出现乱码呢?
【标题】:Ħ¶ûׯ԰֮ľ֮ËéƬÊÕ¼¯¹¥ÂÔ【内容】:Ê×ÏÈ£¬µÇÈë Ħ¶ûׯ԰£¬Áì ÉÏÀ­Ä·£¨Ñ§Íê±äÉíСÊ÷Ãç¼¼ÄÜ£©£¬µ½´ïÀ­Ä·ÊÀ½çľ×岿Â侫Áé֮ɭÆÜÏ¢µØ£¬ ´ò°ÜÊ÷¾«£¨µ½´ïÄ¿µÄµØ¼°Ñ§Ï°¼¼ÄÜ»¹Óдò°ÜÊ÷¾«·½·¨£º http://www.61mole.com/moerzhuanyuangm/1470.html £© »ñµÃËéƬ ÕâÊÇÂÌÊ÷ËéƬ£¬¿ÉÒÔÕÒľϵÇõ³¤ Æ´ºÏÊ״οɶһ»Ä¾Ö®ÊÖÕÈ Æä</div>
<div class="content">
¡¡¡¡<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-INDENT: 21pt; mso-char-indent-count: 2.0"><font size="3"><span style="FONT-FAMILY: ËÎÌå; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'">Ê×ÏÈ£¬µÇÈë<font color="#000000">Ħ¶ûׯ԰£¬Áì</font>ÉÏÀ­Ä·£¨Ñ§Íê±äÉíСÊ÷Ãç¼¼ÄÜ£©£¬µ½´ïÀ­Ä·ÊÀ½çľ×岿Â侫Áé֮ɭÆÜÏ¢µØ£¬´ò°ÜÊ÷¾«£¨µ½´ïÄ¿µÄµØ¼°Ñ§Ï°¼¼ÄÜ»¹Óдò°ÜÊ÷¾«·½·¨£º</span><span lang="EN-US"><font face="Times New Roman">http://www.61mole.com/moerzhuanyuangm/1470.html</font></span><span style="FONT-FAMILY: ËÎÌå; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'">£©</span></font></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-INDENT: 21pt; mso-char-indent-count: 2.0"><span style="FONT-FAMILY: ËÎÌå; mso-ascii-font-family: 'Times New Roman'; mso-hansi-font-family: 'Times New Roman'"><font size="3">»ñµÃËéƬ 请在config.ini 里将htmldecode 改成 true,然后重启采集器试下
页:
[1]