BT中的垃圾数据(rubbish data)和幕后故事

Update:
诺微的森林 – 骑驴看唱本,隔墙有耳
这篇文章对”间谍服务器”分析得很到位,很值得一看。

这几天在通过BT下载一部新近的电视剧《罗马(Rome)》,使用的Client是Windows下的BitComet。刚开始还算正常,从今天下载的第6集开始就发现了很奇怪的现象,几个小时前就提示已完成80%,可是现在还是提示完成度80%,中间一直有十几KB的流量。看了一下统计数据,真是不看不知道,一看吓一跳:

Total Download: 986.04 MB (754.16 MB rubbish data dropped)

怎么会有这么多的垃圾数据(rubbish data)呀?以前也没遇到过这种情况?用Google搜了一下,在这个帖子中间找到了答案:

Grees Oct 18 2005, 12:32 PM

Rubbish data is data that your bittorrent client has discarded.
When certain data for your file failed the hash check, it will be discarded as rubbish data.
Or when your client requests a certain data piece from a user, but before that piece arrive, your client already download it from a second user, then that piece will become rubbish.

垃圾数据就是你的BT客户端所丢弃的数据。
当收取到的数据未能通过hash检查时,就会被作为垃圾数据所丢弃。或者当你的客户端向某一用户请求数据,但在该数据块未抵达前,客户端已经从另一个用户处取得了该数据块,那么之前的数据块就将成为垃圾数据。

Since you are downloading from alot of users at once, the chance that one of them has a bad piece of data is very possible. The movie/game industry also spreading false file of popular game/movies which contain bad data.

因为你的下载总是在很多用户间同时进行的,所以垃圾数据的发生比率还是很高的。电影/游戏厂商也会对一些热门下载故意散播含有坏块的假数据。

Newer bittorrent clients have the option to block users which are sending you alot of rubbish data, client like Bitlord does this automatically.
It also help if you download from certain trackers, some are more reliable then others, for instance i never have rubbish data when downloading from bt-gm.

较新的BT客户端,如Bitlord已经包含选项,能够自动的屏蔽向你发送了大量垃圾数据的用户。
注意选择可信赖的发布站(tracker),也是一个防治垃圾数据比较好的方法。

最让我吃惊的还是下面接下来的一条回复:

prodaytrader Oct 18 2005, 09:37 PM

I think you might also find that you are being sent rubish data from companies like Meta Data. I have never had to deal with this before a week ago. I started a HBO series download called Rome about 2 weeks ago and found that I had downloaded about 5 times the amount of data then what the files called for. It was taking several days when I started to get curious about what was going on. I browsed through the peer lists and found dozens of ip’s that were very similar in nature and many times being different by only 1 digit. This is what made me realize I was being fed BS data or rubbish data. At any rate I downloaded something called Peer Guardian 2 and it started to block IP’s that were known to be hazerdous to our downloads. The downloads finished inside 20 minutes once PG started blocking those ips. No fuss no muss, I just make sure and turn PG on while downloading with BitComet and my download rates are fast with little to no rubbish data.

我想你也许已经发现了一些公司如Meta Data会向你发送垃圾数据。一周前这样的情况是从来没有发生过的。当我两周前开始下载HBO的热门剧集《罗马》的时候,我发现了我下载了大约5倍于正常数据的垃圾数据。过了好些天我才意识到这个奇怪的现象。我查看了一下用户列表,发现几十个非常近似的IP(通常只差最后一位)。这使我意识到我也许被人暗算了。我立刻开始使用一个Peer Guardian 2软件,它能屏蔽那些捣乱的IP。在PG起作用后,我只用了20分钟就完成了下载。所以别着急,用了PG后,我的BitComet下载恢复了正常,没有或者很少有垃圾数据。

In case you are unaware, Riaa and the MPAA are starting to enlist companies like Metadata to be seeders on bittorrent. They are able to create seeds that are complete fakes and will mix in their seeds with the others so that your downloads never finish. They can also become peers to acompish the same objective. Their mission is to make your downloads take forever and log your ip for later use. Do yourselve a favor and find a way to block ips like this.

也许你还没意识到,RIAA(Recording Industry Association of America, 美国唱片工业协会)和MPAA(Motion Picture Association of America, 美国电影协会)已经开始赞助一些公司如Meta Data来为BT做种子。他们通过创建假种子,并混在正常的种子中间使你永远下载不完。他们也会伪装成其它的用户。他们的目的就是延长你的下载时间,并记录下你的IP地址。想想办法屏蔽这些IP吧。

牛,这招 以暴治暴 米国人居然也能想得出来。居然还有Meta Data这样的公司通过这样的手段来赚钱,真晕。上面提到的就是我现在下载的东西,不过我下载的是内嵌中文字幕的real格式的,看来米国人也找了国人合作,ydy这样的大站要加小心了。

这让我联想起,之前一直是在虫窝贵宾ftp下载电影的,最近他们突然停掉了ftp服务,并贴出了如下的通知:

[重要]关闭贵宾区1,2,3

由于最近网络严打盗版,N多论坛已经停止FTP的下载,甚至很多都已经直接关闭论坛。所以,虫窝的也将暂停关闭。具体恢复时间视情况决定。
请诸位谅解!

小道消息 说是要严打到明年6月30号,等国务院提交关于中国加入世界知识产权组织互联网条约(WIPO Internet Treaties)的相关文件 这次行动也是迄今为止我国对网络侵权盗版行为规模最大、力度最强的一次专项治理。

嗯,让暴风雨来得更猛烈些吧。

参考:
HBO 開始主動妨礙 BT 下載
HBO Attacking BitTorrent Permalink
PeerGuardian 小组负责管理钱和服务器的那家伙被 Fake Server 和 FBI Server 的人用金钱贿赂
PeerGuardian 教程

4 Responses to “BT中的垃圾数据(rubbish data)和幕后故事”

  1. s5s5 Says:

    BT里原来有垃圾啊,嘿~我第一次听说~

  2. 一为空间 Says:

    用了很久BT了,一直没注意这个问题,在BITCOMET里面好象没有这个情况呢???

  3. windix Says:

    我也是刚刚才遇到,不过相信这个情况会越来越多的,特别是国外的一些资源,看看PeerGuardian那个越来越大的IP过滤规则文件就应该明白了。

  4. 黄德峻 Says:

    这些rubbish data会不会伤害硬碟?