“Anansi”的意思、由来-中文百科全书

概念

Anansi是一个利用网络连接计算机来探索世界网络资源的研究项目。原则上我们希望基于准确性和性能在分布式网络爬虫上做一个评估，经过考虑，BOINC是我们最终的选择。在这样的一个系统中包括准确性，稳定性，适应性和性能等将被测量。

Anansi，客户返回的唯一的URI被抓取与URI的HTTP状态代码，联营公司，indcating它的空房情况。只有计划http本身可以达到公众的URI将被抓取。没有E - mail地址，文字内容或用户，密码将被收集。它是一个非CPU密集型的项目，这是试图在客户端上，以减少CPU负载。机器人排斥和一些网页的内容，如联想信息正在收集和抓取过程中使用BOINC的志愿者，但他们都将返回到Anansi服务器。

Anansi收集的数据（URI）来将用于地图，减少引擎，计算每个URI的重点。当务之急是建立后入度，出度和Anansi服务器创建时间戳。Anansi服务器考虑，重新计划，保持continuely工作的系统

原文：In Anansi, clients returned only URIs been crawled associate with URI's http status code that indcating availability of it. Only URIs with scheme http itself that can be reached by the public will be crawled. No E-mail address, words content or user, password will be collected. It is an non-cpu-intensive project, which is trying to reduce CPU loads on the client. Associative information such as robots exclusion and some page contents are being collected and used by BOINC Volunteers during crawling, but none of them will be returned to Anansi server.

The data(URIs) collected by Anansi will be used by a Map-reduce engine that calculates priorities for each URI. The priority is established upon In-degree, out-degree and timestamp created by Anansi Server. Anansi server take it into consideration for revisit plans, which maintains a continuely working system.

词条	Anansi
释义	概念运作概念 Anansi是一个利用网络连接计算机来探索世界网络资源的研究项目。原则上我们希望基于准确性和性能在分布式网络爬虫上做一个评估，经过考虑，BOINC是我们最终的选择。在这样的一个系统中包括准确性，稳定性，适应性和性能等将被测量。运作 Anansi，客户返回的唯一的URI被抓取与URI的HTTP状态代码，联营公司，indcating它的空房情况。只有计划http本身可以达到公众的URI将被抓取。没有E - mail地址，文字内容或用户，密码将被收集。它是一个非CPU密集型的项目，这是试图在客户端上，以减少CPU负载。机器人排斥和一些网页的内容，如联想信息正在收集和抓取过程中使用BOINC的志愿者，但他们都将返回到Anansi服务器。 Anansi收集的数据（URI）来将用于地图，减少引擎，计算每个URI的重点。当务之急是建立后入度，出度和Anansi服务器创建时间戳。Anansi服务器考虑，重新计划，保持continuely工作的系统原文：In Anansi, clients returned only URIs been crawled associate with URI's http status code that indcating availability of it. Only URIs with scheme http itself that can be reached by the public will be crawled. No E-mail address, words content or user, password will be collected. It is an non-cpu-intensive project, which is trying to reduce CPU loads on the client. Associative information such as robots exclusion and some page contents are being collected and used by BOINC Volunteers during crawling, but none of them will be returned to Anansi server. The data(URIs) collected by Anansi will be used by a Map-reduce engine that calculates priorities for each URI. The priority is established upon In-degree, out-degree and timestamp created by Anansi Server. Anansi server take it into consideration for revisit plans, which maintains a continuely working system.
随便看	中国食品卫生杂志中国食品协会中国食品信息网中国食品学报中国食品药品安全专项整治稽核组中国食品药品监管理论与法制实践中国食品饮料网中国食品营销网中国食品有限公司中国食品杂志社中国食品展览会中国食品招商网中国食品质量安全联盟中国食品质量报中国食神-烹饪大师刘敬贤中国食俗中国食卫网中国食文化中国食文化研究会中国食物中国食物成表中国食物成分表中国食物成分表(2004第2册) 中国食物与营养中国食用菌百科《局域网布线(第二版)》《局域网技术与组网工程高职高专》《局外人看科学(知识与社会译丛)》《局部塑身(魅力・美女人小秘方)》《局部麻醉剂》《局部麻醉剂》《局长的晚宴》《屁股就像冲锋枪打出的靶子》《层峰命令》《居住区与住宅规划设计实用全书》《居室养花窍门(时尚家庭生活丛书)》《居室摆饰台装修图集》《居室照明与布线(布居一格丛书)》《居室盆花》《居室空间利用(布居一格丛书)》《居室绿饰》《居室色彩搭配》《居室阳台绿色环境艺术》《居家男人》《居所》《居酒屋幽灵》《居韵》《屈原赋注》《屈子章句》《屈骚指掌》

概念

运作