FAQ Search Today's Posts Mark Forums Read
» Video Reviews

» Linux Archive

Linux-archive is a website aiming to archive linux email lists and to make them easily accessible for linux users/developers.


» Sponsor

» Partners

» Sponsor

Go Back   Linux Archive > Debian > Debian Development

 
 
LinkBack Thread Tools
 
Old 11-22-2007, 06:15 PM
Luca Brivio
 
Default Bug#452422: RFP: yacy -- distributed web crawler and search engine

Package: wnpp
Severity: wishlist
X-Debbugs-CC: debian-devel@lists.debian.org

* Package name : yacy
Version : 0.55
Upstream Author : Michael Christen <mc@example.com>
* URL : http://yacy.net
* License : GPL
Programming Lang: Java
Description : distributed web crawler and search engine

YaCy is a scalable personal web crawler and web search engine. One YaCy
installation can organize more than 10 million documents, but YaCy can
operate search clusters of unlimited size.

YaCy has a peer-to-peer web index exchange interface and it does not need a
central server. Web crawls can be done collaborative with all other YaCy
peers. Resulting indexes are organized in a distributed hash table, and
search requests are pointed efficiently to specific, index-hosting peers.

YaCy can not only index texts from various file formats but also from
different media contents. A search result shows interesting text, image,
audio and video content with direct links to OGG, MP3, and video files.

Because YaCy is fully distributed, search results cannot be completely
censored, only filtered by single peer owners. However, in a privatly
operated search network the software provides a strong functionality to
control the content of the search cluster. In a public search network, a user
is anonymous because there is no central point where all search requests can
be stored.

YaCy has a large number of users running their own peer to create a
independent and open search engine. The standard YaCy release is configured
in such a way that the software joins this public network. The software has a
number of community function like a co-operative bookmark system, a news,
blog and built-in wiki system.

--
Luca Brivio



--
To UNSUBSCRIBE, email to debian-devel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
 

Thread Tools




All times are GMT. The time now is 02:29 AM.

VBulletin, Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO ©2007, Crawlability, Inc.
Copyright 2007 - 2008, www.linux-archive.org