Download List

Projeto Descrição

Yioop! is a PHP search engine. Yioop! can be configured as either a general purpose search engine for the whole Web or it can be configured to provide search results for a set of URLs or domains. Yioop can crawl pages or can directly index archives such as ARC and WARC. It supports indexing several file formats such as HTML, Atom, PDF, DOC, PPT, RTF, RSS, XML, SVG, PNG, JPG, BMP, GIF, and sitemaps. The Yioop! crawler can be deployed on one or many machines. It supports having one or more to crawl scheduler processes, as well as multiple fetchers and mirrors. Crawling respects robots.txt including Crawl-delay. Yioop! crawls are stored in a Web archive format that is easy to move around. Crawling can be done on one machine and the results deployed elsewhere. Yioop! supports mixing of crawls. Yioop! comes with a search front end that can be localized as desired using a GUI. This GUI supports RTL languages. Management of crawls can also be done using this GUI. Yioop! can be configured in a straightforward manner to make use of file caching or memcache if available.

System Requirements

System requirement is not defined
Information regarding Project Releases and Project Resources. Note that the information here is a quote from Freecode.com page, and the downloads themselves may not be hosted on OSDN.

2011-01-28 16:35 Back to release list
0.66

Esta versão oferece suporte preliminar para arquivar o rastreamento de arco wiki mídia, e os arquivos do diretório aberto RDF. Ele permite que re-rastreamentos de Yioop criado anteriormente! WebArchives. Também torna mais fácil adicionar derivações para outros idiomas além do Inglês (para o qual já existe um derivado). Finalmente, ele corrige vários bugs e melhora no índice do grupo pelo iterador.
Tags: Minor
This version provides preliminary support for archive crawling of arc, media wiki, and open directory RDF files. It allows re-crawls of previously created Yioop! WebArchives. It also makes it easier to add stemmers for languages other than English (for which there is already a stemmer). Finally, it fixes several bugs in indexing and improves the group by iterator.

Project Resources