Displaying 1 to 6 from 6 results
Grub
Grub Next Generation is distributed web crawling system (clients/servers) which helps to build and maintain index of the Web. It is client-server architecture where client crawls the web and updates the server. The peer-to-peer grubclient software crawls during computer idle time.
Open Search Server
Open Search Server is both a modern crawler and search engine and a suite of high-powered full text search algorithms. Built using the best open source technologies like lucene, zkoss, tomcat, poi, tagsoup. Open Search Server is a stable, high-performance piece of software.
Crawwwler - Open source large scale web crawler
This project is still in its absolute infancy. craWWWler will be a large scale web crawler written in C++ (no MFC). It currently has a very basic plugin architecture controlled by a purposely thin manager. The manager, however, is designed to be more like an ignition switch, occasional pump, and emergency shutdown. The manager is responsible for allowing one or mores plugins to subscribe to the output of other plugins. In this way, the plugins do not have to pass large amounts of data to other p
Iwebcrawler - iCrawler is web based crawler system
iCrawler is a web based crawler system which enable some features like multithreading. iCrawler is also extensible crawler which will support adding any features to it. Build Enviroment: C# Programming Language MS-SQL DataBase ASP.NET
Opese - openSE is a open source search engine for study
OpenSE is a general Chinese search engine implemented in C++ on linux. It consists of four basic modules: crawler, index, query server, and query cgi. This search engine provides web pages query service according to some key words, or a query string given by users. The search results responding to users consists of some item lists. Each item consists of a title, a extract, a url link, and a snapshot link of a web page containing query words given by users. Users can click the snapshot link to se
Spiderscraper - scrape search engine results
returns anchor links from search engine results