Sphinix

It is a standalone search engine, meant to provide fast, size-efficient and relevant fulltext search functions to other applications. Sphinx was specially designed to integrate well with SQL databases and scripting languages. Currently built-in data sources support fetching data either via direct connection to MySQL or PostgreSQL, or using XML pipe mechanism (a pipe to indexer in special XML-based format which Sphinx recognizes). Official APIs for PHP, Python, Java, Ruby, pure C are included in Sphinx distribution.

Its feature includes:

  • high indexing speed (upto 10 MB/sec on modern CPUs);
  • high search speed (avg query is under 0.1 sec on 2-4 GB text collections);
  • high scalability (upto 100 GB of text, upto 100 M documents on a single CPU);
  • provides good relevance ranking through combination of phrase proximity ranking and statistical (BM25) ranking;
  • provides distributed searching capabilities;
  • provides document exceprts generation;
  • provides searching from within MySQL through pluggable storage engine;
  • supports boolean, phrase, and word proximity queries;
  • supports multiple full-text fields per document (upto 32 by default);
  • supports multiple additional attributes per document (ie. groups, timestamps, etc);
  • supports stopwords;
  • supports both single-byte encodings and UTF-8;
  • supports English stemming, Russian stemming, and Soundex for morphology;
  • supports MySQL natively (MyISAM and InnoDB tables are both supported);
  • supports PostgreSQL natively.



http://www.sphinxsearch.com/

Bookmark and Share          4239



comments powered by Disqus


Related Products

Solr

Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g., Word, PDF) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites.

Read more

Saaral-soft-search-spider - WebSpider that can also search text in spidereded web links

Search SpiderSaaral Soft Search Spider is created using perl with GTK+ as a front end. It is basically a Web Spider, Which can find links from given seed site also search in the found web pages. In simple terms, It works like a web search engine without indexing any data. saaral-soft-search-spider is also a good example for Perl GTK2 usage and Using GTK2 widgets inside perl threads. How to Run?To Run Windows BinarySearchSpider is also available as standalone executable for windows OS (Windows XP

Read more

Ssearch - PHP5 Spidering Search Library

Introduction to sSearchThe initial 's' in sSearch stands for 'simple' -- this library is intended to be a simple search library for small- to medium-size web sites where it is not feasible to install search engine software. It is written entirely in object-oriented PHP5 and by default uses a basic MySQL back-end (although an improved MySQL-based engine is in development). FeaturessSearch currently supports: Index single page Index page and follow links ('spidering') recursively to a custom depth

Read more

Pyopensearch - Python example application that provides Open Search suggestions and AJAX'y web f

Example Python application that implements basic OpenSearch 1.1 features along with html/javascript for web browser so that regular html forms can make use of OpenSearch suggestion/autocomplete/search-as-you-type support. Really 3 different pieces: Full Text Search tool (via Whoosh) OpenSearch server html form autosuggest via jQuery and the OpenSearch server See PyOpenSearchExamples for CherryPy demo application using Whoosh (a Full Text Search engine) and serving JSON for auto suggest. Example

Read more

Searchgui - graphical user interface for proteomics identification search engines

SearchGUINews What is SearchGUI? Download Using SearchGUI In Other Tools From the Command Line Database Help User Defined Modifications Result Analysis Troubleshooting SearchGUI Publication: Vaudel et al: Proteomics 2011;11(5):996-9. If you use SearchGUI as part of a paper, please include the reference above. PeptideShaker: To visualize and analyze the search results we recommend the use of PeptideShaker. OMMSA Parser and X!Tandem Parser: For developer access to the search results we recommend t

Read more

Peptide-shaker - interpretation of proteomics identifications from multiple search engines

PeptideShakerNews Introduction Download Read Me Troubleshooting Important: Keep in mind that PeptideShaker is still in development. So we will release frequent updates with new features and bug fixes. If you find a bug or have ideas for new features, please let us know. Either by setting up an Issue or by sending an e-mail to the PeptideShaker Google Group. (Click on a figure to see the full size version) NewsMay 10. 2012: PeptideShaker v0.17.0 is now available: Added a Protein Fractions tab. Ad

Read more

Lucene

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform.

Read more

Jumper - Collaborative search engine in PHP

Jumper 2.0 is a collaborative community search platform that revolutionizes search by crowdsourcing knowledge management powered by a shared bookmarking engine. It is easily and quickly deployed into a community of practice that benefits users with complex and specialized search requirements. Jumper delivers universal search of any databases, flat files, fileshares, content systems, web pages, blogs and wikis, even people - through one simple search box.

Read more

Constellio - Enterprise Search engine

Constellio Open Source Enterprise Search is based on Apache Solr and using Google Search Appliances connectors architecture, it allows, with a single click, to find all relevant content in your organization (Web, email, ECM, CRM etc.).

Read more

Open Search Server

Open Search Server is both a modern crawler and search engine and a suite of high-powered full text search algorithms. Built using the best open source technologies like lucene, zkoss, tomcat, poi, tagsoup. Open Search Server is a stable, high-performance piece of software.

Read more

Related Tags
Browse projects by tags.

We have collection of more than 400,000 open source products ranging from Enterprise product to small libraries in all platforms. We aggregate information from all open source repositories. Search and find the best for your needs.



Follow feeds Follow bestopensource on Twitter Follow bestopensource on Facebook


Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.

Do you provide Consulting, Training, Support for any open source products. Register your business

Tag Cloud >>