Displaying 1 to 10 from 177 results
Ganglia - scalable distributed monitoring system
Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids. It is based on a hierarchical design targeted at federations of clusters. It leverages widely used technologies such as XML for data representation, XDR for compact, portable data transport, and RRDtool for data storage and visualization.
Carrot2 - Search Results Clustering Engine
Carrot2 is an Open Source Search Results Clustering Engine. It could cluster the search results from various sources and generates small collection of documents. Carrot2 offers ready-to-use components for fetching search results from various sources including YahooAPI, GoogleAPI, Bing API, eTools Meta Search, Lucene, SOLR, Google Desktop and more.
Hadoop Common
Apache Hadoop is a framework for running applications on large clusters built of commodity hardware. Hadoop common supports other Hadoop subprojects
MySQL
The MySQL database is the world's most popular open source database.
Tungsten Replicator - Data Replication Engine for MySQL
Tungsten Replicator is a high performance, data replication engine for MySQL. It offers a set of features like Global transaction IDs to support failover, Flexible transaction filtering, Extensible transaction metadata, Sharding, Multiple replication services per process and lot more.
H2 Database
H2 database is very fast, open source database engine. It supports SQL and JDBC standards.
Hazelcast - In-Memory Data Grid for Java
Hazelcast is a clustering and highly scalable data distribution platform for Java. It supports Distributed implementations of java.util.{Queue, Set, List, Map}, java.util.concurrency.locks.Lock, java.util.concurrent.ExecutorService, Distributed Indexing and Query support, Dynamic scaling, partitioning with backups, fail-over, Web-based cluster monitoring tool and lot more.
Cascading - Data Processing Workflows on Hadoop
Cascading is a Data Processing API, Process Planner, and Process Scheduler used for defining and executing complex, scale-free, and fault tolerant data processing workflows on an Apache Hadoop cluster. It is a thin Java library and API that sits on top of Hadoop's MapReduce layer and is executed from the command line like any other Hadoop application.
JPPF - Parallelize computationally intensive tasks and execute them on a Grid
JPPF enables applications with large processing power requirements to be run on any number of computers, in order to dramatically reduce their processing time. This is done by splitting an application into smaller parts that can be executed simultaneously on different machines.
3dcluster - Cluster management tool for managing clusters built using openMosix and Linux
The software aims to provide complete control for a user who manages clusters.