Displaying 1 to 10 from 111 results
S-Space - A scalable software library for semantic spaces
The S-Space Package is a collection of algorithms for building Semantic Spaces as well as a highly-scalable library for designing new distributional semantics algorithms. Distributional algorithms process text corpora and represent the semantic for words as high dimensional feature vectors.
Semantic Vectors - Creating and Searching Semantic Vector using Lucene
The Semantic Vectors package uses a Random Projection algorithm, a form of automatic semantic analysis. Other methods supported by the package include Latent Semantic Analysis (LSA) and Reflective Random Indexing. Latent Semantic Analysis (LSA) is a theory and method for extracting and representing the contextual-usage meaning of words by statistical computations applied to a large corpus of text. This library is used in semantic analysis and text mining.
OpenCog - Framework to build Artificial Intelligence Programs
The OpenCog Framework is a platform to build and share artificial intelligence programs. It includes components for procedural and declarative knowledge representation (AtomSpace), task scheduling (CogServer), AI algorithm containers (MindAgents), connectors to instant messaging and virtual world systems, and other components. MindAgents and other add-ons explore a wide variety of AI techniques including evolutionary program learning (MOSES), natural language processing, and others.
Gate - General Architecture for Text Engineering
GATE excels at text analysis of all shapes and sizes. It provides support for diverse language processing tasks such as parsers, morphology, tagging, Information Retrieval tools, Information Extraction components for various languages, and many others. It provides support to measure, evaluate, model and persist the data structure. It could analyze text or speech. It has built-in support for machine learning and also adds support for different implementation of machine learning via plugin.
Language Detection - Language Detection Library in Java
This is a language detection library implemented in plain Java. It detects language of a text using naive Bayesian filter. It is 99% over precision for 53 languages.
CLD - Language Detector library ported from Chrome browser
This is a straight port from the CLD (Compact Language Detector) library embedded in Google's Chromium browser. The library detects the language from provided UTF8 text (plain text or HTML). It is implemented in C++, with very basic Python bindings.
OpenPipe - Document Pipeline
OpenPipe is an open source scalable platform for manipulating a stream of documents. A pipeline is an ordered set of steps / operations performed on a document to convert from its raw form to something ready to be put into the index.
The operations performed on documents include language detection, field manipulation, POS tagging, entity extraction or submitting the document to a search engine.
Aconcorde - English and Arabic concordancer
A linguistics tool for generating concordance from electronic texts/corpora. aConCorde has a bias towards Arabic language processing. This tool is written in Java.
Anota - High-Performance High-Volume Layered Corpora Annotation
Our framework aims to simplify the integration of independently developed NLP tools, while providing an easy-to-use programming environment, and transparent handling of distributed computing problems, such as fault tolerance and task scheduling, when executing the NLP tools in parallel. Moreover, NLP systems built on top of the framework should are language-independent and produce layered annotations.
Ai-ucla - Artificial Intelligence code and info from AI UCLA
This group fosters interest in and helps support research about artificial intelligence (natural language processing, machine learning, artificial life, etc.) at UCLA. We sponsor individuals' research and deepen our own understanding of artificial intelligence as well as push the research forward including doing group projects.