Browsing projects by Tag(s)

Select a tag to browse associated projects and drill deeper into the tag cloud.

Showing page 1 of 2

The amount and diversity of information is growing exponentially, mainly in the area of unstructured data, like emails, text files, blogs, images etc. Poor data accessibility, user rights integration and the lack of semantic meta data are constraining factors for building next generation ... [More] enterprise search and other document centric applications. Missing standards result in proprietary solutions with huge short and long term cost. SMILA is an extensible framework for building search solutions to access unstructured information in the enterprise. Besides providing essential infrastructure components and services, SMILA also delivers ready-to-use add-on components, like connectors to most relevant data sources. Using the framework as their basis will enable developers to concentrate on the [Less]

0
 
  0 reviews  |  2 users  |  296,774 lines of code  |  6 current contributors  |  Analyzed 10 days ago
 
 

conStruct is a middleware layer that allows structured data (RDF) and associated vocabularies (ontologies) to "drive" tailored tools and data displays within Drupal. The basic conStruct module provides CRUD (create - read - update - delete), search, browse and some import and export ... [More] capabilities for structured datasets. conStruct connects to the underlying structured (RDF) data via the separately available structWSF Web services framework. structWSF is a RESTful Web services layer that also allows multiple conStruct and Drupal installations to share and collaborate structured data with one another via user access rights and privileges to registered datasets. Collaboration networks can also be established directly to distributed structWSF servers. [Less]

5.0
 
  0 reviews  |  2 users  |  15,968 lines of code  |  0 current contributors  |  Analyzed over 1 year ago
 
 

The Semantic Vectors PackageSemantic Vector indexes, created by applying a Random Projection algorithm to term-document matrices created using Apache Lucene. The package was created as part of a project by the University of Pittsburgh Office of Technology Management, to explore the potential for ... [More] automatically matching related concepts in the technology management domain, e.g., mapping new technologies to potentatially interested licensors. This project can be found at http://real.hsls.pitt.edu. The package creates a WordSpace model, of the kind developed by Stanford University's Infomap Project and other researchers during the 1990s and early 2000s. Such models are designed to represent words and documents in terms of underlying concepts, and as such can be used for many semantic (concept-aware) matching tasks such as automatic thesaurus generation, knowledge representation, and concept matching. The Semantic Vectors package uses a Random Projection algorithm, a form of automatic semantic analysis, similar to Latent Semantic Analysis (LSA) and its variants like Probabilistic Latent Semantic Analysis (PLSA). However, unlike other methods, Random Projection does not rely on the use of computationally intensive matrix decomposition algorithms like Singular Value Decomposition (SVD). This makes Random Projection a much more scalable technique in practice. Our application of Random Projection for Natural Language Processing (NLP) is descended from Pentti Kanerva's work on Sparse Distributed Memory, which in semantic analysis and text mining, this method has also been called Random Indexing. A growing number of researchers have applied Random Projection to NLP tasks, demonstrating: Semantic performance comparable with other forms of Latent Semantic Analysis. Significant computational performance advantages in creating and maintaining models. DocumentationJava API Documentation is at http://semanticvectors.googlecode.com/svn/trunk/doc/index.html. Installation help can be found in the InstallationInstructions. Help on using SemanticVectors for DocumentSearch. A page with links to more RelatedResearch. The package requires Apache Ant and Apache Lucene to have been installed, and the Lucene classes must be available in your CLASSPATH. User GroupIssues and bugs can be posted using the Issues tab above. More general questions and discussions may be posted at the group webpage, http://groups.google.com/group/semanticvectors. Originally written by Dominic Widdows, in collaboration with Kathleen Ferraro and the University of Pittsburgh. The project is now maintained and extended by a small group of developers, as listed in the SemanticVectors AUTHORS file. Projects Using Semantic VectorsWe're starting a list of ProjectsUsingSemanticVectors. We're aware of a few more that we'll try to add in due course: please visit this page and leave comments if you know of any. [Less]

0
 
  0 reviews  |  1 user  |  10,918 lines of code  |  5 current contributors  |  Analyzed 3 days ago
 
 

Semsearch is a keyword-based semantic search engine, which aims to wrap up the complexity of semantic search and make it suitable for naive users who are not necessarily familairy with the problem domain or with the specified query language. Semsearch subscribes to a layered architecture of the ... [More] search engine, which comprises five layers: i) a Google-like query interface, which provides a straightforward way of specifying queries usering multiple keywords; ii) a Text search layer, which locates the keywords in the underlying domain ontology and semantic data repositories; iii) a query layer, which translates user quereis into formal queries; iv) a formal query layer, which retrieves results from the semantic data repositories; and v) a semantic data layer, which contains ontologies and semantic metadata of the problem domain. This work was (partly) funded by the X-Media project (www.x-media-project.org) sponsored by the European Commission as part of the Information Society Technologies (IST) programme under EC grant number IST-FP6-026978. [Less]

0
 
  0 reviews  |  0 users  |  0 current contributors  |  Analyzed 5 days ago
 
 

This is a student project. 使用java实现、面向终端用户、目的是开发一个语义搜索引擎的项目.

0
 
  0 reviews  |  0 users  |  0 current contributors  |  Analyzed about 2 years ago
 
 

This work was co-funded by the X-Media project (www.x-media-project.org) sponsored by the European Commission as part of the Information Society. It supports exploration, seach and analysis of semantic data. Data are elements of RDF/OWL ontologies, which are manually added to the system using an ... [More] annotation tool or extracted from documents and web pages. For search, a combination of semantic search and classical index-based search is also supported. Technologies (IST) program under EC grant number IST-FP6- 026978 [Less]

0
 
  0 reviews  |  0 users  |  1,416,655 lines of code  |  0 current contributors  |  Analyzed 1 day ago
 
 

A complete end-to-end solution for NLP document analysis and search engine for English. Based on existing technologies such as link grammar, relex and HyperGraphDB.

0
 
  0 reviews  |  0 users  |  27,562 lines of code  |  1 current contributor  |  Analyzed 10 days ago
 
 
Compare

Framework for qualifying knowledge acquisition and inquiry by/over/from the semantic web.

0
 
  0 reviews  |  0 users  |  18,871 lines of code  |  0 current contributors  |  Analyzed over 1 year ago
 
 

Open Sahara is an open source framework for text mining, developed by Talking Trends. Open Sahara provides scalable functionality for harvesting and annotating content, natural language processing, semantic indexing, storage and searching.

0
 
  0 reviews  |  0 users  |  56,818 lines of code  |  3 current contributors  |  Analyzed 5 days ago
 
 

LazyCat is in a word a knowledge and documents management solution for small-medium organizations. It is designed on principles: simple-to-use, effective-to-solve knowledge and documents sharing and authoring as well as flexible access control and scalability in a clustering environment. By ... [More] leveraging leading open source projects such as JBpm, Jackrabbit, BIRT, Spring, Acegi, Hibernate, EHCache, FCKEditor, eXtremeComponents(JMesa), ClickStreams, OpenSSL etc, it aims to build a stable and extensible digital information resources management infrastructure, virtual organization modelling, role based access control framework and business process executing and monitoring facility. Based upon its infrastructure friendly web ui is provided with key functions involving full text and semantic searching, wiki-like content creation and organizing, process-driven collaboration and auditing and so on. But so far, we are working on research and evaluating these open source projects to make designs to accomplish its goal. We are expecting instructions and helps from all over the world! [Less]

0
 
  0 reviews  |  0 users  |  0 current contributors  |  Analyzed 6 days ago
 
 
 
 

Creative Commons License Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.