GATE - A General Architecture for Text Engineering
In my previous article, Combine Crunch and Lucene for Efficient Web Page Indexing, I mentioned that I used Crunch and Lucene in one of my projects. The project actually aims to build a semantic...
View ArticleProtégé - Open Source Ontology Editor and Knowledge Acquisition System
Protégé is a free, open source ontology editor and knowledge-base framework. I used it together with GATE, Crunch , Lucene,and other tools to create my knowedge-based system with automated ontology...
View ArticleJava - Build Your Semantic Web Application using Jena
In my previous articles, I talked about GATE, Protégé, Crunch and Lucene. I used all these tools together with Jena to build a semantic web application. If you have no idea what semantic web is, here...
View ArticleLoad DMOZ RDF Structure and Content RDF
The DMOZ Open Directory Project is the largest human edited directory of web. As part of my research area, I need to load the structure and content RDF into MySQL database. At first I was trying to use...
View ArticleBuild Domain Knowledge by Extracting Keywords from DMOZ
Download Source This is a experiment I am currently doing, extracting keywords from categories in DMOZ to see how accurate it is to be used for web page categorization. From my previous post, I load...
View ArticleJava - Open Source Social Networking Applications
Here is the link to a list of open source social networking applications in Java. http://www.manageability.org/blog/stuff/java-open-source-social-network This is an intesting area that I am currently...
View ArticleNatural Language Processing using OpenNLP Tools
I used tools from OpenNLP as part of my research on natural language processing and semantic web OpenNLP is an organizational center for open source projects related to natural language processing. It...
View ArticleCalais - Annotates Content with Rich Semantic Metadata
As quoted from the website, Calais seeks to help make all the worlds content more accessible, interoperable and valuable via the automated generation of rich semantic metadata, the incorporation of...
View ArticleNatural Language Toolkit
NTLK is a set of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for...
View Article
More Pages to Explore .....