Difference between revisions of "Text Mining Resources"
From irefindex
PaulBoddie (talk | contribs) (New page: Some notes on open source text mining resources: * "The Text Mining Tool Evaluation project will describe the process of text mining, identify non-proprietary software that can search blo...) |
PaulBoddie (talk | contribs) (Added FreeLing and NLTK.) |
||
Line 7: | Line 7: | ||
** [http://opennlp.sourceforge.net/projects.html OpenNLP projects] | ** [http://opennlp.sourceforge.net/projects.html OpenNLP projects] | ||
** See also [http://opennlp.sourceforge.net/links.html OpenNLP links] for other resources. | ** See also [http://opennlp.sourceforge.net/links.html OpenNLP links] for other resources. | ||
+ | * [http://www.lsi.upc.es/~nlp/freeling/ FreeLing] - ''written in C++ with features from tokenisation through to part-of-speech tagging, word sense disambiguation'' | ||
+ | * [http://www.nltk.org/ NLTK] - ''written in Python with a wide range of natural language processing features'' |
Revision as of 10:50, 9 October 2009
Some notes on open source text mining resources:
- "The Text Mining Tool Evaluation project will describe the process of text mining, identify non-proprietary software that can search blocks of text to identify reports relevant to the cancer registry, and provide information to state cancer registries regarding different tools available and a comparison of the functionality provided by each tool." Evaluation of Open Source Text Mining Tools for Cancer Surveillance (HTML version from the Google cache)
- "U-Compare is an integrated text mining/natural language processing system based on the UIMA Framework." U-Compare: share and compare tools with UIMA
- "The BioNLP Unstructured Information Management Architecture (UIMA) Component Repository provides UIMA wrappers for novel and well-known 3rd-party NLP tools used in biomedical text prosessing, such as tokenizers, parsers, named entity taggers, and tools for evaluation." BioNLP UIMA Component Respository
- "OpenNLP is an organizational center for open source projects related to natural language processing." OpenNLP
- OpenNLP projects
- See also OpenNLP links for other resources.
- FreeLing - written in C++ with features from tokenisation through to part-of-speech tagging, word sense disambiguation
- NLTK - written in Python with a wide range of natural language processing features