This guide provides information about available text mining resources and tools and whether or not the Libraries subscription databases support content mining.
Aggregates information about digital research tools for scholarly use. DiRT makes it easy to find and compare resources available for text mining and data visualization (among others).
Designed to allow researchers to easily harvest full text documents from all participating publishers regardless of their business model (e.g. open access, subscription). Provides step-by-step instructions
A collection of tools to document classification, sequence tagging, and topic modeling. There is also an add-on toolkit (Graphical Models in MALLET) for visualization.
This collection of text analysis tools hosted by the University of Alberta providing XML, HTML, and plain text analysis. Upload documents to extract common words, determine colocates, separate HTML tags, and extract XML tagged information.
An easy to use and free text analysis tool. Upload text and Voyant will automatically determine word frequencies and colocates and display them graphically
An application for the close reading and scholarly analysis of deeply tagged texts.
WordHoard contains the entire canon of Early Greek epic in the original and in translation, as well as all of Chaucer, Shakespeare, and Spenser.
A collection of text analysis tools targeted at humanities scholars that includes side-by-side comparison, grammatical search, and document/sentence/word-set features.
An environment for digital humanities computational work can be time-consuming and difficult. DHBox addresses this problem by streamlining installation processes and providing a digital humanities laboratory in the cloud through simple sign-in via a web browser.
It comes pre-equipped with IPython, RStudio, Omeka, and NLTK.
Created by Harvard. A tool for visualizing trends in repositories of digitized texts. Uses metadata and books collected by the Open Library. It at once describes the contents of the library as a whole in a useful and intuitive way.