Check out our new site!
Freely available text analysis tools are now available at www.linguisticanalysistools.org. Links on this page point to pages on that site.
Also see my GitHub page for Python packages and other scripts (I am updating the GitHub page to include code for all the tools listed below).
Also see my GitHub page for Python packages and other scripts (I am updating the GitHub page to include code for all the tools listed below).
CLA is a simple but powerful text analysis tool. One can use CLA to analyze texts using very large custom dictionaries. In addition to words, custom dictionaries can include n-grams and wildcards.
Click here to learn more |
CRAT is an easy to use tool that includes over 700 indices related to lexical sophistication, cohesion and source text/summary text overlap. CRAT is particularly well suited for the exploration of writing quality as it relates to summary writing. Click here to learn more
|
SEANCE is an easy to use tool that includes 254 core indices and 20 component indices based on recent advances in sentiment analysis. In addition to the core indices, SEANCE allows for a number of customized indices including filtering for particular parts of speech and controlling for instances of negation.
Click here to learn more |
SiNLP is a simple tool that allows users to analyze texts with regard to the number of words, number of types, TTR, letters per word, number of paragraphs, number of sentences, and number of words per sentence for each text. In addition, users can analyze texts with regard to their own custom dictionaries.
Click here to learn more |
TAACO is an easy to use tool that calculates 150 indices of both local and global cohesion, including a number of type-token ratio indices, adjacent overlap indices, and connectives indices.
Click here to learn more TAALED is an analysis tool designed to calculate a wide variety of lexical diversity indices. Homographs are disambiguated using part of speech tags, and indices are calculated using lemma forms. Indices can also be calculated using all lemmas, content lemmas, or function lemmas.
Click here to learn more |
TAALES TAALES is a tool that measures over 400 classic and new indices of lexical sophistication, and includes indices related to a wide range of sub-constructs. Included are indices for both single words and n-grams. Starting with version 2.2, TAALES also provides comprehensive index diagnostics.
Click here to learn more |
TAASSC is an advanced syntactic analysis tool that measures fine-grained indices of clausal and phrasal complexity, classic indices of syntactic complexity, and frequency-based verb argument construction indices.
Click here to learn more |