Tools

Text analysis & Labelling

AntConc is a freeware corpus analysis toolkit for concordancing and text analysis.

Callimachus is a regest of Greek and Latin Papyri (and Coptic papyri containing Greek words).

CorpusSearch 2 supports corpus linguistics research. It is useful both for the construction of syntactically annotated (parsed) corpora and for searching them. Running CorpusSearch on an appropriately annotated corpus a user can automatically: find and count lexical and syntactic configurations of any complexity, correct systematic errors,code the linguistic features of corpus sentences for later statistical analysis.

ediarum (ed)  is a solution consisting of several software components that allows scientists to edit transcriptions of manuscripts and prints in TEI-compliant XML, to provide them with a text and subject apparatus as well as registers and to publish them on the web and in print.

EpiDoc provides guidelines and tools for encoding scholarly and educational editions of ancient documents. It uses a subset of the Text Encoding Initiative's standard for the representation of texts in digital form and was developed initially for the publication of digital editions of ancient inscriptions. Its domain has expanded to include the publication of papyri and manuscripts. More

Hypothesis an online tool for annotating the web.

Lexos a web-based tool to help you explore your favorite corpus of digitized texts.

Lyneal a web-based tool to help you explore your corpus and to explain linguistic phenomena.

MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modelling, information extraction, and other machine learning applications to text. Topic models are useful for analyzing large collections of unlabeled text and the MALLET topic modelling package is used frequently in digital humanities textual analysis.

oXygen suite of XML authoring, editing, and development tools.

Recogito is an online platform for collaborative document annotation. Recogito provides a personal workspace where you can upload, collect and organize your source materials - texts, images and tabular data - and collaborate in their annotation and interpretation.

Roma  is a tool for working with TEI customizations.

Tapas (TEI) provide TEI publishing and repository services at low cost to those who lack institutional resources: faculty, students, librarians, archivists, teachers, and anyone else with TEI data who wants to store, share, and publish it.

TEI (Text Encoding Initiative). Tools for creating, editing, transforming, and publishing TEI documents and schemas using the P5 Guidelines TEI.

TEIGarage  TEIGarage is a webservice and RESTful service to transform, convert and validate various formats, focussing on the TEI format. TEIGarage is based on the proven OxGarage.

TEITOK  is a web-based platform for viewing, creating, and editing corpora with both rich textual mark-up and linguistic annotation.

TextGrid services and tools to create, manage and edit your XML-based research data.

Transkribus  is a platform for the text recognition, image analysis and structure recognition of historical documents.

Voyant a web-based reading and analysis environment for digital texts.

XML Copy editor is a free software that allows editing XML and its associated technologies.