...

Text Analysis
for Oriental Languages

Discover our lemmatization and POS-tagging engine for Oriental and Classical Armenian, Old Georgian and Syriac.

Specific text analysis for Oriental and Classical Armenian, Old Georgian, Syriac, Ancient Greek.

Massive annotation of corpora

Calfa takes charge of the processing of your corpus, even if it contains billions of forms to be analysed.

Contextual annotation

Our IA engine backed by Calfa Dictionary can analyse words in context for a precise and safe data annotation.

Scalable and multilevel solutions

Configure the engine to use specific tag sets, on different levels : lemma, part of speech, morphological.

Our partners

...

GREgORI Project

Softwares, linguistic data and tagged corpus for ancient GREek and ORIental languages.

...

Eastern Armenian National Corpus

EANC is a comprehensive linguistic database of annotated texts in Standard Eastern Armenian (SEA).

...

Digilib

Digital Library of Armenian Literature of the American University of Armenia.

...

Arak29

Text and Concordance of the 1895 Bible in Classical Armenian, with word-by-word grammatical parsing and English gloss.

Try the analysis engine

Send us texts elements to get a demonstration of the analysis on your documents

Contact us for a demo

Learn more

Chahan Vidal-Gorène and Bastien Kindt, Lemmatization and POS-tagging process by using joint learning approach. Experimental results on Classical Armenian, Old Georgian, and Syriac. In Proceedings of LT4HALA 2020 - 1st Workshop on Language Technologies for Historical and Ancient Languages, pp. 22–27, Marseille, France, May 2020. European Language Resources Association (ELRA).
PDF BibTeX

Chahan Vidal-Gorène, Victoria Khurshudyan, and Anaïd Donabédian. Recycling and Comparing Morphological Annotation Models for Armenian Diachronic-Variational Corpus Processing. in Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, pp. 90-101, Barcelona, Spain, Dec. 2020. International Committee on Computational Linguistics (ICCL).
PDF BibTeX