Home

Awesome

ParTUT

ParTUT is a morpho-syntactically annotated collection of Italian/French/English parallel sentences, which includes texts from different sources and representing different genres and domains, released in several formats. See also http://www.di.unito.it/~tutreeb/treebanks.html .

ParTUT comprises approximately 167,000 tokens, with an average amount of 2,100 sentences per language. The texts of the collection currently available were gathered from a large number of sources and domains:

Since release 2.0, ParTUT is also available in the Universal Dependencies format (see here for English, here for French, and here for Italian).

References

If you use the resource, please cite: