L'Agence nationale de la recherche Des projets pour la science

Investissements d'avenirProjets financés


Outils et Ressources pour un Traitement Optimisé des LANGues (ORTOLANG)


Action : Equipements d'excellence


N° de convention : 11-EQPX-0032

Informations générales

  • Référence projet : 11-EQPX-0032
  • RST : Etienne PETITJEAN
  • Etablissement Coordinateur : CNRS Centre Est (Vandoeuvre)
  • Région du projet : Grand Est
  • Discipline : 6 - SHS
  • Aide allouée : 2 600 000 €
  • Date de début du projet : 01/09/2012
  • Date de fin du projet : 31/12/2019
  • Site web du projet : https://www.ortolang.fr
  • Mots clés : plateforme; mutualisation; ressources; outils; français; langues de france

Résumé du projet

ORTOLANG (Open Resources and Tools for Language: https://www.ortolang.fr) is a French infrastructure implemented in the framework of the “Programme d’Investissement d’Avenir” (PIA) funded by the “Investissements d’Avenir” French Government program. The main purpose of ORTOLANG is to offer a repository of linguistic data (corpora, lexicons, dictionaries, etc.) and tools on languages and their processing readily available and well-documented which: enable analysis research on analysis, modelling and automatic treatment of our language to reach the best international level by way of  a real pooling of resources; ease the use and transfer of resources and tools created by public laboratories towards industrial partners, in particular towards SME which cannot often develop such resources and tools for language treatment due to the creation costs; promote the French language and local languages of France by sharing the knowledge acquired by public laboratories. The ORTOLANG platform is intended to be a mutualisation infrastructure for the management, long-term preservation and dissemination of language resources such as corpora, lexicons, terminologies and language processing tools. This project also aims to generalise and ensure the perpetuation of the efforts made by digital resources centres on language, CNRTL (CentredeRessourcesTextuellesetLexicales, www.cnrtl.fr) and the SLDR (Speech and Language Data Repository, http://sldr.org), setup by the CNRS. It has the ambition to be used as a French node and major support for France’s commitment within the CLARIN infrastructure (Common Language Resources and Technology Infrastructure: www.clarin.eu) of which we have been partners from the very beginning. It is also used as a technical language platform of written, oral language forms, to support the coordination actions of the TGIR HumaNum (http://www.huma-num.fr/).    

(L'auteur de ce résumé est le coordinateur du projet, qui est responsable du contenu de ce résumé. L'ANR décline par conséquent toute responsabilité quant à son contenu.)