The AINA Project, Artificial Intelligence, and Language Technologies Authors Marta Villegas Montserrat Barcelona Supercomputing Center https://orcid.org/0000-0003-0711-0029 DOI: 10.2436/20.2503.01.189 Keywords: AI, language technologies, Catalan, NLP Abstract One of the most relevant areas of AI is Natural Language Processing (NLP). In this area, even though most of the large language models are currently multilingual, there is an important difference between the capabilities of English models and the other languages. Thus, the AINA project aims at developing the necessary infrastructure so that the inclusion of Catalan in AI applications becomes appealing and feasible. This article presents the objectives of the project and explains its main characteristics. Downloads Download data is not yet available. Author Biography Marta Villegas Montserrat , Barcelona Supercomputing Center Marta Villegas fa més de 25 anys que treballa com a investigadora en el camp del processament del llenguatge natural. Actualment és la responsable de la Unitat de Tecnologies de la Llengua al Barcelona Supercomputing Center - Centro Nacional de Supercomputación, on dirigeix els treballs per al desenvolupament de models lingüístics. La Unitat ha compilat recentment el corpus espanyol i català més gran mai creat i ha desenvolupat models de referència transformers que han tingut un gran impacte, tant en el món acadèmic com en la indústria. Coordina el projecte AINA i és responsable de diversos projectes nacionals i europeus. References Mikolov, Tomas; et al. (2013). «Efficient Estimation of Word Representations in Vector Space». arXiv:1301.3781 Pennington et al. (2014). «GloVe: Global Vectors for Word Representation». Bojanowski et al. (2017). «Enriching Word Vectors with Subword Information» Devlin J, Chang MW, Lee K, Toutanova K. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805. 2018 Oct 11 «Language Models are Few-Shot Learners» https://arxiv.org/abs/2005.14165 Downloads PDF (Català) Published 2023-06-29 How to Cite Villegas Montserrat , M. (2023). The AINA Project, Artificial Intelligence, and Language Technologies . Terminàlia, 1(27). Retrieved from https://revistes.iec.cat/index.php/Terminalia/article/view/150579 More Citation Formats ACM ACS APA ABNT Chicago Harvard IEEE MLA Turabian Vancouver Download Citation Endnote/Zotero/Mendeley (RIS) BibTeX Issue No. 27: June 2023 Section Topic: The Terminology of the Artificial Intelligence License Authors registered on the OJS platform must read the copyright assignment terms and fill in the corresponding acceptance box.The intellectual property of articles belongs to the respective authors.On submitting articles for publication to the journal Terminàlia, authors accept the following terms:Authors assign to SCATERM (a subsidiary of Institut d'Estudis Catalans) the rights of reproduction, communication to the public and distribution of the articles submitted for publication to Terminàlia.Authors answer to SCATERM for the authorship and originality of submitted articles.Authors are responsible for obtaining permission for the reproduction of all graphic material included in articles.SCATERM declines all liability for the possible infringement of intellectual property rights by authors.The contents published in the journal, unless otherwise stated in the text or in the graphic material, are subject to a Creative Commons Attribution-NonCommercial-NoDerivs (by-nc-nd) 3.0 Spain licence, the complete text of which may be found at https://creativecommons.org/licenses/by-nc-nd/3.0/es/deed.en. Consequently, the general public is authorised to reproduce, distribute and communicate the work, provided that its authorship and the body publishing it are acknowledged, and that no commercial use and no derivative works are made of it.The journal is not responsible for the ideas and opinions expressed by the authors of the published articles.