
Aya
About
The Aya family of large language models (LLMs), developed by Cohere for AI, marks a significant stride in multilingual artificial intelligence. This initiative is focused on bridging cultural and linguistic gaps worldwide by expanding AI's language capabilities. The Aya models, ranging in size and functionality, include Aya-101, which supports 101 languages and delivers superior performance compared to models like mT0 and BLOOMZ in various tests 412. Trained on datasets such as xP3x and the Aya collection, these models exemplify excellence in instruction-following tasks. Additionally, the Aya 23 series, available in both 8B and 35B parameters, focuses on 23 languages, enhancing multilingual conversations through robust instruction-fine-tuning 18. The Aya project embodies an open-science approach, engaging thousands of researchers globally to push the boundaries of multilingual LLMs 5.