š“ó §ó ¢ó ·ó ¬ó ³ó 暬š§ Curating a Welsh-English Translation Dataset for language models
Modern large language models have strong multilingual capabilities, but languages with a small digital footprintĀ on the internet, like Welsh, remain underserved. This matters: as AI becomes more prevalent, poor support for theseĀ ālow-resourceāĀ languagesĀ createsĀ a digital divide with minority communities andĀ risks reducingĀ the languageāsĀ active use