Building a Balochi Language Dataset for NLP Applications

from blog Alex Strick van Linschoten, | ↗ original
I’m working on building out some language models and utilities for the Balochi language. (Read previous posts in this series for the full context.) Even though there are some 8-10 million estimated speakers, it certainly falls into the category of being a ‘low-resource’ language. Many (most?) things that you’d take for granted when working with...