The Common Voice project is Mozilla’s initiative to make speech recognition software more open, accessible, and inclusive. You see, in order to create usable voice technology, an extremely large amount of voice data is required. Most of the data used by large companies isn’t available to the majority of people, and we think that stifles innovation. So we are gathering voices from all over the world in the hope to build the biggest open-source voice dataset around. This data can then be used make open-source speech recognition that is as good as any commercial products in the market today.
Until now though, we have only managed to gather voices in English. But we believe that technology should serve everyone, not just majorities. That is why we are working to launch Common Voice in as many new languages as possible. And for that we need your help!
Would you like to help us create open-source voice technology for your language(s)? The first step is gathering voice data in your language(s). But in order to do this, we need thousands of sentences for people to read. So we are asking you to submit as many sentences as possible in your language(s) so future contributors can read them on the Common Voice website.
Still interested?Get involved!