We believe in a world where the default AI path for developers is trustworthy, safe, and open
Fine-tune a speech recognition model for your voice
This blueprint enables you to create your own Speech-to-Text dataset and model, optimizing performance for your specific language and use case. Everything can run locally - even on your laptop, ensuring your data stays private. You can fine-tune a model using your own data or leverage the Common Voice dataset, a community-led project from Mozilla that supports a wide range of languages. To see the full list of supported languages, visit the CommonVoice website.
Trusted open source tools used for this Blueprint

Use HF Transformers to fine-tune the ASR model, and HF Hub to load Common Voice.
Insights into our motivations and key technical decisions throughout the development process.
See examples of extended blueprints unlocking new capabilities and adjusted configurations enabling tailored solutions—or try it yourself.