How can I build my voice speech-to-text model?
I found an instruction to build such kind of custom model on Azure.
Prepare data for Custom Speech
However, I would like to either fine-tune or transfer learning on Google Colaboratory or docker.
In that case, what machine learning framework do you recommend using?
If you know some Github repo or articles for this challenge, could you share them with me?
Topic finetuning transfer-learning speech-to-text training nlp
Category Data Science