How can I build my voice speech-to-text model?

I found an instruction to build such kind of custom model on Azure.

Prepare data for Custom Speech

However, I would like to either fine-tune or transfer learning on Google Colaboratory or docker.

In that case, what machine learning framework do you recommend using?

If you know some Github repo or articles for this challenge, could you share them with me?

Topic finetuning transfer-learning speech-to-text training nlp

Category Data Science

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.