Sound Classification for Multiple Classes for English Letters

I have recorded audio files for the English letters, each file includes 26 letters. I have split each letter into a separate audio file. Now I want to put similar audio letters into one folder. I can do it manually but it will take time. Is there a classifier method to this?

Topic audio-recognition multiclass-classification classification

Category Data Science


If in each file the letters are spoken separately, with silence in between, and always in the same order (A,B...,Z) then one can try to automate finding each section and its label. Use a Voice Activity Detection (VAD) module to detect each spoken character. Then assign A to the first voiced area, B to the next etc. If you have very clean and uniform audio, then it may work to use volume instead of voice activity detector.

Note that this process is a bit vulnerable to missed sections, because then all the subsequent labels will be shifted. It is also vulnerable to speakers saying characters out of order, or repeated.

An alternative would be to use a Speech Recognition system. That should be able to handle arbitrary order of utterances.

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.