Audio Classification with Counter
I'm trying to create a model that can identify one particular sound, and every time it hears that sound, it increases a counter by 1. So for example, if it hears a specific bird chirping ten times, the counter should display the number 10.
I'm looking for a bit of guidance here as to how to go about this. I know that I will need to use audio classification and for my data, I only have .wav files of that one particular sound since I recorded them with my iPhone. Currently, I'm just preprocessing the data and doing some EDA with librosa in Python.
Here's my questions:
1. Is there a way to train a model to only identify one particular sound based on the data I have? (and if it hears anything different, it's classified as 0/no)
2. How would I go about combining the audio classification model (given that it works) with a counter that can capture a real time sound, process it and display a number if its that particular sound?
I don't really know where to start when I think about these questions. Any help regarding what libraries I may need, terms I should research or know regarding sound/audio data or just your own personal opinion on how you would tackle this problem would be much appreciated.
Topic audio-recognition deep-learning neural-network classification python
Category Data Science