mercredi 20 avril 2016

capturing live audio from browser and sending it to python for speech recognition

I am trying to create an django application that captures audio from the browser and passes that to my speech recognition script in python. Kind of like what Google Now does. If you click a button, you start speaking the audio is sent to the python script and it converts it to text and returns. To make everything clear I am not looking for something like you wait till you record your entire speech and then it starts converting. As and when the user is speaking the the program is supposed to convert in real time. I am not able to figure out a way to achieve this.

I am trying to use pocketsphinx for speech to text conversion. Please also suggest if there are better speech to text converters in python to achieve this functionality.




Aucun commentaire:

Enregistrer un commentaire