How to get the engine to successfully recognize a phrase from a voice file (wav/mp3/etc..)?
For example, if I'll have a voice file and a written text of the context of the same file, so to make it recognize the written words in the voice file.
I tried to play around with the SpeechRecognitionEngine, but without success so far.
I'll appreciate ideas, since this is my first time dealing with Speech Recognition techniques.
I've seen examples of speech-to-text using dictionaries, but I'm not sure how it can be useful here. I was thinking of maybe converting the all voice file to text, and then simply look for the specific phrase in that text, but I don't think it's the right way. Doesn't seem to make sense to convert for example 5hrs voice to text.... or maybe to use the specific phrase as a "dictionary" and to look for this item in the voice file.
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…