|Thread title||Replies||Last modified|
|But how to actually use Simon for voice recognition?||7||20:47, 9 April 2015|
I have spent several days with Simon. Very very cool. Have installed it, trained it, downloaded the Firefox scenario, installed the Mouseless browsing addon for Firefox, read the manual. Very comprehensive preparation. Well done.
But how do I actually use it for voice recognition?
It's running with the main page (Scenarios, Training, Acoustic Model, Recognition) open. The VU meter is fluttering. "Volume is correct" is checked with a green check mark. "Connected and activated" is shown, and "Finished" is at 100%.
So, now what? How to actually use it?
Yes, thank you, but I did not find anything about actually using it, just how to set it up. Maybe I don't have the right manual - the downloadable PDF?
I've just gone through the link you gave, which is what I have. I see how to make recordings, I've done that. I see how to train it, I've done that. Like I said, I've done all the precursors, but cannot find anything that says, for example:
In order to control Firefox click on xxxx, say yyy etc.
In order to convert speech to text, start up kate and click zzz
Easy when you know how, I'm sure. But I don't see anything that shows how.
You do not have permission to edit this page, for the following reason:
You can view and copy the source of this page.
Simon developer here.
You are right in saying that you need a recognizer (3rd party application). I suspect that you installed the "unstable" package from the OBS? I just looked at it and the package is quite broken (*at least* a few missing dependencies). I reported the issue(s) to the maintainer of the package.
From what I can see, you should still be able to get PocketSphinx to run, if you manually install it after you install Simon (again, this should be a dependency). If you can live with a static base model (no training), you need neither the HTK nor SphinxTrain. PocketSphinx (for SPHINX models) and Julius (for HTK models) will suffice.
If you have problems compiling the HTK or SphinxTrain, please notify the appropriate maintainers. This is sadly out of my control. FWIW, Nickolay, the maintainer of SPHINX is extremely responsive and very helpful. You can find him on the IRC channel #cmusphinx on Freenode (nickname: nshm).
As for the "what to do after installation" bit. If you have installed some scenarios and a fitting base model (look at the tags: "[EN/H4W]..." scenarios need the "[EN/H4W] ..." base model, etc.), Simon should auto-activate right after completing the wizard (EN/H4W requires PocketSphinx). From there on you can say the commands that are defined in the scenarios you downloaded. The scenario description usually contains a list. E.g.: http://kde-files.org/content/show.php/%5BEN%2BH4W%5D+Firefox?content=156100
Please note that at the moment, there is no free dictation ("speech to text"), just commands.
Best regards, Peter