But how to actually use Simon for voice recognition?

Jump to: navigation, search

Hi Alavarre,

Simon developer here.

You are right in saying that you need a recognizer (3rd party application). I suspect that you installed the "unstable" package from the OBS? I just looked at it and the package is quite broken (*at least* a few missing dependencies). I reported the issue(s) to the maintainer of the package.

From what I can see, you should still be able to get PocketSphinx to run, if you manually install it after you install Simon (again, this should be a dependency). If you can live with a static base model (no training), you need neither the HTK nor SphinxTrain. PocketSphinx (for SPHINX models) and Julius (for HTK models) will suffice.

If you have problems compiling the HTK or SphinxTrain, please notify the appropriate maintainers. This is sadly out of my control. FWIW, Nickolay, the maintainer of SPHINX is extremely responsive and very helpful. You can find him on the IRC channel #cmusphinx on Freenode (nickname: nshm).

As for the "what to do after installation" bit. If you have installed some scenarios and a fitting base model (look at the tags: "[EN/H4W]..." scenarios need the "[EN/H4W] ..." base model, etc.), Simon should auto-activate right after completing the wizard (EN/H4W requires PocketSphinx). From there on you can say the commands that are defined in the scenarios you downloaded. The scenario description usually contains a list. E.g.: http://kde-files.org/content/show.php/%5BEN%2BH4W%5D+Firefox?content=156100

Please note that at the moment, there is no free dictation ("speech to text"), just commands.

Best regards, Peter

14:44, 30 June 2013

Peter hello, thank you for the comprehensive reply.

I shall work through all these elements and report back.

Kind regards, Andy

19:21, 30 June 2013

Peter hello. It is another two years later, and I have the same problem.

Back when, you said:

    Please note that at the moment, there is no free dictation ("speech to text"), just commands.

Has anything changed?

I have Dragon Dictates, but that requires loading WinX, which I never use except for niche apps.I haven't booted it in at least two months. Linux Format flogged your product again this month, but if it can't do speech to text then it doesn't do what I want.

All sympathies to the reasons for which it was created for voice commands, but I need speech to text.

Kind regards, Andy Alavarre@gmail.com

20:47, 9 April 2015