Author |
Message |
stemsee
Joined: 27 Jun 2013 Posts: 2543 Location: In The Way
|
Posted: Fri 27 Feb 2015, 08:14 Post subject:
Working on speech recognition Subject description: source files acquired |
|
I have managed to obtain directly from the German programmer source code for speech recognition/assistant software. When i have it working I will post something.
|
Back to top
|
|
 |
amigo
Joined: 02 Apr 2007 Posts: 2647
|
Posted: Fri 27 Feb 2015, 09:11 Post subject:
|
|
Pant, pant... at least give us the name so we can be dreaming about it meanwhile...
|
Back to top
|
|
 |
stemsee
Joined: 27 Jun 2013 Posts: 2543 Location: In The Way
|
Posted: Fri 27 Feb 2015, 14:28 Post subject:
|
|
It is called simon ... I have had it sitting here for a few months. A dependency is kde, that is why i waited so long, I was going to build a kde only pup. EmSeev1 had kde, LH64 has kde.sfs so maybe I will use that to build it.
Code: | Please compile Simon with the provided build.sh script.
There are the following dependencies.
Required build-time dependency:
1. KDE libraries and workspace (4.8 or newer)
Optional build-time dependencies:
1. KDE Pim Libs . Enables the optional command plugin "Akonadi" that allows
Simon to trigger commands at certain times and to use Simon dialogs as
calendar reminders.
2. LibSampleRate (http://www.mega-nerd.com/SRC/). If you build Simon with LibSampleRate support, it can
automatically resample audio in- and output for devices that don't support the
required configuration.
3. qaccessibilityclient (http://cmusphinx.sf.net://projects.kde.org/projects/playground/accessibility/libkdeaccessibilityclient). Enables the experimental ATSPI integration.
4. Sphinxbase, PocketSphinx, SphinxTrain (http://cmusphinx.sf.net). Together, they enable SPHINX support in Simon
(model compilation and recognition).
5. OpenCV (http://opencv.org). Enables the face detection and lip detection context plugins.
Optional runtime dependencies:
Julius (http://julius.sourceforge.jp/en_index.php) and the HTK
(http://htk.eng.cam.ac.uk) enable the use of HTK models when installed.
Please refer to the manual for more information.
|
|
Back to top
|
|
 |
rufwoof

Joined: 24 Feb 2014 Posts: 3672
|
Posted: Fri 27 Feb 2015, 15:02 Post subject:
|
|
There are some demo's (for Simon) on youtube if you search around. I watched one only a week ago or so. I like how it divides up the screen into numbered regions and you say a number and it then subdivides that region into numbered sections .... etc until you've zoomed in on the area/icon/whatever you had in mind and can verbally activate that icon/whatever. OK for limited use I guess, but I suspect that would be less acceptable if you were doing lots of clicking around.
|
Back to top
|
|
 |
ETP

Joined: 19 Oct 2010 Posts: 1173 Location: UK
|
Posted: Fri 27 Feb 2015, 15:28 Post subject:
Re:Working on speech recognition |
|
Hi stemsee,
You may wish to check out LiSpeak which appears to be under development.
It uses as its base Palaver that I used in a butchered form in Obedient Pup.
https://github.com/BmanDesignsCanada/LiSpeak/
_________________ Regards ETP
Kennels
|
Back to top
|
|
 |
amigo
Joined: 02 Apr 2007 Posts: 2647
|
Posted: Sat 28 Feb 2015, 05:05 Post subject:
|
|
Okay, nothing new. Simon has been around quite awhile, and as mentioned centers on kde so it is naturally heavy. Palaver looked good, although it is pretty heavy since it uses python. I think it has been replaced by a new project.
Simon makes a good case study because it is pretty functional and well thought-out. But, to me, only for ideas.
|
Back to top
|
|
 |
|