Puppy Linux Discussion Forum Forum Index Puppy Linux Discussion Forum
Puppy HOME page : puppylinux.com
"THE" alternative forum : puppylinux.info
 
 FAQFAQ   SearchSearch   MemberlistMemberlist   UsergroupsUsergroups   RegisterRegister 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 

The time now is Sun 21 Sep 2014, 14:35
All times are UTC - 4
 Forum index » Advanced Topics » Additional Software (PETs, n' stuff) » Multimedia
flite_hts_engine: at last, good quality Puppy speech
Post_new_topic   Reply_to_topic View_previous_topic :: View_next_topic
Page 2 of 3 Posts_count   Goto page: Previous 1, 2, 3 Next
Author Message
mcewanw

Joined: 16 Aug 2007
Posts: 2346
Location: New Zealand

PostPosted: Fri 27 Nov 2009, 10:59    Post_subject: 0.91.mce02.pet uploaded
Sub_title: stdout and wav version
 

Changes: Fixed scripts to allow wav output.

This version of flite_hts_engine writes to either wav or to stdout (allows direct pipe to aplay; no fifo required)

_________________
Non enim propter gloriam, diuicias aut honores pugnamus set propter libertatem solummodo quam Nemo bonus nisi simul cum vita amittit.
Back to top
View user's profile Send_private_message Visit_website 
amigo

Joined: 02 Apr 2007
Posts: 2247

PostPosted: Fri 27 Nov 2009, 11:59    Post_subject:  

Great work William -no problem about the diff. I appreciate the fine work that you have done in the past and on this.

Text has to be 'vetted' for use by flite and other t2s programs as they don't always know what to do. Text with formatting like:
##########
==========
underscores and other special chars will not be read pleasantly at all.
Back to top
View user's profile Send_private_message 
mcewanw

Joined: 16 Aug 2007
Posts: 2346
Location: New Zealand

PostPosted: Fri 27 Nov 2009, 12:07    Post_subject:  

flite_hts_engine sometimes gets oddly muddled. Try these two simple examples. On my machine at least, the second example flops...

flitet "Here is the new thread" | aplay

flitet "Here is the new thread and" | aplay

A long text always contains such difficulties here and there which flite_hts_engine makes a mess off. Otherwise it is great; if only it was consistent in that sense!

_________________
Non enim propter gloriam, diuicias aut honores pugnamus set propter libertatem solummodo quam Nemo bonus nisi simul cum vita amittit.
Back to top
View user's profile Send_private_message Visit_website 
amigo

Joined: 02 Apr 2007
Posts: 2247

PostPosted: Fri 27 Nov 2009, 13:09    Post_subject:  

I've been looking at my flite sources again. I had forgotten that I had found a file called speak.c on the net a long time ago, which someone had written to read a file line-by-line using flite. Actually, the program links in the flite libs so it is standalone. Anyway, it may be a good candidate to do the same with flite_hts_engine.

I'll post it tomorrow after working out (again) the details of how to use it. I had built flite and speak using 'diet' or uClibc, so the build recipe needs to be re-worked for using glibc.

I also was looking at the full sources for flite -especially the built-in sound server/client bits. It might not be too hard to build this into flite_hts and thereby reduce some of the latency of starting aplay each time.
Back to top
View user's profile Send_private_message 
Trobin

Joined: 18 Aug 2005
Posts: 907
Location: BC Canada

PostPosted: Sun 28 Mar 2010, 20:25    Post_subject:  

Anyway rto attach this to emacspeak?
_________________
http://speakpup.blogspot.com
Back to top
View user's profile Send_private_message 
jemimah


Joined: 26 Aug 2009
Posts: 4309
Location: Tampa, FL

PostPosted: Fri 30 Apr 2010, 23:47    Post_subject:  

Here is a gtkdialog frontend for flite. It takes an argument for the filename too, so you can use it with RoxRightClicks.
flite-speak.gz
Description 
gz

 Download 
Filename  flite-speak.gz 
Filesize  546 Bytes 
Downloaded  381 Time(s) 
Back to top
View user's profile Send_private_message Visit_website 
abushcrafter


Joined: 30 Oct 2009
Posts: 1447
Location: England

PostPosted: Tue 02 Nov 2010, 10:01    Post_subject:  

There's a new version out. Could you package it up please? Could you also slow down the speed at which it talks? With the current package, I find it is stumbling on a 1mb text file. Is that buffer size issues? In which case can you give the new version (If you package it up.) a larger buffer please. Thanks so far.

I have attached a bug fix of jemimah's gtkdialog frontend for flite.
flite-speak-0.0.2.bz2
Description 
bz2

 Download 
Filename  flite-speak-0.0.2.bz2 
Filesize  829 Bytes 
Downloaded  267 Time(s) 

_________________
adobe flash is rubbish!
My Quote:"Humans are stupid, though some are clever but stupid." http://www.dependent.de/media/audio/mp3/System_Syn_Heres_to_You.zip http://www.systemsyn.com/
Back to top
View user's profile Send_private_message Visit_website 
amigo

Joined: 02 Apr 2007
Posts: 2247

PostPosted: Tue 02 Nov 2010, 13:50    Post_subject:  

Yes, there is a new version of flite available. It now comes with three or four voices which you can choose from using command-line options Yeah! The female voice seems to be the most natural sounding.
Back to top
View user's profile Send_private_message 
jemimah


Joined: 26 Aug 2009
Posts: 4309
Location: Tampa, FL

PostPosted: Tue 02 Nov 2010, 15:21    Post_subject:  

I don't think flite-hts-engine has been updated - the regular flite is much larger.
Back to top
View user's profile Send_private_message Visit_website 
amigo

Joined: 02 Apr 2007
Posts: 2247

PostPosted: Wed 03 Nov 2010, 11:12    Post_subject:  

You are correct jemimah, flite-hts-engine has not been updated. The flite API has changed, so flite-hts-engine it may or may not adapt to the new API. flite-hts-engine has much better voice quality, but there are problems with it garbling some texts, and flite-hts can only read fairly short texts.

I've just found something new called svox-pico which is used on the Android platform. There is a pico2wave utility(if you use the ubuntu sources) which provides several very nice voices to choose from. Problem is it only outputs a *.wav file. Maybe one of us can figure out how to get it use stdout, or some audio lib to output sound directly...

I'm still searching for something light and usable...
Back to top
View user's profile Send_private_message 
mcewanw

Joined: 16 Aug 2007
Posts: 2346
Location: New Zealand

PostPosted: Fri 05 Sep 2014, 09:59    Post_subject: New version of flite_hts_engine uploaded  

flite_hts_engine is a reasonably good quality Text To Speech (TTS) synthesiser in an incredibly small package considering it includes the voice data (approx 1.6MB download).

New version (deb package for DebianDog and dotpet for Puppy), that seems to have fixed the problem of occasional garbled text can be downloaded, for now at least, from the first post of this thread.

William

_________________
Non enim propter gloriam, diuicias aut honores pugnamus set propter libertatem solummodo quam Nemo bonus nisi simul cum vita amittit.
Back to top
View user's profile Send_private_message Visit_website 
greengeek

Joined: 20 Jul 2010
Posts: 2549
Location: New Zealand

PostPosted: Fri 05 Sep 2014, 15:58    Post_subject:  

Thanks William - I've just tried the dotpet and it works well on my slacko 5.6 derivative. Seems to respond well to commas and fullstops too. Here is my sample:
Code:
# flitet "hello world, I am going to the bathroom. I may be some time" | aplay

(I am always searching for tts and stt methods for a voice controlled pup that i need to make some improvements on...)

EDIT : It makes an interesting difference if I add a hyphen after a vowel:
Code:
# flitet "hello world, I am going to the ba-throom. I may be some time" | aplay


EDIT2 : It also seems to have the ability to change the inflection of the last word in the sentence. There is a slight difference in the ending of these two phrases:
Code:
#  flitet "Where is president kennedies brain" | aplay
and
Code:
#  flitet "Where, is president kennedies brain" | aplay
Back to top
View user's profile Send_private_message 
mcewanw

Joined: 16 Aug 2007
Posts: 2346
Location: New Zealand

PostPosted: Fri 05 Sep 2014, 18:58    Post_subject: Mage Platform for Performative Speech Synthesis  

Thanks, greengeek - you post interesting examples and observations regarding the subtleties of using flite_hts_engine.

As I mentioned in my first thread post, you may also find the project http://mage.numediart.org/ of interest. Mage is easy to compile (in DebianDog at least - it uses cmake rather than make during compilation) and is nicely described in a number of videos on its website. I have compiled Mage on my computer and am working on it just now. Hope to provide some useful dotpet/deb eventually for that too, but early days, so don't know if I will succeed or if too much work involved. The Mage videos are worth a look though, since show future possibilities surrounding some of the research going on out there.

William

EDIT: The required build system proved too large for the resources my machine has... Also the HTS code in Mage doesn't seem to be used or immediately usable with a realtime TTS like flite, as far as I could determine. Best I managed was to use its HTS batch binary to create a wav file from a provided .lab text file. Wasn't a particularly interesting exercise; flite_hts_engine appears to be more appropriate for practical use in Puppy as things are at the moment.

_________________
Non enim propter gloriam, diuicias aut honores pugnamus set propter libertatem solummodo quam Nemo bonus nisi simul cum vita amittit.

Edited_time_total
Back to top
View user's profile Send_private_message Visit_website 
Flash
Official Dog Handler


Joined: 04 May 2005
Posts: 11068
Location: Arizona USA

PostPosted: Fri 05 Sep 2014, 23:34    Post_subject:  

Sounds a lot better if you put hyphens like this:
Code:
# flitet "hel-lo world, I am going to the bath-room. I may be some time" | aplay
Back to top
View user's profile Send_private_message 
sunburnt


Joined: 08 Jun 2005
Posts: 5030
Location: Arizona, U.S.A.

PostPosted: Wed 10 Sep 2014, 01:03    Post_subject:  

Very nice William, this is something to inspire most anyone.

I`m thinking BaCon maybe able to help if the output is a problem.
Back to top
View user's profile Send_private_message 
Display_posts:   Sort by:   
Page 2 of 3 Posts_count   Goto page: Previous 1, 2, 3 Next
Post_new_topic   Reply_to_topic View_previous_topic :: View_next_topic
 Forum index » Advanced Topics » Additional Software (PETs, n' stuff) » Multimedia
Jump to:  

Rules_post_cannot
Rules_reply_cannot
Rules_edit_cannot
Rules_delete_cannot
Rules_vote_cannot
You cannot attach files in this forum
You can download files in this forum


Powered by phpBB © 2001, 2005 phpBB Group
[ Time: 0.0785s ][ Queries: 12 (0.0042s) ][ GZIP on ]