Infovox iVox - 1.1.2natural voices in language of choice for use in any app |
|
||||||||||||||||
|
|||||||||||||||||
Feedback Summary:
| Version 1.1.2: | |||||
| Overall Rating: | Features: | Support: | |||
| Ease of Use: | Quality / Stability: | Price: | |||
Key to Types of Feedback:
Reviews
Troubleshooting
Usage Tips
Developer Notes
Commentary
Featured Reviews
Better than Cepstral voices, but still wanting... - Version: 2.0b, 9/18/2009 12:24AM PST
zunipus
Most Recent Replies: View All 1 Replies
- Waiting for Cepstral update ... (german voice)
Decent voices with some caveats 



- Version: 1.1.2, 4/28/2008 09:18PM PST
(1 of 1 users found this comment useful)
TimA
Acapella group gives you a generous length of time to evaluate the voices -- 30 days of unrestricted use. If you haven't decided whether you can live with the voices after that amount of time, you probably weren't using them in the first place. If, however, you find that you miss the voices after the trial period ends, that's a sure sign that you should purchase the voices.
The American voice pack comes with several voices. However, only the HQ voices are of any real quality. The HD voices sound more robotic than even the low-quality voices that come with the Mac. There are five HQ voices.
However, narrowing them down a bit further, the child voices, Kenny and Nelly, are just high-pitched versions of other voices. That leaves you with three truly high-quality voices, Heather, Laura, and Ryan.
At $99 for the voice pack, that puts the best voices at about $33 each (no you can’t purchase them individually), $13 more than the individual Cepstral voices. The best voice of the bunch, Ryan, however, sounds excellent (aside from a few ideosyncrasies).
The worst trait of the Infovox Ivox voices is that they pronounce some punctuation marks, such as asterisks and dashes. This is especially annoying in e-mails when these characters are used as separators such as:
*********************************************************************
When the voices get to such strings, they insist on stupidly saying, “asterisk asterisk asterisk asterisk asterisk asterisk asterisk asterisk asterisk” ad nauseum.
Apple's voices have not read punctuation marks like these for over a decade. Come on Acapella, what were you thinking? Get with the program! This is ridiculous when a natural sounding voice suddenly starts reading “asterisk asterisk asterisk” or “dash dash dash dash dash dash dash dash dash dash” etc.This is pointless and unnatural. And annoying.
The American voice pack comes with several voices. However, only the HQ voices are of any real quality. The HD voices sound more robotic than even the low-quality voices that come with the Mac. There are five HQ voices.
However, narrowing them down a bit further, the child voices, Kenny and Nelly, are just high-pitched versions of other voices. That leaves you with three truly high-quality voices, Heather, Laura, and Ryan.
At $99 for the voice pack, that puts the best voices at about $33 each (no you can’t purchase them individually), $13 more than the individual Cepstral voices. The best voice of the bunch, Ryan, however, sounds excellent (aside from a few ideosyncrasies).
The worst trait of the Infovox Ivox voices is that they pronounce some punctuation marks, such as asterisks and dashes. This is especially annoying in e-mails when these characters are used as separators such as:
*********************************************************************
When the voices get to such strings, they insist on stupidly saying, “asterisk asterisk asterisk asterisk asterisk asterisk asterisk asterisk asterisk” ad nauseum.
Apple's voices have not read punctuation marks like these for over a decade. Come on Acapella, what were you thinking? Get with the program! This is ridiculous when a natural sounding voice suddenly starts reading “asterisk asterisk asterisk” or “dash dash dash dash dash dash dash dash dash dash” etc.This is pointless and unnatural. And annoying.
Superb 



- Version: 1.0, 1/15/2007 02:26PM PST
(2 of 2 users found this comment useful)
rickybuchanan
My Mac needed the OS reinstalled recently for hardware reasons, so I unexpectedly got to hear the built in Tiger voices again. The difference in quality was much more than I remembered - these Infovox iVox voices are a hundred times better to listen to, and much easier to understand.
I highly recommend them to anyone who uses the OS X text-to-speech engine.
I highly recommend them to anyone who uses the OS X text-to-speech engine.
When a text-to-speech voice comes to a punctuation mark, you want it to act accordingly like a rational human being. If a sentence ends in a period, the voice speaks in a 'straight' tone. If a sentence ends in an exclamation mark, you want the voice to speak more emphatically. If a sentence ends in a question mark, you want the voice to rise at the end in a questioning tone. All of Apple's voices do this.
Cepstral voices treat exclamation marks and question marks as periods. It sounds very odd and dysfunctional. This is why I avoid their voices.
Acapela voices, which are used in Infovox, are just as nice sounding as Cepstrals with the added feature that at least they (usually) comprehend question marks and speak accordingly. (Results will vary, so test voices before using them). But again, sadly, they have no comprehension of exclamation marks, which of course sounds again odd. Bleh. But at least they're a step up from Cepstral.
Then there is the noise artifact problem. Apple's recent Alex voice still has them. You hear odd little noises that have nothing to do with speech. The Bruce voice, in one version of Mac OS X, used to make occasional farting noises! Vicki remains the queen of Apple voices for lack of artifacts. But sadly the Leopard and Snow Leopard speech engine ruins her cadence to make her sound manic, despite the fact that nothing whatever changed in her actual speech code.
Cepstral and Acapela voices can have similar noise artifact problems. Be sure to check each voice carefully before buying it or deploying it. One voice I with which I am most pleased is the recent Peter British voice from Acapela. I find it easy to understand with very few noise artifacts or speech inflection oddities.
It's strange after all these years of voice technology, [The original 1984 Mac had text-to-speech built in], that it's still 'buyer (or user) beware'. No doubt human error inflicts ever inflicts itself into the process.