PDA

View Full Version : Software Automated Mouth



Rick Ethridge
January 24th, 2005, 10:13 AM
I found a disk image copy of this program on the internet. It's in a SIT archive. Just reply if you're interested in a copy.

EvanK
January 24th, 2005, 05:40 PM
Thanks Rick -- I found it here: http://homepage.mac.com/vectronic/appleii/sam.html -- is that the same page you're looking at?

Terry Yager
January 24th, 2005, 06:15 PM
Does it do speech-to-text, or just text-to-speech? I don't have any Apple to try it on, but I'm curious as to how it compares to more modern software. I just spent 2.5 hours yesterday installing & training Dragon Point & Speak. What a disappointment! It's speech recognizer really sux!

--T

carlsson
January 25th, 2005, 01:15 AM
S.A.M. as I know it is a software speech synthesizer, i.e. it will "speak" what you write to it, but no attempt at recording or recognizing. There were other hard/software solutions for various computers that tried to do that with various success.

S.A.M. was able for the Apple II, Atari 8-bit and Commodore 64; maybe even IBM PC(jr).

Terry Yager
January 25th, 2005, 10:55 AM
Yeah, I used to enjoy playing around with the speech synthesizer on my TI-99/4A. It was really quite advanced for it's time.

--T

carlsson
January 25th, 2005, 11:06 PM
Speak and Spell, wasn't it? I think it uses a special chip that generates the speech, also available for other computers, rather than being software generated by the computer's sound chip.

vic user
January 26th, 2005, 04:25 AM
the latest computer collector newsletter has an article on text to speech:


>> WELCOME TO THE COMPUTER COLLECTOR NEWSLETTER

>> W: http://news.computercollector.com E: news@computercollector.com

>> Vol. 4, Issue 4: Jan. 24, 2005: News/opinion, tidbits, classifieds

****************************************
NEWS & OPINION

The history of computer text-to-speech synthesis
by Evan Koblentz

Does your computer talk? Or rather, does it talk any better than it
could approximately 25 years ago?

That's right: we're "talking" about Software Automated Mouth, better
known as SAM, developed mostly by Mark Burton in 1979. The company,
SoftVoice, still exists today at http://www.text2speech.com. The
story of how Steve Jobs used SAM to make the Macintosh computer
"introduce itself" in 1984 is detailed at this very long web address:
http://www.folklore.org/StoryView.py?project=Macintosh&story=Intro_Dem
o.txt&topic=Marketing&sortOrder=Sort+by+Date&detail=medium&showcomment
s=1 (copy and paste the link because it will break across lines) but
we prefer http://homepage.mac.com/vectronic/appleii/sam.html
where you can actually download the stuffed software for Apple II
computers! (Soon we're acquiring a //c and looking forward to getting
this great memory from the past.) There is also a Wikipedia entry at
http://en.wikipedia.org/wiki/Software_Automatic_Mouth although we
don't particularly trust the accuracy of Wikipedia entries.

SAM is very neat, but we wondered: what came before it? What is the
real history of computer text-to-speech synthesis? In short, what was
the first machine -- computer or not -- to speak for itself? A-
Googling we went in search of some answers.

We found the web site http://www.ling.su.se/staff/hartmut/kemplne.htm
where Prof. Hartmut Traunmüller reports: "The first attempts to
produce human speech by machine were made in the 2nd half of the 18th
century. Ch. G. Kratzenstein, professor of physiology in Copenhagen,
previously in Halle and Petersburg, succeeded in producing vowels
using resonance tubes connected to organ pipes (1773)."

But there is a difference between the first attempts at something and
the first actual success. The professor continues: "[Wolfgang] Von
Kempelen's machine was the first that allowed to produce not only some
speech sounds, but also whole words and short sentences," described in
a paper by Von Kempelen in 1791.

Kempelen, as many computer history buffs already know, is more famous
for building a supposedly automated chess-playing machine, described
in great detail by writer Tom Standage in his 2002 book, "The Turk".
That chess machine turned out to be a fraud -- a short human was
always hidden inside -- but the speaking machine was real. Standage
explains how telegraph pioneer Charles Wheatstone built a copy of the
machine in 1863 and demonstrated it to a young Alexander Graham Bell.
(Visit http://www.ling.su.se/staff/hartmut/farkas.htm for many more
web sites and biographical references about Von Kempelen.)

Jump back to 1979: Texas Instruments was one of a few companies
selling handheld language translator devices. TI engineers
understood, however, that merely reading a foreign language was
useless if you didn't know how to pronounce the word. So they built
speech synthesis into the product using off-the-shelf technology from
their own parts bin: anyone remember the "Speak & Spell" toy? TI used
the toy chip while Burton and his colleagues were working on software
solutions! There are some fascinating specifications and other
details at http://www.datamath.org/Speech/LanguageTutor.htm. (I used
to own one of these devices, but gave it to VCF chief Sellam Ismail.
Unfortunately I did not record any audio clips from it.)

So again, we ask: does your computer talk any better than it could
approximately 25 years ago? Share your early text-to-speech tales
with us at news@computercollector.com.

[Note: we requested an interview with Mark Barton. We'll post the
results of that interview on the CCN web site when available.]

---------------------------------------

Terry Yager
January 26th, 2005, 09:22 AM
Speak and Spell, wasn't it? I think it uses a special chip that generates the speech, also available for other computers, rather than being software generated by the computer's sound chip.

Speak & Spell was quite a different animal, marketed as an educational toy for children, but it probably utilized the same chip, a TI proprietary design. The speech synthizer was an add-on for the TI99, which jacked into the expansion port on the side. It had a built-in vocabulary of several hundred words, plus a good number of built-in phonomes that could be programmed to pronounce other words not in the vocabulary. Funny thing is tho, IIRC, you couldn't program it directly from TI BASIC. You needed to have another cartridge, the Terminal Emulator to program the phonomes with. There were also a few games cartridges released which utilized the speech synthizer to speak with.

--T

EvanK
January 26th, 2005, 06:39 PM
Just to clear this up... my inspiration for writing about SAM, etc. in this week's Computer Collector Newsletter was actually a conversation with Rick a few days ago... we were talking along with several other people in the weekly chat at http://www.geocities.com/c64friends/ ... the topic happened to turn to synthesis and I brought up SAM, then Rick said he'd posted the relevant link here for me (and others) to get to afterwards.

Anyway, to Vic and everyone else: I hope you found the article educational and entertaining. I'm always looking for new story ideas if anyone has some (and looking for writers too... you try putting out a newsletter virtually alone each week!) :)

carlsson
January 27th, 2005, 07:50 AM
Gee, 1773.. if one produced a machine that could make sounds similar to human speech, I would suppose you got accused of witchcraft, or maybe that was a century earlier.

vic user
February 1st, 2005, 04:19 AM
Anyway, to Vic and everyone else: I hope you found the article educational and entertaining. I'm always looking for new story ideas if anyone has some (and looking for writers too... you try putting out a newsletter virtually alone each week!)


_________________
Evan Koblentz, editor
Computer Collector Newsletter

http://news.computercollector.com

news@computercollector.com

Holy smokes, you do most of that newsletter yourself?

Although nothing ever beats getting a physical newsletter in the mail, I love getting the CCN in my virtual mail box.

Thanks for all your work!

Chris

mjmahon
July 16th, 2005, 04:12 PM
There were two different versions of S.A.M.--one all done with software throught the 1-bit speaker port of the Apple II, and one using an 8-bit DAC card with an audio amplifier.

Both used the standard phoneme synthesis algorithm in the public domain.

The tricky part was making reasonably good speech by generating a pulse-width modulated stream of pulses to the speaker. (I know, I built one of these myself. ;-)

The TI chip was based on linear predictive coding (LPC), that permits pretty intelligible speech with very low data rates. Unfortunately, it takes a lot of analysis to derive the LPC coefficients for a particular utterance, so the "dictionaries" for the chip were limited.

The most common way the TI chip was used with computers was with a "dictionalry" of phonemes and the standard phoneme synthesis algorithm, making it sound about like any of the other phoneme synthesizers (like the SC-01 or the SSI 263).

Street Electronics' Echo II was an Apple speech synthesis card based on the TI chip, and it had both phoneme synthesis (unlimited robotic vocabulary) and LPC-synthesized whole words, which were much higher quality but with a limited vocabulary.

Terry Yager
July 16th, 2005, 04:55 PM
I recently picked up a Super Speak & Spell at the thrift store, but I wasn't too impressed. It does come off as being yet another cheap-ass toy (very limited capabilities).
OTOH, I have been playing for the past few days with the latest addition to my vintage computer collection; an Epson HX-20 with an Adaptive Communication Systems RealVoice Expansion Unit. Not too shabby, for 20-year-old technology, especially when compared to state-of-the-art for today. Check this link for an interactive demo of ATTs latest offering:

http://www.research.att.com/projects/tts/demo.html

One of the local BBSs where I used to hang out was an Assistive Technology site, and a lot of hard-of-hearing people in Michigan were still using RealVoice at the time the board went down for good (Y2K issues) on Dec 31, 1999, and probably still are.

WARNING: Shameless and self serving plug ahead!

If anyone reading this would like one of thier own to play with, see my current eBay auctions:

http://cgi.ebay.com/ws/eBayISAPI.dll?ViewItem&item=5220376184&rd=1&sspagename=STRK%3AMESE%3AIT&rd=1

/plug

--T

mjmahon
July 16th, 2005, 06:08 PM
The Epson unit is almost certainly using the same chip as the Speak'n'Spell. The only difference is the coefficient data being sent to it.

The Speak'n'Spell had a low price target and a relatively large vocabulary, so quality is limited (though still very understandable and better than most phoneme synthesis systems).

When the chip is fed by a computer, more data can be supplied, producing the pleasant female voice well-known to Echo II fans. ;-)

Terry Yager
July 16th, 2005, 06:47 PM
The RealVoice is powered by the same 6301-type CPU that (x2) is at the heart of the HX-20 itself, so it is essentially a computer in it own right (probably running in "slave" mode to the HX-20's master. I dont see anything in the way of TI chips in it, just a whole bunch of ROM chips & some others with identification sanded off.

--T

Terry Yager
July 17th, 2005, 01:18 PM
For comparison to the SOTA ATT demo above, I've posted a sample of the 20-year old TTS from the RealVoice unit:

http://webpages.charter.net/shent449/comphell/ayb.wav

--T

80sFreak
July 17th, 2005, 05:47 PM
http://webpages.charter.net/tyger89/default.htm/ayb.wav


This no workie... :(

Cheers,

Bryan

Terry Yager
July 19th, 2005, 02:41 PM
Try this one:

http://webpages.charter.net/shent449/comphell/ayb.wav

--T

80sFreak
July 19th, 2005, 03:41 PM
http://webpages.charter.net/shent449/comphell/ayb.wav


That be workin'! :D

Cheers,

Bryan

carlsson
July 23rd, 2005, 04:44 AM
I'm not trying to be witty or disparaging, but the female voice sounds like a Japanese woman speaking English. :wink:

http://www.anders.sfks.se/mp3/allyourbase.mid

Terry Yager
July 23rd, 2005, 09:13 AM
I'm not trying to be witty or disparaging, but the female voice sounds like a Japanese woman speaking English. :wink:

http://www.anders.sfks.se/mp3/allyourbase.mid

Could be, Epson is a Japanese company...

--T

Geo3
November 11th, 2006, 06:51 AM
Not sure what this thread is about.
But for the AppleII/IIgs. They have the Echo and Ufonic boards that did speech from the software.
For the IIc there was a Cricket plug in like Echo.
Corvox Hardware/Software used to be used for speech recognition. Also speech if I remember correctly.

Smooth Talker was a software program for the IIgs that would do sppech to text as you typed. Male or female voice.

I am sure there were others as well.

Take Care

ahm
November 11th, 2006, 07:26 AM
Not sure what this thread is about.

Software Automatic Mouth (http://en.wikipedia.org/wiki/Software_Automatic_Mouth), aka SAM.
Woo, there's an Atari version (http://www.retrobits.net/atari/sam.shtml)!

carlsson
November 11th, 2006, 02:04 PM
I scanned in an advertisment from COMPUTE!

http://www.cbm.sfks.se/pics/sam-ad.jpg (640 px wide, so I didn't inline it)


HERE'S S.A.M.!
PUT HIM IN YOUR DISK DRIVE AND YOUR COMPUTER TALKS.

S.A.M. is the Software Automatic Mouth (tm), by Don't Ask Software - a complete speech synthesizer on a disk. S.A.M. does what hardware speech devices do, and more - without the high price.

* Natural-sounding speech
* Variable pitch, speed, inflection
* English text-to-speech conversion
* Easy to use in your programs

FOR COMMODORE 64, ATARI, APPLE II+, IIe COMPUTERS

From your software dealer. Sug. retail prices: C64, Atari versions $59.95.
Apple version includes d/a card: $124.95.

Programmed by Mark Barton. (c) 1982 Don't Ask.
Atari, C64 conversions by Robert Freedman, Mac Lindsay respectively. (c) 1982, 1983 Don't Ask.
Commodore 64, Atari, Apple II+ and IIe are trademarks of Commodore Business Machines Inc., Atari Inc., and Apple Computer Inc., respectively.