Story highlights
Susan Bennett says she is the voice of the original U.S. version of Siri on Apple's iPhone
Apple won't comment, but other sources -- including an audio forensic expert -- confirm this
Recordings from 2005 were used for Siri; hearing herself six years later was a surprise
How CNN's Jessica Ravitz, who had never used Siri, found Bennett is also shocking
For the past two years, sheâs been a pocket and purse accessory to millions of Americans. Sheâs starred alongside Samuel L. Jackson and Zooey Deschanel. Sheâs provided weather forecasts and restaurant tips, been mocked as useless and answered absurd questions about what sheâs wearing.
She is Siri, Appleâs voice-activated virtual âassistantâ introduced to the masses with the iPhone 4S on October 4, 2011.
Behind this groundbreaking technology there is a real woman. While the ever-secretive Apple has never identified her, all signs indicate that the original voice of Siri in the United States is a voiceover actor who laid down recordings for a client eight years ago. She had no idea sheâd someday be speaking to more than 100 million people through a not-yet-invented phone.
Her name is Susan Bennett and she lives in suburban Atlanta.
Apple wonât confirm it. But Bennett says she is Siri. Professionals who know her voice, have worked with her and represent her legally say she is Siri. And an audio-forensics expert with 30 years of experience has studied both voices and says he is â100%â certain the two are the same.
Bennett, who wonât divulge her age, fell into voice work by accident in the 1970s. Today, she can be heard worldwide. She speaks up in commercials and on countless phone systems. She spells out directions from GPS devices and addresses travelers in Delta airport terminals.
Until now, itâs been a career thatâs afforded her anonymity.
But a new Apple mobile operating system, iOS 7, with new Siri voices means that Bennettâs reign as the American Siri is slowly coming to an end. At the same time, tech-news site The Verge posted a video last month, âHow Siri found its voice,â that led some viewers to believe that Allison Dufty, the featured voiceover talent, was Siri. A horrified Dufty scrambled in response, writing on her website that she is âabsolutely, positively NOT the voice of Siri,â but not before some bloggers had bought into the hype.
And there sat Bennett, holding onto her secret, laughing and watching it all. For so long sheâd been goaded by others, including her son and husband, to come forward. Her Siri counterparts in the UK and Australia had revealed their identities, after all.
So why not her? It was her question to wrestle with, and finally she found her answer.
âI really had to weigh the importance of it for me personally. I wasnât sure that I wanted that notoriety, and I also wasnât sure where I stood legally. And so, consequently, I was very conservative about it for a long time,â she said. âAnd then this Verge video came out ⦠And it seemed like everyone was clamoring to find out who the real voice behind Siri is, and so I thought, well, you know, what the heck? This is the time.â
The Siri surprise
The story of how Bennett became this iconic voice began in 2005. ScanSoft, a software company, was looking for a voice for a new project. It reached out to GM Voices, a suburban Atlanta company that had established a niche recording voices for automated voice technologies. Bennett, a trusted talent who had done lots of work with GM Voices, was one of the options presented. ScanSoft liked what it heard, and in June 2005 Bennett signed a contract offering her voice for recordings that would be used in a database to construct speech.
For four hours a day, every day, in July 2005, Bennett holed up in her home recording booth. Hour after hour, she read nonsensical phrases and sentences so that the âubergeeksâ â as she affectionately calls them; they leave her awestruck â could work their magic by pulling out vowels, consonants, syllables and diphthongs, and playing with her pitch and speed.
These snippets were then synthesized in a process called concatenation that builds words, sentences, paragraphs. And that is how voices like hers find their way into GPS and telephone systems.
âThere are some people that just can read hour upon hour upon hour, and itâs not a problem. For me, I get extremely bored ⦠So I just take breaks. Thatâs one of the reasons why Siri might sometimes sound like she has a bit of an attitude,â Bennett said with a laugh. âThose sounds might have been recorded the last 15 minutes of those four hours.â
But Bennett never knew exactly how her voice would be used. She assumed it would be employed in company phone systems, but beyond that didnât think much about it. She was paid by the hour â she wonât say how much â and moved on to the next gig.
The surprise came in October 2011 after Apple released its iPhone 4S, the first to feature Siri. Bennett didnât have the phone herself, but people who knew her voice did.
âA colleague e-mailed me [about Siri] and said, âHey, weâve been playing around with this new Apple phone. Isnât this you?ââ
Bennett went to her computer, pulled up Appleâs site and listened to video clips announcing Siri. The voice was unmistakably hers.
âOh, I knew,â she said. âItâs obviously me. Itâs my voice.â
Seeking proof
It certainly does sound like Bennett. But proving who supplied the voice of Siri isnât easy. Itâs not like Steve Jobs sent Bennett a thank-you note, or a certificate to hang on her wall.
There are others who vouch for her. But the tech world â and specifically the text-to-speech, or TTS, space â is a complicated business, one thatâs shrouded in secrecy and entangled in a web of nondisclosure agreements.
Bennett is not bound by such restrictions, which is why sheâs talking. But the industry has a vested interest in keeping their voices anonymous.
âThe companies are competing to create the best-sounding and functioning systems. Their concern is driving revenues,â said Marcus Graham, CEO of GM Voices. âTalking about the voice talent, from their perspective, is likely seen as a distraction.â
Bennettâs attorney, Steve Sidman, canât breach attorney-client privilege to share documents and contracts, but since he began representing Bennett in 2012 heâs been intensely aware of her connection to Siri.
âIâve engaged in substantial negotiations â multiple, months-long negotiations â with parties along the economic food chain, so to speak, that involved her rendering services as the voice of Siri,â he told CNN. âItâs as simple as that.â
And then thereâs Graham, of GM Voices, a man who has built a career around providing voiceover talent for interactive voice technologies.
Graham wonât divulge details about any deals he made back in 2005. But he has worked with Bennett for 25 years, has recorded âliterally millions of words with Susanâ and has installed her voice with clients across the globe. He knows her voice as well as anyone, and he doesnât hesitate when asked if she and Siri are the same.
âMost female voices are kind of thin, but sheâs got a rich, full voice,â he said. âYes, sheâs the voice of Siri. ⦠Sheâs definitely the voice.â
A â100% matchâ
In October 2005, a few months after Bennett made those recordings, ScanSoft bought and took on the name of Nuance Communications. Nuance is the company widely accepted to have provided to Apple the technology behind Siri.
When CNN contacted Nuance to try and confirm Bennettâs identity as a voice of Siri, a Nuance spokeswoman said, âAs a company, we donât comment on Apple.â
Apple, too, declined to comment.
So CNN took the investigation one step further by hiring an audio forensics expert to compare Bennettâs voice with Siriâs.
Ed Primeau, of Rochester Hills, Michigan, has been doing this work for three decades. Heâs testified in courts, analyzed âhundreds, if not thousandsâ of recordings and is a member of the American Board of Recorded Evidence. He spent four hours studying our âknown voiceâ â in this case Siri â with the unknown voice of Bennett.
âI believe, and Iâve lived this for 30 years, no two voices are the same,â he said, after finishing his analysis of the Siri voice and Bennettâs. âThey are identical â a 100% match.â
To reach his conclusion Primeau created back-to-back comparison files, lifted and listened to consonants and reviewed deliveries. He took the hiss off the Siri sound, created in recording from a phone, and dropped it into Bennettâs file.
After studying Bennettâs normal speaking voice, he was about 70% certain of the match. But once he had audio of her saying the same words as Siri, he knew his work was done. Even so, he said he asked a colleague for a second opinion.
âI understand the importance of accuracy,â Primeau said. âRest assured: Itâs 100% Susan.â
How CNN got this story
This isnât the sort of story Iâd naturally go after. Technology is far from my beat. In fact, the first time I ever spoke to Siri was on my work phone â the kind thatâs plugged into a wall jack and has a tangled cord attached to the handset.
Bennett was a voiceover artist I was interviewing for a CNN special project on the worldâs busiest airport â Hartsfield-Jackson Atlanta International â scheduled to come out next month. I was tracking down the airportâs voices, and she, a voice of Delta terminals, was one of them.
In the course of our phone conversation, I asked her to rattle off some jobs sheâs had over the years. She gave me a quick and general rundown and then added that sheâs done a lot of IVR work.
âIVR?â I asked.
âInteractive voice response,â she answered. âThe sort of thing you hear on a companyâs phone system.â
For reasons I canât explain â I was still struggling to understand my first iPhone â I blurted out, âHey, are you Siri?â
She gasped. And then I gasped.
âOh my God,â I said. âYouâre totally Siri, arenât you?â
What followed was a short, panicked flurry of non-denials and non-confirmations, and a promise from me that I wouldnât do or say a thing.
That was months ago. About two weeks ago, after the confusion over the Verge video, Bennett reached out to me. She was ready to speak as herself and set the record straight.
âMy career as a machineâ
As a child, Bennettâs favorite toy was a play phone-operator system, a big red block with a receiver and lines she could patch in to help imaginary callers make their connections.
Years later, while singing jingles, she was tapped to be the radio and TV voice of First National Bankâs âTillie the All-Time Teller,â the first ATM machine. Though that was about 40 years ago, she can â and does â still break seamlessly into the high-pitched song.
âI began my career as a machine many years ago,â Bennett said. âIâm sure that you hear my voice at some point every day.â
But the way she is heard was a surprise even to her.
Music and singing had always been a part of Bennettâs life. At Brown University, she sang in a jazz band and also with another group at the Berklee School of Music. After graduating, she toured as a backup singer with Burt Bacharach and Roy Orbison. Today, she and husband Rick Hinkle â a guitarist, composer and sound engineer â still play in a band, mostly at private events.
She fell into voiceover work by chance in the 1970s when she walked into Atlantaâs Doppler Studios for a jingle job and the voiceover talent was a no-show. The studio owner looked around and said, âSusan, come over here. You donât have an accent. Go ahead and read this.â
She did, and a new career path was born.
Bennett wasnât always accent-free, though. She was born in Vermont and grew up all over New England. Her voice â dropped Rs and all â was âSNLâ-skit ready. Can she imagine Siri as a New Englander? âNeva! Neva!â
A stint in upstate New York helped her lose the accent. By the time she arrived in Atlanta in 1972, with her first husband, former NHL player Curt Bennett of the Atlanta Flames, she was ready to fight off the Southern twang. She fell in love with Atlanta and, after that marriage ended, stayed.
Even though her voice can be heard everywhere, sheâs enjoyed being out of the spotlight.
âYou have a certain anonymity which can be very advantageous,â she said. âPeople donât judge you by how you look ⦠Thatâs been kind of freeing in a lot of ways.â
âPart of historyâ
Bennett works in a sound-proof recording booth in her home, a tin of lozenges at the ready. Her voice is transmitted to the world, while she â if she so chooses â sits in her jammies, or more likely her Zumba clothes. Auditions are done by e-mail. She can grocery shop and go unrecognized.
Itâs not as though her natural speaking voice, heard out of context in the produce aisle, sparks reactions.
So the idea of coming out as the voice of Siri was one she pushed aside. It probably wouldnât have even occurred to her if not for the goading of others, including her 36-year-old son â whom she, and he, jokingly refers to as âSon of Siri.â
âHer voice has been everywhere throughout my life. Iâd call my bank while I was in college in Colorado, and it was my mom telling me I had $4,â said Cameron Bennett, a photographer in Los Angeles.
He first found out she was the voice of Siri while watching an iPhone 4S commercial on TV. There, on the screen, was director Martin Scorsese talking to his mother. When Cameron bought the phone himself, she began barking at him through its GPS feature, prompting him to yell, âMom, stop!â
âSheâs part of history,â he said. âIt was funny trying to explain to her how big it was. She uses her cell phone for 8% of what it can do.â
When Bennett upgraded her phone and first talked to ⦠well, herself, she says she was a little horrified. It was weird, to say the least. But she was blown away, she said, to play a part in such a technological feat.
Being the voice of Siri, though, doesnât mean sheâs immune to the sorts of frustrations others sometimes have with the technology.
âBut I never yell at her â very bad karma,â Bennett said. That said, she knows not everyone is as gracious: âYes, I worry about how many times I get cursed every day.â
Now, though, with iOS 7 she is passing the telephonic torch to a new Siri. Bennett would be lying if she said she wasnât a bit disappointed, but in her field of work sheâs learned to expect evolution â and even revolution.
As technology improves, and the concatenation process becomes less robotic and more human, Bennett thinks anything will be possible.
âI really see a time when youâll probably be able to put your own voice on your phone and have your own voice talk back to you,â she said. âWhich Iâm used to, but maybe you arenât.â