Using Apple Books AI Based Digital Narration Tool For Audiobooks

In self-publishing two paperback and hardback novels with Amazon Kindle Direct Publishing, some asked if I’d ever get audiobooks created. I always laughed, as it was way beyond my budget, as they typically cost anywhere from $6,000 to $10,000 to produce. Of course, I really did want to create an audiobook.

I even dabbled with the idea of recording it myself in a padded closet, but just didn’t want to deal with listening to my voice or investing in the proper microphone. Plus, my house is always loud with kids and cats, slamming doors and cabinets. I’d have to record at 3AM nightly to get it done.

We used to rent Harry Potter audiobooks on CD from the Abington Township Library. My son loved to listen to them on long travels. They were fantastic, as different talents represented unique characters for the dialogue parts. It really brought the novels to life.

Fiverr and ElevenLabs To The Rescue?Be Home By Dinner Audiobook

As I explored Fiverr.com, I realized some freelancers could produce the novel for about $5,000, but it was still too much. Also, the work of creating a character list of how to pronounce the many names and settings seemed challenging to do over a Zoom meeting.

ElevenLabs.io, which I’m really fond of, was a potential solution. But it would require that I get the $330 / month plan due to the word count. It also required writing out unique words and have them spelled out. For example, a pizzeria called Rosario’s would need to be re-typed to Rose-air-ee-ohs so that the AI tool could understand the correct way to enunciate the word.

A Free Solution?

For ebook syndication, I use Draft2Digital, which I recently discovered is an approved partner of Apple Books, and allows authors to generate audiobooks using Apple Books AI digital narration for FREE! All that you have to do is pick out the ideal voice to represent your book and let the AI tool spend a few weeks with your ebook. It’s currently restricted to categories of romance, fiction, mystery and thriller, or science fiction and fantasy.

When I received notification that Be Home By Dinner was published on Apple Books and selling for $4.99, I smiled because I knew that that there was no way in hell that the AI tool could possibly have nailed the dozens of character names, locations, 1980s pop culture references and more. I questioned how did it make a leap of faith in determining pronunciations.

On initial listen, I enjoyed the character voice that I had chosen. He was suitable for suspense, which was the genre of the novel. But… the “audiobooks without the overhead” definitely had its fair share of issues.

What Fell Apart With The AI Produced AudiobookCharacter Names

I knew this would happen. But some character names, like Kova (the antagonist) took on a different pronunciation at different parts of the book. Sometimes it was Koo-Vah. Other times it was Kah-Vah. And sometimes it was the correct Koe-Vah. The name kept morphing, which I found odd since it’s a simple 4-letter name.

Author Name

Yes, even my name was butchered. Instead of stating Franke with a silent “E”, they included a hard “E”. I felt like I was back in high school during role call. So, I now have a different sounding last name.

What I Miss About A Human NarratorMouth Noises

Yes, it sounds weird and gross, but I missed sounds of the human element. Fake breath noises are not part of the equation yet, let alone lip smacking or air sneaking through teeth. The AI voice is so dry and sterile, with a clockwork tempo. At times you want to rattle the robot and have it take a shot of whiskey to loosen up.

Be Home By Dinner Audio BookAmbient Sounds

The AI voice is precise with perfect audio levels. But I miss the sounds of the room, like pages being turned or a glass of water being put down on a wooden table. The impurities of recordings are often the most endearing. Sting accidentally sitting on the piano during “Roxanne” for example. The Police kept that in the recording. I remember listening to a lot of the Beat Generation authors do readings and hearing the cigarette exhalations and ice cubes tinkling in glasses, or uproarious laughter of someone nearby.

Lively Dialogue

When I read a novel, I create a dialogue voice for each character. It just happens naturally to help break up the reading. With AI narration, the voice adjusts a bit with a conversation between two people, but it sounds like a screenplay read by someone vaguely interested in auditioning for a part in a film adaptation. The emphasis is not as strong, especially for highly emotional scenes of distress.

Correctly Pronounced Words

For heteronyms, the AI tool seemed to work based on a coin toss. For example, the word “tearing” was supposed to be pronounced like “eyes tearing up”, but it was pronounced like “tearing up a piece of paper”. The correct context was picked up by the AI tool sometimes, but not always.

Onomatopoeia is a train wreck. For example, the “psss psss psss” cat call sounds resulted in the narration spelling out each instance of these phrases. I laughed hard on that one.

Regardless of issues, I’m excited for the opportunity have an audiobook at the ready for Be Home By Dinner. Download a copy of it from Apple Books.

Get started with your own Apple Books audio book.

The post Using Apple Books AI Based Digital Narration Tool For Audiobooks appeared first on Carl Franke.

 •  0 comments  •  flag
Share on Twitter
Published on April 12, 2024 19:57
No comments have been added yet.