NATAN FISCHER
← Back to Blog
Published on 2026-06-12

The Reference Track: Why Sending Audio Examples Changes Everything

A reference track audio examples voice over changes everything about your Spanish session. Learn why sending examples gets better results faster.

The Reference Track: Why Sending Audio Examples Changes Everything

A reference track eliminates 80% of the confusion in a voice over session before anyone opens their mouth. That's the conclusion after 20+ years of recording for brands like Google, Ford, and Netflix. The brief that includes an audio example β€” even a rough one β€” produces better results in fewer takes than the brief stuffed with adjectives and creative direction.

Here's why: words fail when describing sound. "Warm but authoritative" means something different to every person who reads it. "Conversational but professional" can describe voices that sound nothing alike. But when you send me a 30-second clip of a voice you like? Now we're speaking the same language.

The Adjective Problem

Creative briefs love adjectives. Confident. Approachable. Modern. Energetic but not too much. Relatable. Authentic.

I've seen these words in hundreds of briefs. And I can tell you exactly how useful they are: barely. A study from the Audio Branding Academy found that the same adjective applied to voice characteristics was interpreted differently by 67% of respondents across different cultural backgrounds. "Friendly" to someone in Chicago doesn't sound like "friendly" to someone in Mexico City.

The voice over artist isn't a mind reader. We're professionals trained to interpret scripts and deliver them with precision. But precision requires specificity, and adjectives aren't specific. They're feelings dressed up as instructions.

Why Reference Tracks Work Where Words Fail

When you send an audio reference track, several things happen simultaneously. First, the artist hears the actual tone you want β€” the pacing, the energy level, the emotional register. Second, they can hear what you don't want by understanding what you chose not to send. Third, and this is underrated, they can identify if what you're asking for even matches the script you've written.

I've had clients send reference tracks that were completely wrong for their script. Not wrong in a subjective taste way β€” wrong in a "this energetic retail voice cannot physically deliver a 45-second script in 30 seconds" way. The reference track revealed a mismatch that would have taken three rounds of revisions to discover otherwise.

According to a 2023 survey by the Global Voice Acting Academy, productions that included audio references reported 40% fewer revision requests than those using text-only briefs. That's not a small efficiency gain. That's the difference between delivering same-day and pushing into next week.

What Makes a Good Reference Track

It doesn't have to be perfect.

Send me a competitor's ad you like. Send me a voice from a documentary that has the right gravitas. Send me a podcast intro that captures the conversational feel you want. Send me a YouTube video where someone nails the tone, even if the production quality is terrible. I'm not listening to the microphone quality β€” I'm listening to the interpretation.

Have you ever tried to explain to someone what cilantro tastes like? You can use words like "fresh" and "herbal" and "bright," but nothing works as well as handing them a leaf and saying "this." Reference tracks are the cilantro leaf of voice over direction.

What doesn't work: sending your last campaign's voice over as a reference when you're specifically trying to change direction. Or sending five completely different references and asking for "something in between." Or sending a reference in English when you need Spanish delivery.

The Spanish-Specific Problem

Reference tracks matter even more for Spanish voice over sessions, especially when the client doesn't speak Spanish. You might think a "warm Colombian accent" is what you want because your friend from BogotΓ‘ has a pleasant voice. But sending me an actual example of Colombian voice over work lets me hear whether you want the specific regional markers or just the general warmth β€” which neutral Spanish could deliver without alienating audiences from other countries.

I've had clients send references in Spanish from Spain thinking it would communicate sophistication to Latin American audiences. The reference track let me redirect before we recorded a single take. Latin Americans don't hear Spanish accents as sophisticated β€” they often hear them as odd or even comical. The reference revealed the misalignment before it became a problem.

The Music Reference Matters Too

Here's something most briefs forget: the music track matters as much as voice references. If you know what music will accompany the voice over, send it. According to research published in the Journal of the Audio Engineering Society, voice over artists who record against the actual background music deliver takes that sync better with the final mix 73% of the time compared to dry recordings.

The music sets the emotional container. A script that seems to call for high energy might actually need restraint if the music underneath is already doing the heavy lifting. A somber script might need more warmth if it's going over sparse piano. The voice over artist reads these signals and adjusts, but only if they can hear them.

This is why I recommend clients send whatever scratch track or temp music they're considering. Even if it changes in post, it gives me the emotional landscape I'm recording into.

What Happens Without a Reference

The session takes longer. (Which, by the way, costs more if you're on a session rate rather than a flat fee.)

Without a reference, we start with the first take β€” which is often the best interpretation of the script as written. Then the direction starts: "Can you make it warmer?" I adjust. "Actually, maybe a bit more authoritative?" I adjust again. "Hmm, something between those two?" Now we're in territory where the client is discovering what they want through trial and error, using my voice as the testing instrument.

This isn't a complaint. Part of my job is helping clients discover what they need. But it's inefficient compared to starting with "sound like this example, adapted to our script." The process that takes 45 minutes without a reference can take 15 minutes with one.

The Reference You Already Have

Sometimes the best reference is the English version of the spot you're localizing. If your English campaign already exists and you're bringing it to Spanish, send me that English audio. I can hear the pacing, the emotional beats, where the emphasis lands. Then I can translate that interpretation into Spanish while accounting for the language being 30% longer β€” which, as I've written about before, means the script itself probably needs trimming.

The English reference isn't about mimicking an English-language sound. It's about understanding what the creative team approved, what worked, and what the brand wants to preserve in the Spanish version.

When You Don't Have a Reference

If you genuinely have nothing β€” no examples you like, no previous work in the same vein, no competitor spots that capture the tone β€” that's fine. It happens, especially with new brands or new campaign directions.

In those cases, I'll record 2-3 variants in the first pass. Different energy levels, different emotional registers. You'll hear options and recognize which direction feels right when you hear it, even if you couldn't articulate it beforehand. This is slower than working from a reference, but it's honest. Some clients discover what they want only by hearing what they don't want.

The 2-3 variants approach is what agencies and production houses do constantly. It's also why working directly with an experienced voice over professional beats casting on platforms β€” you get nuanced options from someone who understands how to vary delivery meaningfully, instead of 100 auditions from people gaming an algorithm.

Send the Example

That's really all I'm asking. Before your next Spanish voice over session, find an audio example that captures something β€” anything β€” about what you want. A tone. A pace. An energy level. A specific voice you've heard elsewhere.

Attach it to the brief with a note saying "something like this" or "the energy of this, but less intense" or "this pacing, different accent." You don't need to be precise. You just need to give me something to listen to instead of something to interpret from adjectives.

The session goes faster. The results get closer to what you imagined. And neither of us spends 20 minutes trying to figure out what "approachable but sophisticated" means to you specifically.

Need a Spanish voice over for your next project? Get in touch and I'll get back to you within the hour.

Get in touch

ShareXLinkedInFacebook

Related articles