Guide · Plugins

How voice changers actually work

A voice changer can turn you into a robot, a stranger, or a deep movie-trailer narrator. Underneath all of it sit a couple of simple ideas. Once you get those, every voice effect you have ever heard makes sense, and you know exactly what to reach for.

What it actually does

A voice changer is software that transforms the sound of a voice while keeping the words clear, so the listener hears a different voice saying the same thing.

It is not lip-sync trickery, and it is not cloning someone else's voice from a sample. That is a different animal. A voice changer takes the voice you already have and reshapes how it sounds. Same words, same timing, different character coming out the other end.

Think of the voice as raw clay. The changer does not write new sentences for you, it just remolds the shape of what you say into something higher, lower, rougher, or flat-out cartoonish. The meaning stays put. The sound is what moves.

Pitch and formant, the two that matter

Pitch is how high or low a voice sounds. Formant is the resonance that makes a voice sound like a particular body and throat. Moving them together is what makes a changed voice believable.

Pitch is the easy one. Slide it up and you get higher, slide it down and you get deeper. Most people stop there, and that is exactly why most cheap voice effects sound fake. Move pitch on its own and you get the chipmunk-at-the-wrong-speed sound, the one that screams "this is a filter."

Formant is the part nobody talks about and the part that does the heavy lifting. It is the resonance of a voice, the quality that tells your ear this came out of a big chest or a small one. The first time I cranked my pitch up on a call I sounded like a startled gerbil, but the second I brought the formant along with it, I turned into a believable higher voice that was clearly a real person, just not me. That pairing is the whole trick.

Live, or after the fact

Real-time voice changing transforms a voice live as you speak. Studio voice changing applies the effect to a recording after it is captured. Same processing, different moment.

Real time is the one you want for anything happening now: a live call, a stream, gaming, a Discord channel. Your voice goes in, the changed voice comes out instantly, and nobody on the other end ever hears the original.

Studio is the after-the-fact version. You record first, then drop the effect onto the clip while you edit a video or clean up a podcast. Nothing has to happen in the moment, so you can take your time and fine-tune until it is perfect. The processing under the hood is the same idea either way. The only real difference is whether it runs in the moment or on the recording.

The effects people actually reach for

Strip away the names and most voice effects fall into three buckets:

  • Character voices. Robot, monster, alien, ghost, chipmunk, deep trailer narrator. These lean on big pitch and formant moves plus a bit of texture to sell the character.
  • Through a device. A voice pushed through a telephone, an old radio, or a megaphone. Here the trick is not changing who is talking, it is changing what they sound like they are talking through.
  • Gentle leans. Nudging a voice a little more feminine or more masculine without going cartoonish. This is the subtle end, and it lives or dies on getting the formant right, not just the pitch.

Once you know it is mostly pitch, formant, and a little texture under the hood, you stop seeing a giant menu of mystery effects and start seeing a few knobs arranged a hundred different ways.

Hiding a voice is its own job

There is one use that is less about fun and more about protecting someone. Anonymizing a voice means reshaping it so you cannot tell who is speaking, while keeping every word clear. It uses the same building blocks, but the goal flips: instead of building a fun character, you are erasing an identity.

It is a real, useful thing and it has its own quirks, so it gets its own guide: how to anonymize a voice.

What makes a good one

A good voice changer gives you real control over pitch and formant, runs at low latency if you need it live, works inside the apps you already use, and does its processing on your own machine.

Low latency matters the second you go live. If there is a noticeable delay between you talking and the changed voice coming out, calls and streams fall apart. For studio work after the fact, latency does not matter at all, so do not pay for it if you do not need it.

The other thing to check is where your audio goes. A tool that processes right there on your computer keeps your voice on your computer. One that ships your audio off to some server to do the work is doing more with your voice than you might want. For a tool that is partly about privacy, that is not a small detail.

We built one. It is called Incognito.

Incognito is a voice changer for Mac that loads right inside the editing and audio apps you already use. Real control over pitch, formant, character voices, classic device looks, and a built-in tool for hiding a voice. It does its work on your own machine. Free trial, no time limit on testing it.

Get Incognito

Free trial available.

It looks like magic until you learn the two knobs, and then it is just two knobs.