Bernard Henusse in a lively discussion with a student over French materials in a well-lit room.

Why Tu and Tout Sound Identical to You

A specific French problem

Why Tu and Tout Sound Identical to You

The most common French phonetic confusion for English speakers, what’s actually happening in your brain, and why mouth-shape tutorials don’t fix it.

By Bernard Henusse, founder of Real French. Kerfiac, Brittany. Teaching spoken French through phonetic ear training since 2008.

Eleven-minute read. About 2,277 words.

In this article

A 30-second test

Listen to these two French words. Are they the same word, or different?

Tap each one. Listen carefully. Then keep reading.

Word A is tu (you). Word B is tout (everything). Two words, two meanings. To French speakers, they sound nothing alike.

Can you hear the difference between tu and tout?

Most English speakers can’t, on the first attempt. If you played the two clips above and they sounded like the same word said twice, or close enough that you couldn’t reliably tell them apart, you are not bad at French. You have a specific, well-documented perceptual problem, and it has nothing to do with effort, attention, or vocabulary.

The vowel in tu is the French phoneme /y/. The vowel in tout is /u/. Two distinct sounds in French. Two completely different words. They are what linguists call a minimal pair, two words that differ by a single phoneme, and they appear in ordinary French sentences hundreds of times a day. Tu as tout vu. You have seen everything. If you can’t hear the difference, you can’t parse the sentence.

English doesn’t use this distinction. English has one high rounded vowel category, the /u/ in too or shoe, and your brain assigns both French sounds to that single English bucket. Tu sounds like tout because your perceptual system never built a separate category for /y/. Until it does, no amount of vocabulary, conjugation, or conversation practice will fix the gap.

What’s actually different between /y/ and /u/?

Tongue position. Both vowels round the lips. Both are high vowels (the tongue is raised toward the roof of the mouth). The difference is whether the tongue is at the front of the mouth or at the back.

For /u/ (the vowel in tout), the tongue sits at the back of the mouth, the way it does in English too. Your lips round. The sound comes out low and dark.

For /y/ (the vowel in tu), the tongue sits at the front of the mouth, the way it does in English see, but your lips round into the position they take for /u/. Front-tongue, rounded-lips. The sound comes out brighter, higher, almost like a strangled ee. English doesn’t combine those two articulatory features anywhere in its phoneme inventory, which is why your tongue and brain don’t naturally produce it.

There are a hundred YouTube videos that explain this exact mechanical instruction. We have an article you could read about it. None of them, on their own, fix the problem. Here’s why.

“You can’t reliably produce a sound your brain doesn’t perceptually distinguish. Production follows perception, not the other way around.”

Why doesn’t “round your lips and say ee” work?

Because you can’t reliably produce a sound your brain doesn’t perceptually distinguish. Production follows perception, not the other way around. If your auditory system files /y/ and /u/ in the same bucket, you have no internal target to aim at.

Picture trying to throw a dart at a board you can’t see. Someone tells you the bullseye is twenty centimetres up and ten to the left. You throw, miss, and ask if you got close. They say “not quite.” You adjust by feel, throw again, miss again. After a hundred throws you might land near the centre by accident, but you can’t learn to aim, because nothing in your sensory feedback tells you which direction to correct.

That is what mouth-shape tutorials do for the /y/ sound. They hand you instructions about the motor side of the action, but they don’t fix the perceptual side. You produce something, your tutor says it’s almost right, you have no way to know what almost-right sounds like in your own head, so you can’t calibrate. After two years of well-meant lessons, your /y/ is still your English /u/ wearing different lipstick.

What’s actually happening in your brain?

Your auditory system is doing exactly what evolution designed it to do. It’s saving you cognitive effort by sorting incoming sound into the small set of categories that mattered for the language you grew up in. The technical name for this is categorical perception, and it has been studied for decades. Patricia Kuhl at the University of Washington is one of the major figures in the research.

Babies are born able to distinguish every sound contrast used in every language on earth. Around age seven or eight, the brain finalizes the phoneme inventory of the native language and stops perceptually distinguishing the rest. The unused contrasts collapse into the nearest available category. From that point forward, your brain is no longer hearing the contrast; it’s pattern-matching to the buckets it built in childhood.

For English speakers learning French, this means a handful of French phonemes that English doesn’t use are now invisible. /y/ is one. /ø/ (the vowel in peu) is another. The three nasal vowels in un, on, and en blur into a single approximate sound. We teach 14 distinct French vowel sounds at Real French. English speakers reliably hear about six of them without training. The other eight are inaudible to your auditory system, which throws them away as duplicates of sounds it already knows.

A linguistically curious aside. The same phenomenon is why Japanese speakers struggle with English /r/ and /l/ (Japanese has a single liquid phoneme that English splits into two), and why English speakers struggle with the four Mandarin tones (English doesn’t use pitch contour to mark word identity). Categorical perception is universal across human languages. The specific contrasts you can’t hear depend entirely on which one your brain locked in first.

Why can’t a native French speaker easily teach you to hear /y/?

Because they never had to learn the contrast. They were born into it. The /y/ vs /u/ distinction was wired into their auditory system before they were two. They can produce both sounds correctly, hear the difference effortlessly, and tell you when your version is wrong. They cannot show you how to retrain a perceptual system, because they never had to retrain theirs.

This is no criticism of native-speaker tutors. They are excellent at correcting your output, expanding your vocabulary, conducting French conversation, and modelling natural speech. Most language learning needs exactly this. But ear retraining is a different discipline. It requires someone trained in second-language phonetics who has either taught the contrast hundreds of times to anglophone learners and watched the perceptual system rebuild, or has gone through it themselves as a learner of a foreign phoneme inventory. That kind of training is rare. Most italki and Preply tutors don’t have it.

It’s also why apps like Duolingo can’t fix the problem. The app plays /y/, plays /u/, asks you to identify which is which, you tap a button, and the app tells you whether you guessed right. Guessing right is not hearing right. After a hundred trials your accuracy may improve, but what improved was your guessing strategy. The perceptual category never split.

Curious which of the 14 French sounds your ear is filtering out?

The same diagnostic Bernard runs with every student on day one. Eight pairs of French words. Three minutes, free, no commitment.

What does retraining your ear for /y/ actually look like?

Hours of corrected production with a trained ear listening, in person, sustained over consecutive days. The work is unglamorous and old-fashioned and there is no software shortcut.

In our home in Brittany, the sequence with a new student typically goes like this. We sit across a small table. I produce tu, then tout, several times each. The student listens. Then they produce. I tell them which one I just heard, irrespective of which one they intended. We do this for forty minutes, then we move to the related minimal pair du / doux. Then to su / sous. Then to vu / vous. Eight or nine pairs through the same contrast in a single morning.

By lunch, most students start hearing the difference. By the end of the first day, they hear it consistently. By the end of three days they hear it without thinking, and they can produce it without rehearsing in their head. The window is open. Once the perceptual category has split in their brain, no further training is required for that contrast. The other thirteen French sounds are sitting next to it, waiting to be addressed the same way.

This is what the 14 Sounds Experience is built around. Three days, eighteen hours of one-on-one work, in our home, all meals and accommodation included, from €1,900. The same method underlies the longer programmes on the pricing page, and the broader pedagogy that combines all three pillars (the 14 Sounds, oral comprehension, and oral interaction) lives on the method page.

If tu and tout still sound like the same word and you’ve been studying French for years, this is the thing you’ve been missing. Not effort, not vocabulary, not motivation. The perceptual ground floor.

Common questions

How long does it take to learn to hear /y/ vs /u/?

Most students start hearing the contrast within the first three to four hours of focused one-on-one work. Consistent perception, where you don’t have to concentrate, follows by the end of the first day. Permanent automatic perception, where the contrast is wired in for good, settles in over the following two days and the weeks of casual exposure that follow. The brain consolidates fastest when the initial training is intensive (hours per day for several consecutive days), then is allowed to ride on natural exposure afterwards.

Can I learn this online or do I have to come to France?

Online phonetic theory courses are useful preparation. They explain mechanically what /y/ requires and give you the vocabulary to talk about the problem. They cannot retrain your perception on their own. The work that splits the perceptual category requires a trained ear listening to your specific output in real time, catching the moment your tongue slides back and your /y/ collapses into your familiar /u/. That feedback loop has to run for hours, sustained, with someone in the room with you. Through a screen, with the latency and audio compression involved, it’s genuinely difficult to do at the precision level required.

I can pronounce tu correctly when I think about it. Do I still need ear training?

Probably yes. There’s a difference between conscious production and automatic perception. Many learners can produce a passable /y/ when they slow down and focus on the mouth shape, then slip back into /u/ five seconds later in conversation when their attention moves to vocabulary or grammar. The ear training rebuilds the perceptual category, which means the right sound becomes automatic in production too. If you have to think about it, the category isn’t fully there yet.

What other French sounds do English speakers struggle with?

The /y/ vs /u/ contrast is the most common one to surface, but it’s one of about eight that English speakers reliably miss. The others include /ø/ (the vowel in peu), /œ/ (the vowel in brun), the three nasal vowels in un, on, and en (which most anglophones flatten into a single nasalised sound), the open/close mid-vowel pair /e/ vs /ε/ (which distinguishes pêcheur from pécheur, fisherman from sinner), and the open/close /o/ vs /ɔ/ pair. Our flagship 14 Sounds page walks through the full set with audio examples.

Is this the same problem as the Japanese r/l issue?

Yes. Same cognitive mechanism, different phonemes. Japanese has a single liquid consonant where English has two (/r/ and /l/), so Japanese speakers learning English famously struggle to distinguish rice from lice. The neurological process is identical to what happens to English speakers with French /y/ and /u/. It is also the same reason Mandarin speakers can struggle with English voiced/voiceless contrasts, and English speakers can struggle with the four Mandarin tones. The brain locks in the inventory of distinctions it needs around age seven, and the unused contrasts go quiet. Retraining is possible, but it takes deliberate phonetic work, not exposure alone.

I’m in my fifties. Am I too old to retrain my ear?

No. The neuroplasticity literature is clear that the relevant categorical-perception circuits remain rebuildable in adulthood given the right input. Our oldest student to date came at seventy-three and rebuilt the /y/ perception within the same three-day window as a thirty-year-old. What changes with age isn’t the underlying capacity, it’s the amount of childhood-installed interference your brain has to work past. The training is the same. The intensity required is also the same. Three days of focused work resets the perceptual ground floor at any age.

Related reading

Why Can’t I Hear French?

If you’ve studied French for years and still can’t follow a conversation, the problem isn’t effort or vocabulary. Your ear was never trained.

Eleven-minute read.

Why Your Teacher Never Fixed Your Pronunciation

Years of French lessons and your accent has barely moved? It’s almost never the teachers. It’s the format.

Nine-minute read.

Why Do French People Switch to English When You Speak French?

It is not rudeness or your accent. French people switch to English because their brain is working too hard to parse blurry vowels.

Eleven-minute read.

Last reviewed by Bernard Henusse on 2026-04-26.

Bernard Henusse, founder of Real French

Bernard Henusse · Founder, Real French

Bernard Henusse founded Real French in Brittany in 2008. He has trained 400+ adult students from 30+ countries in phonetic ear training, including diplomats, professors, journalists, and Nobel laureate Eric Kandel.

Read more about Bernard →

If something here rang true

Take the three-minute diagnostic to find out which of the 14 French sounds you’re missing. Or book a free fifteen-minute call with Bernard to talk through where you actually are with French and what would close the gap fastest.

400+ students from 30+ countries · 18 years of teaching · Kerfiac, Brittany.