TOEIC Part 3 Conversations: 39 Questions, 13 Dialogues, and the 8-Second Preview That Saves Your Score
You hear the narrator say "Questions 32 through 34 refer to the following conversation." A woman starts talking about a coffee machine that isn't working. Forty seconds later, she and a service rep have discussed the model, the warranty, and a pickup time — and you realize you've been listening so hard that you never actually read the three printed questions. You mark three best guesses, and you never get to hear that conversation again.
This is how most candidates lose points on Part 3. The audio isn't too fast. The vocabulary isn't too hard. Part 3 isn't really a listening test — it's a listening-plus-reading-plus-predicting test, and candidates who treat it as pure listening leak points on every set.
Part 3 is also the single largest section on the TOEIC Listening test. Thirty-nine questions out of 100 — nearly 40% of your Listening score — hang on how you handle thirteen short conversations. If you target 400+ on Listening, Part 3 is where that score is built.
What Part 3 Actually Looks Like
Part 3 sits between Part 2 (Question-Response) and Part 4 (Short Talks), and it is the first Listening section where you deal with multi-question sets instead of single-item questions.
| Feature | Part 3 Detail |
|---|---|
| Conversations | 13 |
| Questions per conversation | 3 |
| Total questions | 39 |
| Speakers per conversation | 2 or 3 |
| Times audio is played | Once only |
| Where questions appear | Printed in the test book |
| Where options appear | Printed in the test book |
| Where the conversation appears | Audio only (no transcript) |
| Accents | American, British, Canadian, Australian |
| Typical conversation length | ~30-45 seconds |
| Section position | After Part 2, before Part 4 |
Three details deserve special attention. The conversations are audio-only — never printed, gone the instant the narrator moves on. The questions and options are printed — every stem and every (A)(B)(C)(D) option is visible in your test book before the audio starts, and this is the single biggest lever you have. You hear each conversation exactly once — no replays, no pausing. If your attention lapses for eight seconds, you've probably lost one of the three questions.
Why 3-Question Sets Change Everything
In Part 2, you heard a single question and picked one of three responses — self-contained, one shot. In Part 3, every set of audio feeds three questions that almost always test three different points in the conversation. The reason for the call might be Q32. A detail mentioned ten seconds later might be Q33. Something offered near the end might be Q34.
If you walk into the audio without knowing what to listen for, you are relying on your memory to hold the whole conversation for three separate retrievals. That doesn't work. After forty seconds of listening with no goals, most of the specific information is already fading.
Candidates who score well on Part 3 do the opposite: they read the questions first, form a mental listening agenda, and let the audio confirm predictions they've already set up. The audio becomes a targeted search, not a memory contest.
The Five Question Types in Part 3
Every Part 3 question falls into one of five types. Recognizing the type from the question stem tells you exactly where in the conversation the answer is likely to appear.
1. Main Topic / Purpose
These target the reason the conversation is happening — "What is the conversation mainly about?" / "Why is the woman calling?" / "Where most likely are the speakers?"
The answer almost always comes from the first 5-10 seconds. The opening line usually announces the reason for the call, the location, or the topic. In the ETS sample where a woman calls about a broken coffee machine, Q32 ("Why is the woman calling?") is answered by her first sentence: "I'm calling because our office coffee machine has stopped working." If you miss the first two lines, your odds on the main-topic question drop dramatically.
2. Detail Questions
These target a specific fact — "What does the man ask the woman about?" / "When will the order be delivered?" / "How many units does the customer want?"
Detail questions usually come from the middle of the conversation. The information is stated once, often buried in less-important context. In the coffee-machine conversation, Q33 asks "What does the man ask the woman about?" and is answered when the service rep asks about the model number. One sentence in a 40-second conversation is all you get.
3. Action / Offer Questions
These target what a speaker will do, offers to do, or asks the other person to do — "What does the man offer to do?" / "What will the speakers most likely do next?"
The answer almost always comes from the final third of the conversation. Offers and next steps appear at the resolution stage. In the coffee-machine sample, Q34 ("What does the man offer to do?") is answered near the end when the service rep offers to send a technician. The cue word "offer" in the stem tells you to listen for modal phrases like "I can," "I'll send," "Would you like me to."
4. Inference / Implied-Meaning Questions
These are the hardest type in Part 3. They test whether you understand why a speaker said something — "What does the woman mean when she says, '__________'?" / "What does the man imply about the project?"
The quoted phrase is printed in the stem, so you know which line to listen for extra carefully. But the answer is never the literal meaning — it is the function of the line. If a speaker says "I've been with this company for twenty years" in response to whether they can handle a difficult client, the implied meaning is "I have plenty of experience; I can handle it." Train yourself to ask: "Why did the speaker bother to say that?"
5. Graphic Integration Questions
Some Part 3 sets include a small visual — a table, chart, schedule, floor plan, map, invoice, or organizational chart — printed next to the questions. One of the three questions will reference the graphic directly: "Look at the graphic. What size screen will the man order?" These require combining information from the audio with the printed visual, and deserve their own section.
Graphic Integration Questions: A Deep Dive
About 2-3 of the 13 Part 3 conversations include a graphic. These items are among the most predictable questions on the entire test — once you understand how they are designed.
How the Design Works
Graphic integration questions follow a consistent pattern: the speakers never name the answer directly. Instead, they mention a constraint — a price limit, a time limit, a department name, a floor number — and you cross-reference that constraint with the graphic.
Consider the ETS sample with an office-equipment order. The graphic is a small table:
| Screen Size | System Price |
|---|---|
| 13 inches | $850 |
| 15 inches | $950 |
| 17 inches | $1,100 |
| 19 inches | $1,300 |
Q41 ("What does the woman ask the man to do?") and Q42 ("What problem does the man mention?") are answered from the audio alone. Q43 reads "Look at the graphic. What size screen will the man order?"
The key sentence in the audio is roughly: "I want the largest screen we can get, but I have a maximum budget of $1,000 per system." Neither speaker says "15 inches" or "17 inches." You apply the constraint ($1,000 max, largest possible) to the table: 17-inch at $1,100 is over budget; 15-inch at $950 is the largest that fits. Answer: 15 inches.
Common Graphic Types
Any of these can appear in Part 3: price tables, meeting schedules, floor plans, invoices and receipts, organizational charts, maps, email subject headers, coupons and discount tables, flight or train schedules, weekly calendars.
The Graphic-Preview Habit
Because graphic items are so predictable, your preview window needs to include the graphic itself. When you see a table of sizes and prices, ask yourself: what constraint would make one of these rows the answer? Usually it will be a budget, a size limit, a time preference, or a named person — and you can almost always predict which column or row the conversation will target before the audio starts.
The 8-Second Preview Window
The single most powerful strategy on Part 3 is exploiting the pause between conversations. The narrator announces "Questions 32 through 34 refer to the following conversation" and a short audio beat passes before the conversation begins. Between sets, the narrator also reads each question aloud after the conversation ends, creating another 8-10 seconds per question where you can read ahead.
Practically this gives you roughly 8 seconds before the conversation to scan the three question stems, plus 8-10 seconds between each question after the audio, during which the narrator is reading the question you've already answered. Used well, these windows mean you arrive at every new conversation already knowing what to listen for.
How to Use the Preview Window
With eight seconds you can't read every word of every option. Train this sequence:
- Read the three question stems first. Ignore the options at this stage. You want to know the three listening goals.
- Tag each question mentally — "main topic" / "detail about X" / "inference on the quoted line." The type tells you where in the audio the answer will appear.
- Scan options only if time remains. Noun-heavy options (places, products, times) give you specific words to listen for; verb-heavy options tell you what kind of action to expect.
- If a graphic is present, scan it too. Identify what data is in it and guess what constraint could drive the question.
Done well, this preview converts Part 3 from a listening-comprehension task into a confirmation task. The audio just confirms which predicted answer is correct.
When You Fall Behind
The worst Part 3 mistake is to let one set leak into the next. If you missed Q34, do not burn the pause window trying to recover it. Mark a best guess, move on, and protect your preview window for Q35-37. The rule: always preview the next set, even at the cost of abandoning the last one.
3-Speaker Conversations: Tracking Who Said What
Since the 2016 format update, TOEIC Part 3 includes conversations with three speakers. You will typically see 2-3 of these among the 13 conversations. They are harder than 2-speaker exchanges for one reason: speaker identification is now a question in itself.
The Speaker-Tracking Challenge
In a two-speaker conversation, every line is obviously the woman's or the man's, so stems like "What does the woman offer to do?" are trivial to scope. In a three-speaker conversation, you might have two men and one woman. Stems become "What does the second man suggest?" / "What does the woman ask Richard to do?" / "Why does one of the men apologize?" — forcing you to track three voices, remember which named person said what, and map that to a question that may reference speakers by gender, role, or name.
Strategies for 3-Speaker Sets
Listen for names in the opening lines. Three-speaker conversations almost always begin with speakers addressing each other by name. "Hi, Richard. This is Karen, and I've got Marcus here from IT on the line." If you capture those names in the first five seconds, you can mentally tag each voice before the discussion starts.
Distinguish voices by pitch and accent. Even when speakers don't use names, a 3-speaker set pairs voices that are audibly distinct — one American man with one British man, or one woman with two differently-pitched men. Lock onto timbre as quickly as you lock onto content.
Use the question stems to narrow focus. If Q35 asks specifically about "the second man," the woman's contributions are less critical to that question. Use the printed stems to allocate listening attention during the audio.
Common Distractor Patterns
Part 3 options are crafted to catch candidates who listen at the word level instead of the meaning level. Three trap patterns dominate.
Trap 1: Extracted-Phrase Traps
The option uses a word that was literally said in the audio but misrepresents its meaning. If a speaker says "We won't be able to finish before Friday," a trap option might read "The speakers will finish on Friday" — "Friday" matches, but polarity is inverted. A word appearing in both audio and option is not evidence the option is correct; always verify the full meaning matches.
Trap 2: Wrong-Speaker Attribution
The option describes something that genuinely happened — but attributes it to the wrong speaker. If the question asks "What does the woman offer?" and the man is the one who offered to send a technician, an option saying "Send a technician" is a trap unless the woman also offered it. Always verify the speaker named in the stem is the one who performed the action.
Trap 3: Time-Reference Swaps
The option shifts an action's time reference. If a speaker says "I have ordered the supplies," an option reading "The man will order the supplies" flips a completed past action into a future plan. When options differ by verb tense or time adverbials ("yesterday" vs. "tomorrow," "has already" vs. "will soon"), the time reference is almost certainly where the test is hiding the distinction.
Trap 4: Over-Generalization
The audio describes one specific case ("the northeast branch office"); the option generalizes to all cases ("all branch offices"). If the audio qualifies something with "some," "occasionally," "in one case," the correct option carries the same qualifier — over-generalization traps inflate the scope of what was actually said.
Part 3 Is Where Listening Scores Are Won or Lost
Thirty-nine questions is almost 40% of the Listening test. If you get 30 of 39 Part 3 items right instead of 20, that's ten extra questions — a meaningful bump that moves candidates from the 350-range into the 450+ range.
The mechanics are in your favor. Part 3 conversations are short (under a minute). Topics are workplace-routine and within everyday vocabulary. The question types are a small, predictable set. Options are printed in your test book. And you get a preview window before every set.
What separates 400+ scorers from 300-range scorers on Part 3 is almost never raw listening ability — it's process discipline. Previewing, tagging question types, scanning graphics, tracking speakers, and refusing to dwell on missed items. These habits are learnable, and every practice conversation is a chance to reinforce them.
Plan to spend more Listening practice time on Part 3 than on any other section. Drill at least one full 13-conversation block per session, and after each block, classify every missed item: preview failure, speaker-tracking failure, distractor trap, or genuine listening gap? Different diagnoses need different fixes, and Part 3 improvement comes from matching the right fix to the right leak.
Ready to practice Part 3 with the real format? ExamRift offers full TOEIC Listening mock sets that mirror ETS pacing — 13 conversations per block, authentic graphic-integration items, 3-speaker dialogues with named participants, and the same American/British/Canadian/Australian accent rotation you'll hear on test day. Every conversation comes with transcript, distractor-pattern breakdowns, and question-type tagging so you can see exactly where your points leaked. Start your first free Part 3 block today and find out whether the 8-second preview habit is already second nature or the next thing you need to build.