TOEIC Part 2 Question-Response: 25 Questions, Three Spoken Options, and Why It's Harder Than Part 1

TOEIC Part 2 Question-Response: 25 Questions, Three Spoken Options, and Why It's Harder Than Part 1

You heard the question clearly. You heard option (A), then (B), then (C). And somewhere between the question and (B), your brain went blank. Which one answered the question? Was it the one that repeated "fax"? You bubble (B) and move on, already two seconds behind.

Part 2 of TOEIC Listening looks trivial on paper — one question, three short replies, pick the best. Yet candidates routinely lose more points on Part 2 than on Part 1, and strong readers sometimes find it the hardest section on the Listening test.

The reason is simple: Part 2 is the only part of the test where nothing is printed except the answer bubbles A/B/C. You hear the stimulus once, you hear the three choices once, and you answer. If your ear drifts for two seconds, the evidence is gone.

What Part 2 Actually Looks Like

Part 2 is the second section of TOEIC Listening and contains 25 items. For each item, you hear a spoken question or statement followed by three spoken responses labeled (A), (B), and (C). Your job is to pick the response that best answers what you heard.

Feature Part 2 Question-Response
Items 25
Section time ~9 minutes (within the ~45-minute Listening block)
Stimulus One spoken question or statement
Options Three spoken responses (A, B, C)
What is printed Only the answer bubbles — A, B, C
Scoring 1 point each, machine-scored
Accents American, British, Canadian, Australian
Replays None — you hear everything once

Notice the critical line: only the answer bubbles are printed. No question stem, no option text, no visual anchor. This single fact explains why Part 2 behaves so differently from every other section on the test.

Why Part 2 Is Deceptively Hard

On Part 1 (Photographs), the image gives you a running check: does this sentence describe what I see? On Part 3 (Conversations) and Part 4 (Talks), printed questions and options let you re-read, eliminate distractors, and keep context visible.

Part 2 strips all of that away. You cannot re-read because there is nothing to read. You cannot compare options side by side because they arrive in sequence, and by the time (C) finishes, (A) is already fading. You cannot even confirm the question stem — there is no printed version.

This means Part 2 rewards a different skill set: working memory, reflex recognition of the question word, real-time elimination, and trust in ear-level evidence with no visual fallback.

The good news is that Part 2 has structure. The questions fall into five predictable types, the wrong answers fall into three predictable families, and both are trainable.

The Five Question Types

Almost every Part 2 item belongs to one of five categories. Classifying the stimulus in the first two seconds is the single biggest accuracy boost on this section.

1. WH-Questions (Who, What, When, Where, Why, How)

WH-questions ask for specific information. The correct response provides it — often directly, sometimes obliquely.

Q: "Where's the new fax machine?" (A) Next to the water fountain. ← correct (B) I'll send a fax tomorrow. (C) By Wednesday.

Where locks in the response shape: a location. (A) gives a location. (B) repeats "fax" but describes an action. (C) gives a time, which would answer when, not where.

Indirect WH-responses also appear. "Where is the meeting room?" might be answered with "It's the first room on the right" — not a formal address, but clearly a location. Your ear is looking for answer category, not a grammatical frame.

"I don't know" responses are valid too. A WH-question expecting a time or place can legitimately be answered with "I'll check" or "Let me find out." Acknowledging uncertainty is a normal response, and the test treats it as correct when the other two options are clearly wrong.

2. Yes/No Questions

Questions starting with do, does, did, is, are, can, will, have invite a yes-or-no answer — but on TOEIC, the ideal response often skips yes/no entirely.

Q: "Martin, are you driving to the client meeting?" (A) Oh, would you like a ride? ← correct (B) Nice to meet you, too. (C) I thought it went well!

A literal answer would be "Yes, I am" or "No, I'm taking the train." But (A) is correct because it answers the implied follow-up ("Can we coordinate?") without wasting words on yes/no. This is how native speakers actually talk, and TOEIC reflects that.

Train yourself to accept responses that imply the yes/no through follow-up content. "Oh, would you like a ride?" only makes sense if Martin is driving — so the response implicitly confirms yes.

3. Alternative / Choice Questions

Choice questions use or to offer two options. The correct response names one choice (or occasionally rejects both).

Q: "Is the report due today or tomorrow?" Correct shape: "Tomorrow morning." / "I think it's Friday, actually." / "Let me check the email."

Wrong shape: "Yes, it is." A yes/no answer to an or question is almost always a distractor. Choice questions are usually the easiest type to classify, but they are also where tired test-takers slip — the yes/no distractor is a tempting default when your ear is fatigued.

4. Tag Questions

Tag questions end with a short appended clause: aren't you?, isn't it?, didn't he?, right?. They ask for confirmation or denial of a statement the speaker already believes.

Q: "You're coming to the meeting, aren't you?" Correct shape: "Yes, at two o'clock." / "Actually, I have a conflict." / "I wouldn't miss it."

Tags can arrive quickly and quietly — often reduced to "right?" or swallowed in the rhythm. If your ear misses the tag, you may mis-classify the stimulus as a statement and miss the fact that it wants confirmation.

5. Indirect Requests and Statements

This category confuses the most test-takers, because the stimulus is not a question at all. It is a statement or announcement that invites a reply.

Statement: "Mariko announced that she's retiring in April." (A) How many did you count? (B) I'm not tired at all. (C) Right, she's been here twenty-five years. ← correct

Nothing in the stimulus ends with a question mark. It is simply news. The correct response acknowledges the news and adds a natural follow-up. (B) is a sound-based distractor (tired/retiring). (A) is off-topic. (C) is what a coworker would actually say.

Indirect statements also include invitations, offers, complaints, and observations. The moment a statement arrives, shift your listening from "what's the answer?" to "what would a colleague naturally say in reply?"

The Three Distractor Families

Part 2 wrong answers are not random. They are engineered around three specific traps, and once you recognize the pattern, the distractors become audible in real time.

Family 1: Same-Word Repetition

The wrong option repeats a word from the stimulus. Your brain hears the familiar word and assumes relevance. It is not.

Q: "Where's the new fax machine?" (B) I'll send a fax tomorrow.

"Fax" appears in both, but the option describes sending a fax (an action), not the location of the machine. The repetition is bait. A correct Part 2 response often contains no words from the stimulus at all.

Training habit: when you hear a repeated word, raise a mental flag and treat the option as guilty until proven innocent.

Family 2: Similar-Sound Distractors

The wrong option contains a word that sounds like a word in the stimulus but has no related meaning.

  • "copy" in the question → "coffee" in the option
  • "retiring" in the statement → "tired" in the option
  • "violin" in the question → "volume" in the option
  • "expense" → "expect"
  • "fair" → "fare" → "fear"

Q: "How well does Thomas play the violin?" (A) Sure, I really like it. (B) Oh, he's a professional. ← correct (C) I'll turn down the volume.

(C) echoes "violin" with "volume" — similar sound, related topic, but no answer to "how well." The test exploits the brain's tendency to latch onto phonetic similarity as a proxy for relevance.

Training habit: when two words share a syllable but live in different semantic worlds, it is almost always a trap.

Family 3: Wrong-Tense / Wrong-Category Answers

The option provides the right kind of information for the wrong question. A question about completion gets answered as if it were about a future plan. A question about place gets answered with a time.

Q: "Where's the new fax machine?" (C) By Wednesday.

"By Wednesday" would answer "When is the new fax machine arriving?" But the question asked where. The option is well-formed English attached to the wrong question.

Wrong-tense traps are especially common: past-tense question ("Did you finish?") gets a future-tense option ("I'll start tomorrow"); present-continuous ("Are they meeting now?") gets a past-tense option ("They met yesterday").

Training habit: after each correct answer in practice, state the category match out loud — "asked for a location, got a location." This keeps category-matching at the front of your listening.

When the Best Answer Doesn't Start With Yes or No

This is the single most common point of failure on Part 2, so it deserves its own section.

On Part 2, a yes/no question is often best answered by an option that never says yes or no. The correct response acts as if the yes/no is obvious and moves straight to a follow-up.

"Would you like a ride?" only makes sense if Martin is driving. The response confirms yes by implication and does something useful with that information (offers a ride) in the same breath.

Other common patterns of implicit yes/no:

  • Q: "Have you finished the report?" → A: "I'll send it by five." (implicit yes/soon)
  • Q: "Is the projector working?" → A: "IT just fixed it." (implicit yes)
  • Q: "Do we have more envelopes?" → A: "I'll check the supply closet." (implicit "let me find out")
  • Q: "Can you attend the Thursday meeting?" → A: "What time does it start?" (implicit "possibly, need more info")

If you are mentally filtering for "yes" or "no" at the start of each option, you will reject the correct answer on a significant number of items.

How to Prepare: Reflex Listening, Not Academic Listening

Part 2 rewards a different kind of listening than most classroom instruction builds. Academic listening — the kind you do for TOEFL lectures — trains you to hold long structures, take notes, and reason about argument flow. Part 2 asks for the opposite: rapid, shallow, reflex-level pattern matching.

Shadowing and Repeat Practice

Shadowing means repeating what you hear, out loud, at near-real-time speed. For Part 2, shadow the stimulus — not the options. Your goal is to internalize the rhythm of TOEIC question stems so your ear parses them without conscious effort.

Play a stimulus, repeat it at full speed, check the transcript, move on. Do 20-30 items per session. After a few weeks, question stems arrive pre-parsed: your brain will know within a syllable whether you heard a WH-question, a yes/no, or a statement.

The WH-Anchor Habit

Lock the question word into working memory within the first two syllables. Where → location. When → time. Who → person. How → manner or quantity. Why → reason. The question word arrives at the very start and tells you exactly what the response must look like.

Then, as the three options play, run category matches. "Next to the water fountain" — location, matches where, candidate. "By Wednesday" — time, rejects. "I'll send a fax tomorrow" — action, rejects.

This single habit, practiced until automatic, accounts for a large share of Part 2 accuracy gains.

Training on Reductions and Natural Rhythm

TOEIC audio uses natural workplace English, which means reductions: gonna, wanna, didja, whaddya, wouldja, lemme. Test-takers who only studied textbook English often mishear these entirely. "Are you going to finish?" becomes "Ya gonna finish?" "Did you send it?" becomes "Didja send it?"

Expose your ear to natural-speed English daily — podcasts, interviews, workplace YouTube channels — for rhythm, not content. A fast "didja" should tell your ear "past-tense yes/no question" before the verb arrives.

One-Pass Practice, No Replays

Resist the urge to rewind. The real test gives you one pass. If you allow multiple listens in practice, you train a skill the test will never reward. Answer on the first pass, mark uncertain items, and review with the transcript only after the full set is done. This is uncomfortable, and it is the exact pressure you will feel on test day.

Bringing It All Together

Part 2 looks like the simplest section on TOEIC Listening, and that is exactly why it eats points. Twenty-five fast items, no visual support, no replays. The candidates who score well are not the ones who understand every word — they are the ones who:

  • Classify the stimulus (WH, yes/no, choice, tag, statement) in the first two seconds
  • Lock the response category into working memory before the options play
  • Recognize the three distractor families (same-word, similar-sound, wrong-tense) in real time
  • Accept that the correct answer often shares no vocabulary with the stimulus
  • Accept that yes/no questions often have no yes/no answer

These are trained habits, built through volume — hundreds of Part 2 items, listened to once each, with structured review afterward.

On ExamRift, TOEIC Part 2 practice gives you 25-item sets with the full mix of question types, randomized across American, British, Canadian, and Australian accents. Each item comes with a transcript, a classification tag (WH / yes-no / choice / tag / statement), the distractor family for each wrong option, and functional-phrase supplements that expose you to indirect-response patterns. You practice one pass at a time — no replays during the timed set — and only after submission do you get transcripts and explanations, so the training matches test conditions exactly.

Part 2 is a reflex-listening section. Build the reflex, and the 25 points stop being a guessing game.


Ready to train your ear on TOEIC Part 2 reflexes? Start practicing on ExamRift and convert one of the test's trickiest sections into one of your most consistent scoring blocks.