TOEIC Part 4 Talks: 30 Questions Across Six Workplace Genres That High-Scorers Recognize Instantly

You've handled Part 1's photo descriptions, survived Part 2's rapid question-response exchanges, and tracked the back-and-forth of Part 3 conversations. Then the voice on the recording shifts — one speaker, no interruptions, 30 to 60 seconds of uninterrupted English, and three questions waiting on the page. Welcome to Part 4.

Part 4 is the final section of TOEIC Listening, and it is statistically where perfect-Listening candidates separate from mid-scorers. It is not harder than Part 3 in any particular sentence — vocabulary and grammar stay in the workplace register you have been training for all along. The difficulty is structural. Each talk runs longer than a Part 3 conversation. There is no second speaker to give you a rest beat or repeat key information. Your attention has to hold steady for a full minute at a time, ten times in a row, at the tail end of a 45-minute section when concentration is already thinning.

The good news is that Part 4 is also the most predictable section of the entire TOEIC. Every talk you will ever hear falls into one of six workplace genres, each with a recognizable opening and a predictable question shape. Once you learn to identify the genre in the first ten seconds, the rest of Part 4 becomes an exercise in confirming what you already expect to hear.

The Format at a Glance

Part 4 is laid out on a strict 10 × 3 grid. Ten single-speaker talks, three questions per talk, thirty questions total.

Feature	Part 4 specifics
Number of talks	10
Questions per talk	3
Total questions	30 (questions 71-100)
Talk length	~30-60 seconds
Speakers per talk	1
Audio plays	Once only
Question stems & options	Printed in the test book
Accents	American, British, Canadian, Australian
Graphic-integration items	Yes (tables, maps, schedules — similar to Part 3)

The talk is spoken once. The three questions that follow are then read aloud, but the stems and options are already printed on your page, so you can — and should — be reading ahead while the talk is playing. There is no preparation timer before each talk; the recording rolls straight from the previous item into a brief directional cue, then into the next talk.

Six Workplace Genres You Will Hear

Every Part 4 talk you will ever encounter belongs to one of these six families. Learning their opening cues is the single most efficient thing you can do for this section.

1. Phone and Voicemail Messages

The speaker introduces themselves by name and company, states why they are calling, and often ends with a request or an offer. Typical openers: "Hello, Mr. Lee, this is Thomas from BKS Auto Shop calling with some information about your car repair…" or "Hi, Sandra, it's Michael from Finance — just following up on the invoice you sent over…"

Expected question shape:

Why is the speaker calling?
What does the speaker say about [the repair / the order / the schedule]?
What does the speaker offer to do / what does the speaker ask the listener to do?

2. Announcements (Workplace, Store, Airport, Stadium)

A single voice addressing a group — employees in a meeting room, shoppers in a department store, passengers at a gate, spectators in an arena. Openers tend to be direct: "Attention all passengers on flight 247 to Toronto…" or "Good morning, everyone. Before we begin this morning's meeting…"

Expected question shape:

Where is the announcement being made?
Who is the intended audience?
What are listeners instructed to do next?

3. Broadcasts and News Reports

Traffic, weather, business headlines, or local-interest stories delivered in a clean broadcast register. Openers frequently name the station and the segment: "Good afternoon, this is Janet Chen with your midday business update on KBRB…" The speaker is a professional presenter, not a character in a workplace scenario.

Expected question shape:

What is the main topic of the broadcast?
What is mentioned about [the company / the event / the weather]?
What does the speaker recommend listeners do?

4. Advertisements and Commercials

A product or service is pitched with a clear feature list and a call to action. Openers hook fast: "Tired of spending hours every week on your lawn? Let GreenLeaf Care take care of everything for you…" The speaker is a copywriter's voice — warm, persuasive, slightly elevated.

Expected question shape:

What is being advertised?
What feature does the speaker emphasize?
What are listeners encouraged to do?

5. Tours and Orientation Scripts

A guide walking a group through a location or a new-hire onboarding session. Openers orient the audience: "Welcome to the Riverside Heritage Museum. Before we begin today's tour…" or "I'd like to welcome all of our new associates. Over the next hour I'll walk you through…"

Expected question shape:

Where is the tour taking place?
What will the group do next?
What does the speaker remind or advise listeners about?

6. Podcasts and Interview Excerpts

A host introducing a guest, or a guest speaking about their area of expertise. These skew slightly longer and more content-dense. Openers usually include a guest introduction: "My guest today is Dr. Amelia Okafor, who has spent the past decade studying workplace productivity…"

Expected question shape:

Who is the guest / what is their area of expertise?
What is the main topic being discussed?
What does the speaker / guest claim or recommend?

How Genre Recognition Saves Time

The moment you identify which of the six genres you are in, you activate a set of expectations about what information will matter. In a phone message you start listening for the caller's name and company and the reason for the call. In an advertisement you listen for the product and the call to action. In a broadcast you listen for the topic and the affected parties.

Consider the Thomas-from-BKS-Auto-Shop example. The first fifteen words — "Hello Mr. Lee, this is Thomas from BKS Auto Shop calling with some information about your car repair" — tell you four things before the content even begins:

It is a phone message (genre: voicemail)
The speaker is Thomas at BKS Auto Shop (speaker identification)
The listener is Mr. Lee (listener identification)
The topic is a car repair (main purpose)

You already have a strong hypothesis about the three questions you are about to face: something about the repair status, something about when Mr. Lee can pick up the car, and something about what Thomas is offering or asking. Sure enough, the actual questions in this ETS sample are:

Q71 — What does the speaker say about the repair? Correct answer: It has been finished early.
Q72 — When can the listener pick up his car? Correct answer: Tomorrow.
Q73 — What does the speaker offer to do? Correct answer: Arrange a ride.

The question shapes are exactly the voicemail-genre defaults. Your job in the remaining 45 seconds of audio was just to fill in the specific details.

This is the mental shift that raises a 420 Listening score to a 480+: you stop listening as a passive receiver and start listening as a checklist-runner who already knows roughly what questions are coming.

Question Types You Will See in Part 4

Although the genres differ, the question types themselves are a small, stable set. They mirror Part 3 with one or two Part-4-specific tilts.

Main Topic / Purpose

"What is the purpose of the announcement?" / "What is the broadcast mainly about?" / "Why is the speaker calling?"

The answer is almost always established in the first two sentences of the talk. If you missed the opening, you have forfeited this question.

Detail

"What does the speaker say about [a specific thing]?" / "According to the speaker, what will happen on Monday?"

The audio states the fact explicitly; your job is to hold the detail long enough to match it to one of the four options. Detail questions are the most common type in Part 4 and reward good notetaking.

Speaker / Listener Identification

"Who most likely is the speaker?" / "Who is the intended audience?" / "Where is the announcement being made?"

These are inference-light questions that ask you to classify the situation. Genre recognition does most of the work: once you know you are in a store announcement, the speaker is probably a manager and the listeners are probably shoppers or staff.

Action, Offer, and Recommendation

"What does the speaker offer to do?" / "What does the speaker recommend?" / "What are listeners asked to do?"

These almost always point at the final 10-15 seconds of the talk. Closing sentences like "If you have any questions, please call the front desk…" or "We apologize and will issue a refund to your account…" are the answer zone.

Inference and Implication

"What can be inferred about the company?" / "What does the speaker imply about the product?"

These are less common in Part 4 than in Part 3 but do appear, especially in podcast and broadcast genres. The inference required is always modest — one small logical step from a stated fact.

Graphic Integration

"Look at the graphic. Which location will the listeners visit next?"

Part 4 includes a handful of graphic-integration items, usually in tour, announcement, or orientation genres. The audio gives you one piece of information (a time, a feature, a name) and you combine it with the graphic (a schedule, map, or table) to identify the correct cell.

Sustained Attention for 30-60 Seconds

Part 4 talks run roughly twice as long as Part 3 conversations, with no second speaker to give you a rest beat. That extra length is the single biggest reason candidates lose points here. Attention drifts, a ten-second segment gets processed as background noise, and the detail question for that segment becomes a guess.

Three habits help you hold attention across the full talk.

Read the three questions before the audio starts. Between items you have a short window while the directions are being read. Skim the next block of three questions and absorb the general shape. "Something about a repair, something about timing, something the speaker offers." That's enough to prime your attention.

Anchor each question to a talk region. Main-topic questions point at the opening. Detail questions point at the middle. Action or offer questions point at the closing. If you know you are looking for an offer, you can let your attention relax slightly in the middle and sharpen again as the talk winds down.

Treat the first ten seconds as sacred. The genre, speaker, listener, and purpose are almost always established in the opening. If you are still recovering from the previous item when the new talk begins, you will spend the rest of it working uphill.

A practical drill: listen to Part 4 samples with the transcript hidden and state aloud the genre and three likely question types within fifteen seconds of the talk beginning. Then check your predictions against the actual questions on the page.

Common Distractors in Part 4

ETS builds distractors out of the same small set of patterns. Knowing the patterns lets you eliminate wrong answers mechanically.

Wrong-listener / wrong-speaker swap. The talk mentions both speaker and listener actions; a wrong option attaches the speaker's action to the listener or vice versa. If the question asks "What does the speaker offer to do?", the right answer is something the speaker will do, not something the listener is asked to do.

Time-frame swap. Many talks mention both a past event and a future plan ("we apologize for last week's delay" and "your package will arrive on Thursday"). Distractors trap on the wrong time frame.

Scope confusion. In broadcasts and podcasts especially, the speaker states a general industry fact alongside a specific fact about one company. A wrong option applies the general fact to the specific entity (or the reverse).

Surface-word lure. A distractor repeats a striking content word from the audio but attaches it to the wrong action or agent. Match the proposition, not the words.

Plausible-but-unsaid. On inference items, a distractor describes something realistic and consistent with the talk but never actually stated. TOEIC inference stays close to the stated facts; if the option requires extra context, it is almost certainly wrong.

A Compact Part 4 Playbook

Put everything above together and a Part 4 run looks like this:

During the between-item window, skim the three printed question stems.
In the first 10 seconds of the talk, identify the genre, the speaker, the listener, and the purpose.
Through the middle of the talk, log concrete details — names, dates, numbers, actions — in your mental scratchpad.
In the final 10 seconds, sharpen for the action, offer, or recommendation that usually closes the talk.
As the three questions are read aloud, match each to its expected talk region (opening / middle / closing) and commit to your answers.
Move on immediately. If a question feels unresolved, mark a best guess and reset your attention — the next talk begins within seconds, and the opening of that talk is more valuable than another moment of regret.

Part 4 rewards candidates who come in with a mental template and spend the audio confirming details, not candidates who listen from a blank slate and try to reconstruct the meaning afterward. The template comes from practice: enough repetitions that the six genres feel like old friends, and the question shapes arrive before the stems do.

When that moment clicks, the final 30 questions of TOEIC Listening stop feeling like a marathon and start feeling like a structured walkthrough you have done a dozen times before. That is the state 480+ scorers are in when they put their pencils down.

Ready to drill Part 4 by genre, with full transcripts, study notes, and AI-scored practice across all six workplace talk types? ExamRift offers full TOEIC L&R mock exams with genre-tagged Part 4 items, so you can practice phone messages, announcements, broadcasts, advertisements, tours, and podcasts as separate skill tracks — and track your accuracy across main-topic, detail, and action questions until Part 4 becomes your strongest section.