framework · metrics
Fishbone diagram: how to use it in a PM interview
Best for: RCA questions where the cause is cross-functional, gradual, or politically contested
The fishbone diagram is a divergent tool. Its job is to expand possible cause space before you commit to a branch, not to find the root cause itself. Kaoru Ishikawa developed it at Kawasaki shipyards in 1943 (first published 1968), and the original six categories, called the 6Ms, were built for manufacturing: Man, Machine, Material, Method, Measurement, Mother Nature. None of those map cleanly to software PM work. Candidates who recite manufacturing 6Ms in a product interview signal that they memorized the label without translating the logic.
The correct pairing is fishbone first, Five Whys second. Fishbone diverges: what might be causing this? Five Whys converges: what is causing this, specifically? Use them in sequence, not as alternatives.
When to reach for fishbone instead of going straight to Five Whys
Fishbone is the right opening move when:
- The drop is cross-functional and no single team clearly owns the failure.
- The metric declined gradually rather than at a single point (no clean incident trace).
- There is political pressure toward one team’s explanation and you need a neutral structure.
- You cannot yet rule out multiple independent contributing factors.
Go straight to Five Whys when: a specific system alert fired, a single deploy is the obvious suspect, or you are in a post-mortem with logs in front of you. Spending time building a fishbone when the cause is already isolated is theater, not analysis.
The six PM-world bone categories
The classic 6Ms need to be replaced with software and product vocabulary. The six bones that cover most PM RCA scenarios:
- Users and behavior: cohort shift, seasonality, traffic source change, notification fatigue, onboarding friction, demand shift.
- Funnel and process: step-level drop, form errors, payment method failure, backend timeout that looks like user abandonment.
- Technology and infrastructure: latency regression, CDN issue, third-party script failure, API error rate spike, mobile OS change.
- Product and design: recent UI deploy, A/B variant leak, mobile viewport bug, copy change, feature flag misconfiguration.
- Data and measurement: tracking breakage, attribution change, timezone artifact, sampling error, definition change.
- External and market: competitor promotion, payment processor outage, app store change, regulatory action, macro seasonality.
In 2026, add a seventh bone for any AI product: Model and AI system. A metric drop can originate from a model version rollout, retrieval corpus staleness, prompt regression after a safety update, eval score decline, or latency SLA breach causing timeouts that register as engagement drops. Candidates who run a six-bone analysis on an AI product are using an older mental model. The correct question to add at the start: “Did any model, prompt, or retrieval configuration change in the 72 hours before the drop?”
How to build the diagram in an interview
You do not need a whiteboard. The verbal protocol is the same as the visual one.
Step 1: Scope the head precisely. The fish head is the problem statement. Scoring starts here. “DAU dropped” is not a head. “iOS DAU dropped 12% week-over-week starting Tuesday; Android and web flat” is a head. Confirm the scope with your interviewer before building anything else.
Step 2: Name all the bones before populating any of them. Say: “Let me map the cause categories first so I’m not anchoring on one team’s explanation.” Then list all six (or seven) bones by name. This signals MECE thinking: each bone is a mutually exclusive cause category, and together they cover the plausible cause space.
Step 3: Populate each bone with two or three specific hypotheses. Be concrete. Not “a bug” under Technology, but “latency regression on the payment API” or “iOS 19 changed how the location dialog fires.”
Step 4: Prioritize one branch to investigate first and say why. The strongest candidates name what data they would pull to validate or falsify the highest-probability branch, and who owns that data.
Worked example: checkout conversion dropped 11%
Head: Mobile web checkout conversion dropped 11% week-over-week starting Wednesday. Desktop flat. No change in traffic volume.
Bones and hypotheses:
- Users and behavior: cohort mix shift toward lower-intent browsers, mobile session length shortened.
- Funnel and process: step-level exit spike at payment form, guest checkout path broken.
- Technology and infrastructure: third-party payment script latency up, CDN miss rate on mobile assets, Wednesday deploy window overlap.
- Product and design: new mobile UI deployed Tuesday night, A/B variant leaked to wrong segment, mobile viewport truncating the CTA.
- Data and measurement: mobile tracking pixel failing on iOS 19 safari, conversion definition changed to exclude partial checkouts.
- External and market: payment processor outage for one card type, competitor launched a promotion Wednesday.
Prioritization: The Wednesday deploy is the fastest hypothesis to validate because there is a clear timestamp and an owner. Pull step-level funnel data to confirm where the exit is, then check deploy logs. If the Technology or Product bone is confirmed as the location, apply Five Whys from there: why did users drop at the payment form? Because the third-party script is timing out. Why? Because the CDN config change in Tuesday’s deploy increased asset load time past the script timeout threshold. Why did that reach production? No performance budget check in CI.
Strong vs. weak answer
strong
"Before I diagnose, let me scope the head precisely: checkout conversion dropped 11% on mobile web, desktop flat, started Wednesday. I want to be sure we're measuring the same thing before I build a hypothesis. Now let me map the cause categories so I'm not anchoring on one team. I'll build the spine first: Users and Behavior, Funnel and Process, Technology and Infrastructure, Product and Design, Data and Measurement, External and Market. For an AI product I'd add Model and AI System, but this is a checkout flow so six bones cover it. Under each: [names two to three specific hypotheses per bone]. I'd start with Technology and Product because of the Wednesday deploy: that's the fastest to falsify with step-level funnel data and deploy logs. Once I confirm which bone is live, I'd switch to Five Whys to drill the chain. The two-stage move is: fishbone to scope cause space, Five Whys to find the systemic fix."
weak
"DAU dropped, it's probably a backend issue or a recent push. I'd check the deploy log and ask engineering." This fails four ways: it skips problem scoping entirely (12% on iOS only versus all platforms is a different problem); it jumps to a single hypothesis before exploring cause space; it outsources diagnosis to engineering rather than demonstrating PM ownership; and it has no data validation plan across categories. The interviewer stops trusting the candidate's analytical rigor before the second sentence.
The 2026 viability bone
In 2026, a root cause analysis on an AI product sometimes lands here: “the inference cost per session makes this feature structurally unprofitable at current pricing.” That is a viable root cause requiring a business decision, not an engineering ticket. For AI products, add a Viability and Economics branch to your Model and AI System bone: cost-per-query, token budget overage, model pricing tier, unit economics mismatch. A PM who surfaces this in an RCA question at an AI-first company is demonstrating exactly the viable-and-lovable lens that interviewers at Anthropic, OpenAI, and Perplexity are looking for. Feasibility is free in 2026; the interesting root causes are economic and experiential.
Use it, do not recite it
Announcing “I’ll use the fishbone framework” and then listing three hypotheses under one bone is not a fishbone. The value is in populating all the bones before picking one: that is the structural guarantee that you haven’t pre-anchored. If you skip bones because “it’s probably X,” you’ve abandoned the tool’s only purpose. Build the spine, populate it honestly, then prioritize with data.