framework · metrics
AARRR framework for growth: the quantitative mechanics PMs need to know
Best for: Growth PM interviews, root-cause analysis questions, metrics deep-dives, and diagnosing a flat or declining growth number
AARRR is only useful when you go beneath the stage labels into the quantitative mechanics that reveal whether growth is structurally healthy or acquisition-dependent. That distinction decides interviews: a PM who recites five stages and generic metrics clears a definitions bar; a PM who connects the growth accounting identity to the retention stage and cites a Quick Ratio benchmark clears the growth-PM bar.
The growth accounting identity
The foundational equation is:
MAU(t) = retained(t) + new(t) + resurrected(t)
As a delta:
delta MAU = new + resurrected - churned
This decomposition, popularized by Facebook’s growth team around 2012, shows that headline growth is the net of three separate levers. A product growing 12% month-over-month could be doing it with 90% gross retention (healthy, compounding) or 40% gross retention flooded by new-user acquisition (fragile, spend-dependent). Same headline, completely different business. Amplitude’s canonical App A vs. App B example makes this concrete: App A has a Quick Ratio of 1 to 1.5 and 40% gross retention; App B has a Quick Ratio above 4 and 90% gross retention. Both report 12% monthly growth. App B survives a 30% acquisition budget cut. App A collapses.
The Quick Ratio
Quick Ratio = (new users + resurrected users) / churned users
Benchmarks by product category:
- Above 4: strong for consumer mobile apps. Growth is not dependent on acquisition alone.
- 2 to 4: solid for B2B SaaS, where churn is structurally lower and sales cycles longer.
- Below 1: the product is shrinking even with positive new-user inflow. No acquisition spend fixes this.
Resurrection rate is the underused component. A resurrection rate above 15% (resurrected users divided by previously churned users) for a consumer app signals strong brand recall or a product-market fit recovery worth investigating. Dormant users returning unprompted are giving you a signal most PMs miss.
The five stages as diagnostic buckets
The stages matter as buckets for locating the leak, not as an ordered checklist to optimize simultaneously. Every product has one binding constraint at any given time.
Acquisition. New users entering the funnel. Track by channel: CAC, organic vs. paid split, first-touch attribution. In 2026, the cost of building has collapsed (feasibility is free), but user acquisition cost is still real and rising in competitive AI categories. Organic and viral acquisition are durable; paid is a faucet that stops when the budget does.
Activation. New users reaching a moment where the product delivers genuine value. The activation event is product-specific. Facebook’s canonical example: 7 friends in 10 days, Chamath Palihapitiya’s metric, predicted long-term retention better than any other early behavioral signal. If fewer than 20% of signups reach your defined activation event, the funnel is broken before it starts.
Retention. Activated users returning over time. This is where the growth accounting identity lives. Track D1, D7, D30. A D30 curve that flattens rather than continuing to slope downward is the leading indicator of product-market fit. If D30 is still declining at day 60, no acquisition budget solves it.
Revenue. The business model working at user scale. LTV:CAC above 3:1 is the standard viability floor. Below 1:1, unit economics are broken. For freemium: free-to-paid conversion rate and time-to-conversion. For B2B: expansion revenue within accounts as a proxy for product-led referral.
Referral. Users sourcing other users. Viral coefficient K = (average invites sent per user) x (invite conversion rate). K above 1 means exponential self-sustaining growth; K below 1 means acquisition spend must permanently supplement. Most consumer products run between 0.2 and 0.7. Anything above 0.5 paired with healthy D30 retention is a meaningful compounding signal.
Worked RCA: DAU is flat, where is the leak?
This is the interview scenario where AARRR pays off if you know the mechanics.
Step 1: decompose the delta. Pull new, retained, and resurrected counts for the trailing 30 days. If delta MAU is flat, at least one of three things is true: new users are down, churned users are up, or both. Identify which before doing anything else.
Step 2: isolate by stage. If new users are healthy but MAU is flat, churn is absorbing the inflow. Go to the retention stage. If new users are declining and churn is stable, the acquisition or activation funnel is breaking. Segment by channel or cohort to find where the drop-off started.
Step 3: cohort the retention problem. A product-wide churn number masks cohort-level signals. A recent cohort churning faster than older cohorts indicates a product change (new onboarding, feature removal, UX regression). A cross-cohort churn increase indicates an external factor (seasonality, competitor, pricing change).
Step 4: name the lever and the guardrail. The interviewer wants a hypothesis and a test. “I believe activation is the constraint because D1 retention dropped three points in the last two cohorts. I would A/B test a guided first session against the current blank canvas. My success metric is D7 retention on the treatment cohort, not D1, because D1 can be inflated by novelty.”
AARRR vs. RARRA: the decision rule
RARRA reorders the stages to: Retention, Activation, Referral, Revenue, Acquisition. It is the correct ordering when retention is not yet solved. The logic: fixing a leaky bucket before filling it is more capital-efficient than the reverse.
Use AARRR framing when the product has solved retention (D30 is healthy, curve has flattened) and the question is how to grow the top of the funnel or optimize revenue.
Use RARRA framing when the product is early, retention is unclear or declining, or the question is root-cause analysis on a flat or declining growth number. In practice, most growth questions in PM interviews are RARRA-shaped problems presented as AARRR questions. Naming this distinction signals you know when a framework applies, not just how it works.
AARRR vs. growth loops
AARRR is a linear funnel. It shows you the stages and the drop-off between them. Growth loops (referral feeding acquisition, usage data improving the model improving retention) are circular and compounding. The funnel framing is the right diagnostic tool; the loop framing is the right architectural goal.
In a strong interview answer, use AARRR to find the leak, then shift to loop framing to explain what durably healthy growth looks like: “I’d use AARRR to identify that activation is the bottleneck. Once activation is fixed, I’d ask whether the product has a self-reinforcing loop. Does activation lead to user-generated content, which drives organic acquisition, which feeds more data into the model? If yes, improving activation compounds. If no, growth is linear and acquisition cost is a permanent tax.”
The 2026 reframe: activation is now the hardest stage
In 2026, feasibility is free. Any team can ship a working AI product in a weekend. User acquisition cost is still real, but the harder constraint is activation: users now encounter 40-plus AI tools and churn from any given one within the first session if the value is not immediate and consequential.
For AI and agent products, activation means something structurally different. The aha moment is not completing an onboarding checklist; it is the user trusting the agent with a consequential task and having that task completed well. That moment is qualitative and often delayed. A user who tries a coding assistant, gets a mediocre first completion, and closes the tab has technically “activated” by the old definition but has not reached the moment that predicts retention.
Retention in AI products follows the same logic. D30 curve flattening is still the target, but the mechanism is habit formation around a workflow, not feature engagement. An async agent running in the background has no explicit session to measure. Retention becomes: did the agent complete tasks in the trailing 30 days? Was the user in the loop? Did output quality hold? The SQL model for this is event logs keyed on task completion, not session starts.
Interviewers at AI-native companies are now explicitly asking which AARRR stage fails first for agent products. The answer is almost always Activation (users try the agent, do not give it a consequential enough first task) or Retention (users are impressed by session one but do not build a workflow around it). Candidates who invoke the viable/lovable distinction add a layer: the product may be technically impressive and even usable, but if it does not fit into where and how users actually work, Retention will not hold regardless of how strong the demo looked.
What separates a strong answer from a great one
Strong: knows the Quick Ratio formula, can locate the bottleneck stage, names product-specific metrics rather than generic ones.
Great: connects the growth accounting identity to the Quick Ratio, identifies whether the product is in an AARRR or RARRA phase, acknowledges the funnel vs. loop distinction, and names one limitation of the framework relevant to the product at hand (for example, async agents lack session-level activation events, so standard instrumentation needs rethinking).
The interviewer signal is whether you use the framework as a diagnostic to reach a judgment, or recite it as a definition. AARRR is the map. The bottleneck is the destination.
See also: AARRR pirate metrics for stage-by-stage metrics and interview delivery, North Star Metric for anchoring AARRR to the business goal, and growth PM role for how growth-focused companies evaluate this skillset.