Home/Cohorts
Beginner guide

Cohort analysis, without the jargon

Cohort analysis answers one practical question: after people arrive, do they come back, buy again, and pay back the cost of acquiring them? This page teaches the concept from zero, using the same kind of retention table teams use in product analytics, CRM, fintech, and e-commerce.

The idea in one sentence

A cohort is a group of people who start from the same point. Cohort analysis follows that group over time instead of mixing everyone into one average.

A cohort is a group, not a metric.

Start with a school-class analogy

Imagine every student who started first grade in September 2020. That class is a cohort. You can check how many reached grade 5, grade 9, or graduation. In a product, the same logic applies: everyone who signed up in March, placed a first order in April, or installed the app during one campaign can be followed month after month.

Why averages are dangerous

Averages mix old loyal customers with new fragile customers. Revenue can grow while new users are getting worse, simply because you are buying more traffic. Cohorts separate the story: January users behaved like this, February users behaved like that, paid-search users behaved differently from referrals.

The five words you need before reading the table

Cohort

The starting group. Example: users who made their first purchase in March.

Birth event

The event that puts someone into the cohort: signup, install, first purchase, loan approval, subscription start.

Cohort age

How much time has passed since the birth event. M0 is the starting month, M1 is one month later.

Return event

The action that proves the user came back. It should create value: purchase, transfer, lesson completed, booking, paid renewal.

Retention

The share of the original cohort that did the return event in a later period.

Revenue cohort

The same table, but with money instead of activity: revenue, gross margin, LTV, payback.

The simplest formula

If 1,000 users signed up in January and 420 of them made a purchase in February, January M1 purchase retention is 42%.

M1 retention = 420 returning users / 1,000 users in the original cohort = 42%

Read the cohort table slowly

The table below is fictional, but the shape is real. Each row is a monthly cohort. Each column is the age of that cohort. Empty cells on the right do not mean zero; they mean that not enough time has passed yet.

CohortSizeMonths since signup
M0M1M2M3M4M5M6
Jan1,200100%62%48%42%39%37%36%
Feb1,450100%64%50%44%40%38%
Mar1,380100%66%52%46%42%
Apr1,620100%68%54%48%
May1,750100%70%56%
Jun1,900100%71%
Healthy pattern: M1 retention is around 65-70%, and the oldest cohort still has meaningful activity at M6. The curve falls at first, then flattens. That plateau is the base of loyal users.

How to read it

1

Read one row left to right

This shows one cohort aging over time. January starts at 100%, then you see how much of January survives after one, two, three months.

2

Read one column top to bottom

This compares cohorts at the same age. If March M1 is worse than January M1, something changed in acquisition, onboarding, product quality, pricing, or seasonality.

3

Do not compare old age with young age

January M6 and June M1 answer different questions. Compare M1 with M1, M3 with M3, and so on.

4

Remember the blank triangle

Newer cohorts have not lived long enough to have M4, M5, or M6 data. A blank cell is unknown, not bad.

The same table as a curve

If you average each column, you get a retention curve. A healthy curve drops early and then flattens. A weak curve keeps falling toward zero.

0%25%50%75%100%M0M1M2M3M4M5M6
Average retention across available cohorts in the table above

Cohort, segment, funnel: do not mix them up

These tools answer different questions. Teams get confused when they use the word "cohort" for every group of users.

ToolWhat it groupsQuestionExample
SegmentUsers with a shared traitWho are they?Users in Almaty on iOS
FunnelA sequence of stepsWhere do they drop?Visit -> signup -> purchase
CohortA group followed through timeDo they stay or pay back?March first-purchase users after 3 months

Retention is not only activity. It is also money.

A channel can look cheap on CAC and still be bad if customers do not repeat, do not repay, or churn before payback. That is why mature teams build both retention cohorts and revenue cohorts.

ChannelCACM1 gross marginM3 cumulative marginPayback
Discount TikTok$18$7$10No payback
SEO$45$12$58Month 3
Referral$28$20$76Month 2

How to build the first cohort report

Do not start with a huge dashboard. Start with one business question and one clean table.

  1. Pick the question: Are new users returning? Are paid users profitable? Which channel creates repeat buyers?
  2. Pick the birth event. Use a real start point: first purchase, account opened, subscription started. Avoid vague events like page view.
  3. Pick the cohort period. For consumer apps use days or weeks; for e-commerce and subscriptions use months.
  4. Pick the return or value event. For fintech it might be transfer completed; for e-commerce, second order; for EdTech, lesson completed.
  5. Add source and product cuts only after the base table works: channel, campaign, city, platform, first category, language.
  6. Turn the table into a decision: cut a channel, repair onboarding, change CRM timing, fix product friction, or keep scaling.

Local examples for KZ and CIS teams

Fintech

Do not stop at account opening. Build cohorts by first approved product, then track first transaction, repeat transaction, card funding, loan repayment, and margin.

E-commerce

Black Friday and Kaspi-style promos can create huge first-order cohorts with weak repeat behavior. Separate discount cohorts from normal-price cohorts.

Delivery and marketplaces

Track the second and third order. A cheap first order can be noise if people only came for a promo code.

EdTech

A signup cohort is too weak. Track payment, first lesson, first homework, week-two activity, completion, refund, and referral.

Common beginner mistakes

Using "app open" as the return event

Opening the app is often too weak. Use the action that proves value: transfer, order, booking, lesson, renewal.

Changing definitions every month

If January cohort uses signup and February cohort uses first purchase, the comparison is broken.

Reading tiny cohorts too confidently

A 40-person cohort can move by 10-20 percentage points because of a few people. Pool small cohorts or mark the result as directional.

Mixing acquisition quality with product quality

A bad cohort may come from worse traffic, a broken onboarding flow, seasonality, pricing, support delays, or a product change. Segment before blaming one team.

Looking only at retention, not margin

A cohort that returns because of heavy discounts can look active and still lose money.

Treating incomplete cells as zero

The newest cohorts have not had time to reach M4 or M6. Leave those cells blank until the period exists.

Watch a beginner video

The page above is enough to understand the mechanics. The video is useful if you want to see the same idea explained verbally before you build a table yourself.

Open video on YouTube

FAQ

Is cohort analysis only for product teams?

No. Marketing uses cohorts to judge channel quality, CRM uses cohorts to plan retention, finance uses revenue cohorts for payback, and product uses cohorts to find activation problems.

What period should I use: day, week, or month?

Use the natural rhythm of the product. Daily works for mobile apps and games. Weekly works for content, marketplaces, and onboarding. Monthly works for e-commerce, subscriptions, and banking products.

How many users do I need?

There is no universal minimum, but very small cohorts are noisy. If one user changes the result by several percentage points, pool cohorts or treat the result as a signal, not proof.

Can GA4, Amplitude, or Mixpanel build this?

Yes. GA4 has Cohort exploration, and product analytics tools such as Amplitude and Mixpanel are built around event-based retention analysis. The hard part is not the tool; it is choosing clean events.

Useful references

Need a cohort table tied to real CRM revenue, ad spend, and product events? Get in touch.