/knowledge/recommender-systems
Recommender Systems
The quiet algorithms that decide what you see next — the films, the products, the posts. The core idea is elegant, the maths borrows straight from dimensionality reduction, and the failure modes shape what billions of people pay attention to.
- Studied
- Recommender SystemsAdvanced · ranking & personalisation
- When
- ML coursework
- Applied in
- Ranking & prioritisation
- Read / Refreshed
- ~15 min read2026-06-26
A surprising amount of modern life is shaped by recommender systems: the films a streaming service surfaces, the products a shop suggests, the posts a feed shows you next. They're the engines of personalisation, and the idea at their core is genuinely elegant — predict how much a particular person will like a particular thing, from the patterns of what everyone has liked before, and rank accordingly.
It's worth knowing properly for two reasons. The mechanics tie beautifully back to dimensionality reduction — the central technique is the same latent-factor idea — and the failure modes (filter bubbles, feedback loops) are some of the most socially consequential in all of applied ML. This page is the landscape: the two great strategies, the matrix-factorisation engine, and what goes wrong.
01
Predicting what you'll like
Frame the problem as a giant, mostly-empty table: rows are users, columns are items, and each cell is how much that user likes that item — a rating, a click, a purchase. The catch is that the table is overwhelmingly sparse: any one person has interacted with a tiny fraction of all items. The recommender's job is to fill in the blanks — predict the missing cells — and then recommend the items it predicts you'll rate highest.
There are two fundamentally different ways to do that, and the difference is where they look for signal: at the items themselves, or at the crowd of other users.
02
Content-based filtering: more like what you liked
Content-based filtering looks at the items. It builds a profile of what you like from the features of things you've liked before, then recommends items with similar features. Liked several hard sci-fi films? Here's another tagged sci-fi. The item features can be explicit (genre, author, price) or learned text embeddings of descriptions, and "similar" is the same vector-distance idea from the NLP page.
Its strengths and weaknesses are two sides of one coin: it can recommend brand-new items the moment they're catalogued (it only needs their features), and it never needs other users — but it's trapped in your existing tastes. It can only ever suggest more of the same, never the delightful out-of-left-field find. For that, you need the crowd.
03
Collaborative filtering: the wisdom of the crowd
Collaborative filtering ignores item features entirely and uses only the pattern of interactions — the wisdom of the crowd. The intuition: "people who agreed with you in the past will agree with you in the future." Two flavours:
- User-user — find people whose tastes resemble yours, and recommend what they liked that you haven't seen.
- Item-item — find items that tend to be liked by the same people, and recommend items similar (in that co-liking sense) to ones you rated highly. ("Customers who bought this also bought…") This is what powers most large-scale systems, because item-item relationships are more stable than user tastes.
The magic of collaborative filtering is that it needs no knowledge of what the items actually are — only who interacted with what. That's also its weakness, which the next section's technique addresses, and the cold-start problem after it exposes.
04
Matrix factorisation: the latent-factor engine
The breakthrough that powered modern collaborative filtering — and famously won the Netflix Prize — is matrix factorisation. The idea: approximate the giant sparse user-item rating matrix as the product of two much smaller, dense matrices:
Each user becomes a short vector of latent factors, and so does each item. A predicted rating is just the dot product of a user's vector and an item's vector. The factors are learned, not labelled — but they often correspond to interpretable dimensions ("how much sci-fi", "how light-hearted"), and a user's score on a factor times an item's score on the same factor captures their match.
This is the same low-rank, latent-dimension idea as PCA — compressing a huge matrix into a few meaningful dimensions — which is why the linear-algebra foundations pay off here directly. It also gracefully handles sparsity: you only fit on the cells you do observe, and the factorisation generalises to the rest.
05
The cold-start problem
Collaborative filtering's great weakness is the cold-start problem: it needs interaction history, and a brand-new user or item has none. You can't recommend to someone you know nothing about, and you can't surface a just-added item nobody has touched. It's the chicken-and-egg at the heart of every new recommender.
The standard fix is to go hybrid: lean on content-based methods (which only need features) for new users and items, then shift to collaborative filtering as interaction history accumulates. Most production systems are hybrids precisely for this reason — each method covers the other's blind spot.
06
Measuring success: it's about ranking
Evaluating recommenders is subtler than a single accuracy number, because what matters is the order of what you show, not a precise rating. The metrics are ranking-focused: precision@k (of the top k recommendations, how many were relevant?) and NDCG (which also rewards putting the best items highest). You care about the top of the list — nobody scrolls to recommendation 200.
07
Bubbles & feedback loops
Recommenders don't just predict behaviour — they shape it, and that creates problems bigger than any accuracy metric:
- Popularity bias — the crowd's favourites get recommended most, so they get interacted with most, so they get recommended even more. The rich get richer, and niche items stay invisible.
- Filter bubbles / echo chambers — by showing you ever-more of what you already engage with, the system narrows your world, which is benign for films and corrosive for news and opinion.
- Feedback loops — the model's recommendations become the data it next trains on, so it learns from its own influence and can spiral. It's the reward-shaping problem in another guise: optimise raw engagement and you may amplify exactly the sensational or addictive content that maximises clicks, regardless of whether it's good for anyone.
08
Where it shows up in my work
09
Refresh in 60 seconds
The content-vs-collaborative split, matrix-factorisation/cold-start framing, and filter-bubble / feedback-loop cautions reflect current recommender-systems references alongside coursework.