Gad, they sure like to say "BM25" over and over again. That's a near worthless a... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		throwaway81523 17 days ago \| parent \| context \| favorite \| on: It's 2026, Just Use Postgres Gad, they sure like to say "BM25" over and over again. That's a near worthless approach to result ranking. Doing any halfway ok job requires much more tuned and/or more powerful approaches.

cpursley 17 days ago | [–]

It's common to do a hybrid of BM25 with other fuzzy search or pgvector.

storus 17 days ago | | [–]

BM25 is quite bad and needs to be retrained for each corpus anew. SPLADEv2 is much better and there are even better sparse embeddings these days.

throwaway7783 17 days ago | [–]

Can you please elaborate why?

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact