Show HN: Infini-News – 1.36B news articles from Common Crawl, queryable in ms
By ruggsea · 2026-07-01 · 5 points · 1 comments
https://cs2.uni-graz.at/blog/infini-news/
Infini-News is ten years of CC-NEWS (the news subset of Common Crawl), cleaned, enriched and turned into a full-text index so you can count any keyword or phrase across 1.36B articles in sub-second time (ok, now maybe a few seconds, but circumstantial), without downloading anyth…
Open the full discussion on BetterNews