congoe 6 hours ago

Very excited about this! I used CedarDB a while back for my bioinformatics research when Clickhouse started running into OOMs, and the difference was astonishing (Some details: The main query was a join of 100 million rows against 30.000 rows, then aggregating statistics for the entire table. Clickhouse used up well over 64 GiB of RAM + 100 GiB of Swap and then crashed, while Cedar used only 15 GiBs and finished quickly).

Ever since attending the advanced database lecture of Thomas Neumann at TU Munich, who wrote Umbra, the academic predecessor of CedarDB, I've been looking forward to everyone getting their hands on it!

I believe Cedar will enable tons of projects and companies to simplify their data stack because it can basically handle everything at once (until reaching very significant scale).