Lesser known features of ClickHouse

  • Here's one not mentioned in the article (currently an experimental feature gated by a toggle): ClickHouse can consume a Postgres logical replication stream. Instead of paying for Stitch/Fivetran/etc, you can get OLTP->OLAP replication for free, with no additional software. It's not as flexible as the commercial stuff, and has several sharp edges, but still it's very useful and just an amazing thing to be built-in.

  • In clickhouse-local example, you can omit the --structure argument and it will continue to work with automatic schema inference.

  • I recommend to write a follow-up with the topics:

    1. clickhouse-git-import tool, what is it and how to use it. 2. ClickHouse as a monitoring agent with the system.asynchronous_log table. 3. Hosting static MergeTree tables on any web server.

  • We have a use case for wide columnar data, used for mostly performance analytics. There are many types of events that share same columns, but mostly total unique columns are 20x of a typical events columns. My use case is filtering by some boolean logic and aggregating them. I use Elasticsearch with a mapping that does not tokenize text fields, (all strings are keyword type) and it works very well. I can add nodes as I want and adjust shards with ease with spot instances and make it cost effective.

    I have not found a way/tool to replace this. Many of the tools fail at dynamic data with cardinality. Wanted to use Clickhouse like this, adding columns as they are discovered but it did not go well, but it has been a while. Also adding replicas is not as easy as Elasticsearch.

    Does anyone have a similar use case implemented with Clickhouse when data is not known before hand and unfolds as time goes by?

  • A very good article. Can anyone recommend more like this to learn more about clickhouse?

  • Kind of related question: we really like clkckhouse but want a grafana alternative or at least an alternative to the vertamedia adapter. The adapter burns way more resources then clickhouse itself.

  • I don't know what click house is, but that personal site is really awesome.

  • The best feature of Clickhouse is replacing entire data engineering and analytics teams and their “modern data platform” with one person and one application.