Cloudflare R2 Data Catalog: Managed Apache Iceberg tables with zero egress fees

  • I vaguely remember reading comments here that said you can get rate limited on R2 without warning if egress is too high. Was that true and is that still true? What is the limit if so?

    I tried looking for that thread again and I only found the exact opposite comment from the Cloudflare founder:

    >Not abuse. Thanks for being a customer. Bandwidth at scale is effectively free.[0]

    I distinctly remember such a thread though.

    Edit: I did find these but neither are what I remember:

    https://news.ycombinator.com/item?id=42263554

    https://news.ycombinator.com/item?id=33337183

    [0] https://news.ycombinator.com/item?id=38124676

  • This post also introduces Iceberg pretty nicely. Details on Class A vs Class B operations are here[0].

    What kind of latency/throughput are people getting from R2? Does it benefit from parallelism in the same way s3 does?

    [0]: https://developers.cloudflare.com/r2/pricing/#class-a-operat...

  • Woo this is cool! I hope they start hosting public datasets like Google does for BigQuery, such as (wink wink) Hacker News archive.

  • Honestly don't understand how Cloudflare thinks this is a higher priority than versioning, replication of buckets, or even geo distribution of objects.