Crunch compatibility

Supported platforms and file formats for Granica Crunch.

Granica Crunch supports a range of platforms, file formats, and cloud providers. This page details what's compatible today and what's coming soon.

Supported cloud providers

  • Amazon Web Services (AWS) — Amazon S3
  • Google Cloud Platform (GCP) — Google Cloud Storage

Supported file formats

  • Apache Parquet — Full support for all Parquet versions and compression codecs (Snappy, zlib, zstd, LZO, LZ4, Brotli)
  • Additional columnar formats coming soon

Runtime Crunch compatibility

Runtime Crunch works with any application or system that uses the open source Apache Parquet writer. This includes:

  • Apache Spark (all versions)
  • Apache Flink
  • Trino / Presto
  • Custom applications using PyArrow, parquet-mr, or other standard Parquet libraries

Background Crunch compatibility

Background Crunch works with specific qualified platforms that support the file swap mechanism:

  • Apache Spark on Amazon EMR, Google Dataproc, and self-managed clusters
  • Trino — query-level compatibility
  • Databricks — support coming soon
  • Apache Hive — metastore-aware swap support

Table formats

  • Apache Iceberg — Full support
  • Delta Lake — Full support
  • Apache Hive — Full support

Crunch maintains full data integrity and format compliance. Crunched files remain valid Parquet files that any standards-compliant reader can process without modification.

Was this page helpful?

On this page