How reads work

Understand how reads remain unchanged.

Crunch is not in the read path, it only works in the background. Crunch simply swaps the original files with smaller, optimized versions. The underlying mechanism of the swap operation is customizable and our product experts can advise you based on your specific environment. Once the data is swapped, your Spark and Trino-based applications will then begin reading the newly reduced (i.e. ingested) files normally. Because the physical files are now smaller you may even see your query speeds and data loading speeds increase, dependent on your query patterns.

See also