GH-533: Add ALP (Adaptive Lossless floating-Point) encoding specification by prtkgaur · Pull Request #557 · apache/parquet-format

prtkgaur · 2026-03-11T04:03:57Z

Add the encoding specification for ALP (encoding value 10) to Encodings.md. ALP compresses FLOAT and DOUBLE columns by converting values to integers via decimal scaling, then applying Frame of Reference encoding and bit-packing. Values that cannot be losslessly round-tripped are stored as exceptions.

Closes [Proposal] Add ALP encoding support in parquet file format #533

See rendered preview here: https://github.com/prtkgaur/parquet-format/blob/alpEncoding/Encodings.md#adaptive-lossless-floating-point-alp--10

The spec covers:

Page layout: 7-byte header, offset array, compressed vectors
Vector format: AlpInfo, ForInfo, packed values, exception data
Encoding math: two-step multiplication for cross-language consistency
Parameter selection, exception detection, and decoding steps

Based on the paper "ALP: Adaptive Lossless floating-Point Compression" (Afroozeh and Boncz, SIGMOD 2024). Wire format matches the C++ Arrow and Java parquet-java implementations.

Rationale for this change

What changes are included in this PR?

Do these changes have PoC implementations?

Add the encoding specification for ALP (encoding value 10) to Encodings.md. ALP compresses FLOAT and DOUBLE columns by converting values to integers via decimal scaling, then applying Frame of Reference encoding and bit-packing. Values that cannot be losslessly round-tripped are stored as exceptions. The spec covers: - Page layout: 7-byte header, offset array, compressed vectors - Vector format: AlpInfo, ForInfo, packed values, exception data - Encoding math: two-step multiplication for cross-language consistency - Parameter selection, exception detection, and decoding steps Based on the paper "ALP: Adaptive Lossless floating-Point Compression" (Afroozeh and Boncz, SIGMOD 2024). Wire format matches the C++ Arrow and Java parquet-java implementations.

alamb changed the title ~~Add ALP (Adaptive Lossless floating-Point) encoding specification~~ GH-533: Add ALP (Adaptive Lossless floating-Point) encoding specification Mar 11, 2026

This was referenced Mar 11, 2026

GH-533: Adaptive Lossless Floating-Point (ALP) Encoding #548

Closed

[WIp] Alp encoding support apache/arrow-rs#9372

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-533: Add ALP (Adaptive Lossless floating-Point) encoding specification#557

GH-533: Add ALP (Adaptive Lossless floating-Point) encoding specification#557
prtkgaur wants to merge 1 commit intoapache:masterfrom
prtkgaur:alpEncoding

prtkgaur commented Mar 11, 2026 •

edited by alamb

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

prtkgaur commented Mar 11, 2026 • edited by alamb Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rationale for this change

What changes are included in this PR?

Do these changes have PoC implementations?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

prtkgaur commented Mar 11, 2026 •

edited by alamb

Loading