Postgres Analytics Accelerator (PGAA) v1.6

Postgres Analytics Accelerator (PGAA) is a high-performance extension that enables Postgres to query large-scale data stored in open table formats like Delta Lake, Apache Iceberg, and Parquet. By offloading heavy analytical queries to a vectorized execution engine, PGAA bridges the gap between operational databases and data lakes.

Get started

  • Compatibility: Check supported PostgreSQL versions, operating systems, and other requirements.

  • Architecture: Understand the core architecture and how the vectorized engine works.

  • Core concepts: Understand the fundamental principles of vectorized execution, data lake integration, and DirectScan.

  • Quickstart guide: Install PGAA, create a storage location and read table from our sample benchmark datasets.

Using PGAA

Performance & optimization

Reference

  • Configuration parameters: The behavior of the PGAA extension is governed by Grand Unified Configuration (GUC) variables. These parameters allow you to switch executors, enable performance optimizations, and manage security credentials.

  • Functions: PGAA introduces a suite of SQL functions for administrative tasks, such as mapping new tables, monitoring storage health, and launching maintenance background jobs.

  • Table options: When mapping or creating analytical tables, specific options allow you to define how data is read from or written to your object store.

  • Data types: PGAA maps native Postgres data types to optimized columnar formats in the data lake.

  • Datasets: Access pre-configured schemas and data loading instructions for analytical datasets to baseline your performance.