PeerDB Blog

PeerDB Cloud is Now in Public Beta!

Kaushik Iska — Tue, 07 May 2024 15:51:37 GMT

🚀 Today, we're excited to announce that PeerDB Cloud is officially entering public beta. If you're a data engineer or an organization looking for a fast, simple, and cost-effective way to replicate data from Postgres to data warehouses such as Snowflake, BigQuery, and ClickHouse, or to queues such as Kafka, Redpanda, and Google PubSub, PeerDB Cloud is ready to serve you. If you want to be white-glove onboarded to PeerDB Cloud by the founder directly, you can book some time here.

We've been operating PeerDB Cloud in Private Beta for the past three months. As the system has matured and we've had the privilege of serving a growing number of customers, we're thrilled to now launch it into Public Beta.

What is PeerDB Cloud?

PeerDB Cloud is the fully managed offering of PeerDB. It is the easiest way to get started with PeerDB in just a couple of clicks, you can have a production-ready PeerDB instance provisioned and a worry-free approach to Postgres replication. PeerDB Cloud comes bundled with PeerDB's core features like:

PeerDB Cloud comes with all PeerDB features

Postgres Change Data Capture with latencies of less than 1 minute for Data Warehouses and single-digit milliseconds for queues.
High-quality target connectors, for Data Warehouses such as Snowflake, BigQuery, Postgres, ClickHouse, etc., and Queues such as Kafka, Redpanda, Google PubSub, Azure Event Hubs, and so on.
Blazing Fast Parallel Initial Loads and Re-syncs: PeerDB is 10x faster compared to other tools. You can move Terabytes of data in few hours vs days.
Streaming Query Replication for production-ready replication based on watermark columns.
Web UI and Unique SQL Interface for ETL: Easily manage your data with our intuitive Web UI and SQL interface.
And many more

PeerDB Cloud is fully managed - 0 CAPEX and OPEX costs

In addition, PeerDB Cloud provides a fully-managed production-ready experience supporting enterprise-grade features such as:

High Availability (HA): Every PeerDB Cloud instance comes with HA. Under the hood, we have replica instances across Availability Zones and mechanisms for auto failover as needed.
Horizontal Autoscaling: As your replication load increases, we have mechanisms to auto-scale compute resources as needed.
In-Place/Transparent Upgrades: Enjoy hassle-free rolling upgrades with no downtime. This helps keep you up-to-date with all the latest features and ensures that you stay current with no extra effort.
Advanced Logs and Metrics: Monitor your system effectively with detailed logs and metrics. OpenTelemetry endpoint support is coming soon.
Privacy and Security: Privacy and security are our top priorities at PeerDB Cloud, surpassing even performance and functionality. We offer SSH tunneling for secure connections and ensure encryption at rest and in transit. Our platform is GDPR compliant and is currently undergoing a SOC2 audit (2 months in), with compliance expected by mid-June. Here is our trust report.
Dedicated Slack Channel & Support SLAs: Every PeerDB Cloud customer gets a dedicated slack channel for expert guidance during implementation, migration and post-prod support. PeerDB Cloud also comes with below Support SLAs:

Save up to 5x costs and Predictable Pricing

Being laser-focused on Postgres replication, we have implemented multiple Postgres-native and infrastructural features to optimize costs. Here is our white paper that provides a detailed summary of all these optimizations. With this, we are able to save up to 5x in costs for our customers compared to other tools. The graph below shows how PeerDB compares to other data-movement tools (reference)

In addition to this, PeerDB Cloud provides a predictable pricing model. Instead of charging based on the number of rows or the amount of data moved, you just pay for the vCPUs provisioned. This ensures that as your workload scales, your costs don't skyrocket.

Current Metrics

These behind-the-scenes metrics for PeerDB Cloud showcase our progress and reinforce our confidence in launching it to Public Beta.

Volume of data moved in PeerDB cloud: PeerDB Cloud already serves 10+ production customers and is replicating 20TB of data from Postgres every week, amounting to approximately 100TB per month. The graph below shows the day-over-day growth over the past week. Note that the graph below captures Avro compressed data; if uncompressed, the volume would be significantly higher, ranging anywhere from 200 to 400TB of uncompressed data moved per month.

Sign-ups for PeerDB Cloud: Signups have grown at a consistent pace over the past few months, increasing by 100% month over month.

Our Customers

In just three months, we have over 10 production customers using PeerDB Cloud for production-grade Postgres replication. Our customers are spread across various verticals including Fintech, IoT, Retail, Sales Marketing Automation, and more. Below is a snapshot of a few of our publicly referenceable customers.

Here is a link to our customer stories. They demonstrate how our customers were able to achieve 10 times faster and up to 5 times cheaper Postgres replication experiences with PeerDB. Here are a few testimonials:

"Our decision to choose PeerDB was reaffirmed by their comprehensive online resources, which instilled confidence in their expertise. Not only did they help us cut costs effectively, but their unparalleled customer service provided immediate and insightful assistance, making us feel supported and empowered in managing our PostgreSQL database."**- Sang Mercado, Head of Engineering, Harmonic AI

Were using this connector already for our Postgres to ClickHouse ETL and its insanely fast and accurate! Cant believe how well this works. The PeerDB team has been super helpful in getting us set up, helping us debug, and advising us on everything related to ClickHouse and Postgres. Great work guys!!**- Neel Mehta, CTO of Fiber AI

Growing with Postgres 📈

Postgres has solidified itself as one of the most popular developer databases ever created. While MySQL lost some appeal after being acquired by Oracle, Postgres continues to earn developer trust. In 2023, it topped the Stack Overflow Developer Survey, and was named DBMS of the Year by DB-Engines.

At PeerDB, we believe that Postgres is going to become the database of the world. We are dedicated to contributing to this vision by making it effortless for any Postgres user to implement use cases involving data movement and ETL for Postgres. Providing a fully managed Postgres replication experience through PeerDB Cloud is a step in that direction.

Future Roadmap

We're committed to continuous improvement and have exciting new features in development:

OpenTelemetry endpoint to integrate with your own monitoring tools such as DataDog, PagerDuty, OpsGenie, and more.
WebHooks and REST API integration to create and manage PEERs and MIRRORs.
Expanding from AWS to other clouds incl. GCP and Azure
Support for private link to securely connect to your VPC. This feature is currently in private preview.
PeerDB Cost Analysis - Fully visibility into your Data Warehouse costs and how to optimize them

Join the PeerDB Cloud Community!

💡 Ready to see what PeerDB Cloud can do for your data?

Sign-Up for PeerDB Cloud's public beta today.
Book a Chat with PeerDB founders to discuss how PeerDB can transform your data strategy.
Star our GitHub repo
Join our Slack Channel to connect with the PeerDB community.

PeerDB Cloud is poised to reshape how you manage and replicate your data. Start your journey with us now!

PeerDB Streams - Simple, Native Postgres Change Data Capture

Sai Srirampur — Mon, 06 May 2024 16:21:50 GMT

We spent the past 7 months building a solid experience to replicate data from Postgres to Data Warehouses such as Snowflake, BigQuery, ClickHouse and Postgres.

Now, we want to expand and bring a similar experience for Queues. With that spirit, we are excited to announce PeerDB Streams. PeerDB Streams provides a simple and native way to replicate changes as they happen in Postgres to Queues / message brokers such as Kafka, Redpanda, Google PubSub, Azure Event Hubs, and so on. Under the hood, PeerDB Streams uses Postgres logical decoding to enable Postgres Change Data Capture (CDC).

The Problem

We selected Queues as our next target because we've heard from multiple Postgres users that existing CDC tools are complex and have a significant learning curve. Debezium is the most common technology for this use-case. It is proven and has large production usage. However, a common pain point among our users is that Debezium has a significant learning curve and requires institutional knowledge to set up and manage in production. It takes a few months to fully deploy Debezium in production. A few common issues from users include -

Interacting through a command line interface or configuration files, understanding the various options / settings, and learning best practices for running Debezium in production requires a significant learning curve. Debezium UI, released to address usability concerns, is still in an incubating state. Additionally, reading Debezium docs/resources to get started can be overwhelming and not the most approachable.
Supporting data formats (ex: MsgPack) and transformations is not trivial and incurs an additional learning curve. You need to write a Java project, build JAR packages and set up a runtime path on the kafka connect plugin. It isnt as simple as plugging in a premade template or writing a few lines of code.
Debezium is not as native as Kafka for other types of message brokers and does not offer the same level of configurability. For example, with Event Hubs, it is difficult to define custom partitioning schemes and stream to topics spread across namespaces and subscriptions.

TL;DR We believe that Debezium aims to provide a comprehensive experience for engineers to implement CDC rather than making it dead simple for them. So you can do a lot with Debezium but need to know a lot about it.

PeerDB Streams - Simple, Native Postgres Change Data Capture (CDC)

This is what we want to address with PeerDB. We are building a Simple, yet Comprehensive experience for Postgres Change Data Capture (CDC). The goal is to enable engineers to implement production-grade Postgres CDC with a minimal learning curve, within a few days.

PeerDBs feature-set isn't at Debezium's level yet, and as PeerDB evolves, we might face similar usability challenges. However, we're putting Simplicity/Usability at the forefront and we believe that we can achieve the above goal. Here is how we are doing it

Simple Postgres CDC Using PeerDB UI

First and foremost, PeerDB offers a simple UI to set up source and target data sources (such as Postgres and Kafka) by creating PEERs and initiating CDC by creating a MIRROR.

Through the UI, users can monitor the progress of CDC, including throughput (per table) and latency; search through logs; set up alerts to Slack or Email based on replication slot growth; investigate Postgres-specific metrics, including slot size, wait events for replication, and more. The UI also offers advanced features, including tuning MIRRORs, pausing MIRRORs, adding tables to MIRRORs, and more. We have strived to make these features as intuitive as possible for users, for example, by using information toolbars and simple language. Below is a demo showing of PeerDB UI in action. Here a link to the quick start for you to try PeerDB Streams in just a few minutes.

https://www.loom.com/share/ebcfb7646a1e48738835853b760e5d04?sid=a50b2865-48df-4ba7-94d4-631c2a778464

Enhanced CLI Experience: Intuitive SQL Layer for Managing Postgres CDC

Second, for users who prefer a CLI over the UI, we provide a Postgres-compatible SQL layer to initiate and manage CDC. This SQL layer offers the same level of comprehensiveness as the UI and we believe that it is far more intuitive and user-friendly compared to bash scripts and configuration files.

Simple Lua Scripts for Row-Level Transformations

Third, users can perform row-level transformations before streaming CDC changes to Kafka. They can write Lua scripts to execute these transformations. This enables powerful features such as encrypting/masking personally identifiable information (PII), supporting various data formats (JSON, MsgPack, Flatbuffers, Protobuf, etc.), and more. To make it very simple for users, we offer a script editor along with a bunch of useful templates. Additionally, applying a transformation is optional, with the default data format being JSON.

Native Connectors to non-Kafka targets

Fourth, we offer native connectors to non-Kafka targets, including Google Pub/Sub and Azure Event Hubs. Behind the scenes, we utilize the native Go APIs/libraries provided by these services to build our connectors, instead of relying on the less developed Kafka-compatible layer over these queues. We support advanced features specific to these services. For example, with Azure Event Hubs, users can perform CDC to topics distributed across different namespaces and subscriptions.

PeerDB Streams is Postgres Native

Finally, we are laser-focused on Postgres and, as of now, don't support any other databases. This allows us to implement many Postgres-native optimizations. For example, we provide Postgres-native metrics and alerts, including replication slot growth, wait events for logical decoding, number of connections and so on. Features such as parallel snapshotting for 10x faster initial loads and decoding in-flight transactions are in private beta.

Try PeerDB Streams

Checkout this 10-minute quickstart to try PeerDB for Postgres CDC to Kafka.

Separately, you can try PeerDB through one of three offerings: Open Source offering, PeerDB Cloud, our fully managed service, and a self-hosted enterprise offering that includes production-grade Helm charts.

Our vision is to provide the worlds best data-movement experience for Postgres. PeerDB Streams is another step in that direction. We built PeerDB Streams in close design partnership with a few Fintech and IoT customers implementing Postgres CDC for their transactional outbox use cases. The product has been battle-tested at scale and is constantly evolving. We would love to get your feedback on product experience, our thesis and anything else that comes to your mind. It would be super useful for us. Thank you!

PeerDB Launch Week

Kaushik Iska — Fri, 03 May 2024 17:31:00 GMT

It will be almost 1 year since PeerDB started the YC Summer 23 program. To celebrate this, we challenged ourselves: how could we make a truly impactful announcement? 🤔

The answer wasn't one feature, but an entire week bursting with launches! 🚀

Even with our small team, we knew this was ambitious. But after months of relentless work, we're thrilled to announce...

PeerDB Launch Week begins on Monday, May 6th!

What to expect?

We don't want to spoil the surprise, so for now we are giving you a teaser of what to expect. Follow us on twitter / X to keep up to date: @PeerDBInc to keep up.

Simple Postgres to ClickHouse replication featuring MinIO

Sai Srirampur — Thu, 02 May 2024 17:55:58 GMT

At PeerDB, we provide a fast and cost-effective way to replicate data from Postgres to Data Warehouses such as Snowflake, BigQuery, ClickHouse, and queues like Kafka, Red Panda and Google PubSub, among others.

A few months ago, we added a ClickHouse connector for Postgres Change Data Capture (CDC). Surprisingly, this connector gained substantial traction and adoption within our community. This applies to both our fully managed service (PeerDB Cloud) and our Open Source offerings. Here is a customer story from one of our customers who uses the ClickHouse connector.

The Problem

However, there was one common piece of feedback from many of our Open Source users. The ClickHouse connector required an S3 bucket as a prerequisite, which added additional overhead for users. Non-AWS users and those without immediate access to S3 could not use the ClickHouse connector. This wasn't a problem in our fully managed offering (PeerDB Cloud), as we abstracted away the S3 bucket creation from our customers.

This blog describes how we solved this problem and made it extremely easy for our users replicating data from Postgres to ClickHouse. We used MinIO, the open source S3 alternative, to stage the intermediary Avro files as part of the Change Data Capture (CDC) from Postgres to ClickHouse.

Why does the ClickHouse connector need S3?

Under the hood, PeerDB uses the Avro format for data in transit while replicating data from Postgres to Data Warehouses. Loading Avro files through Go wasn't trivial as the clickhouse-go driver didn't support Avro ingestion. Additionally, ClickHouse has native integration for loading data from S3 and is very efficient at it, as it attempts to parallelize as much work as possible, processing files in a streaming fashion. Therefore, we chose to use S3 as an intermediary storage for Avro files before importing them into ClickHouse.

This method has proven effective, allowing users to efficiently replicate data from Postgres to ClickHouse with latencies under 30 seconds and high throughput rates.

MinIO helps make the PeerDB's ClickHouse Connector Seamless

By integrating MinIO container services into our Docker Compose files for our Open Source offering, we've enabled an in-house S3-compatible storage solution that launches seamlessly with PeerDB. PeerDB uses environment variables to manage S3 bucket credentials, allowing for easy integration. Users can set these variables to match the MinIO bucket parameters, or they can plug in their own S3 bucket details. These parameters default to the packaged MinIO bucket parameters, as a result, users no longer need to provide a separate bucket for PeerDBs ClickHouse integration, simplifying the setup process significantly.

A huge shoutout to MinIO for building a solid product that serves as an open source alternative to S3. Integrating MinIO's Docker container within PeerDB's Docker file was a one-week project. MinIO's APIs, being fully compatible with S3, allowed for seamless integration with PeerDB and ClickHouse.

Result: Even simpler Postgres to ClickHouse replication with PeerDB.

Simplifying ClickHouse Peer Creation with Optional S3 Configuration

Integrating the MinIO Docker Container in our Open Source offering eliminates the need for users to specify S3 buckets to use our ClickHouse connector. While creating the ClickHouse Peer, adding S3 information is optional, as shown in the screenshot below.

Set Up a Postgres to ClickHouse Mirror in Under a Minute

Once the Postgres and ClickHouse Peers are created, users can create MIRRORs to replicate data from Postgres to ClickHouse within a minute. See below video:

https://www.loom.com/share/fa1afec884724876a63aab522b40e445?sid=7d5383ed-0c51-4018-8920-3d8e95ad4c56

Use the MinIO Console for complete visibility into internal staging

MinIO also comes with a sleek UI that helps you manage the internal Avro files PeerDB creates as part of the replication process.

https://www.loom.com/share/b41d3ad81259407f9a99b7a74c8f1449?sid=2766cebd-20d9-493d-a95a-b8852c9c30b9

We hope you enjoyed reading the blog. If you're a ClickHouse user and wish to replicate data from Postgres to ClickHouse using PeerDB, please check out the links below or reach out to us directly!

How can we make pg_dump and pg_restore 5 times faster?

Sai Srirampur — Thu, 25 Apr 2024 16:16:13 GMT

pg_dump and pg_restore are reliable tools for backing up and restoring Postgres databases. They're essential for database migrations, disaster recovery and so on. They offer precise control over object selection for backup/restore, dump format options (plain or compressed), parallel table processing and so on. They ensure a consistent database snapshot is dumped and restored.

However, they are single-threaded at the table level. This significantly slows down the dump and restore of databases with a star schema common in real-world applications such as Time series and IoT. For databases over 1 TB, pg_dump and pg_restore can take days, increasing downtime during migrations and RTOs in disaster recovery scenarios.

In this blog, we'll discuss an idea called "Parallel Snapshotting". This idea could be integrated into Postgres upstream in the future to make pg_dump and pg_restore parallelizable at a single table level. Parallel Snapshotting has already been implemented in PeerDB, an open-source Postgres replication tool. We will also cover a few interesting benchmarks of migrating a large table of 1.5TB from one Postgres Database to another with and without Parallel Snapshotting.

A quick primer on pg_dump and pg_restore

pg_dump is the most reliable way to back up a PostgreSQL database. It enables the backup of a database at a consistent snapshot; that is, the backup guarantees a state that existed previously. The backup generated by pg_dump is a logical representation of the data in PostgreSQL, not a copy of the PostgreSQL data directory. It captures objects as they appear in PostgreSQL.

pg_restore is the most reliable way to restore a backup generated by pg_dump from one PostgreSQL database to another.

Both pg_dump and pg_restore are Postgres-native; that is, they come packaged with community Postgres and can be used as command-line utilities, similar to psql.

pg_dump and pg_restore offer fine grain control

They provide fine-grained control to manage the backup and restore processes. Below are a few flags that are commonly used:

You have the -f flag, which lets you decide on data formats such as plain text or compressed gzip. Compressed dumps are quite helpful when you have limited network bandwidth or want to save on network costs.
To speed up the backup and restore process, you can use the -j flag to dump and restore multiple tables in parallel.
You can pick and choose specific database objects you want to backup and restore, including tables and schemas.
You can also choose to dump only the schema or only the data using the schema-only and data-only flags.
There are many more flags that they provide that can be found in community docs.

pg_dump and pg_restore can be very slow for large tables

pg_dump and pg_restore are single threaded at a table level

There is a painful issue that users often run into with pg_dump and pg_restore. pg_dump and pg_restore can be very slow for large tables. This is because they are single threaded at table level. They can dump and restore multiple tables in parallel but for a single table they are single threaded.

This means that in use cases where you have a single fact table and multiple dimension tables, pg_dump and pg_restore can get bottlenecked on the large fact table. This is very common in the star schema data-model which is used by multiple real-world use-cases such as IoT, Timeseries, Data Warehousing and so on.

Migrating a 1.5TB table can take 1.5 days

The impact of the problem described above can be significant. Using pg_dump and pg_restore to migrate a 1.5 TB pgbench_accounts table from one Postgres database to another took 1.5 days. The benchmark was conducted under optimal conditions, i.e., using the correct flags and region collocating the source, target, and the VM on which pg_dump and pg_restore were running, among other factors. This 1.5-day downtime is substantial when migrating or recovering mission-critical databases.

Parallel Snapshotting to make pg_dump & pg_restore multi-threaded per table

Now, let's explore a concept called Parallel Snapshotting, which could make pg_dump and pg_restore multi-threaded at the single table level. Note that Parallel Snapshotting is not currently implemented in the upstream versions of pg_dump and pg_restore. It represents an idea/design that could enhance pg_dump and pg_restore in the future.

https://youtu.be/5VbJcSsK0OM

Below video captures migrating 5GB of data from Postgres to Postgres within a min using the Parallel Snapshotting feature in PeerDB.

CTID forms the basis of Parallel Snapshotting

CTID forms the basis of Parallel Snapshotting. Every row in a Postgres table has an internal column called CTID, also known as the tuple identifier. CTID is unique for each row of the table. It represents the exact location of the row on diskit is the combination of the page/block number and the page offset. You can also query the CTID column for a table through a simple SELECT as you are seeing in the below image.

Parallel Snapshotting - Logically Partition the Table by CTID and COPY Multiple Partitions Simultaneously

Let's dive into the design of Parallel Snapshotting:

First, create a Postgres Snapshot using the function pg_export_snapshot(). This ensures that the dump and restore operate on a consistent snapshot of the database.
Second, using that snapshot, logically partition the large table based on CTIDs, i.e., create CTID ranges that encapsulate the table.
Once that is done, copy multiple such logical partitions in parallel from the source to the target.
1. Essentially, you are running SELECT statements with these CTID ranges to read data from the source and write it to the target.
2. The SELECT statements with CTID ranges are very efficient because they use tid range scans, which are similar to index lookups on the CTID column.
3. Also, note that you are reading data in the order of how it is stored on the disk.
We are using COPY WITH BINARY to STDOUT and from STDIN, which makes the dump and restore simultaneous.
We are also using cursors to ensure that the dump doesnt exhaust memory.

Migrating a 1.5TB table 5 times faster with Parallel Snapshotting

At PeerDB, we are building a Postgres replication tool to provide a fast and cost-effective way to move data from Postgres to data warehouses such as Snowflake, BigQuery, ClickHouse, PostgreSQL, and queues like Kafka, Redpanda, Google PubSub, Azure Event Hubs, etc.

To enable faster migrations from one Postgres database to another, we have implemented Parallel Snapshotting within our product. Through this feature, our customers are able to move terabytes of data in a few hours versus days.

We did the same above benchmark to move a 1.5 TB pgbench_accounts table from one Postgres database to another, and it took just 7 hours with PeerDB. This was 5x faster than using pg_dump and pg_restore. This speedup was possible through the Parallel Snapshotting feature. The performance can be further improved by increasing the number of parallel threads for the migration and by using more beefier Postgres source and target databases.

Conclusion and References

The intent of this blog is to share the design principles we followed to enable faster database migrations and discuss how they can be extended to enhance pg_dump and pg_restore in the future. Hope you enjoyed reading the blog. Sharing a few relevant links for reference:

Speeding up Postgres restores
Faster Data Migrations in Postgres
Faster Postgres Migrations using PeerDB
Podcast on Logical replication common issues
Try PeerDB Open Source for fast Postgres migration and replication
Try PeerDB Cloud, the fully managed offering of PeerDB

PeerDB raises $3.6 million seed funding to revolutionize data movement for PostgreSQL

Sai Srirampur — Thu, 11 Apr 2024 05:55:35 GMT

PeerDB offers a fast and cost-effective way to move data from PostgreSQL to data warehouses, such as Snowflake, and to queues like Kafka. This enables businesses to have real-time and reliable access to data, which is of utmost importance in this AI era. PeerDBs overarching vision is to become the de facto standard for data movement and ETL (extract, transform and load) for companies that run their businesses on Postgres.

SAN FRANCISCO April 11, 2024 PeerDB, the leading data movement platform for PostgreSQL, today announced it received $3.6 million in seed round funding. Investors in the round include lead investor 8VC, Y Combinator, Wayfinder Ventures, Webb Investment Network, Flex Capital, Rogue Capital, Pioneer Fund, Orange Collective and several angel investors.

PeerDB will use the funds to continue building its engineering team, propelling its go-to-market and client acquisition initiatives and supporting its growth. PeerDB revenue is doubling every two months.

"Postgres is becoming the database of the world and the de facto primary database for both enterprises and SMBs. Existing data movement and ETL tools are not built for Postgres they often fail at scale due to painfully slow syncs, lack of reliability and lack of native features. The time has come for someone to give enough care to the world's most adopted open source database. Thanks to all our investors and customers for believing in us and sharing our vision," said PeerDB CEO and co-founder Sai Krishna Srirampur.

"At PeerDB, we're tackling inefficiencies surrounding Postgres data movement. With fundamental optimizations like parallel snapshotting and handling the nuances around replication slots that are absent in the existing ETL tools, were focused on building a system specialized for Postgres at terabyte scale our approach diverges from traditional methods that falter at scale and resort to resyncs. Our team of Postgres experts not only provides data movement services but also becomes an essential part of client teams, offering advice on database tuning and query optimization. As we brace for the influx of data driven by the era of LLMs, we're committed to ensuring Postgres data movement remains efficient and scalable," said PeerDB CTO and co-founder Kaushik Iska.

The Problem

In the current data and AI landscape, existing data movement and ETL (extract, transform, and load) tools prioritize the breadth over the quality of connectors and are not optimized for Postgres. Users often face issues such as painfully slow syncs syncing hundreds of gigabytes of data can take days; unreliability, characterized by frequent crashes and loss of data precision; and feature limitations, including a lack of configurability and support for native data types.

PeerDB distinguishes itself by prioritizing the quality over the breadth of connectors and tailoring its design specifically for Postgres. Through this, PeerDB offers 10 times faster data movement and at one-fifth the cost.

Why Now?

"8VC's investment in PeerDB is driven by our conviction in the exponential growth trajectory of Postgres. We see an exceptional team in Sai and Kaushik, whose deep expertise in Postgres positions them uniquely in the marketplace. We believe that a solid narrative around data movement and ETL will play a pivotal role in the success and widespread adoption of Postgres," said Bhaskar Ghosh, partner at 8VC.

The time is ripe to develop a first-class data movement tool for Postgres, as it is becoming the world's most popular database. Currently ranked fourth in the DB-Engines Ranking of database management systems (DBMS) based on popularity, Postgres is the only DBMS in the top four experiencing growth. It also received DB-Engines' DBMS of the Year 2023 award for gaining more popularity than any other of the 417 monitored systems in 2023. Enterprise adoption of Postgres is on the rise, with 50% of enterprise companies already using it. This trend is set to significantly increase the volume of data stored and moved through Postgres, requiring an ETL tool that can handle this scale.

The introduction and funding of PeerDB is also timely due to the widespread focus on artificial intelligence (AI) and hyperscale data analytics, which often require movement of massive datasets from primary database platforms such as Postgres to data warehouses for AI-based analytics to help provide insights and inform business decisions.

PeerDB's Vision and Use cases

PeerDB's vision is to become the de facto standard for data movement and ETL for companies that run their businesses on Postgres, encompassing use cases such as:

Fast and cost-effective replication to data warehouses: Replicate data from Postgres to analytical stores (OLAP) such as Snowflake, BigQuery and ClickHouse for AI-based analytics informing business decisions in use cases like fraud or anomaly detection. PeerDB has already made its mark here with a rapidly growing customer base that strongly challenges incumbents like Fivetran.
Real-time streaming and change data capture: Low-latency replication from Postgres to queues, such as Kafka, enabling use cases like real-time alerting and micro services-based architectures. PeerDB already supports Kafka, Azure Event Hubs, and Google PubSub as targets, serving as an enterprise-grade alternative to Debezium.
Database migrations: Migrating data from legacy databases like Oracle and SQL to Postgres to support modernization and digital transformation initiatives.
Enterprise-grade Postgres high availability (HA) and backups: As enterprises modernize their database stack by migrating from Oracle and SQL Server to Postgres, managing HA and backups across regions and hybrid on-premises environments becomes critical. The infrastructure that PeerDB is developing can be extended to support such mission-critical use cases in the future.
Vector ETL: Extracting unstructured data at scale, transforming it into vector embeddings with LLMs, and loading these into Postgres using pgvector. This enables semantic searches for advanced AI applications.

PeerDB is built for Postgres

Below are a few product differentiators of PeerDB:

Faster Postgres data movement: PeerDB implements parallel snapshotting, the fastest way for moving Postgres data. This can dramatically reduce the time required to move massive datasets, often from days to hours, while ensuring consistency.
Native Postgres data type support and replication: PeerDB specializes in natively replicating advanced data types like JSONB and geospatial, crucial for IoT apps and geospatial applications. With data available in native formats, users save the time and effort required for data transformation, since the data is already in the format necessary for their AI-based analytics and other applications.
Cost optimizations: PeerDB can reduce data movement costs by up to five times compared to incumbent ETL tools. PeerDB published a white paper detailing the data modeling and infrastructure optimizations employed to save costs for customers.

Customers

PeerDB customers include:

Harmonic AI replaced an incumbent ETL solution with PeerDB, saving $80,000 yearly and reducing their yearly data-movement costs by five times.
Expedock uses PeerDB to deliver real-time AI-driven supply chain automation, replicating 700 million rows monthly from Postgres to Snowflake with under one minute latency and five times cost savings compared to their previous ETL tool.
Fiber AI uses PeerDB to replicate terabytes of data from Postgres to ClickHouse in real time, powering their real-time search use case.
Flatiron Health used PeerDB to migrate 35,000 tables with terabytes of data from Postgres to Snowflake within a week.

"Were using PeerDB already for our Postgres to ClickHouse ETL and its insanely fast and accurate! We cant believe how well it works. The PeerDB team has been super helpful in getting us set up, helping us debug, and advising us on everything related to ClickHouse and Postgres. Great work, guys!"**said Neel Mehta, CTO, Fiber AI.**

How to try PeerDB?

You can try PeerDB through one of three offerings: Open source, a fully managed cloud service and a self-hosted enterprise offering.

Founders

The co-founders of PeerDB, CEO Sai Krishna Srirampur and CTO Kaushik Iska, have been friends since high school and were roommates in college at the International Institute of Information Technology Hyderabad, where they both studied computer science. Sai Krishna Srirampur was an early engineer at Citus Data, which was acquired by Microsoft. There, he led solutions engineering for all Postgres services on Microsoft Azure. Kaushik Iska built operating systems and led data teams at Google, SafeGraph, and Palantir Technologies. He also represented India in the International Collegiate Programming Contest (ACM ICPC) World Finals. They have been building Postgres products for a decade now and have closely worked with Postgres customers running into issues with existing ETL tools. To fill this gap, they founded PeerDB. Below is the image of the PeerDB Founding Team.

PeerDB is GDPR Compliant

Kunal Gupta — Thu, 04 Apr 2024 15:37:56 GMT

We are excited to share a significant achievement at PeerDB: we have achieved full compliance with the General Data Protection Regulation (GDPR). This milestone represents our unwavering dedication to data protection and privacy, further strengthening the trust our clients and partners have placed in us.

Our Ongoing Dedication to Data Privacy and Security

At PeerDB, we understand the critical importance of data privacy and security. Our team has worked diligently to ensure that our practices align with the stringent requirements of the GDPR. For a detailed overview of our security measures, we invite you to explore our updated Trust Center.

Benefits of GDPR Compliance for Our Clients

Enhanced Data Security: With GDPR compliance in place, we guarantee the highest level of security and confidentiality for all data entrusted to us. From data encryption to strict access controls, we have implemented comprehensive measures to safeguard your information.
Transparency and Control: Our Trust Center serves as a central hub for information on our security infrastructure, organizational practices, and third-party engagements. We believe in transparency and empower our clients with the control they need over their data.
Continual Improvement: Our commitment to data security doesn't end with GDPR compliance. We are dedicated to continuously evaluating and enhancing our security measures to not only meet current regulations but also stay ahead of emerging threats.

Looking Ahead: Our Next Steps in Security

While we celebrate our GDPR compliance, we are also looking towards the future. As part of our ongoing commitment to security excellence, we are currently in the process of preparing for the SOC 2 Type 2 certification and are undergoing the audit for the same. This additional certification will further validate our dedication to maintaining high security standards for our services.

Empowering Your Business with PeerDB Cloud

PeerDB Cloud provides a secure and scalable platform for businesses to fulfil all their Postgres Data Movement needs. With GDPR compliance at its core, PeerDB Cloud offers peace of mind, knowing that your data is protected in a robust and reliable environment.

Why GDPR Compliance Matters

Achieving GDPR compliance goes beyond meeting legal requirementsit's about building trust with our clients and exceeding industry standards. It demonstrates our commitment to ensuring the privacy and security of the data we handle.

Your Trust, Our Priority

At PeerDB, we are dedicated to being a reliable partner in your digital journey. We are here to assist you in leveraging our services with confidence, knowing that your data is protected. For more information on our GDPR compliance efforts, PeerDB Cloud, or any other security-related inquiries, please visit our Trust Center.

Thank you for being part of this journey with us. We look forward to continuing to provide secure and trusted solutions for all your data movement needs.

Exploring versions of the Postgres logical replication protocol

Kevin Biju — Mon, 01 Apr 2024 16:57:40 GMT

Introduction

Logical Replication is one of the many ways a Postgres database can replicate data to other Postgres database (a.k.a standby). Logical replication directly reads from the write-ahead log (WAL), recording every database change, avoiding the need to intercept queries or periodically read the table. These changes are filtered, serialized and then sent to the standby servers where they can be applied. While logical replication is intended to be used by Postgres databases to send and receive changes, it also allows ETL tools like PeerDB to get a reliable stream of changes that can be processed as needed.

Logical replication started by only allowing streaming of committed transactions. It then evolved to support in-flight transactions followed by two-phase commits and then parallel apply of in-flight transactions. This blog will dive into this evolution, its impact on performance, and present some useful benchmarks. This blog is useful for anyone who uses Postgres Logical Replication in practice!

Components of logical replication

For a quick rundown, a full logical replication setup involves several crucial components. Please skip this section if you are already familiar with the concepts of logical replication.

1. Replication Slot: A replication slot on the primary server is what reads changes from the WAL and passes it to the output plugin to be serialized and sent to the standby server (or ETL tool) to be applied. Periodically, the standby server sends a message to the primary to confirm that it has read the WAL to a certain point, at which point the slot can advance.

2. Publication: A publication is essentially a filter on the WAL changes. Publications are very powerful and can filter out schemas, tables and even particular columns of tables. You can also choose to publish inserts and not updates and also apply custom logic to filter out certain rows. When a standby starts reading from a replication slot, a set of publications are passed as input.

3. Subscriptions: A subscription is basically the Postgres syntax for creating a logical replication connection to a primary server for replicating changes from a slot and a set of publications. The standby then reads data from the primary and replicates it as long as the subscription is active. While this is Postgres specific, other tools end up behaving like subscribed standbys and get the same output from the primary server.

4. Output plugins: The replication slot passes raw WAL change data to an output plugin which serializes it to a stream of messages. This helps with the interoperability of logical replication as the message format is independent of the underlying database version or configuration. The de-facto output plugin is a Postgres project called pgoutput but other plugins like wal2json and decoderbufs enjoy support among the community.

Wait, logical replication has versions?

When starting logical replication (START_REPLICATION), there is a parameter called proto_version that allows users to opt in to newer semantics of the logical replication protocol. Starting with Postgres 14 in September 2021, three new proto_versions of logical replication have been added in consecutive releases. Looking at the docs for proto_version right now, we see this:

proto_version    Protocol version. Currently versions 1, 2, 3, and 4 are supported.    Version 2 is supported only for server version 14 and above,     and it allows streaming of large in-progress transactions.    Version 3 is supported only for server version 15 and above,     and it allows streaming of two-phase commits.    Version 4 is supported only for server version 16 and above, and it     allows streams of large in-progress transactions to be applied     in parallel.

While these all sound like good things, it's not clear for the average reader what they mean or what problems are being tackled. And for the informed reader who knows what these changes mean, it'd still be nice to understand how they are implemented and their impact on real-world workloads.

v1 - the status quo

To analyze the messages and semantics of the various protocol versions, we've written a small Go application called polorex. If you want to check out the code or try things out for yourself, check out the code in this repo.

To simulate a workload, we are running 2 transactions concurrently, inserting rows into the same table. The transactions insert rows in 100 batches of 250,000, totalling 50 million rows. The workload is simulated by a subcommand of the polorex application. The transactions are read and analyzed by another subcommand called txnreader which connects to the database and continuously reads the replication slot.

./polorex txnreader -port 7132[in a different terminal]./polorex txngen -port 7132 -iterations 100 -batchsize 250000 -parallelism 2

The transactions start at the green line and end at the red line. We can see how the transactions are being read only after they commit. It takes 3-4 minutes to decode both our transactions. Since we just committed 2 large transactions, the pgoutput plugin has to read a lot of WAL at once and then serialize it into 50 million INSERT messages to be sent over. While the graph shows that we are reading almost 250K inserts per second, but one can see how this could quickly go out of hand for larger transactions with wider schemas. We could quickly fall behind the primary server purely due to this decoding overhead.

Another issue which follows from this but is less obvious is with regards to the size of the replication slot. This is basically the amount of WAL being retained for the slot to decode changes without losing any data. Looking at the graph, it quickly rises as the transactions progress, but also stays high until both transactions are read at which point it falls dramatically. This can be an issue in workloads with high throughput and large transactions - the WAL being retained can reach hundreds of gigabytes within a matter of hours, thereby consuming the entire disk space and crashing the Postgres server.

With this insight in mind, we can see how version 2's promise of allowing streaming of large in-progress transactions sound enticing. But there is also a simplicity in version 1 of only sending changes over when they are committed. We read a BeginMessage and everything from there onward is fair game to be replicated immediately. In contrast, an "in-progress" transaction could be rolled back at any point, and therefore all the changes read so far need to be staged somehow before being replicated.

v2 - rows down the stream

To begin with, we restart txnreader with a flag to ask it to use protocol version 2 while connecting to the slot. We then rerun the same txngen workload.

./polorex txnreader -port 7132 -protocol 2[in a different terminal]./polorex txngen -port 7132 -iterations 100 -batchsize 250000 -parallelism 2

We are seeing a completely different story in terms of how the transactions are being processed here. It's clear that we're getting rows way before the transaction even commits. We're actually seeing streaming of in-progress transactions! Rows for a particular transaction come to us between a StreamStartMessage and StreamStopMessage, and we get several of these streams while rows are still being sent over. We are getting streams for both of our transactions before any of them commit, but we are still only reading 1 transaction at a time.

A transaction being streamed now commits using a StreamCommitMessage, but unlike the Commit message from earlier, we need to wait for this since the fate of the transaction is not known yet. Alternatively, we could receive a StreamAbortMessage which implies transaction rollback and so all our changes for said transaction should not be applied.

The improvements from streaming are nothing short of dramatic, we can see how transactions are fully read seconds after the rows finish inserting, approximately 4 minutes earlier compared to version 1. As a result, the slot size also decreases much more quickly.

Results - v2 enables faster decoding and shorter peak slot size duration

To reiterate, there is no magical improvement in transaction reading performance or peak slot size. The transactions themselves take about the same time to process and generate the same amount of WAL, but since the replication happens in parallel with the transaction, we see better performance.

In version 2, transactions are fully decoded, and the slot size decreases immediately after the transactions are completed, compared to version 1, which requires an additional 4 minutes. This can have drastic impact on workloads with high throughput and sizable transactions - version 2 can be very helpful in enhancing logical decoding performance and ensuring the slot size is kept in check!

v3 and v4 - 2PC and parallel apply

Version 3 introduces new message types to manage two-phase commit transactions. While significant in certain scenarios, the concept of two-phase commit remains relatively niche from an ELT standpoint.

Version 4 is less clear in its description, and even the documentation doesn't venture much farther than this. As it turns out, it doesn't refer to applying multiple transactions in parallel, but spreading out the load of applying a single large transaction over multiple processes in the standby. For this, new fields have been added to some existing messages. This is again a great feature in some workloads, but not very useful from the standpoint of something else pretending to be a Postgres standby.

Conclusion

Postgres logical replication is a powerful feature central to the distributed/HA Postgres ecosystem. By using version 2 of the logical replication protocol to stream in-flight transactions, we can efficiently manage WAL spikes during sizable transactions, enhancing logical decoding performance and mitigating disk full issues caused by replication slot growth. Additionally, this approach reduces the lag between the Postgres source and its readers.

At PeerDB, we're developing a feature that utilizes version 2 of the logical replication protocol to consume changes from a Postgres database before they are committed. We believe this feature will significantly benefit Postgres users grappling with issues related to replication slot growth. Overall, version 2 of the logical replication protocol presents a promising solution for optimizing Postgres replication processes and improving overall reliability and performance.

Enterprise-grade Replication from Postgres to Azure Event Hubs

Sai Srirampur — Fri, 15 Mar 2024 20:54:59 GMT

At PeerDB, we are building a fast and a cost-effective way to replicate data from Postgres to Data Warehouses and Queues. Today we are releasing our Azure Event Hubs connector. With this, you get a fast, simple, and reliable way to Change Data Capture (CDC) from PostgreSQL to Azure Event Hubs, enabling downstream apps to consume a raw feed of data from your PostgreSQL database in real-time. This enables use cases such as real-time alerting for Fraud or Anomaly detection in Banking/IoT, Operational Analytics, and more.

In this blog, we delve into existing approaches to replicate Postgres to Event Hubs and their challenges, as well as how PeerDB addresses these challenges to provide an Enterprise-grade experience!

Status Quo

Debezium is hard to use and is not built for Azure Event Hubs

A common ways to replicate data from Postgres to Event Hubs is to use Open Source tools such as Debezium. Below are a few challenges that we've heard from customers trying Debezium with Azure Event Hubs.

Limited Configurability: Debezium offers limited customization for Azure Event Hubs, including the inability to perform advanced mapping between tables and topics, lack of support for custom partitioning schemes per topic, and inability to flatten nested JSONs, among other limitations.
High Setup and Maintenance Costs: One of the common concerns we hear from customers is that setting up and managing Debezium at a production-grade level is challenging. It often requires several months of work by a data engineering team to fully implement.
Not Native to Azure Event Hubs: Debezium leverages the Kafka protocol over Event Hubs to support the Event Hubs connector. The Kafka protocol is not as developed as the native APIs provided by Event Hubs.

PeerDB for Change Data Capture (CDC) from Postgres to Azure Event Hubs

In the past 6 months, we have invested heavily to make replication from Postgres to Azure Event Hubs as robust as possible. We have implemented multiple usability, security, and performance-related features required for enterprise customers. Below are a few highlights.

Simple to Use - SQL Layer that makes life very easy!

Along with a simple UI, PeerDB provides a Postgres-compatible SQL layer to manage replication from Postgres to Azure Event Hubs. You just need to run a couple of SQL commands to setup a highly reliable CDC pipeline: CREATE PEER to make PeerDB aware of the Postgres and Event Hubs peers; CREATE MIRROR to kick off the replication job.

The Postgres-compatible SQL layer comes in very handy for managing replication from a fleet of Postgres databases across different tenants or micro services to Azure Event Hubs. You can script out your pipelines using Python or any other language and use any CI tool to manage your data pipelines.

The following demo showcases PeerDB in action, replicating data from Postgres, running a multi-tenant SaaS app, to Azure Event Hubs.

https://www.loom.com/share/1846057942f141e4afdadc030f55a421

Blazing fast performance with Sub-Second latency

Use cases requiring replication from Postgres to Azure Event Hubs are highly latency-sensitive. For instance, consider an IoT app publishing raw changes to Event Hubs. PeerDB implements multiple optimizations to provide sub-second latency at high throughputs (10K+ TPS). A few of the optimizations include:

Streaming instead of batching
Always consuming the logical replication slot
Parallel apply for Azure Event Hubs
Using native APIs (not the Kafka layer) to ingest into Azure Event Hubs

Highly Configurable - do almost anything you want!

PeerDB provides many nuts and bolts to manage the behavior of CDC. You can control data formats/transformations, security/isolation, and performance while replicating data from Postgres to Azure Event Hubs. A few of them include:

Topics can be spread Namespaces and Subscriptions: You can replicate data from multiple Postgres tables to Event Hubs spread across namespaces and even subscriptions. This ensures guaranteed isolation across topics, which could be critical in multi-tenant SaaS apps.
Define custom partition keys and partition counts across topics: To configure performance across topics, you can define custom partition keys and partition counts per topic.
Flatten JSON and JSONB column: PeerDB allows you to deep flatten JSON and JSONB columns in Postgres into separate key<>value pairs on Azure Event Hubs.

Enterprise grade Security and Isolation

We designed the Azure Event Hubs connector specifically for Enterprise customers. Below are a few security features/items that PeerDB provides.

Guaranteed isolation across Azure Event Hubs topics: PeerDB provides the ability to replicate data from multiple tables in Postgres to separate topics spread across different namespaces and Azure subscriptions. This ensures guaranteed isolation across topics, which could be critical in multi-tenant SaaS apps, where you are providing raw DB feed to your customers.
PeerDB Enterprise Offering: For enterprise customers, PeerDB provides the self-hosted offering, which comes with production-ready Helm charts and Enterprise-grade support. This enables you to provision PeerDB in Azure Kubernetes Services (AKS) within your own VNET.

Production ready Observability

PeerDB UI: PeerDB comes with a comprehensive UI to monitor the replication jobs. You can monitor performance (throughput and latency), logs, and Postgres native metrics such as replication slot size. Additionally, you can create alerts for these metrics and send them to various channels such as Email and Slack.
Integration with Azure Monitor: PeerDB Enterprise can run on Azure Kubernetes Services (AKS). AKS has out-of-the-box integration with Azure Monitor to manage metrics, logs and alerts.

Conclusion

Hope you enjoyed reading this blog! The Azure Event Hubs connector is being used in production by a few large-scale Postgres Azure customers. If you are interested in trying this out, please reach out to us through the Contact Us form on our website.

We are actively working to extend similar support to other queues including Kafka and Google Pub Sub. If you are interested in previewing PeerDB for these queues, reach out to us through the Contact Us form. We also offer a 30 day free trial for PeerDB Cloud.

Comparing Postgres Managed Services: AWS, Azure, GCP and Supabase

Sai Srirampur — Mon, 04 Mar 2024 17:24:23 GMT

At PeerDB, we are building a fast and a cost-effective way to replicate data from Postgres to Data Warehouses such as Snowflake, BigQuery, ClickHouse, Postgres and so on. All our customers run Postgres at the heart of the data stack, running fully managed or self-hosted Postgres databases.

We often get asked about the preferred managed service for PostgreSQL. In that spirit, we are writing this blog to compare four popular options incl. AWS RDS Postgres, Azure Flexible Server Postgres, GCP Cloud SQL for Postgres, and Supabase Postgres, across Performance, Costs and Features. We also acknowledge other providers like Tembo, Crunchy Bridge, Neon and TimescaleDB which we'll cover in a future post.

Note that this this comparison aims to serve as a helpful "first" checklist for developers choosing a managed service. There may be something we missed, and we apologize for those oversights. We are happy to adjust our analysis based on feedback.

Setup

To ensure an apples-to-apples comparison, we aimed to match the four options as closely as possible in terms of RAM, vCores, disk space, PostgreSQL version, region, etc. The table below captures the details of the initial setup.

Cloud	AWS	GCP	Azure	Supabase
Region	us-east-1	us-east1	East US	East US
PG Version	16.1-R2	15, 16 unavailable	16	15, 16 unavailable
DB Type	db.m6i.large	Enterprise -> Sandbox	Standard_D2s_v5	Large
RAM	8	8	8	8
vCores	2	2	2	2
Disk Size	100	100	100	100
Disk Type / IOPs	gp3 (3000)	3000	Premium SSD v2 (3000)	Not specified
Default Arch	x64	Not specified (probably x64)	x64	ARM
HA	Not enabled	Not enabled	Not enabled	Not enabled
DB Disk Type (IOPS)	SSD gp3 (3000)	3000	Premium SSD v2 (3000)	Not specified

Performance

Benchmark Setup

All the performance tests were conducted using a VM (client) with the same compute capacity and collocated in the same region as the PostgreSQL database. We did 3 main performance tests:

pgbench representing a typical Transactional (OLTP) workload
COPY command to Batch Insert (Upload) data to Postgres
SELECT command to Batch download data from Postgres

pgbench

Across all the 4 managed PostgreSQL providers, pgbench was run for 24 hours with 8 parallel connections and 4 jobs pgbench -c 8 -j 4 -P 30. The graphs below capture a comparison of average throughput i.e. transactions per second (TPS), average latency and average CPU utilization for all the services.

AWS RDS PostgreSQL led the pack with an average of 2.7K TPS and 2.884 ms average latency. Azure Flexible Server PostgreSQL ranked second, closely trailing AWS RDS by just ~12%. It recorded an average of 2.4K TPS and an average latency of 3.260 ms. Supabase and GCP Cloud SQL PostgreSQL followed. Average CPU utilization across all the services was almost the same i.e. around 80%, except for Supabase. This could be because Supabase uses ARM processors compared to others who use x86.

Batch Upload and Download

For batch uploads, we used the COPY command to insert 1GB and 5GB files from the client to PostgreSQL. For batch downloads, we executed a SELECT query that retrieved 1GB and 5GB of data from a table in PostgreSQL to the client. The graphs below illustrate how each service performed in these tests:

In terms of batch upload with COPY command, AWS RDS was again the leader taking around ~105s to ingest 5GB of data. GCP Cloud SQL was second with 113s. Azure Flexible Server and Supabase followed.

In terms of batch download using SELECT, the numbers were close across AWS, GCP, and Azure, with GCP slightly ahead, taking 51 seconds to download 5GB data. It was interesting to note that Supabase took longer than the others, requiring 160 seconds to download 5GB of data.

CPU utilization peaks during the COPY command were almost consistent across AWS and GCP, at around ~45-50%. Supabase was at approximately 57%. However Azure peaked at 85%.

Costs

Below table captures costs across all the 4 managed services for a Postgres Database with 2vCPU, 8GB RAM and 100GB disk. More details regarding the infra can be found in this sheet.

	AWS	GCP	Azure	Supabase
Costs per month	$129.94	$116.70	$129.94	$113.00
Disk Cost per month	$11.50	N/A	$11.50	N/A
Total Cost per month	$141.44	$116.70	$141.44	$113.00

If you notice, Supabase is the most cost-effective compared to other managed services, at $113. This could be because Supabase uses machines with ARM processors, which are more cost-effective compared to x64. GCP Cloud SQL comes in second at $116 per month. AWS RDS and Azure Flexible Server are tied at $141.44 per month.

Database Features

Postgres Managed Services typically support various important features for running production and enterprise-grade Postgres deployments. A few important features include:

Availability and Reliability:

High Availability (HA) to minimize downtime during DB failures/crashes.
Backups / Point-In-Time-Recovery to handle Disaster Recovery (DR) scenarios
Cross region read replicas for enterprise-grade DR

Performance and functionality:

Out of the box feature to help performance tuning of queries.
Read-replicas to segregate and scale read workloads
Out of the box connection pooling
Extension to enhance Postgres functionality

Security and Compliance:

SOC2 and HIPAA
Private Access

The table below compares each of the four managed services based on the above features:

Feature	AWS	GCP	Azure	Supabase
PITR	Yes	Yes	Yes	Yes
HA	Yes	Yes	Yes	Unclear
HA across Availability Zones	Yes	Yes	Yes	No
Cross region read replicas	Yes	Yes	Yes	Yes (In early access)
Availability SLA	99.95	99.95 with Enterprise, 99.99 with Enterprise Plus	99.95 within AZ, 99.99 with cross AZ HA deployments	99.9
Performance Insights	Yes	Yes	Yes	Not out-of-the-box but through SQL queries
Read replicas	Yes	Yes	Yes	Yes (In early access)
Connection Pooling	Yes with RDS Proxy	No	Yes with PGBouncer	Yes with Supavisor
Number of Extensions	92, Official Docs	74, Official Docs	75, Official Docs	81, Official Docs
Private Access	Yes	Yes	Yes	No
SOC2	Yes	Yes	Yes	Yes
HIPAA	Yes	Yes	Yes	Yes

Conclusion

Below is a summary of the results from the analyses conducted across the four managed services.

AWS RDS Postgres was the most mature Postgres offering of all the other managed services.
1. Performance-wise, it surpassed Azure by just 12% and exceeded the others by over 45% in pgbench throughput and latency.
2. Feature-wise, it supports almost all of them in the Availability and Reliability, Performance, and Security and Compliance categories.
3. It supports the highest number of extensions, i.e., 92 of them.
Azure Flexible Server takes second place in performance. It was very close to AWS, being only about 12% lower in performance. It matches AWS RDS Postgres in terms of features.
Managed services across all three clouds offer robust support for features related to Availability & Reliability and Security & Compliance, which are important for enterprise-grade workloads.
Supabase and GCP Cloud SQL Postgres are the most cost-effective of all the managed services.
Special mention to Supabase for supporting features that make the lives of app developers incredibly easy.

Hope you enjoyed reading this blog. In future blogs we will add a few other managed services to this comparison and aim to go deeper in a few categories such as Performance.

References

Excel sheet capturing all our raw analysis to come up with this blog

NOTE: The blog was updated on April 17, 2023. The primary modification involved changing the Azure Flexible Server VM type from AMD (Standard_D2ads_v5) to Intel (Standard_D2s_v5). This change can be easily configured through radio buttons while provisioning and is set as the default across various regions. Therefore, we deemed it a fair modification in the comparison.

Moving a Billion Postgres Rows on a $100 Budget

Kaushik Iska — Wed, 21 Feb 2024 19:20:32 GMT

Inspired by the 1BR Challenge, I wanted to see how much it would cost to transfer 1 billion rows from Postgres to Snowflake. Moving 1 billion rows is no easy task. The process involves not just the transfer of data but ensuring its integrity, error recovery and consistency post-migration.

Central to this task is the selection of tools and techniques. We will discuss the use of open-source tools, customized scripts, ways to read data from Postgres, and Snowflakes data loading capabilities. Key aspects like parallel processing, efficiently reading Postgres WAL, data compression and incremental batch loading on Snowflake will be highlighted.

I will list and discuss some of the optimizations that are implemented to minimize compute, network, and warehouse costs. Additionally, I will highlight some of the trade-offs made as part of this process. Given that most of the approaches covered in this blog stem from my explorations at PeerDB aimed at enhancing our product The task was accomplished primarily through PeerDB.

I want to make it clear that there are some feature gaps in comparison to a mature system, and it might not be practical for all use cases. However, it does handle the most common use cases effectively while significantly reducing costs. I also want to caveat that there might be some ways in which the estimations may be off and Id be happy to adjust based on feedback.

Setup

Initial data load: We will consider that there are 300M rows already in the table at the start of the task, and our system should handle the initial load of all the rows.
Inserts, Updates and Deletes (Change Data Capture): The rest of the 700M rows will be a combination of inserts, updates and deletes. Including support for toast columns.
- 1024 rows changed per second for ~8 days.
Recoverability: We will reboot the system every 30 mins to ensure that it's robust and can recover from disasters.

Now let us walk through an engineering design that optimally handles the above workload with the objective of minimizing costs and improving performance, one step at a time.

Initial Load from Postgres to Snowflake

Lets start with the first operation any data sync job has to do: load the initial set of data from the source to destination. There are a few challenges that come with this:

How to efficiently retrieve large amounts of data from Postgres?
How to process the data in a way where we have minimal cost foot-print?
How to efficiently load this data to Snowflake?

Optimal Data retrieval from Postgres

Reading a table sequentially from Postgres is slow. It would take a long time to read 300M rows from Postgres. To make this process more efficient, we have to parallelize. We've got a clever way to quickly read parts of a table in Postgres using something called the TID Scan, which is a bit of a hidden gem. Basically, it lets us pick out specific chunks of data as stored on disk, identified by their Tuple IDs (CTIDs), which look like (page, tuple). This optimizes IO utilization and is super handy for reading big tables efficiently.

Here's how we do it: we divide the table into partitions based on the pages of the database, and each partition gets its own scan task. Each task handles about 500K rows. So, we partition the table into CTID ranges, with each partition having ~500K rows, and we process each partition parallelly (16 partitions at a time).

SELECT count(*) FROM public.challenge_1br; -- find the count-- num_partitions = (count // rows_per_partition)SELECT bucket, MIN(ctid) AS start, MAX(ctid) AS endFROM (    SELECT NTILE(1000) OVER (ORDER BY ctid) AS bucket, ctid   FROM public.challenge_1br) subqueryGROUP BY bucket ORDER BY start;

Data in Transit

It is important to process the data in a way where we dont overload the system. As we are operating under budget constraints, we need to use techniques that use the hardware effectively. We are going to be using the your dataset fits in RAM'' paradigm of systems design. 300M rows for initial load does sound like a lot, but let's see how we can make it fit in our RAM. We need to process the data to ensure data-types are mapped correctly to the destination. We are going to convert the query results to Avro for faster loading into warehouses, and also for its logical type support.

How big is the data?

Let us take a little detour to explore how big the data is. This is a good chance to look at some real world examples to estimate things. Based on interacting with a lot of production customers, and talking to some experts, its safe to say that on an average we see ~15 columns per table. In our table, lets say each row is ~512 bytes.

# for initial loadnum_rows = 300_000_000bytes_per_row = 512total_num_bytes = num_rows * bytes_per_rowtotal_size_gb = total_num_bytes / 1_000_000_000# total initial load size 153.6 GB# memory required during initial loadnum_rows_per_partition = 500_000mb_per_partition = num_rows_per_partition * bytes_per_row / 1_000_000 # 256 MBnum_partitions_in_parallel = 16required_memory = num_partitions_in_parallel * mb_per_partition # 4096 MB

Required Memory

Based on the above napkin math, we can see that with 4GB of RAM we should be able to do the initial load. We will allocate 8GB of RAM to account for other components.

Efficiently loading data into Snowflake

As mentioned earlier we are going to store the query results into Avro on-disk. We are further going to compress the Avro files using zstd to further reduce the disk footprint and also to save on network costs. We will take a slight deviation from the topic to talk about Bandwidth costs.

Bandwidth costs: They can break the bank!

Let's look at the network costs, you can see the variance in numbers.

Cost per 10GB (egress)	AWS	GCP	Azure
Within same AZ	Free	Free	Free
Within same region (different AZ)	$0.1	$0.1	$0.1
Across Regions	$0.1 - $0.2 (Depends on Destination)	$0.2 - $1.4 (Depends on source+destination)	$0.2 - $1.6 (Depends on region + +intra/inter continental)
To Internet	$0.9 - $0.5 (10TB - 150TB)	$0.8 - $2.3 (Premium tier - Depends on Source+Destination)	$1.81 - $0.5 (MS Premium NW - Depends on source + usage)

Its interesting to see the variance in the costs, so its best to have Postgres, our System and Snowflake in the same cloud provider and the same region. Lets now calculate the networks costs needed for this workload.

Calculating Network Costs

Another thing to be wary of is the Warehouse configuration.

bytes_per_row = 512num_rows = 1_000_000_000total_data_size = 512GBcompressed_data_size_GB = 256 #avro+zstd gives atleast 2x compressionbandwidth_cost_per_10GB = $0.1# total nework costs# data_size_GB * bandwidth_cost_per_10GB / 10network_costs_egress_from_postgres = $5# compressed_data_size_GB * bandwidth_cost_per_10GB / 10network_costs_egress_from_system_to_snowflake = $2.56 network_costs = $7.56

Snowflake Warehouse Configuration

In many organizations, a significant portion of Snowflake expenses comes from compute usage, particularly when warehouses run idle between tasks. Snowflake's compute costs are accrued based on warehouse operational time, starting from activation to suspension. Often, idle warehouse time can contribute to 10%-25% of the total Snowflake compute costs. The Baselit team wrote an excellent blog about it: read more about it here.

The two things we will be doing is to set AUTO_SUSPEND to be 60 seconds, a warehouse idles for up to a minute after the last query before pausing, and make sure that we keep the warehouse active for the least amount of time. This is the default configuration you get if you follow the PeerDB Snowflake setup guide.

Inserts, Updates and Deletes

The next challenge for us after the initial load would be to read the change data from Postgres and replaying that to Snowflake. We are going to be doing that using Postgres Logical Replication. At the start of the replication, we will create a replication slot and use pgoutput plugin. This is the recommended way to read changes from the slot. Once we read the changes from the slot, we will batch them and then load them to Snowflake.

As we discussed earlier, it is important to keep the Snowflake warehouse idle for as long as we can, and batching helps with that. We store records in batches of 1M to Avro like before, and load them to an internal stage in Snowflake. Once the data is loaded into the stage, we will MERGE the records from the stage into the destination table. This way most of the heavy-lifting of the resolution is left to the warehouse and it simplifies our system.

Tools

At PeerDB, we are building a specialized data-movement tool for Postgres with laser focus on Postgres to Data Warehouse replication. Most of the above optimizations incl. parallel initial load, reducing Data Warehouse costs, native data-type mapping, support of TOAST columns, fault-tolerance and auto recovery etc. are already baked into the product. PeerDB is also Free and Open. So we chose PeerDB to implement the above workload.

Hardware

Now that we have landed on 8GB RAM, let us move onto picking the instance type.

Since ARM uses lower energy compared to x64 (due to being RISC), they are around 25% cheaper as compared to x64 machines. The tradeoff here is that x64 machines run at around 2.9GHz with a 3.5GHz Turbo (M6i instances) as compared to ARM machines at about 2.5GHz (Graviton2 - M6g) but M6i instances are about 30% more expensive as compared to M6g instances.

Effective cost is $0.0409/GHz for x64 vs $0.03616/GHz for ARM, so cost is about 13% more per GHz on x64 But cost per GHz is not the determining factor for reading in a single thread from Postres during CDC as replication slots can be read from a single process at once.

For this current experiment, I went with m6gd.large as it offers a good balance of speed and disk.

Optional read: In this blog we will use AWS for our analysis. However, here are some other learnings we had on this topic. OVH Cloud currently does not support ARM Instances and has a similar $0.118/hour c2-7 instance (in limited regions) which has a very low network speed (250MBps) with 50GB of SSD. Hetzner has a CCX13 $0.0292/hour instance (including a 118GB SSD) but no dedicated ARM instances.

Conclusion

One question that I'm often posed with: Is this practical?. Yes, one machine can die, but systems where there is only one machine have a remarkable amount of uptime, especially when the state is stored in a durable way.

Back to the topic at hand. If we look at the total cost of the system we built (assuming us-west-2 as the region. Over a month time this is the breakdown:

Cost Category	Cost	Comment
Hardware	$65.992 / month	AWS m6gd.large (2 vcpus, 8 GB RAM)
Comes with 118 GB NVMe which is great!
Network	$7.56	AWS network transfer same region 500 GB (with compression)
Warehouse	N/A	These are common across various vendors
Total	$73.552	Hardware Costs + Network costs = $65.992 + $7.56 = $73.552 (Within $100 budget)

If we were to look at various ETL tools and how much they charge for moving 1 billion rows, this is what it comes out to:

Vendor	Cost per 1 billion records
Fivetran	$23,157.89
Airbyte	$11,760.00
Stitch Data	$4,166.67
Above Approach (using PeerDB OSS)	$73.552

I am part of a company building a software for moving data specifically from Postgres to Data warehouses. It's my job to figure out how to provide the best experience to our customers. Doing this project forced me to figure out a way to provide the best bang for buck, and to include a lot of the explored features into PeerDB. I hope it conveys some appreciation for what modern hardware is capable of, and how much you can get out of it.

PeerDB UI - Deeper Dive: Part 1

Kaushik Iska — Fri, 16 Feb 2024 17:44:22 GMT

At PeerDB, we are building a fast and cost-effective way to replicate data from Postgres to Data Warehouses such as BigQuery, Snowflake and ClickHouse.

When building PeerDB UI, we wanted it to be minimal but effective. Features were driven by what the customers really needed, while keeping the bloat low. For this article, I've asked the team to share their favorite part about the UI.

Replication Slot Growth Chart

This chart shows the size of slot in GB over time.

Activity Monitor

This view captures all the activity and connections open for the database.

Slack Alerts

Alerting configuration for Slack, back to our minimal roots, where we show the configured alerts channel.

Mirror Rows Over Time

A simple histogram view over time where we see the number of rows synced.

Timezone Selector

Simple touch to select the timezones.

Conclusion

These are just a glimpse of PeerDB UI. We tried to cover some of the features that make PeerDB UI unique and helpful. I'm hoping that we add a lot more of these useful features.

We hope you enjoyed reading the blog. If you're a Postgres user and wish to replicate data from Postgres to Snowflake/BigQuery/ClickHouse using PeerDB, please check out the links below or reach out to us directly!

Postgres to ClickHouse Real time Replication using PeerDB

Sai Srirampur — Wed, 14 Feb 2024 16:57:49 GMT

Today we at PeerDB are introducing the ClickHouse target connector in Beta. With this you can seamlessly replicate data in Postgres to ClickHouse with low latency and high throughput. ClickHouse was one of the most asked connector from our customers, so we prioritized it right after our initial set of targets which includes Snowflake, BigQuery and Postgres.

In this blog, we'll cover the use cases that Postgres to ClickHouse replication enables, followed by a practical demo showing low latency (10s) replication from Postgres to ClickHouse using PeerDB, and conclude with how we built this Connector.

Postgres to ClickHouse Replication Use cases

Operational Analytics or HTAP - Replicating data from Postgres to ClickHouse enables real-time analytics on operational data without compromising transactional performance, creating an efficient Operational Data Warehouse or an HTAP environment. Postgres handles transactional (OLTP) workloads, while ClickHouse enables fast analytics (OLAP) on transactional data.
ClickHouse as Data Warehouse - ClickHouse is considered a cost-effective Data Warehouse due to its open-source nature, columnar storage, data compression, and parallel processing capabilities, which enhance analytics performance on large datasets while minimizing hardware costs. In this scenario, Postgres to ClickHouse replication enables move your application (OLTP) data to your Data Warehouse for centralized analytics.

PeerDB for Fast Postgres to ClickHouse Replication

In this section, we'll walk through an example of Postgres to ClickHouse replication using PeerDB.

Postgres Setup

You can use any Postgres database in the cloud or on-prem. I am using RDS Postgres for this setup. The Postgres database has a goals table which is constantly getting ingested with 2500 rows per second.

CREATE TABLE public.goals (        id bigint PRIMARY KEY GENERATED BY DEFAULT AS IDENTITY,        owned_user_id UUID,        goal_title TEXT,        goal_data JSON,        enabled BOOL,        ts timestamp default now());/* Insert 5000 records at a time into goals table */INSERT INTO public.goals (owned_user_id, goal_title, goal_data, enabled)SELECT gen_random_uuid(), 'tiTLE', '{"tags": ["tech", "news"]}', falseFROM generate_series(1, 5000) AS i;/* Using psql's \watch keep insert 5000 records every 2 seconds */postgres=> \watch--INSERT 0 5000--INSERT 0 5000--INSERT 0 5000

ClickHouse Setup

You can create ClickHouse using its Docker container on an EC2 instance or use ClickHouse Cloud. In my test I used ClickHouse Cloud. I created a separate database called peerdb, where I'll be replicating the goals table from Postgres.

PeerDB Setup

You can use PeerDB Open Source or PeerDB Cloud to deploy a PeerDB instance. For the scope of this demo, I'll be using the PeerDB Cloud's Micro offering which has a 1-month free trial.

Create Postgres and ClickHouse Peers

In the PeerDB world, Peers are data stores. Creating Peers lets PeerDB know which data stores the replication will be set up between. You can use PeerDB's UI to create the Postgres and the ClickHouse Peers.

Create Mirror for Postgres to ClickHouse Replication

In the PeerDB world, a Mirror represents replication from a source peer to a target peer. You can use PeerDB's UI to create a MIRROR for replicating data from Postgres to ClickHouse.

https://www.loom.com/share/d8983a1768024dbb89e5ef34e64f0c55?sid=77dcfc60-67b4-4469-a3fc-a44975c1f015

Data Freshness

You can set the Sync Interval to control data freshness in ClickHouse; I set this interval to 10 seconds, which was feasible specifically because of ClickHouse. Such a short interval wouldn't have been practical with other Data Warehouses like Snowflake or BigQuery. This is because they don't handle real-time data ingestion as flexibly as ClickHouse; and real-time ingests would keep them active for longer, incurring high costs, unlike with ClickHouse.

MIRROR = Initial Load + Change Data Capture

If you observe, I created a Change Data Capture (CDC) based MIRROR that replicates data using Postgres' Write-Ahead Log (WAL) and Logical Decoding. It involves two steps:

An initial load that takes a snapshot of existing data in Postgres and copies it to ClickHouse; Through Parallel Snapshotting, you can expect signficantly faster initial loads. We've seen TBs get moved in hours vs days.
Change Data Capture (CDC): Once the initial load is completed, PeerDB constantly reads changes in Postgres through the logical replication slot and replicates those changes to ClickHouse.

Replication Latency of ~10s for 2500 TPS

Looking at the below metrics available in PeerDB UI, PeerDB replicated 35000 rows every 13 seconds, maintaining an ingestion throughput of ~2500 rows per second with a latency/data freshness close to 10 seconds on ClickHouse.

Below is the full demo of PeerDB's MIRROR replicating data from Postgres to ClickHouse

https://www.loom.com/share/5bb82439f02645d2b78b3ac8f1ad60fb?sid=d6fd5a29-ce50-4f1b-ae0a-9d9777d401d5

Design Choices

Below section captures a few design choices we made while building the ClickHouse connector.

Final Tables use ReplacingMergeTree Engine

The final tables that are created automatically created by PeerDB in ClickHouse are of the type ReplacingMergeTree. Both INSERTs and UPDATEs in Change Data Capture (CDC) are captured as new rows with different versions (using _peerdb_version) in ClickHouse. The ReplacingMergeTree query engine takes care of periodic deduplication using the PRIMARY KEY column. DELETEs from PostgreSQL are propagated as new rows that are marked as deleted (using the _peerdb_is_deleted column). The snippet below shows the target table definition for the goals table in ClickHouse.

clickhouse-cloud :) SHOW CREATE TABLE public_goals;CREATE TABLE peerdb.public_goals(    `id` Int64,    `owned_user_id` String,    `goal_title` String,    `goal_data` String,    `enabled` Bool,    `ts` DateTime64(6),    `_peerdb_synced_at` DateTime64(9) DEFAULT now(),    `_peerdb_is_deleted` Int8,    `_peerdb_version` Int64)ENGINE = SharedReplacingMergeTree('/clickhouse/tables/{uuid}/{shard}', '{replica}', _peerdb_version)PRIMARY KEY idORDER BY idSETTINGS index_granularity = 8192

Postgres to ClickHouse Data Type Mapping

PeerDB aims to map data types from PostgreSQL to their native counterparts in ClickHouse. All numeric, string, and date types are mapped to corresponding data types in ClickHouse. More advanced data types, such as ARRAYS, are mapped to the Array type in ClickHouse. As of now JSON, JSONB and HSTORE are mapped to String on ClickHouse. For more detailed information on Data Type Mapping, you can check our GitHub repo. We will be improving Data Type support further in the near future.

S3 stage and avro files

Raw CDC changes from PostgreSQL are staged in S3 in the form of Avro files. PeerDB OSS requires users to specify an S3 stage when creating the ClickHouse Peer. In PeerDB Cloud, we manage it for you.

Conclusion

We hope you enjoyed reading the blog. If you're a ClickHouse user and wish to replicate data from Postgres to ClickHouse using PeerDB, please check out the links below or reach out to us directly!

Reducing BigQuery Costs by 260x

Sai Srirampur — Mon, 05 Feb 2024 18:00:52 GMT

In this blog post, we'll do a deep-dive into a simple trick that can reduce BigQuery costs by orders of magnitude. Specifically, we'll explore how clustering (similar to indexing in BigQuery world) large tables can significantly impact costs. We will walk through an example use-case for a common query pattern (MERGE), where clustering reduces the amount of data processed by BigQuery from 10GB to 37MB, resulting in a cost reduction of ~260X.

Setup

The setup involves a common Data Warehousing scenario of transforming raw data from multiple sources into a consumable format through the MERGE command. Now, let's delve into the data model:

There are two groups of tables, each containing two tables. One group of tables is clustered based on the id column, while the tables in the other group are not clustered.
One of tables represents staging/raw data and the other represents the final data.
The staging table is seeded with 10 million rows and the final table is seeded with 200 million rows.
The MERGE command is used to merge a subset of data (~10K rows filtered on id) in the staging table to the final table.

Creating Tables and Seeding Data

Here goes the script that we used to create and seed tables.

MERGE command on Clustered and Unclustered tables

/* and  can be either both clustered or unclustered*/MERGE sai_tests. AS finalUSING (    SELECT * FROM sai_tests.    -- filtering 9999 rows from staging table    WHERE id > 500000 AND id < 510000 ) AS stagingON final.id = staging.id -- join on id columnWHEN MATCHED  THEN    UPDATE SET final.column_string = staging.column_string,     final.column_int = staging.column_int,     final.column_float = staging.column_float,     final.column_timestamp = staging.column_timestampWHEN NOT MATCHED  THEN    INSERT (id, column_string, column_int, column_float, column_timestamp)    VALUES (staging.id, staging.column_string, staging.column_int,     staging.column_float, staging.column_timestamp);

Analyzing the MERGE command on the Unclustered tables

Below is the execution graph of the MERGE command on the unclustered tables:

Now let us analyze important step of the execution graph:

In the first step (S00), BigQuery infers the join across both tables and pushes the id filter to the final table. Despite pushing the filter down to the final table, BigQuery still reads all 200 million records to retrieve the approximately 10K filtered records. This is due to the absence of clustering on the 'id' column in the final table.
Now, let us analyze the JOIN (S02). As a part of the JOIN, BigQuery scans the entire 10 million records in the staging table, even though there is a WHERE clause on the id that filters approximately 10K records. This is due to the absence of clustering on the id column in the staging table.
If you observe, lack of CLUSTERING made BigQuery generate suboptimal plan processing 9.69GB of data costing 4.7 cents.

Analyzing the MERGE command on the Unclustered tables

Below is the execution graph of the MERGE command on the clustered tables:

Now let us analyze each step of the execution graph:

First, BigQuery infers the join across both tables and pushes the id filter to the final table. Since the final table is CLUSTERED on the id column, BigQuery leverages this clustering to efficiently retrieve approximately 368K records out of the 200 million records in the final table.
Now, let's analyze the JOIN (S02). As part of the JOIN, BigQuery utilizes the CLUSTERED id to read only 362K records from the staging table, which contains 10 million records.
If you observe BigQuery optimally filtered data across both the staging and final table using the CLUSTER (index) on id processing only ~37MB of data costing 0.017 cents

Results: Unclustered vs Clustered Tables

	Unclustered	Clustered	Difference
Bytes Processed	9.69GB	37MB	~260X
Slot time	40.5min	2.5min	~20x
Costs	4.7 cents	0.017 cents	~268X

The amount of bytes processed by the MERGE on the clustered set of tables is 260 times lower than the unclustered group.
Slot time is 20 times lower on clustered tables compared to unclustered tables.
Costs ($$) are 268 times lower for MERGE on clustered tables compared to unclustered tables.

This is the impact a simple CLUSTER can have on compute costs for BigQuery. Based on the MERGE commands that you use, observe the join columns and columns in the WHERE clause, and intelligently CLUSTER tables on those columns. This can signficantly reduce costs. With a similar approach.

Auto Clustering and Partitioning in PeerDB

We at PeerDB are building a fast and cost-effective way to replicate data from Postgres to Data Warehouses and Queues. Aligned with the above blog post, we automatically cluster and partition the raw and the final tables on BigQuery. The raw table is cluster based on 2 columns that have WHERE clause filters and partition based on timestamp. The final table is clustered based on the primary key. We've seen significant cost reduction (2x-10x) for our customers with this optimization.

References

Hope you enjoyed reading the blog, here are a few references that you might find interesting:

Shopify reduced their BigQuery spend from ~$1,000,000 to ~$1000.
Five Useful Queries to get BigQuery costs.
Try PeerDB Cloud for free.
Visit PeerDB's GitHub repository to Get Started.

Five Useful Queries to Get BigQuery Costs

Kaushik Iska — Wed, 31 Jan 2024 23:39:10 GMT

At PeerDB, we are building a fast and cost-effective way to replicate data from Postgres to Data Warehouses such as BigQuery. So our customers are heavy Data Warehouse users. One topic that comes up in almost each and every engagement is Data Warehouse costs. In recent months, we've explored BigQuery costs and optimization strategies in detail. We're eager to share our learnings.

This first blog post on Five Useful Queries to Get BigQuery Costs will be helpful for any company who uses BigQuery and and wants to keep their costs in a check.

Total Cost by day in the last 7 days

Let's start by examining the overall cost estimates overtime for your BigQuery instance. This would help you understand the query load over various days, and give you a comprehensive view of the costs over each day.

SELECTDATE_TRUNC(jobs.creation_time, DAY) AS job_date,SUM(jobs.total_bytes_billed/1024/1024/1024/1024 * 5) AS job_cost_usd_totalFROM region-us.INFORMATION_SCHEMA.JOBS_BY_PROJECT jobsWHERE    jobs.creation_time >= TIMESTAMP_SUB(CURRENT_TIMESTAMP(), INTERVAL 7 DAY)GROUP BY job_dateORDER BY job_cost_usd_total DESC;

NOTE: Adjust the region-qualifier based on the location of your Warehouse.

Example Output:

Total Costs per Application or User

When applications query BigQuery, they usually do so using service accounts, each associated with an email. Similarly, when users in your organization access the console and run a query, it is associated with the user's email. To understand the impact of an application or a user on your BigQuery costs, the following query would be useful.

SELECTuser_email, SUM(jobs.total_bytes_billed/1024/1024/1024/1024 * 5) AS job_cost_usd_totalFROM region-us.INFORMATION_SCHEMA.JOBS_BY_PROJECT jobsWHERE    jobs.creation_time >= TIMESTAMP_SUB(CURRENT_TIMESTAMP(), INTERVAL 7 DAY)GROUP BY user_emailORDER BY job_cost_usd_total DESC;

Example Output:

Cost Analysis Based on Query Types

Typically, one would examine this to understand which query types are dominating costs. This is useful, for example, to determine if MERGE queries, commonly generated by transformation tools like dbt are driving up costs. This enables you to fine tune specific query types

SELECTstatement_type, SUM(jobs.total_bytes_billed/1024/1024/1024/1024 * 5) AS job_cost_usd_totalFROM region-us.INFORMATION_SCHEMA.JOBS_BY_PROJECT jobsWHERE    jobs.creation_time >= TIMESTAMP_SUB(CURRENT_TIMESTAMP(), INTERVAL 7 DAY)GROUP BY statement_typeORDER BY job_cost_usd_total DESC;

Example Output:

Total Cost by Project in the last 7 days

You can analyze BigQuery costs per project to ensure efficient resource allocation and budget management. This analysis helps you identify high-cost projects, enabling targeted optimizations to reduce expenses.

SELECTproject_id, SUM(jobs.total_bytes_billed/1024/1024/1024/1024 * 5) AS job_cost_usd_totalFROM region-us.INFORMATION_SCHEMA.JOBS_BY_PROJECT jobsWHERE    jobs.creation_time >= TIMESTAMP_SUB(CURRENT_TIMESTAMP(), INTERVAL 7 DAY)GROUP BY project_idORDER BY job_cost_usd_total DESC;

Query Dry Runs with no costs to get costs

In cases where you are concerned about a specific query being intensive and incurring high costs, it is useful to run the query in dry-run mode before executing it. This will show you the amount of data that the query will process, helping you estimate the costs. Additionally, you can test your data modeling optimizations for example, clustering and partitioning to see how they impact the amount of data processed and the costs in this way.

Run the below query using BigQuery's cli bq

bq query --format=prettyjson --dry_run \--use_legacy_sql=false ""

Example Output:

{  "configuration": {    "dryRun": true,    "jobType": "QUERY",    ...    "totalBytesProcessed": "74113430156"   },  "status": {    "state": "DONE"  },  ...}

The above output shows that ~74GB of data would be processed by the query, and the estimated costs would be ~37 cents.

Reference: Here goes the official documentation on BigQuery costs. We found this to be a useful resource while coming with the blog.

Closing Thoughts...

Hope you enjoyed reading the blog. More blogs on BigQuery costs and optimizations are coming soon. At PeerDB, we implement mechanisms such as auto partitioning and clustering to minimize BigQuery costs while replicating data from Postgres. Few of our customers have seen 5x reduction in Warehouse costs. If you wanted give PeerDB a shot, below resources would be helpful:

Try out the our fully managed offering at no cost. OR
Visit our GitHub repository to Get Started.

Data Types Need Care during Database Replication

Amogh Bharadwaj — Fri, 26 Jan 2024 23:26:46 GMT

Moving data from one data store to another means finding common ground. Data stores speak in terms of data types. Two data stores will never have a perfect overlap when it comes to what data types they support. It's the responsibility of a data movement tool to ensure efficient and native data type mapping during replication.

This blog discusses the importance of a carefully designed data type mapping during database replication. It also talks about how PeerDB handles data type mapping, with practical examples.

Benefits of native data type mapping during replication

Below are the benefits of a carefully designing data type mapping during replication, involving the use of the most optimal native data types for both source and target.

Saves Transformation Costs: JSONs are tricky, so let's just map JSONs of PostgreSQL to STRINGs in Snowflake. This is easy enough but not a good idea. Every consumer of the warehouse (ex: DBT jobs) would need to transform the STRING to a JSON. This could drastically increase Warehouse costs. Mapping JSON in Postgres to the native VARIANT type in Snowflake would help avoid those transformation costs.
Query richness: Storing data in the target system's native data types, such as enables users to fully utilize its advanced querying capabilities. For example, storing JSON in its native format allows for efficient use of JSON-specific functions and native GEOSPATIAL types enables complex spatial queries and analyses.
Reduces Tech Debt: Consider a scenario where Geospatial (POSTGIS data) from PostgreSQL is mapped as STRINGs in Snowflake. In this case, the data engineer responsible for building DBT pipelines in Snowflake will need to convert these STRING types into Snowflake's native geospatial types. This means extra code and more effort! Why not have a data movement tool do this for you automatically, reducing technical debt?

Data store Centric ETL With PeerDB

At PeerDB, our core engineering philosophy is to be tailored to each individual data store - the way each behaves, the features each provides - these store-specific intricacies are leveraged to maximum effect.

One of the approach we take is to have a mapping per data type, per store. PostgreSQL to Snowflake and PostgreSQL to Google BigQuery are two distinct code paths, allowing us to develop robust, tailored pipelines. Let's look at a few case studies.

Nulls in Arrays

Let's consider the task of mapping a TEXT[] array of PostgreSQL to Snowflake and BigQuery. They both support their own ARRAY types, of course. We can reuse the same piece of code to sync these types to both these stores!

So we go ahead and kick off replication. Everything looking good for Snowflake. Alas, you see that your flow to BigQuery fails for an array such as this:

-- BigQuery doesn't allow NULLs in ArraysSELECT ['HELLO','WORLD',NULL]Array cannot have a null element; error in writing field f0_

Turns out, BigQuery doesn't like having NULLs in arrays during their insertion. We'll have to strip those arrays of the nulls. Now here's the loss - we've already committed to a single mapping, meaning we will be removing nulls in Snowflake arrays for no reason.

PeerDB avoids this by having flags and gates to transform data according to where it's going. In this case, we are able to continue to sync such arrays to both the data warehouses as natively possible.

Be Doubly Precise

Now let's try syncing a JSON value like {"rating": 4.553435} to BigQuery. Luckily for us, it provides a PARSE_JSON function we can use to parse JSON formatted strings to the actual JSON. Great! Except we're hit with another complaint:

4.553435 cannot round-trip through string representation

Turns out, BigQuery handles floating point values differently, and for some reason refuses to accept this harmless number.

Upon some research, we see that PARSE_JSON accepts an argument called wide_number_mode which must be set to the value round - this tells BigQuery that it can round these numbers if it thinks they're too long.

The only reason we're able to implement these root-level fixes is because our architecture defines data by their type and their destination, rather than just the type.

HStore At Home

PostgreSQL's HStore was the first unstructured data type to release for it. It is a simple key-value pair data type with a great deal of flexibility. Let's say we were to sync these HStores to PostgreSQL (destination) and Snowflake.

Well, Snowflake doesn't document a HStore type of their own, like PostgreSQL does. At this point, many would be tempted to simply write HStores as strings to Snowflake.

However that is not in the spirit of a data type which is of a key-value pair form. We would like to be able to query the HStore as we would be able to in PostgreSQL. Therefore the better choice would be to sync them as VARIANTs in Snowflake so customers can leverage Snowflake's Semi-Structured Data Querying.

High-Level Glance At PeerDB's Mapping

PeerDB supports all the primitives types and their arrays as expected. These include integers, timestamp, date, boolean, text and nulls.

Below is a look at how PeerDB maps a few complex types.

PostgreSQL	Snowflake	BigQuery
JSON	JSON-Compatible `VARIANT`	JSON
JSONB	`VARIANT`	JSON
GEOGRAPHY	GEOGRAPHY (WKT)	Geography (WKT)
GEOMETRY	GEOMETRY (WKT)	Geography (WKT)
HStore	JSON-Compatible Variant (respects HStore intricacies)	JSON
ENUM	STRING	STRING
ARRAY	ARRAY	ARRAY

Closing Thoughts

Hope you enjoyed reading this blog on importance of Data Type mapping during Database Replication. Speaking of data types, feel free to check out these other resources on the same topic:

SSH Tunneling for Secure Postgres Replication

Kunal Gupta — Wed, 17 Jan 2024 17:31:36 GMT

At PeerDB, we are building a fast and a cost-effective way to replicate data from Postgres to Data Warehouses, Queues and Storage. This blog post talks about how PeerDB accesses Postgres Databases securely via SSH Tunneling and moves data to any of its supported targets.

What is SSH Tunneling?

SSH (Secure Shell) Tunneling is a method for connecting to services over unsecured networks, creating a protected pathway for data to travel and encrypting the information exchanged.

A tunnel is created via a SSH Bastion (or Jump Box) to an application (usually) inaccessible from the outside world:

The client application (which can be on the same host as the SSH Client) can now access the previously inaccessible application via the established tunnel.

Why Use SSH Tunneling?

PeerDB uses SSH Tunneling to securely connect and move data from your Protected Postgres Database(s) to a variety of targets in a cost-effective manner:

Accessing Protected Resources Securely
Resources such as Databases are usually not made accessible from outside the VPC of the hosted cloud or the on-premises server. SSH Tunneling ensures a secure pathway for data transmission to and from the protected resource.
Cost-Effective Data Integrity
Investing in a secure data access solution is a strategic move for any business. SSH Tunneling minimizes the need for complex infrastructure, offering a cost-effective strategy to ensure data integrity without inflating operational costs.
It just requires a lightweight SSH Bastion (VM or EC2 instance) and a couple of firewall rules to get going.

Securing Postgres Replication with PeerDB's SSH Tunneling

Let's say we have a Postgres Database in the cloud (say RDS) which is inaccessible via the public internet and is only accessible via specific resources inside its VPC.

How can you connect PeerDB to such a Postgres DB?

The Setup

You only need 2 things to set up the tunnel:

A Bastion or JumpBox (an EC2 Instance)
Firewall rules for
- Bastion to be able to access the RDS (Security Group - Inbound Rule)
- PeerDB to be able to connect to the Bastion (Security Group - Inbound Rule)

The setup should look something similar to this:

Nothing more is required as PeerDB takes care of the rest.

PeerDB to the Rescue

It just takes a few seconds to connect to Postgres via an SSH Tunnel in PeerDB:

This is how the end result looks like:

We've integrated SSH Tunneling into the heart of the PeerDB, so now anyone can simply configure the tunnel with minimal extra configuration (like the Bastion information), with the PeerDB server automatically setting it up and using it to connect to the protected Postgres database.

Conclusion

Thank you for reading our blog! If you want to try out this new feature SSH Tunneling to securely connect and replicate data from your Postgres Database to Data Warehouses:

Try out the Free Trial of our fully managed offering. OR
Visit our GitHub repository to Get Started.

Celebrating New Year with Postgres and Snowflake

Kaushik Iska — Mon, 01 Jan 2024 18:39:56 GMT

Once upon a time in the digital realm, databases decided to get into the New Year spirit. Leading the charge, Postgres had a brilliant idea - to send his friend Snowflake some delicious homemade cookies. After all, what's a New Year without some sweet treats?

Postgres bakes some cookies as presents

So, Postgres set to work in his virtual kitchen, crafting tree-shaped cookies of various sizes. But where would you place a cookie in a database world? On a table, of course! Postgres ended up with a table that was overflowing with cookies. It was a sight to behold, rows upon rows of chocolate-chip, oatmeal raisin, and many more flavors!

Here goes the magic recipe that Postgres used to bake cookies:

-- Table to place cookiesCREATE TABLE cookies (    id SERIAL PRIMARY KEY,    cookie TEXT);-- Function to bake cookiesCREATE OR REPLACE FUNCTION generate_cookie(width INT, height INT) RETURNS TEXT AS $$DECLARE    tree TEXT := '';    i INT;BEGIN    IF width < 3 OR height < 3 THEN        RETURN 'Invalid dimensions';    END IF;    FOR i IN 1..height LOOP        tree := tree || lpad(repeat('*', 2 * i - 1), i + width - 1) || E'\n';    END LOOP;    tree := tree || rpad('|', width, ' ');    RETURN tree;END;$$ LANGUAGE plpgsql;-- Baking cookies and placing them on the tableINSERT INTO cookies(cookie) SELECT generate_series(10,10);INSERT INTO cookies(cookie) SELECT generate_series(6,6);INSERT INTO cookies(cookie) SELECT generate_series(12,7);.........

Here is how some of the cookies looked, arent they delicious!

Postgres was proud of his cookie table, and he couldn't wait to send them to Snowflake, his database buddy. But how do you send cookies to a friend who speaks SQL?

Now, here comes the fun part. If humans send holiday cards to their friends, it's usually written in English. But databases have their own language, SQL. That's where the adventure really began.

Presents In Transit

Postgres knew he needed the help of a magical tool called PeerDB, which, as it turns out, also spoke SQL. First, he had to register the two peer databases in PeerDB. Here's what he did:

CREATE PEER good_friend_pg WITH POSTGRES (    host = '',    port = '',    -- Add other connection details here);CREATE PEER good_friend_sf WITH SNOWFLAKE (  account_id = '',  private_key = ',    -- Add Snowflake connection details here);

Now that Postgres had introduced friends to PeerDB, he was ready to send the cookies using PeerDB's Mirror feature. He created a mirror named holiday_gift from good_friend_pg to good_friend_sf:

CREATE MIRROR holiday_giftFROM good_friend_pg TO good_friend_sfWITH TABLE MAPPING (public.cookie:public.cookie)WITH (  do_initial_copy = true);-- for fun lets ingest more cookies into the tableINSERT INTO cookies(cookie)SELECT generate_cookie(i, i) FROM generate_series(5, 100) i;-- continue to execute the above query once every 2 seconds\watch 2

Snowflake Gets Presents

The cookies were on their way to Snowflake, and Postgres couldn't contain his excitement. He imagined Snowflake's reaction when he received rows and rows of data that turned out to be delicious cookies. Lets take a peek on Snowflake!

And so, in the world of databases, where holiday gifts could be just as sweet as SQL queries, the celebration continued with a heartwarming twist.

Happy New Year 🎉, everyone, from the PeerDB team! Hope you enjoyed reading this blog. If you also wanted to send some presents from Postgres to Snowflake using PeerDB: 🎁 🥳

To try PeerDB's fully managed offering, please request for access using this link!
Visit our GitHub repository to Get Started.

Real-time Change Data Capture from Postgres 16 Read Replicas

Sai Srirampur — Thu, 21 Dec 2023 05:40:09 GMT

At PeerDB, were committed to providing a fast and simple experience to replicate data from Postgres to Data Warehouses, Queues and Storage. We're excited to announce a new feature: Change Data Capture (CDC) from Postgres Read Replicas.

Postgres 16 introduced logical decoding via Standbys (a.k.a. Read Replicas). Behind the scenes, PeerDB relies on this feature to enable CDC / replication from Postgres Read Replicas.

In this blog, we will cover:

The impact of this feature on customer workloads.
Our learnings while building this feature. This would be helpful for any application that relies on Postgres logical decoding and wants to start using Read Replicas instead of Primaries.

Eliminate risk on your Primary Postgres Database

A common concern for users replicating data from a Primary Postgres database is the potential risk it poses to the database due to -

Addition Load on the Primary - Logical decoding can incur additional load and resource consumption on the Primary.
Outages due to replication slot growth: Logical replication slots if not consumed properly, can grow in size leading to disks getting filled and resulting in outages.

With this new feature, you can setup a Postgres Read Replica and start replicating data from it instead of the Primary. It completely eliminates the risk of affecting your Primary database.

Logical Decoding on Postgres Standbys, Our Learnings

Before introducing this feature, we checked if PeerDB could use a Postgres Standby as a source without any code changes. But that was not the case. It required a few lines of code change to support replication through Standbys.

We're eager to share our learnings on which Postgres replication-related features worked well on Read Replicas and which ones needed to be worked around -

CREATE_REPLICATION_SLOT works as expected on Read Replicas: Behind the scenes we use CREATE_REPLICATION_SLOT to create the replication slot and that code path required no changes.

       CREATE_REPLICATION_SLOT peerflow_slot_test_replica_1 LOGICAL pgoutput;                 slot_name           | consistent_point |    snapshot_name    | output_plugin       ------------------------------+------------------+---------------------+---------------        peerflow_slot_test_replica_1 | 4/A40005A8       | 0000001B-00000016-1 | pgoutput       (1 row)

SNAPSHOT can be created and used on Read Replicas: For a consistent initial load from Postgres to a target, we use the snapshot generated by CREATE_REPLICATION_SLOT (see snapshot_name column). That worked as expected and required no code change.
1. ```
       BEGIN;       SET TRANSACTION ISOLATION LEVEL REPEATABLE READ;       SET TRANSACTION SNAPSHOT '0000001B-00000016-1';       SELECT * FROM ;       END;
```
START_REPLICATION worked as expected on Read Replicas: START_REPLICATION is used for consuming the logical replication slot. This worked as expected.
1. ```
       START_REPLICATION SLOT peerflow_slot_test_replica        LOGICAL 3/9000C8D1 (proto_version '1',        publication_names 'pub_all')
```
Publications cannot be created on the Read Replica: PeerDB couldn't automatically create the publication on the standby. So we had to create the publication manually on the primary and input that to the MIRROR (a.k.a. replication).
WAL control functions cannot run on Read Replicas: The use of pg_current_wal_lsn() to monitor the slot size failed on the replica, showing an error related to recovery in progress. We overcame this by implementing a conditional query using CASE WHEN pg_is_in_recovery() THEN pg_last_wal_receive_lsn() ELSE pg_current_wal_lsn(), which employs pg_last_wal_receive_lsn() END for replicas.

Conclusion

Thank you for reading our blog! If you want to try out this new feature of replicating data from Postgres Read Replicas to Data Warehouses, Queues or Storage:

You can get in touch with us by filling out the Request for Access form on our website.
Alternatively, visit our GitHub repository to Get Started.

PG Slot Notify: Monitor Postgres Slot Growth in Slack

Kaushik Iska — Fri, 15 Dec 2023 15:55:11 GMT

We're excited to launch pgslot-notify-bot, our latest open-source tool designed to surface the well-known issue of replication slot growth in PostgreSQL, as highlighted in this earlier article. This tool is useful for database administrators to preemptively tackle problems associated with replication slot size increases, ensuring smooth database operations.

Features

Monitoring: Checks the size of PostgreSQL replication slots at specified intervals.
Slack Alerts: Instant notifications to a designated Slack channel for quick action. Tags @channel for slots that exceed the threshold otherwise just posts a message.
Easy Setup: Simple configuration with an .env file, allowing quick deployment. There is also a docker-compose.yaml for those who prefer that.
Adjustable Thresholds: Users can set size limits to suit their database needs.

Future

We see this bot to evolve to monitor other common issues that arise when replicating data in PostgreSQL, be it number of open connections or contentions by looking at wait_event andwait_event_types.

Get Involved

Try out the pgslot-notify-bot, and file issues or contribute if you'd want to see more features. Leave a star if you like it!

Five tips on Postgres logical decoding

Sai Srirampur — Wed, 13 Dec 2023 21:13:31 GMT

Logical decoding is a mechanism that enables users to stream changes on Postgres as a sequence of logical operations like INSERTs, UPDATEs, and DELETEs. This is useful for applications that need to keep an external data store synchronized with Postgres. For example, replicating data in Postgres to Data Warehouses for analytics.

In this blog, I'll walk through 5 interesting logical decoding concepts worth knowing about:

Slot Growth - Under the covers, a logical replication slot captures changes in the Postgres Write-Ahead Log (WAL) and streams them in a human-readable format to the client. A common issue posed by logical decoding is unexpected replication slot growth. This could risk filling up storage and causing server crashes. Slot growth mostly happens when the consumer application (a.k.a. client) that reads changes from a replication slot lags or halts. You could monitor slot activity and its growth to detect if the client is lagging or is halted:
1. Monitoring the state of the slot: The query below indicates if the slot is active or not. If the active column is predominantly showing true, it means that the consumer app is constantly consuming the changes. Else, it means that the app is halted.
```
 SELECT slot_name,active FROM pg_replication_slots ;
```
2. Monitoring the size of the slot - If the size of the slot is constantly growing, it could indicate that the consumer app is not catching up with the rate of ingestion on Postgres. Below
```
 SELECT slot_name,  pg_size_pretty(pg_wal_lsn_diff(pg_current_wal_lsn(),  confirmed_flush_lsn))  AS replication_lag_bytes  FROM pg_replication_slots;
```
  There could be other reasons incl. long-running transactions, the client forgetting to flush the slot periodically, etc. causing replication slot growth. More on the causes, preventive measures and mitigation steps for replication slot growth in the next blog!
Large transactions can lead to Slot growth and this can be avoided - By default, a large in-progress transaction cannot be consumed until the transaction finishes. This could potentially increase the size of the slot. To resolve this concern, starting from Postgres 14, the START_REPLICATION command enables clients to consume large in-flight transactions (see proto_version). However, it is the responsibility of the client to reliably handle changes from the in-flight transaction.
PUBLICATION to filter logical decoding changes: When reading changes from a logical replication slot, you can use a PUBLICATION as a filter, allowing you to selectively capture the changes you're interested in. This feature acts as a sieve, enabling filtering based on specific tables, schemas, types of changes (like INSERT, UPDATE, or DELETE), and even column values (introduced in Postgres 15). Crucially, this filtering occurs server-side, reducing the volume of data transmitted over the network.
```
 /*Create a publication that  publishes all changes in two tables*/ CREATE PUBLICATION mypublication  FOR TABLE users, departments; /*Create a publication that publishes  all changes from active departments*/ CREATE PUBLICATION active_departments FOR TABLE  departments WHERE (active IS TRUE); /*Create a publication that only  publishes INSERT operations in one table:*/ CREATE PUBLICATION insert_only FOR TABLE mydata     WITH (publish = 'insert');
```
logical_decoding_work_mem - In Postgres 13, a new setting called logical_decoding_work_mem was introduced, controlling how much memory a logical replication connection (wal sender process) can use to decode changes. By default, it's set at 64MB, but if your workload involves large transactions or high concurrency, this might not be sufficient. When that happens, the decoding process might overflow to disk, slowing down logical decoding. Increasing logical_decoding_work_mem, especially if you have available memory, can significantly boost decoding performance. Keep an eye on pg_stat_activity to see if your wal_sender process is struggling with IO wait events, and consider tuning logical_decoding_work_mem. This blog captures a more detailed summary of how this setting works.
Logical decoding via standbys in Postgres 16 - Before Postgres 16, creating logical replication slots on standbys (read-replicas) wasn't possible. However, Postgres 16 brings a significant changeyou can now create replication slots on standby! This allows clients to consume changes from the replica rather than the primary server, offering several benefits:
1. Eases the load on the primary server.
2. Eliminates the risk of replicating slot growth impacting the primary.
3. Making logical decoding resilient to failovers.

Closing statements..

At PeerDB, we enable you to easily replicate data from Postgres to Data Warehouses, Queues and Storage. Behind the scenes, we rely on Postgres' logical decoding to make this happen. To ensure that we provide a fast and simple experience to users, we've been delving into the intricacies of logical decoding and trying to keep up with its evolution. This blog captures a few of our learnings. Hope you enjoyed reading the blog!

If you want to give PeerDB a try, these links should prove useful: :)

Optimizing Postgres Replication using Column Exclusion

Sai Srirampur — Thu, 30 Nov 2023 20:19:27 GMT

At PeerDB we are building a fast, simple and feature-rich experience to replicate data from Postgres to Data Warehouses, Queues and Storage. With this goal, it was a given that we enable users to replicate data from a subset of columns of a table and exclude data from the rest.

This blog will cover different use cases that require Column Exclusion while replicating data from Postgres. It will also get into how you can accomplish this with PeerDB.

Why exclude columns while replicating data from Postgres?

Column Exclusion was a common feature request from our customers. Below are a few use cases where they needed this feature:

Data Security and Privacy - Excluding sensitive information like personally identifiable (PII) data or proprietary business information from replication helps maintain data security and confidentiality.
Data Minimization: In many scenarios, not all columns are required for analysis or downstream processing. Through column exclusion, you can tailor replication to suit your exact needs avoiding unnecessary data transfer. This helps with:
1. Improve Performance: Transmitting only necessary columns streamlines data transfer, leading to improved replication performance. This is particularly significant when dealing with large tables or limited network bandwidth.
2. Save costs: Sending only needed columns cuts down on network usage, reduces the compute needed for processing rows, and saves storage on the destination. This leads to overall cost savings.

PeerDB for excluding columns during Postgres replication

We recently added the Column Exclusion feature to enable users to pick and choose columns while replicating tables from Postgres. You can do this as a part of the Change Data Capture (CDC) / WAL-based replication.

Supported Targets

Column Exclusion extends to all the supported targets incl. Data Warehouses such as Snowflake, BigQuery and Postgres; Queues such as Azure Event Hubs and Storage such as S3 and Google Cloud Storage.

You can either use PeerDB's UI or the SQL Layer to include and exclude columns while creating the MIRROR (a.k.a. replication).

Column Exclusion through PeerDB UI

PeerDB provides a simple-to-use UI to create and manage MIRRORs. While creating CDC / WAL-based MIRRORs, you can choose the specific columns per table, that you'd want to replicate using checkboxes. The below gif captures how to exclude columns while replicating data from Postgres to Snowflake.

https://www.loom.com/share/9171a1295d5b4a73a849fc49c5765f4f?sid=fccba9fe-24b4-44bb-8d25-b991d748f128

Column Exclusion through SQL Layer

PeerDB provides a Postgres-compatible SQL Layer to easily create and manage replication from Postgres. We improved the CREATE MIRROR command to enable users to exclude columns. The below script shows an example command to exclude PII data such as email, height and weight while replicating user_details table from Postgres to Snowflake

CREATE MIRROR cdc_replicate_users FROM postgres_peer TO snowflake_peer WITH TABLE MAPPING (    {        from:public.user_details,        to:public.user_details,        exclude:[email, height, weight]    }  ) WITH (      do_initial_copy = true,      snapshot_sync_mode='avro',      snapshot_num_rows_per_partition = 500000,      snapshot_max_parallel_workers = 4,      snapshot_num_tables_in_parallel = 4,      snapshot_staging_path = ''  );

Conclusion

If you have a use case that requires excluding columns while replicating data from Postgres and want to give PeerDB a try, these links should prove useful: :)

Native Replication of Postgres Geospatial Data

Sai Srirampur — Thu, 09 Nov 2023 20:50:44 GMT

PostGIS extension enables you to efficiently store and query geospatial data in Postgres. It introduces spatial data types such as GEOMETRY and GEOGRAPHY, which lets you store points, polygons, lines and so on. PostGIS also supports a large set of SQL functions and spatial indexes to effectively manipulate spatial data.

The reason you chose PostGIS is that you want your geospatial workload to be treated as a first-class citizen. You should expect no less when it comes to replicating your geospatial database, instead of geospatial being an afterthought.

In this blog, we will explore what it takes to natively replicate PostGIS data from Postgres to other data stores.

Status Quo

We've overwhelmingly heard from customers that existing data movement tools are not reliable in replicating geospatial data from Postgres. Common issues include -

Suboptimal Data Type Conversion - converting spatial types to STRING or JSON rather than using native spatial types supported by the target.
Data Loss - skipping whole rows when encountering incompatible types leading to loss of data.

Replicating Geospatial data from Postgres

At PeerDB, we are building a specialized data-movement tool for Postgres. With that spirit, we took a step forward to add extensive geospatial support while replicating data from Postgres to Snowflake. This includes -

https://www.loom.com/share/771ef42e09854fcfabec71f4fb1a849c

Demo of replicating Postgres Geospatial data incl. POINT, LINE, POLYGON etc of GEOMETRY and GEOGRAPHY types to Snowflake.

Support for GEOMETRY and GEOGRAPHY data types: These types are stored as GEOMETRY and GEOGRAPHY types on Snowflake.

Replicating Points, Lines and Polygons: A GEOMETRY or GEOGRAPHY in Postgres can be any of a Point, Line or Polygon. PeerDB handles replicating all these data types to Snowflake.
Let's see how to ingest Point,Line and Polygon to a Postgres table:

      CREATE TABLE locations (          id serial PRIMARY KEY,          location_name VARCHAR(255),          location_geometry GEOMETRY,          location_geography GEOGRAPHY      );      -- Inserting a point with generic geometry and geography      INSERT INTO locations VALUES (1,'Point A',           ST_GeomFromText('POINT(1 2)'),           ST_GeogFromText('POINT(1 2)'));      -- Inserting a line with generic geometry and geography      INSERT INTO locations VALUES (2, 'Line AB',           ST_GeomFromText('LINESTRING(1 2, 3 4, 5 6)'),           ST_GeogFromText('LINESTRING(1 2, 3 4, 5 6)'));      -- Inserting a polygon with generic geometry and geography      INSERT INTO locations VALUES (3, 'Polygon ABCD',           ST_GeomFromText('POLYGON((1 1, 2 1, 2 2, 1 2, 1 1))'),           ST_GeogFromText('POLYGON((1 1, 2 1, 2 2, 1 2, 1 1))'));

Replicating GEOMETRYCOLLECTION: A geometry can represent a collection of Lines, Points and Polygons. PeerDB handles replicating GEOMETRYCOLLECTION from Postgres to Snowflake.
Let's see how to insert a GEOMETRYCOLLECTION to a Postgres table:

      /*      Inserting a GEOMETRYCOLLECTION into the generic       geometry column      */      INSERT INTO locations (location_name,       location_geometry,       location_geography)      VALUES (          'GeometryCollection',          ST_GeomFromText('GEOMETRYCOLLECTION(POINT(1 2),               LINESTRING(1 2, 3 4))'),          ST_GeogFromText('GEOMETRYCOLLECTION(POINT(1 2),               LINESTRING(1 2, 3 4))')      );

Error handling for invalid Geometries: A difference between Postgres and Snowflake is that Postgres lets you store invalid Geometries and Snowflake doesn't. For example, a line with a single point can be stored in Postgres but not in Snowflake. In such a scenario, while replicating data from Postgres, PeerDB captures the invalid data in logs and stores null for that column value on Snowflake.
Let's see how an invalid GEOMETRY looks like in Postgres
```
      source=> SELECT ST_IsValid(ST_MakeLine('POINT(1 2)'::geometry));      NOTICE:  Too few points in geometry component       at or near point 1 2       st_isvalid      ------------       f      (1 row)
```

Conclusion

We added extensive support to replicate geospatial data from Postgres to Snowflake. We will be expanding this to other data stores such as BigQuery and Clickhouse. Please reach out to us if you want this support for other data stores.

If you want to give PeerDB a try for replicating geospatial data from Postgres, these links should prove useful: :)

Building a Streaming Platform in Go for Postgres

Kaushik Iska — Fri, 03 Nov 2023 20:43:05 GMT

At PeerDB, our mission is to create a Postgres-first data-movement platform that makes it fast and simple to stream data from Postgres to Data Warehouses, Queues and Storage. Our engineering focus revolves around 10x faster data movement, cost-efficiency, and hardware optimization.

In this blog post, we'll dive into our recent transition from a pull-and-push model to a more efficient streaming approach using Go channels. Let's explore why streaming is crucial and how this change significantly improved performance.

The Pull-and-Push Model

Before our recent change, we operated with a pull-and-push model. We fetched rows into an array in memory and then moved them to the target. While this approach worked well for smaller batch sizes, it presented issues with larger batches. Specifically, we couldn't parallelize the pushing while pulling, leading to a lack of pipeline efficiency. The split between pull and push time in a typical setup for us was 60-40.

This is how our code looked like before:

// sync all the records normally, then apply the schema delta after NormalizeFlow.type RecordsWithTableSchemaDelta struct {    RecordBatch            *RecordBatch // wrapper for "Records []Record"    TableSchemaDeltas      []*protos.TableSchemaDelta    RelationMessageMapping RelationMessageMapping}

Shifting to Streaming

Our new approach involves buffering and concurrently pushing data to the target (e.g., Snowflake) in batches, as we pull it from PostgreSQL. This pipelining of data transfer offers significant advantages:

Improved Efficiency: Pipelining allows us to overlap the pull and push phases, reducing overall processing time.
Reduced Latency: With pipelining, data reaches its destination more quickly, enhancing overall system responsiveness.

This is the shared structure after the change:

type CDCRecordStream struct {    // Records are a list of json objects.    records chan Record    // Schema changes from the slot    SchemaDeltas chan *protos.TableSchemaDelta    // Relation message mapping    RelationMessageMapping chan *RelationMessageMapping    // ... other fields}

Harnessing Go Channels for Streaming

Go Channels are used to enable communication and synchronization between goroutines (concurrent functions) in a Go program. Channels allow one goroutine to send data to another goroutine and provide a safe way to exchange information. Here are a few benefits that Go channels provide:

Data Synchronization: Go channels provide granular control over data synchronization, preventing race conditions and ensuring data consistency as it flows through a system.
Resource Management: Go channels' blocking behavior at capacity prevents data overload, mitigating the risk of Out-of-Memory (OOM) errors and ensuring stability.
Concurrent Processing: Go channels enable efficient concurrent data processing, optimizing resource utilization and achieving high throughput across data retrieval, transformation, and insertion.
Error Handling: Built-in error handling mechanisms using select statements improve system robustness, allowing us to respond gracefully to exceptions and maintain reliability. Here goes our implementation of handling errors in Go channels
Synergy with Postgres Logical Replication: We use logical replication slots to manage CDC from Postgres. START_REPLICATION streams changes from Postgres at a given wal position into our buffer channels and waits until we ask for the next change. The back pressure mechanism provided by Go channels and the streaming capabilities of START_REPLICATION go hand in hand to ensure resiliency, by controlling memory utilization.

The Impact of Change

Our performance improvement is remarkable. In initial scale tests, we achieved:

Throughput: 10-12k Transactions Per Second (TPS)
Minimal Lag: 1-5 seconds

Compare this to our previous performance, which took roughly 30 seconds to complete similar tasks. The impact is undeniable, with our streaming model significantly outperforming the pull-and-push approach.

The above image shows the flame chart snapshot view of pulling records and pushing records occurring simultaneously.

Future Enhancements

Looking ahead, we are exploring additional optimizations to further enhance our system's resilience. One promising avenue is spilling the record stream to disk to prevent Out-of-Memory (OOM) issues. This approach would ensure that our system can handle even larger datasets without sacrificing performance or reliability.

Conclusion

In our pursuit of building a resilient data-movement platform for PostgreSQL, PeerDB has made a crucial shift from a pull-and-push model to an efficient streaming approach using Go channels. The results speak for themselves: improved performance, reduced latency, and a more responsive system.

As we continue to innovate and optimize, we aim to provide Postgres users with a data movement experience that is not only faster but also cost-effective and hardware-efficient. Stay tuned for more insights and updates as we push the boundaries of what's possible with PeerDB. If you want to give PeerDB a try for streaming data from Postgres and experience the above improvements, these links should prove useful: :)

Real-time Change Data Capture for Postgres Partitioned Tables

Sai Srirampur — Mon, 30 Oct 2023 22:38:43 GMT

Why use Partitioned Tables in Postgres?

Table partitioning in PostgreSQL is used to improve query performance and manage large datasets efficiently by dividing a table into smaller, more manageable segments, or partitions. It allows for faster data access and maintenance, as well as optimizing specific operations like archiving or purging old data:

Faster queries - through partition pruning, queries with partition column filter are scoped to a subset of data rather than scanning the whole table
Efficient data expiry - for tables partitioned on time, to expire older data you could just DROP the older partitions rather than running a DELETE command and incurring table bloat.

Table Partitioning in Postgres is becoming a given

Over the past 5 years, the Table Partitioning feature is constantly evolving and its usage is only increasing. A reason for this is that Postgres is able to support a multitude of high-scale use cases such as Timeseries, IoT, and multi-tenant SaaS, where there is a natural dimension to partition data. Table Partitioning enhances the performance and scalability of Postgres for such use cases. Just to share some numbers, out of all the Postgres customers we are working with, close to 50% of them use partitioned tables.

Change Data Capture (CDC) for Partitioned Tables is more relevant than ever

With the increased adoption of the Table partitioning feature and the numerous use cases that rely on CDC from Postgres, it is important for a data movement tool to comprehensively support CDC for Partitioned tables. This includes streaming real-time changes from Partitioned tables to Data Warehouses, Queues, or Storage, and managing various scenarios like adding new partitions, dropping partitions, adding or dropping columns, and ensuring compatibility with different Postgres versions.

PeerDB for replicating Partitioned Tables in Postgres

At PeerDB, we are building a specialized data-movement tool for Postgres. With that spirit, we took a step forward to add extensive support for Real-time Change Data Capture (CDC) for Partitioned Tables. Below is a list of the various scenarios that we handled:

https://www.loom.com/share/bd622150edd046aea4d6ca66d38f0eb9

Demo of replicating a partitioned table in Postgres to Snowflake. It covers various scenarios such as adding new partitions, adding columns, dropping partitions and so on.

Just specify the parent table for replication - While kicking off the MIRROR (a.k.a. replication) you just need to specify the name of the partitioned table (the parent) that you want to replicate. You don't need to specify the names of each partition. PeerDB takes care of a) Taking the initial snapshot of data across all the partitions and applying it to the target and b) Replaying CDC in real-time across all the partitions to the target table.
New partitions can be created and replicated - As new partitions are created and data is added to them, PeerDB automatically replicates that data to the target table.
New columns can be added and replicated - PeerDB supports replicating schema changes where you add a new column (ADD COLUMN). This works as expected for Partitioned Tables.
Dropping partitions doesn't delete data on the target - If you drop a partition to expire data, we don't propagate that to the target (ex: Snowflake) i.e. we don't delete data matching that partition. We made this design choice based on customer feedback - users don't want to delete data in their data warehouse. If you require a better way to handle this scenario, you can create an issue on Github or submit a PR! :)
Support Postgres versions 12 to 16 - Replicating partitioned tabled should be supported for all Postgres versions starting from 12 to 16. Postgres 12 required the publication to be created for all tables. Whereas with the rest of the versions, you can create the publication just for the partitioned table with publish_via_partition_root set to true.

Conclusion

It was a common concern from our customers that existing generalized data movement tools either lacked features or were not reliable in handling partitioned tables. So, we decided to spend time adding extensive Change Data Capture (CDC) support for partitioned tables. If you want to give PeerDB a try on your existing Postgres partitioned tables, these links should prove useful: :)

Benchmarking Postgres Replication: PeerDB vs Airbyte

Kevin Biju — Tue, 10 Oct 2023 17:49:54 GMT

We at PeerDB are working on fast and simple data movement in and out of Postgres. Other data-movement tools also support the Postgres connector and have been investing in improving this. For example, Airbyte has released a series of blog posts demonstrating improvements they have made to their Postgres connector, leading to significant performance gains over Fivetran.

In the past few weeks, we spent some time running a benchmark captured in this Airbyte blog. The primary goal was to understand - how we stack up against other tools, how our existing features impact performance and how can we further improve our product.

Assumptions

Airbytes benchmark was performing a one-time transfer of a single large table from Postgres to Snowflake. Since we are moving data already present in the source table instead of incrementally moving fresh data, this is what we call a Full Refresh in Airbyte and an Initial Snapshot in PeerDB.

Benchmarking a large migration like this is highly dependent not only on the performance of the migration platform but also on the performance of the source Postgres and target Snowflake instances. Network throughput between all these parts is also crucial.

So we generated a dataset and tested PeerDB and Airbyte against it, using instances for Postgres and Snowflake which we felt best represented a production deployment. The entire infrastructure was positioned within a single AWS region, reducing network bottlenecks. More details on the setup can be found here.

Generating the Data

While Airbyte provided the schema of the table, it wasnt enough to generate a dataset, as we didnt know the size of a row or the number of rows in the table. We decided to run the test on a table with 6 billion rows and a 1.5TB size.

We worked out that each row should be around 230-235 bytes. We then arrived at a size for each variable length field that should get us the table size we wanted. We also converted one of the columns in the table to be a generated primary key because PeerDB currently requires one for CDC mirroring.

CREATE TABLE IF NOT EXISTS public.xxxxx    (          f0 BIGINT PRIMARY KEY GENERATED ALWAYS AS IDENTITY,      f1 BIGINT,      f2 BIGINT,      f3 INTEGER,      f4 DOUBLE PRECISION,      f5 DOUBLE PRECISION,      f6 DOUBLE PRECISION,      f7 DOUBLE PRECISION,      f8 VARCHAR COLLATE pg_catalog.\"default\",      f9 VARCHAR COLLATE pg_catalog.\"default\",      f10 DATE,      f11 DATE,      f12 DATE,      f13 VARCHAR COLLATE pg_catalog.\"default\",      f14 VARCHAR COLLATE pg_catalog.\"default\",      f15 VARCHAR COLLATE pg_catalog.\"default\"    )

After doing all this, we wrote a small Rust program that could generate 6 billion rows and insert them into a Postgres instance we provisioned on AWS RDS. After some debugging, we were in business!

[2023-09-12T10:08:33Z INFO  firenibble] Finished inserting 6000000000 rows in 12689.01s seconds[2023-09-12T10:08:33Z INFO  firenibble] Throughput: 472850.12

postgres=> SELECT pg_size_pretty(pg_relation_size('public.xxxxx')); pg_size_pretty---------------- 1578 GB(1 row)

Testing!

Both AirByte and PeerDB have an Open Source offering and are available as Docker Compose Applications. So we decided to use these for our testing, on a sufficiently powerful AWS EC2 instance. To reiterate, we are only looking for the performance of the initial load, and not an incremental sync.

Airbyte

With Airbyte, we already knew from their numbers that a run could take multiple days. So after deploying Airbyte and setting up our connectors, we kicked off a run, checking in once every few hours.

After an initial failure because of hitting a timeout (of 72 hours), Airbyte completed successfully. It took 83 hours to move the table to Snowflake.

PeerDB

PeerDB implements parallelism for these heavy initial loads, and we launched five runs in total with 1, 8, 16, 32 and 48 parallel threads.

With 32 and 48 threads, PeerDB moved over 1.5TB of Postgres table in under 5 hours. Even scaling down to 8 threads, we still see a runtime of under 9 hours. This performance derives from the optimizations PeerDB has done to make reads as efficient as possible and also the parallelism multiplier.

Airbyte does not support parallelism at the moment. We decided to do an additional run with parallelism set to 1, as a fair comparison to Airbyte. With this, we got a run time of 43 hours.

Results

Airbyte took 83 hours to move a 1.5TB table from Postgres to Snowflake. With a parallelism of 32 threads, PeerDB took 5 hours to do the same job. So PeerDB was 16x faster. Even considering a single-threaded run, PeerDB is ~2x faster. We felt that a comparison with Fivetran was out of scope for this article. Airbyte had already shown that Fivetran was slower than Airbyte and we could expect a full refresh to take 150+ hours.

What makes PeerDB faster?

Parallelism

One of the more obvious reasons is our early adoption of parallelism for long-running operations such as moving large tables from Postgres to Snowflake. We do this by logically partitioning the large table based on internal tuple identifiers (CTID) and parallelly streaming those partitions to Snowflake. The implementation is inspired by this DuckDb blog. Based on the load you can put on the source Postgres database, you can configure the parallelism for the sync. More details can be found in this blog

Batching

The second is configurable batching while reading from Postgres and writing to Snowflake. If you have PeerDB running on a large machine you could have a larger batch size (ex: 1mill). This reduces the network roundtrips across Postgres, PeerDB and Snowflake thereby improving performance. Configurable batching also helps meet Snowflake's recommendation for faster loads. In the above tests, we configured a batch size of ~750K to meet this recommendation.

Binary format for data transfer

This third is using Avro as the intermediary data format. Avro enables data to be stored in binary (compressed) format and still supports a wide variety of data types (unlike parquet). This enables fast data movement, without compromising data integrity.

Glimpses into the future

We are working on multiple other features incl. concurrent reading of the replication slot during the Initial Snapshot; parallelized writing of Change Data Capture to Target tables; and others to further improve performance

Closing Remarks

Keeping performance aside, we were quite impressed with Airbyte's web interface and a staggering variety of connectors. We would recommend it if the breadth of available connectors is more important to you.

At PeerDB we instead have a laser focus on Postgres with the primary goal of providing the highest quality source and destination connectors for it.

If Postgres is a central part of your data stack and you want to stream data from Postgres to data warehouses, message queues or storage engines faster, simpler and cheaper, come talk to us here. You could also try out our free and open offering here.

If you want to try benchmarking PeerDB and Airbyte yourself, we have put info about our benchmark here.

Parallelized Initial Load for CDC-based streaming from Postgres

Sai Srirampur — Tue, 05 Sep 2023 19:10:56 GMT

PeerDB is a specialized data-movement tool for Postgres. Currently, it supports 2 modes of streaming data in and out of Postgres - Change-data-capture (CDC) based streaming and Query-based streaming. We support a bunch of sources and targets for both of these streaming modes. See supported connectors for more details.

Using these streaming modes, you can accomplish multiple real-world use cases incl. database migrations; real-time operational analytics - streaming of transactional data in Postgres to data warehouses for analytics; periodic backups - archive older data in Postgres to cold storage (S3, Blob) and many more.

v0.7.1 is PeerDB's most recent release with a bunch of enhancements for faster, richer and more reliable data movement for Postgres. This blog does a deep dive into a few highlights:

Parallelized Initial Load for CDC-based streaming.
Reliability and stability improvements
New connectors - Azure Event Hubs and SQL Server
Metrics and Monitoring (preview)

Parallelized Initial Snapshot for CDC-based streaming

PeerDB now allows you to copy existing data (a.k.a initial snapshot or initial load) in your Postgres tables to the target data store before replaying changes from the logical replication slot (CDC). The initial snapshot is parallelizable at a per-table level. We do this by logically partitioning the table based on internal tuple identifiers (CTID) and parallelly streaming those partitions to the target data store. The implementation is inspired by this DuckDb blog. Based on the load you can afford to put on the source Postgres database, you can configure the parallelism for the initial snapshot.

This feature has a significant impact on load times - you can sync TBs of data in a few hours vs. days. Below are a few real-world use cases that can drastically benefit from this feature:

Faster Postgres to Postgres migrations - If you are migrating your Postgres database on-prem to the cloud or migrating across managed providers, you can get the fastest and most reliable migration experience with PeerDB. 100s of GB can be migrated in a few minutes and TBs can be migrated in a few hours.
Faster Postgres to BigQuery/Snowflake full syncs and resyncs - As a part of setting up a real-time CDC-based pipeline from Postgres to Snowflake, one of our customers was able to load 10 billion rows in ~12 hours. Their existing data movement tool took 3 days and a few other tools never finished.

COPY with BINARY for Postgres to Postgres streaming and Avro for Postgres to Data warehouse streaming

Based on the target data store, there are other performance optimizations that PeerDB does under the covers for the initial snapshot of the data.

For Postgres to Postgres streaming, PeerDB uses the COPY command with BINARY format to read from the source and write to the target. This enables byte-to-byte copy between source and target Postgres databases, reducing serialization/deserialization overhead and enhancing the speed of data transfer. You can follow the steps in this doc to experience PeerDB for Postgres to Postgres migrations.
For Postgres to Snowflake (or BigQuery) streaming, data is converted to Avro format, staged and pushed to the target. Avro enables data to be stored in binary (compressed) format and still supports a wide variety of data types (unlike parquet). This enables fast data movement, without compromising data integrity. You can follow the steps in this doc to experience PeerDB for real-time CDC-based streaming incl. Parallelized Initial Load from Postgres to your data warehouse.

Below demo shows the Parallelized Initial Load feature in action to replicate ~100 million from Postgres to Postgres in less than 5 minutes.

https://www.loom.com/share/c2a3191efaed4c1abe399ddd08600076

Reliability and stability improvements

In our latest update, v0.7.1, a significant focus has been placed on enhancing reliability and stability. To achieve this, we dedicated an entire sprint to develop a robust testing framework designed for scale testing across our entire technology stack. This involved crafting tables and schemas to rigorously test a myriad of edge casesranging from handling columns larger than 15MB to dealing with wide tables and varying data types. To simulate real-world conditions, we loaded approximately 300GB of data into our source PostgreSQL database. The objective was to comprehensively assess performance across different scenarios, including data transfers from PostgreSQL to PostgreSQL, PostgreSQL to Snowflake, and PostgreSQL to BigQuery, both in initial load conditions and with Change Data Capture (CDC).

Through this exhaustive testing, we were able to surface issues that only became apparent at scale. These issues have been addressed, resulting in a significantly smoother operation. Additionally, we've augmented our logging capabilities to provide more insightful analytics. In our orchestration engine (temporal), we've also ramped up the implementation of heartbeats to monitor system health more aggressively. These are just a few of the many stability improvements you'll find in PeerDB v0.7.1, as we continue our commitment to delivering a reliable and robust data movement solution for Postgres.

New connectors - Azure Event Hubs and SQL Server

As a part of this release, we also added 2 new connectors:

Azure Event Hubs as a target for CDC-based streaming from Postgres - With this you can stream CDC (logical replication) changes in real-time from your Postgres database to Azure Event Hubs. More details can be found here. Under the covers, PeerDB creates 1 Event Hub topic per Postgres table and writes CDC changes in parallel across topics. We enable you to configure the parallelism and the batch size while syncing records to Event Hubs.
SQL Server as a source for Query-based streaming to Postgres - Now you can stream the results of any SQL query on SQL Server to Postgres. The streaming can be configured to be an incremental or a one-time load. More details can be found here. One of our customers is testing PeerDB to periodically stream data from their SQL Server databases on the edge to their centralized Postgres data warehouse.

Metrics and Monitoring (preview)

As a part of v0.7.1, we released our first take on metrics and monitoring. Our goal with this effort is to provide full operational visibility into all the data streaming jobs (a.k.a. MIRRORs) that are running on your PeerDB cluster.

Metadata tables - We introduced 4 metadata tables in the peerdb_stats schema that capture useful information about the ongoing MIRRORs. They capture information such as the status of MIRRORs, the progress of MIRRORs, streaming throughput, latency/lag and so on. As PeerDB is Postgres-compatible, you can connect these metadata tables to your existing monitoring tools viz. data-dog, new-relic, Grafana and integrate PeerDB with your existing monitoring eco-system.
Grafana Dashboard - We also package Grafana and Prometheus as a part of our docker stack. Grafana dashboard is available on port 3000. It captures throughput and lag related details as below:

We are constantly evolving our metrics and monitoring story. You can anticipate many more enhancements on this end in our future releases.

Next Steps with PeerDB v0.7.1

Hope you enjoyed reading the blog. If you want to get started with PeerDB v0.7.1, these links should prove useful:

Faster Postgres migrations using PeerDB, Part 1

Sai Srirampur — Thu, 20 Jul 2023 23:36:11 GMT

PeerDB is an ETL/ELT tool built for PostgreSQL. It enables you to Stream Query Results from a source Postgres database to a target Postgres database. You can use this feature to efficiently migrate data from one Postgres database to another. This blog captures the benefits of using PeerDB for Postgres migrations. It also covers a tutorial to easily test PeerDB for this scenario. The scope of this blog is restricted to efficient offline data migrations. A blog on online migrations is coming soon! :)

Fast, flexible and resilient Postgres migrations

Here are some benefits of using PeerDB for Postgres migrations:

Blazing fast large table migrations - Postgres' native migration utility pg_dump/pg_restore is single-threaded at a single table level. This can slow down migrations of a Postgres database with a single large table and multiple small tables (star schema). PeerDB enables you to multi-thread (parallelize) the migration at a single table level. This can reduce migration times by orders of magnitude.
Transform data during migration - PeerDB enables you to migrate data based on any SELECT query on the source database. As a part of the migration process, you can restrict columns, filter rows, denormalize rows and so on. For example, you can mask a few columns from your production while migrating to the staging database.
Simultaneous reading from source and writing to target - pg_dump/pg_restore allows you to parallelize data migration across tables using -j flag. In this mode, it cannot pg_dump | pg_restore i.e. simultaneously read from the source and write to the target. PeerDB can parallelize the migration across tables, within a single table and still streams rows to the target, as they are read. This can drastically improve resource utilization and improve migration performance significantly.
Fault-tolerant and resilient - PeerDB is a fully-fledged ETL product built for Postgres. It has mechanisms in place for fault tolerance - state management, automatic retries, handling idempotency and consistency and so on. This ensures that the migration experience is highly resilient and guarded against failures. More details on how PeerDB ensures resiliency are captured in this blog.

5-step tutorial to test PeerDB for Postgres migrations

Now let us go through a quick tutorial to migrate data from a source Postgres database to a target Postgres database using PeerDB:

Setup PeerDB in 2 minutes

Setup PeerDB on any machine by following this Quick Start. It would take you less than 2 minutes.

Step 1: Generate Sample Data on Source

You can use pgbench utility to quickly generate data on your source Postgres database. The below command would create and populate 3 tables (pgbench_accounts, pgbench_tellers and pgbench_branches) with around 10GB of data. pgbench_accounts contributes to most of the size and is around ~9.9GB. The rest of the 2 tables are way smaller i.e. few KB.

pgbench -i -s 700 "port=5432 host=localhost password=postgres user=postgres dbname=source"

(You can change the above connection string using the actual credentials of your source database)

Step 2: Schema-only dump and restore on target

You can run the below command to create a schema of pgbench_accounts, pgbench_tellers and pgbench_branches on a target Postgres database

pg_dump --schema-only -t 'pgbench_*' "port=5432 host=localhost password=postgres user=postgres dbname=source" | psql "port=5432 host=localhost password=postgres user=postgres dbname=target"

Step 3: Create Source and Target Postgres Peers

Connect to peerdb and create both the source and target Postgres Peers. More details on the CREATE PEER command can be found here.

psql "port=9900 host=localhost password=peerdb"CREATE PEER source_peer FROM POSTGRES WITH(    host = '172.17.0.1',    port = '5432',    user = 'postgres',    password = 'postgres',    database = 'source');CREATE PEER target_peer FROM POSTGRES WITH(    host = '172.17.0.1',    port = '5432',    user = 'postgres',    password = 'postgres',    database = 'target');

Step 4: Data Migration using CREATE MIRROR command

Let us use PeerDB's CREATE MIRROR command to migrate the smaller tables pgbench_branches and pgbench_tellers This command gives multiple options, here I am setting the initial_copy_only to true. This ensures that the MIRROR stops immediately stops after data from the source table is migrated to the target table.

CREATE MIRROR migrate_pgbench_branches FROM        source_peer TO target_peer FOR$$ SELECT * FROM pgbench_branches $$WITH (        destination_table_name = 'public.pgbench_branches',        initial_copy_only=true);CREATE MIRROR migrate_pgbench_tellers FROM        source_peer TO target_peer FOR$$  SELECT * FROM pgbench_tellers$$WITH (        destination_table_name = 'public.pgbench_tellers',        initial_copy_only=true);-- in a few minutes the above tables should be migrated.-- run below commands to quickly validate ther migrationSELECT count(*) FROM target_peer.pgbench_branches;SELECT count(*) FROM target_peer.pgbench_branches;

Parallelized and blazing-fast migration of a large table

If a large table has an incremental int or a timestamp column (a.k.a. watermark_column), CREATE MIRROR can parallelize the migration of that table from source to target. You can specify the watermark_column, parallelism and num_rows_per_partition as inputs to the CREATE MIRROR command. Under the hood, PeerDB takes up all the heavy lifting to logically partition the table and stream data using multiple threads. This can reduce migration times by orders of magnitude. See the below

 CREATE MIRROR migrate_pgbench_accounts FROM        source_peer TO target_peer FOR$$  SELECT * FROM pgbench_accounts WHERE aid BETWEEN {{.start}} AND {{.end}}$$WITH (        destination_table_name = 'public.pgbench_accounts',        initial_copy_only=true,        parallelism = 8,        watermark_column = 'aid',        watermark_table_name = 'pgbench_accounts',        num_rows_per_partition = 300000);

Benchmarks

Part 2 of this blog will be benchmarking PeerDB at scale for migrating large Postgres databases. As a teaser, you can expect numbers captured in this blog (scroll to the bottom), that I wrote a few years ago. And yes, it was one of my favorite blogs :)

Hope you enjoyed reading the blog. Check out peerdb.io and get started with PeerDB to efficiently migrate your Postgres databases!

Using Temporal to Scale Data Synchronization at PeerDB

Kaushik Iska — Mon, 10 Jul 2023 20:25:59 GMT

At PeerDB, we're tackling the challenge of scaling data synchronization across distributed systems. One of the cornerstones of our approach is Temporal, an open-source workflow orchestration platform. This platform shines when it comes to managing retries, long-running state, ensuring idempotency, and much more. In this blog post, well guide you through how we have incorporated Temporal into our data synchronization workflows.

Scaling Data Synchronization

Before diving into the specifics, it's essential to comprehend the complexities involved in data synchronization across distributed systems. Data synchronization involves streaming data changes (Change Data Capture or CDC) and replicating the results of a query on a source system. These operations must be robust, stateful, and often long-running. With the advent of various data sources and destinations in a distributed environment, these challenges compound.

To tackle these, we architected a solution focusing on statefulness, idempotency, retryability, and the separation of concerns. The cornerstone of this architecture is an interface we call Connector.

The Role of Connector

type Connector interface {    Close() error    ConnectionActive() bool    ...    PullRecords(req *model.PullRecordsRequest) (*model.RecordBatch, error)    SyncRecords(req *model.SyncRecordsRequest) (*model.SyncResponse, error)    ...}

Connector is the interface at the heart of our architecture, encapsulating the logic for various operations critical for our data synchronization tasks. The methods range from maintaining the state (GetLastOffset), pulling and syncing records (PullRecords, SyncRecords), to ensuring the idempotency of operations, and setting up necessary pre and post operations.

For example, PullRecords and SyncRecords are idempotent. In other words, they can be called multiple times with the same request without changing the result. This is a vital attribute in a distributed system where network issues could lead to duplicated or retried requests.

Incorporating Temporal

Temporal has proven to be an invaluable tool for simplifying the management of these workflows. We use Temporal to orchestrate our two workflows, the PeerFlowWorkflow (for CDC) and the QRepFlowWorkflow (for query replication).

Temporal enables us to divide these workflows into multiple activities, tailored to the requirements of the sources and destinations. This segregation provides us with granular control and visibility over the state and progress of each operation, making the system more reliable and easier to maintain.

Enhancing Retryability

Temporal shines when it comes to handling retries. In distributed systems, failure is inevitable. However, having a system designed to embrace retries ensures that temporary issues dont lead to permanent data loss or inconsistencies. Temporal's in-built support for retries means that if a workflow fails due to a transient issue, it can automatically retry the operation, making our system far more resilient.

Scaling with Temporal

func (w *Workflow) StartWorkflow(...) {    ...    err = w.TemporalClient.ExecuteWorkflow(...)    ...}

With Temporal and our Connector interface at our disposal, we have crafted a system that scales gracefully with our needs. Each source and destination can implement its instance of the Connector interface, enabling easy expansion to new data sources or destinations without impacting the existing ones. This design is inherently scalable and flexible.

Temporal also aids us in managing long-running operations. Functions such as data streaming and replication are inherently long-running. Temporal ensures these operations are carried out efficiently without the need to complete in a single step, thus ensuring smooth operation even under heavy load or over extended periods.

Deployment Advantages

One of the unexpected benefits of this architecture is the ease of deployment. Thanks to the modularity of our design, PeerDB can easily be containerized and deployed using Docker. Furthermore, deploying on Kubernetes clusters allows us to scale horizontally and manage our services more efficiently, offering improved reliability and resource usage.

git clone --recursive docker compose up

Wrapping Up

Through Temporal and our Connector interface, PeerDB has devised a robust and scalable architecture for data synchronization in distributed systems. By handling complex issues like state management, idempotency, retryability, and separation of concerns, we've made our system incredibly reliable and robust.

Temporal's workflow orchestration capabilities have been a game-changer, enabling us to manage and scale our data synchronization tasks efficiently. By leveraging Temporal's features and coupling them with a carefully designed Connector interface, we're propelling PeerDB's data synchronization capabilities to new heights.

If you are interested in handling larger, more complex data synchronization tasks in your distributed systems, we invite you to try out PeerDB. We're excited to share the possibilities that this combination of Temporal and PeerDB can open up for your data needs. Visit our website or Github page to get started.

PeerDB is built to scale, ready to deliver!

Efficiently query BigQuery and Snowflake from PostgreSQL

Sai Srirampur — Fri, 30 Jun 2023 23:02:29 GMT

When do you want to query data in your data warehouse from PostgreSQL?

A few scenarios where you want to query your data warehouse from PostgreSQL include:

You have PostgreSQL stored procs for business logic and you want to extend them to data stored in your data warehouse. You want don't want to de-duplicate data and inflate storage costs.
You want to pre-aggregate data in your data warehouse and create materialized views on your PostgreSQL database.
You want to add a new feature to your application written for PostgreSQL and this feature relies on data stored in your data warehouse (say BigQuery). You want to keep your app code simple by just querying PostgreSQL and not introducing a new data store.

Limitations with current approaches

Currently, there is no easy way to query BigQuery and Snowflake tables from your PostgreSQL database. You can write a foreign data wrapper in Postgres to make this happen. However, writing an efficient and a resilient foreign data wrapper is not trivial.

There are a few extension(s) that claim to do this. However, they can get very inefficient for a few common query types and in-fact query types where BigQuery and Snowflake excel. Their query push-down capabilities are not very mature -

Aggregate (GROUP BY) push-down is not supported.
JOIN, ORDER BY, LIMIT, FUNCTION, UNNEST etc. push down can be finicky and unreliable.

postgres_fdw + peerdb for efficiently querying BigQuery and Snowflake from PostgreSQL

PeerDB provides a Postgres-compatible interface to query tables in BigQuery and Snowflake. It lives outside your PostgreSQL database. PeerDB parses and translates the incoming SQL query to make it compatible with the appropriate Peer (BigQuery or Snowflake). It pushes down most SQL constructs, including filters, JOINs, aggregates, sorts, limits, function/procedure calls, etc., to the attached Peer.

You can also query tables in BigQuery or Snowflake from your existing Postgres database by connecting PeerDB as a postgres_fdw FOREIGN SERVER and creating FOREIGN TABLEs pointing to BigQuery/Snowflake tables. Below are a few benefits you get by following this approach:

Blazing fast queries - You can utilize postgres_fdws advance push-down capabilities to maximize the performance of your queries involving BigQuery and Snowflake tables. For queries where push-down is important, you can expect 100x performance gains compared to other foreign data wrappers.
Works for both hosted and on-prem deployments of PostgreSQL - This approach doesn't involve installing a custom extension to your PostgreSQL database. And just relies on postgres_fdw, which is shipped with community PostgreSQL and is supported on every hosted provider incl. AWS RDS, Azure Flexible Server, Cloud SQL, Crunchy Bridge, Supabase etc.
Highly resilient - As we are not installing any software (custom extensions) to your PostgreSQL database and just relying on the native postgres_fdw, crashes and instabilities are rare.

Detailed documentation on setting this up is available here.

Supported data-stores

Currently, this approach works for querying BigQuery (production) and Snowflake (in Beta) from PostgreSQL. We are working on adding support for Databricks (Delta Lake tables), Clickhouse and few others.

If you wanted support for other data stores, join our slack channel and shoot us a request. OR send an email to founders@peerdb.io. Our engineering team operates fast and has built connectors just in a couple of days!

Hope you enjoyed reading the blog. Check out our github repo and get started testing PeerDB

Seamless and Scalable Streaming of Query results with PeerDB

Sai Srirampur — Wed, 28 Jun 2023 19:55:58 GMT

When do you Stream Query results from one data store to another?

Streaming query results from one data store to another is common for many real-world use cases. Below are a few of them:

Continuously stream your transformed, filtered, and cleaned transactional data from PostgreSQL to the data warehouse for analytics.
Sync customer-specific data from a multi-tenanted/centralized data store to the customers own data store.
Stream pre-aggregated data from the data warehouse to PostgreSQL for high-concurrent, low-latency dashboards.

Why is it hard?

Building a scalable data pipeline to continuously stream query results across data stores involves multiple challenges:

Complex data-type mapping: You need to handle data-type mapping across data stores, which can become intricate when dealing with advanced data types like arrays, bytes, and vectors.
Reliable synchronization tracking: Ensuring accurate tracking of synced and unsynced data becomes crucial to maintain data consistency.
Fault-tolerant pipeline: Building a pipeline that can handle failures and recover seamlessly is not easy.
Maintain data freshness: You need to ensure that the pipeline is as performant as possible. This would dictate the freshness of data for downstream applications.
Resource management: You need to explicitly handle resource management on source and target. This is to ensure that your pipeline doesnt affect other concurrent workloads.

Limitations with current tools and approaches

Lack of customizable and specialized ETL tools - Most ETL tools lack customizability to stream results based on any SELECT query( with joins, group bys filter etc). They are also generalized supporting a ton of connectors - not providing a reliable and scalable experience for any 2 connectors.
Businesses spend months of resources to build in-house pipelines: Orgs typically build in-house data pipelines using Airflow and Python scripts. They spend multiple months of effort and significant dev resources to build and maintain these pipelines.

PeerDB - Super easy, Blazing Fast and Highly Customizable

PeerDB makes it super easy to continuously stream query results from one data store to another. Below are a few benefits that you get by using PeerDB -

Save multiple months of engineering resources: You simply run a few SQL commands, and PeerDB takes care of all the heavy lifting to set up and maintain highly performant and reliable data pipelines across stores. You dont need to spend months of time and resources to build pipelines. PeerDB reduces the effort to a few days.
Blazing Fast: PeerDB internally implements multiple optimizations to provide the best possible performance experience. For example, it converts data to Avro format during transit and enables parallelism during both reading from sync and writing to target. This enables you to take decisions using fresh data.
Highly customizable
1. You can run any SELECT query that is supported by PostgreSQL for the transformation, including JOINs, function/procedure calls, GROUP BYs, and so on.
2. The SQL command provides various options such as batch size, parallelism, and refresh interval, which give you granular control when configuring the pipeline.

Detailed documentation on setting up Real-time Streaming of Query Results from PostgreSQL is available here.

Supported data-stores

Supported source(s) include PostgreSQL and targets incl. Snowflake, BigQuery, S3 and PostgreSQL.

Hope you enjoyed reading the blog. Check out our github repo and get started testing PeerDB