Skip to main content
Shopify integration

Shopify data integration,
on autopilot.

Sync Shopify orders, products, customers, and inventory to any database, warehouse, or spreadsheet — in plain English. Source-available, self-hosted, no per-row fees.

TL;DR

rsync.ai connects to Shopify via the Admin GraphQL API (v2024-10), auto-discovers your resources, and runs scheduled syncs (every minute, hour, or day — described in plain English). Customer PII is excluded from default queries. Works with Postgres, MySQL, BigQuery, Snowflake, Google Sheets, S3, and more.

  • Plain-English pipeline setup — no YAML, no DAGs
  • Auto schema discovery for orders, products, customers, collections, inventory
  • Self-hosted — Shopify credentials never leave your network
  • Source-available under Elastic License 2.0 — code on GitHub

Shopify resources synced today

Schema for these is auto-discovered on connect via the Shopify Admin GraphQL API (v2024-10).

OrdersProductsProduct variantsCustomersInventory itemsInventory levelsCollectionsShop+ more on the roadmap (metafields, fulfillments, draft orders)

rsync.ai vs. the Shopify ETL alternatives

A quick honest comparison for syncing Shopify into your data stack.

Feature
rsync.aiyou
Fivetran
Airbyte
Zapier
Plain-English pipeline creation
Scheduled Shopify → DB / Sheets sync
Self-hosted (Shopify data stays on your servers)
Source-available connector code (auditable)
Shopify customer PII excluded by default
No per-row / per-MAR pricing
Sync to Google Sheets natively

Shopify integration — frequently asked

What Shopify resources can rsync.ai sync today?

Six resources today: orders, products, customers, collections, inventory items, and shop. Product variants and inventory levels come embedded with their parent. The connector talks to the Shopify Admin GraphQL API (version 2024-10). Schema is discovered the moment you connect — you tick the resources you want to sync.

Is the Shopify sync real-time, or scheduled?

Scheduled batch today. You set the cadence in plain English — "every 5 minutes", "hourly", or "every day at 2am". Webhook-based real-time sync is on the roadmap. In practice, a 1–5 minute schedule feels real-time for any store under ~10k orders a day.

How does rsync.ai handle Shopify API rate limits?

Shopify throttles its GraphQL API on a server-side cost budget. The connector paginates politely (up to 250 records per request) and backs off when Shopify says to slow down. If a sync interrupts, the next run resumes from the last saved cursor — no duplicate rows, no manual restart.

Does Shopify customer PII reach my destination?

By default, no. The connector skips customer email, phone, and addresses on every query. Those fields are never fetched, so they can't reach your destination. That keeps you safely inside the Shopify Basic plan — no protected-customer-data approval needed. If you do need them, edit the GraphQL query in your self-hosted copy.

Can I sync from multiple Shopify stores into one destination?

Yes. Add each Shopify store as a separate connection. Each store's data lands in its own schema — `shopify_brand_a.orders`, `shopify_brand_b.orders`, and so on. No collisions, and you can join across them at query time.

What destinations does rsync.ai support for Shopify?

Built-in destinations today: PostgreSQL, MySQL, MariaDB, SQLite, Snowflake, BigQuery, Redshift, ClickHouse, Google Sheets, S3, and a Kafka sink for streaming. Need something else? Point the AI Tool Generator at the destination's docs URL and it builds a working connector.

How is the Shopify connector licensed and priced?

rsync.ai is source-available under the Elastic License 2.0 — free to self-host, but it can't be resold as a managed service. The Shopify connector ships in the same repo. No per-row, per-MAR, or per-connector fees. You pay only for the compute you run it on.

Ship your Shopify pipeline today.

Pick a destination — MySQL, Postgres, or Google Sheets — and have data flowing in under 5 minutes.