MySQL Reverse ETL

Updated December 29, 2025

Import data from MySQL to any destination. This source saves you the trouble of writing code to extract, transform, and load data from your database. Instead, specify the query (or queries) to run, and we’ll handle the rest.

Best Practices

Before you add a Reverse ETL source, you should take some measures to ensure the security of your customers’ data and limit performance impacts to your database and Customer.io workspace.

Create a new database user/service account. Implement a database user with minimal privileges specifically for Customer.io import/sync operations. This person only requires read permissions with access limited to the tables you want to sync from.
Avoid using your main database instance. Consider creating a read-only database instance with replication in place, lightening the load and preventing data loss on your main instance.
Sync only the data that you’ll use in Customer.io. Limiting your query can improve performance, and minimizes the potential to expose sensitive data. Select only the columns you care about, and make sure you use the {{last_sync_time}} to limit your query to data that changed since the previous sync.
Limit your sync frequency so you don’t sync more than necessary and consume unnecessary resources. If the previous reverse ETL operation is still in progress when the next interval occurs, we’ll skip the operation and catch up your data on the next interval. You should monitor your first few reverse ETL intervals to ensure that your sync doesn’t impact your system’s security and performance—frequently skipped operations may indicate that you’re syncing too often.

Sending excessive data can impact your account’s performance

You should not run queries that return large data sets—millions of rows—more than once per day. Doing so may impact workspace performance, including delaying campaigns and messages.

Granting us access to your database

We officially support MySQL 5.7 and newer. An older database version might work, but we can’t guarantee it.

We support both SSL and non-SSL database connections. As a part of setup, you’ll need to provide the credentials of a database user with read-access to the tables you want to select data from.

If you use a firewall or an allowlist, you must allow the following IP addresses so we can connect to your database. Make sure you use the correct IP addresses for your account region.

US Region	EU Region
34.29.50.4	34.22.168.136
35.222.130.209	34.78.194.61
34.122.196.49	104.155.37.221

Set up your MySQL integration

As a part of this setup, you’ll provide Customer.io with MySQL user credentials that we’ll use to query your database. We recommend that you create a new user with Read Only access specifically for Customer.io, so you can manage Customer.io access to your database independent of any other MySQL users you have.

Your database or storage bucket must allow connections from the following IP addresses. If our IP addresses are blocked, we won’t be able to connect to your database.

Account region	IP Addresses
US	34.29.50.4, 35.222.130.209
EU	34.22.168.136, 34.78.194.61

Go to Data & Integrations > Integrations. In the Directory tab, pick MySQL.
Provide your database information, including credentials to connect to your database, and click Save database.
- The Name is a friendly name you’ll use to recognize your database whenever you reference it in Customer.io.
- Enter Host address and the name of the database you want to connect to.
- Enter a database user’s credentials and click Add database. We suggest that you use someone with read-only credentials for your database. While our integration won’t write to your database, using read-only credentials ensures that you can’t inadvertently make changes to your database through your query.
- Toggle the options for SSL or SSH tunneling if necessary.
Set up a Sync. A sync is the type of data (identify, track, etc) you want to import from your database and click Next: Define Query. You can set up syncs for each type of data you want to import.
1. Provide a Name and Description for the sync. This helps you understand the sync at a glance when you look at your integration’s Overview later.
2. Select the type of data you want to import.
3. Set the Sync Frequency, indicating how often you want to query your database for new data. You should set the frequency such that sync operations don’t overlap. Learn more about sync frequency.
4. Select when you want to start the sync: whether you want to begin importing data immediately, or schedule the sync to start at a later date.
Enter the query that selects the data you want to import. See Queries below for more information about the information you’ll want to select for your sync. Click Run Query to preview results and make sure that your query selects the right information.
Click Enable to enable your sync.

Now you can set up additional syncs and connect your integration to one or more destinations.

Adding syncs

After you set up your incoming integration, you can add additional syncs to import different types of data from your database. For example, you might want to import identify data for your users, and track data for their actions. Subsequent syncs can rely on your existing database, or you can add another database within your integration.

In your integration, go to the Syncs tab and click Add Sync.
Select your database or add a new one and click Next: Create Sync.
Set up a syncA sync is the type of source data (identify, track, etc) you want to import from your database. A sync is essentially the type of source call you want to make. and click Next: Define Query. You can set up syncs for each type of data you want to import.
1. Provide a Name and Description for the sync. This helps you understand the sync at a glance when you look at your source Overview later.
2. Select the type of data you want to import.
3. Set the Sync Frequency, indicating how often you want to query your database for new data. You should set the frequency such that sync operations don’t overlap. Learn more about sync frequency.
4. Select when you want to start the sync: whether you want to begin importing data immediately, or schedule the sync to start at a later date.
Enter the query that selects the data you want to import. See Queries below for more information about the information you’ll want to select for your sync. Click Run Query to preview results and make sure that your query selects the right information.
Click Enable to enable your sync.

Sync Frequency

You can sync data as often as every minute. However, we recommend that you set your sync frequency such that sync operations don’t overlap. If you schedule syncs such that a sync operation is scheduled to start while the previous operation is still we’ll skip the next sync operation.

Semantic events: Deleting people, groups, and more

You may notice that this integration doesn’t have sync types to delete people, groups, or other objects. To do these kinds of operations, you’ll use what we call semantic events. These are events with specific names that indicate a delete operation. When your Track sync picks up events with an event name we recognize, we’ll perform the associated action—like deleting a person or group.

For example, if you send an event with the name User Deleted, we’ll delete the person from your workspace. See Customer.io Semantic Events for more information.

The semantic events we support are:

Event Name	Action
`Device Added or Updated`	Add or update a mobile device.
`Device Deleted`	Delete a mobile device.
`User Deleted`	Delete a person.
`Object Deleted`	Delete a custom object.
`Relationship Deleted`	Delete a relationship.
`Suppress Person`	Suppress a person.
`Unsuppress Person`	Unsuppress a person.
`Report Delivery Event`	Report in-app message events (like delivery, open, click) outside of our JavaScript integration.

Queries for each sync type

When you create a database sync, you provide a query selecting the people or objects you want to import, and respective properties. You’ll build your queries using the same principles from our Pipelines API.

Each row returned from your query represents an individual operation (like an identify call, a track event, etc). Columns represent the traits or properties that you want to apply to the person, group, or event that your sync imports.

While we support queries that return millions of rows and hundreds of columns, syncing large amounts of data more then once a day can impact your account’s performance—including delaying campaigns or messages. When you set up your query, consider how much data you want to send and how often; and make sure you limit your results using the last_sync_time.

Make sure you compare timestamps against last_sync_time

Our examples below include a last_sync_time value. You must compare a timestamp to this value to avoid sending duplicate traffic to Customer.io which could impact your workspace’s performance.

`last_sync_time` and limiting your results

You can send data to Customer.io only for records that have changed since the last sync by comparing timestamps against the last_sync_time value. This helps you avoid syncing the same records over and over again—which can cause syncs to take longer and, in extreme cases, can impact your workspace’s performance.

We expose last_time_sync as a Unix timestamp representing the date-time when the last successfully completed sync started. By comparing a timestamp against this value, you’ll only sync records that have changed since the last sync.

For your first sync, the last_sync_time is 0, so you’ll sync all records. After that, you’ll just get the changeset.

Identify

The identify method tells us who someone is and lets you assign unique traitsA key-value pair that you associate with a person or an object—like a person’s name, the date they were created in your workspace, or a company’s billing date etc. Use attributes to target people and personalize messages. to a person. Your query should compare a timestamp to the last_sync_time to ensure that you only import new data.

You can identify people by anonymousId and/or userId.

anonymousId only: This assigns traits to a person before you know who they are.
userId only: Identifies a user and sets traits.
both userId and anonymousId: Associates the data from the anonymousId with the person you identify by userId.

SELECT id AS userId, email_address AS email, fname, lname, msisdn AS phone
FROM users
WHERE last_updated >= {{last_sync_time}}

integrations object
Contains a list of booleans indicating the integrations that are enabled (true) or disabled (false). By default, all integrations are enabled (returning an empty object). Set "All": false to reverse this behavior.
- Enabled/Disabled integrations* boolean
timestamp string (date-time)
The ISO-8601 timestamp when the event originally took place. This is mostly useful when you backfill data past events. If you’re not backfilling data, you can leave this field empty and we’ll use the current time or server time.
traits object
Additional properties that you know about a person. We’ve listed some common/reserved traits below, but you can add any traits that you might use in another system.
- createdAt string (date-time)
  We recommend that you pass date-time values as ISO 8601 date-time strings. We convert this value to fit destinations where appropriate.
- email string
  A person’s email address. In some cases, you can pass an empty userId and we’ll use this value to identify a person.
- Additional Traits* any type
  Traits that you want to set on a person. These can take any JSON shape.

Identify people by email or ID

If you identify people by email and a unique ID, you can use a CASE statement or the COALESCE function to set the userId to prioritize the customer ID when available, falling back to email for people who don’t have a unique ID yet. This kind of setup is common when you support both leads (identified by email) and customers (identified by a unique ID after they make a purchase, or otherwise convert).

COALESCE

The COALESCE function returns the first non-null value from the list of arguments:

SELECT
  COALESCE(CAST(user_id AS CHAR), email) AS userId,
  email,
  first_name,
  last_name
FROM users
WHERE last_updated >= {{last_sync_time}}

CASE

The CASE statement checks if user_id exists. If it does, it converts the ID to a string; otherwise, it uses the email address:

SELECT
  CASE
    WHEN user_id IS NOT NULL THEN CAST(user_id AS CHAR)
    ELSE email
  END AS userId,
  email,
  first_name,
  last_name
FROM users
WHERE last_updated >= {{last_sync_time}}

Track

The track method records things people do. Every track call represents an event.

You should track your audience’s activities with events both as performance indicators and so you can respond to your audience’s activities with campaignsCampaigns are automated workflows you set up to send people messages and perform other actions when they meet your criteria. in Journeys. For example, if your audience performs a Video Viewed or Item Purchased event, you might respond with other videos or products the person might enjoy.

Track calls require an event name describing what a person did. They must also include an anonymousId or a userId. Calls that you make with an anonymousId are associated with a userId when you identify someone by their userId.

In most cases, your query should compare a timestamp to the last_sync_time to ensure that you only import new events.

SELECT id AS userId, event_name AS event, products, total_price AS value
FROM events
WHERE timestamp > {{last_sync_time}}

event string
Required The name of the event
integrations object
Contains a list of booleans indicating the integrations that are enabled (true) or disabled (false). By default, all integrations are enabled (returning an empty object). Set "All": false to reverse this behavior.
- Enabled/Disabled integrations* boolean
properties object
Additional properties for your event.
- Event Properties* any type
  Additional properties that you want to capture in the event. These can take any JSON shape.
timestamp string (date-time)
The ISO-8601 timestamp when the event originally took place. This is mostly useful when you backfill data past events. If you’re not backfilling data, you can leave this field empty and we’ll use the current time or server time.

Backfilling events

In your initial sync, the last_sync_time is 0, and we’ll capture all events that otherwise match your query. After that, we only capture events that occur after the last_sync_time—events that occurred since the previous sync. This prevents you from importing the same events multiple times, but also means that you can’t backfill event history.

If you need to backfill event history after your initial sync, you’ll need to set up a new sync to import the events you want to backfill. In general, you’ll:

Create a new sync with a new query that captures the events you want to backfill.
Run the sync to backfill events.
Disable the backfilling sync so that you don’t capture events that your normal event query would otherwise import.

Group

The Group method associates a person with a group—like a company, organization, project, online class or any other collective noun you come up with for the same concept. In Customer.io Journeys, we call groups objectsAn object is a non-person entity that you can associate with one or more people—like a company, account, or online course.. If the group/object or person in your group call don’t exist, this operation creates them.

Group calls require a groupId to represent the group. In almost every case, a group call should also include a userId to associate the person with the group. You can also include traits to provide additional information about the group (or the relationship between the person and the group). Find more details about the group method in our API specifications.

Your query should compare a timestamp to the last_sync_time to ensure that you only import new data.

Remember, group calls represent both an organization/group and relationships with users (by userId). Your query should include not only the groupId, but the userId so that you can capture relationships between users and groups.

If the userId doesn’t exist in Customer.io, we’ll create a new person to represent the new userId and their relationship to the group.

SELECT companyId AS groupId, objectTypeId, companyname, employees, personId AS userId
FROM companies
WHERE last_updated >= {{last_sync_time}}

Include objectTypeId when you send data to Customer.io

Customer.io supports different kinds of groups (called objectsAn object is a non-person entity that you can associate with one or more people—like a company, account, or online course.) where each object has an object type represented by an incrementing integer beginning at 1. If you send group calls to Customer.io, you should include the object type ID or we’ll assume that the object type is 1.

groupId string
Required ID of the group
integrations object
Contains a list of booleans indicating the integrations that are enabled (true) or disabled (false). By default, all integrations are enabled (returning an empty object). Set "All": false to reverse this behavior.
- Enabled/Disabled integrations* boolean
timestamp string (date-time)
The ISO-8601 timestamp when the event originally took place. This is mostly useful when you backfill data past events. If you’re not backfilling data, you can leave this field empty and we’ll use the current time or server time.
traits object
Additional information about the group.
- Group Traits* any type
  Additional traits you want to associate with this group.

Relationship attributes

In Customer.io, you can assign attributesA key-value pair that you associate with a person or an object—like a person’s name, the date they were created in your workspace, or a company’s billing date etc. Use attributes to target people and personalize messages. to both the group (called a custom objectAn object is a non-person entity that you can associate with one or more people—like a company, account, or online course. in Customer.io) and to the relationshipThe connection between an object and a person in your workspace. For instance, if you have Account objects, people could have relationships to an Account if they’re admins. between the object and the person. By default, attributes are stored on the custom object itself, but you can assign relationship attributes using the relationshipAttributes JSON object.

SELECT companyId AS groupId, objectTypeId, companyname, employees, personId AS userId,
    JSON_OBJECT(
        'is_manager', is_manager,
        'role', role,
        'start_date', start_date,
        'department', department
    ) AS relationshipAttributes
FROM companies
WHERE last_updated >= {{last_sync_time}}

Page

The Page method records page views on your website, along with optional extra information about the page a person visited.

Your query should compare a timestamp to the last_sync_time to ensure that you only import new data.

SELECT id AS userId, metatitle as name, url, time_on_page
FROM pages
WHERE timestamp > {{last_sync_time}}

integrations object
Contains a list of booleans indicating the integrations that are enabled (true) or disabled (false). By default, all integrations are enabled (returning an empty object). Set "All": false to reverse this behavior.
- Enabled/Disabled integrations* boolean
name string
Required The name of the page.
properties object
Additional properties for your event.
- category string
  The category of the page. This might be useful if you have a single page routes or have a flattened URL structure.
- Page Properties* any type
  Additional properties tha tyou want to send with the page event. By default, we capture `url`, `title`, and stuff.
timestamp string (date-time)
The ISO-8601 timestamp when the event originally took place. This is mostly useful when you backfill data past events. If you’re not backfilling data, you can leave this field empty and we’ll use the current time or server time.

Screen

The Screen method sends screen view events for mobile devices. These help you understand the screens that people use in your app.

Your query should compare a timestamp to the last_sync_time to ensure that you only import new data.

SELECT id AS userId, screen_name as name, session_started
FROM screens
WHERE timestamp > {{last_sync_time}}

name string
Required The name of the screen the person visited.
properties object
Additional properties for your screen.
timestamp string (date-time)
The ISO-8601 timestamp when the event originally took place. This is mostly useful when you backfill data past events. If you’re not backfilling data, you can leave this field empty and we’ll use the current time or server time.

Alias

The Alias method combines two previously unassociated user identities. Some integrations automatically reconcile profiles with different identifiers based on whether you send anonymousId, userId, or another trait that the integration expects to be unique. But for integrations that don’t, you may need to send alias requests to do this.

In general, you won’t need to use the alias call; we try to handle user identification gracefully so you don’t need to merge profiles. But you may need to send alias calls to manage user identities in some data-out integrations.

For example, in Mixpanel it’s used to associate an anonymous user with an identified user once they sign up.

SELECT id AS userId, old_id as previousId
FROM user_resolution
WHERE timestamp >= {{last_sync_time}}

previousId string
Required The userId that you want to merge into the canonical profile.
userId string
Required The userId that you want to keep. This is required if you haven’t already identified someone with one of our web or server-side libraries.

Copied to clipboard!

Latest features at Customer.io

Rate limiting now available for multi-language newsletters

Import content into Design Studio using our new API endpoints

Add guardrails to the content AI creates

MySQL Reverse ETL

Best Practices

Granting us access to your database

Set up your MySQL integration

Adding syncs

Sync Frequency

Semantic events: Deleting people, groups, and more

Queries for each sync type

`last_sync_time` and limiting your results

Identify

Identify people by email or ID

COALESCE

CASE

Track

Backfilling events

Group

Relationship attributes

Page

Screen

Alias

Latest features at Customer.io

Rate limiting now available for multi-language newsletters

Import content into Design Studio using our new API endpoints

Add guardrails to the content AI creates

MySQL Reverse ETL

Best Practices

Granting us access to your database

Set up your MySQL integration

Adding syncs

Sync Frequency

Semantic events: Deleting people, groups, and more

Queries for each sync type

last_sync_time and limiting your results

Identify

Identify people by email or ID

COALESCE

CASE

Track

Backfilling events

Group

Relationship attributes

Page

Screen

Alias

How can we make it better?

`last_sync_time` and limiting your results