Release Status Released Availability Free
Status Page Mixpanel Status Page Default Historical Sync 7 days
Whitelist Tables/Columns Supported/Unsupported Default Replication Frequency 30 minutes
Full Table Endpoints 1 Incremental Endpoints 2
Destination Incompatibilities Possible incompatibilities. Learn more.

Connecting Mixpanel

Connecting your Mixpanel data to Stitch is a five-step process:

  1. Retrieve your Mixpanel API credentials
  2. Add Mixpanel as a Stitch data source
  3. Define the Historical Sync
  4. Define the Replication Frequency
  5. Select tables to sync

Retrieving Your Mixpanel API Credentials

  1. Sign into your Mixpanel account.
  2. Click Account, which is located in the upper right portion of the screen.
  3. In the modal that displays, click Projects.

    Note that this window is different than the Project Settings window, which is accessed using the gear icon in the lower left corner. The window you need is accessed only by clicking Account > Projects.

  4. Your API credentials will display:

    Mixpanel API credentials.

Leave this page open - you’ll need it to complete the setup in Stitch.

Add Mixpanel as a Stitch Data Source

  1. On the Stitch Dashboard page, click the Add an Integration button.
  2. Click the Mixpanel icon.

  3. Enter a name for the integration. This is the name that will display on the for the integration; it’ll also be used to create the schema in your data warehouse.

    For example, the name “Stitch Mixpanel” would create a schema called stitch_mixpanel in the data warehouse. This schema is where all the tables for this integration will be stored.

  4. Paste your API credentials in the the API Key and Secret fields, respectively.

Defining the Historical Sync

The Sync Historical Data setting will define the starting date for your Mixpanel integration. This means that data equal to or newer than this date will be replicated to your data warehouse.

Change this setting if you want to sync data beyond Mixpanel’s default setting of 7 days. For a detailed look at historical syncs, check out the Syncing Historical SaaS Data article.

Define the Replication Frequency

The Replication Frequency controls how often Stitch will attempt to replicate data from your Mixpanel integration. By default the frequency is set to 30 minutes, but you can change it to better suit your needs.

Before setting the Replication Frequency, note that:

  • The more often Mixpanel is set to replicate, the higher the number of replicated rows.
  • The number of rows in the source may not equal the number of rows replicated by Stitch. Tables that use Full Table Replication will result in a higher number of replicated rows.

  • If you’re using a data warehouses that doesn’t natively support nested structures, you’ll see a higher number of replicated rows due to the de-nesting Stitch performs.

To help prevent overages, we recommend setting the Replication Frequency to something less frequent - like 6 hours instead of 30 minutes. For tips on reducing your row count, check out the Reducing Your Row Count section of our Billing Guide.

After selecting a Replication Frequency, click Save Integration.

Selecting Tables to Sync

To complete the setup, you’ll need to select tables you want to sync to your data warehouse.

Check out the Schema section to learn more about the available tables in Mixpanel.

  1. In the Integration Details page, click the Tables to Replicate tab.
  2. Locate a table you want to replicate.
  3. To sync a table, click the checkbox next to the table’s name. A green checkmark means the table is set to sync.

  4. Repeat this process for all the tables you want to replicate.

Mixpanel’s Intial Sync

After you finish setting up Mixpanel, you might see its Sync Status show as Pending on either the Stitch Dashboard or in the Integration Details page.

For a new integration, a Pending status indicates that Stitch is in the process of scheduling the initial sync for the integration. This may take some time to complete.


Replicating Mixpanel Data

Because of how Mixpanel’s API is designed, two of the three tables in our Mixpanel integration - the mixpanel_export and mixpanel_funnels tables - can only be queried by day. This means that every time Stitch runs a replication job for a Mixpanel integration, the past day’s worth of data will be replicated for each of these tables.

To prevent the re-replication of data that will count against your row count, we recommend setting the Replication Frequency to something less frequent.


Mixpanel Schema

Note that these tables won't automatically be created in your data warehouse. You must set them to sync in the Integration Details page first.

Stitch's Mixpanel integration includes these tables:


mixpanel_engage

Replication Method: Full Table
Primary Key: distinct_id
Contains Nested Structures?: No

The mixpanel_engage table contains info from People Analytics, which will enable you to do user-level analysis.

Table Info & Attributes

mixpanel_engage Attributes

While we try to include everything Mixpanel has here, this may not be a full list of attributes. Refer to Mixpanel's documentation for a full list and description of each attribute.

  • distinct_id

  • created

  • email

  • first_name

  • last_name

  • last_seen

mixpanel_export

Replication Method: Incremental
Primary Key: event:time:distinct_id:_sdc_record_hash
Contains Nested Structures?: No

The mixpanel_export table contains raw event data.

Table Info & Attributes

Replication

Because of how Mixpanel’s API is designed, this table can only be queried by day. Thyis means that every time Stitch runs a replication job for a Mixpanel integration, the past day’s worth of data will be replicated for this table.

mixpanel_export Attributes

While we try to include everything Mixpanel has here, this may not be a full list of attributes.

  • distinct_id

  • event

  • mp_country_code

  • mp_lib

  • mp_reserved_browser

  • mp_reserved_browser_version

  • mp_reserved_city

  • mp_reserved_current_url

  • mp_reserved_initial_referrer

  • mp_reserved_initial_referring_domain

  • mp_reserved_lib_version

  • mp_reserved_reserved_os

  • mp_reserved_region

  • mp_reserved_screen_height

  • mp_reserved_screen_width

  • time

  • _sdc_record_hash

mixpanel_funnels

Replication Method: Incremental
Primary Key: funnel_id:date
Contains Nested Structures?: Yes

The mixpanel_funnels table contains contains info about your Mixpanel funnels.

Table Info & Attributes

mixpanel_funnels & Nested Structures

This table contains nested structures. If you use a data warehouse that doesn't natively support nested structures, some of the attributes listed below may be in a subtable.

These items are marked with a *

Replication

Because of how Mixpanel’s API is designed, this table can only be queried by day. Thyis means that every time Stitch runs a replication job for a Mixpanel integration, the past day’s worth of data will be replicated for this table.

mixpanel_funnels Attributes

While we try to include everything Mixpanel has here, this may not be a full list of attributes.

  • funnel_id

  • date

  • steps*: count, step_conv_ratio, goal, overall_conv_ratio, avg_time, event

  • analysis__completion

  • analysis__starting_amount

  • analysis__steps

  • analysis__worst



Questions? Feedback?

Did this article help? If you have questions or feedback, please reach out to us.