Understanding Your Customers

Disclaimer: Super Furniture is a fictional company used for this example. Below is a showcase of how HyperData can be used. Data included in this example are simulated and for demonstration purposes.

Problem

Super Furniture is an e-commerce store that sells furniture and home goods. The company is interested in better understanding its customers to drive top-line growth. To do so, Super Furniture is looking to leverage 3rd party data to enable granular customer segmentation, targeted advertising campaigns, personalized messaging, and market research.

The problem is that it is incredibly tedious & expensive to extract, transform, and validate large volumes of 3rd party data from different sources. It requires weeks to months of data engineering work and ongoing data pipeline maintenance to get real value from the data - this is an incredibly high upfront cost.

Solution

Super Furniture can use the HyperData API to simplify the data integration process and significantly reduce costs. HyperData transforms and validates the raw 3rd party data so that clean data can be loaded into Super Furniture's database in a unified and reliable format. HyperData does this at a fraction of the cost and time vs. Super Furniture building and maintaining the integrations in-house.

Source Data

Note that the data is simulated and is for demonstration purposes only.

The order data comes directly from the Super Furniture database and contains the records of each online order. We will use the zipcodes and emails from the transaction records to learn more about the customer base.

Here's a sample row from the dataset:

Transforming with Hyperdata

The source data is in different formats and will be updated at different frequencies so we will use the HyperData API to transform the source data into a single format for use. Here's the code we will use to do this:

import hdata as hd

# Load the raw data
orders = hd.Source(transformation_key='*****', data='orders.parquet')
zipcode = hd.Source(transformation_key='*****', data='zipcode.csv')
email_enrichment = hd.Source(transformation_key='*****', data='email.csv')

# Transform
sources = [orders, zipcode, email_enrichment]
entity, attribute, record = hd.transform(sources, auth_token='*****')

Here is the output of the transformation in CSV format for reference. The data is returned from the transform function as three pandas data frames.

With this clean data feed, Super Furniture can now run targeted advertising campaigns, customize communications with existing customers, and better understand their market. All of this is made possible by creating a single source of truth for external data with the HyperData API.

Last updated