Unified Data Model
The Entity-Attribute-Value Model
All data transformed with the HyperData API maps to the same EAV model.
Universally Unique Identifiers
We use UUIDs to index each entity and attribute for a couple of key reasons
Since UUIDs are globally unique, you can easily append new data without risking an index conflict where two entities or attributes have the same ID
The record table can be easily linked to the entity and attribute table
The entity table
The entity table can store just about anything—people, products, places, customers, websites, and so much more. It’s like a universal container for things in the real world that you have information on.
The attribute table
The attribute table serves as a detailed ledger, cataloging every data point related to an entity. Think of it as the specifics or the "about" details of anything you're tracking. This could be a person’s age, a product's price, a post's click-through rate, or any other characteristic.
The record table
The record table ties everything together by linking an entity and attribute to a specific time with a value. The "datetime" column can represent specific points in time such as "2023-04-01 11:30:13" or a stretch of time such as a whole month "2022-03". The "record_value" column stores the actual value of the entity-attribute pair with either a float, integer, string, or bytes-like object.
Example
The following is an example of the entity, attribute, and record tables that would be returned as the output of a transformation. The data in this table is completely fictitious and for demonstration purposes only.
7ba0f437-be4d-71c8-12e6-006ce5a65f2e
81 Bay Street, Toronto, ON M5J 0E7
CIBC square located in downtown Toronto
eef438f7-e448-aa48-c1f8-db568d27b591
twitter.com/cibc
The offical X page of CIBC
The Value of Standardization
Standardizing data is a crucial process that brings uniformity to information, making it easier to collect, analyze, and share across various platforms and systems. This harmonization not only enhances the accuracy and reliability of data analysis but also facilitates interoperability between different technologies and sectors. By adhering to common standards, organizations can ensure that their data is compatible, comparable, and easily integrated, leading to more informed decision-making, streamlined operations, and the ability to leverage insights across different datasets.
By consistently transforming data to the same model, HyperData eliminates the complexity and variability often encountered in data management. This standardization ensures that developers and organizations can reliably query and build applications on a stable foundation, without the concern of pipelines breaking due to data format inconsistencies. HyperData reduces maintenance efforts and provides the assurance that data structures will remain consistent, enabling more predictable and efficient application performance and data analysis.
Last updated