Introduction
The Aindo Synthetic Data platform is a web application that empowers users to generate synthetic data that closely mimic real-world datasets, while maintaining privacy and statistical accuracy.
The platform is built around three key components that work together:
- Sources: the starting point of the data generation process. Sources represent the original datasets you want to synthesize or anonymize, and serve as inputs for creating generators. Learn more about managing sources here.
- Generators: built using your source data to define how new datasets are created. Discover how to manage generators here.
- Generated Datasets: the dataset produced by an existing generator. Find more details on managing generated datasets here.
The typical process for generating a dataset on the platform follows these steps:
- Create a source by uploading or connecting to your original data.
- Create a generator using the source data.
- Generate datasets using the created generator.
This workflow can be visually represented as follows:
Advanced Workflow: Using Generated Datasets as Sources
The platform also supports using generated datasets as sources for new generators.
See an example below: