Evaluation
report
Collect summary statistics for the evaluation of synthetic data in terms of data quality and privacy protection.
Arguments:
data_train
- ARelationalData
object containing the original training data.data_test
- ARelationalData
object containing the original test data.data_synth
- ARelationalData
object containing the generated synthetic data.path
- A path to save the report.n_max_train
- The maximum number of samples per table (for train data) to use in the report.n_max_test
- The maximum number of samples per table (for test data) to use in the report.
compute_privacy_stats
Compute privacy statistics for the evaluation of synthetic data.
Arguments:
data_train
- ARelationalData
object containing the original training data.data_synth
- ARelationalData
object containing the generated synthetic data.n_max
- The maximum number of samples per table (for both train and synth data) to use in the computation.n_folds_std
- Number of folds to use in teh computation of the standard deviation. If None, the computation is not performed. Default: 10.
Returns:
A dictionary mapping each table to a PrivacyStats
object (or None in case of error).