-
Notifications
You must be signed in to change notification settings - Fork 286
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
synthetic data format is wrong based up on real data -Please help #1943
Comments
Hi there @Vasanthpravin When you run However, this process isn't perfect and we always recommend double checking the metadata to make sure it matches what you expect. You can display the
Then, you can update the sdtype of multiple columns at once using the
Then you can create your synthesizer object, fit the model, and sample:
|
Hi there @Vasanthpravin I'm closing out this issue for now, as it seems like there isn't a clear bug here. But let me know if you're still running into the issue or uncover a related bug and we can re-open the issue! |
sdv versiom-1.12.0
databricks 13.3 LTS
dp_pandas = df.toPandas()
metadata = SingleTableMetadata()
metadata.detect_from_dataframe(dp_pandas)
synthesizer = GaussianCopulaSynthesizer(metadata=metadata)
synthesizer.fit(data=df_pandas)
synthetic_data = synthesizer.sample(num_rows=50)
display(synthetic_data)
dp_pandas.info(verbose = True, null_counts = False)
Ouptut is coming
Person id PhoneId
sdv-pii-btwry sdv-id-0
both columns are number but its generating sdv like that.Please help
The text was updated successfully, but these errors were encountered: