You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to synthesize a dataset with various columns for dates.
I'm using the Gaussian Copula Synthesizer, and the following transformer for these columns:
The columns are of type <M8[ns] or datetime64[ns]. They already contain a lot of missing values (NaT).
After synthesis, the sampled data contains ONLY missing values (NaT).
The dates are of format yyyy-mm-dd. I tried setting the date_time format to '%Y-%m-%d' , but to no avail.
What did I do wrong?
The text was updated successfully, but these errors were encountered:
Out of curiosity, does the NaT issue disappear if you use the default transformer, without updating to OptimizedTimestampEncoder? I'd be curious to know if the default workflow results in a roughly similar proportion of NaT values in the synthetic data.
I'm also curious to know if you're hoping to generate synthetic data with the same datetime format as your real data, or if you're hoping to generate synthetic data with a different one? In SDV land, the datetime_format value should describe the format in your real data, not the aspirational format for the synthetic data.
Do you also mind sharing your SDV code? This would include how you're instantiating the synthesizer, customizing the synthesizer's behavior using parameters and methods, any constraints you're using, etc.
I'm trying to synthesize a dataset with various columns for dates.
I'm using the Gaussian Copula Synthesizer, and the following transformer for these columns:
The columns are of type
<M8[ns]
ordatetime64[ns]
. They already contain a lot of missing values (NaT).After synthesis, the sampled data contains ONLY missing values (NaT).
The dates are of format yyyy-mm-dd. I tried setting the date_time format to
'%Y-%m-%d'
, but to no avail.What did I do wrong?
The text was updated successfully, but these errors were encountered: