Adversary's prior knowledge #10

lipikaramaswamy · 2022-05-11T18:05:52Z

Hello,

I have a question about the mechanism proposed in your paper.

In the design of the membership inference attack, a requirement is that the adversary must have a reference dataset drawn from the same distribution as the target model's training data. So in the implementation, for any dataset available in SDGYM (e.g. adult, insurance), one sample is used as the adversary's prior information and another is used as the training set for the generative model that produces the synthetic data, the size of each depending on config params sizeRawT and sizeRawA.

In practice, however, when building a generative model, it's beneficial to use the entire dataset available to train to better learn the underlying distribution. It seems that the mechanism proposed hinges on a) using generative models that do not necessarily require large training sets, such as those listed in the configs (BayesNet, PrivBayes) or b) having very large training sets such that GANs or large language models have enough data for stable training after sampling. I'd love to hear your thoughts on this.

Would you also be able to share the config parameters used to generate results for CTGAN and PATEGAN in your paper? Thanks!

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adversary's prior knowledge #10

Adversary's prior knowledge #10

lipikaramaswamy commented May 11, 2022

Adversary's prior knowledge #10

Adversary's prior knowledge #10

Comments

lipikaramaswamy commented May 11, 2022