Disclosure control #1

tombisho · 2021-09-02T15:02:25Z

Look at Swiss knife disclosure measures and implement them

tombisho · 2021-09-02T15:09:35Z

and check how the seed is set

tombisho · 2021-10-15T14:03:47Z

Note that a variable to be synthesized
first that has no predictors is a special case and its synthetic values are by default generated
by random sampling with replacement from the original data ("sample" method). I

tombisho · 2022-10-24T09:17:28Z

concern that the density smoothing does not hide extreme values, so might need to use top and bottom coding. How to set the top and bottom? 90% of real value? Might hide large values causing problems?
add a label to the data to show it is synthetic (easy?)
remove unique combinations of factors that are also in the real data
first column always sampled - contains real data? Force it to be a factor? The smoothing and top and bottom is applied to it, which can hide the real values

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disclosure control #1

Disclosure control #1

tombisho commented Sep 2, 2021

tombisho commented Sep 2, 2021

tombisho commented Oct 15, 2021

tombisho commented Oct 24, 2022 •

edited

Loading

Disclosure control #1

Disclosure control #1

Comments

tombisho commented Sep 2, 2021

tombisho commented Sep 2, 2021

tombisho commented Oct 15, 2021

tombisho commented Oct 24, 2022 • edited Loading

tombisho commented Oct 24, 2022 •

edited

Loading