Announcing DP Wizard v0.6

Photo of people socializing at a reception at the Harvard Science and Engineering Complex

We’ve spent some time this fall on features in DP Wizard designed particularly for research data repository users. One common challenge is having data which contains personal data, which can’t be released verbatim, but should still be summarized. Differential privacy could be used to generate aggregate statistics, but for reproducible research, the steps in the analysis should also be described.

DP Wizard now makes this easier. Version 0.5 already added the option of generating synthetic data from a private CSV, in addition to descriptive statistics. With 0.6, in addition to downloading individual files, you can now download a zip file with all the steps in the analysis, and before download you can customize the README which will be included in the zip file.

There are many smaller improvements along the way, including summaries of your choices for each step and improved code generation. You can find a full list of changes in our CHANGELOG.

Let us know if you’ve found DP Wizard useful, or if there are particular features you need!