Resources

Searching for statistical data privacy resources? You’re in the right place. Here we provide a range of resources, from CRAN packages to influential articles.

Articles

Books and Book Chapters

CRAN (R) Packages

  • acro: A Tool for Automating the Statistical Disclosure Control of Research Outputs
  • AQuadtree: Confidentiality of Spatial Point Data
  • cellKey: Consistent Perturbation of Statistical Frequency and Magnitude Tables
  • diffpriv: Easy Differential Privacy
  • duawranglr: Securely Wrangle Dataset According to Data Usage Agreement
  • easySdcTable: Easy Interface to the Statistical Disclosure Control Package ‘sdcTable’ Extended with Own Implementation of ‘GaussSuppression’
  • pda: Privacy-Preserving Distributed Algorithms
  • ppmf: Read Census Privacy Protected Microdata Files
  • ppmHR: Privacy-Protecting Hazard Ratio Estimation in Distributed Data Networks
  • PPRL: Privacy Preserving Record Linkage
  • RegSDC: Information Preserving Regression-Based Tools for Statistical Disclosure Control
  • sdcHierarchies: Create and (Interactively) Modify Nested Hierarchies
  • sdcLog: Tools for Statistical Disclosure Control in Research Data Centers
  • SDCNWay: Tools to Evaluate Disclosure Risk
  • sdcSpatial: Statistical Disclosure Control for Spatial Data
  • sdcTable: Methods for Statistical Disclosure Control in Tabular Data
  • synthpop: Generating Synthetic Versions of Sensitive Microdata for Statistical Disclosure Control
  • uwedragon: Data Research, Access, Governance Network : Statistical Disclosure Control

Python Packages

  • Google’s Differential Privacy Libraries
  • Diffprivlib: General-Purpose Library for Experimenting with, Investigating, and Developing Applications in Differential Privacy
  • OpenDP: The Core Library of Differential Privacy Algorithms Powering the OpenDP Project
  • PipelineDP: Python Framework for Applying Differentially Private Aggregations to Large Datasets Using Batch Processing Systems such as Apache Spark, Apache Beam, and More
  • PySyft: Perform Data Science on Data that Remains in Someone Else’s Server
  • PyDP: OpenMined’s Python Differential Privacy Library
  • Tensorflow Privacy: Library for Training Machine Learning Models with Privacy for Training Data
  • Tumult Analytics: Tumult Labs’ Differential Privacy Library, running on PySpark

Additional Resources

If you have additional resources that you’d like to share on this page, join our group and let us know on our discussion forum.