Jörg Drechsler – Författare
Visar alla böcker från författaren Jörg Drechsler. Handla med fri frakt och snabb leverans.
2 produkter
2 produkter
Handbook of Sharing Confidential Data
Differential Privacy, Secure Multiparty Computation, and Synthetic Data
Inbunden, Engelska, 2024
2 246 kr
Skickas inom 10-15 vardagar
Statistical agencies, research organizations, companies, and other data stewards that seek to share data with the public face a challenging dilemma. They need to protect the privacy and confidentiality of data subjects and their attributes while providing data products that are useful for their intended purposes. In an age when information on data subjects is available from a wide range of data sources, as are the computational resources to obtain that information, this challenge is increasingly difficult. The Handbook of Sharing Confidential Data helps data stewards understand how tools from the data confidentiality literature—specifically, synthetic data, formal privacy, and secure computation—can be used to manage trade-offs in disclosure risk and data usefulness.Key features:• Provides overviews of the potential and the limitations of synthetic data, differential privacy, and secure computation• Offers an accessible review of methods for implementing differential privacy, both from methodological and practical perspectives• Presents perspectives from both computer science and statistical science for addressing data confidentiality and privacy• Describes genuine applications of synthetic data, formal privacy, and secure computation to help practitioners implement these approachesThe handbook is accessible to both researchers and practitioners who work with confidential data. It requires familiarity with basic concepts from probability and data analysis.
Synthetic Datasets for Statistical Disclosure Control
Theory and Implementation
Häftad, Engelska, 2011
1 273 kr
Skickas inom 10-15 vardagar
The aim of this book is to give the reader a detailed introduction to the different approaches to generating multiply imputed synthetic datasets. It describes all approaches that have been developed so far, provides a brief history of synthetic datasets, and gives useful hints on how to deal with real data problems like nonresponse, skip patterns, or logical constraints. Each chapter is dedicated to one approach, first describing the general concept followed by a detailed application to a real dataset providing useful guidelines on how to implement the theory in practice. The discussed multiple imputation approaches include imputation for nonresponse, generating fully synthetic datasets, generating partially synthetic datasets, generating synthetic datasets when the original data is subject to nonresponse, and a two-stage imputation approach that helps to better address the omnipresent trade-off between analytical validity and the risk of disclosure.The book concludes with a glimpse into the future of synthetic datasets, discussing the potential benefits and possible obstacles of the approach and ways to address the concerns of data users and their understandable discomfort with using data that doesn’t consist only of the originally collected values. The book is intended for researchers and practitioners alike. It helps the researcher to find the state of the art in synthetic data summarized in one book with full reference to all relevant papers on the topic. But it is also useful for the practitioner at the statistical agency who is considering the synthetic data approach for data dissemination in the future and wants to get familiar with the topic.