for the NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009
The final print program (with some minor post-workshop typographic corrections) can be found here (last updated 2009-09-24). Further information on the workshop can be found on the main page. Presentations are listed below, with links to the actual presentation document.
Opening remarks
- Tom Mesenbourg, Deputy Director of the U.S. Census Bureau
- John Abowd (Cornell University)
Session 1: : Imputation methods and synthesizers
- Jerry Reiter: Easily implemented, nonparametric synthesizers based on algorithmic methods in computer science (additional tables)
- Gary Benedetto and Simon Woodcock: Partially-synthetic linked employer-employee data
- Robert Creecy: The Feasibility of Creating a Fully Synthetic Decennial Census Microdata File
- Michael Larsen and Jennifer Huckett: Synthetic data methods using quantile regression and hot deck with rank swapping
- John Abowd, Fredrik Andersson, Matthew Graham, Lars Vilhuber and Jeremy Wu: Formal Privacy Guarantees and Analytical Validity of OnTheMap Public-use Data
Session 2: Synthetic data in public use micro-data products
- Martha Stinson, Gary Benedetto, and Melissa Bjelland: Summary of Methods and Preliminary Assessment of the SIPP Synthetic Beta
- Saki Kinney: Synthetic Longitudinal Business Database
- Jörg Drechsler: New Data Dissemination Approaches in Old Europe – Synthetic Datasets for a German Establishment Survey
- Sam Hawala and Rolando Rodriguez: Disclosure avoidance for group quarters in the American Community Survey: Details of the synthetic data method
- Trivellore Raghunathan: Diagnostic Tools for Assessing Validity of Synthetic Data Inferences
Session 3: Synthetic data and disclosure avoidance
- Arnold Reznek: Disclosure avoidance issues at the Census Bureau
- Stefan Bender: Pulling wool over users' eyes
- Nick Greenia: Confidentiality Issues with Tax Data
- Jennifer Madans: Disclosure avoidance issues at NCHS
Closing remarks
- Donald Rubin (Harvard University)
Funding for the conference and its preparation were provided by National Science Foundation (NSF) Grant SES-0922494, the U.S. Census Bureau's Center for Economic Studies, the Internal Revenue Service (IRS), and the Edmund Ezra Day Professorship at Cornell University.
Bibliography
Please cite the presentations as follows:
- J. Madans, "Disclosure avoidance issues at NCHS," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47681, 2009.
[URL] [Bibtex]@techreport{handle:1813:47681, author={Madans, Jennifer}, title={Disclosure avoidance issues at NCHS}, url={http://hdl.handle.net/1813/47681}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47681}, }
- N. Greenia, "Confidentiality Issues with Tax Data," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47680, 2009.
[URL] [Bibtex]@techreport{handle:1813:47680, author={Greenia, Nick}, title={Confidentiality Issues with Tax Data}, url={http://hdl.handle.net/1813/47680}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47680}, }
- S. Bender, "Pulling wool over users' eyes," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47679, 2009.
[URL] [Bibtex]@techreport{handle:1813:47679, author={Bender, Stefan}, title={Pulling wool over users' eyes}, url={http://hdl.handle.net/1813/47679}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47679}, }
- A. Reznek, "Disclosure avoidance issues at the Census Bureau," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47678, 2009.
[URL] [Bibtex]@techreport{handle:1813:47678, author={Reznek, Arnold}, title={Disclosure avoidance issues at the Census Bureau}, url={http://hdl.handle.net/1813/47678}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47678}, }
- T. Raghunathan, "Diagnostic Tools for Assessing Validity of Synthetic Data Inferences," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47677, 2009.
[URL] [Bibtex]@techreport{handle:1813:47677, author={Raghunathan, Trivellore}, title={Diagnostic Tools for Assessing Validity of Synthetic Data Inferences}, url={http://hdl.handle.net/1813/47677}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47677}, }
- S. Hawala and R. Rodriguez, "Disclosure avoidance for group quarters in the American Community Survey: Details of the synthetic data method," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47676, 2009.
[URL] [Bibtex]@techreport{handle:1813:47676, author={Hawala, Sam and Rodriguez, Rolando}, title={Disclosure avoidance for group quarters in the American Community Survey: Details of the synthetic data method}, url={http://hdl.handle.net/1813/47676}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47676}, }
- J. Drechsler, "New Data Dissemination Approaches in Old Europe – Synthetic Datasets for a German Establishment Survey," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47675, 2009.
[URL] [Bibtex]@techreport{handle:1813:47675, author={Drechsler, Jörg}, title={New Data Dissemination Approaches in Old Europe – Synthetic Datasets for a German Establishment Survey}, url={http://hdl.handle.net/1813/47675}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47675}, }
- S. Kinney, J. Reiter, R. Jarmin, and J. Miranda, "Synthetic Longitudinal Business Database," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47674, 2009.
[URL] [Bibtex]@techreport{handle:1813:47674, author={Kinney, Saki and Reiter, Jerry and Jarmin, Ron and Miranda, Javier}, title={Synthetic Longitudinal Business Database}, url={http://hdl.handle.net/1813/47674}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47674}, }
- M. Bjelland, G. Benedetto, and M. Stinson, "Summary of Methods and Preliminary Assessment of the SIPP Synthetic Beta," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47673, 2009.
[URL] [Bibtex]@techreport{handle:1813:47673, author={Bjelland, Melissa and Benedetto, Gary and Stinson, Martha}, title={Summary of Methods and Preliminary Assessment of the SIPP Synthetic Beta}, url={http://hdl.handle.net/1813/47673}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47673}, }
- F. Andersson, J. M. Abowd, M. Graham, J. Wu, and L. Vilhuber, "Formal Privacy Guarantees and Analytical Validity of OnTheMap Public-use Data," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47672, 2009.
[URL] [Bibtex]@techreport{handle:1813:47672, author={Andersson, Fredrik and Abowd, John M. and Graham, Matthew and Wu, Jeremy and Vilhuber, Lars}, title={Formal Privacy Guarantees and Analytical Validity of OnTheMap Public-use Data}, url={http://hdl.handle.net/1813/47672}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47672}, }
- M. Larsen and J. Huckett, "Synthetic data methods using quantile regression and hot deck with rank swapping," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47671, 2009.
[URL] [Bibtex]@techreport{handle:1813:47671, author={Larsen, Michael and Huckett, Jennifer}, title={Synthetic data methods using quantile regression and hot deck with rank swapping}, url={http://hdl.handle.net/1813/47671}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47671}, }
- R. Creecy, "The Feasibility of Creating a Fully Synthetic Decennial Census Microdata File," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47670, 2009.
[URL] [Bibtex]@techreport{handle:1813:47670, author={Creecy, Robert}, title={The Feasibility of Creating a Fully Synthetic Decennial Census Microdata File}, url={http://hdl.handle.net/1813/47670}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47670}, }
- G. Benedetto and S. Woodcock, "Partially-synthetic linked employer-employee data," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47669, 2009.
[URL] [Bibtex]@techreport{handle:1813:47669, author={Benedetto, Gary and Woodcock, Simon}, title={Partially-synthetic linked employer-employee data}, url={http://hdl.handle.net/1813/47669}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47669}, }
- J. Reiter, "Random Forest Models for Generating Partially Synthetic Data," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47668, 2009.
[URL] [Bibtex]@techreport{handle:1813:47668, author={Reiter, Jerry}, title={Random Forest Models for Generating Partially Synthetic Data}, url={http://hdl.handle.net/1813/47668}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47668}, }
- J. Reiter, "Easily Implemented, Nonparametric Synthesizers Based on Algorithmic Methods from Computer Science," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47667, 2009.
[URL] [Bibtex]@techreport{handle:1813:47667, author={Reiter, Jerry}, title={Easily Implemented, Nonparametric Synthesizers Based on Algorithmic Methods from Computer Science}, url={http://hdl.handle.net/1813/47667}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47667}, }
- R. Jarmin and A. Reznek, "NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009 Program," NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009, Presentation 1813:47666, 2009.
[URL] [Bibtex]@techreport{handle:1813:47666, author={Jarmin, Ron and Reznek, Arnold}, title={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009 Program}, url={http://hdl.handle.net/1813/47666}, type={Presentation}, year={2009}, institution={NSF-Census-IRS Workshop on Synthetic Data and Confidentiality Protection 2009}, number={1813:47666}, }
You can download the entire bibtex file here.