Skip to main content

Where is the Social Science Gateway?

The Social Science Gateway (SSG) grant has ended, please read here about ongoing availability of resources created as part of that project.

We support:

APDU Logo foas-logo-small (2)

SIPP Synthetic Beta file

The SIPP Synthetic Beta files are available on the VirtualRDC. Application forms and other documents are available at http://www.census.gov/sipp/synth_data.html. All but the application forms are also available on this site (see below).

Applications are judged solely on feasibility (i.e. the necessary variables are on the SSB). After projects are approved by the Census Bureau, researchers will be given accounts on the VirtualRDC (more specifically, on the Synthetic Data Server). More details regarding the use of the Synthetic Data Server are available at http://www.vrdc.cornell.edu/news/synthetic-data-server/.

Documentation

Documentation is available on this website as well as at the Census Bureau website:

  • Version 5.1:
    • U.S. Census Bureau, "Codebook for SIPP Synthetic Beta version 5.1," U.S. Census Bureau, 2013.
      [PDF] [URL] [Bibtex]
      @TECHREPORT{ssb_v5_1_codebook,
      author = {{U.S. Census Bureau}},
      title = {Codebook for {SIPP} {Synthetic} {Beta} version 5.1},
      institution = {U.S. Census Bureau},
      year = {2013},
      abstract = {This codebook documents version 5.0 of the SIPP Synthetic Beta (SSB).
      The SSB is a set of files containing individual-level data synthesized
      from linked survey and administrative data. The SSB is produced by
      the US Census Bureau as part of a joint project with the Social Security
      Administration (SSA), and the Internal Revenue Service (IRS). The
      goal of the project is to make some of the benefits of linked survey
      and administrative data available to researchers outside of restricted‐access
      Census Bureau facilities in a manner that protects the confidentiality
      of the underlying data.},
      owner = {vilhuber},
      timestamp = {2013.10.07},
      url = {http://www2.vrdc.cornell.edu/news/wp-content/uploads/2011/01/SSB_Codebook.pdf}
      }

An experimental online codebook is available at CED²AR (provided by the NSF-Census Research Network - Cornell Node)

  • Version 5.0:
    • U.S. Census Bureau, "DRB Memo September 20, 2010," U.S. Census Bureau, 2010.
      [PDF] [URL] [Bibtex]
      @TECHREPORT{drbmemo2010,
      author = {{U.S. Census Bureau}},
      title = {{DRB} {M}emo {S}eptember 20, 2010},
      institution = {U.S. Census Bureau},
      year = {2010},
      month = {September 20},
      owner = {vilhuber},
      timestamp = {2013.10.07},
      url = {http://www2.vrdc.cornell.edu/news/wp-content/uploads/2011/01/DRBMemoSeptember202010.pdf}
      }
    • U.S. Census Bureau, "Codebook for SIPP Synthetic Beta version 5.0," U.S. Census Bureau, 2010.
      [PDF] [URL] [Bibtex]
      @TECHREPORT{ssb_codebook,
      author = {{U.S. Census Bureau}},
      title = {Codebook for {SIPP} {Synthetic} {Beta} version 5.0},
      institution = {U.S. Census Bureau},
      year = {2010},
      abstract = {This codebook documents version 5.0 of the SIPP Synthetic Beta (SSB).
      The SSB is a set of files containing individual-level data synthesized
      from linked survey and administrative data. The SSB is produced by
      the US Census Bureau as part of a joint project with the Social Security
      Administration (SSA), and the Internal Revenue Service (IRS). The
      goal of the project is to make some of the benefits of linked survey
      and administrative data available to researchers outside of restricted‐access
      Census Bureau facilities in a manner that protects the confidentiality
      of the underlying data.},
      comment = {Original location: http://www.census.gov/sipp/SSB_Codebook.pdf},
      owner = {vilhuber},
      timestamp = {2013.10.07},
      url = {http://www2.vrdc.cornell.edu/news/wp-content/uploads/2011/01/SSB_Codebook.pdf}
      }
  • Version 4.x:
    • J. M. Abowd, G. Benedetto, and M. Stinson, "Using the SIPP Synthetic Beta for analysis," U.S. Census Bureau, Training provided to participants at a meeting at the U.S. Census Bureau on October 26, 2007 , , 2007.
      [PDF] [URL] [Bibtex]
      @TECHREPORT{sipp_synthetic_beta_training_final_20071026,
      author = {John M. Abowd and Gary Benedetto and Martha Stinson},
      title = {Using the {SIPP} {Synthetic} {Beta} for Analysis},
      institution = {U.S. Census Bureau},
      year = {2007},
      type = {Training provided to participants at a meeting at the U.S. Census
      Bureau on October 26, 2007},
      owner = {vilhuber},
      timestamp = {2013.10.08},
      url = {http://www2.vrdc.cornell.edu/news/?p=306}
      }
    • J. M. Abowd, M. Stinson, and G. Benedetto, "Final report to the Social Security Administration on the SIPP/SSA/IRS Public Use File Project," U.S. Census Bureau, 2006.
      [PDF] [URL] [Bibtex]
      @TECHREPORT{ssafinal,
      author = {John M. Abowd and Martha Stinson and Gary Benedetto},
      title = {Final Report to the {Social Security Administration} on the {SIPP/SSA/IRS}
      {Public} {Use} {File} {Project}},
      institution = {U.S. Census Bureau},
      year = {2006},
      owner = {vilhuber},
      timestamp = {2013.10.07},
      url = {http://www2.vrdc.cornell.edu/news/?p=308}
      }
    • U.S. Census Bureau, "Codebook for the SIPP Synthetic Beta version 4.1," U.S. Census Bureau, 2007.
      [PDF] [URL] [Bibtex]
      @TECHREPORT{technicaldescriptionsippsyntheticbetaoct42007,
      author = {{U.S. Census Bureau}},
      title = {Codebook for the {SIPP} {Synthetic} {Beta} Version 4.1},
      institution = {U.S. Census Bureau},
      year = {2007},
      month = {October},
      abstract = {This codebook documents version 5.0 of the SIPP Synthetic Beta (SSB).
      The SSB is a set of files containing individual-level data synthesized
      from linked survey and administrative data. The SSB is produced by
      the US Census Bureau as part of a joint project with the Social Security
      Administration (SSA), and the Internal Revenue Service (IRS). The
      goal of the project is to make some of the benefits of linked survey
      and administrative data available to researchers outside of restricted‐access
      Census Bureau facilities in a manner that protects the confidentiality
      of the underlying data.},
      owner = {vilhuber},
      timestamp = {2013.10.07},
      url = {http://www.census.gov/sipp/technicaldescriptionsippsyntheticbetaoct42007.pdf}
      }
    • U.S. Census Bureau, "DRB memo on disclosure testing the SIPP Synthetic Beta," U.S. Census Bureau, 2006.
      [URL] [Bibtex]
      @TECHREPORT{drbmemnov2006,
      author = {{U.S. Census Bureau}},
      title = {{DRB} Memo on Disclosure Testing the {SIPP} {Synthetic} {Beta}},
      institution = {U.S. Census Bureau},
      year = {2006},
      month = {September 20},
      owner = {vilhuber},
      timestamp = {2013.10.07},
      url = {http://www2.vrdc.cornell.edu/news/?p=307}
      }

Accessing the data

Once an account on the Synthetic Data Server has been established, you will find template programs and instructions on the server under

 /rdcprojects/co00517/SSB/data/
                                  current -> v5.0
                                  v4.2/
                                  v5.0/
                                  v5.1/
                                  users/
 /rdcprojects/co00517/SSB/programs/
                                  template/v4.2
                                  template/v5.0
                                  users/

Citing and Funding Acknowledgement

We ask that users of the data give credit to the different funders that contributed to the creation and distribution of the data:

The creation of the SIPP Synthetic Beta was funded by the US Census Bureau and SSA, with additional funding from NSF Grants #0427889 and #0339191.The Synthetic Data Server is funded through NSF grant SES-1042181

The data itself can be cited as

  • U.S. Census Bureau, "SIPP Synthetic Beta version 5.1," {U.S. Census Bureau} and Cornell University, Synthetic Data Server [distributor], Washington,DC and Ithaca, NY, USA, [Computer file] , , 2013.
    [URL] [Bibtex]
    @TECHREPORT{SSB5.1,
    author = {{U.S. Census Bureau}},
    title = {{SIPP} {S}ynthetic {B}eta Version 5.1},
    institution = {{U.S. Census Bureau} and Cornell University, Synthetic Data Server
    [distributor]},
    year = {2013},
    type = {[Computer file]},
    address = {Washington,DC and Ithaca, NY, USA},
    howpublished = {Computer file},
    organization = {Cornell University, Synthetic Data Server [distributor]},
    owner = {vilhuber},
    timestamp = {2013.06.10},
    url = {http://www2.vrdc.cornell.edu/news/data/sipp-synthetic-beta-file/}
    }
  • U.S. Census Bureau, "SIPP Synthetic Beta version 5.0," {U.S. Census Bureau} and Cornell University, Synthetic Data Server [distributor], Washington,DC and Ithaca, NY, USA, [Computer file] , , 2011.
    [URL] [Bibtex]
    @TECHREPORT{SSB5.0,
    author = {{U.S. Census Bureau}},
    title = {{SIPP} {S}ynthetic {B}eta Version 5.0},
    institution = {{U.S. Census Bureau} and Cornell University, Synthetic Data Server
    [distributor]},
    year = {2011},
    type = {[Computer file]},
    address = {Washington,DC and Ithaca, NY, USA},
    howpublished = {Computer file},
    organization = {Cornell University, Synthetic Data Server [distributor]},
    owner = {vilhuber},
    timestamp = {2013.06.10},
    url = {http://www2.vrdc.cornell.edu/news/data/sipp-synthetic-beta-file/}
    }