Skip to main content

Where is the Social Science Gateway?

The Social Science Gateway (SSG) grant has ended, please read here about ongoing availability of resources created as part of that project.

We support:

APDU Logo foas-logo-small (2)

Data @ VirtualRDC

Table of contents

The following pages may also be of interest:

1 Overview

The following datasets are available on the VirtualRDC, in either dummy (zero-obs) form, or with some content based on public-use data. Check the Data section for updated documentation on these datasets. Some documentation on the full dataset may be available on the CES website.

2 Types of datasets

The VirtualRDC does not host confidential data of any type. Here's what types of data we have:

  • public -use data: Public use data not (easily) available elsewhere, or because they have a use in the analysis of synthetic data.
  • Synthetic data: replicates the exact structure and the statistical properties of the original (confidential) dataset, without using any of the original confidential data content. Access to some of these data still requires using special compute servers (see Synthetic Data Server).
  • Zero-obs or dummy datasets: replicate the exact structure of the original dataset, but contain no data at all - no original data, no synthetic data, none.
  • Public-use based replica: replicates the exact structure of the original (confidential) dataset, with the content of a similar publi-use dataset reformatted to resemble the original dataset

3 Available public-use datasets

  • Download-only:
  • Access on compute servers only :
    • Public-use  files (constantly but idiosynchratically updated)
      /data/clean for clean SAS and Stata data sets
      /data/doc for documentation on the available files

      You will find SAS and Stata versions of many datasets, including QWI and OnTheMap, here. If you see a dataset missing which you would wish to have available, let us know.

    • There might also be a small set of public-use data in the RDC-compatible structure under
      /ssgprojects/virtualrdc/{economic,demographic,decennial,mixed}

4 Available synthetic datasets

5 Available dummy datasets

Zero-obs datasets have the basic structure of the (confidential) datasets, without containing any data. To effectively use them, users should reference a path as (in SAS)

%let prefix=/data/virtualrdc;
libname INPUTS "&prefix./economic/ocd/microdata";

When transferring programs to the Census RDC, setting

%let prefix=;

and making no other changes will let the programs work as-is on the confidential data in the Census RDC.

The following zero-obs datasets are available on VirtualRDC nodes:

  • LEHD Infrastructure files FROM 2004 SNAPSHOT (documentation)
    /mixed/lehd/s2004/brb/us/brb_us_brlist.sas7bdat
    /mixed/lehd/s2004/brb/us/brb_us_ecflist.sas7bdat
    /mixed/lehd/s2004/brb/us/brb_us_xwalk.sas7bdat
    /mixed/lehd/s2004/ecft26/ca/ecf_ca_sein_t26.sas7bdat
    /mixed/lehd/s2004/ecft26/ca/ecf_ca_seinunit_t26.sas7bdat
    /mixed/lehd/s2004/ecft26/ca/ecf_zz_sein_t26.sas7bdat
    /mixed/lehd/s2004/ecft26/ca/ecf_zz_seinunit_t26.sas7bdat
    /mixed/lehd/s2004/ecft26/ca/es202_ca_t26.sas7bdat
    /mixed/lehd/s2004/ecf/zz/ecf_zz_sein.sas7bdat
    /mixed/lehd/s2004/ecf/zz/ecf_zz_seinunit.sas7bdat
    /mixed/lehd/s2004/ehf/zz/ehf_zz_controltotals.sas7bdat
    /mixed/lehd/s2004/ehf/zz/ehf_zz_phf.sas7bdat
    /mixed/lehd/s2004/ehf/zz/ehf_zz.sas7bdat
    /mixed/lehd/s2004/ehf/zz/ehf_zz_sein_employment.sas7bdat
    /mixed/lehd/s2004/ehf/zz/ehf_zz_shf.sas7bdat
    /mixed/lehd/s2004/ehf/zz/ehf_zz_uhf.sas7bdat
    /mixed/lehd/s2004/ehf/zz/ehf_zz_uniqpik.sas7bdat
    /mixed/lehd/s2004/galt26/zz/gal_zz_2003_br_2001.sas7bdat
    /mixed/lehd/s2004/galt26/zz/gal_zz_2003_t26flags.sas7bdat
    /mixed/lehd/s2004/galt26/zz/gal_zz_2003_t26.sas7bdat
    /mixed/lehd/s2004/gal/zz/gal_zz_2003_acspow_2001.sas7bdat
    /mixed/lehd/s2004/gal/zz/gal_zz_2003_ahs_2002.sas7bdat
    /mixed/lehd/s2004/gal/zz/gal_zz_2003_es202only.sas7bdat
    /mixed/lehd/s2004/gal/zz/gal_zz_2003_maf_2004.sas7bdat
    /mixed/lehd/s2004/gal/zz/gal_zz_2003.sas7bdat
    /mixed/lehd/s2004/gal/zz/gal_zz_2003_xwlk_2001.sas7bdat
    /mixed/lehd/s2004/icft26/zz/icf_zz_t26.sas7bdat
    /mixed/lehd/s2004/icf/zz/icf_zz_implicates_age_sex.sas7bdat
    /mixed/lehd/s2004/icf/zz/icf_zz_implicates_county.sas7bdat
    /mixed/lehd/s2004/icf/zz/icf_zz_implicates_education.sas7bdat
    /mixed/lehd/s2004/icf/zz/icf_zz.sas7bdat
    /mixed/lehd/s2004/qwi/zz/qwi_zz_seinunit.sas7bdat
    /mixed/lehd/s2004/u2w/zz/u2w_zz.sas7bdat
  • Other Census RDC datasets:
    demographic/acs
    demographic/ahs
    demographic/cps
    demographic/sip
    economic/asm
    economic/brdis
    economic/cbo
    economic/cmf
    economic/crt
    economic/csr
    economic/ilbd
    economic/lbd
    economic/meps
    economic/ocd
    economic/qfr
    economic/rnd
  • Internal decennial census microdata dummy datasets
    decennial/1970cen
    decennial/1980cen
    decennial/1990cen