Table of contents |
The following pages may also be of interest:
- Prototype National QWI
- OnTheMap/LODES Synthetic Data
- QWI Public Use Data
- SIPP Synthetic Beta file
- LBD Synthetic Data v2
- Data archives
1 Overview
The following datasets are available on the VirtualRDC, in either dummy (zero-obs) form, or with some content based on public-use data. Check the Data section for updated documentation on these datasets. Some documentation on the full dataset may be available on the CES website.
2 Types of datasets
The VirtualRDC does not host confidential data of any type. Here's what types of data we have:
- public -use data: Public use data not (easily) available elsewhere, or because they have a use in the analysis of synthetic data.
- Synthetic data: replicates the exact structure and the statistical properties of the original (confidential) dataset, without using any of the original confidential data content. Access to some of these data still requires using special compute servers (see Synthetic Data Server).
- Zero-obs or dummy datasets: replicate the exact structure of the original dataset, but contain no data at all - no original data, no synthetic data, none.
- Public-use based replica: replicates the exact structure of the original (confidential) dataset, with the content of a similar publi-use dataset reformatted to resemble the original dataset
3 Available public-use datasets
- Download-only:
- Access on compute servers only :
- Public-use files (constantly but idiosynchratically updated)
/data/clean for clean SAS and Stata data sets /data/doc for documentation on the available files
You will find SAS and Stata versions of many datasets, including QWI and OnTheMap, here. If you see a dataset missing which you would wish to have available, let us know.
- There might also be a small set of public-use data in the RDC-compatible structure under
/ssgprojects/virtualrdc/{economic,demographic,decennial,mixed}
- Public-use files (constantly but idiosynchratically updated)
4 Available synthetic datasets
- OnTheMap synthetic data (1 replicate from Versions 3, 4, and 5 --the current version on the Census Bureau OnTheMap site). Please consult the VirtualRDC OnTheMap website
- Synthetic Longitudinal Business Database (using Synthetic Data Server)
- SIPP Synthetic Beta file (using Synthetic Data Server)
5 Available dummy datasets
Zero-obs datasets have the basic structure of the (confidential) datasets, without containing any data. To effectively use them, users should reference a path as (in SAS)
%let prefix=/data/virtualrdc; libname INPUTS "&prefix./economic/ocd/microdata";
When transferring programs to the Census RDC, setting
%let prefix=;
and making no other changes will let the programs work as-is on the confidential data in the Census RDC.
The following zero-obs datasets are available on VirtualRDC nodes:
- LEHD Infrastructure files FROM 2004 SNAPSHOT (documentation)
/mixed/lehd/s2004/brb/us/brb_us_brlist.sas7bdat /mixed/lehd/s2004/brb/us/brb_us_ecflist.sas7bdat /mixed/lehd/s2004/brb/us/brb_us_xwalk.sas7bdat /mixed/lehd/s2004/ecft26/ca/ecf_ca_sein_t26.sas7bdat /mixed/lehd/s2004/ecft26/ca/ecf_ca_seinunit_t26.sas7bdat /mixed/lehd/s2004/ecft26/ca/ecf_zz_sein_t26.sas7bdat /mixed/lehd/s2004/ecft26/ca/ecf_zz_seinunit_t26.sas7bdat /mixed/lehd/s2004/ecft26/ca/es202_ca_t26.sas7bdat /mixed/lehd/s2004/ecf/zz/ecf_zz_sein.sas7bdat /mixed/lehd/s2004/ecf/zz/ecf_zz_seinunit.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz_controltotals.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz_phf.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz_sein_employment.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz_shf.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz_uhf.sas7bdat /mixed/lehd/s2004/ehf/zz/ehf_zz_uniqpik.sas7bdat /mixed/lehd/s2004/galt26/zz/gal_zz_2003_br_2001.sas7bdat /mixed/lehd/s2004/galt26/zz/gal_zz_2003_t26flags.sas7bdat /mixed/lehd/s2004/galt26/zz/gal_zz_2003_t26.sas7bdat /mixed/lehd/s2004/gal/zz/gal_zz_2003_acspow_2001.sas7bdat /mixed/lehd/s2004/gal/zz/gal_zz_2003_ahs_2002.sas7bdat /mixed/lehd/s2004/gal/zz/gal_zz_2003_es202only.sas7bdat /mixed/lehd/s2004/gal/zz/gal_zz_2003_maf_2004.sas7bdat /mixed/lehd/s2004/gal/zz/gal_zz_2003.sas7bdat /mixed/lehd/s2004/gal/zz/gal_zz_2003_xwlk_2001.sas7bdat /mixed/lehd/s2004/icft26/zz/icf_zz_t26.sas7bdat /mixed/lehd/s2004/icf/zz/icf_zz_implicates_age_sex.sas7bdat /mixed/lehd/s2004/icf/zz/icf_zz_implicates_county.sas7bdat /mixed/lehd/s2004/icf/zz/icf_zz_implicates_education.sas7bdat /mixed/lehd/s2004/icf/zz/icf_zz.sas7bdat /mixed/lehd/s2004/qwi/zz/qwi_zz_seinunit.sas7bdat /mixed/lehd/s2004/u2w/zz/u2w_zz.sas7bdat
- Other Census RDC datasets:
demographic/acs demographic/ahs demographic/cps demographic/sip economic/asm economic/brdis economic/cbo economic/cmf economic/crt economic/csr economic/ilbd economic/lbd economic/meps economic/ocd economic/qfr economic/rnd
- Internal decennial census microdata dummy datasets
decennial/1970cen decennial/1980cen decennial/1990cen