Non-Web Data Resources


In addition to the resources available on the Data Freeway and to our collection of ICPSR datasets, the Data Center also has a wide variety of resources available in the EDC itself. These resources are usually CD-ROM databases accessible via the Data Center's public workstations. In some cases, the CD databases also have on-line counterparts, but the CD versions of the databases are generally more user-friendly than are their on-line counterparts.

Area Resource File (ARF)--The Area Resource File is a database of health resources data, measured at the county level for over 6,000 indicators. The information includes measures of employment in various health professions; availability of health facilities; frequency of utilization; hospital and Medicare expenditures; and demographic and geographic indicators by county. The database is installed on all of the EDC's public workstations. Data from the ARF are also available here and from the ICPSR. More information about the ARF is available from this website.

Banks Cross-National Time Series Data Archive (CNTS)--The Banks CNTS contains time-series data on a variety of economic, political, and social topics (e.g. GDP, regime type, population, infrastructure). A complete list of variables is available here. The temporal coverage is from 1815 to 2009, varying by country and topic. The geographic coverage is global. We cannot distribute the dataset in its entirety to users but can help users create subsets of countries, years, and variables which they can take with them for use in their analysis. An older version of the database with data up to 1994 is available in Excel here.

Consumer Expenditure Survey (CES) Microdata--The CES is conducted by the Bureau of Labor Statistics and provides data on the buying habits of consumers within the United States. For a given year, the data files consist of (a) quarterly "interview" surveys and related expenditure data, which provide information on households' income, demographics, and expenses, and (b) "diary" surveys that provide detailed data on short-term expenses for frequently-purchased items. The Data Center has microdata files on CD-ROM from 1996-2008. Some CES microdata are also available via the ICPSR and via the National Bureau of Economic Research. Aggregated data and statistics from the CES and more detailed information about the survey are available from the Bureau of Labor Statistics.

County and City Data Book 2000--The EDC has a CD version of the popular data source. The CD contains data in Excel format.

Current Population Survey Merged Outgoing Rotation Groups (MORG) Data--The CPS Merged Outgoing Rotation Groups Data consist of data from households that are being rotated out of the CPS sample, either temporarily or permanently. Households interviewed for the CPS are interviewed for 4 consecutive months, then rotated out of the sample for 8 months, then interviewed again for 4 months, and then rotated out of the sample permanently - it is these outgoing interview groups that are included in the MORG data. The files contain data for items such as labor force status, industry and occupation of employment, weekly earnings (not annual earnings/income), and standard demographic information. More information about the contents of the MORG data is available via the National Bureau of Economic Research. Note that the MORG data are not the same as the data from the Annual Social and Economic Supplement to the CPS that is conducted in March of each year. Patrons interested in the ASEC data should go to the NBER's collection of CPS data files, to IPUMS-CPS, or to the DataFerrett (see below) instead.

DataFerrett--The DataFerrett (Federal Electronic Research and Review Extraction Tool) is a joint effort between the Census Bureau and the CDC. The software allows for easier access to and subsetting of many datasets produced by the Census Bureau, the CDC, and the Bureau of Labor Statistics, such as the Current Population Survey series, the American Housing Survey, and the Behavioral Risk-Factor Surveillance System. Most of the data accessible via the Ferrett are microdata, but some aggregated data (e.g. tabulated data from SF3 of the 2000 Census) are also accessible. A full list of datasets currently available via the Ferrett is located at http://www.thedataweb.org/datasets.html. Users can create subsets of available datasets and do tabulations and crosstabluations within the Ferrett itself or create files for use in SAS, SPSS, Stata, or Excel. The software is loaded onto all the EDC public workstations and is also available for download here.

Datastream Advance--Datastream Advance contains a very extensive range of economic data including stock prices and indices, bond indices, exchange rates, commodity prices, and a wide range of macroeconomic indicators. The data are global in coverage, but with variation across regions and time. Users can export data directly into Excel. Datastream Advance is on all the EDC workstations.

EUGene--EUGene (Expected Utility Data Generation and Management Program) is an application that allows users to create custom subsets of datasets pertaining to international conflict. Users can export these subsets into both SPSS and Stata formats. A detailed list of the variables and datasets available via EUGene is located here (click on "Variables Available" at the left). The software is on all the Data Center's public workstations and is also available for download.

Eurostats Statistics DVD--The Eurostats Statistics DVD interface is loaded onto all EDC public workstations. It allows users to access a wide range of economic and social data for EU member-states at both the national and sub-national level. Some of the databases within the DVD also include data on non-EU members elsewhere in Europe as well as major developed countries such as Australia, Japan, and the United States. There is also a (very clunky) on-line interface to this resource (access is restricted to Emory's IP range only). Eurostat also makes much of these data available at http://ec.europa.eu/eurostat.

HIV/AIDS Surveillance Data Base--The HIV/AIDS Surveillance Data Base, which is produced by the International Program Center of the U.S. Census Bureau, is a compilation of data on the prevalence of HIV/AIDS, taken from various studies published in the medical and scientific literature. The data are available via a simple Microsoft Access interface that allows users to select information by country or survey population or different measures of prevalence.

International City/County Management Association (ICMA) Local Government Survey Data Sets--The Data Center has several datasets from ICMA's collection of surveys of local governments. These surveys cover various topics such as form/structure of local governments, barriers to local economic development, and funding for homeland security programs. The specific datasets in the EDC's collections are the Municipal Form of Governments surveys (1996, 2001, 2006), the County Form of Governments surveys (1997, 2002), the Economic Development surveys (1999, 2004), and the Homeland Security survey (2005).

International Labour Organization Key Indicators of the Labour Market (KILM)--The KILM database contains annual data for various labor market indicators in different countries. These indicators include various measures of employment and unemployment, labor force status, wages/compensation, and poverty. The data are global in coverage and cover the time period from 1980 onwards (with variation in coverage by country or indicator). The database is available for download and installation from this location.

International Monetary Fund Balance of Payments Statistics (BOPS)--The BOPS contains detailed data on countries' international economic transactions in their current accounts, capital accounts, and financial accounts. Topics covered include trade in goods and services, foreign direct investment, portfolio investment, and holdings of reserve assets. The BOPS data are also available via Databases at Emory and as ICPSR Study #8623.

International Monetary Fund Direction of Trade Statistics (DOTS)--The DOTS data contain monthly, quarterly, and yearly data on countries' exports and imports, broken down by individual trade partners. The data are also available via Databases at Emory and as ICPSR Study #7628.

International Monetary Fund Government Finance Statistics (GFS)--This IMF resource contains very detailed data on government spending and revenue, broken down by both various categories of expenditure/revenue and by different levels of government. Data coverage is from 1972 onwards, varying by country, and the data are split across two CD-ROM's - an "historical" database that covers 1972-1989 and a second database that covers 1990 onwards. We recommend that patrons use this database in conjunction with print editions of the Government Finance Statistics, which are available in Woodruff Library. The data are also available via Databases at Emory and as ICPSR Study #8624.

International Monetary Fund International Financial Statistics (IFS)--The CD version of the IFS is on all of the EDC's public workstations. The CD has a more straightforward interface than does the web version of the International Financial Statistics.

International Telecommunications Union World Telecommunications Indicators (WTI)--The WTI contains time- series data on IT/telecommunications infrastructure. The geographic coverage is global. The temporal coverage is annual from 1975 onwards and in 5-year increments from 1960 to 1975. The interface is the same as that for the WDI and allows users to export data directly into Excel. The WTI is on all the EDC's public workstations.

International Terrorism: Attributes of Terrorist Events (ITERATE)--ITERATE contains descriptive information on individual acts of terrorism, such as locations, casualties, responsible parties, methods used, evidence of state sponsorship, and coding of success. There are additional files for hijackings and hostage-taking incidents. The quantitative data are supplemented by textual descriptions of these events. For an additional description of the data, see this link at UCSD: http://libraries.ucsd.edu/ssds/iterate.html.

MARGene--MARGene (Minorities at Risk Data Generation and Management Program) is an application that allows users to create custom subsets of data from the Minorities at Risk Project. The MAR Project is affiliated with the Integrated Network for Societal Conflict Research and the Center for International Development and Conflict Research at the University of Maryland. The project tracks the status of nearly 300 "politically-active communal groups" and the MAR data are used extensively by scholars of ethnic conflict. The software is very similar to EUGene and users can export their desired groups, years, and variables into SPSS or Stata files. MARGene is on all the EDC public workstations. It is also available for download.

National Bureau of Economic Research (NBER) Trade Database--The NBER Trade Database contains multiple data sources measuring trade flows and countries' endowments in factors of production. It is available on CD-ROM in the Data Center.

National Practitioner Data Bank (NPDB)--The NPDB contains data pertaining to malpractice suits such as the nature and severity of malpractice allegations, the nature and amount of any payment(s) made during malpractice proceedings, and so forth. The data are in SPSS format. Please note that certain variables are limited in terms of detail for confidentiality reasons. The data are also available for download here (registration is required).

National Survey of Family Growth, Cycle 6, 2002 (NSFG)--The NSFG is an on-going survey series focusing on matters of family life/history and reproductive health. The topics covered include schooling, family background, marriage histories, contraceptive use, pregnancy outcomes, sexually transmitted diseases, and insurance, with additional variables for religion, race/ethnicity, and employment. The EDC has the latest version of the NSFG availabe on CD with the data ready for use in SAS, SPSS, or Stata formats. You can read more about the NSFG here. Various waves of the NSFG are also available via the Social Science Electronic Data Library and directly from the National Center for Health Statistics.

Neighborhood Change Database (NCDB)--The NCDB is produced by the Geolytics Corporation and contains data from the 1970, 1980, 1990, and 2000 U.S. Census. The NCDB allows users to "normalize" data from pre-2000 iterations of the Census and convert the data into 2000 Census tracts. Users can export the data into Excel-ready spreadsheets and also create digital tract-boundary files for use with GIS software such as ArcView. More information about the NCDB is available via the Geolytics website.

OECD Health Data--The OECD Health Data database contains much time-series data on health conditions and health policy in the OECD countries (e.g. mortality rates by cause of death, health-related lifestyle indicators, public and private health expenditures). Data within the database can be saved into Excel. The interface for the database is on all the EDC public workstations and is also available for download from SourceOECD.

State Personal Income--The SPI database is produced by the Regional Economic Information System division of the Bureau of Economic Analysis. It contains time-series state-level data on peronal income, earnings, wages, and employment broken down by industry, as well as state-level data on transfer payments and tax payments. The data can be exported into Excel-ready formats. Please note that the data on this CD are not necessarily compatible with the data available on-line from the BEA.

Statistical Abstract of the United States--The EDC has a CD version of the popular data source. The CD contains data in Excel format, and the data on the CD often cover longer periods of time than is the case with the print copies or with the on-line version of the Abstract.

TRAINS for Windows--TRAINS (Trade Analysis and Information System) is an UNCTAD database with product-specific data on trade flows, tariffs, and non-tariff barriers. There are two versions of the database - an older CD version with time-series data and an HTML-based version with more recent data. The interfaces for both versions are on all the EDC workstations. Note that the CD interface is not very user-friendly. Data from TRAINS are also now available on-line via the World Integrated Trade Solution.

UNIDO Industry Demand-Supply Balance (IDSB) Database--The IDSB Database is produced by the United Nations Industrial Development Organization (UNIDO) and has time-series industry-level data for multiple developed and developing countries. The database classifies industries using the International Standard Industrial Classification (ISIC) codes. The industry-level variables include consumption, output, exports, and imports. Users can create subsets of data and export them directly into spreadsheets.

UNIDO Industrial Statistics (INDSTAT) Databases--The INDSTAT databases are produced by the United Nations Industrial Development Organization (UNIDO) and have time-series industry-level data for multiple developed and developing countries. The databases classify industries using the International Standard Industrial Classification (ISIC) codes. The industry-level variables include numbers of establishments and employees, output, value-added, and capital formation. There are two INDSTAT databases - one at the three-digit ISIC level and one at the four-digit ISIC level.

Voteview for Windows--Voteview is software developed by Keith Poole and Harold Rosenthal for analysis of Congressional role-call votes. It includes data on all roll-call votes from the 1st-106th Congresses (1789-2000) and also provides ideological scores for members of Congress. Users can export data into Excel-ready spreadsheets and map out distributions of votes by congressional district or state. The software is also available for download here.

Ward's World Motor Vehicle Data 2001--The Ward's World Motor Vehicle Data CD contains time-series data on motor-vehicle sales and production. The geographic coverage is global and the data are in Excel.

WIIW Database on Foreign Direct Investment in Central, East, and Southeast Europe--The WIIW (the Vienna Institute for International Economic Studies) produces this collection, which provides various measures of FDI for transition countries in Central and Eastern Europe. The CD provides information on both stocks and flows of inward and outward FDI for the region, and provides additional breakdowns by either partner country or economic sector.

WIIW Handbook of Statistics - Central, East and Southeast Europe--The WIIW (the Vienna Institute for International Economic Studies) produces an annual compendium of data and statistics for transition countries in Central and Eastern Europe. This CD includes information on various macroeconomic variables (e.g. GDP, consumption, trade, wage) and on various social indicators (e.g. population demographics).

World Bank Africa Database (WBAD)--The WBAD is very similar to the World Development Indicators see below) but focuses exclusively on African countries and has a wider, larger range of variables in it. The World Bank Africa Database is also now available on-line as the Africa Development Indicators.

World Bank Global Development Finance (GDF)--The CD version of the GDF is on all of the EDC's public workstations. The GDF contains annual statistics on stocks and flows of public and private debt owed over 100 countries, with data coverage extending back to 1970 (varying by series) and projections for certain indicators extending up to 2010. The Global Development Finance database is also now available on-line from the World Bank.

World Bank World Development Indicators (WDI)--The CD-ROM version of the WDI is on all of the EDC's public workstations. The CD-ROM has a more powerful interface than that for the on-line version of the WDI in that it allows users more control over the format and arrangement of their data files. It also includes .pdf copies of the print edition of the WDI and more in the way of documentation of variable definitions and sources.

World Trade Analyzer (WTA)--The WTA provides bilateral data on trade in specific products between pairs of countries. The geographic coverage is global and the years covered go back to 1980. Users can export the data directly into Excel and can "shape" how the data will appear in their spreadsheets. We have two versions of the WTA in the Data Center - one for the years 1980-1999 and one for the years 1988-2005 (the latter version contains data on post-communist countries in Eastern Europe). The WTA is on all the EDC public workstations.