Getting Started with U.S. Census Data
U.S. Census QuickFacts data is an amazing resource which is available to the public free of charge. This treasure trove of information is an incredibly useful resource for performing many different types of analysis involving where people live in the United States as well as other demographic information, all of which is broken down by county and state. While U.S. Census QuickFacts data is available and summarized from multiple sources throughout the Internet, the U.S. Census Bureau makes the information directly available only via flat text files on their web site. Fear not! Without too much effort, we can take those flat files and import them into a transaction data source.
In this post, I’ll be covering the process of downing the U.S. Census QuickFacts data from their website (currently data from the 2010 census) and importing it as new tables into an existing SQL Server database using SQL Server Management Studio and a little bit of Microsoft Excel. I’m using SQL Server 2012 and Microsoft Excel 2013, but any recent version of both should work.
The first step in using U.S. Census QuickFacts data is to download it. This can be done via the U.S. Census Bureau Website at which there are three files of interest:
- DataSet.txt — the actual census data in a CSV file
- FIPS_CountyName.txt — Columns which give the codes for each country and state used within the DataSet file.
- DataDict.txt — descriptions of what each column name represents. This file is a useful reference, but we won’t be importing it into our database.
Start things off by downloading the three text files.
|U.S. Census data available for download from the Census Bureau website.|