1/18/2018 Lecture Notes
January 18, 2018
Welcome back! Professor Pacheco is out of town today. You should begin the class working on Assignment 2, which is to find some data from the New York State data site and tell a story with it using Instagram.
At 1 p.m., Professor Jodi Upton, our Knight Chair in Data and Explanatory Journalism, will arrive to give a guest lecture about data journalism.
1/16/2018 Lecture Notes: Welcome to Dataviz!
January 11, 2018
Tues 1/16/2018 Class: Welcome!
12:30-12:50: Welcome! An Introduction to using data visualization to tell stories.
- Quick roll call.
- Before next class, please register for the class blog (follow instructions from an email the system sent you) and fill out this survey.
- Review how the class site is organized.
- Walkthrough of the syllabus and schedule.
12:50-1: What is Dataviz?
- Some things can be better understood by seeing or exploring. Here are some good examples:
- News you can Use: Is it better to buy or rent?
- Understanding complexity. Which Supreme Court justices agree or disagree?
- Understanding behaviors. Where do voters in NYC live?
- Understanding sentiment. Emotional arcs during presidential addresses.
- Understanding geographic distribution. Perentage of people without health insurance by U.S. county.
- Understanding our world. http://Hint.fm/wind
- Having fun! Here are some visualizations about where in the world people have the best and worst sex and recurring themes in Arrested Development.
- More examples: Here are some more I have collected, and on the #Dataviz Twitter hashtag and Data is Beautiful Reddit.
- And how about some really bad examples!
1-1:15: Exercise: And now for a little magic. Introduce yourself through data!
- Open this URL. How did that data get in there? Let’s find out together.
- Google yourself and find an image that is publicly available on the web. Right-click the image and get the URL.
- Go to this Google spreadsheet and fill out your info. Put your image URL into the correct column, and put this code around it:
<img src=”YOURURLHERE” width=”200″ />
(Note: type this in, don’t copy and paste from this page).
- Take a look at this URL or the home page to see class data populating in real time.
Congrats! You Participated in a Data Visualization
We will go through some basic features of Excel, and formulas.
- Adding information as data
- Add a formula
- Columns and rows.
- Formula: using the equal sign for functions. Basic math.
- Sum columns or rows.
- Select an area.
- Format cells to change cell type (text, number).
- Making charts in Excel.
- Common formulas: adding, subtracting, dividing, multiplying, summing.
- CSV format versus native Excel format.
1:25-1:40: Putting it into practice: Sorting and filtering NYS bridge data
- Bridges across the country are badly in need of repair, and it can literally be a life or death issue. Here’s more about that.
- Download and unzip this data set of 51,000 bridges in New York State. bridges_blanksremoved.csv
- Open it in Excel. Scroll right until you find the column “critfrac” (column DL) which stands for critical fracture. A y12 or y24 means outdated design, so a single solid hit can bring the entire bridge down.
- Next, find the column “suffrtno” (column FC), which stands for Sufficiency Rating. Anything under 50 is considered dangerous.
- Also note the “totlcost” (total cost to fix in thousands of dollars) in column DV, and “avdayno” (average daily traffic) in column AK.
- How many bridges are in danger of collapsing due to critical fracture?
- How many bridges have an inadequate sufficiency rating?
- How many have both bad critical fracture and sufficiency rating numbers?
- How much traffic goes over the bridges with both bad critfrac and suffrtno ratings? (Use the data from “avdayno,” column AK).
- How much will it cost to fix the bridges with both bad critfrac and suffrtno ratings? (Use “totlcost”, column DV).Go through these yourself, then let’s review the answers and how to get them.
1:40-1:50: Looking ahead: