Sorting Data in Excel: Simple Analysis

Sorting a data set helps answer a basic question journalists like to ask: “Which ____ has the highest (or lowest) ______?”

Excel (and other spreadsheets such as the open source Calc) make sorting data easy. In fact, I often make sorting my first step when “interviewing” data because it quickly reveals high and low values and often highlights some that may seem questionable.

Let’s work through a simple sort in Excel. I’ll be using Excel 2007, but older versions have similar functions. Start by downloading the file “sorting.xls” and saving it to your computer. Open it and follow along:

1. We have a table of Census data from the 2006-2008 American Community Survey. It shows the median age of the population for each of 79 school districts in Virginia plus the state itself.

We want to know which district has the oldest and youngest populations. Let’s sort it!

2. Click once on one cell anywhere in the table. This will help Excel auto-discover your table in the next step.

(more…)

‘Trouble on the Tray’ Wins EWA Award

Good news for our USA TODAY team that researched, reported and wrote the “Trouble on the Tray” series on school lunch safety: The Education Writers Association yesterday named it a winner in the 2009 National Awards for Education Reporting. The series — reported by Blake Morrison, Peter Eisler and Elizabeth Weise with data analysis by yours truly — received first prize in the “Large Media — Investigative Reporting” category.

I’m giving a talk this week at the IRE Computer Assisted Reporting conference on how we acquired and analyzed the federal data that helped fuel the story.

Major stories in the series include:

Schools in the dark about tainted lunches
Why a recall of tainted beef didn’t include school lunches
Fast-food standards for meat top those for school lunches
26,500 school cafeterias lack required inspections

Our series spurred congressional calls for reforms to USDA policies, and in February the agency announced tighter requirements on companies that supply food to the National School Lunch Program, including stricter testing of meat.

Spreading data journalism in the newsroom

A reporter called recently for tips on setting up “a CAR desk” in the newsroom of a decent-sized community newspaper. The editor had watched the reporter’s success at gathering and analyzing data and, as typically happens,  now wanted the reporter to train the rest of the newsroom.

Here was my advice:

Focus on a few: Instead of holding building-wide Excel classes or database journalism seminars, start with just one or two reporters who show a combination of interest and decent technical smarts. That lets you go deep on a couple of beats rather than spread yourself thin. Also, success breeds success. Watching a few reporters land great stories will possibly spur interest from others.

Have the right goals: Goals like “publish one CAR story a week” miss the point. Better objectives are to have data-thinking ever present in the reporter’s mind, have the reporter well-versed in her beat’s data sources, and have the reporter develop basic data skills. From that, stories will flow.

Inventory data: Speaking of data sources, have each reporter you work with find out the sets of data local governments keep. File FOIA requests for table layouts and database schemas. Get the data, then study it. That will spur story ideas.

Crawl first, run later: All the hot talk in data journalism these days is on Web frameworks and visualizations, but there’s plenty of work for the beginner in the land of Excel and Access. Build those skills as a starting point.

Your thoughts? Add a comment below …

Anthony

About me

I'm a journalist who works with words, code and data. I'm also a husband, father, musician, gardener and occasional poet. I love finding and telling great stories. I'm inspired by art, music and design that elevate. I pursue the truth. Data journalism's the focus here, but other topics will crop up. Thanks for reading.
LINKS & TRIVIA

microblog

places

LinkedIn
Twitter
Delicious
Tumblr
Spelling Bee blog
- Tales from our trips to the Scripps National Spelling Bee
360 Sports Jam
- My teen journalist's sports blog

data journalists

Brian Boyer
Aaron Bycoffe
Jack Gillum
Gregory Korte
Aron Pilhofer
Mark Schaver
Matt Waite
Ben Welsh
Derek Willis
Matt Wynn

from the home office

RSS TOP USA TODAY STORIES