<?xml version="1.0" encoding="UTF-8"?> <rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" ><channel><title>Anthony DeBarros &#187; Analysis</title> <atom:link href="http://www.anthonydebarros.com/category/analysis/feed/" rel="self" type="application/rss+xml" /><link>http://www.anthonydebarros.com</link> <description>DATA. JOURNALISM. LIFE.</description> <lastBuildDate>Tue, 17 Jan 2012 14:16:00 +0000</lastBuildDate> <language>en</language> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.1</generator> <item><title>A Facelift for a Book List</title><link>http://www.anthonydebarros.com/2011/07/01/facelift-for-book-list/</link> <comments>http://www.anthonydebarros.com/2011/07/01/facelift-for-book-list/#comments</comments> <pubDate>Fri, 01 Jul 2011 15:35:00 +0000</pubDate> <dc:creator>Anthony</dc:creator> <category><![CDATA[Analysis]]></category> <category><![CDATA[Journalism]]></category><guid isPermaLink="false">http://www.anthonydebarros.com/?p=1415</guid> <description><![CDATA[The USA TODAY Best-Selling Books list has a new look and added interactivity, part of a relaunch of books coverage. It&#8217;s been a fun project that has been on my front burner for about three months. I get to work with all kinds of data at USA TODAY, but the book list has been a constant. When I [...]]]></description> <content:encoded><![CDATA[<p>The <a href="http://books.usatoday.com/list/index" target="_blank">USA TODAY Best-Selling Books list</a> has a new look and added interactivity, part of a relaunch of books coverage. It&#8217;s been a fun project that has been on my front burner for about three months.</p><p>I get to work with all kinds of data at <em>USA TODAY,</em> but the book list has been a constant. When I arrived at <em>USAT</em> in 1997, one of the first projects I took on was to build and analyze an archive of the list to mark its fifth anniversary. Since then, as that archive grew to hold nearly 18 years of data, we&#8217;ve used it to anchor stories about authors and trends in publishing. We&#8217;re awfully proud of the list, and people in the publishing industry tell us it&#8217;s one of the most accurate accounts of Americans&#8217; weekly reading habits.</p><p>Last year, we opened the archives up to developers via a <a href="http://developer.usatoday.com/docs/read/bestselling_books" target="_blank">Best-Selling Books API</a>. This year, giving the list itself a facelift was the next logical step.</p><p>We were fortunate to assemble a crack team of designers, developers and product managers who, in a short time, conceptualized, designed, redesigned, and coded an entirely new collection of book-related pages for our site. What&#8217;s new:<br /> <span id="more-1415"></span><br /> &#8211; The <a href="http://books.usatoday.com/list/index" target="_blank">list itself</a> has an all-new design, including book covers.<br /> &#8211; You can filter the list by genre or type. Handy if you enjoy a certain kind of book.<br /> &#8211; Dig into the archives. Search by title, author or in the book&#8217;s brief description. You also can see entire lists from earlier weeks. Here, for example, is the <a href="http://books.usatoday.com/list/index?date=1993-10-28" target="_blank">first book list <em>USA TODAY</em> published</a>, on Oct. 28, 1993.<br /> &#8211; Clicking a book title takes you to its own page, which includes stats, reviews from <a href="http://www.goodreads.com/" target="_blank">Goodreads</a> and latest Tweets about the title (more reader reviews to come). Here&#8217;s the page for <em><a href="http://books.usatoday.com/book/harper-lee-to-kill-a-mockingbird/l19769" target="_blank">To Kill a Mockingbird</a></em>, which has been on the list for 755 weeks so far.<br /> &#8211; Links to buy the title from some of readers&#8217; favorite sources. And since our developers optimized the site for tablets and phones, you can find, buy and read a book quickly.</p><p>All this is part of an overall redesign of <a href="http://books.usatoday.com/index" target="_blank">books coverage</a>, which in turn is part of a larger site redesign.</p><p>My role in all this was to help the team understand the intricacies and particulars of the list and suggest the most suitable interactions it could offer. Coding-wise, I wrote some SQL to support the fetching and searching of the lists and the individual title&#8217;s archive data.</p><p>What a great time, a nice next chapter after finishing our work on the <a title="Lessons From a Census Factory" href="http://www.anthonydebarros.com/2011/04/02/lessons-from-a-census-factory/" target="_blank">Census 2010 P.L. 94 release</a>. More to come &#8230;</p> ]]></content:encoded> <wfw:commentRss>http://www.anthonydebarros.com/2011/07/01/facelift-for-book-list/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Which web browsers do journalists favor?</title><link>http://www.anthonydebarros.com/2011/03/15/which-web-browsers-do-journalists-favor/</link> <comments>http://www.anthonydebarros.com/2011/03/15/which-web-browsers-do-journalists-favor/#comments</comments> <pubDate>Wed, 16 Mar 2011 02:51:04 +0000</pubDate> <dc:creator>Anthony</dc:creator> <category><![CDATA[Analysis]]></category><guid isPermaLink="false">http://www.anthonydebarros.com/?p=1272</guid> <description><![CDATA[After I started playing with Internet Explorer 9 tonight &#8212; and knowing that most developers, including Microsoft, want to wean the world from IE6 as soon as possible &#8212; I grew curious about the browsers favored by my site&#8217;s visitors. A quick dig into Google Analytics gave me the data for the last few months, [...]]]></description> <content:encoded><![CDATA[<p>After I started playing with Internet Explorer 9 tonight &#8212; and knowing that most developers, including Microsoft, want to <a href="http://ie6countdown.com/" target="_blank">wean the world from IE6</a> as soon as possible &#8212; I grew curious about the browsers favored by my site&#8217;s visitors. A quick dig into Google Analytics gave me the data for the last few months, and the Google Charts API let me build a quick pie:</p><p><img src="http://chart.apis.google.com/chart?chxs=0,676767,12.5&amp;chxt=x&amp;chs=480x325&amp;cht=p&amp;chco=000000&amp;chd=s:VPOHE&amp;chl=35%25+-+Firefox|24%25+-+Chrome|23%25+-+Safari|12%25+-+IE|6%25+-+Other&amp;chtt=Site+visits+by+browser%2C+November+2010-March+2011" alt="Site visits by browser, November 2010-March 2011" width="500" height="325" /></p><p>I can&#8217;t know for sure, but I suspect that most people who read my site are journalists or developers. Most traffic comes from links I post on Twitter or via search keywords that tend toward journalism, data, math and, lately, the Census.</p><p>Generally, you&#8217;re not an IE-centric crowd &#8212; just 12%. That&#8217;s lower than overall metrics, which tend to place Internet Explorer at anywhere from <a href="http://www.w3counter.com/globalstats.php?year=2011&amp;month=2" target="_blank">40%</a> or <a href="http://gs.statcounter.com/#browser-ww-monthly-201102-201102-bar" target="_blank">more</a> of the overall market.</p><p>Oh, and the percent using IE6? Less than 0.4%.</p> ]]></content:encoded> <wfw:commentRss>http://www.anthonydebarros.com/2011/03/15/which-web-browsers-do-journalists-favor/feed/</wfw:commentRss> <slash:comments>2</slash:comments> </item> <item><title>The 2010 Best-Selling Books</title><link>http://www.anthonydebarros.com/2011/01/13/the-2010-best-selling-books/</link> <comments>http://www.anthonydebarros.com/2011/01/13/the-2010-best-selling-books/#comments</comments> <pubDate>Fri, 14 Jan 2011 03:55:04 +0000</pubDate> <dc:creator>Anthony</dc:creator> <category><![CDATA[Analysis]]></category><guid isPermaLink="false">http://www.anthonydebarros.com/?p=1136</guid> <description><![CDATA[Update Jan. 17, 2012: The top selling books of 2011 are listed here, and in that table you can view lists back to 2007. The post below refers to the 2010 top-selling titles. Original post: Stieg Larsson&#8217;s Millennium Trilogy grabbed the top three spots in USA TODAY&#8217;s annual list of top-selling books, reflecting a broader move by [...]]]></description> <content:encoded><![CDATA[<p><strong>Update Jan. 17, 2012:</strong> The top selling books of 2011 are listed <a href="http://www.usatoday.com/life/books/news/story/2012-01-11/100-best-selling-books-of-2011/52504752/1">here</a>, and in that table you can view lists back to 2007. The post below refers to the 2010 top-selling titles.</p><p><strong>Original post:</strong></p><p>Stieg Larsson&#8217;s <em>Millennium Trilogy</em> grabbed the top three spots in USA TODAY&#8217;s annual list of top-selling books, reflecting a broader move by readers toward fiction this year. The <a href="http://www.usatoday.com/life/books/news/2011-01-12-top-books-2010_N.htm" target="_blank">list of 2010&#8242;s best-sellers</a> was part of a <a href="http://www.usatoday.com/life/books/news/2011-01-12-booktrends13_N.htm" target="_blank">package</a> published today wrapping up the year&#8217;s trends as reflected in USA TODAY&#8217;s <a href="http://books.usatoday.com/list/index">Best-Selling Books</a> list.</p><p>Shepherding the books list is one of those tasks I spend considerable time on and, over the years, it has become one of my favorite opportunities for data journalism. The well never seems to run dry on ideas, and with 17 years of data in our archives, there are plenty of opportunities to see how annual moves on the list stack up against long-term trends. Along with Larsson&#8217;s success, our book team&#8217;s report on trends highlighted <a href="http://www.usatoday.com/life/books/news/2011-01-13-topbooks201013_ST_N.htm" target="_blank">titles and authors reaching No. 1</a>, from Nicholas Sparks to George W. Bush.</p><p>Much of our book list data is open for developers. Check out the <a href="http://developer.usatoday.com/docs/read/bestselling_books" target="_blank">API for details</a>.</p> ]]></content:encoded> <wfw:commentRss>http://www.anthonydebarros.com/2011/01/13/the-2010-best-selling-books/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Story hunting in birth, death data</title><link>http://www.anthonydebarros.com/2010/09/07/data-analysis-births-deaths/</link> <comments>http://www.anthonydebarros.com/2010/09/07/data-analysis-births-deaths/#comments</comments> <pubDate>Tue, 07 Sep 2010 14:30:39 +0000</pubDate> <dc:creator>Anthony</dc:creator> <category><![CDATA[Analysis]]></category><guid isPermaLink="false">http://www.anthonydebarros.com/?p=924</guid> <description><![CDATA[The U.S. government&#8217;s annual count of births and deaths is among the most basic of demographics, but tracking it is one of my little obsessions. I keep annual totals in a spreadsheet and get all gooey inside when I can add another year to the pile. That happened last month, when the National Center for [...]]]></description> <content:encoded><![CDATA[<p><strong>The U.S. government&#8217;s annual</strong> count of births and deaths is among the  most basic of demographics, but tracking it is one of my little  obsessions. I keep annual totals in a spreadsheet and get all gooey  inside when I can add another year to the pile.</p><p>That happened last month, when the National Center for Health Statistics released <a href="http://www.cdc.gov/nchs/data/nvsr/nvsr58/nvsr58_25.htm" target="_blank">data</a> showing the number of births in the U.S. <a href="http://www.usatoday.com/news/health/2010-08-27-birth-decline_N.htm" target="_blank">has dropped for two years in a row</a>. One possible reason, the experts said, was the recession.</p><p>When it&#8217;s newsworthy, a yearly update to a longitudinal data set certainly is worth covering. But sometimes these basic demographics &#8212; including Census data &#8212; reveal even more when we take a long-term view.</p><p>For example, below are the annual number of births and deaths from 1933 to 2009 plotted in <a href="http://manyeyes.alphaworks.ibm.com/manyeyes/" target="_blank">Many Eyes</a>. Click the graphic to interact:</p><p><script src="http://manyeyes.alphaworks.ibm.com/manyeyes/visualizations/351ec052b1fc11dfa2b1000255111976/comments/3523b8beb1fc11dfa2b1000255111976.js?width=475&amp;height=350" type="text/javascript"></script></p><p>It&#8217;s simple &#8212; just two fever lines. But it&#8217;s chock full of  generational milestones that bear watching:</p><ul><li>The first baby boomers &#8212; those born in 1946 &#8212; turn 65 starting in January.</li><li>The Gen Xers that follow have closed in on middle age. They range from the early 30s to mid-40s (in fact, Gen X poster boy Kurt Cobain would have turned 43 this year).</li><li>Meanwhile, the first of the Millennials &#8212; the &#8220;echo boomers&#8221; whose numbers peaked in 1990 &#8212; are nearing 30.</li></ul><p>Each generation brings a new sensibility to the stages of life, and the relative size and makeup of each one &#8212; not to mention its cultural context &#8212; gives journalists plenty of opportunity for storytelling. Two examples:</p><ul><li>Much has been written about the big bump of post-World War II babies marching  closer to retirement (maybe), Social Security, and the years where  health care becomes a major concern. But what about the inevitable? Notice that the number of deaths in the U.S. has plateaued at about 2.4 million a year. That won&#8217;t last long with Boomers heading into the years where <a href="http://www.cdc.gov/nchs/data/nvsr/nvsr58/nvsr58_21.pdf" target="_blank">death rates rise dramatically</a>. How will 4 million deaths annually affect the funeral home business, the ability to buy a cemetary plot, and the overall industry around end-of-life care?</li><li>Along with Gen X came the &#8220;baby bust,&#8221; the years of rapidly declining birth rates that led to all  kinds of prognostications about the<a href="http://www.time.com/time/magazine/article/0,9171,963617,00.html" target="_blank"> shrinking of America</a>. That means our workforce now has a relative shortage of thirtysomethings. Does that mean more opportunity for Millennials to advance in the business world and less pressure for boomers to retire?</li></ul><p>These sorts of trends are slow-burning, but they reflect data trends that exert hidden but massive force on our culture, much like the tides. The savvy data journalist keeps an eye on them not just for what they say this year but what they reveal over time.</p> ]]></content:encoded> <wfw:commentRss>http://www.anthonydebarros.com/2010/09/07/data-analysis-births-deaths/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Sorting Data in Excel: Simple Analysis</title><link>http://www.anthonydebarros.com/2010/05/12/sorting-data-in-excel-simple-analysis/</link> <comments>http://www.anthonydebarros.com/2010/05/12/sorting-data-in-excel-simple-analysis/#comments</comments> <pubDate>Wed, 12 May 2010 04:00:45 +0000</pubDate> <dc:creator>Anthony</dc:creator> <category><![CDATA[Analysis]]></category> <category><![CDATA[Excel]]></category><guid isPermaLink="false">http://www.anthonydebarros.com/?p=552</guid> <description><![CDATA[Sorting a data set helps answer a basic question journalists like to ask: &#8220;Which ____ has the highest (or lowest) ______?&#8221; Excel (and other spreadsheets such as the open source Calc) make sorting data easy. In fact, I often make sorting my first step when &#8220;interviewing&#8221; data because it quickly reveals high and low values [...]]]></description> <content:encoded><![CDATA[<p>Sorting a data set helps answer a basic question journalists like to ask: &#8220;Which ____ has the highest (or lowest) ______?&#8221;</p><p>Excel (and other spreadsheets such as the open source <a href="http://www.openoffice.org/" target="_blank">Calc</a>) make sorting data easy. In fact, I often make sorting my first step when &#8220;interviewing&#8221; data because it quickly reveals high and low values and often highlights some that may seem questionable.</p><p>Let&#8217;s work through a simple sort in Excel. I&#8217;ll be using Excel 2007, but older versions have similar functions. Start by <a href="http://www.anthonydebarros.com/wp-content/themes/portfolio_ad/docs/sorting.xls" target="_blank">downloading the file &#8220;sorting.xls&#8221;</a> and saving it to your computer. Open it and follow along:</p><p>1. We have a table of Census data from the 2006-2008 American Community Survey. It shows the median age of the population for each of 79 school districts in Virginia plus the state itself.</p><p><img class="alignnone size-full wp-image-567" style="border: 0pt none;" title="sorting1" src="http://www.anthonydebarros.com/wp-content/uploads/2010/05/sorting11.jpg" alt="" width="450" height="318" /></p><p>We want to know which district has the oldest and youngest populations. Let&#8217;s sort it!</p><p>2. Click once on one cell anywhere in the table. This will help Excel auto-discover your table in the next step.</p><p><img class="alignnone size-full wp-image-571" title="sorting2" src="http://www.anthonydebarros.com/wp-content/uploads/2010/05/sorting21.jpg" alt="" width="450" height="315" /></p><p><span id="more-552"></span><br /> 3. On the Excel ribbon, select the &#8220;Data&#8221; tab and click &#8220;Sort.&#8221;</p><p><img class="alignnone size-full wp-image-573" title="sorting3" src="http://www.anthonydebarros.com/wp-content/uploads/2010/05/sorting3.jpg" alt="" width="400" height="284" /></p><p>4. Two things happened. One, your entire table was selected (or highlighted). Two, a dialog box popped up to offer sorting options. Check off &#8220;My data has headers.&#8221; That will prevent your header row from getting sorted with the data, and it will add the three column names under the &#8220;Sort by&#8221; drop down.</p><p>5. Under &#8220;Sort by,&#8221; select &#8220;Median.&#8221; Under &#8220;Order,&#8221; select &#8220;Largest to Smallest.&#8221;</p><p><img class="alignnone size-full wp-image-578" title="sorting4" src="http://www.anthonydebarros.com/wp-content/uploads/2010/05/sorting4.jpg" alt="" width="450" height="311" /></p><p>6. Click &#8220;OK.&#8221; Excel sorts your table, ranking the districts by median age &#8212; from highest to lowest. Your first few rows should look like this:</p><p><img class="alignnone size-full wp-image-581" title="sorting5" src="http://www.anthonydebarros.com/wp-content/uploads/2010/05/sorting5.jpg" alt="" width="450" height="300" /></p><p>Now, we can do a quick scan and look for patterns. For example, several of the &#8220;oldest&#8221; counties are in southern Virginia, far away from the Northern Virginia economic engine. Meanwhile, the district with the lowest age is Harrisonburg City Public Schools &#8212; with a median age of a barely-legal 22.8. Could the fact that the city hosts <a href="http://www.emu.edu/" target="_blank">two</a> <a href="http://www.jmu.edu/" target="_blank">universities</a> have something to do with that?</p><p>Good fodder for reporting, all made possible by a simple Excel sort.</p><p>A couple of tips and cautions:</p><p>&#8211; A good general practice is to work on a copy of your original data. Because things happen.</p><p>&#8211; Excel does best at sorting when your table has a header row and is not contiguous to any unrelated data, such as footnotes. Insert blank rows and columns between the data you want to sort and any information you want to keep separate.</p><p>&#8211; I recommend selecting only one cell in your table before selecting the &#8220;Sort&#8221; button. If you grab more than one, Excel may attempt to sort only those cells rather than the whole table. The 2007 version asks if you want to expand the selection, but older versions sometimes do not. This creates the possibility that only some of your data would get sorted, which is a nightmare. Always make sure your entire table gets selected!</p><p>&#8211; You can sort by more than one field. In Excel 2007, click &#8220;Add level&#8221; in the sort dialog.</p><p>Questions? Tips of your own? Add them below &#8230;</p> ]]></content:encoded> <wfw:commentRss>http://www.anthonydebarros.com/2010/05/12/sorting-data-in-excel-simple-analysis/feed/</wfw:commentRss> <slash:comments>3</slash:comments> </item> <item><title>&#8216;Trouble on the Tray&#8217; Wins EWA Award</title><link>http://www.anthonydebarros.com/2010/03/11/trouble-on-the-tray-wins-ewa-award/</link> <comments>http://www.anthonydebarros.com/2010/03/11/trouble-on-the-tray-wins-ewa-award/#comments</comments> <pubDate>Thu, 11 Mar 2010 17:32:10 +0000</pubDate> <dc:creator>Anthony</dc:creator> <category><![CDATA[Analysis]]></category><guid isPermaLink="false">http://www.anthonydebarros.com/?p=432</guid> <description><![CDATA[Good news for our USA TODAY team that researched, reported and wrote the &#8220;Trouble on the Tray&#8221; series on school lunch safety: The Education Writers Association yesterday named it a winner in the 2009 National Awards for Education Reporting. The series &#8212; reported by Blake Morrison, Peter Eisler and Elizabeth Weise with data analysis by [...]]]></description> <content:encoded><![CDATA[<p><strong>Good news</strong> for our USA TODAY team that researched, reported and wrote the <a href="http://content.usatoday.com/topics/topic/School+Lunch+Safety" target="_blank">&#8220;Trouble on the Tray&#8221;</a> series on school lunch safety: The <a href="http://www.ewa.org/site/PageServer" target="_blank">Education Writers Association</a> yesterday named it a winner in the <a href="http://www.ewa.org/site/PageServer?pagename=contest_winners" target="_blank">2009 National Awards for Education Reporting</a>. The series &#8212; reported by Blake Morrison, Peter Eisler and Elizabeth Weise with data analysis by yours truly &#8212; received first prize in the &#8220;Large Media &#8212; Investigative Reporting&#8221; category.</p><p>I&#8217;m giving a talk this week at the <a href="http://data.nicar.org/conference/schedule/7" target="_blank">IRE Computer Assisted Reporting conference</a> on how we acquired and analyzed the federal data that helped fuel the story.</p><p>Major stories in the series include:</p><p>&#8211; <a href="http://www.usatoday.com/news/education/2009-11-16-del-rey_N.htm" target="_blank">Schools in the dark about tainted lunches</a><br /> &#8211; <a href="http://www.usatoday.com/news/education/2009-12-01-beef-recall-lunches_N.htm" target="_blank">Why a recall of tainted beef didn&#8217;t include school lunches</a><br /> &#8211; <a href="http://www.usatoday.com/news/education/2009-12-08-school-lunch-standards_N.htm" target="_blank">Fast-food standards for meat top those for school lunches</a><br /> &#8211; <a href="http://www.usatoday.com/news/education/2009-12-15-school-lunches-health-inspections_N.htm" target="_blank">26,500 school cafeterias lack required inspections<br /> </a></p><p>Our series spurred congressional calls for reforms to USDA policies, and in February the agency announced <a href="http://www.usatoday.com/news/health/2010-02-04-school-lunch_N.htm" target="_blank">tighter requirements on companies</a> that supply food to the National School Lunch Program, including stricter testing of meat.</p> ]]></content:encoded> <wfw:commentRss>http://www.anthonydebarros.com/2010/03/11/trouble-on-the-tray-wins-ewa-award/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Spreading data journalism in the newsroom</title><link>http://www.anthonydebarros.com/2010/02/06/spreading-data-journalism/</link> <comments>http://www.anthonydebarros.com/2010/02/06/spreading-data-journalism/#comments</comments> <pubDate>Sat, 06 Feb 2010 15:09:06 +0000</pubDate> <dc:creator>Anthony</dc:creator> <category><![CDATA[Analysis]]></category> <category><![CDATA[Journalism]]></category> <category><![CDATA[Workflow]]></category><guid isPermaLink="false">http://www.anthonydebarros.com/?p=400</guid> <description><![CDATA[A reporter called recently for tips on setting up &#8220;a CAR desk&#8221; in the newsroom of a decent-sized community newspaper. The editor had watched the reporter&#8217;s success at gathering and analyzing data and, as typically happens,  now wanted the reporter to train the rest of the newsroom. Here was my advice: Focus on a few: [...]]]></description> <content:encoded><![CDATA[<p><strong>A reporter called </strong>recently for tips on setting up &#8220;<a href="http://en.wikipedia.org/wiki/Database_journalism" target="_blank">a CAR desk</a>&#8221; in the newsroom of a decent-sized community newspaper. The editor had watched the reporter&#8217;s success at gathering and analyzing data and, as typically happens,  now wanted the reporter to train the rest of the newsroom.</p><p>Here was my advice:</p><p><strong>Focus on a few:</strong> Instead of holding building-wide Excel classes or database journalism seminars, start with just one or two reporters who show a combination of interest and decent technical smarts. That lets you go deep on a couple of beats rather than spread yourself thin. Also, success breeds success. Watching a few reporters land great stories will possibly spur interest from others.</p><p><strong>Have the right goals: </strong>Goals like &#8220;publish one CAR story a week&#8221; miss the point. Better objectives are to have data-thinking ever present in the reporter&#8217;s mind, have the reporter well-versed in her beat&#8217;s data sources, and have the reporter develop basic data skills. From that, stories will flow.</p><p><strong>Inventory data:</strong> Speaking of data sources, have each reporter you work with find out the sets of data local governments keep. File FOIA requests for table layouts and database schemas. Get the data, then study it. That will spur story ideas.</p><p><strong>Crawl first, run later:</strong> All the hot talk in data journalism these days is on Web frameworks and visualizations, but there&#8217;s plenty of work for the beginner in the land of Excel and Access. Build those skills as a starting point.</p><p>Your thoughts? Add a comment below &#8230;</p> ]]></content:encoded> <wfw:commentRss>http://www.anthonydebarros.com/2010/02/06/spreading-data-journalism/feed/</wfw:commentRss> <slash:comments>2</slash:comments> </item> <item><title>School Lunch series gets CW mention</title><link>http://www.anthonydebarros.com/2010/01/18/california-watch-school-lunch-series/</link> <comments>http://www.anthonydebarros.com/2010/01/18/california-watch-school-lunch-series/#comments</comments> <pubDate>Mon, 18 Jan 2010 18:31:35 +0000</pubDate> <dc:creator>Anthony</dc:creator> <category><![CDATA[Analysis]]></category><guid isPermaLink="false">http://www.anthonydebarros.com/?p=376</guid> <description><![CDATA[USA TODAY&#8217;s series on school lunch quality, &#8220;Trouble on the Tray,&#8221; is included in a roundup of noteworthy investigative projects from 2009 by the non-profit investigative reporting team at California Watch. The stories were reported and written by Blake Morrison, Peter Eisler and Elizabeth Weise with data analysis by yours truly. Also mentioned: The Washington [...]]]></description> <content:encoded><![CDATA[<p>USA TODAY&#8217;s series on school lunch quality, <a href="http://content.usatoday.com/topics/topic/School+Lunch+Safety" target="_blank">&#8220;Trouble on the Tray,&#8221;</a> is included in a roundup of <a href="http://californiawatch.org/watchblog/top-editors-still-buzzing-about-2009-investigative-stories" target="_blank">noteworthy investigative projects from 2009</a> by the non-profit investigative reporting team at <a href="http://californiawatch.org/" target="_blank">California Watch</a>. The stories were reported and written by Blake Morrison, Peter Eisler and Elizabeth Weise with data analysis by yours truly.</p><p>Also mentioned: The Washington Post&#8217;s investigation of the <a href="http://www.washingtonpost.com/wp-srv/special/metro/red-line-crash/" target="_blank">Metro Red Line crash</a> and The New York Times&#8217; series on <a href="http://www.nytimes.com/2009/08/23/us/23water.html?_r=1" target="_blank">toxic water</a>.</p><p>From the item, by <a href="http://twitter.com/KatchesCW" target="_blank">Mark Katches</a>:</p><blockquote><p>Just to be clear, this is by no means a comprehensive list. It represents only a small, informal survey about stories that some highly respected investigative journalists are buzzing about.</p></blockquote><p>Indeed, every year I am amazed at the quality and depth of investigative reporting that American newsrooms continue to produce even as the industry fights hard times. It&#8217;s an honor to be mentioned in that company.</p> ]]></content:encoded> <wfw:commentRss>http://www.anthonydebarros.com/2010/01/18/california-watch-school-lunch-series/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item><title>Mean vs. Median: A Beginner&#8217;s Guide</title><link>http://www.anthonydebarros.com/2009/12/27/mean-vs-median-excel/</link> <comments>http://www.anthonydebarros.com/2009/12/27/mean-vs-median-excel/#comments</comments> <pubDate>Mon, 28 Dec 2009 03:53:26 +0000</pubDate> <dc:creator>Anthony</dc:creator> <category><![CDATA[Analysis]]></category> <category><![CDATA[Excel]]></category><guid isPermaLink="false">http://www.anthonydebarros.com/?p=291</guid> <description><![CDATA[A common way to summarize a group of numbers &#8212; one most of us learned in grade school &#8212; is to find its mean, commonly called the average. But it&#8217;s not always the best measure. Let&#8217;s say six kids go on a field trip, ages 10, 11, 10, 9, 13 and 12. It&#8217;s easy to [...]]]></description> <content:encoded><![CDATA[<p><strong>A common way to summarize </strong>a group of numbers &#8212; one most of us learned in grade school &#8212; is to find its mean, commonly called the average. But it&#8217;s not always the best measure.</p><p>Let&#8217;s say six kids go on a field trip, ages 10, 11, 10, 9, 13 and 12. It&#8217;s easy to add the ages and divide by six to get the group&#8217;s average age:<br /> &nbsp;</p><div class="wp_syntax"><div class="code"><pre class="dos" style="font-family:monospace;"><span style="color: #33cc33;">(</span>10 + 11 + 10 + 9 + 13 + 12<span style="color: #33cc33;">)</span> / 6 = 10.8</pre></div></div><p>Because all the ages are close, the average of 10.8 gives us a good picture of the group as a whole. But averages are less helpful when the values are skewed toward one end or if they include outliers.</p><p>For example, what if we add a much older chaperone to our field trip? With ages of 10, 11, 10, 9, 13, 12 and 46, the average age of the group rises considerably:<br /> &nbsp;</p><div class="wp_syntax"><div class="code"><pre class="dos" style="font-family:monospace;"><span style="color: #33cc33;">(</span>10 + 11 + 10 + 9 + 13 + 12 + 46<span style="color: #33cc33;">)</span> / 7 = 15.9</pre></div></div><p>Now the mean is not an accurate representation. The outlier skews the average, and no journalist should feel comfortable reporting it.</p><p>This is where calculating a median is handy. The median is the midpoint in an ordered list of values &#8212; the point at which half the values are higher and half lower. If the median household income in East Middletownburg is $50,000, then half the households earn more and half less.</p><p><span id="more-291"></span>Using our field trip, we order the ages from lowest to highest:<br /> &nbsp;</p><div class="wp_syntax"><div class="code"><pre class="dos" style="font-family:monospace;">9, 10, 10, 11, 12, 13, 46</pre></div></div><p>The middle value is 11, and that&#8217;s the median. Half the values are higher, and half lower. If there had been an even number of values, we&#8217;d average the two middle values to find the median. For larger sets of numbers, you can use the MEDIAN function in Microsoft Excel.</p><p>Given this group, the median of 11 is a much better representation of the typical age than the average of 15.9. That&#8217;s what makes median such a useful statistical measure. Scan financial news, and you&#8217;ll see medians reported frequently. Reports on housing prices often use medians because a few sales of McMansions in a zip code that&#8217;s otherwise modest can make averages useless. Same for sports player salaries where one or two superstars can skew results.</p><p>A good test: calculate the average and the median for a group of values. If they&#8217;re close, then the group is probably normally distributed (the familiar bell curve), and the average is useful. If they&#8217;re far apart, then the values are not normally distributed and the median is the better representation.</p> ]]></content:encoded> <wfw:commentRss>http://www.anthonydebarros.com/2009/12/27/mean-vs-median-excel/feed/</wfw:commentRss> <slash:comments>2</slash:comments> </item> <item><title>26,500 school cafeterias uninspected</title><link>http://www.anthonydebarros.com/2009/12/16/school-lunch-cafeteria-inspections/</link> <comments>http://www.anthonydebarros.com/2009/12/16/school-lunch-cafeteria-inspections/#comments</comments> <pubDate>Wed, 16 Dec 2009 14:55:31 +0000</pubDate> <dc:creator>Anthony</dc:creator> <category><![CDATA[Analysis]]></category><guid isPermaLink="false">http://www.anthonydebarros.com/?p=276</guid> <description><![CDATA[Thousands of school cafeterias went uninspected in the 2007-08 school year, we report today in the fourth major installment of our &#8220;Trouble on the Tray&#8221; investigation into school lunch safety. In today&#8217;s story, reporters Blake Morrison and Peter Eisler worked with me to examine data on the number of schools in each state that met [...]]]></description> <content:encoded><![CDATA[<p><strong>Thousands of school cafeterias </strong>went uninspected in the 2007-08 school year, <a href="http://www.usatoday.com/news/education/2009-12-15-school-lunches-health-inspections_N.htm" target="_blank">we report today</a> in the fourth major installment of our <a href="http://content.usatoday.com/topics/topic/School+Lunch+Safety" target="_blank">&#8220;Trouble on the Tray&#8221;</a> investigation into school lunch safety.</p><p>In today&#8217;s story, reporters Blake Morrison and Peter Eisler worked with me to examine <a href="http://www.usatoday.com/news/education/2009-12-15-school-lunches-health-inspections_N.htm#table" target="_blank">data</a> on the number of schools in each state that met a federal requirement to have two cafeteria inspections annually. We found that in eight states, more than half of schools reporting failed to meet that standard in 2006-07 and 2007-08 school years.</p><p>Meanwhile, the series continues to draw attention on Capitol Hill. This week, Sen. Kirsten Gillibrand, D-N.Y., <a href="http://www.usatoday.com/news/education/2009-12-14-food_N.htm?loc=interstitialskip" target="_blank">called on the federal government</a> to increase its standards for meat used in school lunches and to cut contracts with companies that repeatedly did not meet standards.</p> ]]></content:encoded> <wfw:commentRss>http://www.anthonydebarros.com/2009/12/16/school-lunch-cafeteria-inspections/feed/</wfw:commentRss> <slash:comments>0</slash:comments> </item> </channel> </rss>
<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced
Database Caching 1/45 queries in 0.024 seconds using disk: basic
Object Caching 521/612 objects using disk: basic

Served from: www.anthonydebarros.com @ 2012-02-05 05:28:05 -->
