We cant Cure Cancer, But we can Cure this Medicare Chart [Chart Busters]

Share

Facebook
Twitter
LinkedIn

This post is from GuestBuster Jeff Weir in our Chart Busters series.

Note: The post slightly longer, but worth every word. Just get a cup of coffee and soak in to this visualization goodness. (Also, click on any image to see its full version)

Over at the FlowingData blog, they’ve been talking about this pretty slick looking Choropleth Map that shows how Medicare returns vary across the United States:

Medicare Chart Critique Figure_1

The above shows total Medicare reimbursements in 2006, either by Hospital Referring Region or by State, depending on the radio button. Using the dropdown box, you can change it to this:

Medicare Chart Critique Figure_2

…which is how the data looks if you overlay it on a Giraffe. Oops, I forgot to rotate it before saying that. Bear with me a moment…

Medicare Chart Critique Figure_3

There. See the Giraffe now? Good.

A picture is worth a thousand words, or so they say. But is a Choropleth worth the many line charts and clowns that you could squeeze into the same valuable screen real estate? Let’s find out, by evaluating what this particular chart does well, and what it does poorly, and whether other charting methods might better convey its information.

Words and music.

Right off the bat, there’s a simple way that the authors could improve this chart. While they include a description below the chart to point out what the data is, and where it came from, they miss something just as important…what they concluded from all of this. So before we consider adding – say – bullet graphs, let’s consider adding some bullet points. A few sentences can tell readers important stuff that would otherwise remain hidden in an undownloaded PDF report. Insights like:

  • Care is often better in low-cost areas.
  • Growth in returns are only partly explained by advancing technology, and
  • Differences in growth rates across regions seem largely due to discretionary decisions by physicians that are influenced by the local availability of hospital beds, imaging centres and other resources-and a payment system that rewards growth and higher utilization.

Straight off the bat, this would make the graph a better graph…without even messing with its form.

But mess we must…

…because lurking below the chlorophyll green of this Choropleth Map are a few serious charting oversights. Ready? Let’s check ’em out.

Scale? Fail!

First, check out the legend.

Medicare Chart Critique Figure_4

Crikey…its bands are as discrete as Bruno. Its scale is about as even as my temperament. It varies about as much as =RANDBETWEEN(PaydayBankBalance, UsualOverdraft).

If you fire up Excel and look at the spread covered by each range, you see just how arbitrary the different price bands are:

Medicare Chart Critique Figure_5

Whoa…the spread of that $9k to $16k band is nearly 15 times larger than two of the other bands. That can’t be good, can it?

Nice profile

If you were to graph financial spread of each group against the aggregated number of Hospital Referral Regions that fall within each spread, you get something like a histogram. The difference between the sizes of these bands is about as different as the number of performers on stage at a Bob Dylan concert in 1964 compared with 1974. See for yourself:

Medicare Chart Critique Figure_6Medicare Chart Critique Figure_7

1964 1974

Oops, wrong graphic. Try this:

Medicare Chart Critique Figure_8

Normally histograms have equal widths for each band, but here I want to highlight just how unequal the bands used are. Plus, this lets us regroup the data into evenly spread $1k bands, and overlay it on the first distribution, to see how it compares. Here’s one that I prepared earlier, with the red line as the regrouped data…

Medicare Chart Critique Figure_9

Vastly different picture isn’t it. The red is kinda like Data Pig’s heart rate before he eats chocolate covered bacon on Saturdays, and the blue is how his ECG would look when he’s in the ambulance, on the way to the hospital.

This makes it very hard to answer that important question “…compared to what?” With such different sized bands, how can we compare one to another? How can we be sure that the distributions within each band will even allow us to?

For instance, take the highest band spread of $9k to $16k: without any further information to go on, we might assume that the median (i.e. middle) value for districts in this category is midway between the $9k to $16k boundaries, like this:

Medicare Chart Critique Figure_10

But that’s like assuming that Simon and Chartjunkle (oops, Garfunkel) have equal talent. We’d be wrong. Verywrong. In actual fact, there’s only three data points to the right of our guessed median line. And as for the 55 hospital regions in Group Five that fall to the left of it…well, they all get tarred with the same brush those worst three performers. The actual median for this group is a lot further left, as shown below:

Medicare Chart Critique Figure_11

This means that over half the data in this 5th band actually falls much closer to the far left of the graph than to the far right of the same group it’s been placed in.

You can see this better if you add a one-dimensional strip plot above the graph, which gives an idea of where the 300 odd values fall within the entire range:

Medicare Chart Critique Figure_12

Whoa…looks like we’ve got a few outliers to contend with.

What a State we’re in…

This seemingly arbitrary ‘bucketing’ effect is exacerbated when aggregating the different hospital regions into State-wide totals. Except this time regions are being penalised by arbitrary geographical boundaries, as well as the arbitrary financial ones above.

Take Texas for example. Aggregating everything up to the State level, Texas appears in that highest band. Yet at the Hospital Referral Region level, one third of its 22 different hospital fall below the national average, and the median for the whole State is around $8,800. So we better be careful making assumptions from a State-wide view, because the Choropleth averages some very diverse costs over some very large chunks of real estate.

To see just how diverse, let’s rank the entire US values from smallest to largest, and highlight where the Texas readings fall within that range:

Medicare Chart Critique Figure_13

What can we tell from this? Firstly, nearly all regions nationwide fall between $5k and $10k. Secondly, there are a few outliers that really skew the picture at the high end. Thirdly, in the Texas case, the State average is boosted somewhat by 3 Texan districts that happen to be among the worst 10 culprits nationwide – one of which is clearly an outlier at $15k. Unfortunately for the lower cost Texan regions, they’re guilty by geographical association…kinda like being kidnapped and held for a zillion dollar ransom, just because you happen to live in the same State as Bill Gates.

So what do we get by aggregating to State boundaries? Probably more blurring than insight. After all, what good would a weather report be to Texans if it only reported the average weather they could expect as a State! Instead, it’s better to keep the aggregation at the Hospital Referral Region level. That way, we can look at this:

Medicare Chart Critique Figure_14

…and ask things like “Wow, why such a difference between Waco and the surrounding bits of Texas?” and “What the hell is Alaska doing there?”

Legends in the making…

What’s far worse that this though is that when looking at the State-wide map, the legend is now really, really wrong.

Here’s the legend next to the actual State-wide figures, for comparison:

Medicare Chart Critique Figure_15

Whoops…the graph title has changed to reflect we’re now looking at Medicare spending per beneficiary per State; i.e. State averages. The legend is still looking at Hospital Referral Region averages, which have a much greater spread. For instance, the Choropleth shows six States as being dark green regions, and the legend says they fall somewhere within $9k to $16k. But the actual data shows they fall in a $9.4k to $9.6k range. Oops! Slight misrepresentation, there.

How to fix it

Obviously this graph really should use a quantitative scale with equal increments; one that changes to reflect the selection that users make. What’s more, colors should have just enough variation so as to highlight any important differences, without being overwhelming or mistaken for camouflage.

But is a Choropleth Map the best way to present this data in the first place? If you want something for people to play with online, then maybe…but if you want to compare things very closely to other things, then maybe not.

For sure, a Choropleth Map looks cool, and it has what Tusha Metha calls “natural context”. But from an analytical perspective, a Choropleth only really reports how one thing changes with regards to geography. If geography is a major determinant – or if you want to show people how things look in their own back yard compared to others – then perhaps this is the piece of kit you need. But if there’s other factors that have much more sway on your data than geography, then perhaps not. For instance, we might want to see whether population density plays a significant part in Medicare returns, given the likely economies of scale from providing healthcare to densely populated regions vs. urban regions. Nows the time to break out a scatter plot:

Medicare Chart Critique Figure_16

Hmmm…looks promising. (Note: I’ve used State-wide data for the above…ran out of time to track down densities in the different Hospital Referral Regions, which is what I’d prefer to do.)

Or we might want to zoom in on the best or worst offenders, and see just how different they are to each other, and to the median value:

Medicare Chart Critique Figure_17

Conclusion

I think a better, fairer Choropleth Map at the Hospital Referral Region level would be interesting. But I don’t think it would be enough. To quote from Stephen Few’s latest book Now you see it: “Color is good at drawing your attention to something if used sparingly, but is one of the ‘pre-attentive attributes’ that is not quantitatively perceived in and of themselves”.

Whereas lines and 2D precision are very precise ways to encode quantitative values.

So when it comes to answering the ‘Compared to what’ question, I don’t think you can beat this:

Medicare Chart Critique Figure_18

 

Choropleth Maps in Excel

For information on the implementation of Choropleth Maps in Excel, check out Tushar Mehta’s excellent resources.

For more information on the pros and cons of Choropleth Maps, check out the Clearly and Simply blog, where Robert has built on Tushar’s excellent approach to produces some great downloadable templates. He also offers advice on potential drawbacks of Choropleth Maps, such as:

  • No visualization of development over time
  • No information on exact values (unless you are implementing tooltips including the data)
  • Very limited direct comparability of the regions
  • Possible perception problems with regards to the size of regions (e.g. Rhode Island on a US map)
  • Possible misinterpretation because the size of a region may have a greater impact on the user’s visual perception than the intensity of the fill color
  • Requirement of real estate on a dashboard

His recommendation: carefully consider whether or not a Choropleth Map is the best visualization for your purposes. Check out his dashboard of Lithuania at a glance to see how he mitigates some of the potential problems by incorporating other graphs into the display.

I used Robert’s template to produce this State-wide Choropleth Map of total Medicare spending per enrollee, 2006 using the same Medicare ranges as the Choropleth that’s the subject of this post:

Medicare Chart Critique Figure_19

…then I replotted the graph using data that had been regrouped $1k bands:

Medicare Chart Critique Figure_20

While I don’t advocate this approach, it’s interesting that even though this is aggregated to State-wide totals, you can see significant differences between the graphs.

Right, that’s it. I’m off to the Hospital to see someone about my writers cramp…

About the Author

Jeff is a Business Analyst from Wellington, New Zealand who has recently discovered a strong interest in Data Visualization. He swears by Edward Tufte and Stephen Few as much as he swears at Excel 2007. He’s so new to advanced Excel, that 2 years ago he had to ask a work friend what the dollar signs in $A$1 meant. Now that he knows that, he’s trying to find out what the dollar signs in $A$2 mean.

Note from PHD:

Thank you Jeff. Your passion and knowledge is truly outstanding. I have a whole pack of donuts waiting for you.

Facebook
Twitter
LinkedIn

Share this tip with your colleagues

Excel and Power BI tips - Chandoo.org Newsletter

Get FREE Excel + Power BI Tips

Simple, fun and useful emails, once per week.

Learn & be awesome.

Welcome to Chandoo.org

Thank you so much for visiting. My aim is to make you awesome in Excel & Power BI. I do this by sharing videos, tips, examples and downloads on this website. There are more than 1,000 pages with all things Excel, Power BI, Dashboards & VBA here. Go ahead and spend few minutes to be AWESOME.

Read my storyFREE Excel tips book

Overall I learned a lot and I thought you did a great job of explaining how to do things. This will definitely elevate my reporting in the future.
Rebekah S
Reporting Analyst
Excel formula list - 100+ examples and howto guide for you

From simple to complex, there is a formula for every occasion. Check out the list now.

Calendars, invoices, trackers and much more. All free, fun and fantastic.

Advanced Pivot Table tricks

Power Query, Data model, DAX, Filters, Slicers, Conditional formats and beautiful charts. It's all here.

Still on fence about Power BI? In this getting started guide, learn what is Power BI, how to get it and how to create your first report from scratch.

39 Responses to “11 very useful excel keyboard shortcuts you may not know”

  1. Judy Fearn says:

    You asked about a favorite keyboard shortcut: I often right click the navigation arrows at the bottom of an Excel workbook to get a list of the worksheets. I can click the one I want without having to scroll left or right.

  2. Sam Krysiak says:

    I regularly use the networkdays(x,y,z) function to show the number of working days between two given dates. To exclude public holidays I reference a list of dates ("z" in the above reference) which I periodically update to reflect upcoming non-working days. To keep the sheet looking tidy for other users, I like to hide this column when I'm done, and then unhide it when I update the sheet.

    With 40 separate workbooks to edit, these shortcuts make it a breeze...

    ? Hide selected column: CTRL+0 [zero]
    ? Unhide hidden column(s) within selection: CTRL+SHIFT+) [closed parenthesis]

    If an "Autofit Selection" keyboard shortcut (not just a key sequence) existed, I'd be as happy as a clam!

  3. [...] 11 very useful excel keyboard shortcuts you may not know [...]

  4. [...] an Excel Conditional Formatting Rock Star 11 very useful excel keyboard shortcuts 73 Free Designer Quality Excel Chart Templates Tracking mutual fund / Stock portfolios using Excel [...]

  5. 1xoid1 says:

    Hello Chandoo, thanks for sharing this information. With some of the shortcuts I seem to have difficulties as they do not seem to work on the German keyboard.

    Can you maybe verify that those combos are only working with the keyboard setup you are using? What would be a good source to lookup combinations for other layouts?

    Regards, 1xoid1

  6. Chandoo says:

    @1xoid1 ... Thanks for visiting PHD and taking timeout to ask your question. Unfortunately all my German can be summarized to one phrase: "guten tag".

    I wont be able to help you, but I can request other readers to respond. So if you know German or use German keyboard and can answer 1xoid1's question, then you get a free donut.

    Guten Tag 🙂

  7. Martin Williamson says:

    To Sam Krysiak.
    Shortcut to Autofit Selection (assuming you mean autofit columns). If you right click toolbar, click customise. From Commands tab/Categories select Built-in Menus.

    In Categories window scroll down and select Columns and drag drop it onto toolbar. Then click the new toolbar Columns button and drag drop Autofit button onto your toolbar (note Autofit for Columns will no longer appear in your menus, only on toolbar).

    Remove Columns button from toolbar (if you want to keep clutter down) drag and drop it off of your toolbar.

    Close Customise box.

    Now to Autofit columns just press "Alt" then "A".

  8. Martin Williamson says:

    Comment 8 correction - 2nd paragraph should read
    "In Commands window...

  9. Robert says:

    @1xoid1:

    Read the following text as follows: The key ,[;] is the one right to the M on the German keyboard. Here are the differences you have to know when using a German keyboard:

    2. Press strg .[:] for inserting the current date (and strg shift .[:] for inserting current time)

    3. Press strg ,[;] to copy values from cell above

    8. Press strg shift –[_] to apply an outline border

    10. Press strg-shift S to activate the font drop down (Schriftgroesse)

    11. Press strg-shift G to activate the font size (Groesse)

    Number 10 and 11 do not work with Excel 2007 anymore, but strg-shift-P shows the font tab of the cell format dialogue in Excel 2007.

    All other shortcuts should work on a German keyboard exactly as Chandoo described them.

    More information needed? Download a complete list with all shortcuts for Microsoft Excel in German (for free):

    http://www.freeware-download.com/downloaddetails/5655.html

    @Chandoo: please do not send a donut, unless you are able to attach one to an email. Otherwise the donut might be able to walk by itself, when it arrives here in Germany...

  10. Robert says:

    I forgot to mention:

    For all readers using an English keyboard: Chip Pearson offers a comprehensive list of Excel shortcuts on the English keyboard:

    http://www.cpearson.com/excel/ShortCuts.aspx

  11. [...] your own keyboard shortcuts in Excel 2007, knowing a few keyboard shortcuts in excel is a huge help. Lyte Byte describes a nifty way to create your own key board shortcuts in [...]

  12. [...] Select a bunch of cells and click on the Sigma symbol on the standard tool bar. Alternatively you can use Alt+= keyboard shortcut. [...]

  13. Prashant R.Moholkar says:

    I do some data entries column A,column B ,Column C , A and B have 10 to 12 digit codes , C has the names ; Kindly suggest me a format or formula for excel to avoid duplication of entries in all the the three columns.

    Regards,
    Prashant

  14. Chandoo says:

    @Prashant... You can use conditional formatting to highlight duplicate entries in the three columns. That way whenever you type a dupe value in a cell the formatting would highlight the values so that you can avoid the error.

    check this post for more on using this way to handling duplicates: http://chandoo.org/wp/2008/03/13/want-to-be-an-excel-conditional-formatting-rock-star-read-this/

    If you are looking for a way to remove duplicates from an existing range, you can try one of the various techniques we have described here. Try these tips:

    http://chandoo.org/wp/2008/11/06/unique-duplicate-missing-items-excel-help/
    http://chandoo.org/wp/2008/08/01/15-fun-things-with-excel/

  15. [...] good alternative (although manual) is to use keyboard shortcuts CTRL + ; or CTRL + : to insert current date and time in the active cell. Since this places the [...]

  16. GesyimmeliA says:

    Your site doesn't correctly work in safari browser

    • Chandoo says:

      Hi GesyimmeliA: Can you tell me which version of Safari on which OS has this problem. I use Macbook at home and loaded the site quite often in Safari and never seen any layout or content issues. Are you facing any script issues while posting comments or somethings like that ?

  17. Daniel Shi says:

    Hey Chandoo. Great site. Learning lots.

    My favorite Excel shortcut has got to be Alt+Down when over an autofilter drop down. Learning that changed my life. That was one of the last things I needed to use a mouse for. Changed my life.

  18. [...] are a big advocate of keyboard shortcuts. I think learning a handful of keyboard shortcuts can improve your productivity tremendously, [...]

  19. Barbara says:

    My favourit keyboard shortcut is control and 1 (use the 1 above the letters on the keyboard, not the number pad) for format cells.

  20. DJ says:

    Favourite shortcut: alt + shift + right/left arrow for grouping/ungrouping!

  21. [...] Select a bunch of cells and click on the Sigma symbol on the standard tool bar. Alternatively you can use Alt+= keyboard shortcut. [...]

  22. [...] clicking on these: excel keyboard shortcuts, excel mouse tips & tricks, excel productivity tips part 1 & part [...]

  23. M Meraz says:

    Martin Williamson thanks for the autofit tip! You rock.

  24. Ayan says:

    In order to generate charts/bar graph with a single key:

    1. Select the data
    2. Press F11
    3. Magic.... 🙂

  25. DiverseIT says:

    F3 = Paste a Name or the entire list of Names
    Crtl + F3 = Name Manager
    Crtl + : = Inserts current time.
    F12 = Save As

  26. DiverseIT says:

    Mistake!
    Crtl + Shift + : = Inserts current time.

  27. JAY SHANKAR says:

    SIR U R THE BEST PERSON WHO SHARES A WONDERFULL AND IMPORTANT TIPS IN EXCEL. THANKS AND KEEP ROCKING.

  28. Amit says:

    How do i hide / unhide a work sheet using the keyboard.

  29. PARBATI says:

    input in one cell 1a23bc output in two cell one of 123 and other one is abc how to possible, please help me.

  30. Woj says:

    Hey cool shortcuts but excel have more shortcuts then you listet.

    i find a big database of supportet shortcuts for Excel 2007 here
    http://www.veodin.com/excel-2007-shortcuts/

  31. jayjaymartin says:

    Great article with some very useful follow-up comments and tips.

    One simple question … how do you vertically align the drop-down filter button in a cell with a larger than normal height?

    It’s easy enough to do so with a cell’s contents but the drop-down filter button stubbornly remains at the bottom and I need it at the top!

    I’ve looked everywhere and haven’t located an explanation to what I am sure is considered an Excel basic.

    Cheers

Leave a Reply