When I saw the Olympic medals won by each country by year infographic on nytimes my jaw almost dropped, go ahead see it and come back, I am sure you will love it too.
It is one of the coolest visualizations I have seen in the recent past and I see infographics all the time, its my passion.
So, I wanted to see if this infographic can be done in Excel, not pixel to pixel, but something close enough to pamper my ego. I was able to create something that looked like this:

Download the Total Olympic medals won by each country since 1896 excel sheet and play with it.
If you want to know how this is done, read on 🙂
1. My first challenge is to get the Olympic medal data per country
Thankfully, Olympics site has the medal counts by country data for each of the 25 editions of the summer games, [click here for 1896] I have copy-pasted the data to my sheet.
2. Next challenge is to find average latitude, longitude for all countries in the world
Thankfully CIA World fact book has the exact data for each country in a table, another ctrl+c, ctrl+v and I have the data in my sheet. [slightly refined data can be found on maxmind as well]
3. Now, the data is not clean
Unfortunately the data copied from Olympics site and CIA fact book doesn’t match as country names were different (USA, United States, United States of America for eg.), country names kept changing (do you know that Australia was called as Australasia sometime back.. :D). So I had to do quite some clean up (mainly using vlookup, filtering unique items etc.)
Finally, I had the data in a tabular format, country names, latitude, longitude, total medals won in rows, Olympic years in columns (1896 to 2004, except 1916, 1940 and 1944 when the games were canceled)
I had to convert latitude and longitude to y and x co-ordinates respectively so that I can plot them on 2 dimensions. I used this logic to do it:
x=(180+longitude)*(map-width/360)
y=(90-latitude)*(map-height/180)
4. Add a scroll bar form control and use it to select the year from 25 Olympic years
This was the easy step. I selected Menu > View > Toolbars > Forms to show the forms toolbar and then inserted a scroll bar control to my sheet. Then I associated it with a cell my sheet and limited the values to change between 1 and 25 (each increment for one of the 25 Olympic years)
![]()
Now, I have associated this scroll bar cell to fetch one Olympic years worth of data.
5. Create a bubble chart with the medal data
Now that I have the data in the format of x, y co-ordinates, medal count for each country for the selected year, I have created a bubble chart with this information, showing bubbles at each pair of (x,y) in the list.
6. Finally, show an outline map of the world in the background

The last step was easy, I searched for an outline map of the world and used it as my chart background, even though this is not part of the original NY Times infographic, it helps me in ensuring that the bubbles are indeed shown in the right places.
Of course there are some differences between my infographic of Olympic medal count and that of NY Times’, mainly,
- The bubbles overlap, but there is nothing I can do about it without writing additional logic. But as Nathan points out, non-overlapping bubbles may be slightly inaccurate.
- The other is, color of bubbles doesn’t change based on the continent it belongs to. Well, this can be done by editing the bubble colors manually, so I gave up.
- Finally, very few countries are omitted in this, mainly due to geopolitical changes, like Germanies getting united, Koreas getting separated, more countries becoming China :D, I did clean up 99% of the data, but there is always a troublesome country you never heard of.
Make sure you download and play with total Olympic medals won by each country since 1896 excel sheet
What do you think of this?
Also see: The art of excel charting – making ubercool dashboards
Junk the default charts, use this art grade templates instead
Did you fire a bullet graph today?















8 Responses to “Pivot Tables from large data-sets – 5 examples”
Do you have links to any sites that can provide free, large, test data sets. Both large in diversity and large in total number of rows.
Good question Ron. I suggest checking out kaggle.com, data.world or create your own with randbetween(). You can also get a complex business data-set from Microsoft Power BI website. It is contoso retail data.
Hi Chandoo,
I work with large data sets all the time (80-200MB files with 100Ks of rows and 20-40 columns) and I've taken a few steps to reduce the size (20-60MB) so they can better shared and work more quickly. These steps include: creating custom calculations in the pivot instead of having additional data columns, deleting the data tab and saving as an xlsb. I've even tried indexmatch instead of vlookup--although I'm not sure that saved much. Are there any other tricks to further reduce the file size? thanks, Steve
Hi Steve,
Good tips on how to reduce the file size and / or process time. Another thing I would definitely try is to use Data Model to load the data rather than keep it in the file. You would be,
1. connect to source data file thru Power Query
2. filter away any columns / rows that are not needed
3. load the data to model
4. make pivots from it
This would reduce the file size while providing all the answers you need.
Give it a try. See this video for some help - https://www.youtube.com/watch?v=5u7bpysO3FQ
Normally when Excel processes data it utilizes all four cores on a processor. Is it true that Excel reduces to only using two cores When calculating tables? Same issue if there were two cores present, it would reduce to one in a table?
I ask because, I have personally noticed when i use tables the data is much slower than if I would have filtered it. I like tables for obvious reasons when working with datasets. Is this true.
John:
I don't know if it is true that Excel Table processing only uses 2 threads/cores, but it is entirely possible. The program has to be enabled to handle multiple parallel threads. Excel Lists/Tables were added long ago, at a time when 2 processes was a reasonable upper limit. And, it could be that there simply is no way to program table processing to use more than 2 threads at a time...
When I've got a large data set, I will set my Excel priority to High thru Task Manager to allow it to use more available processing. Never use RealTime priority or you're completely locked up until Excel finishes.
That is a good tip Jen...