Today lets close some gaps.
Recently I saw this interesting chart on Economist Daily Charts page. This chart is based on World Economic Forum’s survey on how women compare to men in terms of various development parameters. First take a look at the chart prepared by Economist team.

So what are the gaps in this chart?
This chart fails to communicate because,
- All country charts look same, thus making it difficult to spot any deviations.
- We cannot quickly compare one country with another on any particular indicator.
- It does not provide a better context (for eg. how did these countries perform last year?)
But criticizing someone’s work is not awesome. Fixing it and making an even better chart, that has awesome written all over it. So that is what we are going to do.
Fixing the gaps in Gender Equality chart
First take a look at the improved chart. Play below video.
Step 1: Getting the data for this chart
Although folks at Economist have not included source data, the good people at WEF have provided detailed PDF reports (2013, 2012) where all the data is naked and waiting for us, analyst to pounce and go nuts.
I copy pasted table in to Excel.
While 2012 data loaded alright, 2013 loaded in a weird fashion.
So we move to step 2.
Step 2: Cleaning the data
I feel dirty every time I clean a piece of data 😉
But I also like it (cleaning part, not feeling dirty part). I learn some techniques when I am working with messy, sticky and disorganized data sets.
The 2013 data is pasted in to Excel in this format.

From this, we need to transform our data to:

If we know magic, we could point our wand at the table and say something like, Mobiliarbus Datum.
Alas. We are muggles. So lets rely on the most potent magic we know: Excel formulas. Using INDEX + MATCH combination, we can easily convert 2013 data to the format we want.
The actual formula to fetch overall rank (2nd item in the list for each country) is,
=INDEX(gaps2013,MATCH($B5,gaps2013,0)+1)
Explanation:
- gaps2013 is the range where all the 2013 gender gap survey data is copied
- B5 contains the name of the country for which we want the data.
- +1 because we want to get rank, not country name.
For more, read how to get VLOOKUP + 1 item.
Step 3: Set up form controls
Now that we have sparkling clean data, lets create necessary form controls on our output sheet.
![]()
We need 2 controls.
- A combo-box (drop-down) control so that user can select what field to sort the report on.
- A set of option buttons to specify which average to compare.
The combo-box is set up to use the list of values shown aside.
Related: Introduction to Excel Form Controls.
Lets link these to 2 cells, named sortCol & avgType on a different sheet. Call this sheet as calculations. All our formulas will go here.
Step 4: Find sort order based on the selected column
This is the tricky part. I am going to give highlights here and point you to a link where you can learn more.
- Fetch the column we want to sort in a range of cells.
If sorting a number column:
- Make the column unique by adding a very small running fraction.
- This ensures that if our data has duplicates, still our formula works.
- Find the sort order of each item using RANK() formula.
- Refer to Sorting KPIs using Formulas article for more on this technique.
If sorting a text column:
- Find the sort order using COUNTIF() formula.
- Refer to sorting text using formulas article.

Step 5: Re-arrange all data in the sort order
Using INDEX formula, rearrange all data according to the sort order.
Step 6: Calculate % change values
Based on 2012 & 2013 scores, calculate % change and place them in the last 5 columns.
Step 7: Calculate averages
Calculate averages (both G20 & all country values) for all the columns and place them somewhere on your calculations worksheet.
Related: Calculating the average of every nth item.
Step 8: Create charts
Here is the process for creating chart for Overall Score (2013). The same process is used to create all the charts.
- Select all the numbers in overall score column.
- Create a bar chart
- Select vertical axis and press CTRL+1 to format it.
- Select “Categories in reverse order.”
- Adjust series gap to 25%
- Set horizontal axis min to 0 and max to 1 and remove the axis.
- Remove vertical axis, grid lines
- Remove title
- Fill chart background & plot background with no color.
- Set chart outline to no outline.
- And you are done!
See the demo aside to understand the process.

Step 9: Add average as secondary series to the chart
Calculate which average to use in the chart based on the avgType value. And fetch that number to a cell.
Now add average to the chart as a line. This can be done by,
- Adding average point to the chart as second series
- Converting this series to scatter (XY) plot.
- Adjusting the X & Y values of the average point.
- Adding 100% positive (or negative) error bar
- Formatting the error bar to make it look like a line.
- Removing any axis, grid lines added in the process.
Step 10: Oh wow, this is getting long. Have a coffee
I guess this is now a fairly long process. But closing gender gaps (or gaps in the gender gap chart) is never easy. So have a cup of coffee or tea. Rejuvenate and come back.
Step 11: Create all other charts
Follow the same process and create rest of the charts.
One easy way to create rest of the charts is,
- Copy the first chart and paste it elsewhere.
- Select the bars and edit the range address in the formula bar.
- Select the average point and edit that too.
- Adjust axis if needed.
- And you are done!
Step 12: Put everything together
Create a nice table like structure in your output tab and put everything together. Re-size and position the charts as needed. Make sure the colors are nice. Add conditional formatting to highlight column being sorted and you are done!

Missing Steps
I have deliberately omitted a few steps in this process to keep it simple. For those of you with a keen eye:
- Using conditional formatting data bars for the % change column.
- Turning on / off last column in the report based on sort selection using conditional formatting.
- Adding data labels to the country names based on the sort selection.
Download this Excel chart
Click here to download this Excel chart. Play with it. Explore the chart settings, formats, formulas and controls to understand it better.
Conclusions – What does the Gender Inequality Chart say?
After all this analysis, 2 things are clear.
- In most countries, women have high equality with men when it comes to health or education.
- The real gap seems to be in politics & economical development of women.
While this may seem like common sense, it also means, World Economic Forum people should measure inequality on some more parameters. There is little point tracking and analyzing indicators related to health or education (especially in OECD or Western countries).
What do you think?
Want to fill gaps in your Excel knowledge
While no one appreciates gender inequality, we all love awesomeness inequality. There is nothing wrong in wanting to be more awesome than your peers. And here is how you can be unmatched…
- Using Panel charts to understand data
- Spot matrix charts – alternative to radars
- Analyzing performance of stocks using charts
- Suicides & Murders data – interactive analysis
- Visualizing world education rankings
- Survey data analysis with in-cell charts
Want some challenge… How would you analyze this data?
If you want some challenge, go ahead and download the file. It has all the data for 2012 & 2013. Analyze it and share with me your charts. You can email me at chandoo.d@gmail.com or upload your files somewhere and post the links in comments. I would love to see how you can analyze and present this data.












12 Responses to “29 Excel Formula Tips for all Occasions [and proof that PHD readers truly rock]”
Some great contributions here.
Gotta love the Friday 13th formula 😀
Great tips from you all! Thanks a lot for sharing! bsamson, particularly you helped me on a terribly annoying task. 🙂
(BTW, Chandoo, it's not exactly "Find if a range is normally distributed" what my suggestion does. It checks if two proportions are statistically different. I probably gave you a bad explanation on twitter, but it'd be probably better if you fix it here... 🙂 )
Great compilation Chandoo
For the "Clean your text before you lookup"
=VLOOKUP(CLEAN(TRIM(E20)),F5:G18,2,0)
I would like to share a method to convert a number-stored-as-text before you lookup:
=VLOOKUP(E20+0,F5:G18,2,0)
@Peder, yeah, I loved that formula
@Aires: Sorry, I misunderstood your formula. Corrected the heading now.
@John.. that is a cool tip.
Hey Chandoo,
That p-value formula is really great for a statistics person like me.
What a p-value essentially is, is the probability that the results obtained from a statistical test aren't valid. So for example, if my p value is .05, there's a 5% probability that my results are wrong.
You can play with this if you install the Data Analysis Toolpak (which will perform some statistical tests for you AND provide the P Value.)
Let's say for example I've got two weeks of data (separated into columns) with the number of hours worked per day. I want to find out if the total number of hours I worked in week two were really all the different than week one.
Week1 Week2
10 11
12 9
9 10
7 8
5 8
Go to Data > Data Analysis > T-Test Assuming Unequal Variances > OK
In the Variable 1 Box, select the range of data for week 1.
In the Variable 2 Box, select the range of data for week 2.
Check "Labels"
In the Alpha box, select a value (in percentage terms) for how tolerant you are of error.
.05 is the general standard; that is to say I am willing to accept a 95% level of confidence that my result is accuarate.
Select a range output.
Excel calculates a number of results: Average (mean) for each week's data, etc.
You'll notice however that there are two P Values; one-tail and two-tail. (one tail tests are for > or .05), the number of hours I worked in week two is statistically equivalent to the number of hours I worked in week one.
So here’s a way you might want to use this. You put up a new entry on your blog. You think it’s the best entry ever! So you pull your webstats for this week and compare it to last week. You gather data for each week on the length of time a visitor spends on your website. The question you’re trying to prove statistically is whether there’s an average increase in the amount of time spent on your website this week as compared to last week (as a result of your fancy new blog post). You can run the same statistical test I illustrated above to find out. Incidentally, it matters very little to the stat test whether the quantity of visitors differs or not.
Anyhow, the Data Analysis toolpack doesn't perform a lot of stat tests that folks like me would like to have access to. In those cases I have to either use different software, or write some very complicated mathematical formulas. Having this p-value formula makes my life a LOT easier!
Thanks!
Eric~
Fantastic stuf..One line explanation is cool.
Thanks to all the contributors
OS
Take FirstName, MI, LastName in access (you can fix it to work in excel) capitalize first letter of each and lowercase the rest and add ". " if MI exists then same for last name:
Full Name: Format(Left([FirstName],1),">") & Format(Right([FirstName]),Len([FirstName])-1),"") & ". ","") & Format(Left([LastName],1),">") & Format(Right([LastName],Len([LastName])-1),"<")
I teach excel, access, etc etc for a living and i have my access students build this formula one step at a time from the inside out to show how formulas can be made even if it looks complicated. Yes I know I could just do IsNull([MI]) and reverse the order in the Iif() function but the point here is to nest as many functions as possible one by one (also I illustrate how it will fail without the Not() as it is)
Extract the month from a date
The easiest formula for this is =MONTH(a1)
It will return a 1 for January, 2 for February etc.
if in a column we write the value of total person for eg. 10 if we spent 1.33 paise each person then how we get total amount in next column and the result will in round form plzzzzz solve my problem sir................... thank u
@Anjali
If the value 10 is in B2 and 1.33 paise is in C2 the formula in D2 could be =B2*C2
If the values are a column of values you can copy the formula down by copy/paste or drag the small black handle at the bottom right corner of cell D2
kindly share with me new forumulas.
How to convert a figure like 870.70 into 870 but 871.70 into 880 using excel formula ? Please help.