Closing gaps in this Gender Equality Gap chart…

Share

Facebook
Twitter
LinkedIn

Today lets close some gaps.

Recently I saw this interesting chart on Economist Daily Charts page. This chart is based on World Economic Forum’s survey on how women compare to men in terms of various development parameters. First take a look at the chart prepared by Economist team.

Gender inequality gaps in G20 countries - chart from The Economist

So what are the gaps in this chart?

This chart fails to communicate because,

  • All country charts look same, thus making it difficult to spot any deviations.
  • We cannot quickly compare one country with another on any particular indicator.
  • It does not provide a better context (for eg. how did these countries perform last year?)

But criticizing someone’s work is not awesome. Fixing it and making an even better chart, that has awesome written all over it. So that is what we are going to do.

Fixing the gaps in Gender Equality chart

First take a look at the improved chart. Play below video.

Download this Excel Chart.

Step 1: Getting the data for this chart

Although folks at Economist have not included source data, the good people at WEF have provided detailed PDF reports (2013, 2012) where all the data is naked and waiting for us, analyst to pounce and go nuts.

I copy pasted table in to Excel.

While 2012 data loaded alright, 2013 loaded in a weird fashion.

So we move to step 2.

Step 2: Cleaning the data

I feel dirty every time I clean a piece of data 😉

But I also like it (cleaning part, not feeling dirty part). I learn some techniques when I am working with messy, sticky and disorganized data sets.

The 2013 data is pasted in to Excel in this format.

 

2013 gender inequality data looked like this when copy pasted to Excel

From this, we need to transform our data to:

This is how we want to data to look like so that we can easily analyze and make charts - gender gap chart in Excel

If we know magic, we could point our wand at the table and say something like, Mobiliarbus Datum.

Alas. We are muggles. So lets rely on the most potent magic we know: Excel formulas. Using INDEX + MATCH combination, we can easily convert 2013 data to the format we want.

The actual formula to fetch overall rank (2nd item in the list for each country) is,

=INDEX(gaps2013,MATCH($B5,gaps2013,0)+1)

Explanation:

  • gaps2013 is the range where all the 2013 gender gap survey data is copied
  • B5 contains the name of the country for which we want the data.
  • +1 because we want to get rank, not country name.

For more, read how to get VLOOKUP + 1 item.

Step 3: Set up form controls

Now that we have sparkling clean data, lets create necessary form controls on our output sheet.

Form controls on our Interactive gender gap Excel chart

Sorting types in our interactive excel chart - gender inequality in G20 countriesWe need 2 controls.

  1. A combo-box (drop-down) control so that user can select what field to sort the report on.
  2. A set of option buttons to specify which average to compare.

The combo-box is set up to use the list of values shown aside.

Related: Introduction to Excel Form Controls.

Lets link these to 2 cells, named sortCol & avgType on a different sheet. Call this sheet as calculations. All our formulas will go here.

Step 4: Find sort order based on the selected column

This is the tricky part. I am going to give highlights here and point you to a link where you can learn more.

  • Fetch the column we want to sort in a range of cells.

If sorting a number column:

  1. Make the column unique by adding a very small running fraction.
  2. This ensures that if our data has duplicates, still our formula works.
  3. Find the sort order of each item using RANK() formula.
  4. Refer to Sorting KPIs using Formulas article for more on this technique.

If sorting a text column:

  1. Find the sort order using COUNTIF() formula.
  2. Refer to sorting text using formulas article.

Sorting values using formulas - example - Gender gap in G20 countries - interactive Excel chart

Step 5: Re-arrange all data in the sort order

Using INDEX formula, rearrange all data according to the sort order.

Step 6: Calculate % change values

Based on 2012 & 2013 scores, calculate % change and place them in the last 5 columns.

Step 7: Calculate averages

Calculate averages (both G20 & all country values) for all the columns and place them somewhere on your calculations worksheet.

Related: Calculating the average of every nth item.

Step 8: Create charts

Here is the process for creating chart for Overall Score (2013). The same process is used to create all the charts.

  1. Select all the numbers in overall score column.
  2. Create a bar chart
  3. Select vertical axis and press CTRL+1 to format it.
  4. Select “Categories in reverse order.”
  5. Adjust series gap to 25%
  6. Set horizontal axis min to 0 and max to 1 and remove the axis.
  7. Remove vertical axis, grid lines
  8. Remove title
  9. Fill chart background & plot background with no color.
  10. Set chart outline to no outline.
  11. And you are done!

See the demo aside to understand the process.

Steps you need to clean up charts - Gender inequality chart in Excel

Step 9: Add average as secondary series to the chart

Calculate which average to use in the chart based on the avgType value. And fetch that number to a cell.

Now add average to the chart as a line. This can be done by,

  1. Adding average point to the chart as second series
  2. Converting this series to scatter (XY) plot.
  3. Adjusting the X & Y values of the average point.
  4. Adding 100% positive (or negative) error bar
  5. Formatting the error bar to make it look like a line.
  6. Removing any axis, grid lines added in the process.

Step 10: Oh wow, this is getting long. Have a coffee

I guess this is now a fairly long process. But closing gender gaps (or gaps in the gender gap chart) is never easy. So have a cup of coffee or tea. Rejuvenate and come back.

Step 11: Create all other charts

Follow the same process and create rest of the charts.

One easy way to create rest of the charts is,

  1. Copy the first chart and paste it elsewhere.
  2. Select the bars and edit the range address in the formula bar.
  3. Select the average point and edit that too.
  4. Adjust axis if needed.
  5. And you are done!

Step 12: Put everything together

Create a nice table like structure in your output tab and put everything together. Re-size and position the charts as needed. Make sure the colors are nice. Add conditional formatting to highlight column being sorted and you are done!

Improved gender inequality chart - made in Excel with interactive features

Missing Steps

I have deliberately omitted a few steps in this process to keep it simple. For those of you with a keen eye:

  • Using conditional formatting data bars for the % change column.
  • Turning on / off last column in the report based on sort selection using conditional formatting.
  • Adding data labels to the country names based on the sort selection.

Download this Excel chart

Click here to download this Excel chart. Play with it. Explore the chart settings, formats, formulas and controls to understand it better.

Conclusions – What does the Gender Inequality Chart say?

After all this analysis, 2 things are clear.

  • In most countries, women have high equality with men when it comes to health or education.
  • The real gap seems to be in politics & economical development of women.

While this may seem like common sense, it also means, World Economic Forum people should measure inequality on some more parameters. There is little point tracking and analyzing indicators related to health or education (especially in OECD or Western countries).

What do you think?

Want to fill gaps in your Excel knowledge

While no one appreciates gender inequality, we all love awesomeness inequality. There is nothing wrong in wanting to be more awesome than your peers. And here is how you can be unmatched…

Want some challenge… How would you analyze this data?

If you want some challenge, go ahead and download the file. It has all the data for 2012 & 2013. Analyze it and share with me your charts. You can email me at chandoo.d@gmail.com or upload your files somewhere and post the links in comments. I would love to see how you can analyze and present this data.

 

 

Facebook
Twitter
LinkedIn

Share this tip with your colleagues

Excel and Power BI tips - Chandoo.org Newsletter

Get FREE Excel + Power BI Tips

Simple, fun and useful emails, once per week.

Learn & be awesome.

Welcome to Chandoo.org

Thank you so much for visiting. My aim is to make you awesome in Excel & Power BI. I do this by sharing videos, tips, examples and downloads on this website. There are more than 1,000 pages with all things Excel, Power BI, Dashboards & VBA here. Go ahead and spend few minutes to be AWESOME.

Read my storyFREE Excel tips book

Overall I learned a lot and I thought you did a great job of explaining how to do things. This will definitely elevate my reporting in the future.
Rebekah S
Reporting Analyst
Excel formula list - 100+ examples and howto guide for you

From simple to complex, there is a formula for every occasion. Check out the list now.

Calendars, invoices, trackers and much more. All free, fun and fantastic.

Advanced Pivot Table tricks

Power Query, Data model, DAX, Filters, Slicers, Conditional formats and beautiful charts. It's all here.

Still on fence about Power BI? In this getting started guide, learn what is Power BI, how to get it and how to create your first report from scratch.

12 Responses to “29 Excel Formula Tips for all Occasions [and proof that PHD readers truly rock]”

  1. Peder Schmedling says:

    Some great contributions here.
    Gotta love the Friday 13th formula 😀

  2. Aires says:

    Great tips from you all! Thanks a lot for sharing! bsamson, particularly you helped me on a terribly annoying task. 🙂

    (BTW, Chandoo, it's not exactly "Find if a range is normally distributed" what my suggestion does. It checks if two proportions are statistically different. I probably gave you a bad explanation on twitter, but it'd be probably better if you fix it here... 🙂 )

  3. John Franco says:

    Great compilation Chandoo

    For the "Clean your text before you lookup"
    =VLOOKUP(CLEAN(TRIM(E20)),F5:G18,2,0)

    I would like to share a method to convert a number-stored-as-text before you lookup:

    =VLOOKUP(E20+0,F5:G18,2,0)

  4. Chandoo says:

    @Peder, yeah, I loved that formula
    @Aires: Sorry, I misunderstood your formula. Corrected the heading now.
    @John.. that is a cool tip.

  5. Eric Lind says:

    Hey Chandoo,

    That p-value formula is really great for a statistics person like me.

    What a p-value essentially is, is the probability that the results obtained from a statistical test aren't valid. So for example, if my p value is .05, there's a 5% probability that my results are wrong.

    You can play with this if you install the Data Analysis Toolpak (which will perform some statistical tests for you AND provide the P Value.)

    Let's say for example I've got two weeks of data (separated into columns) with the number of hours worked per day. I want to find out if the total number of hours I worked in week two were really all the different than week one.

    Week1 Week2
    10 11
    12 9
    9 10
    7 8
    5 8

    Go to Data > Data Analysis > T-Test Assuming Unequal Variances > OK

    In the Variable 1 Box, select the range of data for week 1.
    In the Variable 2 Box, select the range of data for week 2.
    Check "Labels"
    In the Alpha box, select a value (in percentage terms) for how tolerant you are of error.

    .05 is the general standard; that is to say I am willing to accept a 95% level of confidence that my result is accuarate.

    Select a range output.

    Excel calculates a number of results: Average (mean) for each week's data, etc.

    You'll notice however that there are two P Values; one-tail and two-tail. (one tail tests are for > or .05), the number of hours I worked in week two is statistically equivalent to the number of hours I worked in week one.

    So here’s a way you might want to use this. You put up a new entry on your blog. You think it’s the best entry ever! So you pull your webstats for this week and compare it to last week. You gather data for each week on the length of time a visitor spends on your website. The question you’re trying to prove statistically is whether there’s an average increase in the amount of time spent on your website this week as compared to last week (as a result of your fancy new blog post). You can run the same statistical test I illustrated above to find out. Incidentally, it matters very little to the stat test whether the quantity of visitors differs or not.

    Anyhow, the Data Analysis toolpack doesn't perform a lot of stat tests that folks like me would like to have access to. In those cases I have to either use different software, or write some very complicated mathematical formulas. Having this p-value formula makes my life a LOT easier!

    Thanks!

    Eric~

  6. Balaji OS says:

    Fantastic stuf..One line explanation is cool.
    Thanks to all the contributors

    OS

  7. Locke says:

    Take FirstName, MI, LastName in access (you can fix it to work in excel) capitalize first letter of each and lowercase the rest and add ". " if MI exists then same for last name:
    Full Name: Format(Left([FirstName],1),">") & Format(Right([FirstName]),Len([FirstName])-1),"") & ". ","") & Format(Left([LastName],1),">") & Format(Right([LastName],Len([LastName])-1),"<")

    I teach excel, access, etc etc for a living and i have my access students build this formula one step at a time from the inside out to show how formulas can be made even if it looks complicated. Yes I know I could just do IsNull([MI]) and reverse the order in the Iif() function but the point here is to nest as many functions as possible one by one (also I illustrate how it will fail without the Not() as it is)

  8. Johan says:

    Extract the month from a date
    The easiest formula for this is =MONTH(a1)
    It will return a 1 for January, 2 for February etc.

  9. anjali says:

    if in a column we write the value of total person for eg. 10 if we spent 1.33 paise each person then how we get total amount in next column and the result will in round form plzzzzz solve my problem sir................... thank u

  10. Hui... says:

    @Anjali

    If the value 10 is in B2 and 1.33 paise is in C2 the formula in D2 could be =B2*C2

    If the values are a column of values you can copy the formula down by copy/paste or drag the small black handle at the bottom right corner of cell D2

  11. sajid says:

    kindly share with me new forumulas.

  12. Biswajit Baidya says:

    How to convert a figure like 870.70 into 870 but 871.70 into 880 using excel formula ? Please help.

Leave a Reply