Few days ago, we learned how to create a pie+donut combination chart to visualize polls around the world in 2014. It generated quite a bit of interesting discussion (47 comments so far). One of the comments was from Roberto, who along with Kris & Gábor runs The FrankensTeam an online library of advanced Excel tricks, charts and other mind-boggling spreadsheet wizardry.
I really liked Roberto’s comments on the original post and a charting solution he presented. So I asked him if he can do a guest post explaining the technique to our audience. He obliged and here we go.
Over to FrankensTeam.
Combine pie and xy scatter charts – guest post by The FrankensTeam
Fraü Blucher: I am Fraü Blucher. [horses whinny]
Igor: Steady.
Freddy: Uh, how do you do? I am Dr. Fronkensteen. This is my assistant. Inga, may I present Fraü Blucher. [horses whinny] I wonder what’s got into them.
First of all, we would like to say thank you to Chandoo for asking us to explain how to make this kind of chart.
Recently we have seen an interesting pie-based plot chart by Chandoo. Our proposed version combines 3 different chart types based on some background calculations. The final model is dynamic, you can add more data, and you have the choice to use 1D or 2D data table. All the calculations are prepared on the sheets up to 10 categories. In this guest post we would like to share our template file and show you some of our charting technique.
As an extra, at the end of the post you can find a link to our VBA code which could be used to rotate the chart labels.
Building blocks of the vote-chart
We combined 3 chart types:
- donut chart (two series)
- Outer grey slices
- Inner grey slices with month names
- pie chart (one series)
- Invisible data for placing country labels
- xy scatter chart (three series)
- Brown dots – Legislative
- Blue dots – President
- Orange dots – Referendum
Doughnut series
The two series: month_label and month serve to create the gray ring for the months.
The labels in a doughnut chart are always positioned at the center. By using two series (so two rings) and eliminating the border lines, the two rings seem to be one, but the labels can be positioned at the bottom by adding it to the innermost ring. The reason why we use two rings instead of moving the labels manually is very simple: this way the labels will always stay at the same position, even if you resize the chart. Also it is easier than manually adjust the label boxes.
The month names are linked to the labels from cells (you can see it on the formula bar if you click on one label) because only one axis label could be assigned to the chart, and we use it for the country names (those are more… :-))
XY scatter series
Scatter series are used to arrange the colored dots on the outer ring. This is a main difference from Chandoo’s version. We use 3 series to separate the three different vote categories: presidential, legislative and referendum, and to position the dots of the same country in radial direction as you can see on the original chart. The 3 series form 3 big circles with different radius: legislative is the outermost, referendum is the innermost, but we move the points from the inner circles to the outer, if there is no “higher” vote-type.
Naturally it is possible to adjust the size and shape of the indicators.
We will show you later how to calculate the scatter point positions. (Maybe at first sight it seems to be difficult but you will see it is easy to arrange them properly.)
Our file is prepared to handle more vote-types (or other categories). You will only need to add the new series to the chart!
Pie series
Pie chart is used to position and show labels with the names of the states. The chart itself is hidden (we set to no color and no line) so only the labels are visible.
The number of slices of the pie is determined by the maximum number of countries per month – it needs to be multiplied by 12. All the slices are sized equally and all has a label, but only the ones that we need will have the name of the state, for the rest, the label is an empty string “”.
Formulas behind the chart
For better understanding we separated the data and the support formulas to two sheets. We prepared the file to be able to work with two different types of data table.
You may have the type of vote in one column (1D):
Using some formulas, this table could easily be re-ordered to a pivot-table-like 2D format. This is what you can see in our file on sheet Transpose_data:
This table is the starting point to build up the help data for the charts.
You can find all the calculations on Support sheet. A key element of calculations is the total number of slices for the pie chart. We need to determine the maximum number of countries per month – this will be the number of slices for each month. We use a named formula: max_size_month for this data (here we adapted Chandoo’s MODE-based formula).
The total number of slices will be 12*max_size_month.
The second step is to determine the slice number for each country, and based on that, calculate the the slice angle in radians. If you think about trigonometry, you will remember that sine and cosine together with radius determines the x and y coordinates of the circle points.
We created a calculation table with the necessary formulas. This table is dynamic and prepared to process more data rows and more vote (or other) categories.
The dots are positioned on 3 circles. We use a fixed parameter in a name: circle_distance to set the radiuses of the circles.
We use a support range for both text labels: country names and month. For month names we avoid to use TEXT function with string parameter “mmm” because in non-english systems it will not work! Instead we use Custom cell formatting with code “mmm” – this kind of formatting is translated automatically to locals.
For country names we set the country to the same pie-slice where the dots are, all the rest will have an empty string as label. The column with country name formula will be assigned to the category axis of the chart, but the month names will be linked to the doughnut-series labels one by one, because it is not possible to set two different axis labels. 🙁
How to put it together?
- Select the Legislative x and Legislative y columns, and create a scatter chart.
- Add two more series using the President x and y and Referendum x and y columns.
- Set the axis minimum to -1 maximum to +1 for both of the axes.
- Delete the axes and the grid lines. You can see something like this:
The dots do not form a circle yet, but after you add the pie chart, the shape of the plot area will be a perfect square, so the circle will appear. - Add a new series named for_label using arr_pie both for x and y values:
- Set the chart type of this series to pie and set no fill, no border. Now the dots form perfect circle.
- Link the category axis for this data series to the support column with Label States. (In the Select Data dialogue box click on the “for_label” series, then the Edit button. Select the range from the sheet.)
- Add labels to the pie slices. Set it to show Category name and position Outside end.
- Add two more series (month and month_label) using arr_12 for the values.
- Set the chart type of these two series to doughnut, and set no borders. Color every second slice to darker gray.
- Add data labels for the inner circle, and link the labels one by one to the sheet cells with month names. (Select one label, click on the formula bar, type = and click on the appropriate cell you want to link the label to.)
- Finally you have to hide the 0 data points which appear in the middle of the chart. Add a new xy data series (named “white series”) with fixed values ={0} for x and y. Set a marker of series to the same color as the background of your chart, and use a marker large enough to cover the unnecessary point. 🙂
+1. You can add new xy series if you need – the calculations are already done on the sheets. It is not problem to use over-sized ranges, the error values will become 0 and will appear in the center of the circle – covered by the white series. BUT important for the proper covering, the white series must be the very-last series, so after adding new series, check the order, and move the white series to the bottom of the list.
Bonus: rotate the chart labels using VBA
As you can see on the above picture all the labels are horizontal. To rotate it to radial direction a piece of VBA code is needed. We created this code and published on our site – please feel free to use it for this chart or your other charts (see the link below).
Download the example files
Click here to download the files. Examine the formulas, chart settings and formatting to learn more. This is a highly advanced chart, so take some time to go thru it. You will learn a lot.
Learning points and links:
- Be careful using TEXT formula with string parameter in international environment! You can read about it here.
- Combining xy scatter with pie chart makes the plot area shape perfect square, so it is easy to create a perfect square area for drawing by the xy coordinates. You can read about it here.
- Rotate chart labels to radial or tangential direction is possible with this VBA code.
Added by Chandoo:
Thank you Frankens Team
Thank you so much Robert, Kris and Gábor for taking time to write this. It is a pleasure hosting your article here. I have been following your website for several months and every time I visit it, I end up learning something interesting, creative and just plain awesome. Thanks for sharing your knowledge, ideas and technique with all of us.
Like this chart? Say thanks to Frankens Team
If you enjoyed this chart, please say thanks to Frankens Team. Also visit their site to see how far you can with Excel.
15 Responses to “Highlight Employees by Performance Rating – Conditional Formatting Challenge”
While this might solve the question Shelly asked, there is another option that might be more useful - a pivot table could make a list of people who fall into the various categories, so, if you needed to simply see who got in the top bracket to give them a bonus, you would have that list
Simply sorting by the rankings would work too, but you would knock them out of alphabetical order.
Normal
0
false
false
false
EN-US
X-NONE
X-NONE
/* Style Definitions */
table.MsoNormalTable
{mso-style-name:"Table Normal";
mso-tstyle-rowband-size:0;
mso-tstyle-colband-size:0;
mso-style-noshow:yes;
mso-style-priority:99;
mso-style-parent:"";
mso-padding-alt:0in 5.4pt 0in 5.4pt;
mso-para-margin-top:0in;
mso-para-margin-right:0in;
mso-para-margin-bottom:10.0pt;
mso-para-margin-left:0in;
line-height:115%;
mso-pagination:widow-orphan;
font-size:11.0pt;
font-family:"Calibri","sans-serif";
mso-ascii-font-family:Calibri;
mso-ascii-theme-font:minor-latin;
mso-hansi-font-family:Calibri;
mso-hansi-theme-font:minor-latin;
mso-bidi-font-family:"Times New Roman";
mso-bidi-theme-font:minor-bidi;}
The solution I chose makes use of the percentile formula.
The percentile formula returns the value representing the K-th percentile of a range of values. The range of values is the first criteria, and K is the second criteria in the formula.
I applied Conditional Formatting according to the formulas in the order below:
5% =$C6>=PERCENTILE($C$6:$C$33,0.95) Dark Blue
15% =$C6>=PERCENTILE($C$6:$C$33,0.85) Light Blue
65% =$C6>=PERCENTILE($C$6:$C$33,0.1) Green
10% =$C6>=PERCENTILE($C$6:$C$33,0.05) Light Red
5% =$C6<PERCENTILE($C$6:$C$33,0.05) Dark Red
The issue I noted with this approach is that Zambi was not highlighted in my solution as it is in the solution provided. Unless I am mistaken, and I very well may be, the 10th percentile for this data set is at 2.21, so Zambi would fall above the 10th percentile with a PR of 2.3.
The first step to this was figuring out the 'buckets'; what scores should fall into each range. In attempting to match the formatting of the spreadsheet, I determined the buckets below.
5% = 95% to 100%
10% = 90% up to but not including 95%
65% = 10% up to but not including 90%
10% = 5% up to but not including 10%
5% = under 5%
After that, it is a relatively simple matter to plug the necessary values into the conditional formatting formulas as shown above.
One final consideration is that while the buckets above match the color banding on the spreadsheet, I believe that the original request suggests a different color banding with 6 buckets shown below.
Top 5% = 95 to 100% Dark blue
Top 10% = 85 up to but not including 95% Light blue
Top 65% = 35 up to but not including 85% Green
Bottom 10% = 10% down to but not including 5% Light Red
Bottom 5% = 5% or under Dark Red
This leaves one final bucket of 10 to 35% (exclusive of both values) that is not highlighted and so would remain white.
Thank you Chandoo and Shelly for an interesting and useful exercise. This is certainly a valuable technique to have in my reporting bag of tricks.
Use of PERCENTILE is a smarter way of doing it. Below is my solution.
First 5 % = Apply conditional formatting (Dark Blue) as highlight ">=" =PERCENTILE(C:C,0.95)
Next 15% = Apply conditional formatting (Lighter Blue) as highlight between =PERCENTILE(C:C,0.95)-0.01 and =PERCENTILE(C:C,0.8)
Next 65% = Apply conditional formatting as highlight (Olive Green) between =PERCENTILE(C:C,0.8)-0.01 and =PERCENTILE(C:C,0.15)
Next 10% = Apply conditional formatting as highlight (Lighter Red) between =PERCENTILE(C:C,0.15)-0.01 and =PERCENTILE(C:C,0.05)
Bottom 5% = Apply conditional formatting (Red) as less than =PERCENTILE(C:C,0.05)
I agree, this is a challenge faced by HR managers every year and use of percentile formulae is the most popular solution which permits further processing like making bell curve, applying increments based on segmentation etc.
Hi Chandoo,
I came at the same solution as yours (not looking at yours first) but I have hard coded the conditions in the conditional formatting. For example:
=AND($C6>=$D$10,$C6<$D$9)
I have done the same thing 5 times for each condition. This makes the formatting independent of the order of specification. I think it will work better across versions of excel.
To copy the same thing in all sheets, Shelly can copy these formatted cells with format painter and apply it to the relevant cells in next sheet and so on! I know 700 sheets will be difficult but I dont know of any other way to apply conditional formating rules to the whole sheet.
First i have used percentile formula in the next column of "percentile Threshold" where E5, E6.. is input to colour code.
The idea behind doing this is to replicate the formula for any range and any threshold
=PERCENTILE($C$3:$C$30,1-E5)
=PERCENTILE($C$3:$C$30,1-E6)
=PERCENTILE($C$3:$C$30,1-E7)
=PERCENTILE($C$3:$C$30,1-E8)
=PERCENTILE($C$3:$C$30,1-E9)
Now i have given logic to different employee by applying "if Formula"
=+IF(J3>=$G$5,1,IF(J3>=$G$6,2,IF(J3>=$G$7,3,IF(J3>=$G$8,4,5))))
where 'J" referes to PR and "G" refers to percentile derived from above mentioned formula.
once again it is replicable (just change reference points)
Now comes the major part of Conditional Formatting, i have used "use a formula to determine which cells to be formatted"
Formula =$j=5, format "required colour" Applies to "$I$3:$J$30"
plus put tick on stop if true
This solves the query, important point that this is repeatable and can be done for n number of departments
Thanks !
I had done some reading on it and in Excel 2010 a new function has been introduced, percentile.exc. Attaching a video which also talks why the old percentile function shouldn't be used as it acts erroneous at times. Might be worth a watch Chandoo,
http://www.itechtalk.com/thread10579.html
@Deepa
Quit correct.
Where ever you use statistical spreadsheet functions and are using excel 2010 you should use the new versions of the functions as MS did a lot of work to speed up and fix errors in the old functions.
Warning: If you use the new Excel 2010 statistical functions in Named Formulas most of them will crash excel so do keep that in mind.
Hello Chandoo,
When i first read the challenge file, i thought, the color that need to be applied for a given rule, also need to be picked dynamically as given in rule set. But in the solution file, i found that color is hard Coded. So in case, someone has same data, but wants different colors, he/she needs to goto manage rules and change colors.
Let me know if my understanding is correct, and if yes, can we also make the color to be applied dynamic?
Thanks
Kishore
HI I ALSO USED THE PERCENTILE FUNCTION. HOWEVER, I WENT A STEP FURTHER AND USING THE SMALL() FUNCTION I SORTED THE DATA BY PERCENTILE SO THE COLOSCHEME WOULD BE GROUPED BASED ON THE VALUE. THIS WAY IT IS BETTER AND EASIER TO VIEW.
[...] recently posted a challenge to help a reader with a [...]
Hi, i have got doubt regarding to the percentages that has been put in chandoo's spreadsheet, i cant understadn how he put directly. can some one please explain how chandoo put the percetages straight way that i stated below..
5%
15%
60%
10%
5%
I have stumbled on this post as the solution has been already given so I have taken the liberty to record a video where I show the implementation of it as well as adding a filtering feature which I hope can prove to be useful.
Thank you
http://www.xlninja.com/2012/06/28/how-to-use-excel-to-highlight-employee-performance-rating/
[...] scriu nici macar un cuvant din urmatorul articol. Astazi mi-am citit mailul si hopa challenge de la Chandoo. Cum puteam sa refuz asa ceva si m-am apucat de citit, iar dupa 5 min i-am spus sotului ca pe asta [...]
Question for Chandoo:
I came to your site late but am totally loving these challenges 🙂
I guess it all boils down to how the bins are set up.
I agree with the PERCENTILE.INC function.
pls help me understand where I am wrong.
I have determined following the bins:
bottom 5% <=2.00 (F6:F33 <=PERCENTILE(range,.05))
lower 15% (5+10) <= 2.40 (F6:F33 <=PERCENTILE(range,.15))
lower 80% (5+10+65) <=3.46 (F6:F33 <=PERCENTILE(range,.80))
lower 95% (5+10+65+15) <=4.00 (F6:F33 =PERCENTILE(range,.95))
top 5% <=4.20 (F6:F33 <=PERCENTILE(range,1.00))
I find that only Tom is highest scorer and unique top 5% achiever.
I notice that Chandoo has included Christy and Daniel in top 5% achievers. How can there be 3 people in top 5% out of a population of 28 (5% of 28 = 1.4, i.e. only one person can achieve that status)?
I tried different ways but cannot get to that distribution.
Rest of the work is simply organizing the conditional formatting rules with Stop If True box checked.
Thanks for your insights