Few days ago, we learned how to create a pie+donut combination chart to visualize polls around the world in 2014. It generated quite a bit of interesting discussion (47 comments so far). One of the comments was from Roberto, who along with Kris & Gábor runs The FrankensTeam an online library of advanced Excel tricks, charts and other mind-boggling spreadsheet wizardry.
I really liked Roberto’s comments on the original post and a charting solution he presented. So I asked him if he can do a guest post explaining the technique to our audience. He obliged and here we go.
Over to FrankensTeam.
Combine pie and xy scatter charts – guest post by The FrankensTeam
Fraü Blucher: I am Fraü Blucher. [horses whinny]
Igor: Steady.
Freddy: Uh, how do you do? I am Dr. Fronkensteen. This is my assistant. Inga, may I present Fraü Blucher. [horses whinny] I wonder what’s got into them.
First of all, we would like to say thank you to Chandoo for asking us to explain how to make this kind of chart.
Recently we have seen an interesting pie-based plot chart by Chandoo. Our proposed version combines 3 different chart types based on some background calculations. The final model is dynamic, you can add more data, and you have the choice to use 1D or 2D data table. All the calculations are prepared on the sheets up to 10 categories. In this guest post we would like to share our template file and show you some of our charting technique.

As an extra, at the end of the post you can find a link to our VBA code which could be used to rotate the chart labels.
Building blocks of the vote-chart
We combined 3 chart types:
- donut chart (two series)
- Outer grey slices
- Inner grey slices with month names
- pie chart (one series)
- Invisible data for placing country labels
- xy scatter chart (three series)
- Brown dots – Legislative
- Blue dots – President
- Orange dots – Referendum

Doughnut series
The two series: month_label and month serve to create the gray ring for the months.
The labels in a doughnut chart are always positioned at the center. By using two series (so two rings) and eliminating the border lines, the two rings seem to be one, but the labels can be positioned at the bottom by adding it to the innermost ring. The reason why we use two rings instead of moving the labels manually is very simple: this way the labels will always stay at the same position, even if you resize the chart. Also it is easier than manually adjust the label boxes.
The month names are linked to the labels from cells (you can see it on the formula bar if you click on one label) because only one axis label could be assigned to the chart, and we use it for the country names (those are more… :-))
XY scatter series
Scatter series are used to arrange the colored dots on the outer ring. This is a main difference from Chandoo’s version. We use 3 series to separate the three different vote categories: presidential, legislative and referendum, and to position the dots of the same country in radial direction as you can see on the original chart. The 3 series form 3 big circles with different radius: legislative is the outermost, referendum is the innermost, but we move the points from the inner circles to the outer, if there is no “higher” vote-type.
Naturally it is possible to adjust the size and shape of the indicators.
We will show you later how to calculate the scatter point positions. (Maybe at first sight it seems to be difficult but you will see it is easy to arrange them properly.)
Our file is prepared to handle more vote-types (or other categories). You will only need to add the new series to the chart!
Pie series
Pie chart is used to position and show labels with the names of the states. The chart itself is hidden (we set to no color and no line) so only the labels are visible.
The number of slices of the pie is determined by the maximum number of countries per month – it needs to be multiplied by 12. All the slices are sized equally and all has a label, but only the ones that we need will have the name of the state, for the rest, the label is an empty string “”.
Formulas behind the chart
For better understanding we separated the data and the support formulas to two sheets. We prepared the file to be able to work with two different types of data table.
You may have the type of vote in one column (1D):

Using some formulas, this table could easily be re-ordered to a pivot-table-like 2D format. This is what you can see in our file on sheet Transpose_data:

This table is the starting point to build up the help data for the charts.
You can find all the calculations on Support sheet. A key element of calculations is the total number of slices for the pie chart. We need to determine the maximum number of countries per month – this will be the number of slices for each month. We use a named formula: max_size_month for this data (here we adapted Chandoo’s MODE-based formula).
The total number of slices will be 12*max_size_month.
The second step is to determine the slice number for each country, and based on that, calculate the the slice angle in radians. If you think about trigonometry, you will remember that sine and cosine together with radius determines the x and y coordinates of the circle points.
We created a calculation table with the necessary formulas. This table is dynamic and prepared to process more data rows and more vote (or other) categories.

The dots are positioned on 3 circles. We use a fixed parameter in a name: circle_distance to set the radiuses of the circles.
We use a support range for both text labels: country names and month. For month names we avoid to use TEXT function with string parameter “mmm” because in non-english systems it will not work! Instead we use Custom cell formatting with code “mmm” – this kind of formatting is translated automatically to locals.
For country names we set the country to the same pie-slice where the dots are, all the rest will have an empty string as label. The column with country name formula will be assigned to the category axis of the chart, but the month names will be linked to the doughnut-series labels one by one, because it is not possible to set two different axis labels. 🙁
How to put it together?
- Select the Legislative x and Legislative y columns, and create a scatter chart.
- Add two more series using the President x and y and Referendum x and y columns.
- Set the axis minimum to -1 maximum to +1 for both of the axes.
- Delete the axes and the grid lines. You can see something like this:

The dots do not form a circle yet, but after you add the pie chart, the shape of the plot area will be a perfect square, so the circle will appear. - Add a new series named for_label using arr_pie both for x and y values:

- Set the chart type of this series to pie and set no fill, no border. Now the dots form perfect circle.
- Link the category axis for this data series to the support column with Label States. (In the Select Data dialogue box click on the “for_label” series, then the Edit button. Select the range from the sheet.)
- Add labels to the pie slices. Set it to show Category name and position Outside end.
- Add two more series (month and month_label) using arr_12 for the values.
- Set the chart type of these two series to doughnut, and set no borders. Color every second slice to darker gray.
- Add data labels for the inner circle, and link the labels one by one to the sheet cells with month names. (Select one label, click on the formula bar, type = and click on the appropriate cell you want to link the label to.)
- Finally you have to hide the 0 data points which appear in the middle of the chart. Add a new xy data series (named “white series”) with fixed values ={0} for x and y. Set a marker of series to the same color as the background of your chart, and use a marker large enough to cover the unnecessary point. 🙂
+1. You can add new xy series if you need – the calculations are already done on the sheets. It is not problem to use over-sized ranges, the error values will become 0 and will appear in the center of the circle – covered by the white series. BUT important for the proper covering, the white series must be the very-last series, so after adding new series, check the order, and move the white series to the bottom of the list.
Bonus: rotate the chart labels using VBA
As you can see on the above picture all the labels are horizontal. To rotate it to radial direction a piece of VBA code is needed. We created this code and published on our site – please feel free to use it for this chart or your other charts (see the link below).

Download the example files
Click here to download the files. Examine the formulas, chart settings and formatting to learn more. This is a highly advanced chart, so take some time to go thru it. You will learn a lot.
Learning points and links:
- Be careful using TEXT formula with string parameter in international environment! You can read about it here.
- Combining xy scatter with pie chart makes the plot area shape perfect square, so it is easy to create a perfect square area for drawing by the xy coordinates. You can read about it here.
- Rotate chart labels to radial or tangential direction is possible with this VBA code.
Added by Chandoo:
Thank you Frankens Team
Thank you so much Robert, Kris and Gábor for taking time to write this. It is a pleasure hosting your article here. I have been following your website for several months and every time I visit it, I end up learning something interesting, creative and just plain awesome. Thanks for sharing your knowledge, ideas and technique with all of us.
Like this chart? Say thanks to Frankens Team
If you enjoyed this chart, please say thanks to Frankens Team. Also visit their site to see how far you can with Excel.














40 Responses to “Looking up when the data won’t co-operate (case study)”
Nice Trick.. Clever use of cell references
Here is a formula I tried to create:
=SUMPRODUCT(((NOT(ISERROR(SEARCH(L5,B4:H14))))*1),(B5:H15))
It takes care of Caveat #1 (can handle text), but Caveat #2 remains.
In situations like this, I will often use VBA to restructure the data (2 columns: dates and values) on to a new worksheet. I can then use this 'clean' source for data analysis (formula or pivot table).
=SUMPRODUCT(((NOT(ISERROR(SEARCH(L5,B4:H14))))*1),(B5:H15)) and complex formulae in general are all very well but when you come back to them in a few weeks / months time, it is not at all easy to see what they do and what the limitations are.
Hi Chandoo,
I had used this type of cell ref. various times while calculating average.
But for the situation here try below formula . Note this is an array formula and must be confirmed with Ctrl+Shift+Enter.
=SMALL(IF(MMULT((L5=B4:H14)*IF(ISNUMBER(B5:H15),B5:H15),{1;1;1;1;1;1;1}),MMULT((L5=B4:H14)*IF(ISNUMBER(B5:H15),B5:H15),{1;1;1;1;1;1;1})),1)
Regards,
Hi, I think Using SEARCH in here will create a problem say there is a text like SUN and another text SUNLIGHT both result will be added by SUMPRODUCT.
Regards,
Array option.
=SUM(IF(MOD(ROW(B4:B14),2)=MOD(ROW(B4),2),IF(B4:H14=L5,B5:H15)))
Regards
@Elias,
Nice approach.
Although not requested - the formula I suggested closes all possibilities.
Criteria: _____ Value to retrieve:
Date__________ Numeric
Date__________Textual
Textual________Numeric
Textual________Textual
While your formula copes with only the 3 first combinations.
Michael (Micky) Avidan
“Microsoft® Answer” – Wiki author & Forums Moderator
“Microsoft®” MVP – Excel (2009-2015)
ISRAEL
@Michael,
Sorry but I don’t understand your point. I believe the challenge was to return the summary of a given date. What is your really volatile formula doing that mine is not?
Regards
@Elias,
I didn't say that the challenge differs from what you just mentioned/aimed to nor that your formula doesn't provide the requested result.
Please read my previous comment again and focus on the last combination (TEXT / TEXT).
I, myself, always try to provide a global Formula that is capable to handle all sorts of data.
Michael (Micky) Avidan
“Microsoft® Answer” – Wiki author & Forums Moderator
“Microsoft®” MVP – Excel (2009-2015)
ISRAEL
@Michael,
I see your point, but you are missing the below points if you are trying to cover all sorts of data.
What happened if the lookup value does not exist?
Do you want the first, second, summary, concatenation of the values if the look value is repeated?
See they are too many possibilities to be cover with just one formula.
Regards
1) The range: B4:H15 was named: RNG.
2) The following Array Formula was "retrieved from my sleeve" and I hope it can be shorten.
3) The formula seems to take care of BOTH(!) caveats.
-------------------------------------------------------------------------------
=OFFSET(INDIRECT(ADDRESS(SMALL(IF(RNG=L5,ROW(RNG),""),1),MOD(SMALL(IF(RNG=L5,(ROW(RNG))+COLUMN(RNG)/10),1),1)*10)),1,)
-------------------------------------------------------------------------------
Michael (Micky) Avidan
“Microsoft® Answer” – Wiki author & Forums Moderator
“Microsoft®” MVP – Excel (2009-2015)
ISRAEL
@Michael,
Check what happened with your result if you type 41927 in D5.
Regards
Correct. Didn't predict that.
Will find time to work something out.
Michael (Micky) Avidan
“Microsoft® Answer” – Wiki author & Forums Moderator
“Microsoft®” MVP – Excel (2009-2015)
ISRAEL
@Elias,
Let's hope the following Array Formula "closes all open doors".
Again - it has nothing to do with your formula which works fine as long as the 3 first mentioned combinations are concerned.
-------------------------------------------------------------------------------
=INDEX(RNG,LARGE(IF(RNG=L5,MOD(ROW(RNG)-1,2)*(ROW(RNG)),""),1)-2,(MOD(SMALL(IF(RNG=L5,(ROW(RNG))+COLUMN(RNG)/10),1),1)*10)-1)
-------------------------------------------------------------------------------
Michael (Micky) Avidan
“Microsoft® Answer” – Wiki author & Forums Moderator
“Microsoft®” MVP – Excel (2009-2015)
ISRAEL
Ok, if you insist. The following will cover all the scenarios you listed. However, I’ll never recommend/use such of formula.
Defined names:
rDat = $B$4:$H$15
rRow =ROW(rDat)-MIN(ROW(rDat))+1
rCol =COLUMN(rDat)-MIN(COLUMN(rDat))+1
rInc =MOD(rRow,2)=MOD(MIN(rRow),2)
L6=INDEX(rDat,MAX(IF(rInc,IF(rDat=L5,rRow)))+1,MAX(IF(rInc,IF(rDat=L5,rCol))))
Array Enter
Regards
@Michael,
unfortunately, your array formula still seems to return wrong results (eg 3-Nov).
If data are organized like in the example, ie. looks like a calendar, the INDEX formula seems quite simple:
=INDEX($B$4:$H$15,ROUNDDOWN((L5-B4)/7,0)*2+2,MOD((L5-B4),7)+1)
Yours is effectively the same as what I just came up with, and I believe this is the optimal answer to this particular problem.
My solution, before I saw yours:
=OFFSET(B5,QUOTIENT(L5-B4,7)*2,MOD(L5-B4,7))
OFFSET will work for an arbitrary list size, but INDEX might be easier to read.
QUOTIENT does the round and division in a single step.
If there's an improvement over Elias's solution then I for one can't see it.
Perhaps a non-CSE version which would also mean that only two references (B4:H14 and B5:H15), as opposed to three (B4, B4:B14 and and B5:H15), would require manually amending should the data range change, i.e.:
=SUMPRODUCT((ISEVEN(ROW(B4:H14)-MIN(ROW(B4:H14)))*(B4:H14=L5)*B5:H15))
I suppose we could make it a single, uniform range reference:
=SUMPRODUCT((ISEVEN(ROW(B4:H14)-MIN(ROW(B4:H14)))*(B4:H14=L5)*OFFSET(B4:H14,1,,,)))
which might be more appropriate should we e.g. wish to use a Defined Name for our range, i.e.:
=SUMPRODUCT((ISEVEN(ROW(Rng)-MIN(ROW(Rng)))*(Rng=L5)*OFFSET(Rng,1,,,)))
though whether that compensates for the extra, volatile function call is something to be debated.
Regards
I have tried something and then my Excel workbooks got shut down. Maybe that was too much?
Anyway here is what I've tried:
=SUMPRODUCT(INDEX(B5:H15;IF(ISEVEN(ROW(B5:H15));ROW(B5:B15)-ROW(B5)+1);{1\2\3\4\5\6\7}))
Guess that was wrong? Would this approach work anyway?
Looking forward to learn something from you Excel Experts.
Sorry, I haven't took notice of XOR LX's answer. I guess that's kind of what I was looking for.
@Michael Avidan
As it stands that is not a very rigorous construction.
You say "I, myself, always try to provide a global Formula that is capable to handle all sorts of data", which is a wonderful philosophy, but isn't it at least as important that we ensure that our formulas are independent of the row and column references of the data range in question, so that, should that range change, we do not have to re-work our solution?
What happens with your formula, for example, if RNG is instead re-located one row down, from B4:H15 to B5:H16?
When a formula is reliant upon the addition/subtraction of certain constants within the formula, which themselves are necessarily dependent upon the specific rows/columns in which the data lies at any given time (e.g. the -1 in MOD(ROW(RNG)-1,2)), then that formula is not a very flexible one.
Hence the reason for my choice of a slightly longer construction:
ROW(B4:H14)-MIN(ROW(B4:H14))
which ensures that this part of the calculation is not dependent upon the precise location of the data range within the worksheet, and so will give correct answers even if that range is re-located.
Regards
{=OFFSET(B4,MAX((B4:H15=L5)*ISODD(ROW(1:12))*ROW(1:12)),MAX((B4:H15=L5)*ISODD(ROW(1:12))*COLUMN(A:G))-1)}
Non-array formula:
=INDEX(B4:H15,SUMPRODUCT((B4:H15=L5)*(ROW(B4:H15)-ROW(B4)+1)*ISODD(ROW(B4:H15)-ROW(B4)+1))+1,SUMPRODUCT((B4:H15=L5)*(COLUMN(B4:H15)-COLUMN(B4)+1)*ISODD(ROW(B4:H15)-ROW(B4)+1)))
Using one range (B4:H15), one reference (B4), one lookup value (L5) and no INDIRECT or OFFSET.
My trial with defined names:
DateRange
=$B$4:$H$4,$B$6:$H$6,$B$8:$H$8,$B$10:$H$10,$B$12:$H$12,$B$14:$H$14
Position
=RANK('lookup problem'!$L$5,DateRange,1)
L6
=OFFSET(B4,ROUNDUP(Position/7,0)*2- 1,IF(MOD(Position,7)=0,6,MOD(Position,7)-1))
I'd probably just run with something like:
=SUMPRODUCT((B4:H14=L5)*(MOD(ROW(B4:H14),2)=MOD(ROW(B4),2))*B5:H15)
...which is basically the same as Elias' but without the IFs
The opposite of elegant but it works...
=INDEX(B4:H15,IFERROR(MATCH(L5,B4:B14,0),0)+IFERROR(MATCH(L5,C4:C14,0),0)+IFERROR(MATCH(L5,D4:D14,0),0)+IFERROR(MATCH(L5,E4:E14,0),0)+IFERROR(MATCH(L5,F4:F14,0),0)+IFERROR(MATCH(L5,G4:G14,0),0)+IFERROR(MATCH(L5,H4:H14,0),0)+1,IFERROR(MATCH(L5,B4:H4,0),0)+IFERROR(MATCH(L5,B6:H6,0),0)+IFERROR(MATCH(L5,B8:H8,0),0)+IFERROR(MATCH(L5,B10:H10,0),0)+IFERROR(MATCH(L5,B12:H12,0),0)+IFERROR(MATCH(L5,B14:H14,0),0))
=INDEX(B4:H15,
IFERROR(MATCH(L5,B4:B14,0),0)+
IFERROR(MATCH(L5,C4:C14,0),0)+
IFERROR(MATCH(L5,D4:D14,0),0)+
IFERROR(MATCH(L5,E4:E14,0),0)+
IFERROR(MATCH(L5,F4:F14,0),0)+
IFERROR(MATCH(L5,G4:G14,0),0)+
IFERROR(MATCH(L5,H4:H14,0),0)+1,
IFERROR(MATCH(L5,B4:H4,0),0)+
IFERROR(MATCH(L5,B6:H6,0),0)+
IFERROR(MATCH(L5,B8:H8,0),0)+
IFERROR(MATCH(L5,B10:H10,0),0)+
IFERROR(MATCH(L5,B12:H12,0),0)+
IFERROR(MATCH(L5,B14:H14,0),0))
Named Range
rownum = SUMPRODUCT(('lookup problem'!$B$4:$H$14='lookup problem'!$L$5)*ROW('lookup problem'!$B$4:$H$14)*ISEVEN(ROW('lookup problem'!$B$4:$H$14)))
Formula
=OFFSET($A$1,rownum,MATCH(L5,INDIRECT("$B"&rownum&":$H"&rownum),0))
How about SUM(IF(B4:H14=L5,B5:H15)) with array..it should work
Sorry, Chandoo, you can't find stuff this way in every possible scenario.
What if 2014-10-01 sales would equal 41.927 ? Which is serial number for 2014-10-15 ? SUMIF would fail to retrive correct answer. And your example data suggest that such number is possible in your table.
It's better not to search through dates and numbers at the same time.
If I'd solve a problem like this, it'd reformat table first so I get one column with dates and the other with numbers.
In this case, formula to form date column would be:
=INDIRECT(ADDRESS((INT((ROW()-4)/COUNT($B$4:$H$4))+1)*2+2;MOD(ROW()-4;COUNT($B$4:$H$4))+2;4;1))
and numbers would be the same formula with sight adjustment (+3 instead of +2 at the end of first argument):
=INDIRECT(ADDRESS((INT((ROW()-4)/COUNT($B$4:$H$4))+1)*2+3;MOD(ROW()-4;COUNT($B$4:$H$4))+2;4;1))
And now you got two columns that you can safely use for searching!
Oops, sorry, you actually mentioned that it doesn't work if number=date! I missed that part 🙁
={OFFSET(A1,SUM((B4:H14=L5)*ROW((B4:H14))),SUM((B4:H14=L5)*COLUMN((B4:H14)))-1)}
Works for all data... the solution I got for indirect looks little lengthy
I want to count last 20 records of a person, whose marks is greater than 2 and grade "manager". ....
Assume A1 has got names (James, John...etc...)
A2 "Manager"
A3 "2"
Someone please reply
I want to count last 20 records of a person, whose marks is greater than "2" and grade "Manager"
Assume A1 "geroge" A2 "Michael" A3 "George" etc...name can found anywhere in the rows
B1 "Manager" B2" clerk"
C1 "2" C2, "4"
please reply
Simplest I can come up with. No limitations for either 1 or 2. This does assume dates are an ordered list with 7 per row, and 2 rows per set. Assuming this is always true this will work for an arbitrary long list of dates.
=OFFSET(B5,QUOTIENT($L$5-$B$4,7)*2,MOD($L$5-$B$4,7))
@Marc,
Nice approach - however, as there are no "Negative Dates" - try:
=OFFSET(B5,INT(L5-B4)/7)*2,MOD(L5-B4,7))
——————————————————————————-
Michael (Micky) Avidan
“Microsoft® Answer” – Wiki author & Forums Moderator
“Microsoft®” MVP – Excel (2009-2015)
ISRAEL
=OFFSET(B4,ROUNDUP((L5-41911+1)/7,0)*2-1,MOD(L5-41911,7))
B4 has been used as reference cell for OFFSET().
FOR ROWS:
ROUNDUP(....,0) gives the integer value of a division. In case of presence of a remainder, ROUNDUP will add 1 to the Quotient.
As opposed to ROUNDUP(), the INT() or QUOTIENT() functions eliminate the remainder.
41911 = 01-Sept-2014, the first date in the data.
*2 has been used because there are 2 columns per set of data.
/7 has been used because there are 7 columns per set of data.
For columns
MOD(L5-41911,7))
Vijaykumar Shetye,
Panaji, Goa, India
This is how i did it
{=INDEX(B4:H15, MAX((L5=B4:H15)*ROW(B4:H15))-2, MAX((L5=B4:H15)*COLUMN(B4:H15))-1 )}
Here's my solution:
=INDEX(B4:H15,MATCH(1,MMULT(--(B4:H15=L5),TRANSPOSE(COLUMN(B4:H15)^0)),0)+1,MATCH(1,MMULT(TRANSPOSE(--(B4:H15=L5)),ROW(B4:H15)^0),0))
Sorry, forgot to mention Ctrl Shift Enter is needed.