Using an Array Formula to Find and Count the Maximum Text Occurrences in a Range

Share

Facebook
Twitter
LinkedIn

A week ago Tarun asked a question on the Chandoo.org Forums.

“I have got multiple names in each row and would like to have what name is repeated maximum number of times and how many times?

Eg. Ram, Amita, Obama, Ram, Willi, Ram, Amita, Chandoo, Ram, Willi

Ans: Ram (4 times)”

(The list and answers are edited)

Chandoo responded with a neat Array Formula:

=INDEX(B2:K2,MATCH(MAX(COUNTIF(B2:K2,B2:K2)), COUNTIF(B2:K2,B2:K2),0))  &

” (“&MAX(COUNTIF(B2:K2,B2:K2))&” times)”

Lets take a look inside this and see how it works

 

THE EXAMINATION

The formula has two parts separated by a &

=INDEX(B2:K2,MATCH(MAX(COUNTIF(B2:K2,B2:K2)), COUNTIF(B2:K2,B2:K2),0))

and

&

and

” (“&MAX(COUNTIF(B2:K2,B2:K2))&” times)”

Each part is separate and can be used independently, the & character simply joins the two parts together to make a single string which answers Tarun’s question, Ram (4 times).

Now, lets look at each part.

You can follow along with this forensic examination by downloading the Sample Data File.

 

=INDEX(B2:K2,MATCH(MAX(COUNTIF(B2:K2,B2:K2)), COUNTIF(B2:K2,B2:K2),0))

This is a single Index Function with 2 components, being:

a Range B2:K2 and

a Count  MATCH(MAX(COUNTIF(B2:K2,B2:K2)), COUNTIF(B2:K2,B2:K2),0)

Typically an Index Function uses 3 components

=Index(Array, Row Number,[Column Number])

In this example the Range is a single Row, B2:K2

And so using the Counter in the Row spot has the effect of counting down the first Column and then continuing at the top of the second Column etc

So the formula used:

=INDEX(B2:K2,MATCH(MAX(COUNTIF(B2:K2,B2:K2)), COUNTIF(B2:K2,B2:K2),0))

Is equivalent to:

=INDEX(B2:K2,1,MATCH(MAX(COUNTIF(B2:K2,B2:K2)), COUNTIF(B2:K2,B2:K2),0))

 

Now lets jump ahead to the COUNTIF(B2:K2,B2:K2) bit

If you copy =COUNTIF(B2:K2,B2:K2) to a cell, Press F2 and then evaluate the Formula using F9

You will see that it returns an array. The array is highlighted by the squiggly brackets {  } ‘s

={4,2,1,4,2,4,2,1,4,2}

This is the heart of the solution.

What this is showing us is that for each position in the range B2:K2, the count of how many times that cells value occurs in the range B2:K2

So the formula

=INDEX(B2:K2,MATCH(MAX(COUNTIF(B2:K2,B2:K2)), COUNTIF(B2:K2,B2:K2),0))

Is equivalent to

=INDEX(B2:K2,MATCH(MAX({4,2,1,4,2,4,2,1,4,2}), {4,2,1,4,2,4,2,1,4,2},0))

Looking at the MAX({4,2,1,4,2,4,2,1,4,2}) part, this simplifies to 4, the Maximum value of the array (Remember this line, we’ll come back to it later).

So our simplified formula is now: =INDEX(B2:K2,MATCH(4, {4,2,1,4,2,4,2,1,4,2},0))

Now looking at the MATCH(4, {4,2,1,4,2,4,2,1,4,2},0) part of the equation

You can see that Match is looking for the value 4, in the array {4,2,1,4,2,4,2,1,4,2}, which is the First value , Position 1, the 0 requesting that an exact match is found.

So that MATCH(4, {4,2,1,4,2,4,2,1,4,2},0) is equivalent to 1

So our equation =INDEX(B2:K2,MATCH(4, {4,2,1,4,2,4,2,1,4,2},0))

Is now simplified even more to =INDEX(B2:K2, 1)

Index will then look in B2:K2 and will return the first cell or “Ram” in this example.

 

& “(” & MAX(COUNTIF(B2:K2,B2:K2)) & ” times)”

The second part of the equation is responsible for counting the number of Times Ram occurs and displaying it with some text.

& “(” & MAX(COUNTIF(B2:K2,B2:K2)) & ” times)”

The parts displayed in Red above add the text ( and times) to the Count

Remember the section MAX(COUNTIF(B2:K2,B2:K2)) which was explained above and evaluates to 4 in this case

So the & “(” & MAX(COUNTIF(B2:K2,B2:K2)) & ” times)”

Part evaluates to: ( 4 times)

With the initial & adding it to the text of the first part Ram for the final result – Ram ( 4 times)

 

LEARN MORE ABOUT ARRAY FORMULAS

You can learn more about Array Formulas at the following links:

http://www.cpearson.com/excel/ArrayFormulas.aspx

http://www.databison.com/index.php/excel-array-formulas-excel-array-formula-syntax-array-constants/

http://office.microsoft.com/en-us/excel-help/introducing-array-formulas-in-excel-HA001087290.aspx

 

Chandoo.org has several articles on Array Formulas

http://chandoo.org/wp/tag/array-formulas/

 

FORENSIC FORMULAS

Would you like to see more “Forensic” examination of complex formulas ?

Let us know in the comments below and it may become a regular section at Chandoo.org.

 

Facebook
Twitter
LinkedIn

Share this tip with your colleagues

Excel and Power BI tips - Chandoo.org Newsletter

Get FREE Excel + Power BI Tips

Simple, fun and useful emails, once per week.

Learn & be awesome.

Welcome to Chandoo.org

Thank you so much for visiting. My aim is to make you awesome in Excel & Power BI. I do this by sharing videos, tips, examples and downloads on this website. There are more than 1,000 pages with all things Excel, Power BI, Dashboards & VBA here. Go ahead and spend few minutes to be AWESOME.

Read my storyFREE Excel tips book

Overall I learned a lot and I thought you did a great job of explaining how to do things. This will definitely elevate my reporting in the future.
Rebekah S
Reporting Analyst
Excel formula list - 100+ examples and howto guide for you

From simple to complex, there is a formula for every occasion. Check out the list now.

Calendars, invoices, trackers and much more. All free, fun and fantastic.

Advanced Pivot Table tricks

Power Query, Data model, DAX, Filters, Slicers, Conditional formats and beautiful charts. It's all here.

Still on fence about Power BI? In this getting started guide, learn what is Power BI, how to get it and how to create your first report from scratch.

27 Responses to “Sum of Values Between 2 Dates [Excel Formulas]”

  1. dexter says:

    I would apply a filter and use function subtotal, with option 9. This way you can see multiple views based on the filter.

  2. Michael Azer says:

    hey Chandoo, the solutions you proposed are very efficient, but if I wanted to be fancy I would do it this way .. the references are as your example workbook.
    =SUM(INDIRECT("C"&(MATCH(F5,B5:B95)+4)):INDIRECT("C"&(MATCH(F6,B5:B95)+4)))

  3. Luke M says:

    I like things simple:
    =SUMIF(B5:B95,">="&F5,C5:C95)-SUMIF(B5:B95,">"&F6,C5:C95)

  4. Matt S says:

    use something like: =SUM(OFFSET(B1,0,0,DATEDIF(A1,D1,"d")))
    and have D1 be the date that I want to sum to.

  5. Tom J says:

    In Excel 2003 (and earlier) I'd use an array formula to calculate either with nested if statements (as shown here) or with AND.

    {=SUM(IF(B5:B95>F5,IF(B5:B95<F6,C5:C95,0),0))}

    Note that I truly made this for BETWEEN the dates, not including the dates

  6. Andrew says:

    I turned the data set into a table named Dailies.
    I named the two limits StartDate and EndDate.

    And used an array formula:

    {=SUM((Dailies[Date]>=StartDate)*(Dailies[Date]<=EndDate)*Dailies[Sales])}

  7. Frank Linssen says:

    If I would still be using the old Excel I would do it as follows:

    SUMIF($B$5:$B$95,"<="&H6,$C$5:$C$95)-SUMIF($B$5:$B$95,"<"&H5,$C$5:$C$95)

    Works as simple as it is.

    Regards

  8. ikkeman says:

    =sum(index(c:c,match(startdate,c:c,1)+1):index(c:c,match(enddate,c:c,1))

  9. ikkeman says:

    =sum(index(c:c,match(startdate,b:b,1)+1):index(c:c,match(enddate,b:b,1))

  10. ram says:

    Great examples and thanks to Chandoo. You have simplified my work.

  11. Rony says:

    Hi! great tips I have found in your page, have you seen this
    http://runakay.blogspot.com/2011/10/searching-in-multiple-excel-tabs.html

  12. [...] I'm not sure I understand your question fully, but have a look at this: Sum of Values Between 2 Dates [Excel Formulas] | Chandoo.org - Learn Microsoft Excel Online [...]

  13. Amanda says:

    Thank you! Thank you! Thank you!

  14. abdalurhman says:

    =SUMIF(A2:A11;">="&B13;B2:B11)-SUMIF(A2:A11;"<"&A11;B2:B11)

  15. Eliza says:

    awesome... thank yoo Chandoo!

  16. dockhem says:

    which is most efficient and fast, if all are efficient ?

  17. jmassiah says:

    Thank you for this formula, I've just spent ages trying to find something to work on my data, I knew it would be possible! Don't care if others think there are easier/other ways to do it, you explained it so I understood it and could apply it to what I was doing so I'm happy!

  18. Nagaraju says:

    The above said example is awesome for calculating values between dates,

    can you pls let know how to calculate sale values if we have 10 sales boys for
    ex: 1,rama
    2,krishna
    3,ashwin
    4,naga
    5,suresh

    how much rama sale value between 1/jan/2015 to 10/jun/15
    how much krishna sale value between 10/jan/2015 to 15/july/2015
    i think you understood can you pls let me know the formula for how to calculate the sale between diffrent sale man sale value from master data file

    Thanks,
    Nagaraju

  19. Viv says:

    Hi

    I have a list of people's names in column A, I have a list of dates in column B which records the dates they have been off sick, in column C I have either 1 if it is a full sick day or 0.5 if it is a half day.

    What I would like to do is to add up the number of dates a specific person has been off within two dates.

    For example, I want to look at my list of names and to find Joe Bloggs (column A), then add up all his sick days (column C). The start date will be in cell E1 and the end date will be in F1.

    If this possible using SUMIFS?

    List of names are in range A2:A100

    List of dates in B2:B100

    List of sick days (either 0.5 or 1 in C2:C100

    The start date is in cell E2

    The end date is in cell F2

    Your help would be greatly appreciated.

    • Loknathan says:

      Yes, with the help of SUMIFS you can have the solution.
      Note: you need have an extra col. D2 where you will input Name of the person.
      =SUMIFS(C2:C100,A2:A100,D2,C2:C100,">="&E2,C2:C100,"<"&F2)

      Col. A Col. B Col. C Col.D Col. E Col. F
      Name Date Sales
      ABC 28-Jun-11 1 MNO 28-Jun-11 25-Sep-11
      XYZ 29-Jun-11 0.5
      MNO 30-Jun-11 1
      PQR 1-Jul-11 1

      • Loknathan says:

        Typo ERROR / Correction in formula:
        Yes, with the help of SUMIFS you can have the solution.
        Note: you need have an extra col. D2 where you will input Name of the person.
        =SUMIFS(C2:C100,A2:A100,D2,B2:B100,">="&E2,B2:B100,"<"&F2)

  20. Viv says:

    Hi

    I have a list of people's names in column A, I have a list of dates in column B which records the dates they have been off sick, in column C I have either 1 if it is a full sick day or 0.5 if it is a half day.

    What I would like to do is to add up the number of dates a specific person has been off within two dates.

    For example, I want to look at my list of names and to find Joe Bloggs (column A), then add up all his sick days (column C). The start date will be in cell E1 and the end date will be in F1.

    If this possible using SUMIFS?

    List of names are in range A2:A100

    List of dates in B2:B100

    List of sick days (either 0.5 or 1 in C2:C100

    The start date is in cell E2

    The end date is in cell F2

    Your help would be greatly appreciated.

    Viv

  21. AC says:

    Thanks for this - it solved the problem that I was having. However can someone please explain to me why the "" needs to be around >= and <= as well as why we need to add & in order for the formula to work? Thanks in advance!

  22. Ufoo says:

    This formula works perfectly as well. Any ideas?: =SUM(INDEX(C5:C95,MATCH(H5,B5:B95,1)):INDEX(C5:C95,MATCH(H6,B5:B95,1)))

  23. Ufoo says:

    ikkeman had posted the same thing.

  24. murray says:

    I am trying to sum total a range of cells between date ranges ie column n has $ amounts column d has the transaction dates ie 1/3/2015 or 25/3/2015 or 25/4/2015 column b has the text saying drp or distribution - reinv

    In another cell I am trying to sum or total (in column n) with the value of a range of different dates (column d) that contain different text (column b) ie cell n48 is 50, n65 is 85, n165 is 36

    with the dates ie cell d48 is 1/3/2015, d65 is 25/3/2015 and d165 is 25/4/2015

    with different text that says drp or distribution - reinv ie cell b48 is drp, b65 is distribution - reinv, b165 is drp

    If I wanted to sum the amounts between 1/3/2015 to 31/3/2015 with drp then the total would be 50. Also if I wanted to sum the amounts between 1/4/2015 to 30/4/2015 with drp the sum total would be 36 If I wanted to sum the amounts between 1/3/2015 to 31/3/2015 with drp and distribution - reinv the sum would be 115

    What would the formula be for these different questions

    hope you can help, it has been driving me nuts and cant work it out

Leave a Reply