Extracting Unique, Duplicate and Missing Items using Formulas [spreadcheats]

Share

Facebook
Twitter
LinkedIn

Often I wish Microsoft had spent the effort and time on a data genie (and a set of powerful formulas) that can automate common data cleanup tasks like extracting duplicates, makings lists unique, find missing items, remove spaces etc. Alas, instead they have provided features like clippy which are intrusive to say the least.

So as part of our second installment of spreadcheats we will learn how to tackle few of the most common data processing tasks:

Getting Unique Items from a List of Cells

There are 3 simple ways to do this:

  1. Using Advanced Data Filter
  2. Using countif() and auto filter
  3. Using formulas as described here

Assuming you have data as shown in the picture aside (and wishing you will have customers like those):

  • First add a column to the left of the list. Here we will use formulas to fill numbers based on the uniqueness of the cell next to it.
  • Essentially our formula should generate numbers in increasing order as long as the corresponding item is unique and not increase the number otherwise.
  • So the formula for order column can be like this: =IF(COUNTIF(list-upto-that-point, current element)=1,previous-order+1, previous-order)
    See the example below:

    remember, the first cell order is 1.
  • See how we are using both absolute and relative references to fetch the counts.
  • Now add another column to the right of the list, here we will fetch unique items.
  • We will use vlookup() to fetch each of the 12 unique items. The formula goes like this:
    =VLOOKUP(running number,$B$4:$C$22,2,FALSE)
    You can wrap the vlookup() with if() formula to avoid seeing #value errors.

That is all. Using this method you can extract unique items froma list.

Eliminating Doubles from a List

There are 2 ways in which you can find and remove duplicates(doubles) in excel lists with ease:

  1. Using countif() and then auto-filter
  2. Using formulas

The process for finding duplicates using formulas is same as that of finding unique items.

Instead of writing COUNTIF(list-upto-that-point, current element)=1, we now write COUNTIF(list-upto-that-point, current element)=2. Also the first element’s count should be changed to zero.

Once done the list should look like what you see on the side.

Finding Missing Items by comparing one list with another:

Even though this might seem like a different challenge, it is infact same as the above techniques. You need to use countif() to compare first list’s elements with second list. How? that is your home work.

Download and see these formulas in action:

Still having some doubts? Download the excel tutorial – unique & duplicate items and learn by poking around.

Facebook
Twitter
LinkedIn

Share this tip with your colleagues

Excel and Power BI tips - Chandoo.org Newsletter

Get FREE Excel + Power BI Tips

Simple, fun and useful emails, once per week.

Learn & be awesome.

Welcome to Chandoo.org

Thank you so much for visiting. My aim is to make you awesome in Excel & Power BI. I do this by sharing videos, tips, examples and downloads on this website. There are more than 1,000 pages with all things Excel, Power BI, Dashboards & VBA here. Go ahead and spend few minutes to be AWESOME.

Read my storyFREE Excel tips book

Overall I learned a lot and I thought you did a great job of explaining how to do things. This will definitely elevate my reporting in the future.
Rebekah S
Reporting Analyst
Excel formula list - 100+ examples and howto guide for you

From simple to complex, there is a formula for every occasion. Check out the list now.

Calendars, invoices, trackers and much more. All free, fun and fantastic.

Advanced Pivot Table tricks

Power Query, Data model, DAX, Filters, Slicers, Conditional formats and beautiful charts. It's all here.

Still on fence about Power BI? In this getting started guide, learn what is Power BI, how to get it and how to create your first report from scratch.

14 Responses to “How many ‘Friday the 13th’s are in this year? [Formula fun + challenge]”

  1. in C3=2016
    in C4=3
    in C5=1 (the first next year with three Friday the 13ths)

    =SMALL(IF(MMULT(--(MOD(DATE(C3+ROW(1:1000),COLUMN(A:L),13),7)=6),ROW(1:12)^0)=C4,C3+ROW(1:1000)),C5)

    formula check in the next 1000 years

  2. Brian says:

    This will generate a table of counts of Friday the 13th's by year. If I didn't screw it up the next year with three is 2026.

    I created a simple parameter table with a start date and end date that I wanted to evaluate. That calculates the number of days and generates a list of those days. Then filter and group. The generation of the list in power query (i.e. without populating a date table in excel) is pretty cool, otherwise this isn't really doing anything than creating a big date and filtering/counting.

    let
    Source = List.Dates(StartDateAsDate, Days2, #duration(1,0,0,0)),
    ConvertDateListToTable = Table.FromList(Source, Splitter.SplitByNothing(), null, null, ExtraValues.Error),
    AddDayOfMonthColumn = Table.AddColumn(ConvertDateListToTable, "DayOfMonth", each Date.Day([Column1])),
    AddYearColumn = Table.AddColumn(AddDayOfMonthColumn, "Year", each Date.Year([Column1])),
    AddDayOfWeekColumn = Table.AddColumn(AddYearColumn, "Day of Week", each Date.DayOfWeek([Column1])),
    FilterFriday13 = Table.SelectRows(AddDayOfWeekColumn, each ([DayOfMonth] = 13) and ([Day of Week] = 5)),
    Friday13thsByYear = Table.Group(FilterFriday13, {"Year"}, {{"Number of Friday the 13ths!", each Table.RowCount(_), type number}})
    in
    Friday13thsByYear

    • Brian says:

      With the parameters replaced by values should you want to play along at home. This runs for 20 years starting on 1/1/2016.

      let
      Source = List.Dates(#date(2016,1,1), 7300, #duration(1,0,0,0)),
      ConvertDateListToTable = Table.FromList(Source, Splitter.SplitByNothing(), null, null, ExtraValues.Error),
      AddDayOfMonthColumn = Table.AddColumn(ConvertDateListToTable, "DayOfMonth", each Date.Day([Column1])),
      AddYearColumn = Table.AddColumn(AddDayOfMonthColumn, "Year", each Date.Year([Column1])),
      AddDayOfWeekColumn = Table.AddColumn(AddYearColumn, "Day of Week", each Date.DayOfWeek([Column1])),
      FilterFriday13 = Table.SelectRows(AddDayOfWeekColumn, each ([DayOfMonth] = 13) and ([Day of Week] = 5)),
      Friday13thsByYear = Table.Group(FilterFriday13, {"Year"}, {{"Number of Friday the 13ths!", each Table.RowCount(_), type number}})
      in
      Friday13thsByYear

  3. Alex Groberman says:

    =MATCH(3,MMULT(N(WEEKDAY(DATE(C3+ROW(1:100)-1,COLUMN(A:L),13))=6),1^ROW(1:12)),)+C3-1

    • David N says:

      It should be pointed out that Alex's solution, unlike some others, has the additional advantage of being non-array. My solution was nearly identical but with -- and SIGN instead of N and 1^.

      =C3-1+MATCH(3,MMULT(--(WEEKDAY(DATE(C3-1+ROW(1:25),COLUMN(A:L),13))=6),SIGN(ROW(1:12))),0)

  4. SunnyKow says:

    Sub Friday13()

    Dim StartDate As Date
    Dim EndDate As Date
    Dim x As Long
    Dim r As Long

    Range("C7:C12").ClearContents
    StartDate = CDate("01/01/" & Range("C3"))
    EndDate = CDate("31/12/" & Range("C3"))
    r = 7
    For x = StartDate To EndDate
    If Day(x) = 13 And Weekday(x, vbMonday) = 5 Then
    Cells(r, 3) = Month(x)
    r = r + 1
    End If
    Next
    End Sub

    • SunnyKow says:

      Calculate next year with 3 Friday 13th. Good for 100 years different from year entered in cell C3

      Sub ThreeFriday13()

      Dim StartDate As Date
      Dim EndDate As Date
      Dim x As Long
      Dim WhatYear As Integer
      Dim Counter As Integer

      Range("E7").ClearContents
      StartDate = CDate("01/01/" & Range("C3") + 1)
      EndDate = CDate("31/12/" & Range("C3") + 100)
      Counter = 0

      For x = StartDate To EndDate
      If WhatYear Year(x) Then
      WhatYear = Year(x)
      'Different year so reset counter
      Counter = 0
      End If
      If Day(x) = 13 And Weekday(x, vbMonday) = 5 Then
      Counter = Counter + 1
      If Counter = 3 Then
      WhatYear = Year(x)
      Exit For
      End If
      End If
      Next
      Range("E7") = WhatYear

      End Sub

      • SunnyKow says:

        *RE-POST as not equal did not show earliuer
        Calculate next year with 3 Friday 13th. Good for 100 years different from year entered in cell C3

        Sub ThreeFriday13()

        Dim StartDate As Date
        Dim EndDate As Date
        Dim x As Long
        Dim WhatYear As Integer
        Dim Counter As Integer

        Range("E7").ClearContents
        StartDate = CDate("01/01/" & Range("C3") + 1)
        EndDate = CDate("31/12/" & Range("C3") + 100)
        Counter = 0

        For x = StartDate To EndDate
        If WhatYear NE Year(x) Then
        WhatYear = Year(x)
        'Different year so reset counter
        Counter = 0
        End If
        If Day(x) = 13 And Weekday(x, vbMonday) = 5 Then
        Counter = Counter + 1
        If Counter = 3 Then
        WhatYear = Year(x)
        Exit For
        End If
        End If
        Next
        Range("E7") = WhatYear

        End Sub

  5. Devesh says:

    I've a doubt with using array formula here.
    In sample workbook, I tried to replicate the formula again.
    =IFERROR(SMALL(IF(WEEKDAY(DATE($C$3,ROW($A$1:$A$12),13))=6,ROW($A$1:$A$12)),$B7),"")
    For this I selected C7 to C12, and typed the same formula and pressed ctrl+alt+Enter. But in all cells it is taking $B7 (and not $B7, $B8, $B9.... etc)
    and since it is array formula I can't edit individual cell.
    Please guide.
    Thanks

  6. Pablo says:

    Hi Chandoo,
    Cool stuff. You need to clarify that the answer of 5 represents the 1st month in the year that has a Friday the 13th, and not the number of Fridays the 13th in the year. Subtle, but important difference.
    Thanks,
    Pablo

  7. Micah Dail says:

    I like the MMULT() function far more, but here's how I would have tackled it. It uses an EDATE() base and MODE() over 100 years. I'm assuming that 100 years is enough time to catch the next year with 3 friday 13th's. Array entered, of course.

    {=MODE(IFERROR(YEAR(IF((WEEKDAY(EDATE(DATE(C3, 1, 13), ROW(INDIRECT("1:1200"))))=6), EDATE(DATE(C3, 1, 13), ROW(INDIRECT("1:1200"))), "")), ""))}

  8. Jason Morin says:

    Finding all the Friday the 13ths in a Year:

    =SUMPRODUCT((DAY(ROW(INDIRECT(DATE(C3,1,1)&":"&DATE(C3,12,31))))=13)*(TEXT(ROW(INDIRECT(DATE(C3,1,1)&":"&DATE(C3,12,31))),"ddd")="Fri"))

  9. jmdias says:

    {=sum(if(day.of.week(DATe($YEAR;{1;2;3;4;5;6;7;8;9;10;11;12};13);1)=6;1;0))}
    just list the years

Leave a Reply