Compare 2 sets of data by letter or word & highlight mismatches [vba]

Share

Facebook
Twitter
LinkedIn

We analysts like to compare. If you ever want to keep an analyst busy, just give her 2-3 options. She won’t return to your desk until the cows come home. My wife uses this trick all the time. Picture this:

[In late 2013]
Me: I want to buy a new phone
She: Do you want Nexus 5 or Galaxy S5 or iPhone 5s?

Its late 2014 and I am not done comparing.

So today, let’s talk about an interesting comparison scenario.

Comparing by letter or word

Imagine you are looking at 2 lists like this and you want to know where items differ. Not which items, but where.

Compare 2 lists of data and highlight mismatched letters or words in Excel - How to?

That means, you want to know which letters or words in each line are different.

VBA to the rescue

Unfortunately, none of the standard features of Excel (formulas, conditional formatting, pivot tables etc.) can help us with this situation. But we don’t have to give up. We can use a simple VBA macro to instantly compare 2 lists and highlight mismatched letters or words.

[Related: How to compare 2 lists in Excel, a quick round up of techniques]

A quick demo of our comparison macro:

Compare two texts in Excel and highlight unmatched letters or words using VBA Macros - Demo

How does this macro work?

When you set out to create macros like this, the first step is to define basic algorithm (logical steps in plain English). To compare 2 sets of data, we need to do below:

  1. For each item in list 1
  2. Get corresponding item in list 2
  3. If they don’t match
  4. For word match
    1. For each word in first text
    2. Get corresponding word in second text
    3. Compare
    4. If not matched, highlight in red color
    5. Repeat for other words
  5. For letter match
    1. Find the first mismatched letter
    2. Highlight all the letters from that point in second text
  6. Repeat for next item in list 1
  7. Done

Once you write down this logic, we simply go ahead and implement it in VBA code.

The exact workings of the macro are somewhat complex. So I made a video explaining how the code works & what it can do. Please watch it below.

Video explaining the comparison macro

[see this video on our YouTube Channel]

Download Example Workbook

Click here to download the comparison macro workbook. Examine the code to understand how it is constructed. Feel free to extend it to suit your work needs.

Do you compare lists like this?

Every now and then, I end up having a situation where I need to compare by letter or word. I find VBA macro based solution to be perfect for this.

What about you? Do you compare lists? Where do you struggle with such comparisons? How would you use this macro? Please share your thoughts & tips in comments.

Become incomparable, learn VBA

While VBA is pretty powerful & awesome, not many venture beyond the basic recorded macros. You can transform the work, career & skills by learning VBA. It is not at all difficult and anyone can learn it. Start with below links.

 

Facebook
Twitter
LinkedIn

Share this tip with your colleagues

Excel and Power BI tips - Chandoo.org Newsletter

Get FREE Excel + Power BI Tips

Simple, fun and useful emails, once per week.

Learn & be awesome.

Welcome to Chandoo.org

Thank you so much for visiting. My aim is to make you awesome in Excel & Power BI. I do this by sharing videos, tips, examples and downloads on this website. There are more than 1,000 pages with all things Excel, Power BI, Dashboards & VBA here. Go ahead and spend few minutes to be AWESOME.

Read my storyFREE Excel tips book

Overall I learned a lot and I thought you did a great job of explaining how to do things. This will definitely elevate my reporting in the future.
Rebekah S
Reporting Analyst
Excel formula list - 100+ examples and howto guide for you

From simple to complex, there is a formula for every occasion. Check out the list now.

Calendars, invoices, trackers and much more. All free, fun and fantastic.

Advanced Pivot Table tricks

Power Query, Data model, DAX, Filters, Slicers, Conditional formats and beautiful charts. It's all here.

Still on fence about Power BI? In this getting started guide, learn what is Power BI, how to get it and how to create your first report from scratch.

12 Responses to “Analyzing Search Keywords using Excel : Array Formulas in Real Life”

  1. Very interesting Chandoo, as always. Personally I find endless uses for formulae such as {=sum(if(B$2:B$5=$A2,$C$2$C$5))}, just the flexibility in absolute and relative relative referencing and multiple conditions gives it the edge over dsum and others methods.

    I've added to my blog a piece on SQL in VBA that I think might be of interest to you http://aviatormonkey.wordpress.com/2009/02/10/lesson-one-sql-in-vba/ . It's a bit techie, but I think you might like it.

    Keep up the good work, aviatormonkey

  2. Andy Pope says:

    Hi Chandoo,

    You might find this coded solution I posted on a forum interesting.

    http://www.excelforum.com/excel-programming/680810-create-tag-cloud-in-vba-possible.html

  3. [...] under certain circumstances.  One of the tips involved arranging search keywords in excel using Array Forumlas.  Basically, if you need to know how frequent a word or group of keywords appear, you can use this [...]

  4. Chandoo says:

    @Aviatormonkey: Thanks for sharing the url. I found it a bit technical.. but very interesting.

    @Andy: Looks like Jarad, the person who emailed me this problem has posted the same in excelforum too. Very good solution btw...

  5. bob says:

    Realy great article

    "You can take this basic model and extend it to include parameters like number of searches each key phrase has, how long the users stay on the site etc. to enhance the way tag cloud is generated and colored."

    How would you go about doing this? I think it would need some VB

  6. Thiago says:

    Hi,
    I found the usage very interesting, but is giving me hard time because the LENs formula that use ranges are not considering the full range, in other words, the LEN formula is only bringing results from the respective "line" cell.

    Using the example, when I place the formula to calculate the frequency for "windows" brings me only 1 result, not 11 as displayed in the example. It seems that the LEN formula using ranges is considering the respective line within the range, not the full range.

    Any hint?

  7. Hui... says:

    @Thiago
    You have to enter the formula as an Array Formula
    Enter the Formula and press Ctrl+Shift+Enter
    Not just Enter

  8. Gary says:

    is there a limit to the number of lines it can analyse.
    Ie i am trying to get this to work on a list of sentances 1500 long. 

    • Hui... says:

      @Gary
      In Excel 2010/2013 Excel is only limited by available memory,
      So just give it a go
      As always try on a copy of the file first if you have any doubts

  9. Sumit says:

    Apologies if I am missing something, but coudn't getting frequency be easier with Countif formula. Something like this - COUNTIF(Range with text,"*"&_cell with keyword_&"*")

  10. Ray says:

    Apologies if I missed, but what is the Array Formula to:

    1. Analyze a list of URL's or a list of word phrases to understand frequency;
    2. List in a nearby column from most used words to least used words;
    3. Next to the list of words the count of occurrences.

Leave a Reply