• Hi All

    Please note that at the Chandoo.org Forums there is Zero Tolerance to Spam

    Post Spam and you Will Be Deleted as a User

    Hui...

  • When starting a new post, to receive a quicker and more targeted answer, Please include a sample file in the initial post.

How to remove duplicates from 10 million phone # records

slangsomecash

New Member
Hello,

I simply cannot, for the life of me, figure this out.

Could anyone point me in the right direction?

All of the data is in a single column. The data only contains phone numbers.
 
That makes sense.

Now, how do I do that for 10 million records spread out over 10 separate sheets? Obviously "Remove Duplicates" is insufficient for this project, as it does not dedupe across separate sheets (Excel can only hold 1.02MM rows per sheet).
 
If you have access to Power Query. Load all data to PQ. Then append all tables into one. Then perform remove duplicates.

Then count rows of the resulting table. If more than 1mil. Split them into 1 mil row tables each (using Index column and Table.SelectRows). And return them to worksheet.
 
  • Like
Reactions: Hui
Back
Top