In this tutorial learn how to,
- Extract tabular data from one PDF to Excel
- Combine and extract tables from multiple PDFS to Excel
How to extract PDF table to Excel
Optional: If you need a sample PDF to practice these concepts, use the randomly made credit card statements I created. Download them from here.
Step 1: Go to Data ribbon & click on Get Data > File > PDF
From data ribbon, use the PDF option and point to the location on your computer (or web address).

Step 2: Select the table(s) you want in the navigator screen
Power Query will open up a navaigator screen. Just specify the table(s) you want. Refer to below illustration to know more about the navigator screen.

💡 Bonus tip: Use the composite table if you want to get a data table in your PDF that spans multiple pages. This is excellent for bank or credit card statements.
Step 3: Load or Transform data
If the preview in navigator looks satisfactory, just load it. Otherwise, click on “Transform data” to open query editor to make any final adjustments.
Combine & Extract data from multiple PDFs
Step 0: Place all your PDFs in a folder
Step 1: Folder connection
Instead of PDF option, use the Folder option in the Get Data.

Step 2: Choose “Combine” in file listing screen
Power Query will show you a screen with a list of all files it found in the folder. Choose any of the combine options here to combine the data from all files to one table.

Step 3: Select the table you want from Transfer Sample Screen
Now, you will see another navigator like screen. Just select the table you want in here. Power Query will go to each file in the folder, get the same table and combine them.
Step 4: Load or Edit the query
And enjoy.
Practice PDF Credit Card Statements
If you need a sample PDF to practice these concepts, use the randomly made credit card statements I created. Download them from here.
Video - Convert PDF to Excel
Still not sure how to extract data tables from PDF to Excel? Watch this short video and get it. See it below or on my YouTube channel.
PDF to Excel - FAQs
I don’t have PDF option in my Excel. What do I do?
You can use free Power BI Desktop to do the same. (Download Power BI for free here)
Once you have Power BI, open it, go to Get Data > PDF and follow the same steps as above tutorial.
Instead of loading the data, copy the entire table from Query Editor and paste it to Excel. See below illustration.

I have new files, how do I refresh?
Just place the files in the same folder.
Go to Excel and right click on the extracted table and select “Refresh”. Excel will update the details.
I want to exclude certain files in the folder when combining…
Open the query editor and go to the query that is responsible for your combining PDF process. Go to source step. This will show all the files in the folder.
Include a filter condition here. Power Query will warn about inserting a step. Proceed and you will be able to exclude files based on conditions.
Examples:
- Process files that have file name starting with certain letters
- Files created after certain date
- Having specific extension.
Remember: Power Query is case sensitive.
I want to pre-process or clean-up data before loading it into Excel
Open the query editor and add any necessary data transformation steps at the end.
Examples:
- Removing all foreign currency transactions from credit card statements
- Cleaning up account codes
- Rearranging columns in the PDF data table
For more on what you can do with Power Query, check out this tutorial.
Other questions…
Post a comment and I will try to help you.















21 Responses to “Distinct count in Excel pivot tables”
The distinct count option works well but I have found that if I have a date field and want to group by year, month, etc. that option seems to be disabled. I need to do both, distinct count and group by year/month.
Example data; sales orders with item quantities with dates.
Challenge; sum the item quantities, count the distinct orders and group by month. How do I do this?
Perhaps that's not possible due to the grouping?
@Al... When you use data model based pivots, you cannot group values manually anymore. Why not use Excel 2016's default date grouping option? In this case we have just a few dates, so Excel is not grouping them, but if you have an year's worth of data, when you make the pivot with date in the row label area, Excel automatically groups them. If you have fewer dates or want to use your own grouping, just create a table with all dates, add columns with month, week, year etc. Then connect this table (these types of tables are usually called as calendar tables) to your data on date field as a relationship. Now you can create reports by month, quarter etc easily.
Is this the only way to do it in 2013? I find it rather cumbersome to have to create another data table listing dates with the another column for MONTH() and YEAR() to be able to summarise data for senior level...
I know people find adding calendar tables cumbersome, but it is a best practice and let's you add more layers of analysis quite easily. For example, adding analysis by weekday vs. weekend or by financial quarter or YTD calculations (you would need either Power Pivot DAX or some very carefully setup pivot table value field settings)
I had absolutely no idea this was possible. Very useful, nice work!
Doesn't work for 2010 version though (or at least not my works version)
Hi ,
The post has the following in it :
These instructions work only in Excel 2016, Office 365 and Excel 2013.
when i have 2 different Pivot tables, one without the enabled “Add this data to data model” option, and the other one with it enabled.. is there anyway i can link slicers between them?
if the answer is NO,, what to do ?
Quick note, the “Add this data to data model” option is not available for the Mac version.
perhaps outside scope of this article but I have found when I attempt to create a pivot table from an external data source (connection to a sql view) the "Add this data to data model" becomes greyed out. Anybody experienced and found a solution so I can start getting distinct count in my pivot tables?
Is there a way to still add a calculated field when using distinct count?
I found I can't change the date source after tick the " add this data to the data model", can you help to adv how to change the date source in such case?
Is there a way to update the source once you have added to the data model? I receive a new spreadsheet weekly and would like to update the connection so my tables pull from the new source.
Hi Crhis, I like how you have hulk (superhero) as your avatar. Do you know that there is a superhero in Excel too? It's Power Query. You can use it to solve your problem in a simple click. Here an intro if you need some guidance.
Powerful Introduction to Power Query
A big Thank you. It worked.
Hi, have survey data that I need to analyze but the challenge is that my key fields are showing horizontally. I tried to transpose the fields using Power Query, but unfortunately the new fields are returning same values on a pivot table despite using distinct values
How I can a do a pivot table with discount conts in some columns and then generate shor report filter pages. pls it drives crazy
Hi. Why grand total pivot of distinct count is 13? shouldn't it be 67?
Great Answer! Saved me lots of time!
Thank you!!!
Worked awesome! Thanks!!
Hi Chandoo,
I am using pivot tables for distinct count and now I need to update them with new set of data. But when I update the source data, all the columns and formatting of Pivot table disappears and I need to build it from Scratch.
Is there a possibility that I can update the source data with new rows added and also retain my pivot tables?