How to Import PDF Files into Excel with Power Query

Sign up for our Excel webinar, times added weekly: www.excelcampus.com/blueprint...
This video teaches you how to import tables from a PDF file into Excel with Power Query. Table data that spans multiple pages in the PDF can be appended together into one cohesive Excel Table.
If you’d like to view the accompanying blog post on my website, you can access it here: www.excelcampus.com/powerquer...
Upload your own example PDF file for a future video tutorial:
www.excelcampus.com/power-que...
From PDF is a new feature in Power Query that is currently available of the Beta Channel for Microsoft/Office 365 subscribers. It will be rolling out to other channels in the coming months.
Importing PDF files into Excel has always been a challenge. This new feature of Power Query detects structured data tables within pages, making it easy to cleanup and prepare the data in the Power Query Editor.
We can also append (combine or stack) multiple tables from multiple pages in the PDF file. Power Query allows us to automate this entire process, making it fast and easy to import multiple PDF files into Excel.
Related Videos:
How to Combine Excel Tables or Worksheets with Power Query:
• How To Combine Excel T...
Power Query Overview - Automate Data Tasks in Excel & Power BI:
• How To Automate Data T...
How to Install Power Query in Excel 2010 or 2013 for Windows:
• How To Install Power Q...
#MsExcel #ExcelCampus
00:00 Introduction
01:11 Import from PDF
02:48 Append Data

Пікірлер: 165

  • @nonoobott8602
    @nonoobott86024 жыл бұрын

    You're an excellent teacher and instructor. I can hardly wait for this functionality on my power query. Thanks for sharing

  • @amitdagli3361
    @amitdagli3361 Жыл бұрын

    You are first person to show append power queries.

  • @wayneedmondson1065
    @wayneedmondson10654 жыл бұрын

    Hi Jon.. thanks for the preview into this new feature. Can't wait for it to be released to the general public. Thumbs up!!

  • @80andromeda08
    @80andromeda084 жыл бұрын

    Really you are one of the best instructors I have ever know them. All thanks to you dear for your amazing efforts. Hope this function to be released very soon

  • @michaeljones2843
    @michaeljones28434 жыл бұрын

    This is a game changer for me and I'd like to see more examples if possible. Thanks again!

  • @tomwelt1825
    @tomwelt18254 жыл бұрын

    Thanks, Jon, super content. I can't wait to try it when the From PDF feature is made available. Then I'd really be interested in your planned video on refreshing the data.

  • @ebeaucourt
    @ebeaucourt2 жыл бұрын

    Is it possible to make a video tutorial to import multiple pdf's (all with the same structutre) but with multiple tables? Thanks for your help and excellent tutorails.

  • @gladiadordoexcel2358
    @gladiadordoexcel23584 жыл бұрын

    Your videos are always very useful and with clear explanation. Congratulation from Guinea-Bissau (África)

  • @johnmonroe6206
    @johnmonroe62064 жыл бұрын

    Excellent video, Jon, thanks for sharing. Can't wait for this functionality!

  • @ExcelCampus

    @ExcelCampus

    4 жыл бұрын

    Thanks so much John. I appreciate your support! Looking forward to everyone getting their hands on it too. 👍

  • @lucasamorim5506
    @lucasamorim55063 жыл бұрын

    so thoroughly explained! thank you so much!

  • @markbui8093
    @markbui80933 жыл бұрын

    I truly don't understand the thumb-down votes for these videos. I suspect it's coming from competing KZread sites. Thank you Jon for taking the time to put together these tutorial videos for us. Please keep up the great work!!!

  • @ExcelCampus

    @ExcelCampus

    3 жыл бұрын

    Thanks Mark! 😃

  • @Amy732173
    @Amy7321732 жыл бұрын

    This is a great video. I love it. Thank you Jon.

  • @jongskicastro3119
    @jongskicastro31194 жыл бұрын

    Thank you brother, I learned so much from you!!!

  • @axcolo
    @axcolo4 жыл бұрын

    Excellent! Thanks Jon!

  • @dhunpagla3871
    @dhunpagla38714 жыл бұрын

    Excellent Mr. Jon ....Thanks

  • @marlonbarrion4040
    @marlonbarrion40404 жыл бұрын

    Thank you Jon! Does it work if let say the table in the PDF is in picture format? sometimes we can do copy data from pdf and paste it in excel but when the format is in picture, we cannot copy it and paste in excel, so I think it goes the same way if we use power query.

  • @kicksandblowingguns7555
    @kicksandblowingguns75554 жыл бұрын

    Dear Jon, Good Day, I'm a new subscriber, your explaining method is simple and extraordinary. Thanks for spreading your knowledge across the world. Looking fwd to see your more tutorials.

  • @darrylmorgan
    @darrylmorgan4 жыл бұрын

    Great Tutorial,Looking Forward To Try PDF Imports When It Becomes Available...Thank You Jon :)

  • @ExcelCampus

    @ExcelCampus

    4 жыл бұрын

    Hope you enjoy it! 😊

  • @JagadeeshBharadwaj-1954
    @JagadeeshBharadwaj-1954 Жыл бұрын

    Excellent explanation. Thanks a lot, Sir. 🙏

  • @aemero7477
    @aemero74772 жыл бұрын

    well demonstrated! Thank you!

  • @the_starlinkway7821
    @the_starlinkway78212 жыл бұрын

    @Excel Campus - Jon great video - question, in my case, only my first table and remaining 21 tables dont have headers. Following your steps, the appended output table does not append the tables correctly. For example, table 1 has 10 columns (with headers), table 2 is then appended but at the bottom of the table starting at column 11, then table 3 starts at column 21, and so on. how do you tell excel to stack them vertically and make it recognize that you have the columns in each tables in same locations?

  • @RonLeedy
    @RonLeedy4 жыл бұрын

    Thanks for the great video. Already thinking of reports to pull in once its public.

  • @suki9860
    @suki98602 жыл бұрын

    Thanks, Jon. Very clear tutorial. Quick question. Any inspiration to give for where Column 2 data in the first table is shown in Column 1 in the second table bc Column 1 data is blank in table 2. In fact table 2 columns all get shifted bc of two blank columns in table 2 i.e. Col 1 -> Blank Col 2-> Col 1 Col 3-> Col 2 Col 4-> Blank and Col 5-> Col 3. Thanks, Jon.

  • @sankhabiswas6356
    @sankhabiswas63564 жыл бұрын

    Is there any way to connect two or more data models from external files (including all relationship and measures ) in a new excel file?

  • @willchung7811
    @willchung7811 Жыл бұрын

    append as new! thats what i needed. thank u.

  • @lwinnekins4303
    @lwinnekins4303 Жыл бұрын

    Thanks, Jon, for this tutorial as it was a great help in understanding how to download PDF files using power query.

  • @ExcelCampus

    @ExcelCampus

    Жыл бұрын

    Thanks for your feedback, L! :)

  • @muminjonurmanov9911
    @muminjonurmanov99114 жыл бұрын

    Hello Jon. Thanks for the tutorial. Which software do you use to zoom the window during your tutorials? I want to make tutorials on some other topic but also need that software. Thanks

  • @Spreigs
    @Spreigs Жыл бұрын

    I have a particularly difficult set of data - when power query imports it its not only splitting the data in the cells from the PDF table across columns but also across rows. and in some cases the cell data is split both ways - some of the column to the right is on the left and split across the row below. I've tried a number of the MVPs videos and its better but I'm still needing to go line by line. Thoughts?

  • @heyalbee
    @heyalbee3 жыл бұрын

    I have a pdf with the same table formatting for numbers and text across 50 pages. I cannot get it to import. Excel keeps trying to connect to the file but never does. Are there restrictions to file size or complexity? Thank you.

  • @MrBileDuct
    @MrBileDuct Жыл бұрын

    You are awesome - Thank you!!!!

  • @kenmcmillan2637
    @kenmcmillan26374 жыл бұрын

    Do you know if the stand alone version of Office will ever be available for the PDF feature?

  • @Debaurin
    @Debaurin10 ай бұрын

    Thanks for explanation. I have a question. If i have same data template with different entry. Is there any method to use import all files in same table

  • @jaimealmenaravino9173
    @jaimealmenaravino9173Ай бұрын

    Many thanks for the info. Good job

  • @chanelgabb2040
    @chanelgabb20409 ай бұрын

    Your Videos are fantastic! Great new feature too!

  • @ExcelCampus

    @ExcelCampus

    9 ай бұрын

    Glad you like them @chanelgabb2040! 😀

  • @sheetalbarapatre6808
    @sheetalbarapatre68084 жыл бұрын

    Hi Jon, Thank you for making such informative videos on excel. I know that my question is not relevant to above but I am not able to see the formatting options button besides the cell. Ref video from Dec 14 2018. Can you please help. The version of excel I am using is Microsoft Excel for Mac Version 16.37. Waiting for your reply. Thank you so much.

  • @heuteld
    @heuteld4 жыл бұрын

    Thanks Jon...excellent video. Just curious - what tool are you using to draw the outline around a mouse selection in your video. Great presentation tool.

  • @jarisiri9766
    @jarisiri97662 жыл бұрын

    Thank you very much ! This is, so far that i saw showing upto Append. My excel don't give composite table Option for all table together. This help a lot.

  • @esthersuh3388
    @esthersuh33885 ай бұрын

    I love this content because no other KZreadr demonstate APPEND QUERY like this. Thank you trillions!

  • @ExcelCampus

    @ExcelCampus

    5 ай бұрын

    Thank you for your feedback! 😀

  • @emmab7658
    @emmab76582 жыл бұрын

    I need to try this!! 🙂

  • @ibnhamid5838
    @ibnhamid58382 жыл бұрын

    hi sir, thanks for the knowledge. really appreciate it! do you familiar with stock broker statement in pdf? mind if to a tutorial how to table it and find p/L and risk/potential ratio.. really appreciate it sir. thanks again.

  • @Soori8
    @Soori8 Жыл бұрын

    Thank you Excellent

  • @user-vt5ln7qq4j
    @user-vt5ln7qq4j4 жыл бұрын

    Fantastic!

  • @jacquesdoyon1043
    @jacquesdoyon10433 жыл бұрын

    I have a file that contains many tables (over 40+ pages) but each line may contain a variable number of entries, 2 to 4. Depending of the data present in the pdf file, some tables may show data in two columns but they may be column 1 and 3; others will show data in columns 1 and 4, and so on. All together, the data is a sparse matrix or hollow matrix. Is there a way to force Power Query to consider all tables the same way from the start, i.e. from where the data is showing up in the page?

  • @karenallen1193
    @karenallen11934 жыл бұрын

    Thanks John - cool feature, thanks for highlighting it and running through how to use it.

  • @mohamedadjal8502
    @mohamedadjal85023 жыл бұрын

    Hi, Professor, you have provide in a lot of effort for these videos, thank you, I have a question if we have for example in cell "a1" the number 10.00 m, how to have this number with the same format in cell "b1 "using a text function or some other function, thank you very much.😃👍

  • @nagendravishwamitra3652
    @nagendravishwamitra36524 жыл бұрын

    I have excel 365 but this feature was not displayed how to get this options of importing pdf

  • @walidahmed209
    @walidahmed2094 жыл бұрын

    Hello guys What would be the reason for this error message when inserting pivot table into connection only query (DataFormat.Error) We couldn’t convert to Number

  • @deepikaanand8241
    @deepikaanand82412 жыл бұрын

    Hi Jon, Is it possible to fetch only limited fields from a PDF file like Vendor name, amount and address?

  • @jamesryandraper5476
    @jamesryandraper54762 жыл бұрын

    I’m curious to know if we can count how many tables there are in a pdf file? Currently working on automation project which needs a power query. Source file is a pdf. Please share if this is possible.

  • @rajahtkanagarajah1258
    @rajahtkanagarajah1258 Жыл бұрын

    Hi Jon, great video. How do we do the same thing for multiple pdfs. Im not able to replicate this when extracting from a file with many pdfs and each pdf has different number of pages with same table columns. Do share if this is possible

  • @sasda1234s
    @sasda1234s2 жыл бұрын

    Well Explained! thank you for your efforts here :)

  • @sasda1234s

    @sasda1234s

    Жыл бұрын

    I have a quick question, can we export data from password protected PDF file?

  • @preethisussankoppula2229
    @preethisussankoppula22292 жыл бұрын

    Hey..thank you for the video..can you please help with my question. I need page number details from pdf file to reflect in Excel sheet. is it possible?

  • @krishnannair4068
    @krishnannair40684 ай бұрын

    Brilliant job. Great presentation. . You are great teacher . Thanks a ton 🙏🙏🙏

  • @ExcelCampus

    @ExcelCampus

    4 ай бұрын

    Thanks and welcome! 😀

  • @ahmkowser5888
    @ahmkowser58884 жыл бұрын

    Hi John this is great

  • @rjobaanable
    @rjobaanable3 жыл бұрын

    thank you very much Jon, but how does it work when you have multiple pdf files?

  • @stevennye5075
    @stevennye50754 жыл бұрын

    very useful!

  • @syedmukram9332
    @syedmukram93324 жыл бұрын

    May ALLAH Bless you Sir, Thanks so much

  • @LcTricker

    @LcTricker

    4 жыл бұрын

    Allah blesses all Excel / PowerQuery users

  • @KM-co5mx
    @KM-co5mx4 жыл бұрын

    Hi Jon, one more question please... if the pdf file has an image of a table then I assume power query can not scan the table. Correct? I also have pdf files that contain images of tables that I want to bring into Excel. I assume Excel 365 doesn’t have a scan feature. Thanks Again!

  • @knowledgetransfer7460
    @knowledgetransfer74608 ай бұрын

    Befitting video found finally which solved my problem.

  • @ExcelCampus

    @ExcelCampus

    8 ай бұрын

    Glad it helped! 😀

  • @KATSIGOAL
    @KATSIGOAL4 жыл бұрын

    is there an easy way to import data from pdf forms straight to excel spreadsheet???

  • @vinaynagaonkar8592
    @vinaynagaonkar8592 Жыл бұрын

    Thank You Mr. Jon, that was a wonderful demonstration of using a Power Query and its Editor very effectively, This will help me a Lot😊

  • @ExcelCampus

    @ExcelCampus

    11 ай бұрын

    You're very welcome, @vinaynagaonkar8592 ! 😀

  • @muhammadjaved5131
    @muhammadjaved51314 жыл бұрын

    Dear Sir your videos are very helpful for me could u confirm me when this feature pdf will be available in office 2019

  • @frankiecao442
    @frankiecao4427 ай бұрын

    Thank you for the amazing tutorial. Is it possible to remove the Super URLs if the PDF content includes them?

  • @lawrence5304
    @lawrence5304 Жыл бұрын

    hello Jon, how to import a folder with mutiple FDF files which each PDF file with at least two tables into Excel?

  • @beverlygilalta
    @beverlygilalta2 жыл бұрын

    Thank you

  • @fluffyhoneytina
    @fluffyhoneytina2 ай бұрын

    how can we use xlookup on pdf data? it won't work for me.

  • @user-rm4cg2zn6x
    @user-rm4cg2zn6x5 ай бұрын

    Thank you Sir, Excellent

  • @ExcelCampus

    @ExcelCampus

    5 ай бұрын

    You are most welcome 😀

  • @workingmumkitty
    @workingmumkitty4 жыл бұрын

    Hi Jon, I don't have 'From PDF' option under 'get data', is this something to do with my Excel version pls?

  • @AronieroDYami

    @AronieroDYami

    3 жыл бұрын

    I have the same problem. I have Microsoft Office 365 ProPlus and I dont have "From PDF" option. Did you solve this somehow?

  • @tomtomdad

    @tomtomdad

    3 жыл бұрын

    I have the same problem. "From PDF" icon does not exist in my Excel.

  • @AronieroDYami

    @AronieroDYami

    3 жыл бұрын

    @@tomtomdad I forgot to update my comment here :P atm I have option "From PDF" I think i got this option 1 month ago. So you should get this option after 1-2 months of using Excel :P

  • @tomtomdad

    @tomtomdad

    3 жыл бұрын

    @@AronieroDYami Noted. thank you

  • @alterchannel2501
    @alterchannel25012 жыл бұрын

    Thank you for the video, but I really need help for this problem: I have multiple pdf files in a folder and each file has multiple pages containing tables I need, but every month , depending on the lenght of the data in the tables, the tables might be on a different page (pdf page number) than last month. Every table in each pdf is always formatted the same every month but each table in the pdf is different from the other. I need to create a separate query for each table (example if I have a total of 20 pdf tables, I need 20 queries to elaborate and import in 20 separate data models). How can I achieve that, making sure that if next month the pdf has more pages, it incluedes every table I need. Please help me.

  • @danyalahmedrasheed4216
    @danyalahmedrasheed42162 жыл бұрын

    Awesome!

  • @ihavenoklou4773
    @ihavenoklou47733 жыл бұрын

    I did the same thing but the pages are empty after transforming. I also did OCR but still the same. I can't figure out what did wrong.

  • @smilejs
    @smilejs2 ай бұрын

    Thank you. How are we going to do if we need all the tables? Thanks

  • @AndreaUK1973
    @AndreaUK19732 жыл бұрын

    Hi Lovely! When I tried to import the pdf file, power query says nothing available on the navigation tab although there are 3 pages of tables in my pdf invoice. Is there anyway we could upload the pdf file that bypasses this situation? I’m not sure it’s a pdf settings issue that stop us to upload the info into excel that makes it “unreadable” or something? I have no idea just thoughts. Please tell us if you found a way?

  • @eexmann
    @eexmann4 жыл бұрын

    Hi Jon, Eddy here from the Netherlands, Can't wat to get the new functionality, so I can start using Power Query. Would like to send you a PDF file with the results of my Sun-power collectors which I have on my roof. Until Now I manualy copy the data into an excel sheet to follow the power consumption as well as the power I save using the electricity from my Sun-Cells.

  • @rayzucchero9478
    @rayzucchero94783 жыл бұрын

    Hi. Nice presentation, but why isn't Excel recognizing my decompressed pdf files? I split some bank statements by months, unzipped them on explorer, but Excel doesn't see them when I try to import - only sees the zipped folder. Thank you

  • @johnzani6965
    @johnzani6965 Жыл бұрын

    Hi I'm having issues with bulk converting into excel using pdf from folder. I'm bringing in my bank statements for analysis each month. I have jan-aug ready in my folder to bulk install initially and then want to update sept nov dec etc on month by month basis from there on. Unfortunately some of the pdf files have differing numbers of column headers, depending on the file month. They range from 3 columns to 8 columns? extra columns occur randomly between promoted named headers. There should only be 6 promoted columns I want. I cant just remove the other columns because they contain my data. Its proving almost impossible to do a single query to automate the process. Any tips on this?

  • @helbox
    @helbox3 жыл бұрын

    Hi. Is it possible to load in multiple pdf dokuments?

  • @altafamirali4946
    @altafamirali4946 Жыл бұрын

    Thank you sir for your informative video. Is it possible to make a video tutorial about PDF file like bank statement

  • @naseershaikh2414
    @naseershaikh24143 жыл бұрын

    Sir I have installed Office365 but import PDF option it doesn't show kindly help me in this regard,thanks

  • @jaydeenfokuo1350
    @jaydeenfokuo13503 жыл бұрын

    Hi John, in my office 365 excel, when I go to data, get data, then file, I don't see the pdf option. How do I get the PDF option

  • @noufalc.m458
    @noufalc.m4582 жыл бұрын

    How to import pdf files from a folder that pdfs contains more than one page...while generating the query it pics only 1st page of all pdf..if any solution reply please

  • @arfaali1092
    @arfaali10923 жыл бұрын

    Is this feature available in excel 2013, because I don't find it

  • @AnbarasuAnnamalai
    @AnbarasuAnnamalai4 жыл бұрын

    Nice, really cool

  • @ExcelCampus

    @ExcelCampus

    4 жыл бұрын

    Thank you! Cheers! 😀

  • @anira1504
    @anira15042 жыл бұрын

    Hello, could you please why after export pdf to excel sum formula does not work. And in the end of sheet not show as usual sum and avarage

  • @cassandraterry3171
    @cassandraterry31713 жыл бұрын

    When I try to do this I see multiple pages, but no preview. Am I doing something wrong?

  • @loismathius8747
    @loismathius87474 жыл бұрын

    Awesome really help

  • @ExcelCampus

    @ExcelCampus

    4 жыл бұрын

    Glad it helped 😊

  • @KM-co5mx
    @KM-co5mx4 жыл бұрын

    Greetings John, does the pdf file need to be in a certain format or permission settings? I was told that the pdf files that I wanna import via Excel are not importable because they are locked down.

  • @ExcelCampus

    @ExcelCampus

    4 жыл бұрын

    Hey Kurt, Great question! I don't believe the PDF feature supports password protected files yet. You would need to unprotect the file and save it without the password before importing. I tried to import a password protected PDF but got an error message. Hopefully they will make it possible to input the password in the future. Thanks again and have a nice day! 🙂

  • @damusandy
    @damusandy Жыл бұрын

    Thanks

  • @RAHULGUPTA-qb9cw
    @RAHULGUPTA-qb9cw4 жыл бұрын

    Really thank u sir.

  • @ExcelCampus

    @ExcelCampus

    4 жыл бұрын

    Thanks Rahul! 🙂

  • @RAHULGUPTA-qb9cw

    @RAHULGUPTA-qb9cw

    4 жыл бұрын

    @@ExcelCampus sir my excel not showing explore PDF

  • @evelyncoleman7476
    @evelyncoleman7476 Жыл бұрын

    Would you show how to put a pdf bank statement into excel as searchable

  • @aperson1181
    @aperson11813 жыл бұрын

    It is NOT on MS Office Professional Plus 2016. Microsoft wants your subscription fees and it seems to roll the feature only to Office 365.

  • @veryutils
    @veryutils16 күн бұрын

    Thank you for your informative video! I recommend trying the VeryPDF PDF to Excel Converter. This Windows software automatically converts PDF files to Excel spreadsheets and can also combine multiple PDFs into a single Excel file without losing formatting.

  • @HussamQazi
    @HussamQazi2 ай бұрын

    I have Bank statement with 50 pages. The issue I am facing that in Power query editor there are separate 50 pages and I have to do all settings 1 by 1 on every page which is a lot time consuming. Is there any way I can make all the pages as 1 page and apply the settings?

  • @hitenumrania
    @hitenumrania3 жыл бұрын

    Hey Jon, Thanks for the informative video, unfortunately I do not have the "Get Data" option in my data tab on my MAC. I have seen some videos on the "Get Data" PDF to Excel conversion in MAC but can't figure out how to get the option on my system. I am running office 365 sub. Pls help

  • @subjectline

    @subjectline

    3 жыл бұрын

    Hi, I don't think Power Query exists for Mac. It's Windows only.

  • @user-js8vd9nr3n
    @user-js8vd9nr3n4 ай бұрын

    Good for very simple pages - not for power users.

  • @puregoddessuniversity
    @puregoddessuniversity3 жыл бұрын

    I dont have the "get data" option in my excel?

  • @michaelmraz2707
    @michaelmraz27072 жыл бұрын

    How to select all the pages of the pdf in powerquery?

  • @LarryWright2
    @LarryWright22 жыл бұрын

    I have a complex example that would be very useful. I have 4 years ov invoices from vendor in PDF's. I want to merge them into single XLS, then have a combined data set to see queries on the data. Like item searchable charges, taxes, etc.

  • @alighasemi1141
    @alighasemi11414 жыл бұрын

    great

  • @zzzrajzzz
    @zzzrajzzz2 жыл бұрын

    How do I remove the page breaks in pdf while loading in excel ?