1 min readfrom Microsoft Excel | Help & Support with your Formula, Macro, and VBA problems | A Reddit Community

Trying to automate extracting info from PDFs into a table with PowerQuery but they're somehow not structured the same and it's messing up.

I thought since the PDFs looked like they were the same format (they're documents from a government agency), they would produce the same results if I ran them through PowerQuery. Somehow, they don't.

I need three pieces of data from each file. Somehow they all end up on different columns despite looking identical. I've tried my best to make it fit but the moment I try to remove extraneous columns, the same error pops up because one of the file doesn't have a specific numbered column.

It's so frustrating. I don't even need it to look nice, I just need the info in a list for convenience. Is there anything I can do to make it work?

submitted by /u/DoctorKrakens
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#Excel alternatives for data analysis
#generative AI for data analysis
#rows.com
#natural language processing for spreadsheets
#big data management in spreadsheets
#conversational data analysis
#Excel compatibility
#real-time data collaboration
#intelligent data visualization
#financial modeling with spreadsheets
#PDFs
#PowerQuery
#data extraction
#government agency
#structured data
#columns
#documents
#automate
#different formats
#error