Searchable PDF vs Image PDF

I have a lot of PDFs. I need to separate them based on scanned image PDFs and searchable PDFs. I would like one type to into one folder and the other type to go into another folder. I understand there is no program that can do this 100% but even if I can get 60% of them it will save me some time. I’d like to shoot for 80% accuracy. I know looking at file size might be an indicator and also some common searchable words. I am very new to PowerShell so I was hoping someone already has a script for this.

Thank you.

Welcome to the forum. :wave:t4:

I’d expect that to be very unlikely. :man_shrugging:t4:

Regardless of that - this forum is for scripting questions rather than script requests. We do not write customized and ready to use scripts or solutions on request. But we would be happy to assist with code you wrote by yourself. :wink: :+1:t4:

Hi, welcome to the forum :wave:

It might be easier to identify the scanned PDFs that you think, especially if they were created by the same scanner/scanner software.

If I check the first ten lines of files that I’ve scanned:

Get-Content .\myScanned.PDF -First 10

I can see they all contain:

/Creator (Canon )

On that basis it would be easy to identify which files to move using Select-String.

1 Like

Thank you. This was helpful.