Read information inside PDF

automationpdf

I'd like to automate the invoices I receive from a particular company. Unfortunately they name the files very strangely and the filename is not at all related to invoice number, or date. So I'm trying to find something to extract information from inside the PDF. I would look for particular strings and numbers. I can think of Spotlight that would be able to do that, but I don't think that's the best way. Suggestions and solutions to this particular problem are welcome.

Thank you!

Best Answer

For that kind of things I use Hazel. I have a very easy way to filter PDF contents based on rules. The below image contains a rule that I use to organise my invoices. Hazel keeps watching a folder, and acts on every new file there based on your defined rules.

Screenshot of Hazel configuration