View Full Version : Batch extract pages from PDF
longedge
07-02-2009, 08:41 AM
I'm looking for a way to batch extract, rename and save pages from pdf's. Anybody come across a tool to do it. The manual method of extract and save as works fine but I've got 8 pdf's each with about 300 pages :eek:
RobertB
07-02-2009, 01:36 PM
I think I understand you request.....but, it would seem impossible for an'action' to know which pages you want to extract....
What if you just deleted the pages (whichever is least time consuming) you do not want, save as.
If you have InDesign, you might have hope....by importing the PDF's to InDesign with a script and then refining your pagination.
http://www.tek-tips.com/viewthread.cfm?qid=1132821
longedge
07-02-2009, 02:21 PM
No I don't have InDesign so that's not an option for me.
Just to explain a little further. I have a number of pdf's each with multiple pages, each page of which refers to a different location. I have to save each page as a separate document named as the location which is part of the page header.
I can do it manually with Acrobat by going through the page extract process and choosing to delete pages from the 'master' as I go so the next page to do is always page 1.
If I can't find a way of doing it automatically it'll be passed on to 2 or 3 of my admin support colleagues. What I'm really worried about is human error. I tried it myself and having done 5 or 6 documents I had already mis-named one of them so with several thousand there's just too much room for mistakes :)
audioguy
07-02-2009, 03:00 PM
This looks like it'll do what you're asking.
http://www.softpedia.com/get/Office-tools/PDF/Advanced-Batch-PDF-Page-Extractor.shtml
Not sure if you can configure it to rename the files according to the page headers though.
Btw, you refer to the saved files as pages and documents, but you didn't specify the file format. Is it single page PDF? Image? Word Doc?
gvanassche
07-02-2009, 03:22 PM
this is what I use for this: Multivalent20060102.jar
a good article on it is here: http://kasperowski.com/2006/10/manipulating-pdf-files-with-multivalent.html
you can create a .BAT or .CMD for Vista and you can pipe the output of to a log in the _TempFolder (in batch it is %temp%) and show this in your AMS UI
hope this helps
gert
longedge
07-03-2009, 08:13 AM
Thanks for the pointers, I'll be having a look over the week-end.
longedge
07-13-2009, 11:08 AM
Just to finish the story off and add some detail. I ended up going back to square one here.
The pdf was generated from a report in one of my Access databases.
A well known Access Guru Stephen Lebans (http://www.lebans.com/) has a web site where there is lot's of free stuff among which is a sample database which includes a couple of dll's to output direct to pdf silently and with the option to name the file as it is saved out (using a variable).
It worked out really well in the end and I now have an automated process free from human input ( the bit where the mistakes are made ).
Powered by vBulletin™ Version 4.0.6 Copyright © 2012 vBulletin Solutions, Inc. All rights reserved.