View previous topic :: View next topic |
Author |
Message |
bludger Guru
Joined: 09 Apr 2003 Posts: 389
|
Posted: Wed Feb 04, 2004 10:38 am Post subject: PDF search utility |
|
|
Does anyone out there know of a utility to search through a number of pdf files for a text string similarly to grep. Acroread and xpdf can search pdf files one at a time, but I am looking for something to search through a number of files, preferably from the command line. |
|
Back to top |
|
|
superurban n00b
Joined: 17 May 2003 Posts: 37
|
Posted: Wed Feb 04, 2004 1:58 pm Post subject: |
|
|
hi,
I'm not sure, but maybe htdig will help you out. atleast it's capable of searching through pdfs.
good luck! |
|
Back to top |
|
|
bludger Guru
Joined: 09 Apr 2003 Posts: 389
|
Posted: Wed Feb 18, 2004 9:35 am Post subject: |
|
|
Thanks, I couldn't get htdig to work, but I did find another solution.
I simply ran
Code: | pdftotext -layout <filename> |
on each of the pdf files and then used grep etc. to search.
BTW. I couldn't work out how to send the output of pdftotext to stdout. The man page says
"If text-file is -', the text is sent to stdout" (at least on my screen). I am not sure which character is meant here.
I also found a utility pdfsearch at http://sourceforge.net/projects/pdfsearch/. It is not in portage, though and I have not tried it. |
|
Back to top |
|
|
funeagle Tux's lil' helper
Joined: 05 Aug 2003 Posts: 102 Location: London
|
Posted: Sat Apr 10, 2004 7:15 pm Post subject: |
|
|
Code: | pdftotext -layout somefile.pdf - |
|
|
Back to top |
|
|
|