View previous topic :: View next topic |
Author |
Message |
CaptainBlood Advocate
Joined: 24 Jan 2010 Posts: 3864
|
Posted: Mon Sep 06, 2021 6:49 pm Post subject: how to pdf to text?[solved] |
|
|
Any idea which package to install for such purpose?
Thks 4 ur attention, interest & support. _________________ USE="-* ..." in /etc/portage/make.conf here, i.e. a countermeasure to portage implicit braces, belt & diaper paradigm
LT: "I've been doing a passable imitation of the Fontana di Trevi, except my medium is mucus. Sooo much mucus. "
Last edited by CaptainBlood on Mon Sep 06, 2021 10:48 pm; edited 1 time in total |
|
Back to top |
|
|
fedeliallalinea Administrator
Joined: 08 Mar 2003 Posts: 31269 Location: here
|
Posted: Mon Sep 06, 2021 6:59 pm Post subject: |
|
|
I use OCRmyPDF to trasofrm a pdf image text. _________________ Questions are guaranteed in life; Answers aren't. |
|
Back to top |
|
|
carcajou Apprentice
Joined: 10 Jun 2008 Posts: 248
|
Posted: Mon Sep 06, 2021 7:42 pm Post subject: |
|
|
Maybe app-text/tesseract?
Also, lately I am using LibreOffice. It worked quite well for me when I need to perform quick PDF edits. The downside is that usually recognizes text as bunch of text boxes. |
|
Back to top |
|
|
CaptainBlood Advocate
Joined: 24 Jan 2010 Posts: 3864
|
Posted: Mon Sep 06, 2021 8:23 pm Post subject: |
|
|
fedeliallalinea, Thks.
Should I need any of those: Code: | eix media-libs/jbig2enc
* media-libs/jbig2enc
Available versions: 0.28-r1 0.29 {gif jpeg png tiff webp} | Thks 4 ur attention, interest & support. _________________ USE="-* ..." in /etc/portage/make.conf here, i.e. a countermeasure to portage implicit braces, belt & diaper paradigm
LT: "I've been doing a passable imitation of the Fontana di Trevi, except my medium is mucus. Sooo much mucus. " |
|
Back to top |
|
|
CaptainBlood Advocate
Joined: 24 Jan 2010 Posts: 3864
|
Posted: Mon Sep 06, 2021 9:19 pm Post subject: |
|
|
kukibl wrote: | Maybe app-text/tesseract? | Yes maybe.
OCRmyPDF or tesseract docs seem no handy for my use case:
The pdf files I wish to listen to seem to have a text layer already, as I can select text in pdf viewer such as evince.
Currently building firefox hold me back from trying anything any futher.
Shouldn't last long, though.
Thks _________________ USE="-* ..." in /etc/portage/make.conf here, i.e. a countermeasure to portage implicit braces, belt & diaper paradigm
LT: "I've been doing a passable imitation of the Fontana di Trevi, except my medium is mucus. Sooo much mucus. " |
|
Back to top |
|
|
AJM Apprentice
Joined: 25 Sep 2002 Posts: 195 Location: Aberdeen, Scotland
|
Posted: Mon Sep 06, 2021 9:35 pm Post subject: |
|
|
CaptainBlood wrote: |
OCRmyPDF or tesseract docs seem no handy for my use case:
The pdf files I wish to listen to seem to have a text layer already, as I can select text in pdf viewer such as evince. |
How about pdftotext from app-text/poppler (might need utils keyword?) |
|
Back to top |
|
|
CaptainBlood Advocate
Joined: 24 Jan 2010 Posts: 3864
|
Posted: Mon Sep 06, 2021 10:47 pm Post subject: |
|
|
AJM wrote: | How about pdftotext from app-text/poppler (might need utils keyword?) | That might be the minimalistic thingie I first couldn't find a path to...
poppler[utils] already there, as I was looking for a standalone package.
Thks 4 ur attention, interest & support. _________________ USE="-* ..." in /etc/portage/make.conf here, i.e. a countermeasure to portage implicit braces, belt & diaper paradigm
LT: "I've been doing a passable imitation of the Fontana di Trevi, except my medium is mucus. Sooo much mucus. "
Last edited by CaptainBlood on Tue Sep 07, 2021 6:24 am; edited 1 time in total |
|
Back to top |
|
|
figueroa Advocate
Joined: 14 Aug 2005 Posts: 3005 Location: Edge of marsh USA
|
Posted: Tue Sep 07, 2021 4:21 am Post subject: |
|
|
I know you got this already, but:
Code: | $ ls /usr/bin | grep pdf |
and then:
Code: | $ equery b /usr/bin/pdftotext
* Searching for /usr/bin/pdftotext ...
app-text/poppler-21.07.0 (/usr/bin/pdftotext) |
_________________ Andy Figueroa
hp pavilion hpe h8-1260t/2AB5; spinning rust x3
i7-2600 @ 3.40GHz; 16 gb; Radeon HD 7570
amd64/23.0/split-usr/desktop (stable), OpenRC, -systemd -pulseaudio -uefi |
|
Back to top |
|
|
|