Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
PDF - reflow text or export to HTML/txt?
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Other Things Gentoo
View previous topic :: View next topic  
Author Message
Havin_it
Veteran
Veteran


Joined: 17 Jul 2005
Posts: 1247
Location: Edinburgh, UK

PostPosted: Tue Jul 15, 2008 9:21 pm    Post subject: PDF - reflow text or export to HTML/txt? Reply with quote

Hello,

I have a Massive Programming Book(TM) that I'm reading, and I'd like to be able to read chunks of it on my mobile phone with Acrobat Reader LE. I already have a PDF version of the book, but at 20MB it's far too large to even open on the phone. I can break it into smaller chunks using CUPS-PDF, but even then I have a problem: the page is still too large to read left-to-right on the phone's weedy screen (even in landscape).

So I'm wondering, is there some way I can get the text into a format I can read within the shabby confines of my phone's screen? The text in the original PDF is "real" text (not just an image of text) so it can be read programmatically, but without basic formatting (bullets, headers etc.) it's pretty unreadable when pasted into a .txt file.

I know that Openoffice 3 will bring PDF editing potential, but I've not seen any information on using the betas of this with Gentoo so far (if you have any tips on this, I'm interested). The only other thing I found in portage was pdf2html, but that appears to be for creating graphical snapshots of each page, which doesn't solve my problem.

Is there anything available that could solve this conundrum? Any suggestions welcome!
Back to top
View user's profile Send private message
Enlight
Advocate
Advocate


Joined: 28 Oct 2004
Posts: 3519
Location: Alsace (France)

PostPosted: Wed Jul 16, 2008 12:47 pm    Post subject: Reply with quote

if you know a bit of perl using cam::pdf should be straight forward, It will allow you to copy some pages to another pdf and tons of other stuff.

using pdf::api2 is way more tricky and only needed if you want to mess with pdf internals.
Back to top
View user's profile Send private message
Havin_it
Veteran
Veteran


Joined: 17 Jul 2005
Posts: 1247
Location: Edinburgh, UK

PostPosted: Thu Jul 17, 2008 1:51 am    Post subject: Reply with quote

Hi, thanks for the reply.

Sadly I know nothing of Perl, and I don't think I can justify trying to learn enough to do something with this module (right terminology?) to meet my needs. It looks tempting but I'm concentrating on learning Java right now (that's what the book is for!) and keeping up with my PHP.

Hmmm... wonder if there's anything similar out there for PHP? Will go and have a look...
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Other Things Gentoo All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum