Re: [ILUG] PDF Files

From: Rick Moen (rick at domain linuxmafia.com)
Date: Mon 29 Apr 2002 - 22:35:08 IST


Quoting Nick Murtagh (murtaghn at domain tcd.ie):

> The previously mention psdtotext got text out of that file. The resulting
> mess would encourage me not to bother with that in future.

Well, that's a bit of a mystery: Using ps2ascii from GNU Ghostscript
6.3, I got nothing but the aforementioned ~40 ctrl-L characters.
Certainly, in the future, I'll have a go at it using pstotext, too.
On-line sources, though, suggest that the latter is really pretty much
the same thing, differing in dealing a bit better with punctuation and
ligatures.

In any event, I would suggest that, if the words of a mostly-words
document are accessible only through a graphical viewer with no
prospects whatsoever for extraction, then the document is damaged.



This archive was generated by hypermail 2.1.6 : Thu 06 Feb 2003 - 13:16:27 GMT