public inbox for linux-newbie@vger.kernel.org
 help / color / mirror / Atom feed
From: beolach@juno.com
To: hammer@revobild.net
Cc: linux-newbie@vger.kernel.org
Subject: Re: is there some ps-to-text extractor ?
Date: Thu, 31 Jul 2003 08:07:12 GMT	[thread overview]
Message-ID: <20030731.010712.2373.1019401@webmail01.lax.untd.com> (raw)


I haven't used most of these, so I can't recommend any one over
the others, but it looks like all of these should work.

GNU Ghostscript is distributed with several conversion tools,
including ps2ascii. However, in the ps2ascii(1) man page, it
says "ps2ascii doesn't look at font encoding, and isn't very
good at dealing with kerning, so for PostScript (but not
currently PDF), you might consider pstotext" and points to
<http://www.research.digital.com/SRC/virtualpaper/pstotext.html>.

Also look at PreScript <http://www.nzdl.org/html/prescript.html>,
the PDF Conversion module <http://sourceforge.net/projects/pdf995/>,
and maybe GemDoc <http://www.gemini1consulting.com/gemdoc/>, but
note that GemDoc is for MS Windows, and is not free (other than 14
day trial).

Conway S. Smith

--- Heimo Claasen <hammer@revobild.net> wrote:
> 
> The "pdftotext" application (part of the "xpdf" package) is a real
> blessing; it needs some post-editing but still gets better results 
> with
> extracting the contents than the very own Adobe "service" to strip 
> this
> idiot formatting.
> 
> Question now: Is there any "ps-to-text" converter existing which 
> would do
> this with postscript/ghostscript files ?
> (I dindn't get something meaningful from a web search.)
> 
> // Heimo Claasen // <hammer at revobild dot net> // Brussels 
> 2003-07-30
> The WebPlace of ReRead - and much to read  ==>  
> http://www.revobild.net
> 

________________________________________________________________
The best thing to hit the internet in years - Juno SpeedBand!
Surf the web up to FIVE TIMES FASTER!
Only $14.95/ month - visit www.juno.com to sign up today!
-
To unsubscribe from this list: send the line "unsubscribe linux-newbie" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.linux-learn.org/faqs

             reply	other threads:[~2003-07-31  8:07 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2003-07-31  8:07 beolach [this message]
  -- strict thread matches above, loose matches on Subject: below --
2003-08-02  0:00 is there some ps-to-text extractor ? Heimo Claasen
2003-07-31  7:36 robin
2003-07-31  0:00 Heimo Claasen
2003-07-31  4:33 ` Dan Zlotnikov

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20030731.010712.2373.1019401@webmail01.lax.untd.com \
    --to=beolach@juno.com \
    --cc=hammer@revobild.net \
    --cc=linux-newbie@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox