public inbox for linux-newbie@vger.kernel.org
 help / color / mirror / Atom feed
* Re: is there some ps-to-text extractor ?
@ 2003-07-31  7:36 robin
  0 siblings, 0 replies; 5+ messages in thread
From: robin @ 2003-07-31  7:36 UTC (permalink / raw)
  To: Heimo Claasen, linux-newbie

> Question now: Is there any "ps-to-text"
> converter existing which would do
> this with postscript/ghostscript files ?
> (I dindn't get something meaningful from a web
> search.)

No? I tried
http://www.google.com/search?q=pstotext
and got
http://packages.debian.org/stable/text/pstotext.html

Have fun,
Robin
-
To unsubscribe from this list: send the line "unsubscribe linux-newbie" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.linux-learn.org/faqs

^ permalink raw reply	[flat|nested] 5+ messages in thread
* Re: is there some ps-to-text extractor ?
@ 2003-08-02  0:00 Heimo Claasen
  0 siblings, 0 replies; 5+ messages in thread
From: Heimo Claasen @ 2003-08-02  0:00 UTC (permalink / raw)
  To: linux-newbie

Thanks for the hints flowing !

Now I can try them all on a HUGE manual - 120+ pages entirely cosisting
of text - I guess it will reduce to <30 pp pure text, <bg>.

// Heimo Claasen // <hammer at revobild dot net> // Brussels 2003-08-02
The WebPlace of ReRead - and much to read  ==>  http://www.revobild.net

-
To unsubscribe from this list: send the line "unsubscribe linux-newbie" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.linux-learn.org/faqs

^ permalink raw reply	[flat|nested] 5+ messages in thread
* Re: is there some ps-to-text extractor ?
@ 2003-07-31  8:07 beolach
  0 siblings, 0 replies; 5+ messages in thread
From: beolach @ 2003-07-31  8:07 UTC (permalink / raw)
  To: hammer; +Cc: linux-newbie


I haven't used most of these, so I can't recommend any one over
the others, but it looks like all of these should work.

GNU Ghostscript is distributed with several conversion tools,
including ps2ascii. However, in the ps2ascii(1) man page, it
says "ps2ascii doesn't look at font encoding, and isn't very
good at dealing with kerning, so for PostScript (but not
currently PDF), you might consider pstotext" and points to
<http://www.research.digital.com/SRC/virtualpaper/pstotext.html>.

Also look at PreScript <http://www.nzdl.org/html/prescript.html>,
the PDF Conversion module <http://sourceforge.net/projects/pdf995/>,
and maybe GemDoc <http://www.gemini1consulting.com/gemdoc/>, but
note that GemDoc is for MS Windows, and is not free (other than 14
day trial).

Conway S. Smith

--- Heimo Claasen <hammer@revobild.net> wrote:
> 
> The "pdftotext" application (part of the "xpdf" package) is a real
> blessing; it needs some post-editing but still gets better results 
> with
> extracting the contents than the very own Adobe "service" to strip 
> this
> idiot formatting.
> 
> Question now: Is there any "ps-to-text" converter existing which 
> would do
> this with postscript/ghostscript files ?
> (I dindn't get something meaningful from a web search.)
> 
> // Heimo Claasen // <hammer at revobild dot net> // Brussels 
> 2003-07-30
> The WebPlace of ReRead - and much to read  ==>  
> http://www.revobild.net
> 

________________________________________________________________
The best thing to hit the internet in years - Juno SpeedBand!
Surf the web up to FIVE TIMES FASTER!
Only $14.95/ month - visit www.juno.com to sign up today!
-
To unsubscribe from this list: send the line "unsubscribe linux-newbie" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.linux-learn.org/faqs

^ permalink raw reply	[flat|nested] 5+ messages in thread
* is there some ps-to-text extractor ?
@ 2003-07-31  0:00 Heimo Claasen
  2003-07-31  4:33 ` Dan Zlotnikov
  0 siblings, 1 reply; 5+ messages in thread
From: Heimo Claasen @ 2003-07-31  0:00 UTC (permalink / raw)
  To: linux-newbie

The "pdftotext" application (part of the "xpdf" package) is a real
blessing; it needs some post-editing but still gets better results with
extracting the contents than the very own Adobe "service" to strip this
idiot formatting.

Question now: Is there any "ps-to-text" converter existing which would do
this with postscript/ghostscript files ?
(I dindn't get something meaningful from a web search.)

// Heimo Claasen // <hammer at revobild dot net> // Brussels 2003-07-30
The WebPlace of ReRead - and much to read  ==>  http://www.revobild.net

-
To unsubscribe from this list: send the line "unsubscribe linux-newbie" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.linux-learn.org/faqs

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2003-08-02  0:00 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-07-31  7:36 is there some ps-to-text extractor ? robin
  -- strict thread matches above, loose matches on Subject: below --
2003-08-02  0:00 Heimo Claasen
2003-07-31  8:07 beolach
2003-07-31  0:00 Heimo Claasen
2003-07-31  4:33 ` Dan Zlotnikov

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox