Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
HomeAnnouncementsFree MagazinesWhite PapersSubmit Content
Discussion GroupsASP.NETWindows FormsLanguages.NET FrameworkVisual Studio.NET
Articles.NET FrameworkASP.NETToolsWindows Forms
.NET DirectoryOpen Source ProjectsUser GroupsWeb Resources
Related Topics
Visual Basic 6SQL ServerMS AccessOther DB ProductsMS Server ProductsMore Topics ...

.NET Forum / .NET Framework / New Users / January 2005

Tip: Looking for answers? Try searching our database.

Convert .rtf or .doc or .pdf or .htm to plain txt

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
Dave - 28 Jan 2005 13:14 GMT
Greetings,

Is anybody aware of any code that will allow me to read .rtf or .doc or .pdf
or .htm as plain text (so I can do a streamreader off them).  Thanks,

-Dave
David Browne - 28 Jan 2005 14:17 GMT
> Greetings,
>
> Is anybody aware of any code that will allow me to read .rtf or .doc or
> .pdf or .htm as plain text (so I can do a streamreader off them).  Thanks,

Each format would require a different tool.  Microsoft Word can do .rtf and,
of course, .doc.

But for PDF check out the pdftotext.exe from the XPDF library

 http://www.foolabs.com/xpdf/download.html

from their web site:

"Xpdf is an open source viewer for Portable Document Format (PDF) files.
(These are also sometimes also called 'Acrobat' files, from the name of
Adobe's PDF software.) The Xpdf project also includes a PDF text extractor,
PDF-to-PostScript converter, and various other utilities.

Xpdf runs under the X Window System on UNIX, VMS, and OS/2. The non-X
components (pdftops, pdftotext, etc.) also run on Win32 systems and should
run on pretty much any system with a decent C++ compiler. "

It's a commandline tool so you would need to shell out to it, and then open
a streamreader against the output file.

David
Beringer - 28 Jan 2005 14:59 GMT
As a related topic:
Does anybody know of code examples on how to convert RTF to HTML, XML etc?

Thanks in advance,
Eric

>> Greetings,
>>
[quoted text clipped - 24 lines]
>
> David
Matt Berther - 28 Jan 2005 19:49 GMT
Hello Beringer,

Im not completely sure about this, but vwWare[1] may do what you need.

[1] http://wvware.sourceforge.net/

--
Matt Berther
http://www.mattberther.com

> As a related topic:
> Does anybody know of code examples on how to convert RTF to HTML, XML
[quoted text clipped - 31 lines]
>>
>> David
Dave - 28 Jan 2005 18:38 GMT
David,

This tool from Foolabs does exactly what I was looking for.  I am looking to
use it, though, in the .NET Compact Framework.  Is there a way to do that?

-Dave

>> Greetings,
>>
[quoted text clipped - 24 lines]
>
> David
David Browne - 29 Jan 2005 21:25 GMT
> David,
>
> This tool from Foolabs does exactly what I was looking for.  I am looking
> to use it, though, in the .NET Compact Framework.  Is there a way to do
> that?

It's not managed code: It's a platform binary compiled in C++.  It might
run, or you might be able to compile it for your platform.

David

Free Magazines

Get these publications absolutely FREE for up to 12 months. There are no hidden fees and no obligation. Simply choose a title, complete the application form and submit it. Read more ...

Oracle MagazineNetwork ComputingComputer WorldBio-IT WorldeWeekInformation WeekInfosecurity
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.