I would look at the open source PDF libraries first. There are quite a few
on SourceForge. Most are aimed at outputting PDF, not inputing, so I am not
100% sure they will fit your needs. The next direction is Adobe or one of
the many third party PDF Libraries. until you have clean source off of the
PDF, you will not get clean output.
If this is a web application, I would shy way from the office libs. Not a
big deal with desktop or console apps, but it is wiser to go with a solution
like WordWriter (OfficeWriter if you need other formats) than it is to use
the Word libs from Microsoft in a web environment.

Signature
Gregory A. Beamer
MVP; MCP: +I, SE, SD, DBA
*************************************************
Think outside of the box!
*************************************************
> Hi everyone,
>
[quoted text clipped - 9 lines]
>
> Thnx.