Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
HomeAnnouncementsFree MagazinesWhite PapersSubmit Content
Discussion GroupsASP.NETWindows FormsLanguages.NET FrameworkVisual Studio.NET
Articles.NET FrameworkASP.NETToolsWindows Forms
.NET DirectoryOpen Source ProjectsUser GroupsWeb Resources
Related Topics
Visual Basic 6SQL ServerMS AccessOther DB ProductsMS Server ProductsMore Topics ...

.NET Forum / .NET Framework / .NET SDK / March 2004

Tip: Looking for answers? Try searching our database.

HTML parsing

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
z. f. - 04 Mar 2004 08:35 GMT
Hi,

what is the best way to parse HTML (coming from servers) in order to automate HTML/HTTP interaction.
i know i can use the internet explorer control, but this seems to be the most expensive way, but also it is convenience, since i can put values in input and submit forms etc.

on the other hand i can use webrequest control to get/post pages, but then i have to parse html my self.

is there something in the middle - not to use all resouce internet explorer control  uses (can i hide it, so it will not show anything, or play sound), and anyway is there some control that can allow me to navigate the DOM of an html page without all the GUI?

TIA.
Philipp Sumi - 04 Mar 2004 09:02 GMT
> play sound), and anyway is *there some control that can allow me to
> navigate the DOM of an html page without all the GUI?*

check out this one:
http://www.gotdotnet.com/Community/UserSamples/Details.aspx?SampleGuid=B90FDDCE-
E60D-43F8-A5C4-C3BD760564BC


(and don't crosspost please)

hth, philipp
Jesper V. Andersen - 04 Mar 2004 12:41 GMT
SgmlReader or HtmlAgility (you can find both on gotdotnet). They both work
(with a few minor twists) and allow you to navigate just about any HTML
document like it were wellformed XML ...

Regards
Jesper
Hi,

what is the best way to parse HTML (coming from servers) in order to
automate HTML/HTTP interaction.
i know i can use the internet explorer control, but this seems to be the
most expensive way, but also it is convenience, since i can put values in
input and submit forms etc.

on the other hand i can use webrequest control to get/post pages, but then i
have to parse html my self.

is there something in the middle - not to use all resouce internet explorer
control  uses (can i hide it, so it will not show anything, or play sound),
and anyway is there some control that can allow me to navigate the DOM of an
html page without all the GUI?

TIA.

Free Magazines

Get these publications absolutely FREE for up to 12 months. There are no hidden fees and no obligation. Simply choose a title, complete the application form and submit it. Read more ...

Oracle MagazineNetwork ComputingComputer WorldBio-IT WorldeWeekInformation WeekInfosecurity
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.