Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
HomeAnnouncementsFree MagazinesWhite PapersSubmit Content
Discussion GroupsASP.NETWindows FormsLanguages.NET FrameworkVisual Studio.NET
Articles.NET FrameworkASP.NETToolsWindows Forms
.NET DirectoryOpen Source ProjectsUser GroupsWeb Resources
Related Topics
Visual Basic 6SQL ServerMS AccessOther DB ProductsMS Server ProductsMore Topics ...

.NET Forum / .NET Framework / Performance / November 2006

Tip: Looking for answers? Try searching our database.

High Performance Xml parser

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
rony - 27 Nov 2006 17:17 GMT
Hi,
I am looking for component which allows me to parse my xml file.
the reason i am asking this, is because my xml files are huge it can
reach as far as 1GB more or less.
the time to parse such a file is something like 5 Hours.
Now i am using the XmlRead, XmlNode ... (I do not load the file to the
memory).
Can you suggest  better components to  use?

** I tried SAX but i couldn't understand how it works, because there is
no examples for .net , and very bad documentation.
p.s : I am writing in C#.

Regards, Rony
David Browne - 27 Nov 2006 17:41 GMT
> Hi,
> I am looking for component which allows me to parse my xml file.
[quoted text clipped - 8 lines]
> no examples for .net , and very bad documentation.
> p.s : I am writing in C#.

System.Xml.XmlReader?

David
joachim@yamagata-europe.com - 28 Nov 2006 08:44 GMT
> System.Xml.XmlReader?

For files above 1GB this won't work. If the xml has not been formatted,
all the data is on one line, and you won't succeed in reading it.

I had to do something similar, so I wrote a buffered reader myself and
took care of the parsing by an aho-corasick search engine with
callbacks placed on top of it. This is more or less the way sax works.

Parsing time for 2GB was reduced to 3 minutes :-).

Drop me a line if you want to know more,

Regards,
Joachim

Free Magazines

Get these publications absolutely FREE for up to 12 months. There are no hidden fees and no obligation. Simply choose a title, complete the application form and submit it. Read more ...

Oracle MagazineNetwork ComputingComputer WorldBio-IT WorldeWeekInformation WeekInfosecurity
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.