cURL / Mailing Lists / curl-library / Single Mail

curl-library

parsing html/xml, lynx -dump, APIs, Expat

From: James Wettenhall <wettenhall_at_wehi.edu.au>
Date: Thu, 15 May 2003 00:13:39 +1000 (EST)

Hi,

Just replying to my recent post: I'm wanting to save a
webpage as text in the way that lynx -dump does, but
preferably using an API, rather than a system call.

Thanks for the responses.

I found an XML parser library which looks pretty good:

http://sourceforge.net/projects/expat/

I think it requires strict XML, but that's OK, because I've
discovered that the HTML pages I was trying to convert to
text can be generated in XML instead of HTML.

Regards,
James

-------------------------------------------------------
Enterprise Linux Forum Conference & Expo, June 4-6, 2003, Santa Clara
The only event dedicated to issues related to Linux enterprise solutions
www.enterpriselinuxforum.com
Received on 2003-05-14