cURL / Mailing Lists / curl-and-php / Single Mail

curl-and-php

data scrapping help

From: haroon ahmad <haroon_at_fastcreators.com>
Date: Fri, 17 Aug 2007 10:25:43 +0500

Hi

I have to browse a site which has SE friendly URLs on the pattern

domain.com/category/

this kind of URL contains different subcategories names that can be clicked
to form a URL like

 domain.com/category/subcat/

then this kind of URL contains different names which produce URLs like

 domain.com/category/subcat/name.htm

and my main goal is to fetch information from this final URLs

so I want to write a script which can browse the website for all available
'subcat' and 'name' combinations and read data from the name pages i.e

domain.com/category/*/*.htm

can anybody guide me please how can i proceed for this kind of automatic
process that can browse all urls one by one and then fetch data, i can write
the fetching data part with cURL only problem is the logic for browsing urls
one by one.

thank you

_______________________________________________
http://cool.haxx.se/cgi-bin/mailman/listinfo/curl-and-php
Received on 2007-08-17