On Wed, 19 May 2004 15:57:23 +0100
Justin MacCarthy <justin at maccarthy.org> wrote:
> I want to use wget to give me a list of all URIs referenced on
> my domain (all only my domain), so I can remove old files. I'm
> sure there is way to do this with wget, but I just can't see
> it, (really bad flu today) I don't want to download anything
> just list the documents so I can clean up a local copy
Hope you're feeling better.
I don't think wget can't really do what you want yet. It has a
--spider option but that doesn't work recursively.
Something like momspider, checkbot, or linkchecker should do the
job.
If the content is all static, rsync will clean up the differences
for you.
-fr.
--
Feargal Reilly.
PGP Key: 0x0E7EE8D8 (expires 06-Aug-2004)
Web: http://www.helgrim.com/ | ICQ: 109837009 | YIM: ectoraige
Visit http://ie.bsd.net - BSDs presence in Ireland
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 187 bytes
Desc: not available
Url : http://mail.linux.ie/pipermail/ilug/attachments/20040520/5a5d32f8/attachment.pgp
Maintained by the ILUG website team. The aim of Linux.ie is to
support and help commercial and private users of Linux in Ireland. You can
display ILUG news in your own webpages, read backend
information to find out how. Networking services kindly provided by HEAnet, server kindly donated by
Dell. Linux is a trademark of Linus Torvalds,
used with permission. No penguins were harmed in the production or maintenance
of this highly praised website. Looking for the
Indian Linux Users' Group? Try here. If you've read all this and aren't a lawyer: you should be!