Hi all,
need a little help with this if anyone knows. We have a machine
running Debian 2.2, kernel 2.4.2, dual cpu PIII. It's used for both
computation and general surfing/email etc.
We have two long running computations running on it, which have a
few weeks left and cannot be stopped, so a reboot is out. It would appera
that Netscape crahseed on one of the users and left a corrupted or
unreadable /proc entry. As a result, ps and top both freeze. Whenever
the directory in /proc is read, the process reading it seems to freeze.
As a result we now have a load avg of 40:
gavin at buggeredmachine:/home/gavin$ uptime
10:54am up 50 days, 9:07, 17 users, load average: 41.09, 41.22, 41.24
All else, seems to be fine, but we'd like to correct this without
a reboot. Do you think the cpu is wasting much time on the corrupted
processes. Do you think the computation (nice 19) is still using most of
the cpu or not (it normally would use 80%)? Can anyone see what has
caused this, and in particular, does anyone have a solution. Does it seem
likely that netscape could do this?
Thanks in advance, mail me back if you want any more info. Please
cc my address as I am only on the summary email thingy.
Cheers
Gavin
Maintained by the ILUG website team. The aim of Linux.ie is to
support and help commercial and private users of Linux in Ireland. You can
display ILUG news in your own webpages, read backend
information to find out how. Networking services kindly provided by HEAnet, server kindly donated by
Dell. Linux is a trademark of Linus Torvalds,
used with permission. No penguins were harmed in the production or maintenance
of this highly praised website. Looking for the
Indian Linux Users' Group? Try here. If you've read all this and aren't a lawyer: you should be!