On Fri, 3 Jan 2003, Sameer D. Sahasrabuddhe wrote:
On Fri, Jan 03, 2003 at 01:29:01AM +0530, Sameer D. Sahasrabuddhe wrote:
Our server seems to have run into a problem ... there's a dead process with an RSS of 2GB, and it refuses to die no matter what!
The swap is down to zero, the machine is thrashing, but we can't afford to restart!
We had kept the server running in this state overnight, with kswapd running at full priority - it's using around 40% CPU time - but no results! I guess it's time to restart the server.
How can I get more info about what happened to the server? The user involved says he just aborted some process with "Ctrl-C" but it refused to go away - it was some really heavy program, related to some project work of his. Is there anything information about the system that I should save before I restart?
Sameer.
have your tried attaching to the process using strace ? that might give some clue where it's stuck.
-Rajesh