How to track down the cause of Windows Server 2008 crashing?How can I remove specific events from the event log in Windows Server 2008?uptime.exe crashing on Windows Server 2008How to track down connectivity dropsWindows 2008 server unaccessible without traces in the event logVMware ESX virtual machine, Windows 2008 Server R2, memory full but process total nowhere near?Windows Server 2008 Shut down by itselfWindows 2008 started crashing regularly every weekWindows Server 2008 DNS Server DownWindows web server 2008 fatally crashing dailyAt my wits end. What could cause my server to randomly hard reset? (Seems to be related to ZFS)
Why would someone open a Netflix account using my Gmail address?
How to check if comma list is empty?
c++ conditional uni-directional iterator
How to not get blinded by an attack at dawn
Why when I add jam to my tea it stops producing thin "membrane" on top?
Can my American children re-enter the USA by International flight with a passport card? Being that their passport book has expired
Why can't I share a one use code with anyone else?
Does it matter what way the tires go if no directional arrow?
labelled end points on logic diagram
Holding rent money for my friend which amounts to over $10k?
Developers demotivated due to working on same project for more than 2 years
Should I communicate in my applications that I'm unemployed out of choice rather than because nobody will have me?
Do people who work at research institutes consider themselves "academics"?
What do you call the hair or body hair you trim off your body?
Windows 10 lock screen - display my own random images
Does addError() work outside of triggers?
What dog breeds survive the apocalypse for generations?
Will consteval functions allow template parameters dependent on function arguments?
Capital gains on stocks sold to take initial investment off the table
Can I say: "When was your train leaving?" if the train leaves in the future?
What is the effect of the Feeblemind spell on Ability Score Improvements?
Promotion comes with unexpected 24/7/365 on-call
Understanding Deutch's Algorithm
Does this "yield your space to an ally" rule my 3.5 group uses appear anywhere in the official rules?
How to track down the cause of Windows Server 2008 crashing?
How can I remove specific events from the event log in Windows Server 2008?uptime.exe crashing on Windows Server 2008How to track down connectivity dropsWindows 2008 server unaccessible without traces in the event logVMware ESX virtual machine, Windows 2008 Server R2, memory full but process total nowhere near?Windows Server 2008 Shut down by itselfWindows 2008 started crashing regularly every weekWindows Server 2008 DNS Server DownWindows web server 2008 fatally crashing dailyAt my wits end. What could cause my server to randomly hard reset? (Seems to be related to ZFS)
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty height:90px;width:728px;box-sizing:border-box;
I have Windows Server 2008 running under VMware.
Recently, its started to crash roughly every day, with continuous 100% CPU utilization, and no response in the GUI.
Is there a step-by-step technique to track down the source of this problem?
What logs would I look at?
p.s. The problem appeared around the time I tried to uninstall Acronis, and it blue screened. However, I'm not sure if the current faults are related to Acronis at all.
windows-server-2008 logging windows-event-log server-crashes
add a comment |
I have Windows Server 2008 running under VMware.
Recently, its started to crash roughly every day, with continuous 100% CPU utilization, and no response in the GUI.
Is there a step-by-step technique to track down the source of this problem?
What logs would I look at?
p.s. The problem appeared around the time I tried to uninstall Acronis, and it blue screened. However, I'm not sure if the current faults are related to Acronis at all.
windows-server-2008 logging windows-event-log server-crashes
add a comment |
I have Windows Server 2008 running under VMware.
Recently, its started to crash roughly every day, with continuous 100% CPU utilization, and no response in the GUI.
Is there a step-by-step technique to track down the source of this problem?
What logs would I look at?
p.s. The problem appeared around the time I tried to uninstall Acronis, and it blue screened. However, I'm not sure if the current faults are related to Acronis at all.
windows-server-2008 logging windows-event-log server-crashes
I have Windows Server 2008 running under VMware.
Recently, its started to crash roughly every day, with continuous 100% CPU utilization, and no response in the GUI.
Is there a step-by-step technique to track down the source of this problem?
What logs would I look at?
p.s. The problem appeared around the time I tried to uninstall Acronis, and it blue screened. However, I'm not sure if the current faults are related to Acronis at all.
windows-server-2008 logging windows-event-log server-crashes
windows-server-2008 logging windows-event-log server-crashes
edited Oct 7 '13 at 18:28
Contango
asked Jul 3 '09 at 11:38
ContangoContango
80031131
80031131
add a comment |
add a comment |
8 Answers
8
active
oldest
votes
You can also use the "Reliability and Performance Monitor" that is available under Windows Server 2008.
As you can see below, it automatically keeps a record of the reliability of the server, and assigns it a "reliability score" out of 10. This score starts at 10, and drops if the server experiences any crashes or unexpected shutdowns.
It even keeps a record of which programs were installed, and when, so you can diagnose if an installed program seemed to cause more faults.
You can also set it up to continuously log the CPU usage of programs, to see which program is causing the 100% CPU utilization.
add a comment |
If there is a crash-dump like c:windowsmemory.dmp you can use the WinDbg to analyze it. Usually you want to look for third party drivers in the dump. Step-by-step instructions can be found here.
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:15
In that page the link to the debugging tool is buried down the bottom somewhere. This is the direct link go.microsoft.com/fwlink/p/?LinkId=536682
– MikeKulls
May 11 '16 at 2:51
add a comment |
The System event log. The Application Event log. Google the message of the BSOD. Check the disk's integrity with chkdsk.
2
Problem is solved. Thanks for your comment - it helped me work out that the original problem was a bad SSD drive (it always crashed when it got to the same sector when I was running chkdsk).
– Contango
Jul 9 '09 at 17:16
add a comment |
You have two options:
- Look at records to try and figure out what caused past problems
- Look for signs of things that could lead to the CPU spikes in an attempt to replicate the problem
Logs are a good start for looking back at the history of the system, if you know the time where the problems start or the logs are quiet enough for you to notice a pattern leading to the pegged CPU. If the system BSOD you can throw the dmp's into windbg.
If you're looking for things that could lead to the CPU spikes:
- Process Explorer from sysinterals: look for odd processes or open handles to files or network shares that don't exist anymore. It may point you in the right direction to replicate the problem
- Windows Reliability and Performance Monitor / Perfmon: You can see how each process is acting in regards to Disk/CPU/Memory/Network usage as well as hundreds of other counters. They may give you a clue as to what is running away with the VM before it happens.
Once you have a good candidate for the problems you can turn on Process Monitor from sysinternals. It will dump every file and registry interaction that every process on the system is doing in real time. It can even be configured to load at boot and capture everything until you run the GUI next (be warned this is A LOT of data, so it's only advisable if you can replicate the problem quickly after boot)
There are a bunch of rabbit holes that an root cause analysis can take you down, feel free to let us know on how it goes.
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:14
add a comment |
If it is blue screening, check out the minidump file:
http://support.microsoft.com/kb/315271
... this will tell you (usually) the driver or piece of software that caused the crash.
Its not the BSOD. Thanks for the tip though - this will come in useful in the future.
– Contango
Jul 7 '09 at 8:58
add a comment |
2009-07-06 - I'm thinking its the hard drive.
I did a chkdsk, and it crashed with the same symptoms as before half way through the chkdsk. I'm using a Solid State Drive (SSD), the "PQI DK9128GD6R000A03 128GB SATA 2.5" SSD", with a MTBF of 1,500,000 hours. Despite having a MTBF of 133 years, it seem to have died after 2 weeks or normal use! To check my theory, I copied the VMware files to a standard hard drive. Ran chkdsk, and it worked like a charm. I'll see if the system survives a week of uptime, and if it does I can officially defenestrate my PQI SSD.
2009-07-07 - System crashed again. Back to the drawing board.
2009-07-08 - Rolled back a further 20 days to before I installed the SSD. We'll see if it crashes again (it did).
2009-07-09 - uninstalled OpenVPN, upgraded to the latest version of Skype, upgraded to SQL 2008 to SP1, removed TeamViewer. We'll see if it crashes again (it did, in the middle of an Acronis backup).
2009-07-09 - suspect that the amount of virtual memory available the VMware machine that runs the server is too small, I've got it at 4GB at the moment. Increasing it (this had no effect).
2009-07-09 - discovered that if the VMware container running Windows Server 2008 crashes with 100% CPU utilization, and I pause/restart it, then it uncrashes and resumes operation! This tends to point to a problem with VMware or its host OS (which is XP), rather than a problem within the Windows Server 2008 itself. Getting very close to the heart of the problem now.
2009-07-09 - Windows Server 2008 only crashes when the host OS is under very heavy load. Increased the number of CPU's it can utilize to 2 CPU's, this seems to have fixed the problem.
In conclusion:
- Original problem was caused by a bad hard drive with bad sectors (it was actually a 128GB SSD from PQI - wouldn't expect a Solid State Drive (SSD) to fail two weeks after purchase but this one did).
- Next problem was caused by the host OS that was running VMware coming under high load. Fixed this by allocating more RAM and increasing the size of the page file.
- If it happens again, I have a workaround (just pause/restart VMware v6.5 to "unfreeze" Windows Server 2008 running inside of it).
Problem solved, thanks guys!
add a comment |
Does it crash exactly each 24 hours (on the same time each day)?
If so, there is possibly a scheduled process that causes the crash.
Its not exactly every 24 hours; it seems to be random. Its roughly every day.
– Contango
Jul 6 '09 at 21:50
add a comment |
Could you please explain what do you mean by crash, is the server encountering BSOD or is it just hanging at 100% Cpu.
For troubleshooting, you can make the server log to a syslog server, run at intervals a script listing processes an their resources usage, writing his output to a network share.
If the server makes bsods try googling for the error code mentioned in the bsod.
Also, maybe the acronis uninstall left an error log with some informations in the installation folder.
Its not a BSOD. The VMware machine is hanging at 100% CPU, with a frozen mouse.
– Contango
Jul 7 '09 at 8:57
add a comment |
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "2"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fserverfault.com%2fquestions%2f35513%2fhow-to-track-down-the-cause-of-windows-server-2008-crashing%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
8 Answers
8
active
oldest
votes
8 Answers
8
active
oldest
votes
active
oldest
votes
active
oldest
votes
You can also use the "Reliability and Performance Monitor" that is available under Windows Server 2008.
As you can see below, it automatically keeps a record of the reliability of the server, and assigns it a "reliability score" out of 10. This score starts at 10, and drops if the server experiences any crashes or unexpected shutdowns.
It even keeps a record of which programs were installed, and when, so you can diagnose if an installed program seemed to cause more faults.
You can also set it up to continuously log the CPU usage of programs, to see which program is causing the 100% CPU utilization.
add a comment |
You can also use the "Reliability and Performance Monitor" that is available under Windows Server 2008.
As you can see below, it automatically keeps a record of the reliability of the server, and assigns it a "reliability score" out of 10. This score starts at 10, and drops if the server experiences any crashes or unexpected shutdowns.
It even keeps a record of which programs were installed, and when, so you can diagnose if an installed program seemed to cause more faults.
You can also set it up to continuously log the CPU usage of programs, to see which program is causing the 100% CPU utilization.
add a comment |
You can also use the "Reliability and Performance Monitor" that is available under Windows Server 2008.
As you can see below, it automatically keeps a record of the reliability of the server, and assigns it a "reliability score" out of 10. This score starts at 10, and drops if the server experiences any crashes or unexpected shutdowns.
It even keeps a record of which programs were installed, and when, so you can diagnose if an installed program seemed to cause more faults.
You can also set it up to continuously log the CPU usage of programs, to see which program is causing the 100% CPU utilization.
You can also use the "Reliability and Performance Monitor" that is available under Windows Server 2008.
As you can see below, it automatically keeps a record of the reliability of the server, and assigns it a "reliability score" out of 10. This score starts at 10, and drops if the server experiences any crashes or unexpected shutdowns.
It even keeps a record of which programs were installed, and when, so you can diagnose if an installed program seemed to cause more faults.
You can also set it up to continuously log the CPU usage of programs, to see which program is causing the 100% CPU utilization.
edited Oct 7 '13 at 18:49
answered Jul 10 '09 at 10:07
ContangoContango
80031131
80031131
add a comment |
add a comment |
If there is a crash-dump like c:windowsmemory.dmp you can use the WinDbg to analyze it. Usually you want to look for third party drivers in the dump. Step-by-step instructions can be found here.
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:15
In that page the link to the debugging tool is buried down the bottom somewhere. This is the direct link go.microsoft.com/fwlink/p/?LinkId=536682
– MikeKulls
May 11 '16 at 2:51
add a comment |
If there is a crash-dump like c:windowsmemory.dmp you can use the WinDbg to analyze it. Usually you want to look for third party drivers in the dump. Step-by-step instructions can be found here.
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:15
In that page the link to the debugging tool is buried down the bottom somewhere. This is the direct link go.microsoft.com/fwlink/p/?LinkId=536682
– MikeKulls
May 11 '16 at 2:51
add a comment |
If there is a crash-dump like c:windowsmemory.dmp you can use the WinDbg to analyze it. Usually you want to look for third party drivers in the dump. Step-by-step instructions can be found here.
If there is a crash-dump like c:windowsmemory.dmp you can use the WinDbg to analyze it. Usually you want to look for third party drivers in the dump. Step-by-step instructions can be found here.
edited May 4 at 14:06
community wiki
2 revs, 2 users 75%
Peter Hahndorf
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:15
In that page the link to the debugging tool is buried down the bottom somewhere. This is the direct link go.microsoft.com/fwlink/p/?LinkId=536682
– MikeKulls
May 11 '16 at 2:51
add a comment |
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:15
In that page the link to the debugging tool is buried down the bottom somewhere. This is the direct link go.microsoft.com/fwlink/p/?LinkId=536682
– MikeKulls
May 11 '16 at 2:51
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:15
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:15
In that page the link to the debugging tool is buried down the bottom somewhere. This is the direct link go.microsoft.com/fwlink/p/?LinkId=536682
– MikeKulls
May 11 '16 at 2:51
In that page the link to the debugging tool is buried down the bottom somewhere. This is the direct link go.microsoft.com/fwlink/p/?LinkId=536682
– MikeKulls
May 11 '16 at 2:51
add a comment |
The System event log. The Application Event log. Google the message of the BSOD. Check the disk's integrity with chkdsk.
2
Problem is solved. Thanks for your comment - it helped me work out that the original problem was a bad SSD drive (it always crashed when it got to the same sector when I was running chkdsk).
– Contango
Jul 9 '09 at 17:16
add a comment |
The System event log. The Application Event log. Google the message of the BSOD. Check the disk's integrity with chkdsk.
2
Problem is solved. Thanks for your comment - it helped me work out that the original problem was a bad SSD drive (it always crashed when it got to the same sector when I was running chkdsk).
– Contango
Jul 9 '09 at 17:16
add a comment |
The System event log. The Application Event log. Google the message of the BSOD. Check the disk's integrity with chkdsk.
The System event log. The Application Event log. Google the message of the BSOD. Check the disk's integrity with chkdsk.
answered Jul 3 '09 at 12:01
community wiki
Dave Markle
2
Problem is solved. Thanks for your comment - it helped me work out that the original problem was a bad SSD drive (it always crashed when it got to the same sector when I was running chkdsk).
– Contango
Jul 9 '09 at 17:16
add a comment |
2
Problem is solved. Thanks for your comment - it helped me work out that the original problem was a bad SSD drive (it always crashed when it got to the same sector when I was running chkdsk).
– Contango
Jul 9 '09 at 17:16
2
2
Problem is solved. Thanks for your comment - it helped me work out that the original problem was a bad SSD drive (it always crashed when it got to the same sector when I was running chkdsk).
– Contango
Jul 9 '09 at 17:16
Problem is solved. Thanks for your comment - it helped me work out that the original problem was a bad SSD drive (it always crashed when it got to the same sector when I was running chkdsk).
– Contango
Jul 9 '09 at 17:16
add a comment |
You have two options:
- Look at records to try and figure out what caused past problems
- Look for signs of things that could lead to the CPU spikes in an attempt to replicate the problem
Logs are a good start for looking back at the history of the system, if you know the time where the problems start or the logs are quiet enough for you to notice a pattern leading to the pegged CPU. If the system BSOD you can throw the dmp's into windbg.
If you're looking for things that could lead to the CPU spikes:
- Process Explorer from sysinterals: look for odd processes or open handles to files or network shares that don't exist anymore. It may point you in the right direction to replicate the problem
- Windows Reliability and Performance Monitor / Perfmon: You can see how each process is acting in regards to Disk/CPU/Memory/Network usage as well as hundreds of other counters. They may give you a clue as to what is running away with the VM before it happens.
Once you have a good candidate for the problems you can turn on Process Monitor from sysinternals. It will dump every file and registry interaction that every process on the system is doing in real time. It can even be configured to load at boot and capture everything until you run the GUI next (be warned this is A LOT of data, so it's only advisable if you can replicate the problem quickly after boot)
There are a bunch of rabbit holes that an root cause analysis can take you down, feel free to let us know on how it goes.
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:14
add a comment |
You have two options:
- Look at records to try and figure out what caused past problems
- Look for signs of things that could lead to the CPU spikes in an attempt to replicate the problem
Logs are a good start for looking back at the history of the system, if you know the time where the problems start or the logs are quiet enough for you to notice a pattern leading to the pegged CPU. If the system BSOD you can throw the dmp's into windbg.
If you're looking for things that could lead to the CPU spikes:
- Process Explorer from sysinterals: look for odd processes or open handles to files or network shares that don't exist anymore. It may point you in the right direction to replicate the problem
- Windows Reliability and Performance Monitor / Perfmon: You can see how each process is acting in regards to Disk/CPU/Memory/Network usage as well as hundreds of other counters. They may give you a clue as to what is running away with the VM before it happens.
Once you have a good candidate for the problems you can turn on Process Monitor from sysinternals. It will dump every file and registry interaction that every process on the system is doing in real time. It can even be configured to load at boot and capture everything until you run the GUI next (be warned this is A LOT of data, so it's only advisable if you can replicate the problem quickly after boot)
There are a bunch of rabbit holes that an root cause analysis can take you down, feel free to let us know on how it goes.
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:14
add a comment |
You have two options:
- Look at records to try and figure out what caused past problems
- Look for signs of things that could lead to the CPU spikes in an attempt to replicate the problem
Logs are a good start for looking back at the history of the system, if you know the time where the problems start or the logs are quiet enough for you to notice a pattern leading to the pegged CPU. If the system BSOD you can throw the dmp's into windbg.
If you're looking for things that could lead to the CPU spikes:
- Process Explorer from sysinterals: look for odd processes or open handles to files or network shares that don't exist anymore. It may point you in the right direction to replicate the problem
- Windows Reliability and Performance Monitor / Perfmon: You can see how each process is acting in regards to Disk/CPU/Memory/Network usage as well as hundreds of other counters. They may give you a clue as to what is running away with the VM before it happens.
Once you have a good candidate for the problems you can turn on Process Monitor from sysinternals. It will dump every file and registry interaction that every process on the system is doing in real time. It can even be configured to load at boot and capture everything until you run the GUI next (be warned this is A LOT of data, so it's only advisable if you can replicate the problem quickly after boot)
There are a bunch of rabbit holes that an root cause analysis can take you down, feel free to let us know on how it goes.
You have two options:
- Look at records to try and figure out what caused past problems
- Look for signs of things that could lead to the CPU spikes in an attempt to replicate the problem
Logs are a good start for looking back at the history of the system, if you know the time where the problems start or the logs are quiet enough for you to notice a pattern leading to the pegged CPU. If the system BSOD you can throw the dmp's into windbg.
If you're looking for things that could lead to the CPU spikes:
- Process Explorer from sysinterals: look for odd processes or open handles to files or network shares that don't exist anymore. It may point you in the right direction to replicate the problem
- Windows Reliability and Performance Monitor / Perfmon: You can see how each process is acting in regards to Disk/CPU/Memory/Network usage as well as hundreds of other counters. They may give you a clue as to what is running away with the VM before it happens.
Once you have a good candidate for the problems you can turn on Process Monitor from sysinternals. It will dump every file and registry interaction that every process on the system is doing in real time. It can even be configured to load at boot and capture everything until you run the GUI next (be warned this is A LOT of data, so it's only advisable if you can replicate the problem quickly after boot)
There are a bunch of rabbit holes that an root cause analysis can take you down, feel free to let us know on how it goes.
answered Jul 5 '09 at 8:13
BobBob
1,89432122
1,89432122
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:14
add a comment |
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:14
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:14
Problem is solved now. Thank you for your comments, it helped point me in the right direction.
– Contango
Jul 9 '09 at 17:14
add a comment |
If it is blue screening, check out the minidump file:
http://support.microsoft.com/kb/315271
... this will tell you (usually) the driver or piece of software that caused the crash.
Its not the BSOD. Thanks for the tip though - this will come in useful in the future.
– Contango
Jul 7 '09 at 8:58
add a comment |
If it is blue screening, check out the minidump file:
http://support.microsoft.com/kb/315271
... this will tell you (usually) the driver or piece of software that caused the crash.
Its not the BSOD. Thanks for the tip though - this will come in useful in the future.
– Contango
Jul 7 '09 at 8:58
add a comment |
If it is blue screening, check out the minidump file:
http://support.microsoft.com/kb/315271
... this will tell you (usually) the driver or piece of software that caused the crash.
If it is blue screening, check out the minidump file:
http://support.microsoft.com/kb/315271
... this will tell you (usually) the driver or piece of software that caused the crash.
answered Jul 3 '09 at 14:49
Adam BrandAdam Brand
5,48222439
5,48222439
Its not the BSOD. Thanks for the tip though - this will come in useful in the future.
– Contango
Jul 7 '09 at 8:58
add a comment |
Its not the BSOD. Thanks for the tip though - this will come in useful in the future.
– Contango
Jul 7 '09 at 8:58
Its not the BSOD. Thanks for the tip though - this will come in useful in the future.
– Contango
Jul 7 '09 at 8:58
Its not the BSOD. Thanks for the tip though - this will come in useful in the future.
– Contango
Jul 7 '09 at 8:58
add a comment |
2009-07-06 - I'm thinking its the hard drive.
I did a chkdsk, and it crashed with the same symptoms as before half way through the chkdsk. I'm using a Solid State Drive (SSD), the "PQI DK9128GD6R000A03 128GB SATA 2.5" SSD", with a MTBF of 1,500,000 hours. Despite having a MTBF of 133 years, it seem to have died after 2 weeks or normal use! To check my theory, I copied the VMware files to a standard hard drive. Ran chkdsk, and it worked like a charm. I'll see if the system survives a week of uptime, and if it does I can officially defenestrate my PQI SSD.
2009-07-07 - System crashed again. Back to the drawing board.
2009-07-08 - Rolled back a further 20 days to before I installed the SSD. We'll see if it crashes again (it did).
2009-07-09 - uninstalled OpenVPN, upgraded to the latest version of Skype, upgraded to SQL 2008 to SP1, removed TeamViewer. We'll see if it crashes again (it did, in the middle of an Acronis backup).
2009-07-09 - suspect that the amount of virtual memory available the VMware machine that runs the server is too small, I've got it at 4GB at the moment. Increasing it (this had no effect).
2009-07-09 - discovered that if the VMware container running Windows Server 2008 crashes with 100% CPU utilization, and I pause/restart it, then it uncrashes and resumes operation! This tends to point to a problem with VMware or its host OS (which is XP), rather than a problem within the Windows Server 2008 itself. Getting very close to the heart of the problem now.
2009-07-09 - Windows Server 2008 only crashes when the host OS is under very heavy load. Increased the number of CPU's it can utilize to 2 CPU's, this seems to have fixed the problem.
In conclusion:
- Original problem was caused by a bad hard drive with bad sectors (it was actually a 128GB SSD from PQI - wouldn't expect a Solid State Drive (SSD) to fail two weeks after purchase but this one did).
- Next problem was caused by the host OS that was running VMware coming under high load. Fixed this by allocating more RAM and increasing the size of the page file.
- If it happens again, I have a workaround (just pause/restart VMware v6.5 to "unfreeze" Windows Server 2008 running inside of it).
Problem solved, thanks guys!
add a comment |
2009-07-06 - I'm thinking its the hard drive.
I did a chkdsk, and it crashed with the same symptoms as before half way through the chkdsk. I'm using a Solid State Drive (SSD), the "PQI DK9128GD6R000A03 128GB SATA 2.5" SSD", with a MTBF of 1,500,000 hours. Despite having a MTBF of 133 years, it seem to have died after 2 weeks or normal use! To check my theory, I copied the VMware files to a standard hard drive. Ran chkdsk, and it worked like a charm. I'll see if the system survives a week of uptime, and if it does I can officially defenestrate my PQI SSD.
2009-07-07 - System crashed again. Back to the drawing board.
2009-07-08 - Rolled back a further 20 days to before I installed the SSD. We'll see if it crashes again (it did).
2009-07-09 - uninstalled OpenVPN, upgraded to the latest version of Skype, upgraded to SQL 2008 to SP1, removed TeamViewer. We'll see if it crashes again (it did, in the middle of an Acronis backup).
2009-07-09 - suspect that the amount of virtual memory available the VMware machine that runs the server is too small, I've got it at 4GB at the moment. Increasing it (this had no effect).
2009-07-09 - discovered that if the VMware container running Windows Server 2008 crashes with 100% CPU utilization, and I pause/restart it, then it uncrashes and resumes operation! This tends to point to a problem with VMware or its host OS (which is XP), rather than a problem within the Windows Server 2008 itself. Getting very close to the heart of the problem now.
2009-07-09 - Windows Server 2008 only crashes when the host OS is under very heavy load. Increased the number of CPU's it can utilize to 2 CPU's, this seems to have fixed the problem.
In conclusion:
- Original problem was caused by a bad hard drive with bad sectors (it was actually a 128GB SSD from PQI - wouldn't expect a Solid State Drive (SSD) to fail two weeks after purchase but this one did).
- Next problem was caused by the host OS that was running VMware coming under high load. Fixed this by allocating more RAM and increasing the size of the page file.
- If it happens again, I have a workaround (just pause/restart VMware v6.5 to "unfreeze" Windows Server 2008 running inside of it).
Problem solved, thanks guys!
add a comment |
2009-07-06 - I'm thinking its the hard drive.
I did a chkdsk, and it crashed with the same symptoms as before half way through the chkdsk. I'm using a Solid State Drive (SSD), the "PQI DK9128GD6R000A03 128GB SATA 2.5" SSD", with a MTBF of 1,500,000 hours. Despite having a MTBF of 133 years, it seem to have died after 2 weeks or normal use! To check my theory, I copied the VMware files to a standard hard drive. Ran chkdsk, and it worked like a charm. I'll see if the system survives a week of uptime, and if it does I can officially defenestrate my PQI SSD.
2009-07-07 - System crashed again. Back to the drawing board.
2009-07-08 - Rolled back a further 20 days to before I installed the SSD. We'll see if it crashes again (it did).
2009-07-09 - uninstalled OpenVPN, upgraded to the latest version of Skype, upgraded to SQL 2008 to SP1, removed TeamViewer. We'll see if it crashes again (it did, in the middle of an Acronis backup).
2009-07-09 - suspect that the amount of virtual memory available the VMware machine that runs the server is too small, I've got it at 4GB at the moment. Increasing it (this had no effect).
2009-07-09 - discovered that if the VMware container running Windows Server 2008 crashes with 100% CPU utilization, and I pause/restart it, then it uncrashes and resumes operation! This tends to point to a problem with VMware or its host OS (which is XP), rather than a problem within the Windows Server 2008 itself. Getting very close to the heart of the problem now.
2009-07-09 - Windows Server 2008 only crashes when the host OS is under very heavy load. Increased the number of CPU's it can utilize to 2 CPU's, this seems to have fixed the problem.
In conclusion:
- Original problem was caused by a bad hard drive with bad sectors (it was actually a 128GB SSD from PQI - wouldn't expect a Solid State Drive (SSD) to fail two weeks after purchase but this one did).
- Next problem was caused by the host OS that was running VMware coming under high load. Fixed this by allocating more RAM and increasing the size of the page file.
- If it happens again, I have a workaround (just pause/restart VMware v6.5 to "unfreeze" Windows Server 2008 running inside of it).
Problem solved, thanks guys!
2009-07-06 - I'm thinking its the hard drive.
I did a chkdsk, and it crashed with the same symptoms as before half way through the chkdsk. I'm using a Solid State Drive (SSD), the "PQI DK9128GD6R000A03 128GB SATA 2.5" SSD", with a MTBF of 1,500,000 hours. Despite having a MTBF of 133 years, it seem to have died after 2 weeks or normal use! To check my theory, I copied the VMware files to a standard hard drive. Ran chkdsk, and it worked like a charm. I'll see if the system survives a week of uptime, and if it does I can officially defenestrate my PQI SSD.
2009-07-07 - System crashed again. Back to the drawing board.
2009-07-08 - Rolled back a further 20 days to before I installed the SSD. We'll see if it crashes again (it did).
2009-07-09 - uninstalled OpenVPN, upgraded to the latest version of Skype, upgraded to SQL 2008 to SP1, removed TeamViewer. We'll see if it crashes again (it did, in the middle of an Acronis backup).
2009-07-09 - suspect that the amount of virtual memory available the VMware machine that runs the server is too small, I've got it at 4GB at the moment. Increasing it (this had no effect).
2009-07-09 - discovered that if the VMware container running Windows Server 2008 crashes with 100% CPU utilization, and I pause/restart it, then it uncrashes and resumes operation! This tends to point to a problem with VMware or its host OS (which is XP), rather than a problem within the Windows Server 2008 itself. Getting very close to the heart of the problem now.
2009-07-09 - Windows Server 2008 only crashes when the host OS is under very heavy load. Increased the number of CPU's it can utilize to 2 CPU's, this seems to have fixed the problem.
In conclusion:
- Original problem was caused by a bad hard drive with bad sectors (it was actually a 128GB SSD from PQI - wouldn't expect a Solid State Drive (SSD) to fail two weeks after purchase but this one did).
- Next problem was caused by the host OS that was running VMware coming under high load. Fixed this by allocating more RAM and increasing the size of the page file.
- If it happens again, I have a workaround (just pause/restart VMware v6.5 to "unfreeze" Windows Server 2008 running inside of it).
Problem solved, thanks guys!
edited Jul 9 '09 at 17:19
community wiki
10 revs
Gravitas
add a comment |
add a comment |
Does it crash exactly each 24 hours (on the same time each day)?
If so, there is possibly a scheduled process that causes the crash.
Its not exactly every 24 hours; it seems to be random. Its roughly every day.
– Contango
Jul 6 '09 at 21:50
add a comment |
Does it crash exactly each 24 hours (on the same time each day)?
If so, there is possibly a scheduled process that causes the crash.
Its not exactly every 24 hours; it seems to be random. Its roughly every day.
– Contango
Jul 6 '09 at 21:50
add a comment |
Does it crash exactly each 24 hours (on the same time each day)?
If so, there is possibly a scheduled process that causes the crash.
Does it crash exactly each 24 hours (on the same time each day)?
If so, there is possibly a scheduled process that causes the crash.
answered Jul 3 '09 at 12:04
community wiki
Gamecat
Its not exactly every 24 hours; it seems to be random. Its roughly every day.
– Contango
Jul 6 '09 at 21:50
add a comment |
Its not exactly every 24 hours; it seems to be random. Its roughly every day.
– Contango
Jul 6 '09 at 21:50
Its not exactly every 24 hours; it seems to be random. Its roughly every day.
– Contango
Jul 6 '09 at 21:50
Its not exactly every 24 hours; it seems to be random. Its roughly every day.
– Contango
Jul 6 '09 at 21:50
add a comment |
Could you please explain what do you mean by crash, is the server encountering BSOD or is it just hanging at 100% Cpu.
For troubleshooting, you can make the server log to a syslog server, run at intervals a script listing processes an their resources usage, writing his output to a network share.
If the server makes bsods try googling for the error code mentioned in the bsod.
Also, maybe the acronis uninstall left an error log with some informations in the installation folder.
Its not a BSOD. The VMware machine is hanging at 100% CPU, with a frozen mouse.
– Contango
Jul 7 '09 at 8:57
add a comment |
Could you please explain what do you mean by crash, is the server encountering BSOD or is it just hanging at 100% Cpu.
For troubleshooting, you can make the server log to a syslog server, run at intervals a script listing processes an their resources usage, writing his output to a network share.
If the server makes bsods try googling for the error code mentioned in the bsod.
Also, maybe the acronis uninstall left an error log with some informations in the installation folder.
Its not a BSOD. The VMware machine is hanging at 100% CPU, with a frozen mouse.
– Contango
Jul 7 '09 at 8:57
add a comment |
Could you please explain what do you mean by crash, is the server encountering BSOD or is it just hanging at 100% Cpu.
For troubleshooting, you can make the server log to a syslog server, run at intervals a script listing processes an their resources usage, writing his output to a network share.
If the server makes bsods try googling for the error code mentioned in the bsod.
Also, maybe the acronis uninstall left an error log with some informations in the installation folder.
Could you please explain what do you mean by crash, is the server encountering BSOD or is it just hanging at 100% Cpu.
For troubleshooting, you can make the server log to a syslog server, run at intervals a script listing processes an their resources usage, writing his output to a network share.
If the server makes bsods try googling for the error code mentioned in the bsod.
Also, maybe the acronis uninstall left an error log with some informations in the installation folder.
answered Jul 3 '09 at 12:26
MaxwellMaxwell
4,84812031
4,84812031
Its not a BSOD. The VMware machine is hanging at 100% CPU, with a frozen mouse.
– Contango
Jul 7 '09 at 8:57
add a comment |
Its not a BSOD. The VMware machine is hanging at 100% CPU, with a frozen mouse.
– Contango
Jul 7 '09 at 8:57
Its not a BSOD. The VMware machine is hanging at 100% CPU, with a frozen mouse.
– Contango
Jul 7 '09 at 8:57
Its not a BSOD. The VMware machine is hanging at 100% CPU, with a frozen mouse.
– Contango
Jul 7 '09 at 8:57
add a comment |
Thanks for contributing an answer to Server Fault!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fserverfault.com%2fquestions%2f35513%2fhow-to-track-down-the-cause-of-windows-server-2008-crashing%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown