Alpha just resets and reboots (321 Views)
Reply
Frequent Advisor
A.W.R
Posts: 191
Registered: ‎06-01-2006
Message 1 of 8 (321 Views)

Alpha just resets and reboots

Hi,

We have an ES40 Alpha server with 4 by EV6.7 (21264A) 667 processors, and 3GB of RAM on Tru64UNIX v5.1B rev 2650.

Periodically the system just resets and because we have the AUTO_ACTION parameter set to reboot it reboots. There are no errors in the binary error log and no messages in the messages files.

If anyone can provide input on why this maybe occurring it would be greatly appreciated.

Thanks
Andrew
Honored Contributor
Steven Schweda
Posts: 9,096
Registered: ‎02-23-2005
Message 2 of 8 (321 Views)

Re: Alpha just resets and reboots

> Periodically [...]

Do you really mean "periodically", that is,
"at regular intervals", like, say, every day
at 00:01, or do you really mean
"occasionally", as in "repeatedly and
unpredictably"?

Are you getting crash dumps?
Valued Contributor
John Manger
Posts: 56
Registered: ‎06-08-2005
Message 3 of 8 (321 Views)

Re: Alpha just resets and reboots

Its probably worthwhile attaching a serial console to capture what happens during these 'restarts'.

Also, check further back over recent weeks/months in the binary errlog and see if there were any h/w related warnings.

John M
Nobody can serve both God and Money
Frequent Advisor
A.W.R
Posts: 191
Registered: ‎06-01-2006
Message 4 of 8 (321 Views)

Re: Alpha just resets and reboots

Hi,

This happens at irregular intervals for no apparent reason that we can diagnose. The binary errlog is clean, there are no crash dumps. There are no messages in the messages file.

Thanks
Andrew
Honored Contributor
Steven Schweda
Posts: 9,096
Registered: ‎02-23-2005
Message 5 of 8 (321 Views)

Re: Alpha just resets and reboots

If it's just winking out and rebooting, with
no sign of an organized crash or shutdown,
then I'd tend to suspect a power problem
(supply interruption, or failing power
supply). I'm not sure how one might easily
test that bad-power-supply hypothesis,
however, other than swapping out the
suspect(s)
Honored Contributor
Vladimir Fabecic
Posts: 1,128
Registered: ‎03-03-2005
Message 6 of 8 (321 Views)

Re: Alpha just resets and reboots

ES40 machines had problems with RAM memory.
I had several cases like yours. Problem was bad memory DIMM.
Find some time for machine downtime and do memory tests.

>>> memexer 3

It can take a very long time to find failing DIMM.
In vino veritas, in VMS cluster
Honored Contributor
Honored Contributor
cnb
Posts: 1,197
Registered: ‎09-18-2008
Message 7 of 8 (321 Views)

Re: Alpha just resets and reboots

Most likely memory, an intermittent OCP or PS, but you can try some things first...

Use the console and RMC to check the system health via the env and status commands. Clear any ALERTS and look for anything out of spec. See these guides for reference:

http://h18000.www1.hp.com/alphaserver/download/es40fg_revb.pdf

http://h18000.www1.hp.com/alphaserver/download/es40og_revb.pdf


The next unplanned restart use RMC env to see what alerts are set.

Use '# consvar -s auto_action halt', the next time it halts, look in the NVRAM error logs via SRM commands.

Capture the following...

P00>>> show power
P00>>> cat el
P00>>> sho fru
P00>>> show error

Rgds,
Honored Contributor
Honored Contributor
cnb
Posts: 1,197
Registered: ‎09-18-2008
Message 8 of 8 (321 Views)

Re: Alpha just resets and reboots

Also use sys_check -escalate off hours to see if anything else looks weird.

Have you tried using evmget or evmwatch?

Just some more ideas...

Rgds,
The opinions expressed above are the personal opinions of the authors, not of HP. By using this site, you accept the Terms of Use and Rules of Participation.