Re: Many DEVICE_ERRORS on ERRLOG.SYS (536 Views)
Reply
Regular Advisor
smsc_1
Posts: 87
Registered: ‎11-19-2007
Message 1 of 8 (637 Views)

Many DEVICE_ERRORS on ERRLOG.SYS

Dear community,

In the past days I got from OpenVMS 8.4 a lot of DEVICE_ERRORS.

What is not clear is what they refer.

 

During one of the following errors the application that run on these machines hangs due to I/O delay.

I have a cluster with 2 nodes, and on both nodes I got these errors. Disks are connected to an external controller: MSA1000 e show device command doesn't show any orrors on device.

 

Could someone help me on what happens and how to find references of these errors.

 

LOC1:USER> set def SYS$SYSROOT:[SYSERR]
LOC1:USER> **bleep**/err/elv tran /one_l /sin=5-DEC-2013
Output file SYS$OUTPUT: created at 17-DEC-2013 16:01:30.00


Output for SYS$SYSROOT:[SYSERR]ERRLOG.SYS;1

EVENT  EVENT_TYPE_____________________________  TIMESTAMP______________  NODE__  EVENT_CLASS____________________________
1      New File Created                          6-DEC-2013 00:12:10.01  LOC1  CONTROL_ENTRIES                        
2      System Configuration                      6-DEC-2013 00:12:10.01  LOC1  CONFIGURATION                          
3      Time Stamp                                6-DEC-2013 22:42:10.10  LOC1  CONTROL_ENTRIES                        
4      Device Error                              6-DEC-2013 22:50:18.11  LOC1  DEVICE_ERRORS                          
5      Device Error                              6-DEC-2013 22:50:18.11  LOC1  DEVICE_ERRORS                          
6      Device Error                              6-DEC-2013 22:50:18.20  LOC1  DEVICE_ERRORS                          
7      Device Error                              6-DEC-2013 22:50:18.26  LOC1  DEVICE_ERRORS                          
8      Device Error                              6-DEC-2013 22:50:19.45  LOC1  DEVICE_ERRORS                          
9      Device Error                              6-DEC-2013 22:50:20.46  LOC1  DEVICE_ERRORS                          
10     Device Error                              6-DEC-2013 22:50:49.24  LOC1  DEVICE_ERRORS                          
11     Time Stamp                                6-DEC-2013 22:52:10.10  LOC1  CONTROL_ENTRIES                        
12     Device Error                              6-DEC-2013 22:56:58.38  LOC1  DEVICE_ERRORS                          
13     Device Error                              6-DEC-2013 22:56:58.53  LOC1  DEVICE_ERRORS                          
14     Device Error                              6-DEC-2013 22:56:59.20  LOC1  DEVICE_ERRORS                          
15     Time Stamp                                6-DEC-2013 23:02:10.10  LOC1  CONTROL_ENTRIES                        
16     Device Error                              6-DEC-2013 23:10:37.40  LOC1  DEVICE_ERRORS                          
17     Device Error                              6-DEC-2013 23:10:37.40  LOC1  DEVICE_ERRORS                          
18     Device Error                              6-DEC-2013 23:10:37.41  LOC1  DEVICE_ERRORS                          
19     Time Stamp                                8-DEC-2013 20:22:10.25  LOC1  CONTROL_ENTRIES                        
20     Device Error                              8-DEC-2013 20:27:24.90  LOC1  DEVICE_ERRORS                          
21     Device Error                              8-DEC-2013 20:27:24.90  LOC1  DEVICE_ERRORS                          
22     Device Error                              8-DEC-2013 20:27:25.11  LOC1  DEVICE_ERRORS                          
23     Device Error                              8-DEC-2013 20:27:25.16  LOC1  DEVICE_ERRORS                          
24     Device Error                              8-DEC-2013 20:27:25.80  LOC1  DEVICE_ERRORS                          
25     Device Error                              8-DEC-2013 20:27:26.23  LOC1  DEVICE_ERRORS                          
26     Device Error                              8-DEC-2013 20:27:26.24  LOC1  DEVICE_ERRORS                          
27     Device Error                              8-DEC-2013 20:27:26.29  LOC1  DEVICE_ERRORS                          
28     Device Error                              8-DEC-2013 20:27:26.84  LOC1  DEVICE_ERRORS                          
29     Device Error                              8-DEC-2013 20:27:27.20  LOC1  DEVICE_ERRORS                          
30     Device Error                              8-DEC-2013 20:27:27.77  LOC1  DEVICE_ERRORS                          
31     Device Error                              8-DEC-2013 20:28:49.69  LOC1  DEVICE_ERRORS                          
32     Time Stamp                                8-DEC-2013 20:32:10.25  LOC1  CONTROL_ENTRIES                        
33     Device Error                              8-DEC-2013 20:41:58.47  LOC1  DEVICE_ERRORS                          
34     Device Error                              8-DEC-2013 20:41:58.67  LOC1  DEVICE_ERRORS                          
35     Time Stamp                                8-DEC-2013 21:02:10.25  LOC1  CONTROL_ENTRIES                        
36     Device Error                              8-DEC-2013 21:10:37.88  LOC1  DEVICE_ERRORS                          
37     Device Error                              8-DEC-2013 21:10:37.88  LOC1  DEVICE_ERRORS                          
38     Device Error                              8-DEC-2013 21:10:37.88  LOC1  DEVICE_ERRORS                          
39     Device Error                              8-DEC-2013 21:10:37.88  LOC1  DEVICE_ERRORS                          
40     Device Error                              8-DEC-2013 21:10:37.89  LOC1  DEVICE_ERRORS                          
41     Device Error                              8-DEC-2013 21:10:37.89  LOC1  DEVICE_ERRORS                          
42     Device Error                              8-DEC-2013 21:10:37.89  LOC1  DEVICE_ERRORS                          
43     Device Error                              8-DEC-2013 21:10:37.89  LOC1  DEVICE_ERRORS                          
44     Device Error                              8-DEC-2013 21:10:37.89  LOC1  DEVICE_ERRORS                          
45     Device Error                              8-DEC-2013 21:10:37.89  LOC1  DEVICE_ERRORS                          
46     Device Error                              8-DEC-2013 21:10:37.89  LOC1  DEVICE_ERRORS                          
47     Device Error                              8-DEC-2013 21:10:37.89  LOC1  DEVICE_ERRORS                          
48     Device Error                              8-DEC-2013 21:10:37.89  LOC1  DEVICE_ERRORS                          
49     Device Error                              8-DEC-2013 21:10:37.90  LOC1  DEVICE_ERRORS                          
50     Device Error                              8-DEC-2013 21:10:37.90  LOC1  DEVICE_ERRORS                          
51     Time Stamp                               13-DEC-2013 03:32:10.68  LOC1  CONTROL_ENTRIES                        
52     Device Error                             13-DEC-2013 03:35:36.93  LOC1  DEVICE_ERRORS                          
53     Device Error                             13-DEC-2013 03:35:36.99  LOC1  DEVICE_ERRORS                          
54     Device Error                             13-DEC-2013 03:35:39.20  LOC1  DEVICE_ERRORS                          
55     Device Error                             13-DEC-2013 03:35:40.37  LOC1  DEVICE_ERRORS                          
56     Device Error                             13-DEC-2013 03:35:43.50  LOC1  DEVICE_ERRORS                          
57     Device Error                             13-DEC-2013 03:35:53.60  LOC1  DEVICE_ERRORS                          
58     Device Error                             13-DEC-2013 03:41:58.33  LOC1  DEVICE_ERRORS                          
59     Device Error                             13-DEC-2013 03:41:58.44  LOC1  DEVICE_ERRORS                          
60     Device Error                             13-DEC-2013 03:41:58.61  LOC1  DEVICE_ERRORS                          
61     Time Stamp                               13-DEC-2013 04:02:10.68  LOC1  CONTROL_ENTRIES                        
62     Device Error                             13-DEC-2013 04:10:39.64  LOC1  DEVICE_ERRORS                          
63     Device Error                             13-DEC-2013 04:10:39.64  LOC1  DEVICE_ERRORS                          
64     Device Error                             13-DEC-2013 04:10:39.64  LOC1  DEVICE_ERRORS                          
65     Device Error                             13-DEC-2013 04:10:39.64  LOC1  DEVICE_ERRORS                          
66     Device Error                             13-DEC-2013 04:10:39.64  LOC1  DEVICE_ERRORS                          
67     Device Error                             13-DEC-2013 04:10:39.64  LOC1  DEVICE_ERRORS                          
68     Device Error                             13-DEC-2013 04:10:39.64  LOC1  DEVICE_ERRORS                          
69     Device Error                             13-DEC-2013 04:10:39.64  LOC1  DEVICE_ERRORS                          
70     Device Error                             13-DEC-2013 04:10:39.64  LOC1  DEVICE_ERRORS                          
71     Device Error                             13-DEC-2013 04:10:39.64  LOC1  DEVICE_ERRORS                          
72     Device Error                             13-DEC-2013 04:10:39.65  LOC1  DEVICE_ERRORS                          
73     Device Error                             13-DEC-2013 04:10:39.65  LOC1  DEVICE_ERRORS                          
74     Device Error                             13-DEC-2013 04:10:39.65  LOC1  DEVICE_ERRORS                          
75     Device Error                             13-DEC-2013 04:10:39.65  LOC1  DEVICE_ERRORS                          
76     Device Error                             13-DEC-2013 04:10:39.65  LOC1  DEVICE_ERRORS                          
77     Time Stamp                               17-DEC-2013 15:52:11.15  LOC1  CONTROL_ENTRIES                        


ERROR_LOG_SUMMARY______________________________________________________

Total number of events:                         77
Number of the first event:                      1
Number of the last event:                       77
Earliest event occurred:                         6-DEC-2013 00:12:10.01
Latest event occurred:                          17-DEC-2013 15:52:11.15
Number of events by event class:       
        CONFIGURATION                           1
        CONTROL_ENTRIES                         10
        DEVICE_ERRORS                           66

 

./ Lucas
Honored Contributor
Volker Halle
Posts: 5,209
Registered: ‎04-26-2004
Message 2 of 8 (630 Views)

Re: Many DEVICE_ERRORS on ERRLOG.SYS

Lucas,

 

forget about ANALZYE/ERROR/ELV, this tool is useless for translating most entries in ERRLOG.SYS.

 

Get and install DECevent V3.4 to decode disk-related errors. Or get an older version of WEBES SEA (System Event Analyzer) tool, which does not need a Windows Server to decode OpenVMS errorlog entries.

 

Volker.

Regular Advisor
smsc_1
Posts: 87
Registered: ‎11-19-2007
Message 3 of 8 (626 Views)

Re: Many DEVICE_ERRORS on ERRLOG.SYS

THanks for reply Volker,

could you please point me to the download link of both DECevent V3.4 and WEBES SEA?

 

Many thanks

Lucas

./ Lucas
Honored Contributor
labadie_1
Posts: 1,221
Registered: ‎08-07-2003
Message 4 of 8 (620 Views)

Re: Many DEVICE_ERRORS on ERRLOG.SYS

You need to have a contract support with HP, and then HP will send you the Decevent kits.

Regular Advisor
smsc_1
Posts: 87
Registered: ‎11-19-2007
Message 5 of 8 (608 Views)

Re: Many DEVICE_ERRORS on ERRLOG.SYS

Well, unfortunately I don't have the contract right now, can I upload the ERRLOG.SYS?

Have someone the right software to analize it?

 

Thanks

./ Lucas
Respected Contributor
Bob Blunt
Posts: 314
Registered: ‎05-01-2003
Message 6 of 8 (570 Views)

Re: Many DEVICE_ERRORS on ERRLOG.SYS

Lucas, you didn't mention what hardware you're using.  DECevent is essentially useless for any Integrity systems and, in my experience, it won't help much with errors logged on OpenVMS V8.4.  The format of the errors logged changed (to a "Common Event Header" format that DECevent doesn't understand without help).  And, unfortunately, unless your machine(s) are under warranty there isn't anywhere to upload your ERRLOG.SYS for analysis.  You might try opening a call with HP and use your serial number off the misbehaving system and they'll let you know if your system is still covered.

 

bob

Honored Contributor
Volker Halle
Posts: 5,209
Registered: ‎04-26-2004
Message 7 of 8 (566 Views)

Re: Many DEVICE_ERRORS on ERRLOG.SYS

Bob,

 

the "Common Event Header" has been introduced in OpenVMS Alpha V7.1-2 and caused the standard ANALYZE/ERROR_LOG to fail with '%ERF-F-CEHFND, New header format found. Install DECevent and run conversion utility'

 

DECevent V3.4 (the latest - but long retired - version) can still properly decode most disk device related errlog entries fine - even for Itanium systems. For CPU related errlog entries, translation support in DECevent ended with AlphaServer GS140 6/xxx systems.

 

Volker.

Regular Advisor
smsc_1
Posts: 87
Registered: ‎11-19-2007
Message 8 of 8 (536 Views)

Re: Many DEVICE_ERRORS on ERRLOG.SYS

Well, finally I discover the fault.

One of the MSA1000 brocade switch was in fault (goes up and down continuosly), thanks all for the tips, esxpecially for HP software that I'll try to ask directly to HP when they'll change the switch.

 

BR

./ Lucas
The opinions expressed above are the personal opinions of the authors, not of HP. By using this site, you accept the Terms of Use and Rules of Participation.