Archives of the TeradataForum
Message Posted: Wed, 10 Sep 2003 @ 16:29:15 GMT
Subj: | | V2R5.00.21 Crash |
|
From: | | Aniruddha Mitra |
Hi:
We have a v2r5.00.21 running on win 2k sp3 (8 AMPS, and 4 PE/Dell 6650, 4 x 2 GHz, and 4096 MB RAM). The database behaves strange from
time to time. Pdestate -a says database is running, users are connected, but no one can connect. Bteq says rdbms crashed. On vprocmanager,
all amps are online, and crashcount is 0. tpareset -f hangs for ever. However reset from multitool works. AFter restart database behaves
fine. If I do a tpatrace, there is only one message repeated again and again. ANy idea what is wrong??
Thanks in advance for your help.
Aniruddha Mitra
08/29/03 19:13:41.00 (0/0 e5c) Waiting for 0/7 to exit... done in 0 seconds.
08/29/03 19:13:43.00 (0/0 1a34) Dump for PARTITION starting... FAILed with TIMEOUT in 420 seconds.
08/29/03 19:20:43.00 (0/0 11bc) Waiting for 0/7 to exit... done in 0 seconds.
08/29/03 19:20:43.00 (0/0 10f8) Dump for PARTITION starting... FAILed with TIMEOUT in 420 seconds.
08/29/03 19:27:43.00 (0/0 7c8) Dump for PARTITION starting... FAILed with TIMEOUT in 420 seconds.
08/29/03 19:34:48.00 (0/0 e5c) Dump for PARTITION starting... FAILed with TIMEOUT in 420 seconds.
08/29/03 19:41:48.00 (0/0 10f8) Waiting for 0/7 to exit... done in 0 seconds.
08/29/03 19:48:26.00 (0/0 1aec) Dump for PARTITION starting... FAILed with TIMEOUT in 420 seconds.
08/29/03 19:55:26.00 (0/0 1a34) Waiting for 0/7 to exit... done in 0 seconds.
09/02/03 09:43:09.00 (0/0 788) ===> Restart initiated by VPROCERROR event ERRSYSRESTART (10198).
09/02/03 09:43:09.00 (0/0 668) Reset received from 001-01 w/flags FORCE for ERRSYSRESTART (10198) in vproc 16384.
09/02/03 09:43:09.00 (0/0 668) Crash count/ceiling = 0/3
09/02/03 09:43:09.00 (0/0 668) Local system crashcount -> CLEAR.
09/02/03 09:43:09.00 (0/0 668) State is RESET/BEGIN.
09/02/03 09:43:10.00 (0/0 1b0c) ---- Reset from node 001-01 starting.
09/02/03 09:43:10.00 (0/0 1b0c) Crash count/ceiling = 1/3
09/02/03 09:43:10.00 (0/0 1b0c) State is RESET/STOPTASKS.
09/02/03 09:43:10.00 (0/0 1b0c) Stopping 58 programs... done in 0 seconds.
09/02/03 09:43:10.00 (0/0 1b0c) State is RESET/KILLTASKS.
09/02/03 09:43:11.00 (0/0 1b0c) Reset killing 58 programs...
09/02/03 09:43:11.00 (0/0 1b0c) Waiting for 0/2 to exit... done in 4 seconds.
09/02/03 09:43:15.00 (0/0 1b0c) Waiting for 13/10 to exit... done in 2 seconds.
09/02/03 09:43:17.00 (0/0 1b0c) Waiting for 07f4 to exit... done in 1 second.
09/02/03 09:43:18.00 (0/0 1b0c) Reset killing done in 7 seconds.
09/02/03 09:43:18.00 (0/0 1b0c) State is RESET/NETWORK.
09/02/03 09:43:18.00 (0/0 1b0c) State is RESET/VPROCSTOP.
09/02/03 09:43:18.00 (0/0 1b0c) Stopping vproc in slot 13(8192) 12(16383) 11
(16382) 10(16381) 9(16380) 8(7) 7(6) 6(5) 5(4) 4(3) 3(2) 2(1) 1(0)
09/02/03 09:43:22.00 (0/0 1b0c) State is RESET/FINAL.
09/02/03 09:43:22.00 (0/0 1b0c) Reset done in 12 seconds.
09/02/03 09:43:22.00 (0/0 1b0c) State is RESET/SAVETRACE.
09/02/03 09:43:22.00 (0/0 1b0c) ---- PDE starting.
09/02/03 09:43:22.00 (0/0 1b0c) State is START/BEGIN.
There are also a lot of 3610 errors
|