Please start any new threads on our new site at https://forums.sqlteam.com. We've got lots of great SQL Server experts to answer whatever question you can come up with.

 All Forums
 SQL Server 2005 Forums
 SQL Server Administration (2005)
 Cluster didn't fail over

Author  Topic 

AnAgSh
Starting Member

18 Posts

Posted - 2007-10-14 : 14:51:00
We have Active/Passive cluster on Windows 2003. It is a 2 nodes cluster. The active server came down and the Cluster didn't failover. Then we had to manually failover. Can you help me to decipher the Cluster.log to find out why it didn't failover?

00000d08.000012a8::2007/10/12-04:04:40.847 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000d08.000012a8::2007/10/12-04:04:40.847 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = HYT00; native error = 0; message = [Microsoft][SQL Native Client]Query timeout expired
00000d08.000012a8::2007/10/12-04:04:40.847 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [CP] CppRegNotifyThread checkpointing key Software\Microsoft\Microsoft SQL Server\MSSQL.1\MSSQLSERVER to id 4 due to timer
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\, CLS, 13522 => C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D2.tmp, status 0
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D2.tmp, status 0
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D2.tmp, status 0
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [CP] CpSaveData: checkpointing data id 4 to quorum node 1
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D2.tmp to file Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsCreateDirectory Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182, status 183
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsOpenFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D2.tmp => 3, 36cf960 status 0
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsOpenFile Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT => 2, 36cf8d0 status 183
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] ReadFile 928 (regf) 32768 16384, (0=>0) 0 status 0
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] WriteFile 95c (regf) 16384, status 0 (0=>0)
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] ReadFile 928 (regf) 32768 0, (0=>0) 0 status 0
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsFlushBuffers 95c, status 0
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsCloseHandle 928, status 0
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsCloseHandle 95c, status 0
0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D2.tmp, status 0
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [CP] CppRegNotifyThread checkpointing key Software\Microsoft\Microsoft SQL Server\MSSQL.1\MSSQLSERVER to id 4 due to timer
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\, CLS, 13523 => C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D3.tmp, status 0
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D3.tmp, status 0
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D3.tmp, status 0
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [CP] CpSaveData: checkpointing data id 4 to quorum node 1
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D3.tmp to file Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsCreateDirectory Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182, status 183
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsOpenFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D3.tmp => 3, 36cf960 status 0
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsOpenFile Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT => 2, 36cf8d0 status 183
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] ReadFile 95c (regf) 32768 16384, (0=>0) 0 status 0
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] WriteFile 928 (regf) 16384, status 0 (0=>0)
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] ReadFile 95c (regf) 32768 0, (0=>0) 0 status 0
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsFlushBuffers 928, status 0
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsCloseHandle 95c, status 0
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsCloseHandle 928, status 0
0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D3.tmp, status 0
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [CP] CppRegNotifyThread checkpointing key Software\Microsoft\Microsoft SQL Server\MSSQL.1\MSSQLSERVER to id 4 due to timer
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\, CLS, 13524 => C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D4.tmp, status 0
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D4.tmp, status 0
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D4.tmp, status 0
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [CP] CpSaveData: checkpointing data id 4 to quorum node 1
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D4.tmp to file Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsCreateDirectory Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182, status 183
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsOpenFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D4.tmp => 3, 36cf960 status 0
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsOpenFile Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT => 2, 36cf8d0 status 183
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] ReadFile 7e0 (regf) 32768 16384, (0=>0) 0 status 0
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] WriteFile 95c (regf) 16384, status 0 (0=>0)
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] ReadFile 7e0 (regf) 32768 0, (0=>0) 0 status 0
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsFlushBuffers 95c, status 0
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsCloseHandle 7e0, status 0
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsCloseHandle 95c, status 0
0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D4.tmp, status 0
0000077c.00000844::2007/10/12-04:09:11.323 INFO [Qfs] GetDiskFreeSpaceEx Q:\MSCS\, status 0
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [CP] CppRegNotifyThread checkpointing key Software\Microsoft\Microsoft SQL Server\MSSQL.1\MSSQLSERVER to id 4 due to timer
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\, CLS, 13525 => C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D5.tmp, status 0
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D5.tmp, status 0
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D5.tmp, status 0
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [CP] CpSaveData: checkpointing data id 4 to quorum node 1
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D5.tmp to file Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsCreateDirectory Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182, status 183
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsOpenFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D5.tmp => 3, 36cf960 status 0
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsOpenFile Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT => 2, 36cf8d0 status 183
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] ReadFile 928 (regf) 32768 16384, (0=>0) 0 status 0
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] WriteFile 7e0 (regf) 16384, status 0 (0=>0)
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] ReadFile 928 (regf) 32768 0, (0=>0) 0 status 0
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsFlushBuffers 7e0, status 0
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsCloseHandle 928, status 0
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsCloseHandle 7e0, status 0
0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D5.tmp, status

eyechart
Master Smack Fu Yak Hacker

3575 Posts

Posted - 2007-10-14 : 15:19:26
you need to open a ticket with microsoft professional services for this.



-ec
Go to Top of Page

rmiao
Master Smack Fu Yak Hacker

7266 Posts

Posted - 2007-10-14 : 16:31:07
Is cluster service running on passive node at that time? Did you check windows event logs for related errors?
Go to Top of Page

AnAgSh
Starting Member

18 Posts

Posted - 2007-10-14 : 18:16:54
It was running in the Active node. Here is the event log. This is the MS bug which requires a hotfix. We are planning to apply this fix. But we would like to know why cluster didn't failover

Event Type: Error
Event Source: MSSQLSERVER
Event Category: (2)
Event ID: 602
Date: 10/13/2007
Time: 9:04:40 PM
User: N/A
Computer: ********
Description:
Could not find an entry for table or index with partition ID 426772151009280 in database 2. This error can occur if a stored procedure references a dropped table, or metadata is corrupted. Drop and re-create the stored procedure, or execute DBCC CHECKDB.
Go to Top of Page

eyechart
Master Smack Fu Yak Hacker

3575 Posts

Posted - 2007-10-14 : 20:24:32
you probably should opena ticket with microsoft.



-ec
Go to Top of Page

rmiao
Master Smack Fu Yak Hacker

7266 Posts

Posted - 2007-10-14 : 20:30:53
This error is not related to cluster. You shouls look for message around time that active node went down. Did you see sql server log that started around that time?
Go to Top of Page
   

- Advertisement -