Please start any new threads on our new
site at https://forums.sqlteam.com. We've got lots of great SQL Server
experts to answer whatever question you can come up with.
| Author |
Topic |
|
AnAgSh
Starting Member
18 Posts |
Posted - 2007-10-14 : 14:51:00
|
| We have Active/Passive cluster on Windows 2003. It is a 2 nodes cluster. The active server came down and the Cluster didn't failover. Then we had to manually failover. Can you help me to decipher the Cluster.log to find out why it didn't failover?00000d08.000012a8::2007/10/12-04:04:40.847 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed00000d08.000012a8::2007/10/12-04:04:40.847 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = HYT00; native error = 0; message = [Microsoft][SQL Native Client]Query timeout expired00000d08.000012a8::2007/10/12-04:04:40.847 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.0000077c.000003dc::2007/10/12-04:04:43.785 INFO [CP] CppRegNotifyThread checkpointing key Software\Microsoft\Microsoft SQL Server\MSSQL.1\MSSQLSERVER to id 4 due to timer0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\, CLS, 13522 => C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D2.tmp, status 00000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D2.tmp, status 00000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D2.tmp, status 00000077c.000003dc::2007/10/12-04:04:43.785 INFO [CP] CpSaveData: checkpointing data id 4 to quorum node 10000077c.000003dc::2007/10/12-04:04:43.785 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D2.tmp to file Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsCreateDirectory Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182, status 1830000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsOpenFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D2.tmp => 3, 36cf960 status 00000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsOpenFile Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT => 2, 36cf8d0 status 1830000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] ReadFile 928 (regf) 32768 16384, (0=>0) 0 status 00000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] WriteFile 95c (regf) 16384, status 0 (0=>0)0000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] ReadFile 928 (regf) 32768 0, (0=>0) 0 status 00000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsFlushBuffers 95c, status 00000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsCloseHandle 928, status 00000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsCloseHandle 95c, status 00000077c.000003dc::2007/10/12-04:04:43.785 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D2.tmp, status 00000077c.000003dc::2007/10/12-04:06:23.084 INFO [CP] CppRegNotifyThread checkpointing key Software\Microsoft\Microsoft SQL Server\MSSQL.1\MSSQLSERVER to id 4 due to timer0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\, CLS, 13523 => C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D3.tmp, status 00000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D3.tmp, status 00000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D3.tmp, status 00000077c.000003dc::2007/10/12-04:06:23.084 INFO [CP] CpSaveData: checkpointing data id 4 to quorum node 10000077c.000003dc::2007/10/12-04:06:23.084 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D3.tmp to file Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsCreateDirectory Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182, status 1830000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsOpenFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D3.tmp => 3, 36cf960 status 00000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsOpenFile Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT => 2, 36cf8d0 status 1830000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] ReadFile 95c (regf) 32768 16384, (0=>0) 0 status 00000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] WriteFile 928 (regf) 16384, status 0 (0=>0)0000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] ReadFile 95c (regf) 32768 0, (0=>0) 0 status 00000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsFlushBuffers 928, status 00000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsCloseHandle 95c, status 00000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsCloseHandle 928, status 00000077c.000003dc::2007/10/12-04:06:23.084 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D3.tmp, status 00000077c.000003dc::2007/10/12-04:08:08.259 INFO [CP] CppRegNotifyThread checkpointing key Software\Microsoft\Microsoft SQL Server\MSSQL.1\MSSQLSERVER to id 4 due to timer0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\, CLS, 13524 => C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D4.tmp, status 00000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D4.tmp, status 00000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D4.tmp, status 00000077c.000003dc::2007/10/12-04:08:08.259 INFO [CP] CpSaveData: checkpointing data id 4 to quorum node 10000077c.000003dc::2007/10/12-04:08:08.259 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D4.tmp to file Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsCreateDirectory Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182, status 1830000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsOpenFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D4.tmp => 3, 36cf960 status 00000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsOpenFile Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT => 2, 36cf8d0 status 1830000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] ReadFile 7e0 (regf) 32768 16384, (0=>0) 0 status 00000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] WriteFile 95c (regf) 16384, status 0 (0=>0)0000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] ReadFile 7e0 (regf) 32768 0, (0=>0) 0 status 00000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsFlushBuffers 95c, status 00000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsCloseHandle 7e0, status 00000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsCloseHandle 95c, status 00000077c.000003dc::2007/10/12-04:08:08.259 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D4.tmp, status 00000077c.00000844::2007/10/12-04:09:11.323 INFO [Qfs] GetDiskFreeSpaceEx Q:\MSCS\, status 00000077c.000003dc::2007/10/12-04:09:56.652 INFO [CP] CppRegNotifyThread checkpointing key Software\Microsoft\Microsoft SQL Server\MSSQL.1\MSSQLSERVER to id 4 due to timer0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsGetTempFileName C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\, CLS, 13525 => C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D5.tmp, status 00000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D5.tmp, status 00000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsRegSaveKey C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D5.tmp, status 00000077c.000003dc::2007/10/12-04:09:56.652 INFO [CP] CpSaveData: checkpointing data id 4 to quorum node 10000077c.000003dc::2007/10/12-04:09:56.652 INFO [CP] CppWriteCheckpoint checkpointing file C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D5.tmp to file Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsCreateDirectory Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182, status 1830000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsOpenFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D5.tmp => 3, 36cf960 status 00000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsOpenFile Q:\MSCS\0592a048-5457-426a-a46d-996441ef0182\00000004.CPT => 2, 36cf8d0 status 1830000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] ReadFile 928 (regf) 32768 16384, (0=>0) 0 status 00000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] WriteFile 7e0 (regf) 16384, status 0 (0=>0)0000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] ReadFile 928 (regf) 32768 0, (0=>0) 0 status 00000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsFlushBuffers 7e0, status 00000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsCloseHandle 928, status 00000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsCloseHandle 7e0, status 00000077c.000003dc::2007/10/12-04:09:56.652 INFO [Qfs] QfsDeleteFile C:\DOCUME~1\SVC_CL~1\LOCALS~1\Temp\CLS34D5.tmp, status |
|
|
eyechart
Master Smack Fu Yak Hacker
3575 Posts |
Posted - 2007-10-14 : 15:19:26
|
| you need to open a ticket with microsoft professional services for this.-ec |
 |
|
|
rmiao
Master Smack Fu Yak Hacker
7266 Posts |
Posted - 2007-10-14 : 16:31:07
|
| Is cluster service running on passive node at that time? Did you check windows event logs for related errors? |
 |
|
|
AnAgSh
Starting Member
18 Posts |
Posted - 2007-10-14 : 18:16:54
|
| It was running in the Active node. Here is the event log. This is the MS bug which requires a hotfix. We are planning to apply this fix. But we would like to know why cluster didn't failoverEvent Type: ErrorEvent Source: MSSQLSERVEREvent Category: (2)Event ID: 602Date: 10/13/2007Time: 9:04:40 PMUser: N/AComputer: ********Description:Could not find an entry for table or index with partition ID 426772151009280 in database 2. This error can occur if a stored procedure references a dropped table, or metadata is corrupted. Drop and re-create the stored procedure, or execute DBCC CHECKDB. |
 |
|
|
eyechart
Master Smack Fu Yak Hacker
3575 Posts |
Posted - 2007-10-14 : 20:24:32
|
| you probably should opena ticket with microsoft.-ec |
 |
|
|
rmiao
Master Smack Fu Yak Hacker
7266 Posts |
Posted - 2007-10-14 : 20:30:53
|
| This error is not related to cluster. You shouls look for message around time that active node went down. Did you see sql server log that started around that time? |
 |
|
|
|
|
|
|
|