Showing posts with label tools. Show all posts
Showing posts with label tools. Show all posts

Tuesday, March 20, 2012

cluster error

I think I originally posted this in the wrong forum of SQL Server General Tools.

We have a one node cluster that is receiving the below error messages in the cluster.log and the Event Viewer. In the SQL Server 2005 SP2 job that fails we receive a slighlty different message. The OS is Windows 2003 SP1. This is not a 64-bit box. We have checked the permissions of the cluster service account in SQL Server to be sure they are correct. Can anyone please help with this issue?

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure

You encounter the SynAttachProtect issue (based on your error message). Take a look at section 4.1.2.

http://support.microsoft.com/kb/910228

|||

I spent months trying to resolve this issue but found it to be a hardware driver issue. Not sure what hardware you're running but if its HP Proliant its worth checking this as it sorted the problem we had (even though we were not using iLO) with a simple driver update.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c00688313&jumpid=reg_R1002_USEN

Advisory: (Revision) Integrated Lights-Out Management (iLO) Interface Driver for Windows May Cause the System to Become Unresponsive if the Driver Does Not Allocate Extra Contiguous Memory Blocks Under the 4 GB Space

|||thanks for the info Pete.|||

I have a same problem with a SQL Server Cluster and I disabled SynAttackProtect. But this is take no effect

|||

Andrew,

You disable SYN on all nodes and restart? Can you post the exact error?

|||

Yes.

Errors (Windows Application Log):

20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native ClientCommunication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

The last time of this errors - 20.08.2007. Previos day of this errors - 13/08/2007.

I have a two-nodes cluster with Windows Server 2003 R2 and MS SQL Server 2005 Standart

|||

Andrew,

It happens every 7 day? Can you check system event log to see if there is any network outage at 20.08.2007 17:18:58.

Basically, the cluster service was using tcp to connect to sqlserver to check its health and it couldn't. The original hint was "an existing connection was forcibly closed..." which pointed to SynAttackProtect.

Since you already turned off DoS flag, the only thing left to check is the system itself.

|||

Yes, I check system log - there is no errors at 20.08.2007 17:18:58 (The last event at 20.08 was in 15:08 and it was a Information event). All nodes in cluster was available at at 20.08.2007 17:18:58. Only SQL Server was fail.

|||

!!!!

Today at 10:57:50 it's happened again !!!

I check the system and application log.

System event log:

23.08.2007 11:01:14 ClusSvc Information Failover Mgr 1201 N/A MSNODE1 "The Cluster Service brought the Resource Group ""SQL Server 2005"" online."
23.08.2007 11:01:14 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:14 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:13 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:07 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:06 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 11:00:24 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 11:00:23 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 10:59:57 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 10:59:57 ClusSvc Error Failover Mgr 1069 N/A MSNODE1 Cluster resource 'SQL Server' in Resource Group 'SQL Server 2005' failed.

23.08.2007 10:57:58 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 1751833(in 100 nanosecs).
23.08.2007 10:57:24 TermServDevices Error None 1111 N/A MSNODE2 Driver Microsoft XPS Document Writer required for printer Microsoft XPS Document Writer is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver Canon MF5700 Series required for printer Canon MF5700 Series is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver hp LaserJet 1000 required for printer !!C528!hp LaserJet 1000 is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 Print Information None 2 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was created.
23.08.2007 10:57:22 Print Information None 9 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was set.
23.08.2007 10:57:57 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 269397841(in 100 nanosecs).
23.08.2007 10:26:26 Print Warning None 3 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was deleted.
23.08.2007 10:26:26 Print Warning None 4 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 is pending deletion.
23.08.2007 10:26:26 Print Warning None 8 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was purged.

Application Event Log (because I use a SQL Server 2005 Russian some event have a russian text. In brackets I write english tranlation):

23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = 0; message = [Microsoft][SQL Native Client]Не удается завершить вход в систему из-за задержки при открытии соединения с сервером
"
23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] ODBC sqldriverconnect failed
"
23.08.2007 10:58:51 Application Hang Error (101) 1002 N/A MSNODE2 Hanging application cluadmin.exe, version 5.2.3790.3959, hang module hungapp, version 0.0.0.0, hang address 0x00000000.
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:54 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:53 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

I need help !

-

Andrew Mishechkin

ICQ: 101861332

|||

Thanks for the logs.

The app log indicates error with connection which escalates to a clussvc error in the system log. I would next suggest checking out faulty hardware (i.e. NIC's). I've seen strange and hard to diagnosed issue when it comes to faulty hardware.

|||We also have the same exact error and we are using HP DL 585 G2 running Win2k3 SP1. Any steps i can take to isolate further ( tools for dignostics ).

thanks,
Tony B|||

Hello,

I have the same problem with a sql 2005 SP2 cluter. I have several sql 2005 errors in aplication and a few later an error in system log about cluster service.
This is the cluster:

*clusternode1: HP DL50G5. 4 P dual core with HTT. W2003 R2 EE SP2 x64.
*clusternode2: the same as clusternode1.


I tried to solve the problem taking this actions:

*Disable the ilo driver.
*Update NICs drivers.
*Create a DWORD SynAttackProtect with value 0.


?Any idea to solve the problem?

Thanks
pablo

sqlsql

cluster error

I think I originally posted this in the wrong forum of SQL Server General Tools.

We have a one node cluster that is receiving the below error messages in the cluster.log and the Event Viewer. In the SQL Server 2005 SP2 job that fails we receive a slighlty different message. The OS is Windows 2003 SP1. This is not a 64-bit box. We have checked the permissions of the cluster service account in SQL Server to be sure they are correct. Can anyone please help with this issue?

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure

You encounter the SynAttachProtect issue (based on your error message). Take a look at section 4.1.2.

http://support.microsoft.com/kb/910228

|||

I spent months trying to resolve this issue but found it to be a hardware driver issue. Not sure what hardware you're running but if its HP Proliant its worth checking this as it sorted the problem we had (even though we were not using iLO) with a simple driver update.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c00688313&jumpid=reg_R1002_USEN

Advisory: (Revision) Integrated Lights-Out Management (iLO) Interface Driver for Windows May Cause the System to Become Unresponsive if the Driver Does Not Allocate Extra Contiguous Memory Blocks Under the 4 GB Space

|||thanks for the info Pete.|||

I have a same problem with a SQL Server Cluster and I disabled SynAttackProtect. But this is take no effect

|||

Andrew,

You disable SYN on all nodes and restart? Can you post the exact error?

|||

Yes.

Errors (Windows Application Log):

20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native ClientCommunication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

The last time of this errors - 20.08.2007. Previos day of this errors - 13/08/2007.

I have a two-nodes cluster with Windows Server 2003 R2 and MS SQL Server 2005 Standart

|||

Andrew,

It happens every 7 day? Can you check system event log to see if there is any network outage at 20.08.2007 17:18:58.

Basically, the cluster service was using tcp to connect to sqlserver to check its health and it couldn't. The original hint was "an existing connection was forcibly closed..." which pointed to SynAttackProtect.

Since you already turned off DoS flag, the only thing left to check is the system itself.

|||

Yes, I check system log - there is no errors at 20.08.2007 17:18:58 (The last event at 20.08 was in 15:08 and it was a Information event). All nodes in cluster was available at at 20.08.2007 17:18:58. Only SQL Server was fail.

|||

!!!!

Today at 10:57:50 it's happened again !!!

I check the system and application log.

System event log:

23.08.2007 11:01:14 ClusSvc Information Failover Mgr 1201 N/A MSNODE1 "The Cluster Service brought the Resource Group ""SQL Server 2005"" online."
23.08.2007 11:01:14 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:14 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:13 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:07 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:06 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 11:00:24 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 11:00:23 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 10:59:57 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 10:59:57 ClusSvc Error Failover Mgr 1069 N/A MSNODE1 Cluster resource 'SQL Server' in Resource Group 'SQL Server 2005' failed.

23.08.2007 10:57:58 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 1751833(in 100 nanosecs).
23.08.2007 10:57:24 TermServDevices Error None 1111 N/A MSNODE2 Driver Microsoft XPS Document Writer required for printer Microsoft XPS Document Writer is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver Canon MF5700 Series required for printer Canon MF5700 Series is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver hp LaserJet 1000 required for printer !!C528!hp LaserJet 1000 is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 Print Information None 2 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was created.
23.08.2007 10:57:22 Print Information None 9 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was set.
23.08.2007 10:57:57 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 269397841(in 100 nanosecs).
23.08.2007 10:26:26 Print Warning None 3 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was deleted.
23.08.2007 10:26:26 Print Warning None 4 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 is pending deletion.
23.08.2007 10:26:26 Print Warning None 8 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was purged.

Application Event Log (because I use a SQL Server 2005 Russian some event have a russian text. In brackets I write english tranlation):

23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = 0; message = [Microsoft][SQL Native Client]Не удается завершить вход в систему из-за задержки при открытии соединения с сервером
"
23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] ODBC sqldriverconnect failed
"
23.08.2007 10:58:51 Application Hang Error (101) 1002 N/A MSNODE2 Hanging application cluadmin.exe, version 5.2.3790.3959, hang module hungapp, version 0.0.0.0, hang address 0x00000000.
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:54 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:53 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

I need help !

-

Andrew Mishechkin

ICQ: 101861332

|||

Thanks for the logs.

The app log indicates error with connection which escalates to a clussvc error in the system log. I would next suggest checking out faulty hardware (i.e. NIC's). I've seen strange and hard to diagnosed issue when it comes to faulty hardware.

|||We also have the same exact error and we are using HP DL 585 G2 running Win2k3 SP1. Any steps i can take to isolate further ( tools for dignostics ).

thanks,
Tony B|||

Hello,

I have the same problem with a sql 2005 SP2 cluter. I have several sql 2005 errors in aplication and a few later an error in system log about cluster service.
This is the cluster:

*clusternode1: HP DL50G5. 4 P dual core with HTT. W2003 R2 EE SP2 x64.
*clusternode2: the same as clusternode1.


I tried to solve the problem taking this actions:

*Disable the ilo driver.
*Update NICs drivers.
*Create a DWORD SynAttackProtect with value 0.


?Any idea to solve the problem?

Thanks
pablo

cluster error

I think I originally posted this in the wrong forum of SQL Server General Tools.

We have a one node cluster that is receiving the below error messages in the cluster.log and the Event Viewer. In the SQL Server 2005 SP2 job that fails we receive a slighlty different message. The OS is Windows 2003 SP1. This is not a 64-bit box. We have checked the permissions of the cluster service account in SQL Server to be sure they are correct. Can anyone please help with this issue?

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure

You encounter the SynAttachProtect issue (based on your error message). Take a look at section 4.1.2.

http://support.microsoft.com/kb/910228

|||

I spent months trying to resolve this issue but found it to be a hardware driver issue. Not sure what hardware you're running but if its HP Proliant its worth checking this as it sorted the problem we had (even though we were not using iLO) with a simple driver update.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c00688313&jumpid=reg_R1002_USEN

Advisory: (Revision) Integrated Lights-Out Management (iLO) Interface Driver for Windows May Cause the System to Become Unresponsive if the Driver Does Not Allocate Extra Contiguous Memory Blocks Under the 4 GB Space

|||thanks for the info Pete.|||

I have a same problem with a SQL Server Cluster and I disabled SynAttackProtect. But this is take no effect

|||

Andrew,

You disable SYN on all nodes and restart? Can you post the exact error?

|||

Yes.

Errors (Windows Application Log):

20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native ClientCommunication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

The last time of this errors - 20.08.2007. Previos day of this errors - 13/08/2007.

I have a two-nodes cluster with Windows Server 2003 R2 and MS SQL Server 2005 Standart

|||

Andrew,

It happens every 7 day? Can you check system event log to see if there is any network outage at 20.08.2007 17:18:58.

Basically, the cluster service was using tcp to connect to sqlserver to check its health and it couldn't. The original hint was "an existing connection was forcibly closed..." which pointed to SynAttackProtect.

Since you already turned off DoS flag, the only thing left to check is the system itself.

|||

Yes, I check system log - there is no errors at 20.08.2007 17:18:58 (The last event at 20.08 was in 15:08 and it was a Information event). All nodes in cluster was available at at 20.08.2007 17:18:58. Only SQL Server was fail.

|||

!!!!

Today at 10:57:50 it's happened again !!!

I check the system and application log.

System event log:

23.08.2007 11:01:14 ClusSvc Information Failover Mgr 1201 N/A MSNODE1 "The Cluster Service brought the Resource Group ""SQL Server 2005"" online."
23.08.2007 11:01:14 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:14 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:13 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:07 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:06 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 11:00:24 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 11:00:23 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 10:59:57 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 10:59:57 ClusSvc Error Failover Mgr 1069 N/A MSNODE1 Cluster resource 'SQL Server' in Resource Group 'SQL Server 2005' failed.

23.08.2007 10:57:58 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 1751833(in 100 nanosecs).
23.08.2007 10:57:24 TermServDevices Error None 1111 N/A MSNODE2 Driver Microsoft XPS Document Writer required for printer Microsoft XPS Document Writer is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver Canon MF5700 Series required for printer Canon MF5700 Series is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver hp LaserJet 1000 required for printer !!C528!hp LaserJet 1000 is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 Print Information None 2 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was created.
23.08.2007 10:57:22 Print Information None 9 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was set.
23.08.2007 10:57:57 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 269397841(in 100 nanosecs).
23.08.2007 10:26:26 Print Warning None 3 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was deleted.
23.08.2007 10:26:26 Print Warning None 4 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 is pending deletion.
23.08.2007 10:26:26 Print Warning None 8 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was purged.

Application Event Log (because I use a SQL Server 2005 Russian some event have a russian text. In brackets I write english tranlation):

23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = 0; message = [Microsoft][SQL Native Client]Не удается завершить вход в систему из-за задержки при открытии соединения с сервером
"
23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] ODBC sqldriverconnect failed
"
23.08.2007 10:58:51 Application Hang Error (101) 1002 N/A MSNODE2 Hanging application cluadmin.exe, version 5.2.3790.3959, hang module hungapp, version 0.0.0.0, hang address 0x00000000.
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:54 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:53 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

I need help !

-

Andrew Mishechkin

ICQ: 101861332

|||

Thanks for the logs.

The app log indicates error with connection which escalates to a clussvc error in the system log. I would next suggest checking out faulty hardware (i.e. NIC's). I've seen strange and hard to diagnosed issue when it comes to faulty hardware.

|||We also have the same exact error and we are using HP DL 585 G2 running Win2k3 SP1. Any steps i can take to isolate further ( tools for dignostics ).

thanks,
Tony B|||

Hello,

I have the same problem with a sql 2005 SP2 cluter. I have several sql 2005 errors in aplication and a few later an error in system log about cluster service.
This is the cluster:

*clusternode1: HP DL50G5. 4 P dual core with HTT. W2003 R2 EE SP2 x64.
*clusternode2: the same as clusternode1.


I tried to solve the problem taking this actions:

*Disable the ilo driver.
*Update NICs drivers.
*Create a DWORD SynAttackProtect with value 0.


?Any idea to solve the problem?

Thanks
pablo

cluster error

I think I originally posted this in the wrong forum of SQL Server General Tools.

We have a one node cluster that is receiving the below error messages in the cluster.log and the Event Viewer. In the SQL Server 2005 SP2 job that fails we receive a slighlty different message. The OS is Windows 2003 SP1. This is not a 64-bit box. We have checked the permissions of the cluster service account in SQL Server to be sure they are correct. Can anyone please help with this issue?

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure

You encounter the SynAttachProtect issue (based on your error message). Take a look at section 4.1.2.

http://support.microsoft.com/kb/910228

|||

I spent months trying to resolve this issue but found it to be a hardware driver issue. Not sure what hardware you're running but if its HP Proliant its worth checking this as it sorted the problem we had (even though we were not using iLO) with a simple driver update.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c00688313&jumpid=reg_R1002_USEN

Advisory: (Revision) Integrated Lights-Out Management (iLO) Interface Driver for Windows May Cause the System to Become Unresponsive if the Driver Does Not Allocate Extra Contiguous Memory Blocks Under the 4 GB Space

|||thanks for the info Pete.|||

I have a same problem with a SQL Server Cluster and I disabled SynAttackProtect. But this is take no effect

|||

Andrew,

You disable SYN on all nodes and restart? Can you post the exact error?

|||

Yes.

Errors (Windows Application Log):

20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native ClientCommunication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

The last time of this errors - 20.08.2007. Previos day of this errors - 13/08/2007.

I have a two-nodes cluster with Windows Server 2003 R2 and MS SQL Server 2005 Standart

|||

Andrew,

It happens every 7 day? Can you check system event log to see if there is any network outage at 20.08.2007 17:18:58.

Basically, the cluster service was using tcp to connect to sqlserver to check its health and it couldn't. The original hint was "an existing connection was forcibly closed..." which pointed to SynAttackProtect.

Since you already turned off DoS flag, the only thing left to check is the system itself.

|||

Yes, I check system log - there is no errors at 20.08.2007 17:18:58 (The last event at 20.08 was in 15:08 and it was a Information event). All nodes in cluster was available at at 20.08.2007 17:18:58. Only SQL Server was fail.

|||

!!!!

Today at 10:57:50 it's happened again !!!

I check the system and application log.

System event log:

23.08.2007 11:01:14 ClusSvc Information Failover Mgr 1201 N/A MSNODE1 "The Cluster Service brought the Resource Group ""SQL Server 2005"" online."
23.08.2007 11:01:14 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:14 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:13 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:07 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:06 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 11:00:24 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 11:00:23 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 10:59:57 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 10:59:57 ClusSvc Error Failover Mgr 1069 N/A MSNODE1 Cluster resource 'SQL Server' in Resource Group 'SQL Server 2005' failed.

23.08.2007 10:57:58 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 1751833(in 100 nanosecs).
23.08.2007 10:57:24 TermServDevices Error None 1111 N/A MSNODE2 Driver Microsoft XPS Document Writer required for printer Microsoft XPS Document Writer is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver Canon MF5700 Series required for printer Canon MF5700 Series is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver hp LaserJet 1000 required for printer !!C528!hp LaserJet 1000 is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 Print Information None 2 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was created.
23.08.2007 10:57:22 Print Information None 9 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was set.
23.08.2007 10:57:57 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 269397841(in 100 nanosecs).
23.08.2007 10:26:26 Print Warning None 3 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was deleted.
23.08.2007 10:26:26 Print Warning None 4 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 is pending deletion.
23.08.2007 10:26:26 Print Warning None 8 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was purged.

Application Event Log (because I use a SQL Server 2005 Russian some event have a russian text. In brackets I write english tranlation):

23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = 0; message = [Microsoft][SQL Native Client]Не удается завершить вход в систему из-за задержки при открытии соединения с сервером
"
23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] ODBC sqldriverconnect failed
"
23.08.2007 10:58:51 Application Hang Error (101) 1002 N/A MSNODE2 Hanging application cluadmin.exe, version 5.2.3790.3959, hang module hungapp, version 0.0.0.0, hang address 0x00000000.
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:54 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:53 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

I need help !

-

Andrew Mishechkin

ICQ: 101861332

|||

Thanks for the logs.

The app log indicates error with connection which escalates to a clussvc error in the system log. I would next suggest checking out faulty hardware (i.e. NIC's). I've seen strange and hard to diagnosed issue when it comes to faulty hardware.

|||We also have the same exact error and we are using HP DL 585 G2 running Win2k3 SP1. Any steps i can take to isolate further ( tools for dignostics ).

thanks,
Tony B|||

Hello,

I have the same problem with a sql 2005 SP2 cluter. I have several sql 2005 errors in aplication and a few later an error in system log about cluster service.
This is the cluster:

*clusternode1: HP DL50G5. 4 P dual core with HTT. W2003 R2 EE SP2 x64.
*clusternode2: the same as clusternode1.


I tried to solve the problem taking this actions:

*Disable the ilo driver.
*Update NICs drivers.
*Create a DWORD SynAttackProtect with value 0.


?Any idea to solve the problem?

Thanks
pablo

cluster error

I think I originally posted this in the wrong forum of SQL Server General Tools.

We have a one node cluster that is receiving the below error messages in the cluster.log and the Event Viewer. In the SQL Server 2005 SP2 job that fails we receive a slighlty different message. The OS is Windows 2003 SP1. This is not a 64-bit box. We have checked the permissions of the cluster service account in SQL Server to be sure they are correct. Can anyone please help with this issue?

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure

You encounter the SynAttachProtect issue (based on your error message). Take a look at section 4.1.2.

http://support.microsoft.com/kb/910228

|||

I spent months trying to resolve this issue but found it to be a hardware driver issue. Not sure what hardware you're running but if its HP Proliant its worth checking this as it sorted the problem we had (even though we were not using iLO) with a simple driver update.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c00688313&jumpid=reg_R1002_USEN

Advisory: (Revision) Integrated Lights-Out Management (iLO) Interface Driver for Windows May Cause the System to Become Unresponsive if the Driver Does Not Allocate Extra Contiguous Memory Blocks Under the 4 GB Space

|||thanks for the info Pete.|||

I have a same problem with a SQL Server Cluster and I disabled SynAttackProtect. But this is take no effect

|||

Andrew,

You disable SYN on all nodes and restart? Can you post the exact error?

|||

Yes.

Errors (Windows Application Log):

20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native ClientCommunication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

The last time of this errors - 20.08.2007. Previos day of this errors - 13/08/2007.

I have a two-nodes cluster with Windows Server 2003 R2 and MS SQL Server 2005 Standart

|||

Andrew,

It happens every 7 day? Can you check system event log to see if there is any network outage at 20.08.2007 17:18:58.

Basically, the cluster service was using tcp to connect to sqlserver to check its health and it couldn't. The original hint was "an existing connection was forcibly closed..." which pointed to SynAttackProtect.

Since you already turned off DoS flag, the only thing left to check is the system itself.

|||

Yes, I check system log - there is no errors at 20.08.2007 17:18:58 (The last event at 20.08 was in 15:08 and it was a Information event). All nodes in cluster was available at at 20.08.2007 17:18:58. Only SQL Server was fail.

|||

!!!!

Today at 10:57:50 it's happened again !!!

I check the system and application log.

System event log:

23.08.2007 11:01:14 ClusSvc Information Failover Mgr 1201 N/A MSNODE1 "The Cluster Service brought the Resource Group ""SQL Server 2005"" online."
23.08.2007 11:01:14 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:14 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:13 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:07 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:06 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 11:00:24 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 11:00:23 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 10:59:57 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 10:59:57 ClusSvc Error Failover Mgr 1069 N/A MSNODE1 Cluster resource 'SQL Server' in Resource Group 'SQL Server 2005' failed.

23.08.2007 10:57:58 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 1751833(in 100 nanosecs).
23.08.2007 10:57:24 TermServDevices Error None 1111 N/A MSNODE2 Driver Microsoft XPS Document Writer required for printer Microsoft XPS Document Writer is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver Canon MF5700 Series required for printer Canon MF5700 Series is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver hp LaserJet 1000 required for printer !!C528!hp LaserJet 1000 is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 Print Information None 2 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was created.
23.08.2007 10:57:22 Print Information None 9 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was set.
23.08.2007 10:57:57 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 269397841(in 100 nanosecs).
23.08.2007 10:26:26 Print Warning None 3 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was deleted.
23.08.2007 10:26:26 Print Warning None 4 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 is pending deletion.
23.08.2007 10:26:26 Print Warning None 8 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was purged.

Application Event Log (because I use a SQL Server 2005 Russian some event have a russian text. In brackets I write english tranlation):

23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = 0; message = [Microsoft][SQL Native Client]Не удается завершить вход в систему из-за задержки при открытии соединения с сервером
"
23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] ODBC sqldriverconnect failed
"
23.08.2007 10:58:51 Application Hang Error (101) 1002 N/A MSNODE2 Hanging application cluadmin.exe, version 5.2.3790.3959, hang module hungapp, version 0.0.0.0, hang address 0x00000000.
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:54 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:53 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

I need help !

-

Andrew Mishechkin

ICQ: 101861332

|||

Thanks for the logs.

The app log indicates error with connection which escalates to a clussvc error in the system log. I would next suggest checking out faulty hardware (i.e. NIC's). I've seen strange and hard to diagnosed issue when it comes to faulty hardware.

|||We also have the same exact error and we are using HP DL 585 G2 running Win2k3 SP1. Any steps i can take to isolate further ( tools for dignostics ).

thanks,
Tony B|||

Hello,

I have the same problem with a sql 2005 SP2 cluter. I have several sql 2005 errors in aplication and a few later an error in system log about cluster service.
This is the cluster:

*clusternode1: HP DL50G5. 4 P dual core with HTT. W2003 R2 EE SP2 x64.
*clusternode2: the same as clusternode1.


I tried to solve the problem taking this actions:

*Disable the ilo driver.
*Update NICs drivers.
*Create a DWORD SynAttackProtect with value 0.


?Any idea to solve the problem?

Thanks
pablo

cluster error

I think I originally posted this in the wrong forum of SQL Server General Tools.

We have a one node cluster that is receiving the below error messages in the cluster.log and the Event Viewer. In the SQL Server 2005 SP2 job that fails we receive a slighlty different message. The OS is Windows 2003 SP1. This is not a 64-bit box. We have checked the permissions of the cluster service account in SQL Server to be sure they are correct. Can anyone please help with this issue?

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure

You encounter the SynAttachProtect issue (based on your error message). Take a look at section 4.1.2.

http://support.microsoft.com/kb/910228

|||

I spent months trying to resolve this issue but found it to be a hardware driver issue. Not sure what hardware you're running but if its HP Proliant its worth checking this as it sorted the problem we had (even though we were not using iLO) with a simple driver update.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c00688313&jumpid=reg_R1002_USEN

Advisory: (Revision) Integrated Lights-Out Management (iLO) Interface Driver for Windows May Cause the System to Become Unresponsive if the Driver Does Not Allocate Extra Contiguous Memory Blocks Under the 4 GB Space

|||thanks for the info Pete.|||

I have a same problem with a SQL Server Cluster and I disabled SynAttackProtect. But this is take no effect

|||

Andrew,

You disable SYN on all nodes and restart? Can you post the exact error?

|||

Yes.

Errors (Windows Application Log):

20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native ClientCommunication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

The last time of this errors - 20.08.2007. Previos day of this errors - 13/08/2007.

I have a two-nodes cluster with Windows Server 2003 R2 and MS SQL Server 2005 Standart

|||

Andrew,

It happens every 7 day? Can you check system event log to see if there is any network outage at 20.08.2007 17:18:58.

Basically, the cluster service was using tcp to connect to sqlserver to check its health and it couldn't. The original hint was "an existing connection was forcibly closed..." which pointed to SynAttackProtect.

Since you already turned off DoS flag, the only thing left to check is the system itself.

|||

Yes, I check system log - there is no errors at 20.08.2007 17:18:58 (The last event at 20.08 was in 15:08 and it was a Information event). All nodes in cluster was available at at 20.08.2007 17:18:58. Only SQL Server was fail.

|||

!!!!

Today at 10:57:50 it's happened again !!!

I check the system and application log.

System event log:

23.08.2007 11:01:14 ClusSvc Information Failover Mgr 1201 N/A MSNODE1 "The Cluster Service brought the Resource Group ""SQL Server 2005"" online."
23.08.2007 11:01:14 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:14 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:13 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:07 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:06 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 11:00:24 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 11:00:23 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 10:59:57 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 10:59:57 ClusSvc Error Failover Mgr 1069 N/A MSNODE1 Cluster resource 'SQL Server' in Resource Group 'SQL Server 2005' failed.

23.08.2007 10:57:58 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 1751833(in 100 nanosecs).
23.08.2007 10:57:24 TermServDevices Error None 1111 N/A MSNODE2 Driver Microsoft XPS Document Writer required for printer Microsoft XPS Document Writer is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver Canon MF5700 Series required for printer Canon MF5700 Series is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver hp LaserJet 1000 required for printer !!C528!hp LaserJet 1000 is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 Print Information None 2 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was created.
23.08.2007 10:57:22 Print Information None 9 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was set.
23.08.2007 10:57:57 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 269397841(in 100 nanosecs).
23.08.2007 10:26:26 Print Warning None 3 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was deleted.
23.08.2007 10:26:26 Print Warning None 4 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 is pending deletion.
23.08.2007 10:26:26 Print Warning None 8 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was purged.

Application Event Log (because I use a SQL Server 2005 Russian some event have a russian text. In brackets I write english tranlation):

23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = 0; message = [Microsoft][SQL Native Client]Не удается завершить вход в систему из-за задержки при открытии соединения с сервером
"
23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] ODBC sqldriverconnect failed
"
23.08.2007 10:58:51 Application Hang Error (101) 1002 N/A MSNODE2 Hanging application cluadmin.exe, version 5.2.3790.3959, hang module hungapp, version 0.0.0.0, hang address 0x00000000.
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:54 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:53 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

I need help !

-

Andrew Mishechkin

ICQ: 101861332

|||

Thanks for the logs.

The app log indicates error with connection which escalates to a clussvc error in the system log. I would next suggest checking out faulty hardware (i.e. NIC's). I've seen strange and hard to diagnosed issue when it comes to faulty hardware.

|||We also have the same exact error and we are using HP DL 585 G2 running Win2k3 SP1. Any steps i can take to isolate further ( tools for dignostics ).

thanks,
Tony B|||

Hello,

I have the same problem with a sql 2005 SP2 cluter. I have several sql 2005 errors in aplication and a few later an error in system log about cluster service.
This is the cluster:

*clusternode1: HP DL50G5. 4 P dual core with HTT. W2003 R2 EE SP2 x64.
*clusternode2: the same as clusternode1.


I tried to solve the problem taking this actions:

*Disable the ilo driver.
*Update NICs drivers.
*Create a DWORD SynAttackProtect with value 0.


?Any idea to solve the problem?

Thanks
pablo

sqlsql