Showing posts with label receiving. Show all posts
Showing posts with label receiving. Show all posts

Tuesday, March 27, 2012

Cluster resource SQL Server (InstanceName) failed to come offline

Hi -
Recently I added some additional memory to a sever thats part of an active
passive cluster (for a total of 8GB). After which we started receiving an
error message when trying to move the cluster over to the passive server:
Source:ClusSvc
Event ID: 1117
Description: Cluster resouce SQL Server(InstanceName) failed to come offline
I did some searching online but the only thing I could find was a thread on
google news from October 2003 (link included below). I am wondering if
anyone else has also experienced this particular problem and if so how they
may have solved it.
Google Groups Link:
http://groups.google.com/groups?hl=e...3DN%26tab%3Dwg
Thanks!
Brad Baker
Anything in the SQL Error log?
Did you re-configure SQL to use the additional RAM?
--
Mike Epprecht, Microsoft SQL Server MVP
Johannesburg, South Africa
Mobile: +27-82-552-0268
IM: mike@.epprecht.net
MVP Program: http://www.microsoft.com/mvp
Blog: http://www.msmvps.com/epprecht/
"Brad Baker" <bab5470@.no_sp@.m.rochester.rr..com> wrote in message
news:e78npbWfEHA.3964@.TK2MSFTNGP12.phx.gbl...
> Hi -
>
> Recently I added some additional memory to a sever thats part of an active
> passive cluster (for a total of 8GB). After which we started receiving an
> error message when trying to move the cluster over to the passive server:
>
> Source:ClusSvc
> Event ID: 1117
> Description: Cluster resouce SQL Server(InstanceName) failed to come
offline
>
> I did some searching online but the only thing I could find was a thread
on
> google news from October 2003 (link included below). I am wondering if
> anyone else has also experienced this particular problem and if so how
they
> may have solved it.
>
> Google Groups Link:
>
http://groups.google.com/groups?hl=e...3DN%26tab%3Dwg
>
> Thanks!
> Brad Baker
>
>
>
|||Anything in the SQL Error log?
> Other than the error mentioned in the event viewer I didn't see anything
> really pertinant.
Did you re-configure SQL to use the additional RAM?
> We enabled AWE and also enabled SQL to use 7GB of RAM. The only thing
> that might be noteworthy is that the active server has 8GB of RAM versus
> the passive system which has 4GB of RAM. We've run unbalanced cluster for
> quite some time though and only experienced this problem after adding the
> last 2GB.
Brad
"Mike Epprecht (SQL MVP)" <mike@.epprecht.net> wrote in message
news:%23AWVeqWfEHA.4092@.TK2MSFTNGP10.phx.gbl...
> Anything in the SQL Error log?
> Did you re-configure SQL to use the additional RAM?
> --
> --
> Mike Epprecht, Microsoft SQL Server MVP
> Johannesburg, South Africa
> Mobile: +27-82-552-0268
> IM: mike@.epprecht.net
> MVP Program: http://www.microsoft.com/mvp
> Blog: http://www.msmvps.com/epprecht/
> "Brad Baker" <bab5470@.no_sp@.m.rochester.rr..com> wrote in message
> news:e78npbWfEHA.3964@.TK2MSFTNGP12.phx.gbl...
> offline
> on
> they
> http://groups.google.com/groups?hl=e...3DN%26tab%3Dwg
>
|||The error message indicates that the SQL Server recource fail to come offline. Instead of moving the group, try taking the SQL Server resource offline and see the results/errors ? If it goes offline then I would take all
resources in the SQL group offline, move the group to the other node and bring one resource online at a time -- starting with the resources that are not dependent on any resources i.e the disks first followed by sql ip
resoruce, sql network name and then sql server. Also, I will uncheck "restart" property for the resources to troubleshoot this issue (set it back to default after issue is resolved).
NOTE: Since AWE is enabled, max server memory is set to 7GB and passive node has 4GB, when you move the SQL Group to the passive node, SQL server instance will acquire almost all of the available memory
and leave ONLY up to 128MB fo memory free. For more info, please refer to SQL Server BOL topic "Managing AWE Memory".
Best Regards,
Uttam Parui
Microsoft Corporation
This posting is provided "AS IS" with no warranties, and confers no rights.
Are you secure? For information about the Strategic Technology Protection Program and to order your FREE Security Tool Kit, please visit http://www.microsoft.com/security.
Microsoft highly recommends that users with Internet access update their Microsoft software to better protect against viruses and security vulnerabilities. The easiest way to do this is to visit the following websites:
http://www.microsoft.com/protect
http://www.microsoft.com/security/guidance/default.mspx

Cluster resource could not be found.

After a reboot of one of my node I am currently receiving the following
message:
The disk associated with the cluster disk resource 'Disk T:' could not be
found. The expected signature of the disk was XXXXXXX. If the disk was
removed from the server cluster, the resource should be deleted. If the disk
was replaced, the resource must be deleted and created again in order to
bring the disk online. If the disk has not been removed or replaced, it may
be inaccessible at this time because it is reserved by another server cluster
node.
How can I create another GUID? Will the installation of the Windows SP1
create a new GUID?
Burley,
You are not lonely. I got the same problem on MSCS.
"Burley" wrote:

> After a reboot of one of my node I am currently receiving the following
> message:
> The disk associated with the cluster disk resource 'Disk T:' could not be
> found. The expected signature of the disk was XXXXXXX. If the disk was
> removed from the server cluster, the resource should be deleted. If the disk
> was replaced, the resource must be deleted and created again in order to
> bring the disk online. If the disk has not been removed or replaced, it may
> be inaccessible at this time because it is reserved by another server cluster
> node.
> How can I create another GUID? Will the installation of the Windows SP1
> create a new GUID?
|||Lemme guess, SCSI-based cluster?
Sometimes a SCSI-based cluster can have problems with one node blocking
access to the shared disk. This is most common during node startup and
especially common when two nodes are started up at the same time. Always
boot one node at a time in a SCSI cluster.
Geoff N. Hiten
Senior Database Administrator
Microsoft SQL Server MVP
"Burley" <Burley@.discussions.microsoft.com> wrote in message
news:591F9019-BCCB-49FD-A749-03267C6A4591@.microsoft.com...
> After a reboot of one of my node I am currently receiving the following
> message:
> The disk associated with the cluster disk resource 'Disk T:' could not be
> found. The expected signature of the disk was XXXXXXX. If the disk was
> removed from the server cluster, the resource should be deleted. If the
> disk
> was replaced, the resource must be deleted and created again in order to
> bring the disk online. If the disk has not been removed or replaced, it
> may
> be inaccessible at this time because it is reserved by another server
> cluster
> node.
> How can I create another GUID? Will the installation of the Windows SP1
> create a new GUID?
sqlsql

Tuesday, March 20, 2012

cluster error

I think I originally posted this in the wrong forum of SQL Server General Tools.

We have a one node cluster that is receiving the below error messages in the cluster.log and the Event Viewer. In the SQL Server 2005 SP2 job that fails we receive a slighlty different message. The OS is Windows 2003 SP1. This is not a 64-bit box. We have checked the permissions of the cluster service account in SQL Server to be sure they are correct. Can anyone please help with this issue?

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure

You encounter the SynAttachProtect issue (based on your error message). Take a look at section 4.1.2.

http://support.microsoft.com/kb/910228

|||

I spent months trying to resolve this issue but found it to be a hardware driver issue. Not sure what hardware you're running but if its HP Proliant its worth checking this as it sorted the problem we had (even though we were not using iLO) with a simple driver update.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c00688313&jumpid=reg_R1002_USEN

Advisory: (Revision) Integrated Lights-Out Management (iLO) Interface Driver for Windows May Cause the System to Become Unresponsive if the Driver Does Not Allocate Extra Contiguous Memory Blocks Under the 4 GB Space

|||thanks for the info Pete.|||

I have a same problem with a SQL Server Cluster and I disabled SynAttackProtect. But this is take no effect

|||

Andrew,

You disable SYN on all nodes and restart? Can you post the exact error?

|||

Yes.

Errors (Windows Application Log):

20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native ClientCommunication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

The last time of this errors - 20.08.2007. Previos day of this errors - 13/08/2007.

I have a two-nodes cluster with Windows Server 2003 R2 and MS SQL Server 2005 Standart

|||

Andrew,

It happens every 7 day? Can you check system event log to see if there is any network outage at 20.08.2007 17:18:58.

Basically, the cluster service was using tcp to connect to sqlserver to check its health and it couldn't. The original hint was "an existing connection was forcibly closed..." which pointed to SynAttackProtect.

Since you already turned off DoS flag, the only thing left to check is the system itself.

|||

Yes, I check system log - there is no errors at 20.08.2007 17:18:58 (The last event at 20.08 was in 15:08 and it was a Information event). All nodes in cluster was available at at 20.08.2007 17:18:58. Only SQL Server was fail.

|||

!!!!

Today at 10:57:50 it's happened again !!!

I check the system and application log.

System event log:

23.08.2007 11:01:14 ClusSvc Information Failover Mgr 1201 N/A MSNODE1 "The Cluster Service brought the Resource Group ""SQL Server 2005"" online."
23.08.2007 11:01:14 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:14 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:13 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:07 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:06 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 11:00:24 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 11:00:23 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 10:59:57 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 10:59:57 ClusSvc Error Failover Mgr 1069 N/A MSNODE1 Cluster resource 'SQL Server' in Resource Group 'SQL Server 2005' failed.

23.08.2007 10:57:58 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 1751833(in 100 nanosecs).
23.08.2007 10:57:24 TermServDevices Error None 1111 N/A MSNODE2 Driver Microsoft XPS Document Writer required for printer Microsoft XPS Document Writer is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver Canon MF5700 Series required for printer Canon MF5700 Series is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver hp LaserJet 1000 required for printer !!C528!hp LaserJet 1000 is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 Print Information None 2 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was created.
23.08.2007 10:57:22 Print Information None 9 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was set.
23.08.2007 10:57:57 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 269397841(in 100 nanosecs).
23.08.2007 10:26:26 Print Warning None 3 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was deleted.
23.08.2007 10:26:26 Print Warning None 4 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 is pending deletion.
23.08.2007 10:26:26 Print Warning None 8 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was purged.

Application Event Log (because I use a SQL Server 2005 Russian some event have a russian text. In brackets I write english tranlation):

23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = 0; message = [Microsoft][SQL Native Client]Не удается завершить вход в систему из-за задержки при открытии соединения с сервером
"
23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] ODBC sqldriverconnect failed
"
23.08.2007 10:58:51 Application Hang Error (101) 1002 N/A MSNODE2 Hanging application cluadmin.exe, version 5.2.3790.3959, hang module hungapp, version 0.0.0.0, hang address 0x00000000.
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:54 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:53 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

I need help !

-

Andrew Mishechkin

ICQ: 101861332

|||

Thanks for the logs.

The app log indicates error with connection which escalates to a clussvc error in the system log. I would next suggest checking out faulty hardware (i.e. NIC's). I've seen strange and hard to diagnosed issue when it comes to faulty hardware.

|||We also have the same exact error and we are using HP DL 585 G2 running Win2k3 SP1. Any steps i can take to isolate further ( tools for dignostics ).

thanks,
Tony B|||

Hello,

I have the same problem with a sql 2005 SP2 cluter. I have several sql 2005 errors in aplication and a few later an error in system log about cluster service.
This is the cluster:

*clusternode1: HP DL50G5. 4 P dual core with HTT. W2003 R2 EE SP2 x64.
*clusternode2: the same as clusternode1.


I tried to solve the problem taking this actions:

*Disable the ilo driver.
*Update NICs drivers.
*Create a DWORD SynAttackProtect with value 0.


?Any idea to solve the problem?

Thanks
pablo

sqlsql

cluster error

I think I originally posted this in the wrong forum of SQL Server General Tools.

We have a one node cluster that is receiving the below error messages in the cluster.log and the Event Viewer. In the SQL Server 2005 SP2 job that fails we receive a slighlty different message. The OS is Windows 2003 SP1. This is not a 64-bit box. We have checked the permissions of the cluster service account in SQL Server to be sure they are correct. Can anyone please help with this issue?

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure

You encounter the SynAttachProtect issue (based on your error message). Take a look at section 4.1.2.

http://support.microsoft.com/kb/910228

|||

I spent months trying to resolve this issue but found it to be a hardware driver issue. Not sure what hardware you're running but if its HP Proliant its worth checking this as it sorted the problem we had (even though we were not using iLO) with a simple driver update.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c00688313&jumpid=reg_R1002_USEN

Advisory: (Revision) Integrated Lights-Out Management (iLO) Interface Driver for Windows May Cause the System to Become Unresponsive if the Driver Does Not Allocate Extra Contiguous Memory Blocks Under the 4 GB Space

|||thanks for the info Pete.|||

I have a same problem with a SQL Server Cluster and I disabled SynAttackProtect. But this is take no effect

|||

Andrew,

You disable SYN on all nodes and restart? Can you post the exact error?

|||

Yes.

Errors (Windows Application Log):

20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native ClientCommunication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

The last time of this errors - 20.08.2007. Previos day of this errors - 13/08/2007.

I have a two-nodes cluster with Windows Server 2003 R2 and MS SQL Server 2005 Standart

|||

Andrew,

It happens every 7 day? Can you check system event log to see if there is any network outage at 20.08.2007 17:18:58.

Basically, the cluster service was using tcp to connect to sqlserver to check its health and it couldn't. The original hint was "an existing connection was forcibly closed..." which pointed to SynAttackProtect.

Since you already turned off DoS flag, the only thing left to check is the system itself.

|||

Yes, I check system log - there is no errors at 20.08.2007 17:18:58 (The last event at 20.08 was in 15:08 and it was a Information event). All nodes in cluster was available at at 20.08.2007 17:18:58. Only SQL Server was fail.

|||

!!!!

Today at 10:57:50 it's happened again !!!

I check the system and application log.

System event log:

23.08.2007 11:01:14 ClusSvc Information Failover Mgr 1201 N/A MSNODE1 "The Cluster Service brought the Resource Group ""SQL Server 2005"" online."
23.08.2007 11:01:14 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:14 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:13 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:07 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:06 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 11:00:24 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 11:00:23 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 10:59:57 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 10:59:57 ClusSvc Error Failover Mgr 1069 N/A MSNODE1 Cluster resource 'SQL Server' in Resource Group 'SQL Server 2005' failed.

23.08.2007 10:57:58 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 1751833(in 100 nanosecs).
23.08.2007 10:57:24 TermServDevices Error None 1111 N/A MSNODE2 Driver Microsoft XPS Document Writer required for printer Microsoft XPS Document Writer is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver Canon MF5700 Series required for printer Canon MF5700 Series is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver hp LaserJet 1000 required for printer !!C528!hp LaserJet 1000 is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 Print Information None 2 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was created.
23.08.2007 10:57:22 Print Information None 9 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was set.
23.08.2007 10:57:57 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 269397841(in 100 nanosecs).
23.08.2007 10:26:26 Print Warning None 3 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was deleted.
23.08.2007 10:26:26 Print Warning None 4 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 is pending deletion.
23.08.2007 10:26:26 Print Warning None 8 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was purged.

Application Event Log (because I use a SQL Server 2005 Russian some event have a russian text. In brackets I write english tranlation):

23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = 0; message = [Microsoft][SQL Native Client]Не удается завершить вход в систему из-за задержки при открытии соединения с сервером
"
23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] ODBC sqldriverconnect failed
"
23.08.2007 10:58:51 Application Hang Error (101) 1002 N/A MSNODE2 Hanging application cluadmin.exe, version 5.2.3790.3959, hang module hungapp, version 0.0.0.0, hang address 0x00000000.
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:54 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:53 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

I need help !

-

Andrew Mishechkin

ICQ: 101861332

|||

Thanks for the logs.

The app log indicates error with connection which escalates to a clussvc error in the system log. I would next suggest checking out faulty hardware (i.e. NIC's). I've seen strange and hard to diagnosed issue when it comes to faulty hardware.

|||We also have the same exact error and we are using HP DL 585 G2 running Win2k3 SP1. Any steps i can take to isolate further ( tools for dignostics ).

thanks,
Tony B|||

Hello,

I have the same problem with a sql 2005 SP2 cluter. I have several sql 2005 errors in aplication and a few later an error in system log about cluster service.
This is the cluster:

*clusternode1: HP DL50G5. 4 P dual core with HTT. W2003 R2 EE SP2 x64.
*clusternode2: the same as clusternode1.


I tried to solve the problem taking this actions:

*Disable the ilo driver.
*Update NICs drivers.
*Create a DWORD SynAttackProtect with value 0.


?Any idea to solve the problem?

Thanks
pablo

cluster error

I think I originally posted this in the wrong forum of SQL Server General Tools.

We have a one node cluster that is receiving the below error messages in the cluster.log and the Event Viewer. In the SQL Server 2005 SP2 job that fails we receive a slighlty different message. The OS is Windows 2003 SP1. This is not a 64-bit box. We have checked the permissions of the cluster service account in SQL Server to be sure they are correct. Can anyone please help with this issue?

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure

You encounter the SynAttachProtect issue (based on your error message). Take a look at section 4.1.2.

http://support.microsoft.com/kb/910228

|||

I spent months trying to resolve this issue but found it to be a hardware driver issue. Not sure what hardware you're running but if its HP Proliant its worth checking this as it sorted the problem we had (even though we were not using iLO) with a simple driver update.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c00688313&jumpid=reg_R1002_USEN

Advisory: (Revision) Integrated Lights-Out Management (iLO) Interface Driver for Windows May Cause the System to Become Unresponsive if the Driver Does Not Allocate Extra Contiguous Memory Blocks Under the 4 GB Space

|||thanks for the info Pete.|||

I have a same problem with a SQL Server Cluster and I disabled SynAttackProtect. But this is take no effect

|||

Andrew,

You disable SYN on all nodes and restart? Can you post the exact error?

|||

Yes.

Errors (Windows Application Log):

20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native ClientCommunication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

The last time of this errors - 20.08.2007. Previos day of this errors - 13/08/2007.

I have a two-nodes cluster with Windows Server 2003 R2 and MS SQL Server 2005 Standart

|||

Andrew,

It happens every 7 day? Can you check system event log to see if there is any network outage at 20.08.2007 17:18:58.

Basically, the cluster service was using tcp to connect to sqlserver to check its health and it couldn't. The original hint was "an existing connection was forcibly closed..." which pointed to SynAttackProtect.

Since you already turned off DoS flag, the only thing left to check is the system itself.

|||

Yes, I check system log - there is no errors at 20.08.2007 17:18:58 (The last event at 20.08 was in 15:08 and it was a Information event). All nodes in cluster was available at at 20.08.2007 17:18:58. Only SQL Server was fail.

|||

!!!!

Today at 10:57:50 it's happened again !!!

I check the system and application log.

System event log:

23.08.2007 11:01:14 ClusSvc Information Failover Mgr 1201 N/A MSNODE1 "The Cluster Service brought the Resource Group ""SQL Server 2005"" online."
23.08.2007 11:01:14 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:14 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:13 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:07 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:06 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 11:00:24 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 11:00:23 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 10:59:57 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 10:59:57 ClusSvc Error Failover Mgr 1069 N/A MSNODE1 Cluster resource 'SQL Server' in Resource Group 'SQL Server 2005' failed.

23.08.2007 10:57:58 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 1751833(in 100 nanosecs).
23.08.2007 10:57:24 TermServDevices Error None 1111 N/A MSNODE2 Driver Microsoft XPS Document Writer required for printer Microsoft XPS Document Writer is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver Canon MF5700 Series required for printer Canon MF5700 Series is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver hp LaserJet 1000 required for printer !!C528!hp LaserJet 1000 is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 Print Information None 2 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was created.
23.08.2007 10:57:22 Print Information None 9 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was set.
23.08.2007 10:57:57 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 269397841(in 100 nanosecs).
23.08.2007 10:26:26 Print Warning None 3 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was deleted.
23.08.2007 10:26:26 Print Warning None 4 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 is pending deletion.
23.08.2007 10:26:26 Print Warning None 8 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was purged.

Application Event Log (because I use a SQL Server 2005 Russian some event have a russian text. In brackets I write english tranlation):

23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = 0; message = [Microsoft][SQL Native Client]Не удается завершить вход в систему из-за задержки при открытии соединения с сервером
"
23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] ODBC sqldriverconnect failed
"
23.08.2007 10:58:51 Application Hang Error (101) 1002 N/A MSNODE2 Hanging application cluadmin.exe, version 5.2.3790.3959, hang module hungapp, version 0.0.0.0, hang address 0x00000000.
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:54 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:53 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

I need help !

-

Andrew Mishechkin

ICQ: 101861332

|||

Thanks for the logs.

The app log indicates error with connection which escalates to a clussvc error in the system log. I would next suggest checking out faulty hardware (i.e. NIC's). I've seen strange and hard to diagnosed issue when it comes to faulty hardware.

|||We also have the same exact error and we are using HP DL 585 G2 running Win2k3 SP1. Any steps i can take to isolate further ( tools for dignostics ).

thanks,
Tony B|||

Hello,

I have the same problem with a sql 2005 SP2 cluter. I have several sql 2005 errors in aplication and a few later an error in system log about cluster service.
This is the cluster:

*clusternode1: HP DL50G5. 4 P dual core with HTT. W2003 R2 EE SP2 x64.
*clusternode2: the same as clusternode1.


I tried to solve the problem taking this actions:

*Disable the ilo driver.
*Update NICs drivers.
*Create a DWORD SynAttackProtect with value 0.


?Any idea to solve the problem?

Thanks
pablo

cluster error

I think I originally posted this in the wrong forum of SQL Server General Tools.

We have a one node cluster that is receiving the below error messages in the cluster.log and the Event Viewer. In the SQL Server 2005 SP2 job that fails we receive a slighlty different message. The OS is Windows 2003 SP1. This is not a 64-bit box. We have checked the permissions of the cluster service account in SQL Server to be sure they are correct. Can anyone please help with this issue?

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure

You encounter the SynAttachProtect issue (based on your error message). Take a look at section 4.1.2.

http://support.microsoft.com/kb/910228

|||

I spent months trying to resolve this issue but found it to be a hardware driver issue. Not sure what hardware you're running but if its HP Proliant its worth checking this as it sorted the problem we had (even though we were not using iLO) with a simple driver update.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c00688313&jumpid=reg_R1002_USEN

Advisory: (Revision) Integrated Lights-Out Management (iLO) Interface Driver for Windows May Cause the System to Become Unresponsive if the Driver Does Not Allocate Extra Contiguous Memory Blocks Under the 4 GB Space

|||thanks for the info Pete.|||

I have a same problem with a SQL Server Cluster and I disabled SynAttackProtect. But this is take no effect

|||

Andrew,

You disable SYN on all nodes and restart? Can you post the exact error?

|||

Yes.

Errors (Windows Application Log):

20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native ClientCommunication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

The last time of this errors - 20.08.2007. Previos day of this errors - 13/08/2007.

I have a two-nodes cluster with Windows Server 2003 R2 and MS SQL Server 2005 Standart

|||

Andrew,

It happens every 7 day? Can you check system event log to see if there is any network outage at 20.08.2007 17:18:58.

Basically, the cluster service was using tcp to connect to sqlserver to check its health and it couldn't. The original hint was "an existing connection was forcibly closed..." which pointed to SynAttackProtect.

Since you already turned off DoS flag, the only thing left to check is the system itself.

|||

Yes, I check system log - there is no errors at 20.08.2007 17:18:58 (The last event at 20.08 was in 15:08 and it was a Information event). All nodes in cluster was available at at 20.08.2007 17:18:58. Only SQL Server was fail.

|||

!!!!

Today at 10:57:50 it's happened again !!!

I check the system and application log.

System event log:

23.08.2007 11:01:14 ClusSvc Information Failover Mgr 1201 N/A MSNODE1 "The Cluster Service brought the Resource Group ""SQL Server 2005"" online."
23.08.2007 11:01:14 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:14 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:13 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:07 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:06 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 11:00:24 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 11:00:23 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 10:59:57 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 10:59:57 ClusSvc Error Failover Mgr 1069 N/A MSNODE1 Cluster resource 'SQL Server' in Resource Group 'SQL Server 2005' failed.

23.08.2007 10:57:58 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 1751833(in 100 nanosecs).
23.08.2007 10:57:24 TermServDevices Error None 1111 N/A MSNODE2 Driver Microsoft XPS Document Writer required for printer Microsoft XPS Document Writer is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver Canon MF5700 Series required for printer Canon MF5700 Series is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver hp LaserJet 1000 required for printer !!C528!hp LaserJet 1000 is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 Print Information None 2 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was created.
23.08.2007 10:57:22 Print Information None 9 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was set.
23.08.2007 10:57:57 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 269397841(in 100 nanosecs).
23.08.2007 10:26:26 Print Warning None 3 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was deleted.
23.08.2007 10:26:26 Print Warning None 4 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 is pending deletion.
23.08.2007 10:26:26 Print Warning None 8 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was purged.

Application Event Log (because I use a SQL Server 2005 Russian some event have a russian text. In brackets I write english tranlation):

23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = 0; message = [Microsoft][SQL Native Client]Не удается завершить вход в систему из-за задержки при открытии соединения с сервером
"
23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] ODBC sqldriverconnect failed
"
23.08.2007 10:58:51 Application Hang Error (101) 1002 N/A MSNODE2 Hanging application cluadmin.exe, version 5.2.3790.3959, hang module hungapp, version 0.0.0.0, hang address 0x00000000.
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:54 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:53 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

I need help !

-

Andrew Mishechkin

ICQ: 101861332

|||

Thanks for the logs.

The app log indicates error with connection which escalates to a clussvc error in the system log. I would next suggest checking out faulty hardware (i.e. NIC's). I've seen strange and hard to diagnosed issue when it comes to faulty hardware.

|||We also have the same exact error and we are using HP DL 585 G2 running Win2k3 SP1. Any steps i can take to isolate further ( tools for dignostics ).

thanks,
Tony B|||

Hello,

I have the same problem with a sql 2005 SP2 cluter. I have several sql 2005 errors in aplication and a few later an error in system log about cluster service.
This is the cluster:

*clusternode1: HP DL50G5. 4 P dual core with HTT. W2003 R2 EE SP2 x64.
*clusternode2: the same as clusternode1.


I tried to solve the problem taking this actions:

*Disable the ilo driver.
*Update NICs drivers.
*Create a DWORD SynAttackProtect with value 0.


?Any idea to solve the problem?

Thanks
pablo

cluster error

I think I originally posted this in the wrong forum of SQL Server General Tools.

We have a one node cluster that is receiving the below error messages in the cluster.log and the Event Viewer. In the SQL Server 2005 SP2 job that fails we receive a slighlty different message. The OS is Windows 2003 SP1. This is not a 64-bit box. We have checked the permissions of the cluster service account in SQL Server to be sure they are correct. Can anyone please help with this issue?

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure

You encounter the SynAttachProtect issue (based on your error message). Take a look at section 4.1.2.

http://support.microsoft.com/kb/910228

|||

I spent months trying to resolve this issue but found it to be a hardware driver issue. Not sure what hardware you're running but if its HP Proliant its worth checking this as it sorted the problem we had (even though we were not using iLO) with a simple driver update.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c00688313&jumpid=reg_R1002_USEN

Advisory: (Revision) Integrated Lights-Out Management (iLO) Interface Driver for Windows May Cause the System to Become Unresponsive if the Driver Does Not Allocate Extra Contiguous Memory Blocks Under the 4 GB Space

|||thanks for the info Pete.|||

I have a same problem with a SQL Server Cluster and I disabled SynAttackProtect. But this is take no effect

|||

Andrew,

You disable SYN on all nodes and restart? Can you post the exact error?

|||

Yes.

Errors (Windows Application Log):

20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native ClientCommunication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

The last time of this errors - 20.08.2007. Previos day of this errors - 13/08/2007.

I have a two-nodes cluster with Windows Server 2003 R2 and MS SQL Server 2005 Standart

|||

Andrew,

It happens every 7 day? Can you check system event log to see if there is any network outage at 20.08.2007 17:18:58.

Basically, the cluster service was using tcp to connect to sqlserver to check its health and it couldn't. The original hint was "an existing connection was forcibly closed..." which pointed to SynAttackProtect.

Since you already turned off DoS flag, the only thing left to check is the system itself.

|||

Yes, I check system log - there is no errors at 20.08.2007 17:18:58 (The last event at 20.08 was in 15:08 and it was a Information event). All nodes in cluster was available at at 20.08.2007 17:18:58. Only SQL Server was fail.

|||

!!!!

Today at 10:57:50 it's happened again !!!

I check the system and application log.

System event log:

23.08.2007 11:01:14 ClusSvc Information Failover Mgr 1201 N/A MSNODE1 "The Cluster Service brought the Resource Group ""SQL Server 2005"" online."
23.08.2007 11:01:14 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:14 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:13 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:07 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:06 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 11:00:24 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 11:00:23 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 10:59:57 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 10:59:57 ClusSvc Error Failover Mgr 1069 N/A MSNODE1 Cluster resource 'SQL Server' in Resource Group 'SQL Server 2005' failed.

23.08.2007 10:57:58 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 1751833(in 100 nanosecs).
23.08.2007 10:57:24 TermServDevices Error None 1111 N/A MSNODE2 Driver Microsoft XPS Document Writer required for printer Microsoft XPS Document Writer is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver Canon MF5700 Series required for printer Canon MF5700 Series is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver hp LaserJet 1000 required for printer !!C528!hp LaserJet 1000 is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 Print Information None 2 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was created.
23.08.2007 10:57:22 Print Information None 9 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was set.
23.08.2007 10:57:57 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 269397841(in 100 nanosecs).
23.08.2007 10:26:26 Print Warning None 3 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was deleted.
23.08.2007 10:26:26 Print Warning None 4 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 is pending deletion.
23.08.2007 10:26:26 Print Warning None 8 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was purged.

Application Event Log (because I use a SQL Server 2005 Russian some event have a russian text. In brackets I write english tranlation):

23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = 0; message = [Microsoft][SQL Native Client]Не удается завершить вход в систему из-за задержки при открытии соединения с сервером
"
23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] ODBC sqldriverconnect failed
"
23.08.2007 10:58:51 Application Hang Error (101) 1002 N/A MSNODE2 Hanging application cluadmin.exe, version 5.2.3790.3959, hang module hungapp, version 0.0.0.0, hang address 0x00000000.
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:54 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:53 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

I need help !

-

Andrew Mishechkin

ICQ: 101861332

|||

Thanks for the logs.

The app log indicates error with connection which escalates to a clussvc error in the system log. I would next suggest checking out faulty hardware (i.e. NIC's). I've seen strange and hard to diagnosed issue when it comes to faulty hardware.

|||We also have the same exact error and we are using HP DL 585 G2 running Win2k3 SP1. Any steps i can take to isolate further ( tools for dignostics ).

thanks,
Tony B|||

Hello,

I have the same problem with a sql 2005 SP2 cluter. I have several sql 2005 errors in aplication and a few later an error in system log about cluster service.
This is the cluster:

*clusternode1: HP DL50G5. 4 P dual core with HTT. W2003 R2 EE SP2 x64.
*clusternode2: the same as clusternode1.


I tried to solve the problem taking this actions:

*Disable the ilo driver.
*Update NICs drivers.
*Create a DWORD SynAttackProtect with value 0.


?Any idea to solve the problem?

Thanks
pablo

cluster error

I think I originally posted this in the wrong forum of SQL Server General Tools.

We have a one node cluster that is receiving the below error messages in the cluster.log and the Event Viewer. In the SQL Server 2005 SP2 job that fails we receive a slighlty different message. The OS is Windows 2003 SP1. This is not a 64-bit box. We have checked the permissions of the cluster service account in SQL Server to be sure they are correct. Can anyone please help with this issue?

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] OnlineThread: QP is not online.
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
00000a0c.0000089c::2007/06/12-12:51:27.459 ERR SQL Server <SQL Server>: [sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure

You encounter the SynAttachProtect issue (based on your error message). Take a look at section 4.1.2.

http://support.microsoft.com/kb/910228

|||

I spent months trying to resolve this issue but found it to be a hardware driver issue. Not sure what hardware you're running but if its HP Proliant its worth checking this as it sorted the problem we had (even though we were not using iLO) with a simple driver update.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c00688313&jumpid=reg_R1002_USEN

Advisory: (Revision) Integrated Lights-Out Management (iLO) Interface Driver for Windows May Cause the System to Become Unresponsive if the Driver Does Not Allocate Extra Contiguous Memory Blocks Under the 4 GB Space

|||thanks for the info Pete.|||

I have a same problem with a SQL Server Cluster and I disabled SynAttackProtect. But this is take no effect

|||

Andrew,

You disable SYN on all nodes and restart? Can you post the exact error?

|||

Yes.

Errors (Windows Application Log):

20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native ClientCommunication link failure
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]TCP Provider: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:27:21 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Communication link failure
"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
20.08.2007 17:18:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

The last time of this errors - 20.08.2007. Previos day of this errors - 13/08/2007.

I have a two-nodes cluster with Windows Server 2003 R2 and MS SQL Server 2005 Standart

|||

Andrew,

It happens every 7 day? Can you check system event log to see if there is any network outage at 20.08.2007 17:18:58.

Basically, the cluster service was using tcp to connect to sqlserver to check its health and it couldn't. The original hint was "an existing connection was forcibly closed..." which pointed to SynAttackProtect.

Since you already turned off DoS flag, the only thing left to check is the system itself.

|||

Yes, I check system log - there is no errors at 20.08.2007 17:18:58 (The last event at 20.08 was in 15:08 and it was a Information event). All nodes in cluster was available at at 20.08.2007 17:18:58. Only SQL Server was fail.

|||

!!!!

Today at 10:57:50 it's happened again !!!

I check the system and application log.

System event log:

23.08.2007 11:01:14 ClusSvc Information Failover Mgr 1201 N/A MSNODE1 "The Cluster Service brought the Resource Group ""SQL Server 2005"" online."
23.08.2007 11:01:14 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:14 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:13 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the running state.
23.08.2007 11:01:07 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a start control.
23.08.2007 11:01:06 Service Control Manager Information None 7036 N/A MSNODE1 The SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 11:00:24 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 11:00:23 Service Control Manager Information None 7036 N/A MSNODE1 The Агент SQL Server (MSSQLSERVER) service entered the stopped state.
23.08.2007 10:59:57 Service Control Manager Information None 7035 POLAD\ClusterService MSNODE1 The Агент SQL Server (MSSQLSERVER) service was successfully sent a stop control.
23.08.2007 10:59:57 ClusSvc Error Failover Mgr 1069 N/A MSNODE1 Cluster resource 'SQL Server' in Resource Group 'SQL Server 2005' failed.

23.08.2007 10:57:58 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 1751833(in 100 nanosecs).
23.08.2007 10:57:24 TermServDevices Error None 1111 N/A MSNODE2 Driver Microsoft XPS Document Writer required for printer Microsoft XPS Document Writer is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver Canon MF5700 Series required for printer Canon MF5700 Series is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 TermServDevices Error None 1111 N/A MSNODE2 Driver hp LaserJet 1000 required for printer !!C528!hp LaserJet 1000 is unknown. Contact the administrator to install the driver before you log in again.
23.08.2007 10:57:23 Print Information None 2 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was created.
23.08.2007 10:57:22 Print Information None 9 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was set.
23.08.2007 10:57:57 ClusSvc Information Event Logger 1202 N/A MSNODE1 The time delta between node MSNODE1 and node MSNODE2 is 269397841(in 100 nanosecs).
23.08.2007 10:26:26 Print Warning None 3 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was deleted.
23.08.2007 10:26:26 Print Warning None 4 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 is pending deletion.
23.08.2007 10:26:26 Print Warning None 8 NT AUTHORITY\SYSTEM MSNODE2 Printer HP LaserJet 2100 PCL6 on c634.local.polad.ru (from N486) in session 1 was purged.

Application Event Log (because I use a SQL Server 2005 Russian some event have a russian text. In brackets I write english tranlation):

23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] checkODBCConnectError: sqlstate = 08001; native error = 0; message = [Microsoft][SQL Native Client]Не удается завершить вход в систему из-за задержки при открытии соединения с сервером
"
23.08.2007 10:59:36 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] ODBC sqldriverconnect failed
"
23.08.2007 10:58:51 Application Hang Error (101) 1002 N/A MSNODE2 Hanging application cluadmin.exe, version 5.2.3790.3959, hang module hungapp, version 0.0.0.0, hang address 0x00000000.
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:59 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:58 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:57 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:55 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:54 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 0; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:53 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] OnlineThread: QP is not online.
"
23.08.2007 10:57:51 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Ошибка связи (Communication link failure)
"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] printODBCError: sqlstate = 08S01; native error = 2746; message = [Microsoft][SQL Native Client]Поставщик TCP: An existing connection was forcibly closed by the remote host.

"
23.08.2007 10:57:50 MSSQLSERVER Error (3) 19019 N/A MSNODE1 "[sqsrvres] CheckQueryProcessorAlive: sqlexecdirect failed
"

I need help !

-

Andrew Mishechkin

ICQ: 101861332

|||

Thanks for the logs.

The app log indicates error with connection which escalates to a clussvc error in the system log. I would next suggest checking out faulty hardware (i.e. NIC's). I've seen strange and hard to diagnosed issue when it comes to faulty hardware.

|||We also have the same exact error and we are using HP DL 585 G2 running Win2k3 SP1. Any steps i can take to isolate further ( tools for dignostics ).

thanks,
Tony B|||

Hello,

I have the same problem with a sql 2005 SP2 cluter. I have several sql 2005 errors in aplication and a few later an error in system log about cluster service.
This is the cluster:

*clusternode1: HP DL50G5. 4 P dual core with HTT. W2003 R2 EE SP2 x64.
*clusternode2: the same as clusternode1.


I tried to solve the problem taking this actions:

*Disable the ilo driver.
*Update NICs drivers.
*Create a DWORD SynAttackProtect with value 0.


?Any idea to solve the problem?

Thanks
pablo

sqlsql