Quantcast
Channel: High Availability (Clustering) forum
Viewing all 3614 articles
Browse latest View live

Issues with MSMQ over HTTP in Windows Cluster - Not Working

$
0
0

Background:

We have set up a cluster 'net_cluster' and configured message queuing service 'net_clusterMsmq' in it. Please refer the [screen-shot] below for the cluster configuration. We have two physical servers in cluster. Screen shot shows two IP addresses which are virtual each of which points to the physical server. We have created non-transactional private queues on physical servers.



Please see this image https://social.technet.microsoft.com/Forums/getfile/668427

net_clusterMsmq has two Virtual IPs,  VIP1 and VIP2 which point to .NET1 and .NET2 respectively. Currently .NET1 is up and running. 

VIP1 ----> PIP1 (.NET1) (PIP = Physical IP)

VIP2 ----> PIP2 (.NET2)



Please see this image https://social.technet.microsoft.com/Forums/getfile/668451

H$ - This is storage drive attached to either .NET servers whichever is active. It contains msmq\storage and msmq\mapping.



Issue

NOTE: Everything works and has been working well since long time if I use OS:\net_clusterMsmq. Problem starts when I use HTTP.

We are having issues with MSMQ over HTTP in Windows Cluster environment. When app sends message to the queue using HTTP:

  1. Outgoing queues on IIS (Web App) server shows referred queue with "Waiting to Connect" State and "Connection is ready to transfer messages" message in Connection History column. And queue messages stay stuck forever.
  2. IIS logs on .NET1 server show:
    [VIP1 here] POST /msmq/private$/queuename - 80 - [IIS - Web app server IP here] - 200 0 0 46
    This clearly tells that post request from Web App IIS server was received by .NET1 server. Status = 200. However in S-IP field appears Virtual IP (VIP1) that points to .NET1 server. This is due to the fact that we send requests via cluster node.


Below is what I have checked/tried so far with no luck:

  1. Checked if port 1801 is listening - Yes
  2. Modified sample_map.xml file in H$ (storage drive attached to active .NET server) as well as C:\Windows\System32\msmq\mapping and restarted MSMQ service but didn't work.
    This was done because I found a blog stating message request reaches msmq server but local queue manager does not recognize the Virtual IP (VIP1) and looks for the Physical IP (PIP1) in received message. Since it does not find it, discards the message.
  3. Added ANONYMOUS LOGON with full rights to destination queue on .NET1 server.


NOTE: MSMQ over HTTP works fine in Non-Cluster environment. So this is definitely cluster specific issue.






2012 R2 Guest Cluster Network Failure

$
0
0

We have a 2 node guest cluster (2012 R2) using a shared VHDX located on a CSV providing resilient File Server services and it seems to work well....most of the time.

We've had two instances lately where the File Server cluster has failed due to "network" issues where neither node can see each other and both are removed from the cluster. The problem is that we can't see any other VM reporting networking problems at the same time.

We did some Firmware and Driver updates on the physical nodes recently to resolve a known problem with VMQs an thought that our problems were solved. unfortunately we had a re-occurrence of the problem this morning so we seem to be back to the drawing board.

Has anyone else had similar problems with Guest Clusters in 2012 R2?

Cheers for now

Russell

Making NLB highly available

$
0
0

I get how to use NLB to cluster services such as web, ftp, etc. and I've successfully set it up before.  I'm now setting up an ADFS farm and I got to wondering how do I make NLB itself highly available?  If my lone NLB server were to fail, I lose access to all the clustered services.  I've googled around for the best answer, but the results are for how to create clusters the NLB will manage.

I am thinking the best answer is to failover cluster my NLB server?  When I search for failover cluster nlb, I just get results talking about the difference between the two types of clustering.  Can you failover cluster NLB?  If so, how?  If not, what is the proper way to make NLB highly available.

Thanks for your help in advance.

Failover Cluster on Server 2012 r2

$
0
0

Hi

Does anyone knows whether Failover Clustering is available on server 2012 r2 standard?

Thank you!

CNO, VCO clustering in 2012 R2

$
0
0
I built a cluster, and the CNO is created but the VCO and proper permissions of creating objects is not granted for that. Becasue I will create a Listener for SQL server HA, the question how can I automate when the CNO is created to have that permissions so that not to go through the manual process of adding CREATE OBJECT ,READ, etc tot he CNO?

Hi, Is it Posible to Cluster Forest Trust Servers?

$
0
0

Hi, i am kind having problem trying to Cluster 2 Servers from different Domains. I had both also in SCVMM but i was wondering if is possible to cluster them also.

Hope to hear a good news from anyone.

Thanks a lot.

Sharepoint Website responding very slow, using windows server 2012 Network Load Balancing

$
0
0

Hello Team,

Greetings for the day!

I have 2 Windows server 2012, with Network Load Balancing role enabled on it, on both the server sharepoint 2010 R2 (Sharepoint Farm) is installed. I have enabled Network load balancer, with total of 5 IPS assigned between those 2 server.

The sharepoint site is working very slow some time (1min 30 sec), and sometime it respond very quickly(10 sec).

I have verified both the server performance which is more than good.

Me not sure what can I troubleshoot it further, I am also not sure how to check which server the request is going.

Help me coming out of this situation and optimize the performance.


Paresh Jain

Long time of creating VM checkpoint on file share storage

$
0
0

Hi,

Why VM checkpoint creating time so long ~6min  on file share storage ?

RAM of VM 10Gb

Same VM on local disk checkpoint creating time <3s

When I reduce RAM size of VM to 1Gb - checkpoint time <1min

When checkpoint was creating I looked to file share storage server and Hyper-v server performance monitor and found that disk and network activity was loaded as usual.

Thanks


The cluster Resource Hosting Subsystem (RHS) process was terminated and will be restarted

$
0
0

Server 2012 R2 on beefy PowerEdge R720 dual cluster

CAU worked fine on my cluster long time ago when there was few simple VMs.

As the setup got a big more complicated (more VMs with more disks, more backend iSCSI storage, some VMs in cluster setup), CAU just does not work reliable at all.

So this time I was doing hosts WU by hand (migrated VMs to Host A, updated Host B, restarted, updated again as some updated failed, restarted fresh again, Resume/Do not fail back roles (to be on the safe side...)

Then I selected some running VMs on Host A & Live Migration them back to Host B... at which point Host B (one just updated & freshly rebooted) thrown a fit & killed all VMs that were selected to be moved to it...

"The cluster Resource Hosting Subsystem (RHS) process was terminated and will be restarted. This is typically associated with cluster health detection and recovery of a resource. Refer to the System event log to determine which resource and resource DLL is causing the issue."

Cluster service did restart & did bring the machines up, but that is less then user friendly, had to select them one more time to migrate (and this time they did)

But the whole solution just feels not like Enterprise product (in fact it would not be acceptable even for home use)

Do not expect any miracle solution, but if anybody has any experience to chip in, it would be appreciated

Seb

Processor Information for Lizard

$
0
0

I've just put together a small cluster, with all nodes using Intel E5-2665 cpu's, and all running server 2012 R2. I'm trying to benchmark it using Lizard, but it is asking for information on the CPU.

"To calculate the theoretical efficiency of your HPC cluster, Lizard needs to determine the number of floating-point operations per clock cycle that each core in the processor of the head node of your HPC cluster is capable of performing, and uses that information as a reference for the rest of the compute nodes in your cluster. If Lizard cannot automatically determine this information about the processor, you will be required to provide it."

When I look here (https://msdn.microsoft.com/en-us/library/ee146526(v=ws.10).aspx) on the Microsoft help page, it just says look at the manufacturers website. I've done that and can't see the information I need.

Does anyone know where to look for the necessary info please?

Many thanks.

Node being removed from Windows Server 2012 R2 cluster

$
0
0

Windows Server 2012 R2 multi-site cluster with 5 nodes.  Node 1 (at the main site) fails multiple times a day with the same issue in the cluster log as shown below.  All the other nodes log missed heartbeats.  If it were latency I would expect all nodes to fail at one point or another but node1 is the only one that drops out.  The servers are all the same model with the same drivers.  The switch ports show no errors.  I see no UDP dropped packates in perfmon.  I have checked everything in the following blog: http://blogs.technet.com/b/askcore/archive/2012/02/08/having-a-problem-with-nodes-being-removed-from-active-failover-cluster-membership.aspx and this one: http://blogs.technet.com/b/askcore/archive/2012/07/09/having-a-problem-with-nodes-being-removed-from-active-failover-cluster-membership.aspx.

I do not want to change the cluster delay and threshold values except as a last resort.  The ping times between sites is down in the 3-5ms range even during the issue which is well below the recommended range.

Does anyone know what causes the failure that seems to kick off the issue?
"Failed to retrieve the results of overlapped I/O: (10054)"


Node1:
00000284.00002498::2015/06/15-05:58:35.919 DBG   [CHANNEL 169.254.x.x:~3343~]/recv: Failed to retrieve the results of overlapped I/O: (10054)
00000284.00002498::2015/06/15-05:58:35.919 DBG   [CHANNEL 169.254.x.x:~3343~] Closing due to error: (0).
00000284.00002498::2015/06/15-05:58:35.919 DBG   [CHANNEL 169.254.x.x:~3343~] Close().
00000284.00002498::2015/06/15-05:58:35.919 WARN  [CHANNEL 169.254.x.x:~3343~] failure, status (0)

All the other nodes:
00003700.000025a4::2015/06/15-05:58:33.825 DBG   [NETFTEVM] FTI NetFT event handler got event: LocalEndpoint 10.x.x.x:~3343~ has missed two consecutive heartbeats from 10.x.x.x:~3343~
00003700.000025a4::2015/06/15-05:58:33.825 DBG   [NETFTEVM] TM NetFT event handler got event: LocalEndpoint 10.x.x.x:~3343~ has missed two consecutive heartbeats from 10.x.x.x:~3343~

Network Configuration for SOFS & Hyper-V Cluster

$
0
0

Hello,

Nice opportunity for a good laugh here..potentially.

Looking to deploy an SOFS and HV cluster over the coming summer. Going through the network configuration and have come up with the following (link to image below). Each cluster is comprised of 2 nodes, so nothing too heavy on the network side of things. 

SOFS cluster will utilise SMB 3.0 app shares, on a JBOD.

Would welcome suggestions/bad mistakes/judgement calls..

Network Configuration

Live Migration of Server 2012 R2 Remote Desktop Server Disconnects Users

$
0
0

Hi. I have a 2 node Server 2012 R2 failover cluster and amongst the services running on this cluster is a Server 2012 R2 remote desktop server (this has the session host, web access, licensing and connection broker roles). When I move the 2012 R2 remote desktop server to the other cluster node using live migration, users are disconnected from RDS and get an error to say "failed to reconnect session". Users can immediately, manually reinitiate their connection to the server.

I have other 2012 R2 servers running on this cluster, which don't have the remote desktop services roles installed, and these servers do not exhibit the same behaviour of disconnecting users after a live migration.

Have found only a few references to similar issues on a couple of other TechNet discussions, but no resolution etc...

I'm guessing this is not expected behaviour and there is some kind of configuration issues somewhere?

no disks suitable for cluster disks were found

$
0
0

I have a 2 node cluster and I added a 50GB volume to both nodes. but in failover cluster manager, adding disks does not show the disk.

I see the disk in disk manager

print cluster - want to forward the spool file?

$
0
0

Hi

can anyone give me an idea, really not very experienced, we have a two node print cluster running on windows server 2008, we want to forward on the spool file to another printer, the spool file forwarder program we have runs as an application, where would we need it installed and what files (and where) would we need to monitor?

thanks :)


VSS and Failover Clustering

$
0
0

Hi,

I have a 4-node physical cluster.  External storage is fiber attached to an HP disk array.

The cluster runs about 25 virtual machines.  All the VM's are based on CSV's.

I'm trying to understand where VSS puts its snapshots, when it is dealing with a failover cluster with VM's.

Does each VM use part of its virtual disks, to store the VSS snapshot, or does the cluster snapshot each VM and add the VSS snapshot storage to the CSV's?

thanks for any help!

Mark

Need step by step instruction to install sql server 2012 failover cluster

$
0
0

We need to install sql server 2012 failover cluster within next 12 hrs. If some one send us step step instruction for it would be nice help.

I am new to sql cluster installation. We need help to configure MSDTC with cluster.

We are running windows 2012 server.

 

How to retreive the groupcomponent information from MSCluster_ClusterSharedVolumeToPartition

$
0
0

I'm trying to find the CSV information by using the below WQL query. I pass in disk partition information to fetch the CSV information.

$csv = Get-WmiObject -Namespace "root/MSCluster" -Query "SELECT GroupComponent FROM MSCluster_ClusterSharedVolumeToPartition where PartComponent='MSCluster_DiskPartition.Name=`"C:\ClusterStorage\Volume1`"'"

Write-Host $csv.Name Write-Host $csv.VolumeName


But doesn't seem to be working :(




File cluster migration from Windows 2003 to 2008

$
0
0

Hi,

we need to migrate file cluster on 2003 server to windows server 2008.

how i can do that?

Cluster Aware Updating, WSUS, and SCCM

$
0
0

Howdy,

Wasn't sure where to post this so hopefully it works OK here.

We use SCCM to handle pushing patches to all of our workstations and servers.  However, we don't have it push to Clustered machines since I don't think it supports Cluster Updating.  All of our machines are set to use the SCCM server as their WSUS server but we don't actually approve anything there since SCCM takes care of that for us.

So, what is the best way to go around patching up our clustered machines?  Do I need to have a 2nd WSUS server or can I use the SCCM server?

Here's what I'm thinking so please let me know what I'm missing.

Option 1: Approve updates on the SCCM Server's WSUS program but set all workstations and servers to never check for updates so they don't get pushed out at all.  This should not affect SCCM so everything should patch how it works now.  However, I could then run CAU and I'm guessing that it would see the approved updates and compare those to the servers and know what they need and would patch them up properly?

Option 2: Have a second WSUS server and only point the Clustered Servers at it.  Then I could run CAU against this server and it should work normally.  Only problem would be figuring out how I could make sure the same updates were being installed from SCCM and WSUS so all the servers were in sync.

Option 3: I've seen some pretty complicated scripting methods to accomplish Cluster Updating via SCCM using Orchestrator or other things.  Those I know nothing about to know if they are easy to setup and make work or not but since it bypasses WSUS I guess it'd be a way of keeping everything in sync.

Are there any other options or would any of these just not work?  I'm guessing #1 would be a No since I generally see people saying to not touch WSUS on the SCCM server or bad things can happen.

Thanks!

Viewing all 3614 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>