Quantcast
Channel: High Availability (Clustering) forum
Viewing all 3614 articles
Browse latest View live

network load balancing trouble with add another virtual machine in cluster

$
0
0
i have windows server 2008 r2 hyper-v with two virtual machine that have each per one network adapter(the first IP 192.168.99.36,GW 192.168.99.1, the second 192.168.99.38, GW 192.168.99.1 NLB cluster IP 192.168.99.40, GW 192.168.99.1). i installed nlb on both vm's with enabled 'enable spoofing of MAC addreses'.NLB is unicast. i add one VM in nlb cluster without problem and nlb cluster is working ok, but when i add another the VM, this VM lose network connectivity and NLB cluster is not working. how i resolve this problem?

Windows Failover Clustered IP question

$
0
0

Hello,

Over this past weekend, I created a test environment in my home lab.  This environment contains a 2-Node Windows Server 2016 Failover Cluster with each node having SQL Server 2017 Developer Edition installed with one AlwaysOn Availability Group created and configured.  There are actually three Windows Server 2016 in this test lab.  One domain controller and the 2 clustered nodes.  Here's my question:

While creating the 2-node WFSC, I remember there was a screen asking me to enter the Windows Server Cluster IP.  So I provided one.  While creating the AlwaysOn Availability Group, there was also a screen asking me to enter the AG name and an IP.  So I again provide a static IP for this which was different from the WFSC IP.  Once I got everything setup up & running, I was able to connect to SQL Server 2017 instance using only the AG Cluster name which is the whole point behind HA environment.  So in SSMS, I connect to the AG using only the AG Cluster name.  Everything working beautifully even after a manual failover to my secondary replica.   My question is, what is that static IP for the WFSC used for?  The one that they asked you when you setup the WFSC.  When do I use it?

Thanks

Exchange Cluster Service offlice

$
0
0
hi all , 

we have exchange server 2013 running DAG , we have issue with cluster services due to the Cluster name is not being online and checking the event log of the cluster we see the below error 
" Cluster network name resource 'Cluster Name' cannot be brought online. Ensure that the network adapters for dependent IP address resources have access to at least one DNS server. Alternatively, enable NetBIOS for dependent IP addresses "

what we have tried is  remove the A record of the cluster from the DNS and create again with all permission and  set the option allow any authenticated user to update DNS .

the ComputerName CNO for the Cluster is now online . 

any suggestion people .

Can not connect to the Cluster name You do not have administrative privileage on the cluster, contact your N/W administraor to request access. Error Code: 0x80070005 Access is Denied

$
0
0

Can not connect to the Cluster name
You do not have request access.

Error Code: 0x80070005
Access is Denied

attached the screenshot

facing this issue in both the nodes .kindly help us how to fix this one.





2016 Hyper-V Cluster - Balancer not migrating machines when over the threshold.

$
0
0

Hi,

We have a 2016 HPV Clsuter setup with the Balancer setting at Medium.  According to info online, this means it should migrate things off machines when they hit the 70% utilization.  However, this is not happening.

We applied Windows updates yesterday so hosts got cleaned off for rebooting.  Today things are not evened out.

Here are the current RAM usage listings:

Free RAMTotal RAMPercent Used
12025653%
5325679%
4225684%
28057551%
49657514%

So why is the cluster not migrating things off Host 2 and 3 since they are both over the 70% utilization threshold?  Or am I misunderstanding how the balancer is supposed to work?

Thanks!

VM Cluster

$
0
0

I was reading MS documentation on Failover Clustering https://docs.microsoft.com/en-us/previous-versions/windows/it-pro/windows-server-2012-R2-and-2012/dn265972(v%3dws.11). There is this note that I don't understand ...Make sure that there is not more than one virtual machine in a virtual machine clustered role. Starting with Windows Server 2012, we do not support this configuration. An example of this scenario is where multiple virtual machines have files on a common physical disk that is not part of Cluster Shared Volumes. A single virtual machine per clustered role improves the management experience and the functionality of virtual machines in a clustered environment, such as virtual machine mobility.

Can anyone elaborate this for me please?

tia,

gix



Can Cluster-Aware Updating be configured to accept that certain virtual machines, which are OFF, should be left on their current node during update run?

$
0
0

I have a few Hyper-V virtual machines in a 3-node Windows Server 2016 Failover Cluster, where I do not want to move them from their current Node.

I want to just shut them down manually before any maintenance. But Neither CAU nor Failover Cluster Manager Node Pause/Drain seem to want to accept that they should just be left where they are in their Off state.

Is there a way I can tell CAU or the Failover Cluster to leave them where they are?

I know that I can Choose Pause/No-Drain in Failover Cluster Manager, but there does not seem to be a similar option in CAU.

Windows Failover Cluster Best Practice - Should the Cluster Name and Resource Name should be a part of OU where Baseline GPO's are applied

$
0
0

HI All,

For Windows Failover Cluster What is the Best Practice ?

Should the Cluster Name and Resource Name should be a part of OU where Baseline GPO's are applied or not ?


Taking > 5 Minutes to Connect to SQL Server from Remote Computers After a Fail-over

$
0
0

Hi, all.

I want to bring this up front before I begin.  I am, by no means, an expert at clustering, or SQL Server.  However, I want to try and figure out why it takes so long for our tools to connect to the database server after a fail over.

Background:

OS: Server 2008

SQL Server 2005

2 node cluster server w/ NETAPP SAN (connected to each node using iScsi connection)

Observations:

  • Happens whether fail-over is REAL or simulated (simulated being through the cluster manager).
  • When fail over occurs, all services go offline.. and then come back online in less than a minute (~45-60 seconds).
  • Active node does switch successfully when fail over-occurs.
  • I can see the cluster storage / SAN (storage drive, Qorom, etc.) on the active node.
  • I am able to access the database, local.. on either node, using Enterprise Management Studio.
  • What I can't do is access the database from another computer outside the network for a certain period of time
  • Time that the database is unavailable can range from 6 minutes to up to 13 minutes.  However, I think the time differences here are simulated fail-over events (cluster manager) vs. forced fail-over events (shutting down active node).
  • using NETSTAT -A, I could not see anything as far as lost packets or collisions.  
  • Ping test is fine from server to clients.
  • All clients outside of the subnet, but within the factory network.
  • When the clients are connected to the server's database, there are no issues with connectivity.
  • This "pause" in DB connection from outside clients was not happening when it was 1st installed roughly a year ago (So, I am GUESSING it's not a setup or configuration issue).

I can't just assume this is a network issue because my equipment is always "guilty until proven innocent."  At least in the IT department's eyes.. :), so I want to exhaust all possibilities that this could be an issue with the server itself.

Can anyone think of anymore tests to try to rule out the server?  I'm all ears.  Please remember, I am not an expert, so sorry if I ask how to do half of the actions you suggest... :)

Thank you so much for any suggestions!

Best Regards,

Bill

StorageSpacesDirect does not reclaim pool diskspace for a removed virtual disk

$
0
0

we have a server16 CU8 2-node S2D hyperV cluster with several cluster shared volumes. four of them have been decommissioned and deleted from the storage pool, including remove-virtualdisk for the detached objects. the pool shows the correct amount of free space (capacity volume). however, while for the first three virtualdisks their space was given back for new allocation, the fourth virtualdisk has been deleted from a powershell perspective but the storage pool does not offer the gained space for allocating a new virtualdisk, so it's essentially still allocating the blocks in the background somewhere. there is a miscrepancy in displayed free space at the storagepool view vs. what the virtualdisk wizard offers for volume creation.

any ideas how to cleanup the storagepool to reclaim the dead diskspace for usage?

thank you!

Thorsten

Solution to configure the Hyper V Node in HA without Failover Cluster

$
0
0

Currently, we have clustered hyper V nodes in multiple data centers with iSCSI Storage. Failover Cluster creates multiple platform downtimes even if a VM did not respond to cluster. We have disabled the heartbeat monitoring for the VM in the cluster even we are facing the issue. 

Is there any solution to configure the hyper V node VM's in HA without failover cluster. 


Expanded Disk using Diskpart, now cluster shared volume shoes Failed to get the volume number \\&globalroot\device\***

$
0
0

I have a Nimble storage array that I expanded.

Then ran diskpart on server to expand disk.

When I go into cluster manager, storage, disk size reports correctly but volume does not show the correct size.

I also have two error messges 1069 resource control manager and 5150 - cluster shared volume

Unable to create a new database, or restore an existing database on SQL 2014 FCI

$
0
0

Hello Technet Support,

We have recently built a SQL Server 2014 Standard version with Failover Cluster Instance consists of two nodes only running on Windows Server 2016.

When we try to create a new database, or restore an existing database, unable to do that, faced with the below errors.

Please assist.

Thank you, Anand


Anand Franklin

3 Node HA cluster with 2 CSV's, only use 2 members for one csv ?

$
0
0

Hi, just going through a planning stage at the moment and wondered the following, with a 3 node cluster and one csv, all members acces this via iscsi (if that matters), want to add a second csv but only want 2 members to access this (via direct SAS connection), is this possible ?

S2D Reserve Capacity

$
0
0

Hi,

We are starting to implement S2D in our environment. Can anyone confirm how the reserve capacity thing works? Our nodes have 20 capacity disks so I want at least 2 disks worth in each node for reserve.

How do I do that? Is it literally just a case of not filling up the pool to the amount you want? I.e:

20X 600GB disks (+ 4X 800GB SSD for cache) = 24TB total physical storage
minus 4X 600GB for reserve = 21.6TB
divided by 2 (2 nodes, 2-way mirror) = 10.8TB usuable

So don't fill up the pool more than 10.8TB and you'll have 4 disks worth of reserve capacity?

Is that how it works?

Every time I read about setting reserve capacity I keep wanting to actually mark a physical disk as a reserve disk, like in the traditional hardware-RAID world.

Thanks



SCSI disk doesn't reconnect when failover cluster is mounted

$
0
0

Before creating the failover cluster, I create my storage LUNS to put the VMs and the Quorum to the failover cluster storage setup, but when creating it, the SCSI disks doesn't become available to the cluster, and in the SCSI target connection, they stay with "reconnecting" status.

What should I do to make this storage available in the cluster?

IIS Role Based Failover Clustering - Failover Issue

$
0
0

Hi All,

I have deployed Windows failover clustering and added two nodes onto it. Now we have a requirement to add role based Failover for IIS. I tried adding WWW services in role, but the output is not as expected - failover is not happening.

Also i have downloaded Failover Script for websites from Technet and implemented it via Generic Script. But again the site is not getting failover.

Can someone help me out of this.

Regards,

Ramesh k


[Oracle, MSCS] Service is not brang to another node when service is down

$
0
0

Hello, all

I deployed environment that two oracle instances are being operated upon on Microsoft Cluster HA.

Thing weired is when I shutdown active node or bring service onto another node manually then both work definetely.

However; in case of one service for oracle instance is stopped in the services.msc, it doesn't work.

It is never transsited its service to another node.

When I first configured this environment, I wanted to ensure that HA was possible even if either of the Oracle instances failed.

When I look at the cluster event, I see event id 1196 and event id 1069.

I looked at dns, referring to other technical documents, but did not get a clear answer. I would like to get a helpful opinion.

Thanx.

1090 & 7024 - adding new node

$
0
0

Hello,

How are you? I have created a new cluster on Hyper V server 2016. Actually it only have one node. 

When I want to add the second one appears the following errors:

Event ID 1090:

The Cluster service cannot be started. An attempt to read configuration data from the Windows registry failed with error '2'. Please use the Failover Cluster Management snap-in to ensure that this machine is a member of a cluster. If you intend to add this machine to an existing cluster use the Add Node Wizard. Alternatively, if this machine has been configured as a member of a cluster, it will be necessary to restore the missing configuration data that is necessary for the Cluster Service to identify that it is a member of a cluster. Perform a System State Restore of this machine in order to restore the configuration data.

Event ID 7024:

The Cluster Service service terminated with the following service-specific error: 
The system cannot find the file specified.

Event ID 7031:

The Cluster Service service terminated unexpectedly.  It has done this 67 time(s).  The following corrective action will be taken in 15000 milliseconds: Restart the service.

In this second node the "cluster service" appears on disabled. I tried starting manually and also failed.

I removed failorver cluster feature, then rrestarted and installed again but it failed too

Any ideas?

Thanks in advance

Regards

Windows 2016 file server stretch cluster - error 0x80071398 when moving to different node

$
0
0

Hi,

We want to deploy highly available file server with automatic failover between two sites and we followed the instructions from here:

https://docs.microsoft.com/en-us/windows-server/storage/storage-replica/stretch-cluster-replication-using-shared-storage

The only difference is that we have only one server in each site.

There were no issues during the configuration, but when we try to failback or move the node to the other site we get the following error:

"Error Code: 0x80071398 The operation failed because either the specified cluster node is not the owner of the group, or the node is not a possible owner of the group"

We have checked the possible owners and it all looks good. We even ran the following commands to make sure that the permissions are set correctly:

Get-ClusterResource | Set-ClusterOwnerNode Server1,Server2

Get-ClusterGroup | Set-ClusterOwnerNode Server1,Server2

We also tried evicting the second node and adding again to the cluster but still no luck.

Any help would be greatly appreciated.


Viewing all 3614 articles
Browse latest View live


Latest Images

<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>