Professional Documents
Culture Documents
Troubleshooting
02 APR 2020
VMware vSphere 7.0
VMware vSAN 7.0
vSAN Monitoring and Troubleshooting
You can find the most up-to-date technical documentation on the VMware website at:
https://docs.vmware.com/
docfeedback@vmware.com
VMware, Inc.
3401 Hillview Ave.
Palo Alto, CA 94304
www.vmware.com
©
Copyright 2018-2020 VMware, Inc. All rights reserved. Copyright and trademark information.
VMware, Inc. 2
Contents
1 Introduction to vSAN 6
VMware, Inc. 3
vSAN Monitoring and Troubleshooting
VMware, Inc. 4
About vSAN Monitoring and
Troubleshooting
vSAN Monitoring and Troubleshooting describes how to monitor and troubleshoot VMware vSAN® by
using the vSphere Client, esxcli and RVC commands, and other tools.
Intended Audience
This manual is intended for anyone who wants to monitor vSAN operation and performance, or
troubleshoot problems with a vSAN cluster. The information in this manual is written for experienced
system administrators who are familiar with virtual machine technology and virtual datacenter operations.
This manual assumes familiarity with VMware vSphere, including VMware ESXi, vCenter Server, and the
vSphere Client.
For more information about vSAN and how to create a vSAN cluster, see the vSAN Planning and
Deployment Guide.
For more information about vSAN features and how to configure a vSAN cluster, see Administering
VMware vSAN.
VMware, Inc. 5
1
Introduction to vSAN
VMware vSAN is a distributed layer of software that runs natively as a part of the ESXi hypervisor. vSAN
aggregates local or direct-attached capacity devices of a host cluster and creates a single storage pool
shared across all hosts in the vSAN cluster.
While supporting VMware features that require shared storage, such as HA, vMotion, and DRS, vSAN
eliminates the need for external shared storage and simplifies storage configuration and virtual machine
provisioning activities.
VMware, Inc. 6
2
Monitoring the vSAN Cluster
You can monitor the vSAN cluster and all the objects related to it.
You can monitor all of the objects in a vSAN environment, including hosts that participate in a vSAN
cluster and the vSAN datastore. For more information about monitoring objects and storage resources in
a vSAN cluster, see the vSphere Monitoring and Performance documentation.
The cluster Summary page includes a summary of vSAN capacity. You also can view more detailed
information in the Capacity monitor.
VMware, Inc. 7
vSAN Monitoring and Troubleshooting
Procedure
Results
n The Capacity Overview displays the storage capacity of the vSAN datastore, including total space,
used space, free space, reserved space, and the space that is actually written or physically
consumed on the vSAN disks. For clusters that have the deduplication and compression enabled, you
can view the compression savings and the compression ratio.
n The Usable capacity analysis enables you to estimate the free space available based on the storage
policy that you selected while keeping the deduplication ratio as 1.
n The Usage breakdown before dedup and compression displays the usage breakdown based on the
categories such as VM usage, user objects, and system usage. You can view a graphical
representation of the usage categories. Click the graphic to view the different usage categories.
VMware, Inc. 8
vSAN Monitoring and Troubleshooting
Category Description
User objects Displays iSCSI objects, block container volumes that are not
attached to VM, user-created files, ISO files, VM templates, files
shares, file container volumes, and vSAN objects used by the
vSphere replication service at the target site.
When you enable deduplication and compression, it might take several minutes for capacity updates to
be reflected in the Capacity monitor, as disk space is reclaimed and reallocated. For more information
about deduplication and compression, see "Using Deduplication and Compression" in Administering
VMware vSAN.
You can check the history of capacity usage in the vSAN datastore. Click Capacity History, select a time
range, and click Show Results.
Procedure
VMware, Inc. 9
vSAN Monitoring and Troubleshooting
3 Click Physical Disks to review all hosts, cache devices, and capacity devices in the cluster. vSAN
displays information about capacity devices, such as total capacity, used capacity, reserved capacity,
physical location, and so on. The physical location is based on the hardware location of cache and
capacity and devices on vSAN hosts.
Procedure
1 Navigate to Storage.
You can view general information about the vSAN datastore, including capacity, capabilities, and the
default storage policy.
a Click Disk Management and select the disk group to display local devices in the table at the
bottom of the page.
b Click Capacity to review information about the amount of capacity provisioned and used in the
cluster, and also to review a breakdown of the used capacity by object type or by data type.
When one or more hosts are unable to communicate with the vSAN datastore, the information about
virtual objects might not displayed.
Procedure
VMware, Inc. 10
vSAN Monitoring and Troubleshooting
3 Under vSAN, select Virtual Objects to view the corresponding virtual objects in the vSAN cluster.
a Select an object type in the Affected inventory objects area at the top of the page to display
information about each object, such as health and availability, storage policy, and vSAN UUID.
You can also view the vSphere Replication objects.
b Select the check box on one of the virtual objects and click View Placement details to open the
Physical Placement dialog box. You can view the device information, such as name, identifier or
UUID, number of devices used for each virtual machine, and how they are mirrored across hosts.
c On the Physical Placement dialog box, select the Group components by host placement check
box to organize the objects by host and by disk.
Note At the cluster level, the Container Volumes filter displays detached container volumes. To view
attached volumes, select the VM to which the container is attached.
When a hardware device, host, or network fails, or if a host is placed into maintenance mode, vSAN
initiates resynchronization in the vSAN cluster. However, vSAN might briefly wait for the failed
components to come back online before initiating resynchronization tasks.
n Editing a virtual machine (VM) storage policy. When you change VM storage policy settings, vSAN
might initiate object recreation and subsequent resynchronization of the objects.
Certain policy changes might cause vSAN to create another version of an object and synchronize it
with the previous version. When the synchronization is complete, the original object is discarded.
vSAN ensures that VMs continue to run, and resynchronization does not interrupt their operation.
This process might require additional temporary capacity.
n Recovering hosts from a permanent or long-term failure. If a host is unavailable for more than 60
minutes (by default), vSAN creates copies of data to recover the full policy compliance.
n Evacuating data by using the Full data migration mode before you place a host in maintenance mode.
If a VM is not responding due to latency caused by resynchronization, you can throttle the IOPS used for
resynchronization.
VMware, Inc. 11
vSAN Monitoring and Troubleshooting
Prerequisites
Verify that hosts in your vSAN cluster are running ESXi 6.5 or later.
Procedure
3 Click vSAN.
The Object Repair Time defines the time vSAN waits before repairing a non-compliant object after
placing a host in a failed state or maintenance mode. The default setting is 60 minutes. To change the
setting, edit the Object Repair Timer (Configure > vSAN > Services > Advanced Options).
You can also view the following information about the objects that are resynchronized:
Objects Description
Bytes left to resync Data (in bytes) that is remaining before the resynchronization
is complete.
Total resyncing ETA Estimated time left for the resynchronization to complete.
The objects to be resynchronized are categorized as active,
queued, and suspended. The objects that are actively
synchronizing fall in the active category. The objects that are
in the queue for resynchronization are the queued objects.
The objects that were actively synchronizing but are now in
the suspended state falls in the suspended category.
You can also view the resynchronization objects based on various filters such as Intent and Status.
Using Show first, you can modify the view to display the number of objects.
VMware, Inc. 12
vSAN Monitoring and Troubleshooting
If VMs are not responding due to latency caused by resynchronization, you can throttle the number of
IOPS used for resynchronization. Consider resynchronization throttling only if latencies are rising in the
cluster due to resynchronization, or if resynchronization traffic is too high on a host.
Resynchronization throttling can increase the time required to complete resynchronization. Reprotection
of non-compliant VMs might be delayed.
Procedure
3 Click vSAN.
5 (Optional) Click Show current resync traffic per host to view resynchronization activity.
6 Select the Enable throttling for resyncing objects traffic check box.
n Move the slider to the right to increase the amount of throughput allowed for resynchronization.
n Move the slider to the left to decrease the amount of throughput allowed for resynchronization.
A general rule is to throttle the throughput by half and allow some time for the cluster to adapt. If
further action is needed, throttle the throughput by half again until the cluster stabilizes.
Cluster rebalancing evenly distributes resources across the cluster to maintain consistent performance
and availability.
The following operations can cause disk capacity to reach 80% and initiate cluster rebalancing:
n vSAN hosts are placed in maintenance mode with the Evacuate all data option.
n vSAN hosts are placed in maintenance mode with Ensure data accessibility when objects assigned
PFTT=0 reside on the host.
Note To provide enough space for maintenance and reprotection, and to minimize automatic
rebalancing events in the vSAN cluster, consider keeping 30-percent capacity available at all times.
VMware, Inc. 13
vSAN Monitoring and Troubleshooting
Run the following RVC commands to monitor the rebalance operation in the cluster:
n vsan.check_limits. Verifies whether the disk space use is balanced in the cluster.
n vsan.whatif_host_failures. Analyzes the current capacity use per host, interprets whether a
single host failure can force the cluster to run out of space for reprotection, and analyzes how a host
failure might impact cluster capacity, cache reservation, and cluster components.
The physical capacity use shown as the command output is the average use of all devices in the
vSAN cluster.
For information about the RVC command options, see the RVC Command Reference Guide.
Your vSAN cluster can become unbalanced due to uneven I/O patterns to devices, or when you add hosts
or capacity devices. If the cluster becomes unbalanced, vSAN automatically rebalances the disks. This
operation moves components from over-utilized disks to under-utilized disks.
You can enable or disable automatic rebalance, and configure the variance threshold for triggering an
automatic rebalance. If any two disks in the cluster have a capacity variance that meets the rebalancing
threshold, vSAN begins rebalancing the cluster.
Disk rebalancing can impact the I/O performance of your vSAN cluster. To avoid this performance impact,
you can turn off automatic rebalance when peak performance is required.
Procedure
You can change the variance threshold to any percentage from 20 to 75.
What to do next
You can use the vSAN health service to check the disk balance. Expand the Cluster category, and select
vSAN Disk Balance.
VMware, Inc. 14
vSAN Monitoring and Troubleshooting
The default alarms are automatically triggered when the events corresponding to the alarms are activated
or if one or all the conditions specified in the alarms are met. You cannot edit the conditions or delete the
default alarms. To configure alarms that are specific to your requirements, create custom alarms for
vSAN. See Creating a vCenter Server Alarm for a vSAN Event.
For information about monitoring alarms, events, and editing existing alarm settings, see the vSphere
Monitoring and Performance documentation.
Procedure
3 In the search box, type vSAN as the search term to display the alarms that are specific to vSAN.
Type vSAN Health Service Alarm to search for vSAN health service alarms.
4 From the list of alarms, click each alarm to view the alarm definition.
n esx.problem.vob.vsan.lsom.diskerror
n esx.problem.vob.vsan.pdl.offline
To review the list of VOB IDs for vSAN, open the vobd.log file located on your ESXi host in
the /var/log directory. The log file contains the following VOB IDs that you can use for creating vSAN
alarms.
VMware, Inc. 15
vSAN Monitoring and Troubleshooting
esx.problem.vsan.net.redundancy.lost The vSAN network configuration does not have the required redundancy.
esx.problem.vsan.no.network.connectivity vSAN does not have existing networking configuration, which is in use.
Prerequisites
You must have the required privilege level of Alarms.Create Alarm or Alarm.Modify Alarm
Procedure
3 In the Name and Targets page, enter a name and description for the new alarm.
4 From the Target type drop-down menu, select the type of inventory object that you want this alarm to
monitor and click Next.
Depending on the type of target that you choose to monitor, the summary that follows the Targets,
change.
VMware, Inc. 16
vSAN Monitoring and Troubleshooting
5 In the Alarm Rule page, select a trigger from the drop-down menu.
The combined event triggers are displayed. You can set the rule for a single event only. You must
create multiple rules for multiple events.
b Select an option from the drop-down menu to set the threshold for triggering an alarm.
c Select severity of the alarm from the drop-down menu. You can set the condition to either Show
as Warning or Show as Critical, but not for both. You must create a separate alarm definition for
warning and critical status.
7 Select Send email notifications, to send email notifications when alarms are triggered.
8 In the Email to text box, enter recipient addresses. Use commas to separate multiple addresses.
9 Select Send SNMP traps to send traps when alarms are triggered on a vCenter Server instance.
11 In the Run this script text box, enter the following script or command:
EXE executable files Full pathname of the command. For example, to run the cmd.exe command in the
C:\tools directory, type:
c:\tools\cmd.exe
BAT batch file Full pathname of the command as an argument to the c:\windows
\system32\cmd.exe command. For example, to run the cmd.bat command in the
C:\tools directory, type:
c:\windows\system32\cmd.exe /c c:\tools\cmd.bat
12 Select an advanced action from the drop-down menu. You can define the advanced actions for virtual
machine and hosts. You can add multiple advanced actions for an alarm.
14 Select Reset the alarm to green and click Next to review the alarm definition.
15 Select Enable this alarm to enable the alarm and click Create.
Results
VMware, Inc. 17
3
Monitoring vSAN Health
You can check the overall health of the vSAN cluster, including hardware compatibility, networking
configuration and operations, advanced vSAN configuration options, storage device health and virtual
machine object health.
n Proactive Tests
You can use the vSAN health checks to monitor the status of cluster components, diagnose issues, and
troubleshoot problems. The health checks cover hardware compatibility, network configuration and
operation, advanced vSAN configuration options, storage device health, and virtual machine objects.
VMware, Inc. 18
vSAN Monitoring and Troubleshooting
The vSAN health checks are divided into categories. Each category contains individual health checks.
Hardware Compatibility Monitor the cluster components to ensure that they are using supported hardware,
software, and drivers.
Physical disk Monitor the health of physical devices in the vSAN cluster.
Online health Monitor vSAN cluster health and send to VMware’s analytics backend system for
advanced analysis. You must participate in the Customer Experience Improvement
Program to use online health checks.
vSAN Build Recommendation Monitor vSAN build recommendations for vSphere Lifecycle Manager.
vSAN iSCSI target service Monitor the iSCSI target service, including the network configuration and runtime
status.
Stretched cluster Monitor the health of a stretched cluster, if applicable. For more information on
"Witness appliance upgrade to vSphere 7.0 or higher with caution", see KB 76948.
Hyperconverged cluster configuration Monitor the status of hosts and settings configured through the Quickstart workflow.
compliance
vSAN periodically retests each health check and updates the results. To run the health checks and
update the results immediately, click the Retest button.
If you participate in the Customer Experience Improvement Program, you can run health checks and send
the data to VMware for advanced analysis. Click the Retest with Online health button.
For more information about vSAN health checks, see VMware Virtual SAN Health Check Plugin Guide.
n The Hosts tab displays the hosts participating in the vSAN cluster.
VMware, Inc. 19
vSAN Monitoring and Troubleshooting
Prerequisites
Procedure
3 Under vSAN, select Skyline Health to review the vSAN health check categories.
n If the Test Result column displays a Warning (yellow) or Failed (red) icon, expand the category to
review the results of individual health checks.
n In the Info section, you can click the Ask VMware button to open a knowledge base article that
describes the health check and provides information about how to resolve the issue.
n You can click Silence Alert on a health check, so it does not display any warnings or failures.
Procedure
The browser redirects to the login page for the host client.
2 Enter the username and password for the host, and click Login.
4 In the main page, click the vSAN datastore to display the Monitor link in the navigator.
b Click the Hosts tab to display the hosts participating in the vSAN cluster.
6 (Optional) On the vSAN tab, click Edit Settings to correct configuration issues at the host level.
Select the values that match the configuration of your vSAN cluster, and click Save.
VMware, Inc. 20
vSAN Monitoring and Troubleshooting
Proactive Tests
You can initiate a health test on your vSAN cluster to verify that the cluster components are working as
expected.
Run the VM creation test to verify the vSAN cluster health. Running the test creates a virtual machine on
each host in the cluster. The test creates a VM and deletes it. If the VM creation and deletion tasks are
successful, assume that the cluster components are working as expected and the cluster is functional.
Run the Network performance test to detect and diagnose connectivity issues, and to make sure the
network bandwidth between the hosts supports the requirements of vSAN. The test is performed between
the hosts in the cluster. It verifies that the network bandwidth between hosts, and reports a warning if the
bandwidth is less than 850Mbps.
To access a proactive test, select your vSAN cluster in the vSphere Client, and click the Monitor tab. Click
vSAN > Proactive Tests.
VMware, Inc. 21
4
Monitoring vSAN Performance
You can monitor the performance of your vSAN cluster. Performance charts are available for clusters,
hosts, physical disks, virtual machines, and virtual disks.
The performance service collects and analyzes performance statistics and displays the data in a
graphical format. You can use the performance charts to manage your workload and determine the root
cause of problems.
VMware, Inc. 22
vSAN Monitoring and Troubleshooting
When the vSAN performance service is turned on, the cluster summary displays an overview of vSAN
performance statistics, including IOPS, throughput, and latency. You can view detailed performance
statistics for the cluster, and for each host, disk group, and disk in the vSAN cluster. You also can view
performance charts for virtual machines and virtual disks.
Note When you create a vSAN cluster in the vSphere Client, the Performance Service is disabled. You
can enable and configure the Performance Service.
To support the Performance Service, vSAN uses a Stats database object to collect statistical data. The
Stats database is a namespace object in the cluster's vSAN datastore.
Prerequisites
n All hosts in the vSAN cluster must be running ESXi 6.5 or later.
n Before you configure the vSAN Performance Service, make sure that the cluster is properly
configured and has no unresolved health problems.
Procedure
VMware, Inc. 23
vSAN Monitoring and Troubleshooting
7 (Optional) Click to enable the verbose mode. This check box appears only after enabling vSAN
Performance Service. When enabled, vSAN collects and saves the additional performance metrics to
a Stats DB object. If you enable the verbose mode for more than 5 days, a warning message appears
indicating that the verbose mode can be resource-intensive. Ensure that you do not enable it for a
longer duration.
8 (Optional) Click to enable the network diagnostic mode. This check box appears only after enabling
vSAN Performance Service. When enabled, vSAN collects and saves the additional network
performance metrics to a RAM disk stats object. If you enable the network diagnostic mode for more
than a day, a warning message appears indicating that the network diagnostic mode can be resource-
intensive. Ensure that you do not enable it for a longer duration.
9 Click Apply.
You can manually save a time range with customized name. When you run a storage performance test,
the selected time range is saved automatically. You can save a time range for any of the performance
views.
Prerequisites
n All hosts in the vSAN cluster must be running ESXi 6.5 or later.
Procedure
3 Select any tab, such as vSAN - Backend. In the time range drop-down, select Save.
When the performance service is turned on, the cluster summary displays an overview of vSAN
performance statistics, including vSAN IOPS, throughput, and latency. At the cluster level, you can view
detailed statistical charts for virtual machine consumption and the vSAN back end.
Note To view iSCSI performance charts, all hosts in the vSAN cluster must be running ESXi 6.5 or later.
VMware, Inc. 24
vSAN Monitoring and Troubleshooting
Prerequisites
The vSAN performance service must be turned on before you can view performance charts.
Procedure
4 Select VM. Select a time range for your query. vSAN displays performance charts for clients running
on the cluster, including IOPS, throughput, latency, congestions, and outstanding I/Os. The statistics
on these charts are aggregated from the hosts within the cluster.
5 Select Backend. Select a time range for your query. vSAN displays performance charts for the
cluster back-end operations, including IOPS, throughput, latency, congestions, and outstanding I/Os.
The statistics on these charts are aggregated from the hosts within the cluster.
6 Select iSCSI and select an iSCSI target or LUN. Select a time range for your query. vSAN displays
performance charts for iSCSI targets or LUNs, including IOPS, bandwidth, latency, and outstanding
I/O.
When the performance service is turned on, the host summary displays performance statistics for each
host and its attached disks. At the host level, you can view detailed statistical charts for virtual machine
consumption and the vSAN back end, including IOPS, throughput, latency, and congestion. Additional
charts are available to view the local client cache read IOPS and hit rate. At the disk group level, you can
view statistics for the disk group. At the disk level, you can view statistics for an individual storage device.
Prerequisites
The vSAN performance service must be turned on before you can view performance charts.
To view the following performance charts, hosts in the vSAN cluster must be running ESXi 6.5 or later:
Physical Adapters, VMkernal Adapters, VMkernal Adapters Aggregation, iSCSI, vSAN - Backend resync
I/Os, resync IOPS, resync throughput, Disk Group resync latency.
Procedure
VMware, Inc. 25
vSAN Monitoring and Troubleshooting
4 Select VM. Select a time range for your query. vSAN displays performance charts for clients running
on the host, including IOPS, throughput, latency, congestions, and outstanding I/Os.
5 Select Backend. Select a time range for your query. vSAN displays performance charts for the host
back-end operations, including IOPS, throughput, latency, congestions, outstanding I/Os, and resync
I/Os.
6 Select Disks, and select a disk group. Select a time range for your query. vSAN displays
performance charts for the disk group, including front end (Guest) IOPS, throughput, and latency, as
well as overhead IOPS and latency. It also displays the read-cached hit rate, evictions, write-buffer
free percentage, capacity and usage, cache disk destage rate, congestions, outstanding I/O,
outstanding I/O size, delayed I/O percentage, delayed I/O average latency, internal queue IOPS,
internal queue throughput, resync IOPS, resync throughput, and resync latency.
7 Select Physical Adapters, and select a NIC. Select a time range for your query. vSAN displays
performance charts for the physical NIC (pNIC), including throughput, packets per second, and
packets loss rate.
8 Select Host Network, and select a VMkernel adapter, such as vmk1. Select a time range for your
query. vSAN displays performance charts for all network I/Os processed in the network adapters used
by vSAN, including throughput, packets per second, and packets loss rate.
9 Select iSCSI. Select a time range for your query. vSAN displays performance charts for all the iSCSI
services on the host, including IOPS, bandwidth, latency, and outstanding I/Os.
When the performance service is turned on, you can view detailed statistical charts for virtual machine
performance and virtual disk performance. VM performance statistics cannot be collected during
migration between hosts, so you might notice a gap of several minutes in the VM performance chart.
Note The performance service supports only virtual SCSI controllers for virtual disks. Virtual disks using
other controllers, such as IDE, are not supported.
Prerequisites
The vSAN performance service must be turned on before you can view performance charts.
Procedure
VMware, Inc. 26
vSAN Monitoring and Troubleshooting
4 Select VM. Select a time range for your query. vSAN displays performance charts for the VM,
including IOPS, throughput, and latency.
5 Select Virtual Disk. Select a time range for your query. vSAN displays performance charts for the
virtual disks, including IOPS, delayed normalized IOPS, virtual SCSI IOPS, virtual SCSI throughput,
and virtual SCSI latency.
The vSAN performance diagnostics tool analyzes previously run benchmarks gathered from the vSAN
performance service. It can detect issues, suggest remediation steps, and provide supporting
performance graphs for further insight.
The vSAN performance service provides the data used to analyze vSAN performance diagnostics. vSAN
uses CEIP to send data to VMware for analysis.
Note Do not use vSAN performance diagnostics for general evaluation of performance on a production
vSAN cluster.
Prerequisites
n vCenter Server requires Internet access to download ISO images and patches.
Procedure
You can select a goal based on the performance improvement that you want to achieve, such as
maximum IOPS, maximum throughput, or minimum latency.
The default time range is the most recent hour. You can increase the range to include the last 24
hours, or define a custom time range within the last 90 days. If you used the HCIbench tool to run
performance benchmark tests on the vSAN cluster, the time ranges of those tests appear in the drop-
down menu.
VMware, Inc. 27
vSAN Monitoring and Troubleshooting
Results
When you click Show Results, vSAN transmits performance data to the vSphere backend analytics
server. After analyzing the data, the vSAN performance diagnostics tool displays a list of issues that might
have affected the benchmark performance for the chosen goal.
You can click to expand each issue to view more details about each issue, such as a list of affected items.
You also can click See More or Ask VMware to display a Knowledge Base article that describes
recommendations to address the issue and achieve your performance goal.
VMware, Inc. 28
5
Handling Failures and
Troubleshooting vSAN
If you encounter problems when using vSAN, you can use troubleshooting topics. The topics help you
understand the problem and offer you a workaround, when it is available.
VMware Technical Support routinely requests diagnostic information from your vSAN cluster when a
support request is addressed. The support bundle is an archive that contains diagnostic information
related to the environment, such as product specific logs, configuration files, and so on.
vSAN performs an automated upload of the support bundle, and does not allow you to review, obfuscate,
or otherwise edit the contents of your support data prior to it being sent to VMware. vSAN connects to the
FTP port 21 or HTTPS port 443 of the target server with the domain name vmware.com, to automatically
upload the support bundle.
Note Data collected in the support bundle may be considered sensitive. If your support data contains
regulated data, such as personal, health care, or financial data, you may want to avoid uploading the
support bundle.
VMware, Inc. 29
vSAN Monitoring and Troubleshooting
Procedure
4 Click Upload.
Command Description
esxcli vsan network list Verify which VMkernel adapters are used for vSAN communication.
The esxcli vsan debug commands can help you debug and troubleshoot the vSAN cluster, especially
when vCenter Server is not available.
Debug commands:
Command Description
VMware, Inc. 30
vSAN Monitoring and Troubleshooting
VMware, Inc. 31
vSAN Monitoring and Troubleshooting
hostFailuresToTolerate: 1
Configuration:
RAID_1
Component: 47cbdc58-6928-333f-0c51-020010d5dfa3
Component State: ACTIVE, Address Space(B): 273804165120 (255.00GB),
Disk UUID: 52e95956-42cf-4d30-9cbe-763c616614d5, Disk Name: mpx.vmhba1..
Votes: 1, Capacity Used(B): 373293056 (0.35GB),
Physical Capacity Used(B): 369098752 (0.34GB), Host Name: sc-rdops...
Component: 47cbdc58-eebf-363f-cf2b-020010d5dfa3
Component State: ACTIVE, Address Space(B): 273804165120 (255.00GB),
Disk UUID: 52d11301-1720-9901-eb0a-157d68b3e4fc, Disk Name: mpx.vmh...
Votes: 1, Capacity Used(B): 373293056 (0.35GB),
Physical Capacity Used(B): 369098752 (0.34GB), Host Name: sc-rdops-vm..
Witness: 47cbdc58-21d2-383f-e45a-020010d5dfa3
Component State: ACTIVE, Address Space(B): 0 (0.00GB),
Disk UUID: 52bfd405-160b-96ba-cf42-09da8c2d7023, Disk Name: mpx.vmh...
Votes: 1, Capacity Used(B): 12582912 (0.01GB),
Physical Capacity Used(B): 4194304 (0.00GB), Host Name: sc-rdops-vm...
Type: vmnamespace
Path: /vmfs/volumes/vsan:52134fafd48ad6d6-bf03cb6af0f21b8d/New Virtual Machine
Group UUID: 00000000-0000-0000-0000-000000000000
Directory Name: New Virtual Machine
VMware, Inc. 32
vSAN Monitoring and Troubleshooting
To display the different performance views and metrics in vsantop, enter the following commands:
Command Description
^L Redraw screen
q Quit
VMware, Inc. 33
vSAN Monitoring and Troubleshooting
Problem
An ESXi host that joins a vSAN cluster fails to have vSAN configured.
Cause
If a host does not meet hardware requirements or experiences other problems, vSAN might fail to
configure the host. For example, insufficient memory on the host might prevent vSAN from being
configured.
Solution
3 Resolve the problem that prevents the host to have vSAN configured.
Problem
When you use force provisioning, you can provision a virtual machine object even when the policy
specified in the virtual machine profile cannot be satisfied with the resources available in the vSAN
cluster. The object is created, but remains in the non-compliant status.
vSAN is expected to bring the object into compliance when storage resources in the cluster become
available, for example, when you add a host. However, the object's status does not change to compliant
immediately after you add resources.
Cause
This occurs because vSAN regulates the pace of the reconfiguration to avoid overloading the system.
The amount of time it takes for compliance to be achieved depends on the number of objects in the
cluster, the I/O load on the cluster and the size of the object in question. In most cases, compliance is
achieved within a reasonable time.
VMware, Inc. 34
vSAN Monitoring and Troubleshooting
Problem
Error messages indicate that vCenter Server has detected a problem with vSAN configuration.
Solution
Host with the vSAN service enabled is not in the vCenter cluster Add the host to the vSAN cluster.
1 Right-click the host, and select Move To.
2 Select the vSAN cluster and click OK.
Host is in a vSAN enabled cluster but does not have vSAN Verify whether vSAN network is properly configured and enabled
service enabled on the host. See vSAN Planning and Deployment.
vSAN network is not configured Configure vSAN network. See vSAN Planning and Deployment.
Host cannot communicate with all other nodes in the vSAN Might be caused by network isolation. See the vSAN Planning
enabled cluster and Deployment documentation.
Found another host participating in the vSAN service which is Make sure that the vSAN cluster configuration is correct and all
not a member of this host's vCenter cluster. vSAN hosts are in the same subnet. See vSAN Planning and
Deployment.
vSAN also provides alerts about the type of component failure. See Using the VMkernel Observations for
Creating Alarms and Using the vSAN Default Alarms.
VMware, Inc. 35
vSAN Monitoring and Troubleshooting
Degraded A component is in degraded state if vSAN starts rebuilding the n Failure of a flash caching
vSAN detects a permanent affected components device
component failure and assumes immediately. n Magnetic or flash capacity
that the component is not going to device failure
recover to working state. n Storage controller failure
Absent A component is in absent state if vSAN starts rebuilding absent n Lost network connectivity
vSAN detects a temporary components if they are not n Failure of a physical network
component failure where the available within a certain time adapter
component might recover and interval. By default, vSAN starts n ESXi host failure
restore its working state. rebuilding absent components
n Unplugged flash caching
after 60 minutes.
device
n Unplugged magnetic disk or
flash capacity device
If a failure occurs in the cluster, vSAN marks the components for an object as absent or degraded based
on the failure severity.
Procedure
The home directories and virtual disks of the virtual machines in the cluster appear.
3 Select the check box on one of the virtual objects and click View Placement details to open the
Physical Placement dialog. You can view device information, such as name, identifier or UUID,
number of devices used for each virtual machine, and how they are mirrored across hosts.
If a failure has occurred in the vSAN cluster, the Placement and Availability is equal to Absent or
Degraded.
VMware, Inc. 36
vSAN Monitoring and Troubleshooting
Compliance Status The compliance status of a virtual machine object indicates whether it meets the
requirements of the assigned VM storage policy.
Operational State The operational state of an object can be healthy or unhealthy. It indicates the
type and number of failures in the cluster.
An object is healthy if an intact replica is available and more than 50 percent of
the object's votes are still available.
An object is unhealthy if an entire replica is not available or less than 50 percent
of the object's votes are unavailable. For example, an object might become
unhealthy if a network failure occurs in the cluster and a host becomes isolated.
To determine the overall influence of a failure on a virtual machine, examine the compliance status and
the operational state. If the operational state remains healthy although the object is noncompliant, the
virtual machine can continue using the vSAN datastore. If the operational state is unhealthy, the virtual
machine cannot use the datastore.
Procedure
The home directories and virtual disks of the virtual machines in the cluster appear.
3 Select an object type in the Affected inventory objects area at the top of the page to display
information about each object, such as health and availability, storage policy, and vSAN UUID.
If the inventory object is Unhealthy, the vSphere Client indicates the reason for the unhealthy state in
brackets.
Procedure
b On the Summary tab, examine the value of the VM Storage Policy Compliance property under
VM Storage Policies.
VMware, Inc. 37
vSAN Monitoring and Troubleshooting
c Select an object type in the Affected inventory objects area at the top of the page to display
information about each object, such as health and availability, storage policy, and vSAN UUID.
d Select the check box on one of the virtual objects and click View Placement details to open the
Physical Placement dialog. You can view device information, such as name, identifier or UUID,
number of devices used for each virtual machine, and how they are mirrored across hosts.
e On the Physical Placement dialog, check the Group components by host placement check box
to organize the objects by host and by disk.
Changes in the accessibility occur when the cluster experiences more failures than the policy for a virtual
machine object tolerates.
As a result from a failure in the vSAN cluster, a virtual machine object might become inaccessible. An
object is inaccessible if a full replica of the object is not available because the failure affects all replicas, or
when less than 50 percent of the object's votes are available.
According to the type of object that is inaccessible, virtual machines behave in the following ways:
Virtual machine inaccessibility is not a permanent state. After the underlying issue is resolved, and a full
replica and more than 50 percent of the object's votes are restored, the virtual machine automatically
becomes accessible again.
VMware, Inc. 38
vSAN Monitoring and Troubleshooting
If a disk experiences sustained high latencies or congestion, vSAN considers the device as a dying disk,
and evacuates data from the disk. vSAN handles the dying disk by evacuating or rebuilding data. No user
action is required, unless the cluster lacks resources or has inaccessible objects.
Behavior of vSAN
vSAN responds to the storage device failure in the following ways.
Parameter Behavior
Alarms An alarm is generated from each host whenever an unhealthy device is diagnosed. A warning is issued whenever
a disk is suspected of being unhealthy.
Health check The Disk operation health check issues a warning for the dying disk.
Health status On the Disk Management page, the health status of the dying disk is listed as Unhealthy. When vSAN completes
evacuation of data, the health status is listed as DyingDiskEmpty.
Rebuilding data vSAN examines whether the hosts and the capacity devices can satisfy the requirements for space and
placement rules for the objects on the failed device or disk group. If such a host with capacity is available, vSAN
starts the recovery process immediately because the components are marked as degraded.
If resources are available, vSAN automatically reprotects the data.
If vSAN detects a disk with a permanent error, it makes a limited number of attempts to revive the disk by
unmounting and mounting it.
Behavior of vSAN
vSAN responds to the capacity device failure in the following ways.
VMware, Inc. 39
vSAN Monitoring and Troubleshooting
Parameter Behavior
Primary level of If the Primary level of failures to tolerate in the VM storage policy is equal to or greater than 1, the virtual
failures to tolerate machine objects are still accessible from another ESXi host in the cluster. If resources are available, vSAN
starts an automatic reprotection.
If the Primary level of failures to tolerate is set to 0, a virtual machine object is inaccessible if one of the
object's components resides on the failed capacity device.
Restore the virtual machine from a backup.
I/O operations on vSAN stops all running I/O operations for 5-7 seconds until it re-evaluates whether an object is still available
the capacity device without the failed component.
If vSAN determines that the object is available, all running I/O operations are resumed.
Rebuilding data vSAN examines whether the hosts and the capacity devices can satisfy the requirements for space and
placement rules for the objects on the failed device or disk group. If such a host with capacity is available,
vSAN starts the recovery process immediately because the components are marked as degraded.
If resources are available, an automatic reprotect will occur.
Behavior of vSAN
vSAN responds to the failure of a flash caching device in the following way:
Parameter Behavior
Primary level of If the Primary level of failures to tolerate in the VM storage policy is equal to or greater than 1, the virtual
failures to tolerate machine objects are still accessible from another ESXi host in the cluster. If resources are available, vSAN
starts an automatic reprotection.
If the Primary level of failures to tolerate is set to 0, a virtual machine object is inaccessible if one of the
object's components is on the failed disk group.
I/O operations on vSAN stops all running I/O operations for 5-7 seconds until it re-evaluates whether an object is still available
the disk group without the failed component.
If vSAN determines that the object is available, all running I/O operations are resumed.
Rebuilding data vSAN examines whether the hosts and the capacity devices can satisfy the requirements for space and
placement rules for the objects on the failed device or disk group. If such a host with capacity is available,
vSAN starts the recovery process immediately because the components are marked as degraded.
VMware, Inc. 40
vSAN Monitoring and Troubleshooting
Behavior of vSAN
vSAN responds to the host failure in the following way:
Parameter Behavior
Primary level of If the Primary level of failures to tolerate in the VM storage policy is equal to or greater than 1, the virtual
failures to tolerate machine objects are still accessible from another ESXi host in the cluster. If resources are available, vSAN
starts an automatic reprotection.
If the Primary level of failures to tolerate is set to 0, a virtual machine object is inaccessible if the object's
components reside on the failed host.
I/O operations on vSAN stops all running I/O operations for 5-7 seconds until it re-evaluates whether an object is still available
the host without the failed component.
If vSAN determines that the object is available, all running I/O operations are resumed.
Rebuilding data If the host does not rejoin the cluster within 60 minutes, vSAN examines whether some of the other hosts in
the cluster can satisfy the requirements for cache, space and placement rules for the objects on the
inaccessible host. If such a host is available, vSAN starts the recovery process.
If the host rejoins the cluster after 60 minutes and recovery has started, vSAN evaluates whether to continue
the recovery or stop it and resynchronize the original components.
Behavior of vSAN
vSAN responds to a network failure in the following way:
VMware, Inc. 41
vSAN Monitoring and Troubleshooting
Parameter Behavior
Primary level of If the Primary level of failures to tolerate in the VM storage policy is equal to or greater than 1, the virtual
failures to tolerate machine objects are still accessible from another ESXi host in the cluster. If resources are available, vSAN
starts an automatic reprotection.
If the Primary level of failures to tolerate is set to 0, a virtual machine object is inaccessible if the object's
components are on the isolated hosts.
I/O operations on vSAN stops all running I/O operations for 5-7 seconds until it re-evaluates whether an object is still available
the isolated hosts without the failed component.
If vSAN determines that the object is available, all running I/O operations are resumed.
Rebuilding data If the host rejoins the cluster within 60 minutes, vSAN synchronizes the components on the host.
If the host does not rejoin the cluster within 60 minutes, vSAN examines whether some of the other hosts in
the cluster can satisfy the requirements for cache, space and placement rules for the objects on the
inaccessible host. If such a host is available, vSAN starts the recovery process.
If the host rejoins the cluster after 60 minutes and recovery has started, vSAN evaluates whether to continue
the recovery or stop it and resynchronize the original components.
Symptoms
If a host contains a single storage controller and multiple disk groups, and all devices in all disk groups
are failed, then you might assume that a failure in the common storage controller is the root cause.
Examine the VMkernel log messages to determine the nature of the fault.
If a host contains multiple controllers, and only the devices that are attached to an individual controller are
inaccessible, then you might assume that this controller has failed.
Behavior of vSAN
vSAN responds to a storage controller failure in the following way:
Parameter Behavior
Primary level of If the Primary level of failures to tolerate in the VM storage policy is equal to or greater than 1, the virtual
failures to tolerate machine objects are still accessible from another ESXi host in the cluster. If resources are available, vSAN
starts an automatic reprotection.
If the Primary level of failures to tolerate is set to 0, a virtual machine object is inaccessible if the object's
components reside on the disk groups that are connected to the storage controller.
Rebuilding data vSAN examines whether the hosts and the capacity devices can satisfy the requirements for space and
placement rules for the objects on the failed device or disk group. If such a host with capacity is available,
vSAN starts the recovery process immediately because the components are marked as degraded.
VMware, Inc. 42
vSAN Monitoring and Troubleshooting
Network Connection Lost Between Active Sites If the network connection fails between the two active sites, the witness
host and the preferred site continue to service storage operations, and
keep data available. When the network connection returns, the two active
sites are resynchronized.
Secondary Site Fails or Loses Network Connection If the secondary site goes offline or becomes isolated from the preferred
site and the witness host, the witness host and the preferred site continue
to service storage operations, and keep data available. When the
secondary site returns to the cluster, the two active sites are
resynchronized.
Preferred Site Fails or Loses Network Connection If the preferred site goes offline or becomes isolated from the secondary
site and the witness host, the secondary site continues storage operations
if it remains connected to the witness host. When the preferred site returns
to the cluster, the two active sites are resynchronized.
Witness Host Fails or Loses Network Connection If the witness host goes offline or becomes isolated from the preferred site
or the secondary site, objects become noncompliant but data remains
available. VMs that are currently running are not affected.
Troubleshooting vSAN
Examine the performance and accessibility of virtual machines to diagnose problems in the vSAN cluster.
Using hardware components, drivers, and firmware that are not compatible with vSAN might cause
problems in the operation of the vSAN cluster and the virtual machines running on it.
The hardware compatibility health checks verify your hardware against the VMware Compatibility Guide.
For more information about using the vSAN health service, see Chapter 3 Monitoring vSAN Health.
VMware, Inc. 43
vSAN Monitoring and Troubleshooting
Monitor regularly the following performance indicators to identify faults in vSAN storage, for example, by
using the performance charts in the vSphere Client:
n Virtual Machine. I/O operations, memory and CPU usage, network throughput and bandwidth.
You can use the vSAN performance service to access detailed performance charts. For information about
using the performance service, see Chapter 4 Monitoring vSAN Performance. For more information about
using performance data in a vSAN cluster, see the vSAN Troubleshooting Reference Manual.
Problem
After you enable vSAN on a cluster, on the Summary tab for the cluster the Network Status for vSAN
appears as Misconfiguration detected.
Cause
One or more members of the cluster cannot communicate because of either of the following reasons:
n A host in the cluster does not have a VMkernel adapter for vSAN.
Solution
Join the members of the cluster to the same network. See vSAN Planning and Deployment.
Problem
A virtual machine on a vSAN datastore is in one of the following states that indicate a fault in the vSAN
cluster.
n The virtual machine is non-compliant and the compliance status of some of its object is noncompliant.
See Examine the Compliance of a Virtual Machine in vSAN.
n The virtual machine object is inaccessible or orphaned. See Examine the Failure State of a
Component.
If an object replica is still available on another host, vSAN forwards the I/O operations of the virtual
machine to the replica.
VMware, Inc. 44
vSAN Monitoring and Troubleshooting
Cause
If the object of the virtual machine can no longer satisfy the requirement of the assigned VM storage
policy, vSAN considers it noncompliant. For example, a host might temporarily lose connectivity. See
Object States That Indicate Problems in vSAN.
If vSAN cannot locate a full replica or more than 50 percent of the votes for the object, the virtual machine
becomes inaccessible. If a vSAN detects that the .vmx file is not accessible because the VM Home
Namespace is corrupted, the virtual machine becomes orphaned. See Accessibility of Virtual Machines
Upon a Failure in vSAN.
Solution
If the cluster contains enough resources, vSAN automatically recovers the corrupted objects if the failure
is permanent.
If the cluster does not have enough resources to rebuild the corrupted objects, extend the space in the
cluster. See Administering VMware vSAN.
Problem
The operation for creating a virtual machine fails with an error status: Cannot complete file creation
operation.
Cause
The deployment of a virtual machine on vSAN might fail for several reasons.
n vSAN cannot allocate space for the virtual machine storage policies and virtual machine objects.
Such a failure might occur if the datastore does not have enough usable capacity, for example, if a
physical disk is temporarily disconnected from the host.
n The virtual machine has very large virtual disks and the hosts in the cluster cannot provide storage for
them based on the placement rules in the VM storage policy
For example, if the Primary level of failures to tolerate in the VM storage policy is set to 1, vSAN
must store two replicas of a virtual disk in the cluster, each replica on a different host. The datastore
might have this space after aggregating the free space on all hosts in the cluster. However, no two
hosts can be available in the cluster, each providing enough space to store a separate replica of the
virtual disk.
vSAN does not move components between hosts or disks groups to free space for a new replica,
even though the cluster might contain enough space for provisioning the new virtual machine.
VMware, Inc. 45
vSAN Monitoring and Troubleshooting
Solution
c Examine the capacity and health status of the devices on the hosts in the cluster.
Problem
After you add a host to a stretched cluster in which some hosts are disconnected, on the Summary tab for
the cluster the Configuration Status for vSAN appears as Unicast agent unset on host.
Cause
When a new host joins a stretched cluster, vSAN must update the configuration on all hosts in the cluster.
If one or more hosts are disconnected from the vCenter Server, the update fails. The new host
successfully joins the cluster, but its configuration is incomplete.
Solution
Verify that all hosts are connected to vCenter Server, and click the link provided in the Configuration
Status message to update the configuration of the new host.
If you cannot rejoin the disconnected host, remove the disconnected host from the cluster, and click the
link provided in the Configuration Status message to update the configuration of the new host.
Problem
After you use the RVC tool to add a host to a stretched cluster, on the Summary tab for the cluster the
Configuration Status for vSAN appears as Unicast agent unset on host.
Cause
When a new host joins a stretched cluster, vSAN must update the configuration on all hosts in the cluster.
If you use the RVC tool to add the host, the update does not occur. The new host successfully joins the
cluster, but its configuration is incomplete.
Solution
Verify that all hosts are connected to vCenter Server, and click the link provided in the Configuration
Status message to update the configuration of the new host.
VMware, Inc. 46
vSAN Monitoring and Troubleshooting
Problem
When you add or remove a witness host in a stretched cluster in which some hosts are disconnected, the
operation fails with an error status: The operation is not allowed in the current state. Not
all hosts in the cluster are connected to Virtual Center.
Cause
When the witness host joins or leaves a stretched cluster, vSAN must update the configuration on all
hosts in the cluster. If one or more hosts are disconnected from the vCenter Server, the witness host
cannot be added or removed.
Solution
Verify all hosts are connected to vCenter Server, and retry the operation. If you cannot rejoin the
disconnected host, remove the disconnected host from the cluster, and then you can add or remove the
witness host.
Problem
vSAN locks a host's disk groups when the host reboots and it cannot get the KEK from the KMS. The
disks behave as if they are unmounted. Objects on the disks become inaccessible.
You can view a disk group's health status on the Disk Management page in the vSphere Client. An
Encryption health check warning notifies you that a disk is locked.
Cause
Hosts in an encrypted vSAN cluster do not store the KEK on disk. If a host reboots and cannot get the
KEK from the KMS, vSAN locks the host's disk groups.
Solution
To exit the locked state, you must restore communication with the KMS and reestablish the trust
relationship.
VMware, Inc. 47
vSAN Monitoring and Troubleshooting
In vSAN, you should replace hardware devices when you encounter failures or if you must upgrade your
cluster.
Caution If you decommission the flash caching device without removing it from vSAN first, vSAN uses
smaller amount of cache than expected. As a result, the cluster performance becomes degraded.
When you replace a flash caching device, the virtual machines on the disk group become inaccessible
and the components on the group are marked as degraded. See A Flash Caching Device Is Not
Accessible in a vSAN Cluster.
Prerequisites
n Verify that the storage controllers on the hosts are configured in passthrough mode and support the
hot-plug feature.
If the storage controllers are configured in RAID 0 mode, see the vendor documentation for
information about adding and removing devices.
n If you upgrade the flash caching device, verify the following requirements:
n If you upgrade the flash caching device, verify that the cluster contains enough space to migrate
the data from the disk group that is associated with the flash device.
Procedure
3 Select the disk group that contains the device that you want to replace.
4 Select the flash caching device and click Remove selected disk(s) from disk group.
Results
After the flash caching device is deleted from the vSAN cluster, the cluster details reflect the current
cluster capacity and configuration settings. vSAN discards the disk group memberships, deletes
partitions, and removes stale data from all devices.
What to do next
VMware, Inc. 48
vSAN Monitoring and Troubleshooting
When you unplug a capacity device without removing it from the vSAN cluster, the components on the
disk are marked as absent. If the capacity device fails, the components on the disk are marked as
degraded. When the number of failures of the object replica with the affected components exceeds the
FTT value, the virtual machines on the disk become inaccessible. See Capacity Device Not Accessible in
a vSAN Cluster.
Note If your vSAN cluster uses deduplication and compression, you must remove the entire disk group
from the cluster before you replace the device.
Prerequisites
n Verify that the storage controllers on the hosts are configured in passthrough mode and support the
hot-plug feature.
If the storage controllers are configured in RAID 0 mode, see the vendor documentation for
information about adding and removing devices.
n Verify that the cluster contains enough space to migrate the data from the capacity device.
Procedure
3 Select the disk group that contains the device that you want to replace.
4 Select the flash capacity device or magnetic disk, and click Remove selected disk(s) from disk
group.
What to do next
If you remove a flash caching device, vSAN deletes the disk group that is associated with the flash device
and all its member devices.
VMware, Inc. 49
vSAN Monitoring and Troubleshooting
Prerequisites
Verify that the storage controllers on the hosts are configured in passthrough mode and support the hot-
plug feature.
If the storage controllers are configured in RAID 0 mode, see the vendor documentation for information
about adding and removing devices.
Procedure
2 To identify the device ID of the failed device, run this command and learn the device ID from the
output.
What to do next
If you plan to shut down the vSAN cluster, you do not need to manually disable vSAN on the cluster.
Procedure
a Power off all virtual machines (VMs) running in the vSAN cluster, unless vCenter Server is
running on the cluster.
If vCenter Server is hosted in the vSAN cluster, do not power off the vCenter Server VM.
Click the Monitor tab, and select vSAN > Resyncing Objects.
c Click the Configure tab and ensure that DRS and HA are turned off so that the cluster does not
register host shutdowns as failures.
VMware, Inc. 50
vSAN Monitoring and Troubleshooting
d Disable cluster member updates from vCenter Server using the following command.
esxcfg-advcfg -s 1 /VSAN/IgnoreClusterMemberListUpdates
e If vCenter Server is hosted in the vSAN cluster, power off the vCenter Server VM. The vSphere
Client becomes unavailable.
f Use the following command to place the ESXi hosts in the maintenance mode with no data
migration (no action).
g Power off the hosts after each host has successfully entered the maintenance mode.
On the physical box where ESXi is installed, press the power button until the power-on sequence
begins. The ESXi host starts, locates its VMs, and functions normally.
After you power on the hosts, verify that the vSAN cluster is intact.
You can ignore any misconfiguration status message if you did not make network configuration
changes and the vSAN cluster was working as expected before you shut down the cluster. The
message disappears after at least three hosts join the cluster.
b Use the following command to take the hosts out of maintenance mode.
c Verify that all the hosts are available in the cluster by running the following command on each
host.
e Re-enable cluster member updates from vCenter Server with the following command.
esxcfg-advcfg -s 0 /VSAN/IgnoreClusterMemberListUpdates
f Navigate to the vSAN cluster and click the Configure tab. Re-enable DRS and HA.
h Check the vSAN health service, and resolve any outstanding issues.
VMware, Inc. 51