Jump to content United States-English
HP.com Home Products and Services Support and Drivers Solutions How to Buy
» Contact HP
More options
HP.com home
HP Integrity Virtual Machines Version 4.0 Installation, Configuration, and Administration > Chapter 11 Using HP Serviceguard with Integrity VM

Troubleshooting Serviceguard with Integrity VM

» 

Technical documentation

Complete book in PDF
» Feedback
Content starts here

 » Table of Contents

 » Glossary

 » Index

This section describes how to solve some of the problems that can occur using Serviceguard and Integrity VM.

Troubleshooting VMs as Serviceguard Packages

If the distributed guest does not start or failover, check both the /var/adm/syslog/syslog.log file and the package log file (/etc/cmcluster/guest—name/guest—name.log).

If a package fails to start, Serviceguard performs a package halt. The log files include a Halting package section after the Starting package section that contains the actual starting failure messages. When you view package log files after a package start failure, look at the Halting package section as well as the Starting package section.

If the distributed guest does not fail over, take the package down using the cmhaltpkg command. Make sure the guest has the resources it needs to run on the adoptive node by manually starting the package on the adoptive node with the same workload using the cmrunpkg command.

If the package does not start under manual control, stop the cluster and test the guest named compass1.

  1. Use the hpvmmodify command to set the guest to not be distributed. For example:

    # hpvmmodify -P compass1 -i NONE # hpvmmodify -P compass1 -j 0
  2. Use the hpvmstart command to start the guest with the same VM Host system and workload. Use the virtual console (hpvmconsole) to make sure the the guest OS is installed and that applications are running properly.

After testing the guest, create the Serviceguard package again.

If the guest does not start and displays errors about storage problems, and you are using logical volumes, the storage units might not be available to the VM Host. To make the storage units available , enter the appropriate commands, as follows:

  • For LVM logical volumes, enter the following commands:

    # vgchange -c n /dev/vgxx # vgchange -a y /dev/vgxx
  • For VxVM logical volumes, enter the following commands:

    # vxdg import diskgroup-name # vxvol -g diskgroup-name startall
  • If you are using files on a logical volume, also enter the following command:

    # mount /dev/vgxx /mount-point

After making sure the backing storage devices are available, restore them to their original state.

Some problems that arise from improper storage configuration include:

  • Whole disks — Verify that the VM Host has access to the disks. If not, the problem might be a hardware or storage subsystem issue.

  • LVM - Before starting a package, Serviceguard requires that all volume groups associated with the package are inactive. For details about deactivating LVMs, see the Managing Serviceguard manual.

  • VxVm — Before starting a package, Serviceguard requires that all disk groups associated with the package are deported. For details, see the Managing Serviceguard manual.

  • Files — Before starting a package, Serviceguard requires that file systems of file backing stores associated with the package are unmounted.

If the guest has problems accessing network, make sure the network devices are available on the VM Host system. Packages do not start if any of their defined subnets are unavailable. This causes multiple failures if no standby LANs are available, or when one or more switches, hubs, interfaces or cables fail.

A common issue when starting a package is the lack of available memory. For more information about providing the required memory resources, see Chapter 3 .

Creating Distributed Guests

This section describes how to use the hpvmsg_package script to help you configure guests as Serviceguard packages. If you create the Serviceguard package configuration and control scripts manually instead, use the following options to the hpvmcreate , hpvmmodify, or hpvmclone command to identify the Serviceguard package name and to mark the guest as a distributed guest:

  • Use the -i option to specify the Serviceguard package. (For example, -i SG_package_name.)

  • Use the -j 1 option to specify that the guest is a distributed guest.

For more information, see the hpvmsg_package file.

Networking

If the guest has network problems after failover:

  • Make sure the vswitches are configured configured on the adoptive node. If you are using the VLAN feature of Integrity VM vswitches, make sure that appropriate VLAN IDs are assigned to each port.

  • Adjust the values of the following Serviceguard parameters in the cluster configuration file. The correct settings for the HEARTBEAT_INTERVAL and the NODE_TIMEOUT parameter are system- and load-dependent.

    • The HEARTBEAT_INTERVAL parameter specifies the normal interval between the transmission of heartbeat messages from one node to the other in the cluster. The value of the HEARTBEAT_INTERVAL parameter is entered in microseconds; the default value is 1,000,000 microseconds. Setting the value of this parameter to less than the default is not recommended. The default should be used where possible. The maximum value recommended is 15 seconds, and the maximum value supported is 30 seconds. This value should be at least half the value of the NODE_TIMEOUT parameter.

    • The NODE_TIMEOUT parameter specifies the amount of time after which the Serviceguard node can determine that the other node is unavailable and can initiate cluster reformation. This parameter is entered in microseconds; the default value is 2,000,000.The minimum is two times the value of the HEARTBEAT_INTERVAL parameter. The maximum recommended value for this parameter is 30,000,000. The default setting yields the fastest cluster reformations. However, using the default value increases the potential for spurious reformations because of momentary system hangs or network-load spikes. For many installations, a setting of 5,000,000 to 8,000,000 (5 to 8 seconds) is more appropriate. The maximum value recommended is 30 seconds and the maximum value supported is 60 seconds.

Printable version
Privacy statement Using this site means you accept its terms Feedback to webmaster
© 2008 Hewlett-Packard Development Company, L.P.