|
HP OpenVMS Availability Manager User's Guide
Order Number:
BA554-90001
July 2006
This guide explains how to use HP Availability Manager software to
detect and correct system availability problems.
Revision/Update Information:
This guide supersedes the HP OpenVMS Availability Manager User's Guide, Version 2.5.
Operating System:
Data Analyzer:
Windows 2000 SP 4 or higher;
Windows XP SP 2;
OpenVMS Alpha Versions 8.2 and 8.3;
OpenVMS I64 Versions 8.2-1 and 8.3
Data Collector:
OpenVMS VAX Version 6.2 and 7.3;
OpenVMS Alpha Versions 8.2 and 8.3;
OpenVMS I64 Versions 8.2-1 and 8.3
Software Version:
HP Availability Manager Version 2.6
Hewlett-Packard Company Palo Alto, California
© 2006 Hewlett-Packard Development Company, L.P.
Confidential computer software. Valid license from HP required for
possession, use or copying. Consistent with FAR 12.211 and 12.212,
Commercial Computer Software, Computer Software Documentation, and
Technical Data for Commercial Items are licensed to the U.S. Government
under vendor's standard commercial license.
The information contained herein is subject to change without notice.
The only warranties for HP products and services are set forth in the
express warranty statements accompanying such products and services.
Nothing herein should be construed as constituting an additional
warranty. HP shall not be liable for technical or editorial errors or
omissions contained herein.
Intel and Itanium are trademarks or registered trademarks of Intel
Corporation or its subsidiaries in the United States and other
countries.
Microsoft and Windows are U.S. registered trademarks of Microsoft
Corporation.
Javatm is a US trademark of Sun Microsystems, Inc.
Printed in the US
ZK6552
Preface
Intended Audience
This guide is intended for system managers who install and use HP
Availability Manager software. It is assumed that the system managers
who use this product are familiar with Microsoft Windows terms and
functions.
Note
The term Windows as it is used in this manual refers
to either Windows 2000 or Windows XP but not to any other
Windows product.
|
Document Structure
This guide contains the following chapters and appendixes:
- Chapter 1 provides an overview of Availability Manager software,
including security features.
- Chapter 2 tells how to start the Availability Manager, use the
main System Overview window, select a group of nodes and individual
nodes, and use online help.
- Chapter 3 tells how to select nodes and display node data; it
also explains what node data is.
- Chapter 4 tells how to display OpenVMS Cluster summary and
detailed data; it also explains what cluster data is.
- Chapter 5 tells how to display and interpret events.
- Chapter 6 tells how to take a variety of corrective actions,
called fixes, to improve system availability.
- Chapter 7 describes the tasks you can perform to filter, select,
and customize the display of data and events.
- Appendix A contains a table of CPU process states that are
referred to in Section 3.2.2.4 and in Section 3.3.1.
- Appendix B contains a table of OpenVMS and Windows events that
can be displayed in the Event pane discussed in Chapter 5.
- Appendix C describes the events that can be signaled for each
type of OpenVMS data that is collected.
Related Documents
The following manuals provide additional information:
- HP OpenVMS System Manager's Manual describes tasks for managing an OpenVMS system. It
also describes installing a product with the POLYCENTER Software
Installation utility.
- HP OpenVMS System Management Utilities Reference Manual describes utilities you can use to manage an OpenVMS
system.
- HP OpenVMS Programming Concepts Manual explains OpenVMS lock management concepts.
For additional information about HP OpenVMS products and services,
visit the following World Wide Web address:
http://www.hp.com/go/openvms
|
Reader's Comments
HP welcomes your comments on this manual. Please send comments to
either of the following addresses:
Internet
|
openvmsdoc@hp.com
|
Postal Mail
|
Hewlett-Packard Company
OSSG Documentation Group, ZKO3-4/U08
110 Spit Brook Rd.
Nashua, NH 03062-2698
|
How to Order Additional Documentation
For information about how to order additional documentation, visit the
following World Wide Web address:
http://www.hp.com/go/openvms/doc/order
|
Conventions
The following conventions are used in this guide:
Ctrl/
x
|
A sequence such as Ctrl/
x indicates that you must hold down the key labeled Ctrl while
you press another key or a pointing device button.
|
PF1
x
|
A sequence such as PF1
x indicates that you must first press and release the key
labeled PF1 and then press and release another key or a pointing device
button.
|
[Return]
|
In examples, a key name enclosed in a box indicates that you press a
key on the keyboard. (In text, a key name is not enclosed in a box.)
In the HTML version of this document, this convention appears as
brackets, rather than a box.
|
...
|
A horizontal ellipsis in examples indicates one of the following
possibilities:
- Additional optional arguments in a statement have been omitted.
- The preceding item or items can be repeated one or more times.
- Additional parameters, values, or other information can be entered.
|
.
.
.
|
A vertical ellipsis indicates the omission of items from a code example
or command format; the items are omitted because they are not important
to the topic being discussed.
|
( )
|
In command format descriptions, parentheses indicate that you must
enclose choices in parentheses if you specify more than one.
|
[ ]
|
In command format descriptions, brackets indicate optional choices. You
can choose one or more items or no items. Do not type the brackets on
the command line. However, you must include the brackets in the syntax
for OpenVMS directory specifications and for a substring specification
in an assignment statement.
|
|
|
In command format descriptions, vertical bars separate choices within
brackets or braces. Within brackets, the choices are optional; within
braces, at least one choice is required. Do not type the vertical bars
on the command line.
|
{ }
|
In command format descriptions, braces indicate required choices; you
must choose at least one of the items listed. Do not type the braces on
the command line.
|
bold type
|
Bold type represents the introduction of a new term. It also represents
the name of an argument, an attribute, or a reason.
|
italic type
|
Italic type indicates important information, complete titles of
manuals, or variables. Variables include information that varies in
system output (Internal error
number), in command lines (/PRODUCER=
name), and in command parameters in text (where
dd represents the predefined code for the device type).
|
UPPERCASE TYPE
|
Uppercase type indicates a command, the name of a routine, the name of
a file, or the abbreviation for a system privilege.
|
Example
|
This typeface indicates code examples, command examples, and
interactive screen displays. In text, this type also identifies URLs,
UNIX commands and pathnames, PC-based commands and folders, and certain
elements of the C programming language.
|
-
|
A hyphen at the end of a command format description, command line, or
code line indicates that the command or statement continues on the
following line.
|
numbers
|
All numbers in text are assumed to be decimal unless otherwise noted.
Nondecimal radixes---binary, octal, or hexadecimal---are explicitly
indicated.
|
Chapter 1 Overview
This chapter answers the following questions:
- What is the HP Availability Manager?
- How does the Availability Manager work?
- How does the Availability Manager maintain security?
- How does the Availability Manager identify possible performance problems?
1.1 What Is the HP Availability Manager?
The HP Availability Manager is a system management tool that allows you
to monitor, from an OpenVMS or Windows node, one or more OpenVMS nodes
on an extended local area network (LAN).
The Availability Manager helps
system managers and analysts target a specific node or process for
detailed analysis. This tool collects system and process data from
multiple OpenVMS nodes simultaneously, analyzes the data, and displays
the output using a graphical user interface (GUI).
Features and Benefits
The Availability Manager offers many features that can help system managers
improve the availability, accessibility, and performance of OpenVMS
nodes and clusters.
Feature |
Description |
Immediate notification of problems
|
Based on its analysis of data, the Availability Manager notifies you
immediately if any node you are monitoring is experiencing a
performance problem, especially one that affects the node's
accessibility to users. At a glance, you can see whether a problem is a
persistent one that warrants further investigation and correction.
|
Centralized management
|
Provides centralized management of remote nodes within an extended
local area network (LAN).
|
Intuitive interface
|
Provides an easy-to-learn and easy-to-use graphical user interface
(GUI). An earlier version of the tool, DECamds, uses a
Motif GUI to display information about OpenVMS nodes. The Availability Manager
uses a Java
GUI to display information about OpenVMS nodes on an OpenVMS or a
Windows node.
|
Correction capability
|
Allows real-time intervention, including adjustment of node and process
parameters, even when remote nodes are hung.
|
Uses its own protocol
|
An important advantage of the
Availability Manager is that it uses its own network protocol. Unlike
most performance monitors,
the Availability Manager does not rely on TCP/IP or any other standard
protocol. Therefore, even if a standard protocol is unavailable, the
Availability Manager can continue to operate.
|
Customization
|
Using a wide range of customization options, you can customize the
Availability Manager to meet the requirements of your particular site.
For example, you can change the severity levels of the events that are
displayed and escalate their importance.
|
Scalability
|
Makes it easier to monitor multiple OpenVMS nodes.
|
Figure 1-1 is an example of the initial System Overview window of the
Availability Manager.
Figure 1-1 System Overview Window
The System Overview window is divided into the following sections:
- In the upper section of the display is a list of user-defined
groups and a list of nodes in each group. You can compress the display
to only the name of a group by clicking the handle preceding the group
name. The summary group line remains, showing the collected information
for all the nodes in the group.
If a node name displays a red icon,
you can hold the cursor over the icon, the node name, or the number in
the Events column to display a tooltip explaining what the problem is;
for example, for the node DBGAVC, the following message is displayed:
HIHRDP, high hard page fault rate
|
This section of the window is called the Group/Node
pane.
- In the lower section of the window events are posted, alerting you
to possible problems on your system. The items on the pane vary,
depending on the severity of the problem: the most severe problems are
displayed first. This section of the window is called the Event
pane.
1.2 How Does the Availability Manager Work?
The Availability Manager uses two types of nodes to monitor systems:
- One or more OpenVMS Data Collector nodes, which contain the
software that collects data.
- An OpenVMS or a Windows Data Analyzer node, which contains the
software that analyzes the
collected data.
The Data Analyzer and Data Collector nodes communicate over an extended
LAN
using an IEEE 802.3 Extended Packet format protocol. Once a connection
is established, the Data Analyzer instructs the Data Collector to
gather specific system and process data.
Although you can run the Data Analyzer as a member of a monitored
cluster, it is typically run on a system that is not a member of a
monitored cluster. In this way, the Data Analyzer will not hang if the
cluster hangs.
Only one Data Analyzer at a time should be running on each node;
however, more than one can be running in the LAN at any given time.
Figure 1-2 shows a possible configuration of Data Analyzer and Data
Collector nodes.
Figure 1-2 Availability Manager Node Configuration
In Figure 1-2, the Data Analyzer can monitor nodes A, B, and C across
the network. The password on node D does not match the password of the
Data Analyzer; therefore, the Data Analyzer cannot monitor node D.
For information about password security, see Section 1.3.
Requesting and Receiving Information
After installing the Availability Manager software, you can begin to
request information from one or more Data Collector nodes.
Requesting and receiving information requires the Availability Manager to
perform a number of steps, which are shown in Figure 1-3 and
explained after the figure.
Figure 1-3 Requesting and Receiving Information
The following steps correspond to the numbers in Figure 1-3.
- The GUI communicates users' requests for data
to the driver on the Data Analyzer node:
- On Windows systems, the Windows driver is part of the Windows kit.
- On OpenVMS systems, the OpenVMS driver is called the Data Collector
driver and is included in the Data Collector kit. This is the same
driver that is on the Data Collector node.
- The driver on the Data Analyzer sends users'
requests across the network to the driver on the Data Collector node.
- The driver on the Data Collector transmits the
requested information over the network to the driver on the Data
Analyzer node.
- The driver on the Data Analyzer node passes
the requested information to the GUI, which displays the data.
In step 4, the Availability Manager also checks the data for any events that
should be posted. The following section explains in more detail how
data analysis and event detection work.
Note
More than one Windows or OpenVMS Data Analyzer node can collect data
from the same Data Collector node.
|
Communicating Through a Private LAN Transport
The Availability Manager protocol is based on the 802.3 Extended Packet Format
(also
known as SNAP). The IEEE Availability Manager protocol values are as follows:
Protocol ID: 08-00-2B-80-48
Multicast Address: 09-00-2B-02-01-09
|
If your routers filter protocols in your network, add these values to
your network protocols so that the private transport is propagated over
the routers.
1.3 How Does the Availability Manager Maintain Security?
The Availability Manager uses passwords to maintain security. Passwords are
eight alphanumeric characters long.
The Data Analyzer stores passwords in its customization file.
On OpenVMS Data Collector nodes, passwords are part of a three-part
security code called a security triplet.
The following sections explain these security methods further.
1.3.1 Data Analyzer Password Security
For monitoring to take place, the password on a Data Analyzer node must
match the password section of the security triplet on each OpenVMS Data
Collector node. OpenVMS Data Collectors also impose other security
measures, which are explained in Section 1.3.2.
Figure 1-4 illustrates how you can use passwords to limit access to
node information.
Figure 1-4 Availability Manager Password Matching
As shown in Figure 1-4, the Testing Department's Data Analyzer, whose
password is HOMERUNS, can access only OpenVMS Data Collector nodes with
the HOMERUNS password as part of their security triplets. The same is
true of the Accounting Department's Data Analyzer, whose password is
BATTERUP; it can access only OpenVMS Data Collector nodes with the
BATTERUP password as part of their security triplets.
The Availability Manager sets a default password when you install the Data
Analyzer. To change that password, you must use the OpenVMS Security
Customization page (see Figure 7-21), which is explained in
Chapter 7.
1.3.2 OpenVMS Data Collector Security
OpenVMS Data Collector nodes have the following security features:
- Availability Manager data-transfer security
Each
OpenVMS node running as a Data Collector has a file containing a list
of security triplets. For Data Analyzer and Data Collector nodes to
exchange data, the passwords on these nodes must match. In
addition, the triplet specifies the type of access a Data Analyzer has.
By specifying the hardware address of the Data Analyzer, the triplet
can also restrict which Data Analyzer nodes are able to access the Data
Collector. Section 1.3.3 explains security triplets and how to edit
them.
- Availability Manager security log
An OpenVMS Data Collector logs all access denials and executed
write instructions to the operator communications manager (OPCOM).
Messages are displayed on all terminals that have OPCOM enabled (with
the REPLY/ENABLE command). OPCOM also puts messages in the
SYS$MANAGER:OPERATOR.LOG file. Each security log entry contains the
network address of the initiator. If access is denied, the log entry
also indicates whether a read or write was attempted. If a write
operation was performed, the log entry indicates the process identifier
(PID) of the affected process.
- OpenVMS file protection and process privileges
When the Availability Manager is installed, it creates a directory
(SYS$COMMON:[AMDS$AM]) and sets directory and file protections on it so
that only the SYSTEM account can read the files in that directory. For
additional security on these system-level directories and files, you
can create access control lists (ACLs) to restrict and set alarms on
write access to the security files. For more information about
creating ACLs, see the HP OpenVMS Guide to System Security.
1.3.3 Changing Security Triplets on OpenVMS Data Collector Nodes
To change security triplets on an OpenVMS Data Collector node, you must
edit the AMDS$DRIVER_ACCESS.DAT file, which is installed on all Data
Collector nodes. The following sections explain what a security triplet
is, how the Availability Manager uses it, and how to change it.
|