|Document revision date: 30 March 2001|
Order Number: AA--R237C--TE
This manual is a conceptual guide for experienced users responsible for optimizing performance on OpenVMS systems. For information about OpenVMS performance on AlphaServer GS80/160/320 systems, see the OpenVMS on AlphaServer GS-Series Systems Configuration and Performance Guidelines, available at http://www.openvms.compaq.com/gsseries/index.html.
Revision/Update Information: This manual supersedes the OpenVMS Performance Management, OpenVMS Version 7.2.
Software Version: OpenVMS Version 7.3
Compaq Computer Corporation
© 2001 Compaq Computer Corporation
Compaq, VAX, VMS, and the Compaq logo Registered in U.S. Patent and Trademark Office.
OpenVMS is a trademark of Compaq Information Technologies Group, L.P. in the United States and other countries.
All other product names mentioned herein may be the trademarks of their respective companies.
Confidential computer software. Valid license from Compaq required for possession, use, or copying. Consistent with FAR 12.211 and 12.212, Commercial Computer Software, Computer Software Documentation, and Technical Data for Commercial Items are licensed to the U.S. Government under vendor's standard commercial license.
Compaq shall not be liable for technical or editorial errors or omissions contained herein.
The information in this document is provided "as is" without warranty of any kind and is subject to change without notice. The warranties for Compaq products are set forth in the express limited warranty statements accompanying such products. Nothing herein should be construed as constituting an additional warranty.
The following are trademarks of Compaq Computer Corporation: Alpha, ACMS, DDIF, DECdirect, DECnet, HSC, and MicroVAX.
The following are third-party trademarks:
Motif and UNIX are trademarks of The Open Group in the United States and other countries.
The Compaq OpenVMS documentation set is available on CD-ROM.
This document was prepared using DECdocument, Version V3.3-1e.
This manual presents techniques for evaluating, analyzing, and optimizing performance on a system running OpenVMS. Discussions address such wide-ranging concerns as:
The manual includes detailed procedures to help you evaluate resource utilization on your system and to diagnose and overcome performance problems resulting from memory limitations, I/O limitations, CPU limitations, human error, or combinations of these. The procedures feature sequential tests that use OpenVMS tools to generate performance data; the accompanying text explains how to evaluate it.
Whenever an investigation uncovers a situation that could benefit from adjusting system values, those adjustments are described in detail, and hints are provided to clarify the interrelationships of certain groups of values. When such adjustments are not the appropriate or available action, other options are defined and discussed.
Decision-tree diagrams summarize the step-by-step descriptions in the text. These diagrams should also serve as useful reference tools for subsequent investigations of system performance.
This manual does not describe methods for capacity planning, nor does it attempt to provide details about using OpenVMS RMS features (hereafter referred to as RMS). Refer to the Guide to OpenVMS File Applications for that information. Likewise, the manual does not discuss DECnet for OpenVMS performance issues, because the DECnet-Plus for OpenVMS Network Management manual provides that information.
This manual addresses system managers and other experienced users responsible for maintaining a consistently high level of system performance, for diagnosing problems on a routine basis, and for taking appropriate remedial action.
This manual is divided into 13 chapters and 4 appendixes, each covering a related group of performance management topics as follows:
For additional information on the topics covered in this manual, you can refer to the following documents:
For additional information about Compaq OpenVMS products and services, access the Compaq website at the following location:
Compaq welcomes your comments on this manual. Please send comments to either of the following addresses:
Compaq Computer Corporation
OSSG Documentation Group, ZKO3-4/U08
110 Spit Brook Rd.
Nashua, NH 03062-2698
Use the following World Wide Web address to order additional documentation:
If you need help deciding which documentation best meets your needs, call 800-282-6672.
In this manual, every use of DECwindows and DECwindows Motif refers to Compaq DECwindows Motif for OpenVMS software.
The following conventions are also used in this manual:
|bold text||This typeface represents the introduction of a new term. It also represents the name of an argument, an attribute, or a reason.|
|italic text||Italic text indicates important information, complete titles of manuals, or variables. Variables include information that varies in system output (Internal error number), in command lines (/PRODUCER= name), and in command parameters in text (where dd represents the predefined code for the device type).|
|UPPERCASE TEXT||Uppercase text indicates a command, the name of a routine, the name of a file, or the abbreviation for a system privilege.|
Monospace type indicates code examples and interactive screen displays.
In the C programming language, monospace type in text identifies the following elements: keywords, the names of independently compiled external functions and files, syntax summaries, and references to variables or identifiers introduced in an example.
Managing system performance involves being able to evaluate and coordinate system resources and workload demands.
A system resource is a hardware or software component or subsystem under the direct control of the operating system, which is responsible for data computation or storage. The following subsystems are system resources:
In addition to this manual, specific cluster information can be found in the Guidelines for OpenVMS Cluster Configurations and the OpenVMS Cluster Systems.
Performance management means optimizing your hardware and software resources for the current work load. This involves performing the following tasks:
To help you understand the scope and interrelationship of these issues, this chapter covers the following topics:
Because many different networking options are available, network I/O is not formally covered in this manual. General performance concepts discussed here apply to networking, and networking should be considered within the scope of analyzing any system performance problem. You should consult the documentation available for the specific products that you have installed for specific guidelines concerning configuration, monitoring, and diagnosis of a networking product.
Similarly, database products are extremely complex and perform much of
their own internal management. The settings of parameters external to
OpenVMS may have a profound effect upon how efficiently OpenVMS is
used. Thus, reviewing server application specific-material is a must if
you are to efficiently understand and resolve a related performance
1.1 System Performance Management Guidelines
Even if you are familiar with basic concepts discussed in this section,
there are some details discussed that are specific to this process, so
please read the entire section.
1.1.1 The Performance Management Process
Long term measurement and observation of your system is key to understanding how well it is working and is invaluable in identifying potential performance problems before they become so serious that the system grinds to a halt and it negatively affects your business. Thus, performance management should be a routine process of monitoring and measuring your systems to assure good operation through deliberate planning and resource management.
Waiting until a problem cripples a system before addressing system performance is not performance management, rather it is crisis management. Performance management involves:
You will often observe trends and thus be able to address performance issues before they become serious and adversely affect your business operations. Should an unforeseen problem occur, your historical data will likely prove invaluable for pinpointing the cause and rapidly and efficiently resolving the problem. Without past data from your formerly well-running system, you may have no basis upon which to judge the value of the metrics you can collect on your currently poorly running system. Without historical data you are guessing; resolution will take much longer and cost far more.
Upgrades and Reconfigurations
Some systems are so heavily loaded that the cost of additional functionality of new software can push the system beyond the maximum load that the system was intended to handle and thus deliver unacceptable response times and throughput. If your system is running near its limit now during peak workload periods, you want to ensure that you take the steps necessary to avoid pushing your system beyond its limits when you cannot afford it.
If your system is not a finely tuned, well-running machine, you are advised to use caution when considering changes to anything. Your system is already being pushed to, or beyond, its original designed capacity if you have observed users complaining about:
If this is the case, you need a performance audit to determine your current workload and the resources necessary to adequately support your current and possibly future workloads. Implementing changes not specifically designed to increase such a system's capacity or reduce its workload can degrade performance further. Thus, investing in a performance audit will pay off by delivering you a more reliable, productive, available, and lower maintenance system.
Many factors involved in upgrades and reconfigurations contribute to increased resource consumption. Future workloads your system will be asked to support may be unforseeable due to changes in the system, workload, and business.
Blind reconfiguration without measurement, analysis, modification, and
contingency plans can result in serious problems. Significant increases
in CPU, disk, memory, and LAN utilization demand serious consideration,
measurement, and planning for additional workload and upgrades.
1.1.2 Conducting a Performance Audit
The goals of a performance audit are to:
A proper performance audit will:
Without scientific measurement before installation and modification, as well as after, you will not acquire the data necessary to understand, plan for, and resolve potential problems in the immediate as well as distant future. Keep the following in mind:
Peak Workloads and the Cyclic Nature of Workloads
You must first identify periods of activity during which you cannot afford to have system performance degrade and then measure overall system activity during these periods.
These periods will vary from system to system minute to minute, hour to hour, day to day, week to week, and month to month. Holidays and other such periods are often significant factors and should be considered. These periods depend upon the business cycles that the system is supporting.
If the periods you have identified as critical cannot be measured at
this time, then measurements taken in the immediate future will have to
be used as the basis for estimates of the activity during those
periods. In such cases you will have to take measurements in the near
term and make estimates based on the data you collect in combination
with other data such as order rates from the previous calendar month or
year, as well as projections or forecasts. But factors other than the
CPU may become a bottleneck and slow down the system. For example, a
fixed number of assistants can only process so many orders per hour,
regardless of the number of callers waiting on the phone for service.
1.2 Strategies and Procedures
This manual describes several strategies and procedures for evaluating performance, evaluating system resources, and diagnosing resource limitations as shown in the following list:
As a system manager, you must be able to do the following:
Before you adjust any system parameters, you should:
You can observe system operation using the following tools:
On Alpha platforms, Compaq recommends using the DECevent utility instead of the Error Log utility, ANALYZE/ERROR_LOG. (You invoke the DECevent utility with the DCL command, DIAGNOSE.) You can use ANALYZE/ERROR_LOG on Alpha systems, but the DECevent utility provides more comprehensive reports.
|privacy and legal statement|